BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 004190
         (770 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255548487|ref|XP_002515300.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223545780|gb|EEF47284.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 768

 Score = 1327 bits (3435), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 622/748 (83%), Positives = 680/748 (90%), Gaps = 2/748 (0%)

Query: 23  AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
           AA EPFACD KD TT+ LPFCQV LPI  RV DLIGRL+L EKV LL++ A AV RLGIK
Sbjct: 23  AALEPFACDSKDGTTKNLPFCQVKLPIQDRVKDLIGRLTLAEKVGLLVNNAGAVSRLGIK 82

Query: 83  GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
           GYEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN+TLWEAIGRVVSDEARAM
Sbjct: 83  GYEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNSTLWEAIGRVVSDEARAM 142

Query: 143 YNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAA 202
           YNGG AGLTYWSPNVNI RDPRWGRGQETPGEDP+L GKYAASYV+GLQG+DG+RLKVAA
Sbjct: 143 YNGGAAGLTYWSPNVNILRDPRWGRGQETPGEDPLLVGKYAASYVKGLQGNDGERLKVAA 202

Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            CKHFTAYDLDNWNGVDRFHFNAKVSKQD++DTFDVPFRMCV EGKVASVMCSYNQVNG+
Sbjct: 203 CCKHFTAYDLDNWNGVDRFHFNAKVSKQDMKDTFDVPFRMCVKEGKVASVMCSYNQVNGI 262

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
           PTCADPN+L++T+R +W LNGYIVSDCDSVGV+YD QH+TSTPEEAAADAI+AGLDLDCG
Sbjct: 263 PTCADPNLLRKTVRTQWGLNGYIVSDCDSVGVFYDKQHYTSTPEEAAADAIKAGLDLDCG 322

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
           PFL +HT+ AV+RGL+SE D+N AL NTLTVQMRLGMFDGEPS+QPYG+LGPKDVCTP H
Sbjct: 323 PFLAVHTQDAVKRGLISEADVNGALFNTLTVQMRLGMFDGEPSAQPYGNLGPKDVCTPAH 382

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           QELALEA RQGIVLLKN GPSLPLS  RHRTVA+IGPNS+VTVTMIGNYAG+AC YTTPL
Sbjct: 383 QELALEAGRQGIVLLKNHGPSLPLSPRRHRTVAIIGPNSNVTVTMIGNYAGVACQYTTPL 442

Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
           QGIG YA+TIHQQGC DV C  DQLF  AIDA+RQADAT+LVMGLDQSIEAE  DR GLL
Sbjct: 443 QGIGSYAKTIHQQGCADVGCVTDQLFSGAIDAARQADATVLVMGLDQSIEAEFRDRTGLL 502

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           LPGRQQELVSKV+MASKGPTILVLMSGGPIDV+FAK DP+IAAI+WAGYPGQAGG AIAD
Sbjct: 503 LPGRQQELVSKVAMASKGPTILVLMSGGPIDVSFAKKDPKIAAILWAGYPGQAGGAAIAD 562

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
           +LFGT NPGGKLPMTWYPQEYITNLPMTEMAMR SQSK YPGRTYRFY+G VVYPFGHGM
Sbjct: 563 VLFGTINPGGKLPMTWYPQEYITNLPMTEMAMRSSQSKGYPGRTYRFYQGKVVYPFGHGM 622

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
           SYT+FVH +A+APT+V+VPLDG  G  N +ISGKAI+VTH KCN+L+LG+QVDVKNVGSK
Sbjct: 623 SYTHFVHNIASAPTMVSVPLDGHRG--NTSISGKAIRVTHTKCNKLSLGIQVDVKNVGSK 680

Query: 683 DGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           DG HTLLV+S PPAG W+PHKQLVAFE+VHV AG Q+RVGI+IHVCK LSVVDRSG RRI
Sbjct: 681 DGTHTLLVYSAPPAGRWSPHKQLVAFERVHVSAGTQERVGISIHVCKLLSVVDRSGIRRI 740

Query: 743 PLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           P+GEH+IHIG  KHSVSL A  LGV+KS
Sbjct: 741 PIGEHSIHIGNVKHSVSLQATVLGVVKS 768


>gi|225437531|ref|XP_002270249.1| PREDICTED: probable beta-D-xylosidase 2 [Vitis vinifera]
 gi|297743965|emb|CBI36935.3| unnamed protein product [Vitis vinifera]
          Length = 768

 Score = 1292 bits (3343), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 602/769 (78%), Positives = 684/769 (88%), Gaps = 8/769 (1%)

Query: 2   ASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLS 61
           +S++  F + L ++S       AR+PFACDPKD      PFC+ S+ I +RV DLIGRL+
Sbjct: 8   SSSLLIFLVVLAVVSGE-----ARDPFACDPKDGANAGFPFCRKSIGIGERVKDLIGRLT 62

Query: 62  LQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTAS 121
           L+EKV+LL++ AA VPRLGIKGYEWWSEALHGVSNVGPGTKF GDFPGATSFPQVITTA+
Sbjct: 63  LEEKVRLLVNNAAGVPRLGIKGYEWWSEALHGVSNVGPGTKFSGDFPGATSFPQVITTAA 122

Query: 122 SFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGK 181
           SFN++LWEAIG+VVSDEARAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDPVL+GK
Sbjct: 123 SFNSSLWEAIGQVVSDEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAGK 182

Query: 182 YAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
           YAA YVRGLQG+ GDRLKVAA CKHFTAYDLDNWNGVDRFHF+A+VSKQ++EDTFDVPFR
Sbjct: 183 YAARYVRGLQGNAGDRLKVAACCKHFTAYDLDNWNGVDRFHFDARVSKQEMEDTFDVPFR 242

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
            CV+EGKVASVMCSYNQVNGVPTCADPN+L+ T+R +W LNGY+VSDCDSVGV+YD QH+
Sbjct: 243 SCVVEGKVASVMCSYNQVNGVPTCADPNLLRNTVRKQWHLNGYVVSDCDSVGVFYDNQHY 302

Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
           T+TPEEAAADAI+AGLDLDCGPFL +HT+ A+++GL+SE D+++ALVNT+TVQMRLGMFD
Sbjct: 303 TNTPEEAAADAIKAGLDLDCGPFLAVHTQDAIKKGLVSEADVDSALVNTVTVQMRLGMFD 362

Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
           GEPS+QP+G LGPKDVC+P HQELA+EAARQGIVLLKN G SLPLS   HR++AVIGPNS
Sbjct: 363 GEPSAQPFGDLGPKDVCSPAHQELAIEAARQGIVLLKNHGHSLPLSTRSHRSIAVIGPNS 422

Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
           D  VTMIGNYAGI C YTTPLQGIGRY+RTIHQ+GC DVAC++DQLF  AIDA+ QADAT
Sbjct: 423 DANVTMIGNYAGIPCEYTTPLQGIGRYSRTIHQKGCADVACSEDQLFAGAIDAASQADAT 482

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           +LVMGLDQSIEAEA DRA LLLPGRQQELVSKV+MAS+GPT+LVLMSGGP+DV+FAK DP
Sbjct: 483 VLVMGLDQSIEAEAKDRADLLLPGRQQELVSKVAMASRGPTVLVLMSGGPVDVSFAKKDP 542

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
           RIAAI+WAGYPGQAGG AIADILFG +NPGGKLPMTWYPQEY++ +PMT MAMR   SK 
Sbjct: 543 RIAAIVWAGYPGQAGGAAIADILFGVANPGGKLPMTWYPQEYLSKVPMTTMAMRAIPSKA 602

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
           YPGRTYRFYKGPVVY FGHG+SYTNFVHT+A APT VA+PL G H   N T+SGKAI+VT
Sbjct: 603 YPGRTYRFYKGPVVYRFGHGLSYTNFVHTIAQAPTAVAIPLHGHH---NTTVSGKAIRVT 659

Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
           HAKCNRL++ + +DVKNVG+KDG+HTLLVFS PPAGHWAPHKQLVAFEKVHV A  QQRV
Sbjct: 660 HAKCNRLSIALHLDVKNVGNKDGSHTLLVFSKPPAGHWAPHKQLVAFEKVHVAARTQQRV 719

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
            INIHVCKYLSVVDRSG RRIP+G+H +HIG TKH VSL AA LGVIKS
Sbjct: 720 QINIHVCKYLSVVDRSGIRRIPMGQHGLHIGDTKHLVSLQAAALGVIKS 768


>gi|356534827|ref|XP_003535953.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 771

 Score = 1290 bits (3338), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 603/753 (80%), Positives = 686/753 (91%), Gaps = 4/753 (0%)

Query: 21  GLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
           G  AR+PFACDPK+  T+ LPFC+  L    RV DLIGRL+LQEKV LL++ AAAVPRLG
Sbjct: 20  GGEARDPFACDPKNTATKNLPFCKAWLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLG 79

Query: 81  IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
           IKGYEWWSEALHGVSNVGPGTKFGG FP ATSFPQVITTA+SFNA+LWEAIGRV SDEAR
Sbjct: 80  IKGYEWWSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEAR 139

Query: 141 AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
           AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP+L+GKYAASYVRGLQ +DG+RLKV
Sbjct: 140 AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQETDGNRLKV 199

Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
           AASCKHFTAYDLDNWNGVDRFHFNA+VSKQDIEDTF+VPFRMCV EGKVASVMCSYNQVN
Sbjct: 200 AASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQVN 259

Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
           GVPTCADP +LKRT+RG+W LNGYIVSDCDSVGV+Y++QH+TSTPEEAAADAI+AGLDLD
Sbjct: 260 GVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTSTPEEAAADAIKAGLDLD 319

Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
           CGPFLG HT++AV++GL+SE D+N AL+NTLTVQMRLGM+DGEPSS PYG LGP+DVCTP
Sbjct: 320 CGPFLGQHTQNAVKKGLISETDVNGALLNTLTVQMRLGMYDGEPSSHPYGKLGPRDVCTP 379

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
            HQELALEAARQGIVLLKN+GPSLPLS  RH TVAVIGPNS+VTVTMIGNYAGIACGYT+
Sbjct: 380 SHQELALEAARQGIVLLKNKGPSLPLSTRRHPTVAVIGPNSNVTVTMIGNYAGIACGYTS 439

Query: 441 PLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
           PL+GIGRY +TIH+ GC +VAC +D+ FG AI+ ++QADAT+LVMGLDQSIEAE +DRAG
Sbjct: 440 PLEGIGRYTKTIHELGCANVACTNDKQFGRAINVAQQADATVLVMGLDQSIEAETVDRAG 499

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           LLLPGRQQ+LVSKV+ ASKGPTILV+MSGGP+D+ FAKN+PRI AI+WAGYPGQAGG AI
Sbjct: 500 LLLPGRQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNNPRIQAILWAGYPGQAGGAAI 559

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           ADILFGTSNPGGKLPMTWYPQ YI NLPMT MAMR S+SK YPGRTYRFY GPVVYPFG+
Sbjct: 560 ADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRTYRFYNGPVVYPFGY 619

Query: 621 GMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
           G+SYT+FVHT+A+AP +V++P+DG RHG+ +++I+ KAIKVTHA+C +L++ +QVDVKNV
Sbjct: 620 GLSYTHFVHTLASAPKLVSIPVDGHRHGN-SSSIANKAIKVTHARCGKLSISLQVDVKNV 678

Query: 680 GSKDGAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
           GSKDG HTLLVFS PPA  GHWAPHKQLVAF+K+H+P+ AQQRV +NIHVCK LSVVDRS
Sbjct: 679 GSKDGTHTLLVFSAPPAGNGHWAPHKQLVAFQKLHIPSKAQQRVNVNIHVCKLLSVVDRS 738

Query: 738 GTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           GTRR+P+G H++HIG  KH VSL A TLG+IKS
Sbjct: 739 GTRRVPMGLHSLHIGDVKHYVSLQAETLGIIKS 771


>gi|356501877|ref|XP_003519750.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 772

 Score = 1283 bits (3321), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 603/753 (80%), Positives = 680/753 (90%), Gaps = 4/753 (0%)

Query: 21  GLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
           G  AR+PFACDPK+  T+ LPFC+ SL    RV DLIGRL+LQEKV LL++ AAAVPRLG
Sbjct: 21  GGEARDPFACDPKNTATKNLPFCKASLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLG 80

Query: 81  IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
           IKGYEWWSEALHGVSNVGPGTKFGG FP ATSFPQVITTA+SFNA+LWEAIGRV SDEAR
Sbjct: 81  IKGYEWWSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEAR 140

Query: 141 AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
           AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP+L+GKYAASYVRGLQG+DG+RLKV
Sbjct: 141 AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQGTDGNRLKV 200

Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
           AASCKHFTAYDLDNWNGVDRFHFNA+VSKQDIEDTF+VPFRMCV EGKVASVMCSYNQVN
Sbjct: 201 AASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQVN 260

Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
           GVPTCADP +LKRT+RG+W LNGYIVSDCDSVGV+Y++QH+TSTPEEAAADAI+AGLDLD
Sbjct: 261 GVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTSTPEEAAADAIKAGLDLD 320

Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
           CGPFLG HT++AV++GL+SE D+N AL+NTLTVQMRLGM+DGEPSS PY +LGP+DVCT 
Sbjct: 321 CGPFLGQHTQNAVKKGLISEADVNGALLNTLTVQMRLGMYDGEPSSHPYNNLGPRDVCTQ 380

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
            HQELALEAARQGIVLLKN+GPSLPLS  R RTVAVIGPNS+VT TMIGNYAGIACGYT+
Sbjct: 381 SHQELALEAARQGIVLLKNKGPSLPLSTRRGRTVAVIGPNSNVTFTMIGNYAGIACGYTS 440

Query: 441 PLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
           PLQGIG Y +TI++ GC +VAC DD+ FG AI+A++QADAT+LVMGLDQSIEAE +DRA 
Sbjct: 441 PLQGIGTYTKTIYEHGCANVACTDDKQFGRAINAAQQADATVLVMGLDQSIEAETVDRAS 500

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           LLLPG QQ+LVSKV+ ASKGPTILV+MSGGP+D+ FAKNDPRI  I+WAGYPGQAGG AI
Sbjct: 501 LLLPGHQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNDPRIQGILWAGYPGQAGGAAI 560

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           ADILFGTSNPGGKLPMTWYPQ YI NLPMT MAMR S+SK YPGRTYRFY GPVVYPFG+
Sbjct: 561 ADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRTYRFYNGPVVYPFGY 620

Query: 621 GMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
           G+SYT+FVHT+ +AP +V++P+DG RHG+ ++ I+ KAIKVTHA+C +L++ + VDVKNV
Sbjct: 621 GLSYTHFVHTLTSAPKLVSIPVDGHRHGN-SSNIANKAIKVTHARCGKLSINLHVDVKNV 679

Query: 680 GSKDGAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
           GSKDG HTLLVFS PPA  GHWAPHKQLVAFEKVH+PA AQQRV + IHVCK LSVVDRS
Sbjct: 680 GSKDGIHTLLVFSAPPAGNGHWAPHKQLVAFEKVHIPAKAQQRVRVKIHVCKLLSVVDRS 739

Query: 738 GTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           GTRRIP+G H++HIG  KHSVSL A TLG+IKS
Sbjct: 740 GTRRIPMGLHSLHIGDVKHSVSLQAETLGIIKS 772


>gi|356503923|ref|XP_003520749.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 775

 Score = 1279 bits (3309), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 598/772 (77%), Positives = 684/772 (88%), Gaps = 6/772 (0%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           + + IA F L L L+  +     AR+PFACDPK+  T  +PFC+ SL IP+RV DL+GRL
Sbjct: 8   LLNLIAVFLL-LFLVRHTCE---ARDPFACDPKNGATENMPFCKASLAIPERVKDLVGRL 63

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +LQEKV+LL++ AAAVPRLG+KGYEWWSEALHGVSNVGPG KF   FPGATSFPQVITTA
Sbjct: 64  TLQEKVRLLVNNAAAVPRLGMKGYEWWSEALHGVSNVGPGVKFNAQFPGATSFPQVITTA 123

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
           +SFNA+LWEAIG+VVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+G
Sbjct: 124 ASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAG 183

Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
            YAASYVRGLQG+DG+RLKVAA CKHFTAYDLDNWNG+DRFHFNA+VSKQDIE+TFDVPF
Sbjct: 184 TYAASYVRGLQGTDGNRLKVAACCKHFTAYDLDNWNGMDRFHFNAQVSKQDIEETFDVPF 243

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           RMCV EGKVASVMCSYNQVNGVPTCADPN+LK+T+RG W+L+GYIVSDCDSVGV+YD QH
Sbjct: 244 RMCVSEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQH 303

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           +T TPEEAAADAI+AGLDLDCGPFL +HT++AV++GLLSE D+N ALVNTLTVQMRLGMF
Sbjct: 304 YTPTPEEAAADAIKAGLDLDCGPFLAVHTQNAVEKGLLSEADVNGALVNTLTVQMRLGMF 363

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
           DGEPS+  YG LGPKDVC P HQELALEAARQGIVLLKN GP LPLS  RH TVAVIGPN
Sbjct: 364 DGEPSAHAYGKLGPKDVCKPAHQELALEAARQGIVLLKNTGPVLPLSPQRHHTVAVIGPN 423

Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
           S  TVTMIGNYAG+ACGYT PLQGIGRYA+TIHQ GC++VAC +D+LFG+AI+A+RQADA
Sbjct: 424 SKATVTMIGNYAGVACGYTNPLQGIGRYAKTIHQLGCENVACKNDKLFGSAINAARQADA 483

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
           T+LVMGLDQSIEAE +DR GLLLPGRQQ+LVSKV+ ASKGPTILV+MSGG +D+ FAKN+
Sbjct: 484 TVLVMGLDQSIEAETVDRTGLLLPGRQQDLVSKVAAASKGPTILVIMSGGSVDITFAKNN 543

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
           PRI  I+WAGYPGQAGG AIADILFGT+NPGGKLP+TWYPQEY+T LPMT MAMR S+S 
Sbjct: 544 PRIVGILWAGYPGQAGGAAIADILFGTTNPGGKLPVTWYPQEYLTKLPMTNMAMRGSKSA 603

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
            YPGRTYRFY GPVVYPFGHG++YT+FVHT+A+APTVV+VPL+G   +    IS +AI+V
Sbjct: 604 GYPGRTYRFYNGPVVYPFGHGLTYTHFVHTLASAPTVVSVPLNGHRRANVTNISNRAIRV 663

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQ 718
           THA+C++L++ ++VD+KNVGS+DG HTLLVFS PPA  GHWA  KQLVAFEK+HVPA   
Sbjct: 664 THARCDKLSISLEVDIKNVGSRDGTHTLLVFSAPPAGFGHWALEKQLVAFEKIHVPAKGL 723

Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           QRVG+NIHVCK LSVVD+SG RRIPLGEH+ +IG  KHSVSL AA LG+IKS
Sbjct: 724 QRVGVNIHVCKLLSVVDKSGIRRIPLGEHSFNIGDVKHSVSLQAAALGIIKS 775


>gi|357442285|ref|XP_003591420.1| Beta xylosidase [Medicago truncatula]
 gi|355480468|gb|AES61671.1| Beta xylosidase [Medicago truncatula]
          Length = 765

 Score = 1273 bits (3294), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 600/769 (78%), Positives = 677/769 (88%), Gaps = 12/769 (1%)

Query: 5   IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
           I   FL L+L+S+      AR+PFACDPK+ +T   PFC+ SLPIP RVNDLIGRL+LQE
Sbjct: 6   ITIVFL-LLLMSSE-----ARDPFACDPKNTSTNNFPFCKASLPIPTRVNDLIGRLTLQE 59

Query: 65  KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
           KV +L++ AAAVPR+GIKGYEWWSEALHGVSNVGPGTKF G FP ATSFPQVITT +SFN
Sbjct: 60  KVSMLVNNAAAVPRVGIKGYEWWSEALHGVSNVGPGTKFAGQFPAATSFPQVITTVASFN 119

Query: 125 ATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
           A+LWEAIGRV SDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP+L+GKYAA
Sbjct: 120 ASLWEAIGRVASDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAA 179

Query: 185 SYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
           SYVRGLQG+D  RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQD+EDTF+VPFRMCV
Sbjct: 180 SYVRGLQGTDSSRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDMEDTFNVPFRMCV 239

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
            EG VASVMCSYNQVNGVPTCADPN+LKRTIRG+W L+GYIVSDCDSVGV+Y  QH+TST
Sbjct: 240 KEGNVASVMCSYNQVNGVPTCADPNLLKRTIRGQWHLDGYIVSDCDSVGVFYTNQHYTST 299

Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
           PEEAAADAI+AGLDLDCGPFL  HT++AV++GLL+E D+N AL NTLTVQMRLGMFDGEP
Sbjct: 300 PEEAAADAIKAGLDLDCGPFLAQHTQNAVKKGLLTETDVNGALANTLTVQMRLGMFDGEP 359

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
           S+QPYG+LGP DVCTP HQELAL+AARQGIVLLKN GPSLPLS   H+TVAVIGPNS+ T
Sbjct: 360 SAQPYGNLGPTDVCTPTHQELALDAARQGIVLLKNTGPSLPLSTKNHQTVAVIGPNSNAT 419

Query: 425 VTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILV 484
           VTMIGNYAGIACGYT+PLQGIG+YARTIH+ GC +VAC DD+ FG+A++A+RQADAT+LV
Sbjct: 420 VTMIGNYAGIACGYTSPLQGIGKYARTIHEPGCANVACNDDKQFGSALNAARQADATVLV 479

Query: 485 MGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
           MGLDQSIEAE +DR GLLLPG QQ+LVSKV+ AS+GPTILVLMSGGPID+ FAKNDPRI 
Sbjct: 480 MGLDQSIEAEMVDRTGLLLPGHQQDLVSKVAAASRGPTILVLMSGGPIDITFAKNDPRIM 539

Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
            I+WAGYPGQAGG AIADILFGT+NPG KLPMTWYPQ Y+ NL MT MAMRPS S  YPG
Sbjct: 540 GILWAGYPGQAGGAAIADILFGTTNPGAKLPMTWYPQGYLKNLAMTNMAMRPSSSTGYPG 599

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHA 663
           RTYRFY GPVVYPFG+G+SYTNFVHT+A+AP VV+VP+DG R G+   + +  AI+VTHA
Sbjct: 600 RTYRFYNGPVVYPFGYGLSYTNFVHTLASAPKVVSVPVDGHRRGN---SSNKAAIRVTHA 656

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQQRV 721
           +C +L++ + +DVKNVGSKDG +TLLVFS PP   GHWAP KQLVAFEKV+VPA AQQRV
Sbjct: 657 RCGKLSIRLDIDVKNVGSKDGTNTLLVFSVPPTGNGHWAPQKQLVAFEKVYVPAKAQQRV 716

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
            INIHVCK LSVVD+SGTRRIP+G H+IHIG  KH VSL    LG+IK+
Sbjct: 717 RINIHVCKLLSVVDKSGTRRIPMGAHSIHIGDVKHFVSLQEQKLGIIKT 765


>gi|357511337|ref|XP_003625957.1| Beta-xylosidase [Medicago truncatula]
 gi|355500972|gb|AES82175.1| Beta-xylosidase [Medicago truncatula]
          Length = 771

 Score = 1272 bits (3291), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 599/773 (77%), Positives = 681/773 (88%), Gaps = 5/773 (0%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M+ST +   L  + +    S   AR+ FACD KDA T+ LPFC V L IP+RV DLIGRL
Sbjct: 1   MSSTFSLSPLITLFILLLQSSCDARDSFACDAKDAATKNLPFCNVKLAIPERVKDLIGRL 60

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           ++QEKV LL++ A AVPR+G+K YEWWSEALHGVSNVGPGT+FGG FP ATSFPQVITTA
Sbjct: 61  TMQEKVNLLVNNAPAVPRVGMKSYEWWSEALHGVSNVGPGTRFGGVFPAATSFPQVITTA 120

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
           +SFNA+LWEAIGRVVSDEARAMYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+G
Sbjct: 121 ASFNASLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAG 180

Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
           +YAASYV+GLQG+DG++LKVAA CKHFTAYD+DNWNGVDRFHFNA VSKQDIEDTFDVPF
Sbjct: 181 RYAASYVKGLQGTDGNKLKVAACCKHFTAYDVDNWNGVDRFHFNALVSKQDIEDTFDVPF 240

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           RMCV EGKVASVMCSYNQVNGVPTCADPN+LK+T+RG W L+GYIVSDCDSVGV Y++QH
Sbjct: 241 RMCVKEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGVWGLDGYIVSDCDSVGVLYNSQH 300

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           +TSTPEEAAADAI+AGLDLDCGPFLG+HT+ AV++GLL+E D+NNALVNTL VQMRLGMF
Sbjct: 301 YTSTPEEAAADAIKAGLDLDCGPFLGVHTQDAVKKGLLTEADVNNALVNTLKVQMRLGMF 360

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
           DGEPS+Q YG LGPKDVC P HQELALEAARQGIVLLKN GP+LPLS  RHRTVAVIGPN
Sbjct: 361 DGEPSAQAYGRLGPKDVCKPAHQELALEAARQGIVLLKNTGPTLPLSPQRHRTVAVIGPN 420

Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
           SDVTVTMIGNYAGIACGYT+PLQGIGRYA+TIHQQGC +VAC DD+ FG A+DA+R ADA
Sbjct: 421 SDVTVTMIGNYAGIACGYTSPLQGIGRYAKTIHQQGCSNVACRDDKQFGPALDAARHADA 480

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
           TILV+GLDQSIEAE +DR  LLLPG QQ+LVSKV+ ASKGPTILVLMSGGP+D+ FAKND
Sbjct: 481 TILVIGLDQSIEAETVDRTSLLLPGHQQDLVSKVAAASKGPTILVLMSGGPVDITFAKND 540

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
           P++A I+WAGYPGQAGG AIADILFGT++PGGKLP+TWYPQEY+ NL MT MAMRPS+  
Sbjct: 541 PKVAGILWAGYPGQAGGAAIADILFGTASPGGKLPVTWYPQEYLKNLAMTNMAMRPSKIG 600

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIK 659
            YPGRTYRFYKGPVVYPFGHG++YT+FVH +++APTVV+VP+ G RHG+ N  IS KAI+
Sbjct: 601 -YPGRTYRFYKGPVVYPFGHGLTYTHFVHELSSAPTVVSVPVHGHRHGN-NTNISNKAIR 658

Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPAGA 717
           VTHA+C +L++ + VDVKNVGS+DG HTLLVFS PP G  HW P K LVAFEKVHVPA  
Sbjct: 659 VTHARCGKLSIALHVDVKNVGSRDGTHTLLVFSAPPNGGNHWVPQKSLVAFEKVHVPAKT 718

Query: 718 QQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           +QRV +NIHVCK LSVVD+SG RRIP+GEH++HIG  KHSVSL A  LG+IKS
Sbjct: 719 KQRVRVNIHVCKLLSVVDKSGIRRIPMGEHSLHIGDVKHSVSLQAEALGIIKS 771


>gi|356572781|ref|XP_003554544.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 771

 Score = 1270 bits (3286), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 588/749 (78%), Positives = 671/749 (89%), Gaps = 2/749 (0%)

Query: 24  AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
           AREPFACDPK+  T+ + FC+VSL I +RV DLIGRL+L+EKV+LL++ AAAVPRLG+KG
Sbjct: 23  AREPFACDPKNGGTKKMAFCKVSLAIAERVKDLIGRLTLEEKVRLLVNNAAAVPRLGMKG 82

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           YEWWSEALHGVSN+GP  KF   FP ATSFPQVITTA+SFNA+LWEAIG+VVSDEARAMY
Sbjct: 83  YEWWSEALHGVSNLGPAVKFNAQFPAATSFPQVITTAASFNASLWEAIGQVVSDEARAMY 142

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
           NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+G YAA+YVRGLQG+  +RLKVAA 
Sbjct: 143 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGTYAATYVRGLQGTHANRLKVAAC 202

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           CKHFTAYDLDNWNG+DRFHFNA+VSKQDIEDTFDVPF+MCV EGKVASVMCSYNQVNGVP
Sbjct: 203 CKHFTAYDLDNWNGMDRFHFNAQVSKQDIEDTFDVPFKMCVSEGKVASVMCSYNQVNGVP 262

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           TCADPN+LK+T+RG W+L+GYIVSDCDSVGV+YD QH+T TPEEAAADAI+AGLDLDCGP
Sbjct: 263 TCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQHYTPTPEEAAADAIKAGLDLDCGP 322

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           FL +HT++AV++GLLSE D+N ALVNTLTVQMRLGMFDGEP++ PYGHLGPKDVC P HQ
Sbjct: 323 FLAVHTQNAVKKGLLSEADVNGALVNTLTVQMRLGMFDGEPTAHPYGHLGPKDVCKPAHQ 382

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
           ELALEAARQGIVLLKN GP LPLS   HRTVAVIGPNS  T+TMIGNYAG+ACGYT PLQ
Sbjct: 383 ELALEAARQGIVLLKNTGPVLPLSSQLHRTVAVIGPNSKATITMIGNYAGVACGYTNPLQ 442

Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           GIGRYART+HQ GC++VAC +D+LFG AI+A+RQADAT+LVMGLDQSIEAE +DR GLLL
Sbjct: 443 GIGRYARTVHQLGCQNVACKNDKLFGPAINAARQADATVLVMGLDQSIEAETVDRTGLLL 502

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PGRQ +LVSKV+ ASKGPTILVLMSGGP+D+ FAKN+PRI  I+WAGYPGQAGG AIADI
Sbjct: 503 PGRQPDLVSKVAAASKGPTILVLMSGGPVDITFAKNNPRIVGILWAGYPGQAGGAAIADI 562

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFGT+NPGGKLP+TWYP+EY+T LPMT MAMR ++S  YPGRTYRFY GPVVYPFGHG++
Sbjct: 563 LFGTANPGGKLPVTWYPEEYLTKLPMTNMAMRATKSAGYPGRTYRFYNGPVVYPFGHGLT 622

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           YT+FVHT+A+APTVV+VPL+G   +    IS +AI+VTHA+C++L++ +QVD+KNVGS+D
Sbjct: 623 YTHFVHTLASAPTVVSVPLNGHRRANVTNISNRAIRVTHARCDKLSITLQVDIKNVGSRD 682

Query: 684 GAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
           G HTLLVFS PPA  GHWA  KQLVAFEKVHVPA  Q RVG+NIHVCK LSVVDRSG RR
Sbjct: 683 GTHTLLVFSAPPAGFGHWALEKQLVAFEKVHVPAKGQHRVGVNIHVCKLLSVVDRSGIRR 742

Query: 742 IPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           IPLGEH+ +IG  KHSVSL AA LG+IKS
Sbjct: 743 IPLGEHSFNIGDVKHSVSLQAAALGIIKS 771


>gi|18378991|ref|NP_563659.1| beta-glucosidase [Arabidopsis thaliana]
 gi|75250279|sp|Q94KD8.1|BXL2_ARATH RecName: Full=Probable beta-D-xylosidase 2; Short=AtBXL2; Flags:
           Precursor
 gi|14194121|gb|AAK56255.1|AF367266_1 At1g02640/T14P4_11 [Arabidopsis thaliana]
 gi|23506063|gb|AAN28891.1| At1g02640/T14P4_11 [Arabidopsis thaliana]
 gi|332189332|gb|AEE27453.1| beta-glucosidase [Arabidopsis thaliana]
          Length = 768

 Score = 1243 bits (3217), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 584/765 (76%), Positives = 668/765 (87%), Gaps = 7/765 (0%)

Query: 9   FLGLIL---LSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
           FL +IL   +S+SS  + +RE FACD KDA T TL FCQ+S+PIP+RV DLIGRL+L EK
Sbjct: 8   FLAVILFFLISSSSVCVHSRETFACDTKDAATATLRFCQLSVPIPERVRDLIGRLTLAEK 67

Query: 66  VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
           V LL + AAA+PRLGIKGYEWWSEALHGVSNVGPGTKFGG +P ATSFPQVITT +SFNA
Sbjct: 68  VSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVASFNA 127

Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
           +LWE+IGRVVS+EARAMYNGG  GLTYWSPNVNI RDPRWGRGQETPGEDPV++GKYAAS
Sbjct: 128 SLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAAS 187

Query: 186 YVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVM 245
           YVRGLQG+D  RLKVAA CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 
Sbjct: 188 YVRGLQGNDRSRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVK 247

Query: 246 EGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP 305
           EG VAS+MCSYNQVNGVPTCADPN+LK+TIR +W LNGYIVSDCDSVGV YDTQH+T TP
Sbjct: 248 EGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDTQHYTGTP 307

Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
           EEAAAD+I+AGLDLDCGPFLG HT  AV++ LL E D++NAL+NTLTVQMRLGMFDG+ +
Sbjct: 308 EEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIA 367

Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
           +QPYGHLGP  VCTP H+ LALEAA+QGIVLLKN G SLPLS  RHRTVAVIGPNSD TV
Sbjct: 368 AQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATV 427

Query: 426 TMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVM 485
           TMIGNYAG+ACGYT+P+QGI  YARTIHQ+GC DV C DD+LF AA++A+R ADAT+LVM
Sbjct: 428 TMIGNYAGVACGYTSPVQGITGYARTIHQKGCVDVHCMDDRLFDAAVEAARGADATVLVM 487

Query: 486 GLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAA 545
           GLDQSIEAE  DR  LLLPG+QQELVS+V+ A+KGP ILVLMSGGPID++FA+ D +I A
Sbjct: 488 GLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPA 547

Query: 546 IIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR 605
           I+WAGYPGQ GGTAIADILFG++NPGGKLPMTWYPQ+Y+TNLPMTEM+MRP  SKR PGR
Sbjct: 548 IVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPVHSKRIPGR 607

Query: 606 TYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC 665
           TYRFY GPVVYPFGHG+SYT F H +A+AP V+ + + GR+G    T+SGK+I+VTHA+C
Sbjct: 608 TYRFYDGPVVYPFGHGLSYTRFTHNIADAPKVIPIAVRGRNG----TVSGKSIRVTHARC 663

Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
           +RL+LGV V+V NVGS+DG HT+LVFS PP G WAP KQLVAFE+VHV  G ++RV +NI
Sbjct: 664 DRLSLGVHVEVTNVGSRDGTHTMLVFSAPPGGEWAPKKQLVAFERVHVAVGEKKRVQVNI 723

Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           HVCKYLSVVDR+G RRIP+G+H IHIG   H+VSL A+TLGVIKS
Sbjct: 724 HVCKYLSVVDRAGNRRIPIGDHGIHIGDESHTVSLQASTLGVIKS 768


>gi|9972374|gb|AAG10624.1|AC022521_2 Similar to xylosidase [Arabidopsis thaliana]
          Length = 763

 Score = 1243 bits (3215), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 584/765 (76%), Positives = 668/765 (87%), Gaps = 7/765 (0%)

Query: 9   FLGLIL---LSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
           FL +IL   +S+SS  + +RE FACD KDA T TL FCQ+S+PIP+RV DLIGRL+L EK
Sbjct: 3   FLAVILFFLISSSSVCVHSRETFACDTKDAATATLRFCQLSVPIPERVRDLIGRLTLAEK 62

Query: 66  VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
           V LL + AAA+PRLGIKGYEWWSEALHGVSNVGPGTKFGG +P ATSFPQVITT +SFNA
Sbjct: 63  VSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVASFNA 122

Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
           +LWE+IGRVVS+EARAMYNGG  GLTYWSPNVNI RDPRWGRGQETPGEDPV++GKYAAS
Sbjct: 123 SLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAAS 182

Query: 186 YVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVM 245
           YVRGLQG+D  RLKVAA CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 
Sbjct: 183 YVRGLQGNDRSRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVK 242

Query: 246 EGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP 305
           EG VAS+MCSYNQVNGVPTCADPN+LK+TIR +W LNGYIVSDCDSVGV YDTQH+T TP
Sbjct: 243 EGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDTQHYTGTP 302

Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
           EEAAAD+I+AGLDLDCGPFLG HT  AV++ LL E D++NAL+NTLTVQMRLGMFDG+ +
Sbjct: 303 EEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIA 362

Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
           +QPYGHLGP  VCTP H+ LALEAA+QGIVLLKN G SLPLS  RHRTVAVIGPNSD TV
Sbjct: 363 AQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATV 422

Query: 426 TMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVM 485
           TMIGNYAG+ACGYT+P+QGI  YARTIHQ+GC DV C DD+LF AA++A+R ADAT+LVM
Sbjct: 423 TMIGNYAGVACGYTSPVQGITGYARTIHQKGCVDVHCMDDRLFDAAVEAARGADATVLVM 482

Query: 486 GLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAA 545
           GLDQSIEAE  DR  LLLPG+QQELVS+V+ A+KGP ILVLMSGGPID++FA+ D +I A
Sbjct: 483 GLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPA 542

Query: 546 IIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR 605
           I+WAGYPGQ GGTAIADILFG++NPGGKLPMTWYPQ+Y+TNLPMTEM+MRP  SKR PGR
Sbjct: 543 IVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPVHSKRIPGR 602

Query: 606 TYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC 665
           TYRFY GPVVYPFGHG+SYT F H +A+AP V+ + + GR+G    T+SGK+I+VTHA+C
Sbjct: 603 TYRFYDGPVVYPFGHGLSYTRFTHNIADAPKVIPIAVRGRNG----TVSGKSIRVTHARC 658

Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
           +RL+LGV V+V NVGS+DG HT+LVFS PP G WAP KQLVAFE+VHV  G ++RV +NI
Sbjct: 659 DRLSLGVHVEVTNVGSRDGTHTMLVFSAPPGGEWAPKKQLVAFERVHVAVGEKKRVQVNI 718

Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           HVCKYLSVVDR+G RRIP+G+H IHIG   H+VSL A+TLGVIKS
Sbjct: 719 HVCKYLSVVDRAGNRRIPIGDHGIHIGDESHTVSLQASTLGVIKS 763


>gi|297843058|ref|XP_002889410.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335252|gb|EFH65669.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 763

 Score = 1242 bits (3214), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 584/770 (75%), Positives = 667/770 (86%), Gaps = 7/770 (0%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           MA   A  F    L+S+SS  + +RE FACD KDA T TL FCQ+S+PI +RV DLIGRL
Sbjct: 1   MAFLAAILFF---LISSSSVCVQSRETFACDIKDAATATLRFCQLSVPITERVKDLIGRL 57

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L EKV LL + AAA+PRLGIKGYEWWSEALHGVSNVGPGTKFGG +P ATSFPQVITT 
Sbjct: 58  TLVEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTV 117

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
           +SFNA+LWE+IGRVVS+EARAMYNGG  GLTYWSPNVNI RDPRWGRGQETPGEDPV++G
Sbjct: 118 ASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAG 177

Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
           KYAASYVRGLQG+D  RLKVAA CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF
Sbjct: 178 KYAASYVRGLQGNDRSRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 237

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           RMCV EG VAS+MCSYN+VNGVPTCADPN+LK+TIR EW LNGYIVSDCDSVGV YDTQH
Sbjct: 238 RMCVKEGNVASIMCSYNEVNGVPTCADPNLLKKTIRNEWGLNGYIVSDCDSVGVLYDTQH 297

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           +T TPEEAAAD+I+AGLDLDCGPFLG HT  AV++ LL E D++NAL+NTLTVQMRLGMF
Sbjct: 298 YTGTPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMF 357

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
           DG+ ++QPYGHLGP  VCTP H+ LALEAA+QGIVLLKN G SLPLS  RHRTVAVIGPN
Sbjct: 358 DGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPN 417

Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
           SD TV MIGNYAGIACGYT+P+QGI  YART+HQ+GC DV C DD+LF AA++A+R ADA
Sbjct: 418 SDATVAMIGNYAGIACGYTSPVQGITGYARTVHQKGCVDVHCMDDRLFDAAVEAARGADA 477

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
           T+LVMGLDQSIEAE  DR  LLLPG+QQEL+S+V+ A+KGP ILVLMSGGPID++FA+ D
Sbjct: 478 TVLVMGLDQSIEAEFKDRNSLLLPGKQQELISRVAKAAKGPVILVLMSGGPIDISFAEKD 537

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
            +I AI+WAGYPGQ GGTAIADILFG++NPGGKLPMTWYPQ+Y+TNLPMTEM+MRP  SK
Sbjct: 538 RKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPIHSK 597

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
           R PGRTYRFY GPVVYPFGHG+SYT F H++A+AP V+ + + GR+G    T+SGK+I+V
Sbjct: 598 RIPGRTYRFYDGPVVYPFGHGLSYTRFTHSIADAPKVIPIAVRGRNG----TVSGKSIRV 653

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
           THA+CNRL+LGV VDV NVGS+DG HT+LVFS PP G WAP KQLVAFE+VHV  G ++R
Sbjct: 654 THARCNRLSLGVHVDVTNVGSRDGTHTMLVFSAPPGGEWAPKKQLVAFERVHVAVGEKKR 713

Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           V +NIHVCKYLSVVDR+G RRIP+G+H IHIG   H+VSL A+TLGVIKS
Sbjct: 714 VQVNIHVCKYLSVVDRAGNRRIPIGDHGIHIGDESHTVSLQASTLGVIKS 763


>gi|371917282|dbj|BAL44717.1| SlArf/Xyl2 [Solanum lycopersicum]
          Length = 774

 Score = 1225 bits (3170), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/769 (75%), Positives = 661/769 (85%), Gaps = 5/769 (0%)

Query: 3   STIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
           ST + F    + +S      AAR PFACD K+   R  PFCQ +LPI  RV DLIGRL+L
Sbjct: 10  STSSLFIFIFLFVSIQ----AARPPFACDQKNRAFRNFPFCQTNLPIGDRVRDLIGRLTL 65

Query: 63  QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
           QEKVKLL + AAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG+FPGATSFPQVITTA+S
Sbjct: 66  QEKVKLLGNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGEFPGATSFPQVITTAAS 125

Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
           FNA+LWE IGRVVSDEARAMYNG   GLTYWSPNVNIFRDPRWGRGQETPGEDPV++  Y
Sbjct: 126 FNASLWEEIGRVVSDEARAMYNGEMGGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAALY 185

Query: 183 AASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
           A  YVRGLQG+ DGD LKVAA CKH+TAYDLDNW GVDRFHFNAKV+KQDIEDTFDVPFR
Sbjct: 186 AERYVRGLQGNEDGDSLKVAACCKHYTAYDLDNWGGVDRFHFNAKVTKQDIEDTFDVPFR 245

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
            CV +GKVAS+MCSYNQVNG+PTCADP +L++TIRG W LNGYIVSDCDSVGV+YDTQH+
Sbjct: 246 SCVKQGKVASIMCSYNQVNGIPTCADPQLLRKTIRGGWGLNGYIVSDCDSVGVFYDTQHY 305

Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
           TSTPEEAAA AI+AGLDLDCGPFL  HTE+AV  G+L E  I+  L NT+ VQMRLGMFD
Sbjct: 306 TSTPEEAAAAAIKAGLDLDCGPFLSQHTENAVHIGILKEAAIDTNLANTVAVQMRLGMFD 365

Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
           GEPS+Q YGHLGP+DVC+P HQELA+EAARQGIVLLKN GP+LPLS  RHRTVAVIGPNS
Sbjct: 366 GEPSAQQYGHLGPRDVCSPAHQELAVEAARQGIVLLKNHGPALPLSPRRHRTVAVIGPNS 425

Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
           DVTVTMIGNYAG+ACGYT+PLQGI +YA+TIH++GC DVAC+DD+LF  A++A+RQADAT
Sbjct: 426 DVTVTMIGNYAGVACGYTSPLQGISKYAKTIHEKGCGDVACSDDKLFAGAVNAARQADAT 485

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           +LVMGLDQSIEAE  DR GLLLPG QQEL+S+VS AS+GP +LVLMSGGP+DV FA NDP
Sbjct: 486 VLVMGLDQSIEAEFRDRTGLLLPGFQQELISEVSKASRGPVVLVLMSGGPVDVTFANNDP 545

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
           RI AI+WAGYPGQ GG AIAD+LFG  NPGGKLPMTWYPQEY+ NLPMT M MR + +K 
Sbjct: 546 RIGAIVWAGYPGQGGGAAIADVLFGAHNPGGKLPMTWYPQEYLNNLPMTTMDMRSNLAKG 605

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
           YPGRTYRFYKGP+VYPFGHG+SYT F+ T+  AP  +A+P+DGRH   ++TIS K+I+VT
Sbjct: 606 YPGRTYRFYKGPLVYPFGHGLSYTKFITTIFEAPKTLAIPIDGRHTYNSSTISNKSIRVT 665

Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
           HAKC+++++ + VDVKNVG KDG+HTLLVFS PP   W PHKQLVAF+KV+VPA ++QRV
Sbjct: 666 HAKCSKISVQIHVDVKNVGPKDGSHTLLVFSKPPVDIWVPHKQLVAFQKVYVPARSKQRV 725

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
            INIHVCKYLSVVDR+G RRIP+GEH+IHIG  KHS+SL A+ LGVIKS
Sbjct: 726 AINIHVCKYLSVVDRAGVRRIPIGEHSIHIGDAKHSLSLQASVLGVIKS 774


>gi|449436749|ref|XP_004136155.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 772

 Score = 1200 bits (3105), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 568/763 (74%), Positives = 657/763 (86%), Gaps = 6/763 (0%)

Query: 12  LILLSA-SSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI 70
           LI+LSA    G  AREPFACDPKDA     PFC+V+LPIP+RV DLIGRL+LQEKV+LL+
Sbjct: 12  LIILSAIFRHGGGAREPFACDPKDAALSRYPFCRVALPIPERVKDLIGRLTLQEKVRLLV 71

Query: 71  SGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEA 130
           + AAAVPRLGIKGYEWWSEALHGVSNVGPGT+FGGDFPGATSFPQVITT +SFN +LWEA
Sbjct: 72  NNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVASFNVSLWEA 131

Query: 131 IGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
           IGRVVSDEARAMYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPV++G+YAA Y++GL
Sbjct: 132 IGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAARYIKGL 191

Query: 191 QGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
           QG+DGDRLKVAA CKHFTAYDLDNWNG DRFHFNAKV++QD+ DTF+VPFR CV EGKVA
Sbjct: 192 QGNDGDRLKVAACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVA 251

Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
           SVMCSYNQVNGVPTCADPN+LK TIR +W LNGYIVSDCDSVGV+YD QH+TST EEAAA
Sbjct: 252 SVMCSYNQVNGVPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDNQHYTSTAEEAAA 311

Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
           DAI+AGLDLDCGPFL +HTE AV++GLL++  INNAL NT+TVQMRLGMFDG PSS  YG
Sbjct: 312 DAIKAGLDLDCGPFLAVHTEDAVKKGLLTQTHINNALANTITVQMRLGMFDGAPSSHAYG 371

Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
            LGPK+VC+P HQ+LAL+AARQGIVLLKN+ P LPLS   HRTVAVIGPNSDV VTMIGN
Sbjct: 372 KLGPKNVCSPSHQQLALDAARQGIVLLKNRLPGLPLSADHHRTVAVIGPNSDVNVTMIGN 431

Query: 431 YAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
           YAG+ACGY TPL+GI RY   +H++GC +VACA D  F  A+ A+  ADAT+LVMGLDQS
Sbjct: 432 YAGVACGYVTPLEGIKRYTTVVHRKGCDNVACATDYSFTDALAAASTADATVLVMGLDQS 491

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +EAE  DR GLLLPGRQQELV KV+ AS+GPT+++LMSGGPIDV+FA NDPRI+AI+W G
Sbjct: 492 VEAETKDRDGLLLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFADNDPRISAILWVG 551

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY 610
           YPGQAGG AIAD+LFGT+NPGGKLPMTWYPQ Y++NLPMT MAMR + S  YPGRTYRFY
Sbjct: 552 YPGQAGGAAIADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNMAMRSTSS--YPGRTYRFY 609

Query: 611 KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHAKCNRLT 669
            GPVVY FGHG+SYTNF+HT+  APT+V++ L G R     +T+S KAI+VTHAKC +L+
Sbjct: 610 AGPVVYEFGHGLSYTNFIHTIVKAPTIVSISLSGHRQTHSASTLSSKAIRVTHAKCQKLS 669

Query: 670 LGVQVDVKNVGSKDGAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQQRVGINIHV 727
           L + VDV+N G +DG HT+LVFSTPPA    W P KQLVAFEK+H+ +  ++R+ +++HV
Sbjct: 670 LVIHVDVENKGDRDGFHTMLVFSTPPANGATWVPRKQLVAFEKLHLASREKRRLQVHVHV 729

Query: 728 CKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           CKYLSVVD+ G RRIPLG+H IHIG  KH+VSL AATLG+IK+
Sbjct: 730 CKYLSVVDKLGVRRIPLGDHYIHIGNVKHTVSLQAATLGIIKT 772


>gi|449505346|ref|XP_004162442.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
           [Cucumis sativus]
          Length = 772

 Score = 1198 bits (3099), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 567/763 (74%), Positives = 656/763 (85%), Gaps = 6/763 (0%)

Query: 12  LILLSA-SSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI 70
           LI+LSA    G  AREPFACDPKDA     PFC+V+LPIP+RV DLIGRL+LQEKV+LL+
Sbjct: 12  LIILSAIFRHGGGAREPFACDPKDAALSRYPFCRVALPIPERVKDLIGRLTLQEKVRLLV 71

Query: 71  SGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEA 130
           + AAAVPRLGIKGYEWWSEALHGVSNVGPGT+FGGDFPGATSFPQVITT +SFN +LWEA
Sbjct: 72  NNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVASFNVSLWEA 131

Query: 131 IGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
           IGRVVSDEARAMYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPV++G+YAA Y++GL
Sbjct: 132 IGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAARYIKGL 191

Query: 191 QGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
           QG+DGDRLKVAA CKHFTAYDLDNWNG DRFHFNAKV++QD+ DTF+VPFR CV EGKVA
Sbjct: 192 QGNDGDRLKVAACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVA 251

Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
           SVMCSYNQVNGVPTCADPN+LK TIR +W LNGYIVSDCDSVGV+YD QH+TST EEAAA
Sbjct: 252 SVMCSYNQVNGVPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDNQHYTSTAEEAAA 311

Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
           DAI+AGLDLDCGPFL +HTE AV++ LL++  INNAL NT+TVQMRLGMFDG PSS  YG
Sbjct: 312 DAIKAGLDLDCGPFLAVHTEDAVKKXLLTQTHINNALANTITVQMRLGMFDGAPSSHAYG 371

Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
            LGPK+VC+P HQ+LAL+AARQGIVLLKN+ P LPLS   HRTVAVIGPNSDV VTMIGN
Sbjct: 372 KLGPKNVCSPSHQQLALDAARQGIVLLKNRLPGLPLSAXHHRTVAVIGPNSDVNVTMIGN 431

Query: 431 YAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
           YAG+ACGY TPL+GI RY   +H++GC +VACA D  F  A+ A+  ADAT+LVMGLDQS
Sbjct: 432 YAGVACGYVTPLEGIKRYTTVVHRKGCDNVACATDYSFTDALAAASTADATVLVMGLDQS 491

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +EAE  DR GLLLPGRQQELV KV+ AS+GPT+++LMSGGPIDV+FA NDPRI+AI+W G
Sbjct: 492 VEAETKDRDGLLLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFADNDPRISAILWVG 551

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY 610
           YPGQAGG AIAD+LFGT+NPGGKLPMTWYPQ Y++NLPMT MAMR + S  YPGRTYRFY
Sbjct: 552 YPGQAGGAAIADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNMAMRSTSS--YPGRTYRFY 609

Query: 611 KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHAKCNRLT 669
            GPVVY FGHG+SYTNF+HT+  APT+V++ L G R     +T+S KAI+VTHAKC +L+
Sbjct: 610 AGPVVYEFGHGLSYTNFIHTIVKAPTIVSISLSGHRQTHSASTLSSKAIRVTHAKCQKLS 669

Query: 670 LGVQVDVKNVGSKDGAHTLLVFSTPPA--GHWAPHKQLVAFEKVHVPAGAQQRVGINIHV 727
           L + VDV+N G +DG HT+LVFSTPPA    W P KQLVAFEK+H+ +  ++R+ +++HV
Sbjct: 670 LVIHVDVENKGDRDGFHTMLVFSTPPANGATWVPRKQLVAFEKLHLASREKRRLQVHVHV 729

Query: 728 CKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           CKYLSVVD+ G RRIPLG+H IHIG  KH+VSL AATLG+IK+
Sbjct: 730 CKYLSVVDKLGVRRIPLGDHYIHIGNVKHTVSLQAATLGIIKT 772


>gi|224111912|ref|XP_002316021.1| predicted protein [Populus trichocarpa]
 gi|222865061|gb|EEF02192.1| predicted protein [Populus trichocarpa]
          Length = 768

 Score = 1187 bits (3070), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 560/749 (74%), Positives = 641/749 (85%), Gaps = 2/749 (0%)

Query: 22  LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
           + +REPFACDPK   TR+L FC+V+LPI  RV DLIGRL+LQEK++LL++ AAAVPRLGI
Sbjct: 22  VESREPFACDPKLGLTRSLKFCRVNLPIHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGI 81

Query: 82  KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
           +GYEWWSEALHGVSNVGPGTKFGG FPGAT+FPQVITTA+SFN +LWE IGRVVSDEARA
Sbjct: 82  QGYEWWSEALHGVSNVGPGTKFGGAFPGATAFPQVITTAASFNESLWEEIGRVVSDEARA 141

Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA 201
           MYNGG AGLTYWSPNVN+FRDPRWGRGQETPGEDPV++GKYAASYVRGLQG++G RLKVA
Sbjct: 142 MYNGGMAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNNGLRLKVA 201

Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
           A CKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT++VPF+ CV+ GKVASVMCSYNQVNG
Sbjct: 202 ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKSCVVAGKVASVMCSYNQVNG 261

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
            PTCADP +LK TIRGEW LNGYIVSDCDSVGV +DTQH+T+TPEEAAA  IRAGLDLDC
Sbjct: 262 KPTCADPYLLKNTIRGEWGLNGYIVSDCDSVGVLFDTQHYTATPEEAAASTIRAGLDLDC 321

Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
           GPFL +HTE+AV+ GLL E D+N AL NT+TVQMRLGMFDGEPS+QP+G+LGP+DVCTP 
Sbjct: 322 GPFLAIHTENAVKGGLLKEEDVNMALANTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 381

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           HQ+LAL+AARQGIVLL+N+G +LPLS    +TVAVIGPNSDVTVTMIGNYAG+ACGYTTP
Sbjct: 382 HQQLALQAARQGIVLLQNRGRTLPLSRTL-QTVAVIGPNSDVTVTMIGNYAGVACGYTTP 440

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
           LQGI RYA+T+H  GC DV C  +Q F AA  A+R ADATILVMGLDQSIEAE  DR GL
Sbjct: 441 LQGIRRYAKTVHHPGCNDVFCNGNQQFNAAEVAARHADATILVMGLDQSIEAEFRDRKGL 500

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           LLPG QQELVS V+ AS+GPTILVLMSGGPIDV+FAKNDPRI AI+W GYPGQAGG AIA
Sbjct: 501 LLPGYQQELVSIVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWVGYPGQAGGAAIA 560

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           D+LFGT+NPGGKLPMTWYP  Y+  +PMT M MR   S+ YPGRTYRFYKGPVV+PFGHG
Sbjct: 561 DVLFGTANPGGKLPMTWYPHNYLAKVPMTNMGMRADPSRGYPGRTYRFYKGPVVFPFGHG 620

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           MSYT F H++  AP  V+VPL   H S N T +  AI+V+HA C  L LGV +DVKN G 
Sbjct: 621 MSYTTFAHSLVQAPREVSVPLASLHVSRNTTGASNAIRVSHANCEALALGVHIDVKNTGD 680

Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
            DG HTLLVFS+PP G W+  KQL+ FEKVH+  G+Q+RV I+IHVCK+LSVVDR G RR
Sbjct: 681 MDGTHTLLVFSSPPGGKWSTQKQLIGFEKVHLVTGSQKRVKIDIHVCKHLSVVDRFGIRR 740

Query: 742 IPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           IP+GEH+++IG  KHS+SL  A L  IKS
Sbjct: 741 IPIGEHDLYIGDLKHSISLQ-ANLEEIKS 768


>gi|225431898|ref|XP_002276351.1| PREDICTED: beta-D-xylosidase 1-like [Vitis vinifera]
          Length = 770

 Score = 1176 bits (3041), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 563/748 (75%), Positives = 654/748 (87%), Gaps = 2/748 (0%)

Query: 24  AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
           AREPFACDP++  TR LPFC+VSLPI +R  DL+GRL+LQEK++LL++ A  VPRLGIKG
Sbjct: 23  AREPFACDPRNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKG 82

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           YEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFNA+LWE IGRVVSDEARAMY
Sbjct: 83  YEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMY 142

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
           NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDP ++ KYAA+YVRGLQG+  DRLKVAA 
Sbjct: 143 NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNARDRLKVAAC 202

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           CKH+TAYDLD+W G+DRFHFNA+VSKQD+EDT+DVPF+ CV+EG VASVMCSYNQVNG P
Sbjct: 203 CKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQVNGKP 262

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           TCADP++L+ TIRGEW+LNGYIVSDCDSVGV+YD QH+T+TPEEAAA AI+AGLDLDCGP
Sbjct: 263 TCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTATPEEAAAVAIKAGLDLDCGP 322

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           FL +HTE+A++ G L+E D+N AL+NT++VQMRLGMFDGEPS+QPYG+LGP+DVCTP HQ
Sbjct: 323 FLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCTPAHQ 382

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
           +LALEAARQGIVL++N+GP+LPLS  RHRT+AVIGPNSDVT TMIGNYAG+ACGYTTPLQ
Sbjct: 383 QLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYTTPLQ 442

Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           GIGRYARTIHQ GC  VAC DDQ FGAA+ A+RQADAT+LVMGLDQSIEAE  DR  +LL
Sbjct: 443 GIGRYARTIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDRVDILL 502

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PGRQQELVSKV++AS+GPT+LVLMSGGPIDV+FAKNDPRIAAIIW GYPGQAGGTAIAD+
Sbjct: 503 PGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGTAIADV 562

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFG +NPGGKLP+TWYPQ Y+   PMT MAMR   S+ YPGRTYRFY GPVV+PFGHG+S
Sbjct: 563 LFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNGPVVFPFGHGLS 622

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATI-SGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
           Y+ F H++A APT V+V L       N+TI S  AI+++HA CN   LG  +DVKN G+ 
Sbjct: 623 YSTFAHSLAQAPTTVSVSLASLQTIKNSTIVSSGAIRISHANCNTQPLGFHIDVKNTGTM 682

Query: 683 DGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           DG+HTLL+FSTPP G W+P+K+L+AFEKVHV AG+Q+RV  ++HVCK+LSVVD  G  RI
Sbjct: 683 DGSHTLLLFSTPPPGTWSPNKRLLAFEKVHVGAGSQERVRFDVHVCKHLSVVDHFGIHRI 742

Query: 743 PLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           P+GEH+ HIG  KHS+SL  ATL  IKS
Sbjct: 743 PMGEHHFHIGDLKHSISLQ-ATLEEIKS 769


>gi|408354266|gb|AFU54452.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
          Length = 775

 Score = 1170 bits (3028), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 559/740 (75%), Positives = 641/740 (86%), Gaps = 2/740 (0%)

Query: 24  AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
           AR PFACDP +  TR L FC+V++PI  RV DLIGRL+LQEK++LL++ A AVPRLGI+G
Sbjct: 28  ARPPFACDPHNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGIQG 87

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           YEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN +LW+ IGRVV DEARAMY
Sbjct: 88  YEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRVVPDEARAMY 147

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAA 202
           NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+ KYAA YV+GLQG   G+RLKVAA
Sbjct: 148 NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQGDGAGNRLKVAA 207

Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            CKH+TAYDLDNWNGV+RFHFNA+VSKQD+ DT++VPF+ CV+EG VASVMCSYNQVNG 
Sbjct: 208 CCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVMCSYNQVNGK 267

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
           PTCADP++LK TIRG+WRLNGYIVSDCDSVGV Y+ QH+T TPEEAAADAI+AGLDLDCG
Sbjct: 268 PTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHYTRTPEEAAADAIKAGLDLDCG 327

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
           PFL +HTE+AV+RGL+S+++IN AL NT+TVQMRLGMFDGEPS+  YG+LGP+DVCTP H
Sbjct: 328 PFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLGPRDVCTPAH 387

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q+LALEAARQGIVLL+N+G SLPLS  RHRTVAVIGPNSDVTVTMIGNYAG+ACGYTTPL
Sbjct: 388 QQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 447

Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
           QGIGRY RTIHQ GC DV C  +QLFGAA  A+RQADAT+LVMGLDQSIEAE +DR GLL
Sbjct: 448 QGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFVDRVGLL 507

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           LPG QQELVS+V+ AS+GPTILVLMSGGPIDV FAKNDPRI+AIIW GYPGQAGGTAIAD
Sbjct: 508 LPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAGGTAIAD 567

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
           +LFGT+NPGGKLPMTWYPQ Y+T+LPMT+MAMR   ++ YPGRTYRFY+GPVV+PFG G+
Sbjct: 568 VLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRFYRGPVVFPFGLGL 627

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT-LGVQVDVKNVGS 681
           SYT F H +A+ PT V+VPL     + N+T+  KA++V+HA CN L+ L V VDVKN GS
Sbjct: 628 SYTTFAHNLAHGPTSVSVPLTSLKATANSTMLSKAVRVSHADCNALSPLDVHVDVKNTGS 687

Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
            DG HTLLVF++PP G WA  KQLV F K+H+ AG++ RV I +HVCK+LSVVDR G RR
Sbjct: 688 MDGTHTLLVFTSPPDGKWAASKQLVGFHKIHIAAGSETRVRIAVHVCKHLSVVDRFGIRR 747

Query: 742 IPLGEHNIHIGGTKHSVSLH 761
           IPLGEH + IG   H VSL 
Sbjct: 748 IPLGEHKLQIGDLSHHVSLQ 767


>gi|408354264|gb|AFU54451.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
          Length = 775

 Score = 1168 bits (3021), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 558/740 (75%), Positives = 640/740 (86%), Gaps = 2/740 (0%)

Query: 24  AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
           AR PFACDP +  TR L FC+V++PI  RV DLIGRL+LQEK++LL++ A AVPRLGI+G
Sbjct: 28  ARPPFACDPHNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGIQG 87

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           YEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN +LW+ IGR V DEARAMY
Sbjct: 88  YEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRGVPDEARAMY 147

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAA 202
           NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+ KYAA YV+GLQG   G+RLKVAA
Sbjct: 148 NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQGDGAGNRLKVAA 207

Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            CKH+TAYDLDNWNGV+RFHFNA+VSKQD+ DT++VPF+ CV+EG VASVMCSYNQVNG 
Sbjct: 208 CCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVMCSYNQVNGK 267

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
           PTCADP++LK TIRG+WRLNGYIVSDCDSVGV Y+ QH+T TPEEAAADAI+AGLDLDCG
Sbjct: 268 PTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHYTRTPEEAAADAIKAGLDLDCG 327

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
           PFL +HTE+AV+RGL+S+++IN AL NT+TVQMRLGMFDGEPS+  YG+LGP+DVCTP H
Sbjct: 328 PFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLGPRDVCTPAH 387

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q+LALEAARQGIVLL+N+G SLPLS  RHRTVAVIGPNSDVTVTMIGNYAG+ACGYTTPL
Sbjct: 388 QQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 447

Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
           QGIGRY RTIHQ GC DV C  +QLFGAA  A+RQADAT+LVMGLDQSIEAE +DR GLL
Sbjct: 448 QGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFVDRVGLL 507

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           LPG QQELVS+V+ AS+GPTILVLMSGGPIDV FAKNDPRI+AIIW GYPGQAGGTAIAD
Sbjct: 508 LPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAGGTAIAD 567

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
           +LFGT+NPGGKLPMTWYPQ Y+T+LPMT+MAMR   ++ YPGRTYRFY+GPVV+PFG G+
Sbjct: 568 VLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRFYRGPVVFPFGLGL 627

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT-LGVQVDVKNVGS 681
           SYT F H +A+ PT V+VPL     + N+T+  KA++V+HA CN L+ L V VDVKN GS
Sbjct: 628 SYTTFAHNLAHGPTSVSVPLTSLKATANSTMLSKAVRVSHADCNALSPLDVHVDVKNTGS 687

Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
            DG HTLLVF++PP G WA  KQLV F K+H+ AG++ RV I +HVCK+LSVVDR G RR
Sbjct: 688 MDGTHTLLVFTSPPDGKWAASKQLVGFHKIHIAAGSETRVRIAVHVCKHLSVVDRFGIRR 747

Query: 742 IPLGEHNIHIGGTKHSVSLH 761
           IPLGEH + IG   H VSL 
Sbjct: 748 IPLGEHKLQIGDLSHHVSLQ 767


>gi|255556320|ref|XP_002519194.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223541509|gb|EEF43058.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 782

 Score = 1158 bits (2996), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 550/742 (74%), Positives = 641/742 (86%), Gaps = 3/742 (0%)

Query: 22  LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
           + +R PFACDP++  TR L FC+ +LPI  RV DLI RL+LQEK++LL++ AAAVPRLGI
Sbjct: 36  VESRAPFACDPRNGVTRNLKFCRANLPIHVRVRDLISRLTLQEKIRLLVNNAAAVPRLGI 95

Query: 82  KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
           +GYEWWSEALHGVSNVGPG KFGG FPGATSFPQVITTA+SFN +LWE IGRVVSDEARA
Sbjct: 96  QGYEWWSEALHGVSNVGPGVKFGGAFPGATSFPQVITTAASFNQSLWEQIGRVVSDEARA 155

Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA 201
           MYNGG AGLTYWSPNVN+FRDPRWGRGQETPGEDPVL+GKYAASYVRGLQ S G +LKVA
Sbjct: 156 MYNGGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQSSTGLKLKVA 215

Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
           A CKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT+DVPF+ CV+EGKVASVMCSYNQVNG
Sbjct: 216 ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKACVVEGKVASVMCSYNQVNG 275

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
            PTCADP +LK TIRG+W LNGYIVSDCDSVGV YD QH+TSTPEEAAA  I+AGLDLDC
Sbjct: 276 KPTCADPILLKNTIRGQWGLNGYIVSDCDSVGVLYDNQHYTSTPEEAAAATIKAGLDLDC 335

Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
           GPFL +HTE+AV++GLL E D+N AL NT+TVQMRLGMFDGEPS+ PYG+LGP+DVCTP 
Sbjct: 336 GPFLAIHTENAVKKGLLVEEDVNLALANTITVQMRLGMFDGEPSAHPYGNLGPRDVCTPA 395

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           HQELALEAARQGIVLL+N+G +LPLS  RH T+AVIGPNSDVTVTMIGNYAGIAC YT+P
Sbjct: 396 HQELALEAARQGIVLLENRGQALPLSSSRHHTIAVIGPNSDVTVTMIGNYAGIACKYTSP 455

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
           LQGI RYA+T+HQ GC DVAC  +Q FGAA  A+RQADAT+LVMGLDQSIEAE  DR GL
Sbjct: 456 LQGISRYAKTLHQNGCGDVACHSNQQFGAAEAAARQADATVLVMGLDQSIEAEFRDRVGL 515

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           LLPG QQELVS+V+ AS+GPTILVLMSGGPIDV+FAKNDPR+ AI+WAGYPGQAGG AIA
Sbjct: 516 LLPGHQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRVGAILWAGYPGQAGGAAIA 575

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           D+LFGT+NPGGKLPMTWYPQ Y+  +PMT M MRP  +  YPGRTYRFYKG VV+PFGHG
Sbjct: 576 DVLFGTTNPGGKLPMTWYPQGYLAKVPMTNMGMRPDPATGYPGRTYRFYKGNVVFPFGHG 635

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           MSYT+F H++  AP  V++P+   + ++N TIS KAI+V+H  C + +LG+ ++VKN G+
Sbjct: 636 MSYTSFSHSLTQAPKEVSLPITNLY-ALNTTISSKAIRVSHINC-QTSLGIDINVKNTGT 693

Query: 682 KDGAHTLLVFSTPPAGH-WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
            DG HTLLVFS+PP+G   + +KQL+ FEKV + AG+Q +V I+IHVCK+LS VDR G R
Sbjct: 694 MDGTHTLLVFSSPPSGEKESSNKQLIGFEKVDLVAGSQIQVKIDIHVCKHLSAVDRFGIR 753

Query: 741 RIPLGEHNIHIGGTKHSVSLHA 762
           RIP+G+H+I+IG  KHS+SL A
Sbjct: 754 RIPIGDHHIYIGDLKHSISLQA 775


>gi|183579871|dbj|BAG28345.1| arabinofuranosidase [Citrus unshiu]
          Length = 769

 Score = 1158 bits (2995), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 548/746 (73%), Positives = 634/746 (84%), Gaps = 11/746 (1%)

Query: 19  SSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPR 78
           S  + +R PFACDP++  TR+L FC+ S+PI  RV DLIGRL+LQEK++LL++ AAAVPR
Sbjct: 19  SRFVESRVPFACDPRNGLTRSLRFCRTSVPIHVRVQDLIGRLTLQEKIRLLVNNAAAVPR 78

Query: 79  LGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDE 138
           LGI+GYEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA++FN +LWE IGRVVSDE
Sbjct: 79  LGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAAAFNESLWEEIGRVVSDE 138

Query: 139 ARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL 198
           ARAMYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+GKYAASYVR LQG+ G RL
Sbjct: 139 ARAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRRLQGNTGSRL 198

Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
           KVAA CKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT++VPF+ CV+EGKVASVMCSYNQ
Sbjct: 199 KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 258

Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLD 318
           VNG PTCADP+ILK TIRG+WRL+GYIVSDCDSVGV Y+TQH+T TPEEAAADAI+AGLD
Sbjct: 259 VNGKPTCADPDILKNTIRGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 318

Query: 319 LDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC 378
           LDCGPFL +HTE AV+ GLL E D+N A   T+TVQMRLGMFDGEPS+QP+G+LGP+DVC
Sbjct: 319 LDCGPFLAIHTEGAVRGGLLREEDVNLASAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 378

Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
           TP HQ+LAL+AA QGIVLLKN   +LPLS +RH TVAVIGPNSDVTVTMIGNYAG+ACGY
Sbjct: 379 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 438

Query: 439 TTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
           TTPLQGI RYA+TIHQ GC  VAC  +QL GAA  A+RQADAT+LVMGLDQSIEAE +DR
Sbjct: 439 TTPLQGISRYAKTIHQAGCLGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 498

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
           AGLLLPGRQQELVS+V+ AS+GP +LVLM GGP+DV+FAKNDPRI AI+W GYPGQAGG 
Sbjct: 499 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 558

Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
           AIAD+LFG +NPGGKLPMTWYPQ+Y+  LPMT+M MR  +   YPGRTYRFYKGPVV+PF
Sbjct: 559 AIADVLFGRANPGGKLPMTWYPQDYVARLPMTDMRMRAGRG--YPGRTYRFYKGPVVFPF 616

Query: 619 GHGMSYTNFVHTVANAPTVVAVPL-DGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDV 676
           GHGMSYT F HT++ AP   +VP+    +   N TIS  AI+V H  CN  ++LG+ VDV
Sbjct: 617 GHGMSYTTFAHTLSKAPNQFSVPIATSLYAFKNTTISSNAIRVAHTNCNDAMSLGLHVDV 676

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           KN G   G HTLLVF+ PPAG+W+P+KQL+ F+KVHV AGA Q V ++IHVCK+LSVVD+
Sbjct: 677 KNTGDMAGTHTLLVFAKPPAGNWSPNKQLIGFKKVHVTAGALQSVRLDIHVCKHLSVVDK 736

Query: 737 SGTRRIP-------LGEHNIHIGGTK 755
            G RRIP       LG  +IH+  +K
Sbjct: 737 FGIRRIPWVNIVSILGISSIHLSSSK 762


>gi|298364130|gb|ADI79208.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Malus x domestica]
          Length = 774

 Score = 1157 bits (2992), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 551/748 (73%), Positives = 631/748 (84%), Gaps = 4/748 (0%)

Query: 24  AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
           AR PFACDP++  TRTL FC+V +PI  RV DLIGRL+LQEK+ LL++ A AVPRLGI+G
Sbjct: 28  ARPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQG 87

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           YEWWSEALHGVSNVGPGTKFG  F GATSFPQVITTA+SFN +LWE IGRVVSDEARAMY
Sbjct: 88  YEWWSEALHGVSNVGPGTKFG-TFLGATSFPQVITTAASFNESLWEEIGRVVSDEARAMY 146

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAA 202
           NGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L+ KY A YV+GLQG   G+RLKVAA
Sbjct: 147 NGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPILAAKYGARYVKGLQGDGAGNRLKVAA 206

Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            CKH+TAYDLDNWNGVDRFHFNA+VSKQD+EDT++VPFR CV++G VASVMCSYNQVNG 
Sbjct: 207 CCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFRACVVDGNVASVMCSYNQVNGK 266

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
           PTCADP +LK TIRG+W+LNGYIVSDCDSVGVYYD QH+T TPEEAAA AI+AGLDLDCG
Sbjct: 267 PTCADPELLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEEAAAYAIKAGLDLDCG 326

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
           PFLG+HTE+AV+ G ++EIDIN AL NT+TVQMRLGMFDGEPS+Q YG+LG  DVC P  
Sbjct: 327 PFLGIHTEAAVRFGQVNEIDINYALANTITVQMRLGMFDGEPSAQRYGNLGLADVCKPSS 386

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
            ELALEAARQGIVLL+N+G SLPLS +RHRTVAVIGPNSDVT TMIGNYAGIACGYTTPL
Sbjct: 387 NELALEAARQGIVLLENRGNSLPLSTMRHRTVAVIGPNSDVTETMIGNYAGIACGYTTPL 446

Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
           QGI RY RTIHQ GC DV C  +QL GAA  A+RQADAT+LV+GLDQSIEAE  DR  LL
Sbjct: 447 QGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTDLL 506

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           LPG QQELVS+V+ AS+GPTILV+MSGGPIDV FAKNDPRI AIIW GYPGQAGGTAIAD
Sbjct: 507 LPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYPGQAGGTAIAD 566

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
           +LFGT+NP GKLPMTWYPQ Y+ NLPMT+MAMR   ++ YPGRTYRFYKGPVV+PFG G+
Sbjct: 567 VLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKGPVVFPFGLGL 626

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA-IKVTHAKCNRLTLGVQVDVKNVGS 681
           SYT F H++A  PT+V+VP      S N T+ G   I+V+H  C+ L+L V +D+KN G+
Sbjct: 627 SYTRFSHSLAQGPTLVSVPFTSLVASKNTTMLGNHDIRVSHTNCDSLSLDVHIDIKNSGT 686

Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
            DG HTLLVF+TPP G WAP+KQLV F KVH+ AG+++RV + + VCK+LSVVD  G RR
Sbjct: 687 MDGTHTLLVFATPPTGKWAPNKQLVGFHKVHIVAGSERRVRVGVQVCKHLSVVDELGIRR 746

Query: 742 IPLGEHNIHIGGTKHSVSLHAATLGVIK 769
           IPLG+H + IG  +H VS+  A LG IK
Sbjct: 747 IPLGQHKLEIGDLQHHVSVE-ANLGEIK 773


>gi|449484229|ref|XP_004156823.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
           [Cucumis sativus]
          Length = 769

 Score = 1155 bits (2988), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 547/772 (70%), Positives = 643/772 (83%), Gaps = 5/772 (0%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M ST        ILLSA   G A+R PFACDP ++ T   PFC+ SL + +RV DLIGRL
Sbjct: 1   MPSTSTLILSIFILLSAIH-GRASRAPFACDPNNSVTTDYPFCRRSLVVEERVKDLIGRL 59

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EKVKLL+S A  VPRLGIK Y+WWSEALHGVSNVGPGT+FGG+FP ATSFPQVI+TA
Sbjct: 60  TLEEKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTA 119

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
           +SFNA+LWEAIGRVVSDEARAMYNGG  GLTYWSPNVNIFRDPRWGRGQETPGEDP+L+G
Sbjct: 120 ASFNASLWEAIGRVVSDEARAMYNGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAG 179

Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
            YA +YVRGLQG++G+RLKVAA CKHFTAYDLDNWNGVDRFHFNA+VSKQDIEDTF+VPF
Sbjct: 180 TYAVNYVRGLQGTEGNRLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPF 239

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           RMCV  GKV+SVMCSYNQVNGVPTCADPN+L  T+R +W L+GYIVSDCDSVGV+Y++QH
Sbjct: 240 RMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQH 299

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           +TSTPEEAAA AI+AGLDLDCG FL  HTE+AV+RGLL+E  IN AL NTL+VQMRLGMF
Sbjct: 300 YTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMF 359

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
           DG+  +QPY HLG K VC+  +++LA++AARQGIVLL+N+  SLPLS  RHR VAV+GPN
Sbjct: 360 DGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPN 419

Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
           S+ T+TMIGNYAGIAC Y TPLQGI +Y RTIHQ+GC+ VAC  ++ FG AI+A+R ADA
Sbjct: 420 SNATLTMIGNYAGIACEYITPLQGISKYTRTIHQEGCRGVACRSNKFFGGAIEAARVADA 479

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
            +LVMGLDQSIEAE  DRAGLLLPG Q +LV KV+  +KGP ILVLMSGGPIDV+FAK+ 
Sbjct: 480 VVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKDH 539

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
           P+I+ IIW GYPGQAGG AIAD+LFG +NPGGKLPMTWYPQ+Y++ LPMT M++RP  S 
Sbjct: 540 PKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLRPGTS- 598

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIK 659
            YPGRTYRFYKGPVVYPFGHG+SYT F H + +APT + VP+ G RH    +   GKA++
Sbjct: 599 -YPGRTYRFYKGPVVYPFGHGLSYTAFTHKILSAPTTLTVPVTGHRHPHNGSEFWGKAVR 657

Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQ 719
           VTHAKC+RL+L ++V V+N+G++DGAHTLLV+S PP G W P KQLVAFEKVH+ A A +
Sbjct: 658 VTHAKCDRLSLVIKVAVRNIGARDGAHTLLVYSIPPMGVWVPQKQLVAFEKVHIDAQALK 717

Query: 720 RVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG-GTKHSVSLHAATLGVIKS 770
            V INIHVCK LSVVD+ G RR+P+GEH I IG   +H VSL   TLG+IKS
Sbjct: 718 EVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGDNVRHIVSLQPQTLGIIKS 769


>gi|86553064|gb|AAS17751.2| beta xylosidase [Fragaria x ananassa]
          Length = 772

 Score = 1155 bits (2987), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 554/767 (72%), Positives = 651/767 (84%), Gaps = 9/767 (1%)

Query: 10  LGLILLSASSSGLA-----AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
           L LI L    S L      AR PFACDP++  TR   FC+  +P+  RV DLIGRL+LQE
Sbjct: 9   LSLIALVLCVSALLFNLVHARPPFACDPRNPLTRGFKFCRTRVPVHVRVQDLIGRLTLQE 68

Query: 65  KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
           K++LL++ A AVPRLGI+GYEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN
Sbjct: 69  KIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFN 128

Query: 125 ATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
            +LW+ IG+VVSDEARAMYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLS KYAA
Sbjct: 129 QSLWQEIGQVVSDEARAMYNGGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSAKYAA 188

Query: 185 SYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
           SYV+GLQG   G+RLKVAA CKH+TAYDLDNWNGVDRFHFNA+VSKQD+ DT+DVPFR C
Sbjct: 189 SYVKGLQGDGAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLADTYDVPFRGC 248

Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
           V+EGKVASVMCSYNQVNG PTCADP++LK TIRGEW+LNGYIVSDCDSVGV+YD QH+T 
Sbjct: 249 VLEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGEWKLNGYIVSDCDSVGVFYDQQHYTR 308

Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
           TPEEAAA+AI+AGLDLDCGPFL +HTE A++ GLL EID++ AL NTLTVQMRLGMFDGE
Sbjct: 309 TPEEAAAEAIKAGLDLDCGPFLAIHTEGAIKAGLLPEIDVDYALANTLTVQMRLGMFDGE 368

Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
           PS+Q YG+LGP+DVCTP HQELALEA+RQGIVLL+N G +LPLS +RHRTVAV+GPNSDV
Sbjct: 369 PSAQQYGNLGPRDVCTPAHQELALEASRQGIVLLQNNGHTLPLSTVRHRTVAVVGPNSDV 428

Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
           T TMIGNYAG+ACGYTTPLQGIGRY +TIHQQGC +VAC  +QLFGAA  A+RQADAT+L
Sbjct: 429 TETMIGNYAGVACGYTTPLQGIGRYTKTIHQQGCTNVACTTNQLFGAAEAAARQADATVL 488

Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
           VMGLDQSIEAE  DR  L++PG QQELVS+V+ AS+GPT+LVLMSGGPIDV+FAKNDP+I
Sbjct: 489 VMGLDQSIEAEFRDRTDLVMPGHQQELVSRVARASRGPTVLVLMSGGPIDVSFAKNDPKI 548

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
            AIIW GYPGQAGGTA+AD+LFGT+NP GKLPMTWYPQ+Y++ +PMT MAMR  +   YP
Sbjct: 549 GAIIWVGYPGQAGGTAMADVLFGTTNPSGKLPMTWYPQDYVSKVPMTNMAMRAGRG--YP 606

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
           GRTYRFYKGPVV+PFG G+SYT F H++A  PT V+VPL     + N+T+   A++V+H 
Sbjct: 607 GRTYRFYKGPVVFPFGLGLSYTTFAHSLAQVPTSVSVPLTSLSATTNSTMLSSAVRVSHT 666

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
            CN L+L + V VKN G++DG HTLLVFS+PP+G WA +KQLV F KVH+ AG+ +RV +
Sbjct: 667 NCNPLSLALHVVVKNTGARDGTHTLLVFSSPPSGKWAANKQLVGFHKVHIVAGSHKRVKV 726

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           ++HVCK+LSVVD+ G RRIP+GEH + IG  +H +S+  A +G I+S
Sbjct: 727 DVHVCKHLSVVDQFGIRRIPIGEHKLQIGDLEHHISVE-ANVGEIRS 772


>gi|449469042|ref|XP_004152230.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 769

 Score = 1154 bits (2985), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 547/772 (70%), Positives = 643/772 (83%), Gaps = 5/772 (0%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M ST        ILLSA   G A+R PFACDP ++ T   PFC+ SL + +RV DLIGRL
Sbjct: 1   MPSTSTLILSIFILLSAIH-GRASRAPFACDPNNSVTTDYPFCRRSLVVGERVKDLIGRL 59

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EKVKLL+S A  VPRLGIK Y+WWSEALHGVSNVGPGT+FGG+FP ATSFPQVI+TA
Sbjct: 60  TLEEKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTA 119

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
           +SFNA+LWEAIGRVVSDEARAMYNGG  GLTYWSPNVNIFRDPRWGRGQETPGEDP+L+G
Sbjct: 120 ASFNASLWEAIGRVVSDEARAMYNGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAG 179

Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
            YA +YVRGLQG++G+RLKVAA CKHFTAYDLDNWNGVDRFHFNA+VSKQDIEDTF+VPF
Sbjct: 180 TYAVNYVRGLQGTEGNRLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPF 239

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           RMCV  GKV+SVMCSYNQVNGVPTCADPN+L  T+R +W L+GYIVSDCDSVGV+Y++QH
Sbjct: 240 RMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQH 299

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           +TSTPEEAAA AI+AGLDLDCG FL  HTE+AV+RGLL+E  IN AL NTL+VQMRLGMF
Sbjct: 300 YTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMF 359

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
           DG+  +QPY HLG K VC+  +++LA++AARQGIVLL+N+  SLPLS  RHR VAV+GPN
Sbjct: 360 DGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPN 419

Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
           S+ T+TMIGNYAGIAC Y TPLQGI +Y RTIHQ+GC+ VAC  ++ FG AI+A+R ADA
Sbjct: 420 SNATLTMIGNYAGIACEYITPLQGISKYTRTIHQEGCRGVACRSNKFFGGAIEAARVADA 479

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
            +LVMGLDQSIEAE  DRAGLLLPG Q +LV KV+  +KGP ILVLMSGGPIDV+FAK+ 
Sbjct: 480 VVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKDH 539

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
           P+I+ IIW GYPGQAGG AIAD+LFG +NPGGKLPMTWYPQ+Y++ LPMT M++RP  S 
Sbjct: 540 PKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLRPGTS- 598

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIK 659
            YPGRTYRFYKGPVVYPFGHG+SYT F H + +APT + VP+ G RH    +   GKA++
Sbjct: 599 -YPGRTYRFYKGPVVYPFGHGLSYTAFTHKILSAPTTLTVPVTGHRHPHNGSEFWGKAVR 657

Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQ 719
           VTHAKC+RL+L ++V V+N+G++DGAHTLLV+S PP G W P KQLVAFEKVH+ A A +
Sbjct: 658 VTHAKCDRLSLVIKVAVRNIGARDGAHTLLVYSIPPMGVWVPQKQLVAFEKVHIDAQALK 717

Query: 720 RVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG-GTKHSVSLHAATLGVIKS 770
            V INIHVCK LSVVD+ G RR+P+GEH I IG   +H VSL   TLG+IKS
Sbjct: 718 EVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGDNVRHIVSLQPQTLGIIKS 769


>gi|157041199|dbj|BAF79669.1| beta-D-xylosidase [Pyrus pyrifolia]
          Length = 774

 Score = 1149 bits (2973), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 550/748 (73%), Positives = 634/748 (84%), Gaps = 4/748 (0%)

Query: 24  AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
           AR PFACDP++  TRTL FC+V +PI  RV DLIGRL+LQEK+ LL++ A AVPRLGI+G
Sbjct: 28  ARPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQG 87

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           YEWWSEALHGVSNVGPGTKFG  F GATSFPQVITTA+SFN +LWE IGRVVSDEARAMY
Sbjct: 88  YEWWSEALHGVSNVGPGTKFG-TFLGATSFPQVITTAASFNESLWEEIGRVVSDEARAMY 146

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAA 202
           NGG AGLT+WSPNVNIFRDPRWGRGQETPGEDPVL+ KY A YV+GLQG   G+RLKVAA
Sbjct: 147 NGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQGDGAGNRLKVAA 206

Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            CKH+TAYDLDNWNGVDRFHFNA+VSKQD+EDT++VPF+ CV++G VASVMCSYNQVNG 
Sbjct: 207 CCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVMCSYNQVNGK 266

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
           PTCADP++LK TIRG+W+LNGYIVSDCDSVGVYYD QH+T TPE AAA AI+AGLDLDCG
Sbjct: 267 PTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEAAAAYAIKAGLDLDCG 326

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
           PFLG+HTE+A++ G ++EIDIN AL NT+TVQMRLGMFDGEPS+Q YG+LG  DVC P  
Sbjct: 327 PFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSS 386

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
            ELALEAARQGIVLL+N+G SLPLS IRHRTVAVIGPNSDVT TMIGNYAGIACGYTTPL
Sbjct: 387 NELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGIACGYTTPL 446

Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
           QGI RY RTIHQ GC DV C  +QL GAA  A+RQADAT+LV+GLDQSIEAE  DR GLL
Sbjct: 447 QGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTGLL 506

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           LPG QQELVS+V+ AS+GPTILV+MSGGPIDV FAKNDPRI AIIW GYPGQAGGTAIAD
Sbjct: 507 LPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYPGQAGGTAIAD 566

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
           +LFGT+NP GKLPMTWYPQ Y+ NLPMT+MAMR   ++ YPGRTYRFYKGPVV+PFG G+
Sbjct: 567 VLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKGPVVFPFGMGL 626

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINAT-ISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           SYT F H++A  PT+V+VPL     + N T +S   ++V+H  C+ L+L   +D+KN G+
Sbjct: 627 SYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSNHGVRVSHTNCDSLSLDFHIDIKNTGT 686

Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
            DG HTLLVF+T PAG WAP+KQLV F KVH+ AG+++RV + +HVCK+LS+VD+ G RR
Sbjct: 687 MDGTHTLLVFATQPAGKWAPNKQLVGFHKVHIVAGSERRVRVGVHVCKHLSIVDKLGIRR 746

Query: 742 IPLGEHNIHIGGTKHSVSLHAATLGVIK 769
           IPLG+H + IG  KH VS+  A LG IK
Sbjct: 747 IPLGQHKLEIGDLKHYVSIE-ANLGEIK 773


>gi|65736613|dbj|BAD98523.1| alpha-L-arabinofuranosidase / beta-D-xylosidase [Pyrus pyrifolia]
          Length = 774

 Score = 1146 bits (2965), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 549/748 (73%), Positives = 633/748 (84%), Gaps = 4/748 (0%)

Query: 24  AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
           AR PFACDP++  TRTL FC+V +PI  RV DLIGRL+LQEK+ LL++ A AVPRLGI+G
Sbjct: 28  ARPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQG 87

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           YEWWSEALHGVSNVGPGTKFG  F GATSFPQVITTA+SFN +LWE IGRVVSDEARAMY
Sbjct: 88  YEWWSEALHGVSNVGPGTKFG-TFLGATSFPQVITTAASFNESLWEEIGRVVSDEARAMY 146

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAA 202
           NGG AGLT+WSPNVNIFRDPRWGRGQETPGEDPVL+ KY A YV+GLQG   G+RLKVAA
Sbjct: 147 NGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQGDGAGNRLKVAA 206

Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            CKH+TAYDLDNWNGVDRFHFNA+VSKQD+EDT++VPF+ CV++G VASVMCSYNQVNG 
Sbjct: 207 CCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVMCSYNQVNGK 266

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
           PTCADP++LK TIRG+W+LNGYIVSDCDSVGVYYD QH+T TPE AAA AI+AGLDLDCG
Sbjct: 267 PTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEAAAAYAIKAGLDLDCG 326

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
           PFLG+HTE+A++ G ++EIDIN AL NT+TVQMRLGMFDGEPS+Q YG+LG  DVC P  
Sbjct: 327 PFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSS 386

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
            ELALEAARQGIVLL+N+G SLPLS IRHRTVAVIGPNSDVT TMIGNYAGIACGYTTPL
Sbjct: 387 NELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGIACGYTTPL 446

Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
           QGI RY RTIHQ GC DV C  +QL GAA  A+RQADAT+LV+GLDQSIEAE  DR GLL
Sbjct: 447 QGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTGLL 506

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           LPG QQELVS+V+ AS+GPTILV+MSGGPIDV FAKNDP I AIIW GYPGQAGGTAIAD
Sbjct: 507 LPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPCIGAIIWVGYPGQAGGTAIAD 566

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
           +LFGT+NP GKLPMTWYPQ Y+ NLPMT+MAMR   ++ YPGRTYRFYKGPVV+PFG G+
Sbjct: 567 VLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKGPVVFPFGMGL 626

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINAT-ISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           SYT F H++A  PT+V+VPL     + N T +S   ++V+H  C+ L+L   +D+KN G+
Sbjct: 627 SYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSNHGVRVSHTNCDSLSLDFHIDIKNTGT 686

Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
            DG HTLLVF+T PAG WAP+KQLV F KVH+ AG+++RV + +HVCK+LS+VD+ G RR
Sbjct: 687 MDGTHTLLVFATQPAGKWAPNKQLVGFHKVHIVAGSERRVRVGVHVCKHLSIVDKLGIRR 746

Query: 742 IPLGEHNIHIGGTKHSVSLHAATLGVIK 769
           IPLG+H + IG  KH VS+  A LG IK
Sbjct: 747 IPLGQHKLEIGDLKHYVSIE-ANLGEIK 773


>gi|32481073|gb|AAP83934.1| auxin-induced beta-glucosidase [Chenopodium rubrum]
          Length = 767

 Score = 1142 bits (2954), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 546/764 (71%), Positives = 637/764 (83%), Gaps = 5/764 (0%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M+S   FF   ++ +  S+   AA  P ACDPK   TR L FC+V+LPI  RV DLIGRL
Sbjct: 1   MSSRNNFFCFLVLFILLSAEARAA--PLACDPKSGLTRALRFCRVNLPIRARVQDLIGRL 58

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +LQEKVKLL++ AA VPRLGI GYEWWSEALHGVSNVGPGTKF G FP ATSFPQVITTA
Sbjct: 59  NLQEKVKLLVNNAAPVPRLGISGYEWWSEALHGVSNVGPGTKFRGAFPAATSFPQVITTA 118

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
           +SFNA+LWEAIG+VVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP L+ 
Sbjct: 119 ASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAS 178

Query: 181 KYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
           +YAASYVRGLQG  + +RLKVAA CKH+TAYDLDNWN VDRFHFNAKVSKQD+EDT++VP
Sbjct: 179 QYAASYVRGLQGIYNKNRLKVAACCKHYTAYDLDNWNAVDRFHFNAKVSKQDLEDTYNVP 238

Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
           F+ CV EG+VASVMCSYNQVNG PTCADP++L+ TIRG+WRLNGYIVSDCDSVGV YD Q
Sbjct: 239 FKGCVQEGRVASVMCSYNQVNGKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVLYDDQ 298

Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
           H+T TPEEAAAD I+AGLDLDCGPFL +HTE+AV+RGLL+E D+N AL NT TVQMRLGM
Sbjct: 299 HYTRTPEEAAADTIKAGLDLDCGPFLAVHTEAAVKRGLLTEADVNQALTNTFTVQMRLGM 358

Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
           FDGE ++QP+GHLGPKDVC+P HQ+LAL+AARQGIVLL+N+G SLPLS  RHR +AVIGP
Sbjct: 359 FDGEAAAQPFGHLGPKDVCSPAHQDLALQAARQGIVLLQNRGRSLPLSTARHRNIAVIGP 418

Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
           N+D TVTMIGNYAG+ACGYT+PLQGI RYA+T+HQ GC  VAC  +Q FGAA  A+  AD
Sbjct: 419 NADATVTMIGNYAGVACGYTSPLQGIARYAKTVHQAGCIGVACTSNQQFGAATAAAAHAD 478

Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
           AT+LVMGLDQSIEAE  DRA +LLPG QQELVSKV++AS+GPTILVLM GGP+DV FAKN
Sbjct: 479 ATVLVMGLDQSIEAEFRDRASVLLPGHQQELVSKVALASRGPTILVLMCGGPVDVTFAKN 538

Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
           DP+I+AI+W GYPGQAGGTAIAD+LFGT+NPGGKLP TWYPQ Y+  +PMT++AMR + S
Sbjct: 539 DPKISAILWVGYPGQAGGTAIADVLFGTTNPGGKLPNTWYPQSYVAKVPMTDLAMRANPS 598

Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK-AI 658
             YPGRTYRFYKGPVV+PFG G+SYT F  ++A+APT V VPL  +  + N T   K A+
Sbjct: 599 NGYPGRTYRFYKGPVVFPFGFGLSYTRFTQSLAHAPTKVMVPLANQFTNSNITSFNKDAL 658

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQ 718
           KV H  C+ + L + +DVKN G  DG+HT+LVFSTPP G  +  KQL+ F++VHV AG++
Sbjct: 659 KVLHTNCDNIPLSLHIDVKNKGKVDGSHTILVFSTPPKGTKSSEKQLIGFKRVHVFAGSK 718

Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT-KHSVSLH 761
           QRV +NIHVC +LS  D  G RRIP+GEH +HIG   KH +SLH
Sbjct: 719 QRVRMNIHVCNHLSRADEFGVRRIPIGEHTLHIGDDHKHKLSLH 762


>gi|224099193|ref|XP_002311398.1| predicted protein [Populus trichocarpa]
 gi|222851218|gb|EEE88765.1| predicted protein [Populus trichocarpa]
          Length = 755

 Score = 1140 bits (2948), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 545/741 (73%), Positives = 636/741 (85%), Gaps = 1/741 (0%)

Query: 22  LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
           +  R PFACD K+  TR+L FC+V++P+  RV DLIGRL+LQEK++LL++ AAAVPRLGI
Sbjct: 14  VECRAPFACDAKNGLTRSLKFCRVNMPLHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGI 73

Query: 82  KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
           +GYEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN +LWE IGRVVSDEARA
Sbjct: 74  QGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNKSLWEEIGRVVSDEARA 133

Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA 201
           M+NGG AGLTYWSPNVN+FRDPRWGRGQETPGEDPV++GKYAASYVRGLQG+ G RLKVA
Sbjct: 134 MFNGGMAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNSGFRLKVA 193

Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
           A CKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT+DVPF+ CV+EGKVASVMCSYNQVNG
Sbjct: 194 ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKSCVVEGKVASVMCSYNQVNG 253

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
            PTCADPN+LK TIRGEWRLNGYIVSDCDSVGV Y+ QH+T+TPEEAAA  I+AGLDLDC
Sbjct: 254 KPTCADPNLLKNTIRGEWRLNGYIVSDCDSVGVLYENQHYTATPEEAAAATIKAGLDLDC 313

Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
           GPFL +HTE+AV+ GLL+E D+N AL NT+TVQMRLG+FDGEPS+QP+G LGP+DVCTP 
Sbjct: 314 GPFLAIHTENAVKGGLLNEEDVNMALANTITVQMRLGLFDGEPSAQPFGKLGPRDVCTPA 373

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           HQ+LAL AA+QGIVLL+N G +LPLS   + TVAVIGP +DVTVTMIGNYAG+ACGYTTP
Sbjct: 374 HQQLALHAAQQGIVLLQNSGRTLPLSR-PNLTVAVIGPIADVTVTMIGNYAGVACGYTTP 432

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
           LQGI RYA+TIHQ GC DVAC  +Q FG A  A+ QADAT+LVMGLDQSIEAE  DR  L
Sbjct: 433 LQGISRYAKTIHQSGCIDVACNGNQQFGMAEAAASQADATVLVMGLDQSIEAEFRDRKDL 492

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           LLPG QQEL+S+V+ AS+GPTILVLMSGGPIDV+FAKNDPRI AI+WAGYPGQAGG AIA
Sbjct: 493 LLPGYQQELISRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWAGYPGQAGGAAIA 552

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           D+LFGT+NPGGKLPMTWYPQ+Y+  +PMT M MR   S+ YPGRTYRFYKGPVV+PFGHG
Sbjct: 553 DVLFGTTNPGGKLPMTWYPQDYLAKVPMTNMGMRADPSRGYPGRTYRFYKGPVVFPFGHG 612

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           MSYT F H++  AP  VAVP    +   N T +  +I+V+HA C  L LGV +DVKN G 
Sbjct: 613 MSYTTFAHSLVQAPQEVAVPFTSLYALQNTTAARNSIRVSHANCEPLVLGVHIDVKNTGD 672

Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
            DG  TLLVFS+PP G W+ +K+L+ FEKVH+ AG+++RV I+I VCK+LSVVDR G RR
Sbjct: 673 MDGIQTLLVFSSPPEGKWSANKKLIGFEKVHIVAGSKKRVKIDIPVCKHLSVVDRFGIRR 732

Query: 742 IPLGEHNIHIGGTKHSVSLHA 762
           +P+G+H++HIG  KHS+SL A
Sbjct: 733 LPIGKHDLHIGDLKHSISLQA 753


>gi|296083274|emb|CBI22910.3| unnamed protein product [Vitis vinifera]
          Length = 738

 Score = 1138 bits (2943), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 550/747 (73%), Positives = 636/747 (85%), Gaps = 32/747 (4%)

Query: 24  AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
           AREPFACDP++  TR LPFC+VSLPI +R  DL+GRL+LQEK++LL++ A  VPRLGIKG
Sbjct: 23  AREPFACDPRNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKG 82

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           YEWWSEALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFNA+LWE IGRVVSDEARAMY
Sbjct: 83  YEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMY 142

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
           NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDP ++ KYAA+YVRGLQG+  DRLKVAA 
Sbjct: 143 NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNARDRLKVAAC 202

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           CKH+TAYDLD+W G+DRFHFNA+VSKQD+EDT+DVPF+ CV+EG VASVMCSYNQVNG P
Sbjct: 203 CKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQVNGKP 262

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           TCADP++L+ TIRGEW+LNGYIVSDCDSVGV+YD QH+T+TPEEAAA AI+AGLDLDCGP
Sbjct: 263 TCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTATPEEAAAVAIKAGLDLDCGP 322

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           FL +HTE+A++ G L+E D+N AL+NT++VQMRLGMFDGEPS+QPYG+LGP+DVCTP HQ
Sbjct: 323 FLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCTPAHQ 382

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
           +LALEAARQGIVL++N+GP+LPLS  RHRT+AVIGPNSDVT TMIGNYAG+ACGYTTPLQ
Sbjct: 383 QLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYTTPLQ 442

Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           GIGRYARTIHQ GC  VAC DDQ FGAA+ A+RQADAT+LVMGLDQSIEAE  DR  +LL
Sbjct: 443 GIGRYARTIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDRVDILL 502

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PGRQQELVSKV++AS+GPT+LVLMSGGPIDV+FAKNDPRIAAIIW GYPGQAGGTAIAD+
Sbjct: 503 PGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGTAIADV 562

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFG +NPGGKLP+TWYPQ Y+   PMT MAMR   S+ YPGRTYRFY GPVV+PFGHG+S
Sbjct: 563 LFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNGPVVFPFGHGLS 622

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           Y+ F H++A APT                                 LG  +DVKN G+ D
Sbjct: 623 YSTFAHSLAQAPTT-------------------------------PLGFHIDVKNTGTMD 651

Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
           G+HTLL+FSTPP G W+P+K+L+AFEKVHV AG+Q+RV  ++HVCK+LSVVD  G  RIP
Sbjct: 652 GSHTLLLFSTPPPGTWSPNKRLLAFEKVHVGAGSQERVRFDVHVCKHLSVVDHFGIHRIP 711

Query: 744 LGEHNIHIGGTKHSVSLHAATLGVIKS 770
           +GEH+ HIG  KHS+SL  ATL  IKS
Sbjct: 712 MGEHHFHIGDLKHSISLQ-ATLEEIKS 737


>gi|356556038|ref|XP_003546334.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
          Length = 775

 Score = 1126 bits (2913), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 522/739 (70%), Positives = 626/739 (84%), Gaps = 2/739 (0%)

Query: 25  REPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
           R PFACDP++  TR   FC   +PI  RV DLI RL+L EK++L+++ A AVPRLGI+GY
Sbjct: 34  RVPFACDPRNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGY 93

Query: 85  EWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN 144
           EWWSEALHGVSNVGPGTKFGG FPGAT FPQVI+TA+SFN +LW+ IGRVVSDEARAMYN
Sbjct: 94  EWWSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYN 153

Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAAS 203
           GG AGLTYWSPNVNIFRDPRWGRGQETPGEDP L+ KYAASYV+GLQG S G+ LKVAA 
Sbjct: 154 GGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQGDSAGNHLKVAAC 213

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           CKH+TAYDLDNWNGVDRFHFNAKVSKQD+EDT+DVPF+ CV+EG+VASVMCSYNQVNG P
Sbjct: 214 CKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVMCSYNQVNGKP 273

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           TCADP++L+ TIRG+WRLNGYIVSDCDSVGV++D QH+T TPEEAAA+AI+AGLDLDCGP
Sbjct: 274 TCADPDLLRNTIRGQWRLNGYIVSDCDSVGVFFDNQHYTKTPEEAAAEAIKAGLDLDCGP 333

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           FL +HT+SA+++GL+SE D+N AL N ++VQMRLGMFDGEPS+QPYG+LGP+DVCT  HQ
Sbjct: 334 FLAIHTDSAIRKGLISENDLNLALANLISVQMRLGMFDGEPSTQPYGNLGPRDVCTSAHQ 393

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
           +LALEAAR+ IVLL+N+G SLPLS  R RT+ V+GPN+D TVTMIGNYAG+ACGYTTPLQ
Sbjct: 394 QLALEAARESIVLLQNKGNSLPLSPSRLRTIGVVGPNADATVTMIGNYAGVACGYTTPLQ 453

Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           GI RY +T HQ GC+ VAC  ++LFGAA   +RQADA +LVMGLDQ++EAE  DR GLLL
Sbjct: 454 GIARYVKTAHQVGCRGVACRGNELFGAAETIARQADAIVLVMGLDQTVEAETRDRVGLLL 513

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG QQELV++V+ A+KGP IL++MSGGP+D++FAKNDP+I+AI+W GYPGQAGGTAIAD+
Sbjct: 514 PGLQQELVTRVARAAKGPVILLIMSGGPVDISFAKNDPKISAILWVGYPGQAGGTAIADV 573

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           +FGT+NPGG+LPMTWYPQ Y+  +PMT M MRP+ +  YPGRTYRFYKGPVV+PFGHG+S
Sbjct: 574 IFGTTNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPTTGYPGRTYRFYKGPVVFPFGHGLS 633

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC-NRLTLGVQVDVKNVGSK 682
           Y+ F H++A AP  V+VP+       N+T+S KA+KV+HA C + L +   VDVKN GS 
Sbjct: 634 YSRFSHSLALAPKQVSVPIMSLQALTNSTLSSKAVKVSHANCDDSLEMEFHVDVKNEGSM 693

Query: 683 DGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           DG HTLL+FS PP G W+  KQLV F K HV AG++QRV + +HVCK+LSVVD+ G RRI
Sbjct: 694 DGTHTLLIFSQPPHGKWSQIKQLVGFHKTHVLAGSKQRVKVGVHVCKHLSVVDQFGVRRI 753

Query: 743 PLGEHNIHIGGTKHSVSLH 761
           P GEH +HIG  KHS+S+ 
Sbjct: 754 PTGEHELHIGDVKHSISVQ 772


>gi|356529243|ref|XP_003533205.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
          Length = 774

 Score = 1117 bits (2888), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 521/739 (70%), Positives = 622/739 (84%), Gaps = 2/739 (0%)

Query: 25  REPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
           R PFACDP++  TR   FC   +PI  RV DLI RL+L EK++L+++ A AVPRLGI+GY
Sbjct: 33  RVPFACDPRNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGY 92

Query: 85  EWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN 144
           EWWSEALHGVSNVGPGTKFGG FPGAT FPQVI+TA+SFN +LW+ IGRVVSDEARAMYN
Sbjct: 93  EWWSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYN 152

Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAAS 203
           GG AGLTYWSPNVNIFRDPRWGRGQETPGEDP L+ KYAASYV+GLQG   G+RLKVAA 
Sbjct: 153 GGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQGDGAGNRLKVAAC 212

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           CKH+TAYDLDNWNGVDRFHFNAKVSKQD+EDT+DVPF+ CV+EG+VASVMCSYNQVNG P
Sbjct: 213 CKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVMCSYNQVNGKP 272

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           TCADP++L+ TIRG+W LNGYIVSDCDSVGV++D QH+T TPEEAAA+AI+AGLDLDCGP
Sbjct: 273 TCADPDLLRNTIRGQWGLNGYIVSDCDSVGVFFDNQHYTRTPEEAAAEAIKAGLDLDCGP 332

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           FL +HT+SA+++GL+SE D+N AL N +TVQMRLGMFDGEPS+QP+G+LGP+DVCTP HQ
Sbjct: 333 FLAIHTDSAIRKGLISENDLNLALANLITVQMRLGMFDGEPSTQPFGNLGPRDVCTPAHQ 392

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
           +LALEAAR+ IVLL+N+G SLPLS  R R V VIGPN+D TVTMIGNYAG+ACGYTTPLQ
Sbjct: 393 QLALEAARESIVLLQNKGNSLPLSPSRLRIVGVIGPNTDATVTMIGNYAGVACGYTTPLQ 452

Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           GI RY +T HQ GC+ VAC  ++LFGAA   +RQ DAT+LVMGLDQ+IEAE  DR GLLL
Sbjct: 453 GIARYVKTAHQVGCRGVACRGNELFGAAEIIARQVDATVLVMGLDQTIEAETRDRVGLLL 512

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG QQELV++V+ A+KGP ILV+MSGGP+DV+FAKN+P+I+AI+W GYPGQAGGTAIAD+
Sbjct: 513 PGLQQELVTRVARAAKGPVILVIMSGGPVDVSFAKNNPKISAILWVGYPGQAGGTAIADV 572

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           +FG +NPGG+LPMTWYPQ Y+  +PMT M MRP+ +  YPGRTYRFYKGPVV+PFGHG+S
Sbjct: 573 IFGATNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPATGYPGRTYRFYKGPVVFPFGHGLS 632

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC-NRLTLGVQVDVKNVGSK 682
           Y+ F  ++A AP  V+V +       N+T+S KA+KV+HA C + L     VDVKN GS 
Sbjct: 633 YSRFSQSLALAPKQVSVQILSLQALTNSTLSSKAVKVSHANCDDSLETEFHVDVKNEGSM 692

Query: 683 DGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           DG HTLL+FS PP G W+  KQLV F K HVPAG++QR+ +N+H CK+LSVVD+ G RRI
Sbjct: 693 DGTHTLLIFSKPPPGKWSQIKQLVTFHKTHVPAGSKQRLKVNVHSCKHLSVVDQFGVRRI 752

Query: 743 PLGEHNIHIGGTKHSVSLH 761
           P GEH +HIG  KHS+++ 
Sbjct: 753 PTGEHELHIGDLKHSINVQ 771


>gi|357449039|ref|XP_003594795.1| Beta xylosidase [Medicago truncatula]
 gi|355483843|gb|AES65046.1| Beta xylosidase [Medicago truncatula]
          Length = 762

 Score = 1097 bits (2838), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 527/766 (68%), Positives = 618/766 (80%), Gaps = 13/766 (1%)

Query: 5   IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
           I F  L  IL++ S      R PFACDPK+  TR+  FC   +PI  RV DLIGRL+L E
Sbjct: 8   ITFMLLISILVTLSE----GRVPFACDPKNGLTRSYKFCNTRVPIHARVQDLIGRLALPE 63

Query: 65  KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
           K++L+++ A AVPRLGI+GYEWWSEALHGVSNVGPGTKFGG F  ATSFPQVITTA+SFN
Sbjct: 64  KIRLVVNNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFSAATSFPQVITTAASFN 123

Query: 125 ATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
            +LW  IGR+VSDEARAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++GKYAA
Sbjct: 124 QSLWLEIGRIVSDEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPTVAGKYAA 183

Query: 185 SYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
           SYV+GLQG+  G+RLKVAA CKH+TAYDLDNWNGVDRFHFNAKVSKQD+ DT+DVPF+ C
Sbjct: 184 SYVQGLQGNGAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLADTYDVPFKAC 243

Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
           V +GKVASVMCSYNQVNG PTCADP +L+ TIRGEW LNGYIVSDCDSVGV YD QH+T 
Sbjct: 244 VRDGKVASVMCSYNQVNGKPTCADPELLRNTIRGEWGLNGYIVSDCDSVGVLYDNQHYTR 303

Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
           TPE+AAA AI+AGLDLDCGPFL LHT+ A+++GL+SE D+N AL N +TVQMRLGMFDG+
Sbjct: 304 TPEQAAAAAIKAGLDLDCGPFLALHTDGAIKQGLISENDLNLALANLITVQMRLGMFDGD 363

Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
             +QPYG+LG +DVC P H ++ALEAARQGIVLL+N+G +LPLS  R+RTV VIGPNSDV
Sbjct: 364 --AQPYGNLGTRDVCLPSHNDVALEAARQGIVLLQNKGNALPLSPTRYRTVGVIGPNSDV 421

Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
           TVTMIGNYAGIACGYTTPLQGI RY +TIHQ GCKDV C  +QLFG +   +RQADAT+L
Sbjct: 422 TVTMIGNYAGIACGYTTPLQGIARYVKTIHQAGCKDVGCGGNQLFGLSEQVARQADATVL 481

Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
           VMGLDQSIEAE  DR GLLLPG QQELVS+V+ A++GP ILVLMSGGPIDV FAKNDP+I
Sbjct: 482 VMGLDQSIEAEFRDRTGLLLPGHQQELVSRVARAARGPVILVLMSGGPIDVTFAKNDPKI 541

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
           +AI+W GYPGQ+GGTAIAD++FG +NP G+LP TWYPQ+Y+  +PMT M MR + +  YP
Sbjct: 542 SAILWVGYPGQSGGTAIADVIFGRTNPSGRLPNTWYPQDYVRKVPMTNMDMRANPATGYP 601

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
           GRTYRFYKGPVV+PFGHG+SY+ F H++A AP  V+V             S KA+KV+HA
Sbjct: 602 GRTYRFYKGPVVFPFGHGLSYSRFTHSLALAPKQVSVQFTTPLTQAFTNSSNKAMKVSHA 661

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
            C+ L +G  VDVKN GS DGAHTLLV+S  P G     KQLV F K +VPAG++ RV +
Sbjct: 662 NCDELEVGFHVDVKNEGSMDGAHTLLVYSKAPNG----VKQLVNFHKTYVPAGSKTRVKV 717

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
            +HVC +LS VD  G RRIP+GEH + IG  KHS+ +   TL  IK
Sbjct: 718 GVHVCNHLSAVDEFGVRRIPMGEHELQIGDLKHSILVQ--TLDQIK 761


>gi|449466797|ref|XP_004151112.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
          Length = 770

 Score = 1088 bits (2814), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 519/735 (70%), Positives = 615/735 (83%), Gaps = 7/735 (0%)

Query: 30  CDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSE 89
           CD ++  TR + FCQ SL I +RV DLIGRL+L EK++LL++ A AVPRLGI+GYEWWSE
Sbjct: 34  CDKRNVGTRNMGFCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSE 93

Query: 90  ALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG 149
           ALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN +LW  IGRVVSDEARAMYNGGTAG
Sbjct: 94  ALHGVSNVGPGTKFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTAG 153

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFT 208
           LTYWSPNVNIFRDPRWGRGQETPGEDP+L+ KYAA+YV+GLQG+DG  RLKVAA CKH+T
Sbjct: 154 LTYWSPNVNIFRDPRWGRGQETPGEDPILAAKYAANYVQGLQGNDGKKRLKVAACCKHYT 213

Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
           AYDLDNWNGVDR+HFNAKVSKQD+EDT++VPF+ CV+EGKVASVMCSYNQVNG PTCADP
Sbjct: 214 AYDLDNWNGVDRYHFNAKVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 273

Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH 328
           ++LK TIRG W L+GYIVSDCDSVGV YD+QHFT TPEEAAA  I+AGLDLDCGPFL +H
Sbjct: 274 DLLKNTIRGAWGLDGYIVSDCDSVGVLYDSQHFTPTPEEAAASTIKAGLDLDCGPFLAVH 333

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
           T +AV RGLL E+D+NNAL N L+VQMRLGMFDGEP++QPYG+LGPKDVCTP H+ LALE
Sbjct: 334 TATAVGRGLLKEVDLNNALANLLSVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALE 393

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
           AARQGIVLL+N+  +LPLS  RHRTVAVIGPNSD TVTMIGNYAG+AC YTTP+QGI +Y
Sbjct: 394 AARQGIVLLQNRAGALPLSPTRHRTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKY 453

Query: 449 ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQ 508
            +TIH +GC +VAC  DQL G A  A+R ADA ++V+GLDQSIEAE+ DR G+LLPG+Q+
Sbjct: 454 VKTIHAKGCANVACVGDQLIGEAEAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQE 513

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
           ELV ++ +A KGPT++VLMSGGPIDV+FAKND +I+ I+W GYPGQAGG AIAD+LFG +
Sbjct: 514 ELVRRIGLACKGPTVVVLMSGGPIDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGAT 573

Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
           NPGGKLPMTWYPQ Y+  +PMT M +RP  S  YPGRTYRFYKGPVV+PFG G+SY+ F 
Sbjct: 574 NPGGKLPMTWYPQSYLAKVPMTNMGLRPDPSTGYPGRTYRFYKGPVVFPFGFGLSYSKFS 633

Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT-LGVQVDVKNVGSKDGAHT 687
            + A APT +++PL     + +AT     +KV+H  C  ++ L + +DVKN G+ DG+HT
Sbjct: 634 QSFAEAPTKISLPLSSLSPNSSAT-----VKVSHTDCASVSDLPIMIDVKNTGTVDGSHT 688

Query: 688 LLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH 747
           +LVFST P   W+P K L+ FEKVH+ AG+Q+RV I IHVC +LS VD  GTRRIP+GEH
Sbjct: 689 ILVFSTVPNQTWSPEKHLIGFEKVHLIAGSQKRVRIGIHVCDHLSRVDEFGTRRIPMGEH 748

Query: 748 NIHIGGTKHSVSLHA 762
            +HIG   HS+SL A
Sbjct: 749 KLHIGDLTHSISLQA 763


>gi|189380221|gb|ACD93208.1| beta xylosidase [Camellia sinensis]
          Length = 767

 Score = 1072 bits (2773), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 520/753 (69%), Positives = 609/753 (80%), Gaps = 18/753 (2%)

Query: 24  AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
           +R  FACD     TR LPFC+VSLPI  RV DLIGRL+LQEK++LL++ AAAVPRLGIKG
Sbjct: 27  SRPAFACD---GATRNLPFCRVSLPIQDRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIKG 83

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           YEWWSEALHGVSN  PG KFGG FPGATSFPQVI+TA+SFNA+LWE IGRVVSDEARAMY
Sbjct: 84  YEWWSEALHGVSNADPGVKFGGAFPGATSFPQVISTAASFNASLWEHIGRVVSDEARAMY 143

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
           NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+GKYAASYVRGLQG+ G++LKVAA 
Sbjct: 144 NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQGNSGNQLKVAAC 203

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           CKH+TAYDLDNWN VDR+ FNA+VSKQD+ DT+DVPF+ CV+EGK   V C++     + 
Sbjct: 204 CKHYTAYDLDNWNSVDRYRFNARVSKQDLADTYDVPFKACVVEGKY-QVYCAHT----IK 258

Query: 264 TCADPNILK----RTIRGEWR--LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
             A+P +L     +     W   L+ + +  C    + + T H  STPE+AAA  I+AGL
Sbjct: 259 LMANPLVLTLISPQHHPWSWHSWLHCFRLYRCWGF-ICHSTLH--STPEDAAAATIKAGL 315

Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
           DL+CGPFL +HTE AV++G L E D+N AL+NTL+VQMRLGMFDGEPSSQPYG+LGP+DV
Sbjct: 316 DLECGPFLAIHTEQAVRQGKLGEADVNGALINTLSVQMRLGMFDGEPSSQPYGNLGPRDV 375

Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
           CTP HQ+LALEAARQGIVLL+N+G SLPLS   HRTVAVIGPNSDVTVTM+GNYAG+ACG
Sbjct: 376 CTPAHQQLALEAARQGIVLLQNRGRSLPLSTQLHRTVAVIGPNSDVTVTMLGNYAGVACG 435

Query: 438 YTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALD 497
           +TTPLQGI RY RTIHQ GC  VAC+++QLFG A  A+RQADAT+LVMGLDQSIE E  D
Sbjct: 436 FTTPLQGIERYVRTIHQSGCDSVACSNNQLFGVAETAARQADATVLVMGLDQSIETEFKD 495

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           R GLLLPG QQELVS+V+MAS+GP +LVLMSGGPIDV+FAKNDPRI AI+W GYPGQAGG
Sbjct: 496 RVGLLLPGPQQELVSRVAMASRGPVVLVLMSGGPIDVSFAKNDPRIGAILWVGYPGQAGG 555

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
           TAIAD+LFG +NPGG+LPMTWYPQ+Y+   PMT MAMR + S  YPGRTYRFYKGPVV+P
Sbjct: 556 TAIADVLFGRTNPGGRLPMTWYPQDYLAKAPMTNMAMRANPSSGYPGRTYRFYKGPVVFP 615

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
           FGHGMSYT F H +A+APT V+VPL   +G  N+T     I+VTH  C+ L LG+ +DVK
Sbjct: 616 FGHGMSYTTFAHELAHAPTTVSVPLTSLYGLQNSTTFNNGIRVTHTNCDTLILGIHIDVK 675

Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
           N G  DG HT+LVFSTPP G W  +KQL+ F+KVHV A  +QRV I++HVC  LSVVD+ 
Sbjct: 676 NTGDMDGTHTVLVFSTPPVGKWGANKQLIGFKKVHVVARGRQRVKIHVHVCNQLSVVDQF 735

Query: 738 GTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           G RRIP+GEH++HIG  KHS+SL   TL  IK+
Sbjct: 736 GIRRIPIGEHSLHIGDIKHSISLQ-VTLDNIKT 767


>gi|15239867|ref|NP_199747.1| beta-xylosidase 1 [Arabidopsis thaliana]
 gi|75262458|sp|Q9FGY1.1|BXL1_ARATH RecName: Full=Beta-D-xylosidase 1; Short=AtBXL1; AltName:
           Full=Alpha-L-arabinofuranosidase; Flags: Precursor
 gi|9759419|dbj|BAB09906.1| xylosidase [Arabidopsis thaliana]
 gi|21539545|gb|AAM53325.1| xylosidase [Arabidopsis thaliana]
 gi|332008419|gb|AED95802.1| beta-xylosidase 1 [Arabidopsis thaliana]
          Length = 774

 Score = 1072 bits (2771), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 525/767 (68%), Positives = 618/767 (80%), Gaps = 17/767 (2%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           + + +    + L+ L  SS  L  R  FACDP +  TRTL FC+ ++PI  RV DL+GRL
Sbjct: 10  IGNKVVVILVFLLCLVHSSESL--RPLFACDPANGLTRTLRFCRANVPIHVRVQDLLGRL 67

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +LQEK++ L++ AAAVPRLGI GYEWWSEALHG+S+VGPG KFGG FPGATSFPQVITTA
Sbjct: 68  TLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFPQVITTA 127

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
           +SFN +LWE IGRVVSDEARAMYNGG AGLTYWSPNVNI RDPRWGRGQETPGEDP+++ 
Sbjct: 128 ASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAA 187

Query: 181 KYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
           KYAASYVRGLQG+  G+RLKVAA CKH+TAYDLDNWNGVDRFHFNAKV++QD+EDT++VP
Sbjct: 188 KYAASYVRGLQGTAAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVTQQDLEDTYNVP 247

Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
           F+ CV EGKVASVMCSYNQVNG PTCAD N+LK TIRG+WRLNGYIVSDCDSV V+++ Q
Sbjct: 248 FKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGQWRLNGYIVSDCDSVDVFFNQQ 307

Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
           H+TSTPEEAAA +I+AGLDLDCGPFL + TE AV++GLL+E DIN AL NTLTVQMRLGM
Sbjct: 308 HYTSTPEEAAARSIKAGLDLDCGPFLAIFTEGAVKKGLLTENDINLALANTLTVQMRLGM 367

Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
           FDG     PY +LGP+DVCTP H+ LALEAA QGIVLLKN   SLPLS  RHRTVAVIGP
Sbjct: 368 FDGNLG--PYANLGPRDVCTPAHKHLALEAAHQGIVLLKNSARSLPLSPRRHRTVAVIGP 425

Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
           NSDVT TMIGNYAG AC YT+PLQGI RYART+HQ GC  VAC  +Q FGAA  A+R+AD
Sbjct: 426 NSDVTETMIGNYAGKACAYTSPLQGISRYARTLHQAGCAGVACKGNQGFGAAEAAAREAD 485

Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
           AT+LVMGLDQSIEAE  DR GLLLPG QQ+LV++V+ AS+GP ILVLMSGGPIDV FAKN
Sbjct: 486 ATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTFAKN 545

Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
           DPR+AAIIWAGYPGQAGG AIA+I+FG +NPGGKLPMTWYPQ+Y+  +PMT MAMR S +
Sbjct: 546 DPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAKVPMTVMAMRASGN 605

Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP----TVVAVPLDGRHGSINATISG 655
             YPGRTYRFYKGPVV+PFG G+SYT F H++A +P    +V    L+  +  +N+  S 
Sbjct: 606 --YPGRTYRFYKGPVVFPFGFGLSYTTFTHSLAKSPLAQLSVSLSNLNSANTILNS--SS 661

Query: 656 KAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPAG---HWAPHKQLVAFEKV 711
            +IKV+H  CN    + + V+V N G  DG HT+ VF+ PP         +KQL+AFEKV
Sbjct: 662 HSIKVSHTNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPINGIKGLGVNKQLIAFEKV 721

Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
           HV AGA+Q V +++  CK+L VVD  G RRIP+GEH +HIG  KH++
Sbjct: 722 HVMAGAKQTVQVDVDACKHLGVVDEYGKRRIPMGEHKLHIGDLKHTI 768


>gi|297795695|ref|XP_002865732.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
 gi|297311567|gb|EFH41991.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
          Length = 774

 Score = 1069 bits (2765), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 526/767 (68%), Positives = 620/767 (80%), Gaps = 17/767 (2%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           + + +    + L+ L  SS  L  R  FACDP +  TRTL FC+V++PI  RV DLIGRL
Sbjct: 10  IGNKVVVILVFLLCLVHSSESL--RPLFACDPANGLTRTLRFCRVNVPIHVRVQDLIGRL 67

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +LQEK++ L++ AAAVPRLGI GYEWWSEALHGVS+VGPG+KFGG FPGATSFPQVITTA
Sbjct: 68  TLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGVSDVGPGSKFGGAFPGATSFPQVITTA 127

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
           +SFN +LWE IGRVVSDEARAMYNGG AGLTYWSPNVNI RDPRWGRGQETPGEDP+++ 
Sbjct: 128 ASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAA 187

Query: 181 KYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
           KYAASYVRGLQG+  G+RLKVAA CKH+TAYDLDNWNGVDRFHFNAKV++QD+EDT++VP
Sbjct: 188 KYAASYVRGLQGTAAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVTQQDLEDTYNVP 247

Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
           F+ CV EGKVASVMCSYNQVNG PTCAD N+LK TIRG+WRLNGYIVSDCDSV V+++ Q
Sbjct: 248 FKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGKWRLNGYIVSDCDSVDVFFNQQ 307

Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
           H+TSTPEEAAA +I+AGLDLDCGPFL + TE AV++GLL+E DIN AL NTLTVQMRLGM
Sbjct: 308 HYTSTPEEAAAASIKAGLDLDCGPFLAIFTEGAVKKGLLTENDINLALANTLTVQMRLGM 367

Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
           FDG     PY +LGP+DVC+  H+ LALEAA QGIVLLKN G SLPLS  RHRTVAVIGP
Sbjct: 368 FDGNLG--PYANLGPRDVCSLAHKHLALEAAHQGIVLLKNSGRSLPLSPRRHRTVAVIGP 425

Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
           NSDVT TMIGNYAG AC YTTPLQGI RYART+HQ GC  VAC  +Q FGAA  A+R+AD
Sbjct: 426 NSDVTETMIGNYAGKACAYTTPLQGISRYARTLHQAGCAGVACKGNQGFGAAEAAAREAD 485

Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
           AT+LVMGLDQSIEAE  DR GLLLPG QQ+LV++V+ AS+GP ILVLMSGGPIDV FAKN
Sbjct: 486 ATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTFAKN 545

Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
           DPR+AAIIWAGYPGQAGG AIA+I+FG +NPGGKLPMTWYPQ+Y+  +PMT MAMR S +
Sbjct: 546 DPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAKVPMTVMAMRASGN 605

Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP----TVVAVPLDGRHGSINATISG 655
             YPGRTYRFYKGPVV+PFG G+SYT F +++A +P    +V    L+  +  +N+T   
Sbjct: 606 --YPGRTYRFYKGPVVFPFGFGLSYTTFTNSLAKSPLAQLSVSLSNLNSANAILNST--S 661

Query: 656 KAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPAG---HWAPHKQLVAFEKV 711
            +IKV+H  CN    + + V+V N G  DG HT+ VF+ PP         +KQL+AFEKV
Sbjct: 662 HSIKVSHTNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPKNGIKGLGVNKQLIAFEKV 721

Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
           HV AGA+Q V +++  CK+L VVD  G RRIP+G+H +HIG  KH++
Sbjct: 722 HVMAGAKQTVRVDVDACKHLGVVDEYGKRRIPMGKHKLHIGDLKHTI 768


>gi|224128360|ref|XP_002320310.1| predicted protein [Populus trichocarpa]
 gi|222861083|gb|EEE98625.1| predicted protein [Populus trichocarpa]
          Length = 635

 Score = 1052 bits (2720), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 504/639 (78%), Positives = 560/639 (87%), Gaps = 6/639 (0%)

Query: 133 RVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
           +VVSDEARAM+NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPV+ GKYAASYVRGLQG
Sbjct: 2   QVVSDEARAMFNGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVVGKYAASYVRGLQG 61

Query: 193 SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASV 252
           SDG+RLKVAA CKHFTAYDLDNWNGVDRFHFNA+VSKQD+EDTFDVPFRMCV EGKVASV
Sbjct: 62  SDGNRLKVAACCKHFTAYDLDNWNGVDRFHFNAEVSKQDMEDTFDVPFRMCVKEGKVASV 121

Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADA 312
           MCSYNQVNG+PTCADPN+LK+T+RG       ++     +G     Q     P       
Sbjct: 122 MCSYNQVNGIPTCADPNLLKKTVRGTLFQTVTLLEFI--MGSNTILQPRRKQPRMLLK-- 177

Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
            +A LDLDCGPFLG HTE AV++GLL+E +INNAL+NTLTVQMRLGMFDGEPSSQ YG+L
Sbjct: 178 -QASLDLDCGPFLGQHTEDAVKKGLLNEAEINNALLNTLTVQMRLGMFDGEPSSQLYGNL 236

Query: 373 GPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
           GP DVCTP HQELALEAARQGIVLLKN GPSLPLS  RH +VA++GPNS+VT TMIGNYA
Sbjct: 237 GPNDVCTPAHQELALEAARQGIVLLKNHGPSLPLSTRRHLSVAIVGPNSNVTATMIGNYA 296

Query: 433 GIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIE 492
           G+ACGYTTPLQGI RYA+TIH+QGC DVAC  DQ F AAIDA+RQADAT+LVMGLDQSIE
Sbjct: 297 GLACGYTTPLQGIQRYAQTIHRQGCADVACVSDQQFSAAIDAARQADATVLVMGLDQSIE 356

Query: 493 AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
           AE  DR GLLLPGRQQELVSKV+ ASKGPTILVLMSGGPIDV+FA+NDP+I +I+WAGYP
Sbjct: 357 AEFRDRTGLLLPGRQQELVSKVAAASKGPTILVLMSGGPIDVSFAENDPKIGSIVWAGYP 416

Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
           GQAGG AI+D+LFG +NPGGKLPMTWYPQ+YITNLPMT MAMR S+SK YPGRTYRFYKG
Sbjct: 417 GQAGGAAISDVLFGITNPGGKLPMTWYPQDYITNLPMTNMAMRSSKSKGYPGRTYRFYKG 476

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHAKCNRLTLG 671
            VVYPFGHG+SYTNFVHT+A+APT+V+VPLDG RHGS NATISGKAI+VTHA+CNRL+LG
Sbjct: 477 KVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRHGSGNATISGKAIRVTHARCNRLSLG 536

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
           +QVDVKN GS DG HTLLV+S PPA HWAPHKQLVAFEKVHV AG QQRVGINIHVCK L
Sbjct: 537 MQVDVKNTGSMDGTHTLLVYSRPPARHWAPHKQLVAFEKVHVAAGTQQRVGINIHVCKSL 596

Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           SVVD SG RRIP+GEH++HIG  KHSVSL A+ LGV++S
Sbjct: 597 SVVDGSGIRRIPMGEHSLHIGDVKHSVSLQASILGVVES 635


>gi|371917280|dbj|BAL44716.1| SlArf/Xyl1 [Solanum lycopersicum]
          Length = 771

 Score = 1051 bits (2719), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 507/764 (66%), Positives = 600/764 (78%), Gaps = 14/764 (1%)

Query: 7   FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
           +F L + +L+ + SG  +R+PFACDP +A  R L FC+ SLPI  RV DLI RL+LQEK+
Sbjct: 5   YFILIIFVLAFAYSG-ESRQPFACDPANAGIRNLRFCKTSLPIHVRVQDLIARLTLQEKI 63

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
           +LL++ AA V RLGI GYEWWSEALHGVSN G G KFGG FPGATSFPQVITTA+SFNA+
Sbjct: 64  RLLVNNAAPVQRLGISGYEWWSEALHGVSNTGYGVKFGGAFPGATSFPQVITTAASFNAS 123

Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
           LWE IGRVVS+E RAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP L  +Y  SY
Sbjct: 124 LWEEIGRVVSEEGRAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPHLVAQYGVSY 183

Query: 187 VRGLQGSD---GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
           V+GLQG       RLKVAA CKH+TAYDLD+WNG DR+HFNAKVS QD+EDT++ PF+ C
Sbjct: 184 VKGLQGGGGRGNTRLKVAACCKHYTAYDLDDWNGYDRYHFNAKVSMQDLEDTYNAPFKAC 243

Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
           V+EG VASVMCSYNQ+NG P+CADP +L+ TIR +W LNGYIVSDCDSVGV ++ QH+T 
Sbjct: 244 VVEGNVASVMCSYNQINGKPSCADPTLLRDTIRNQWHLNGYIVSDCDSVGVLFEKQHYTR 303

Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
            PE+AAA  I+AGLDLDCGPFL +HT+ AV  G +S+++INNAL NT+TVQMRLGMFDG 
Sbjct: 304 YPEDAAAITIKAGLDLDCGPFLAIHTDKAVHTGKVSQVEINNALANTITVQMRLGMFDG- 362

Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
             + PY +LGPKDVC+P HQ+LAL+AAR+GIVLLKN G +LPLS  RHRTVAVIGPNSD 
Sbjct: 363 -PNGPYANLGPKDVCSPAHQQLALQAAREGIVLLKNIGQALPLSTKRHRTVAVIGPNSDA 421

Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
           T+ MIGNYAG+ CGY +PLQGI RYARTIHQQGC  VAC  +Q FG A  A+R ADAT+L
Sbjct: 422 TLAMIGNYAGVPCGYISPLQGISRYARTIHQQGCMGVACPGNQNFGLAEVAARHADATVL 481

Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
           VMGLDQSIEAEA DR  LLLPG QQ+L+S+V+MASKGP +LVLMSGGPIDV FAKNDPR+
Sbjct: 482 VMGLDQSIEAEAKDRVTLLLPGHQQDLISRVAMASKGPVVLVLMSGGPIDVTFAKNDPRV 541

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
           ++I+W GYPGQAGG AIAD+LFG +NPGGKLPMTWYPQ+Y+  + M  M MR + SK YP
Sbjct: 542 SSIVWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQDYVAKVSMANMDMRANPSKGYP 601

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG--SINATISGKA---I 658
           GRTYRFYKGP V+PFG G+SYT F   + +AP  V+VP    H   S N T   KA   +
Sbjct: 602 GRTYRFYKGPTVFPFGAGISYTTFSQHLVSAPITVSVPTLHSHDLVSNNTTTLMKAKATV 661

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQ 718
           +  H  C  L + + +DVKN G  DG H +L+FSTPP       KQLVAFEKVHV AGA+
Sbjct: 662 RTIHTNCESLDIDMHIDVKNTGDMDGTHAVLIFSTPPDP--TETKQLVAFEKVHVVAGAK 719

Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG-TKHSVSLH 761
           QRV IN++ CK+LSV D  G RRI +GEH IH+G   KHS++  
Sbjct: 720 QRVKINMNACKHLSVADEYGVRRIYMGEHKIHVGDHLKHSITFQ 763


>gi|357130854|ref|XP_003567059.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
           distachyon]
          Length = 779

 Score = 1004 bits (2597), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 489/756 (64%), Positives = 588/756 (77%), Gaps = 18/756 (2%)

Query: 25  REPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
           R PFAC P   +TR LPFC+ +LP   R  DL+ RL+  EKV+LL++ AA VPRLG++GY
Sbjct: 24  RPPFACAPGGPSTR-LPFCRQALPPRARARDLVARLTRAEKVRLLVNNAAGVPRLGVEGY 82

Query: 85  EWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN 144
           EWWSEALHGVS+ GPG +FGG FPGAT+FPQVI TA+SFNA+LWE IGR VSDE RA+YN
Sbjct: 83  EWWSEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASFNASLWELIGRAVSDEGRAIYN 142

Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
           G  AGLT+WSPNVNIFRDPRWGRGQETPGEDP +SG+YAA+YVRGLQ     RLK AA C
Sbjct: 143 GRQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSGRYAAAYVRGLQQQHAGRLKTAACC 202

Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           KHFTAYDLD W+G DRFHFNA V+ QD+EDTF+ PFR CV+EG+ A+VMCSYNQVNGVPT
Sbjct: 203 KHFTAYDLDRWSGADRFHFNAIVTPQDLEDTFNAPFRACVVEGRAAAVMCSYNQVNGVPT 262

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
           CAD   L+ TIRG+W+L+GYIVSDCDSV V+Y  QH+T T E+A A  +RAGLDLDCGPF
Sbjct: 263 CADQGFLRGTIRGKWKLDGYIVSDCDSVDVFYREQHYTRTREDAVAATLRAGLDLDCGPF 322

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
           L  +TE+AV +G + E DI+ A+VNT+TVQMRLGMFDG+ ++QP+GHLGP+ VCTP H+E
Sbjct: 323 LAQYTEAAVAQGKVKEADIDAAVVNTVTVQMRLGMFDGDVAAQPFGHLGPQHVCTPAHRE 382

Query: 385 LALEAARQGIVLLKNQGPS---LPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
           LALEAA Q IVLLKN G +   LPL SH R  TVAV+GP+S+ TV MIGNYAG  C YTT
Sbjct: 383 LALEAACQSIVLLKNGGGNNMRLPLSSHHRRGTVAVVGPHSEATVAMIGNYAGKPCAYTT 442

Query: 441 PLQGIGRYAR-TIHQQGCKDVAC-ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
           PLQG+GRYAR T+HQ GC DVAC    Q   AA+DA+R ADAT++V+GLDQS+EAE LDR
Sbjct: 443 PLQGVGRYARATVHQAGCTDVACQGSGQPIDAAVDAARHADATVVVVGLDQSVEAEGLDR 502

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
             LLLPGRQ ELVS V+ ASKGP ILVLMSGGP+D+AFA+ND  +AAI+WAGYPGQAGG 
Sbjct: 503 TTLLLPGRQAELVSAVARASKGPVILVLMSGGPVDIAFAQNDRNVAAILWAGYPGQAGGQ 562

Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
           AIAD++FG  NPGGKLP+TWYP++Y+   PMT MAMR   ++ YPGRTYRFY GP ++PF
Sbjct: 563 AIADVIFGHHNPGGKLPVTWYPEDYLRKAPMTNMAMRADPARGYPGRTYRFYAGPTIHPF 622

Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA-----IKVTHAKCNRLTLGVQ 673
           GHG+SYT F HT+A+AP  + V     H +  A  +  A     ++V HA+C  L++ V 
Sbjct: 623 GHGLSYTKFAHTLAHAPAHLTVRRAAGHRTTAAINTTTASHLNDVRVAHAQCEGLSVSVH 682

Query: 674 VDVKNVGSKDGAHTLLVFSTPP--AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
           VDVKNVGS+DGAHT+ V+++PP  A H AP +QLVAFEKVHV AGA  RV + + VC  L
Sbjct: 683 VDVKNVGSRDGAHTVFVYASPPIAAIHGAPVRQLVAFEKVHVAAGAVARVKMGVDVCGSL 742

Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           S+ D+ G RRIP+GEH + IG        HA  LGV
Sbjct: 743 SIADQEGVRRIPIGEHRLMIG----EELTHAVMLGV 774


>gi|318136853|gb|ADV41671.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Actinidia deliciosa
           var. deliciosa]
          Length = 634

 Score =  995 bits (2573), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 475/634 (74%), Positives = 548/634 (86%), Gaps = 2/634 (0%)

Query: 137 DEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
           DEARAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L+G YAASYVRGLQG+DG+
Sbjct: 1   DEARAMYNGGMAGLTFWSPNVNIFRDPRWGRGQETPGEDPMLAGNYAASYVRGLQGNDGE 60

Query: 197 RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSY 256
           RLKVAA CKH+TAYDLDNW GVDRFHFNA+VSKQDI+DTF++PFR CV+ GKVASVMCSY
Sbjct: 61  RLKVAACCKHYTAYDLDNWRGVDRFHFNARVSKQDIKDTFEIPFRECVLGGKVASVMCSY 120

Query: 257 NQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAG 316
           NQVNG+PTCA+P +LK TIRG WRLNGYIVSDCDSVGV+++ QH+TS PEEA A AI+AG
Sbjct: 121 NQVNGIPTCANPKLLKGTIRGSWRLNGYIVSDCDSVGVFFENQHYTSKPEEAVAAAIKAG 180

Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
           LDLDCGPFL +HTE+AV+RGL+S+++IN AL NT+T QMRLGMFDGEPS+  YG+LGP+D
Sbjct: 181 LDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTAQMRLGMFDGEPSAHQYGNLGPRD 240

Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
           VCTP HQ+LALEAARQGIVLL+N+G SLPLS  RHRTVAVIGPNSDVTVTMIGNYAG+AC
Sbjct: 241 VCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVAC 300

Query: 437 GYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
           GYTTPLQGIGRY RTIHQ GC DV C  +QLFGAA  A+RQADAT+LVMGLDQSIEAE +
Sbjct: 301 GYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFV 360

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DRAG LLPG QQELVS+V+ AS+GPTILVLMSGGPIDV FAKNDPRI+AIIW GYPGQAG
Sbjct: 361 DRAGPLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAG 420

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           GTAIAD+LFGT+NPGGKLPMTWYPQ Y+T+LPMT+MAMR   ++ YPGRTYRFY+GPVV+
Sbjct: 421 GTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRFYRGPVVF 480

Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT-LGVQVD 675
           PFG G+SYT F H +A+ PT+V+VPL     + N+T+  KA++V+HA CN L+ L V VD
Sbjct: 481 PFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKAVRVSHADCNALSPLDVHVD 540

Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
           VKN GS DG HTLLVF++PP G WA  KQLV F K+H+ AG++ RV I +HVCK+LSVVD
Sbjct: 541 VKNTGSMDGTHTLLVFTSPPDGKWAASKQLVGFHKIHIAAGSETRVRIAVHVCKHLSVVD 600

Query: 736 RSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
           R G RRIPLGEH + IG   H VSL   + G IK
Sbjct: 601 RFGIRRIPLGEHKLQIGDLSHHVSLQTNS-GEIK 633


>gi|226531269|ref|NP_001145980.1| uncharacterized protein LOC100279508 precursor [Zea mays]
 gi|219885199|gb|ACL52974.1| unknown [Zea mays]
 gi|413920228|gb|AFW60160.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 794

 Score =  979 bits (2532), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 485/769 (63%), Positives = 581/769 (75%), Gaps = 30/769 (3%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           FAC P      +LPFC+ SLP+  R  DL+ RL+  EKV+LL++ AA VPRLG+ GYEWW
Sbjct: 27  FACAPGGPAA-SLPFCRQSLPLRARARDLVSRLTRAEKVRLLVNNAAGVPRLGVAGYEWW 85

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SEALHGVS+ GPG +FGG FPGAT+FPQVI TA+S NATLWE +GR VSDEARAMYNGG 
Sbjct: 86  SEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASLNATLWELVGRAVSDEARAMYNGGR 145

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-------GSDGDRLKV 200
           AGLT+WSPNVNIFRDPRWGRGQETPGEDP +S +YAA+YVRGLQ       G   +RLK+
Sbjct: 146 AGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGHRNRLKL 205

Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
           AA CKHFTAYDLD W G DRFHFNA V+ QD+EDTF+VPFR CV +G+ ASVMCSYNQVN
Sbjct: 206 AACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVMCSYNQVN 265

Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
           GVPTCAD   L+ TIRG W L+GYIVSDCDSV V++  QH+T TPE+AAA  +RAGLDLD
Sbjct: 266 GVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTRTPEDAAAATLRAGLDLD 325

Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
           CGPFL L+  SAV  G +++ D++ AL+NT+TVQMRLGMFDG+P++ P+G LGP DVCT 
Sbjct: 326 CGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRLGPADVCTR 385

Query: 381 DHQELALEAARQGIVLLKNQGPS------LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
           +HQ+LAL+AARQG+VLLKN+  +      LPL    HR VAV+GP++D TV MIGNYAG 
Sbjct: 386 EHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAGK 445

Query: 435 ACGYTTPLQGIGRY-ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
            C YTTPLQG+  Y AR  HQ GC DVAC  +Q   AA++A+RQADAT++V GLDQ +EA
Sbjct: 446 PCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVAGLDQRVEA 505

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E LDR  LLLPGRQ EL+S V+ ASKGP ILVLMSGGPID+AFA+NDPRI  I+W GYPG
Sbjct: 506 EGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYPG 565

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
           QAGG AIAD++FG  NPG KLP+TWY Q+Y+  +PMT MAMR + ++ YPGRTYRFY GP
Sbjct: 566 QAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGYPGRTYRFYTGP 625

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG---------KAIKVTHAK 664
            +YPFGHG+SYT F HT+A+APT + V L G   S  +  S          +A++V HA+
Sbjct: 626 TIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLARPVRAVRVAHAR 685

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA------GHWAPHKQLVAFEKVHVPAGAQ 718
           C  LT+ V VDV NVG +DGAH +LV+   P+      G  AP +QLVAFEKVHVPAG  
Sbjct: 686 CEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVAFEKVHVPAGGV 745

Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
            RV + I VC  LSV DR+G RR+P+GEH + IG   HSVSL    LGV
Sbjct: 746 ARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGELTHSVSLGVEQLGV 794


>gi|242071935|ref|XP_002451244.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
 gi|241937087|gb|EES10232.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
          Length = 790

 Score =  944 bits (2440), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 470/756 (62%), Positives = 560/756 (74%), Gaps = 38/756 (5%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
           TLPFC+ SLP+  R  DL+ RL+  EKV+LL++ AA V RLG+ GYEWWSEALHGVS+ G
Sbjct: 44  TLPFCRQSLPLHARARDLVSRLTRAEKVRLLVNNAAGVARLGVGGYEWWSEALHGVSDTG 103

Query: 99  PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
           PG KFGG FPGAT+FPQVI  A++ NATLWE IGR VSDEARAMYNGG AGLT+WSPNVN
Sbjct: 104 PGVKFGGAFPGATAFPQVIGAAAALNATLWELIGRAVSDEARAMYNGGRAGLTFWSPNVN 163

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNG 217
           IFRDPRWGRGQETPGEDP +S +YAA+YVRGLQ   D +RLK+AA CKHFTAYDLD+W G
Sbjct: 164 IFRDPRWGRGQETPGEDPAISSRYAAAYVRGLQQPYDHNRLKLAACCKHFTAYDLDSWGG 223

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            DRFHFNA VS QD+EDTF+VPFR CV  G+ ASVMCSYNQVNGVPTCAD   L+ TIR 
Sbjct: 224 TDRFHFNAVVSPQDLEDTFNVPFRACVAGGRAASVMCSYNQVNGVPTCADQGFLRGTIRK 283

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
            W L+GYIVSDCDSV V++  QH+T T E+A A  +RAGLDLDCGPFL L+TE+AV R  
Sbjct: 284 AWGLDGYIVSDCDSVDVFFRDQHYTRTAEDAVAATLRAGLDLDCGPFLALYTENAVARKK 343

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           +S+ D++ AL+NT+TVQMRLGMFDG+P+S P+GHLG  DVCT  HQ+LAL+AARQ +VLL
Sbjct: 344 VSDADVDAALLNTVTVQMRLGMFDGDPASGPFGHLGAADVCTKAHQDLALDAARQSVVLL 403

Query: 398 KNQGPS-------LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY-A 449
           KNQ          LPL    HR VAV+GP++D TV MIGNYAG  C YTTPLQG+  Y A
Sbjct: 404 KNQRGRKHRDRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAGKPCRYTTPLQGVAAYAA 463

Query: 450 RTIHQQGCKDVAC-ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQ 508
           R +HQ GC DVAC   +Q   AA+DA+R+        GL +S          LLLPGRQ 
Sbjct: 464 RVVHQAGCADVACQGKNQPIAAAVDAARRLTPPSSSPGLTRS----------LLLPGRQA 513

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
           EL+S V+ A+KGP ILVLMSGGPID+AFA+NDPRI  I+W GYPGQAGG AIAD++FG  
Sbjct: 514 ELISAVAKAAKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYPGQAGGQAIADVIFGQH 573

Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
           NPGGKLP+TWYPQ+Y+  +PMT MAMR + ++ YPGRTYRFY GP ++ FGHG+SYT F 
Sbjct: 574 NPGGKLPVTWYPQDYLEKVPMTNMAMRANPARGYPGRTYRFYTGPTIHAFGHGLSYTQFT 633

Query: 629 HTVANAPTVVAVPLDGRHGS----------INATISGKAIKVTHAKCNRLTLGVQVDVKN 678
           HT+A+AP  + V L     S          +NAT   +A++V HA+C  LT+ V VDV+N
Sbjct: 634 HTLAHAPAQLTVRLSTSSASASASASAASLLNATRPSRAVRVAHARCEGLTVPVHVDVRN 693

Query: 679 VGSKDGAHTLLVF--------STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
           VG +DGAH +LV+        S+ PAG  AP +QLVAFEKVHVPAG   RV + I VC  
Sbjct: 694 VGDRDGAHAVLVYHVAPSSSSSSAPAGTDAPARQLVAFEKVHVPAGGVARVEMGIDVCDR 753

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
           LSV DR G RRIP+GEH + IG   HSV+L    LG
Sbjct: 754 LSVADRDGVRRIPVGEHRLMIGELTHSVTLGVEQLG 789


>gi|115486595|ref|NP_001068441.1| Os11g0673200 [Oryza sativa Japonica Group]
 gi|113645663|dbj|BAF28804.1| Os11g0673200 [Oryza sativa Japonica Group]
          Length = 822

 Score =  920 bits (2377), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 485/783 (61%), Positives = 580/783 (74%), Gaps = 54/783 (6%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
           TLPFC+ SLP   R  DL+ RL+  EKV+LL++ AA VPRLG+ GYEWWSEALHGVS+ G
Sbjct: 40  TLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99

Query: 99  PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRV------------------------ 134
           PG +FGG FPGAT+FPQVI TA+SFNATLWE IG+V                        
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQVMPILKGGHARCNQRPSCIRISVFM 159

Query: 135 --------VSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
                   VSDE RAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA+Y
Sbjct: 160 YVYVCAQAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAY 219

Query: 187 VRGLQGSD--GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
           VRGLQ       RLK+AA CKHFTAYDLDNW+G DRFHFNA V++QD+EDTF+VPFR CV
Sbjct: 220 VRGLQQQQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCV 279

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
           ++G+ ASVMCSYNQVNGVPTCAD   L+ TIR  W L GYIVSDCDSV V+Y  QH+T T
Sbjct: 280 VDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRT 339

Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
            E+A A  +RAGLDLDCGPFL  +TE AV +G + + DI+ A+ NT+TVQMRLGMFDG+P
Sbjct: 340 REDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDGDP 399

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDV 423
           ++QP+GHLGP+ VCT  HQELA+EAARQGIVLLKN G +LPLS     R VAV+GP+++ 
Sbjct: 400 AAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHAEA 459

Query: 424 TVTMIGNYAGIACGYTTPLQGIGRY-ARTIHQQGCKDVACA-DDQLFGAAIDASRQADAT 481
           TV MIGNYAG  C YTTPLQG+ RY AR  HQ GC DVACA   Q   AA+DA+R+ADAT
Sbjct: 460 TVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRADAT 519

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           I+V GLDQ IEAE LDRA LLLPGRQ EL+S V+ ASKGP ILVLMSGGPID+ FA+NDP
Sbjct: 520 IVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDP 579

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
           +IA I+WAGYPGQAGG AIAD++FG  NPGGKLP+TWYPQ+Y+  +PMT MAMR + +K 
Sbjct: 580 KIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPAKG 639

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL------DGRHGSINATI-- 653
           YPGRTYRFY GP ++PFGHG+SYT+F H++A+AP+ + V L           S+NAT   
Sbjct: 640 YPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHHAAASASASLNATARL 699

Query: 654 -SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH--------KQ 704
               A++V HA+C  L + V VDV+NVG +DGAHT+LV++  PA   A          +Q
Sbjct: 700 SRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHGAPVRQ 759

Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
           LVAFEKVHV AG   RV + I VC  LSV DR+G RRIP+GEH + IG   H+V++    
Sbjct: 760 LVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGELTHTVTIALEQ 819

Query: 765 LGV 767
           LGV
Sbjct: 820 LGV 822


>gi|449438167|ref|XP_004136861.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Cucumis sativus]
          Length = 782

 Score =  915 bits (2365), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 447/737 (60%), Positives = 545/737 (73%), Gaps = 7/737 (0%)

Query: 28  FACDPK-DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
           FACD + + +     FC  SL    RV DL+ RL+LQEK+  LI+ A  V RLGI  YEW
Sbjct: 47  FACDAETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRLGIPKYEW 106

Query: 87  WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           WSEALHGVS VGPGTKF    PGATSFPQVI TA+SFNA+L+EAIG+VVS EARAMYN G
Sbjct: 107 WSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVG 166

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG---DRLKVAAS 203
            AGLTYWSPNVNIFRDPRWGRGQETPGEDP+LS KYAA YVRGLQ  D    DRLKVAA 
Sbjct: 167 LAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRDDGDPDRLKVAAC 226

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           CKH+TAYDLDNW G DR+HFNA VS QD+EDTF  PF+ CV++G VASVMCSYNQVNG P
Sbjct: 227 CKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGNVASVMCSYNQVNGKP 286

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           TCADP++L   IRG+W+LNGYIVSDCDSV V Y++QH+T +PEEAAA  I AGLDLDCG 
Sbjct: 287 TCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEEAAAKTILAGLDLDCGD 346

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           FLG HTE+AV  GL++E  I+ A+ N L   MRLG FDG PS Q YG LGPKDVCTP+HQ
Sbjct: 347 FLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQLYGKLGPKDVCTPEHQ 406

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
           ELA EAARQGIVLLKN   SLPLS    +++AVIGPN++VT TMIGNY G  C YTTPLQ
Sbjct: 407 ELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 466

Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           G+     T  Q GC +VAC   QL  A   A+  ADAT+LV+G DQSIEAE+ DR  L L
Sbjct: 467 GLSAVVSTSFQPGCANVACTSAQLDEAKKIAA-SADATVLVVGSDQSIEAESRDRVDLNL 525

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG+Q  L+++V+ ASKGP ILV+M+GG +D+ FAK D +I +I+W G+PG+AGG AIAD+
Sbjct: 526 PGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILWVGFPGEAGGAAIADV 585

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           +FG+ NP G+LPMTWYPQ Y+  +PMT+M MRPS S  +PGRTYRFY G  +Y FG G+S
Sbjct: 586 IFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRTYRFYTGETIYSFGDGLS 645

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           Y++F H +  AP +V++PL+   G I  +    +++V    C  L   V + VKNVG + 
Sbjct: 646 YSDFKHHLVKAPKLVSIPLE--EGHICHSSKCHSLEVVQESCQNLGFDVHLRVKNVGQRS 703

Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
           G+HT+ ++STPP+ H +P K L+ FEKV +  G +  V   + VCK LSV D  G+R++ 
Sbjct: 704 GSHTVFLYSTPPSVHNSPQKHLLGFEKVSLGRGGETVVRFKVDVCKDLSVADEVGSRKVA 763

Query: 744 LGEHNIHIGGTKHSVSL 760
           LG H +H+G  KHS+++
Sbjct: 764 LGLHILHVGTLKHSLNV 780


>gi|255545293|ref|XP_002513707.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223547158|gb|EEF48654.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 777

 Score =  915 bits (2364), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 447/755 (59%), Positives = 550/755 (72%), Gaps = 11/755 (1%)

Query: 14  LLSASSSGLAAREP-FACDPK-DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           +LS+ +  LA   P FACD K + +  +  FC VSL I  RV DL+ RL+LQEK+  L++
Sbjct: 26  VLSSRNQVLAQSSPVFACDVKSNPSLASFGFCNVSLGISDRVTDLVNRLTLQEKIGFLVN 85

Query: 72  GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
            A +V RLGI  YEWWSEALHGVS VGPGT F    PGATSFPQVI TA+SFNA+L+EAI
Sbjct: 86  SAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNIVPGATSFPQVILTAASFNASLFEAI 145

Query: 132 GRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ 191
           G+VVS EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LS KY + YVRGLQ
Sbjct: 146 GKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVRGLQ 205

Query: 192 GSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGK 248
            +D    +RLKVAA CKH+TAYDLDNW G DR+HFNA V+KQD++DTF  PF+ CV++G 
Sbjct: 206 QTDNGDSERLKVAACCKHYTAYDLDNWKGTDRYHFNAVVTKQDLDDTFQPPFKSCVIDGN 265

Query: 249 VASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEA 308
           VASVMCSYNQVNG PTCADP++L   IRGEW+LNGYIVSDCDSV V Y++QH+T TPEEA
Sbjct: 266 VASVMCSYNQVNGKPTCADPDLLAGIIRGEWKLNGYIVSDCDSVDVIYNSQHYTKTPEEA 325

Query: 309 AADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP 368
           AA  I AGLDL+CG FLG HTE+AV  GLL+   ++ A+ N     MRLG FDG+PS Q 
Sbjct: 326 AAITILAGLDLNCGSFLGKHTEAAVNAGLLNVSAVDKAVSNNFATLMRLGFFDGDPSKQL 385

Query: 369 YGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMI 428
           YG LGPKDVCT  +QELA EAARQGIVLLKN   SLPLS    +T+AVIGPN++VT TMI
Sbjct: 386 YGKLGPKDVCTAVNQELAREAARQGIVLLKNSPGSLPLSPTAIKTLAVIGPNANVTKTMI 445

Query: 429 GNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD 488
           GNY G  C YTTPLQG+     T +  GC +VACA  Q+  A   A+  ADAT+LVMG D
Sbjct: 446 GNYEGTPCKYTTPLQGLTASVATTYLAGCSNVACAAAQVDDAKKLAA-SADATVLVMGAD 504

Query: 489 QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIW 548
           QSIEAE+ DR  +LLPG+QQ L+++V+  SKGP ILV+MSGG +DV+FAK + +I +I+W
Sbjct: 505 QSIEAESRDRVDVLLPGQQQLLITQVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILW 564

Query: 549 AGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR 608
            GYPG+AGG AIAD++FG  NP G+LPMTWYPQ Y+  +PMT M MRP  S  YPGRTYR
Sbjct: 565 VGYPGEAGGAAIADVIFGYYNPSGRLPMTWYPQAYVDKVPMTNMNMRPDPSSGYPGRTYR 624

Query: 609 FYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT--HAKCN 666
           FY G  VY FG G+SY+ + H +  AP +V++PL+  H   +   S K I V      C 
Sbjct: 625 FYTGETVYSFGDGLSYSEYKHQLVQAPQLVSIPLEDDHVCRS---SSKCISVDAGEQNCQ 681

Query: 667 RLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
            L   + + V+N+G   G HT+ +F TPP+ H +P K LV FEKV + A     V   + 
Sbjct: 682 GLAFNIDLKVRNIGKVRGTHTVFLFFTPPSVHNSPQKHLVDFEKVSLDAKTYGMVSFKVD 741

Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           VCK+LSVVD  G+R++ LG H +H+G  +HS+++ 
Sbjct: 742 VCKHLSVVDEFGSRKVALGGHVLHVGNLEHSLTVR 776


>gi|449479116|ref|XP_004155509.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Cucumis sativus]
          Length = 809

 Score =  913 bits (2359), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 446/737 (60%), Positives = 544/737 (73%), Gaps = 7/737 (0%)

Query: 28  FACDPK-DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
           FACD + + +     FC  SL    RV DL+ RL+LQEK+  LI+ A  V RLGI  YEW
Sbjct: 74  FACDAETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRLGIPKYEW 133

Query: 87  WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           WSEALHGVS VGPGTKF    PGATSFPQVI TA+SFNA+L+EAIG+VVS EARAMYN G
Sbjct: 134 WSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVG 193

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG---DRLKVAAS 203
            AGLTYWSPNVNIFRDPRWGRGQETPGEDP+LS KYAA YVRGLQ  D    DRLKVAA 
Sbjct: 194 LAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRDDGDPDRLKVAAC 253

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           CKH+TAYDLDNW G DR+HFNA VS QD+EDTF  PF+ CV++G VASVMCSYNQVNG P
Sbjct: 254 CKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGNVASVMCSYNQVNGKP 313

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           TCADP++L   IRG+W+LNGYIVSDCDSV V Y++QH+T +PEEAAA  I AGLDLDCG 
Sbjct: 314 TCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEEAAAKTILAGLDLDCGD 373

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           FLG HTE+AV  GL++E  I+ A+ N L   MRLG FDG PS Q YG LGPKDVCTP+HQ
Sbjct: 374 FLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQLYGKLGPKDVCTPEHQ 433

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
           ELA EAARQGIVLLKN   SLPLS    +++AVIGPN++VT TMIGNY G  C YTTPLQ
Sbjct: 434 ELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 493

Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           G+     T  Q GC +VAC   QL   A   +  ADAT+LV+G DQSIEAE+ DR  L L
Sbjct: 494 GLSAVVSTSFQPGCANVACTSAQL-DEAKKIAASADATVLVVGSDQSIEAESRDRVDLNL 552

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG+Q  L+++V+ ASKGP ILV+M+GG +D+ FAK D +I +I+W G+PG+AGG AIAD+
Sbjct: 553 PGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILWVGFPGEAGGAAIADV 612

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           +FG+ NP G+LPMTWYPQ Y+  +PMT+M MRPS S  +PGRTYRFY G  +Y FG G+S
Sbjct: 613 IFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRTYRFYTGETIYSFGDGLS 672

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           Y++F H +  AP +V++PL+   G I  +    +++V    C  L   V + VKNVG + 
Sbjct: 673 YSDFKHHLVKAPKLVSIPLE--EGHICHSSKCHSLEVVQESCQNLGFDVHLRVKNVGQRS 730

Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
           G+HT+ ++STPP+ H +P K L+ FEKV +  G +  V   + VCK LSV D  G+R++ 
Sbjct: 731 GSHTVFLYSTPPSVHNSPQKHLLGFEKVSLGRGGETVVRFKVDVCKDLSVADEVGSRKVA 790

Query: 744 LGEHNIHIGGTKHSVSL 760
           LG H +H+G  KHS+++
Sbjct: 791 LGLHILHVGTLKHSLNV 807


>gi|359485890|ref|XP_002264183.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Vitis vinifera]
          Length = 774

 Score =  908 bits (2347), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 445/764 (58%), Positives = 548/764 (71%), Gaps = 10/764 (1%)

Query: 4   TIAFFFLGLI--LLSASSSGLAAREP-FACD-PKDATTRTLPFCQVSLPIPQRVNDLIGR 59
           T+   FL      LS+    LA   P FACD   + T     FC  SL    RV DL+ R
Sbjct: 12  TVFLCFLSCFSHFLSSPKWVLAQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKR 71

Query: 60  LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITT 119
           L+L+EK+  L++ AA+V RLGI  YEWWSEALHGVS VGPGT F    PGATSFPQVI T
Sbjct: 72  LTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPGATSFPQVILT 131

Query: 120 ASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
           A+SFNA+L+EAIG+ VS EARAMYN G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+LS
Sbjct: 132 AASFNASLFEAIGKAVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLS 191

Query: 180 GKYAASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
            KYA+ YVRGLQ SD    DRLKVAA CKH+TAYDLDNW GVDRFHFNA V+KQD++DTF
Sbjct: 192 SKYASGYVRGLQQSDDGSPDRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTF 251

Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
             PF+ CV++G VASVMCSYNQVNG P CADP++L   +RGEW+LNGYIVSDCDSV V+Y
Sbjct: 252 QPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFY 311

Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
           ++QH+T TPEEAAA AI AGLDL+CG FLG HTE+AV+ GL+ E  ++ A+ N     MR
Sbjct: 312 NSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMR 371

Query: 357 LGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAV 416
           LG FDG PS   YG LGPKDVCT +HQELA EAARQGIVLLKN   SLPLS    +T+AV
Sbjct: 372 LGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAV 431

Query: 417 IGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASR 476
           IGPN++VT TMIGNY G  C YTTPLQG+     T +  GC +VAC   Q+   A   + 
Sbjct: 432 IGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQI-DEAKKIAA 490

Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
            ADAT+L++G+DQSIEAE  DR  + LPG+Q  L+++V+ ASKG  ILV+MSGG  D++F
Sbjct: 491 AADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISF 550

Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
           AKND +I +I+W GYPG+AGG AIAD++FG  NP G+LPMTWYPQ Y+  +PMT M MRP
Sbjct: 551 AKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRP 610

Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
             +  YPGRTYRFY G  +Y FG G+SYT F H +  AP  V++P++  H   ++    K
Sbjct: 611 DPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVSIPIEEGHSCHSSKC--K 668

Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAG 716
           ++      C  L   + + V N G+  G+HT+ +FS+PP+ H +P K L+ FEKV V A 
Sbjct: 669 SVDAVQESCQNLVFDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAK 728

Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           A+  V   + VCK LS+VD  GTR++ LG H +H+G  KHS+++
Sbjct: 729 AKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNV 772


>gi|359481045|ref|XP_002268626.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Vitis vinifera]
 gi|296089342|emb|CBI39114.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score =  905 bits (2339), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 443/764 (57%), Positives = 551/764 (72%), Gaps = 10/764 (1%)

Query: 4   TIAFFFLGLI--LLSASSSGLAAREP-FACD-PKDATTRTLPFCQVSLPIPQRVNDLIGR 59
           T+   FL      LS+    L    P FACD   + T     FC  SL    RV DL+ R
Sbjct: 12  TVFLCFLSCFSHFLSSPKWVLGQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKR 71

Query: 60  LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITT 119
           L+L+EK+  L++ AA+V RLGI  YEWWSEALHGVS VGPGT F    PGATSFPQVI T
Sbjct: 72  LTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSIVPGATSFPQVILT 131

Query: 120 ASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
           A+SFNA+L+EAIG+VVS EARAMYN G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+LS
Sbjct: 132 AASFNASLFEAIGKVVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLS 191

Query: 180 GKYAASYVRGLQ-GSDG--DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
            KYA++YVRGLQ G DG  DRLKVAA CKH+TAYDLDNW GVDR HFNA V+KQD++DTF
Sbjct: 192 SKYASAYVRGLQQGDDGSPDRLKVAACCKHYTAYDLDNWKGVDRLHFNAVVTKQDMDDTF 251

Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
             PF+ CV++G VASVMCS+NQVNG PTCADP++L   +RGEW+LNGYIVSDCDSV V+Y
Sbjct: 252 QPPFKSCVIDGNVASVMCSFNQVNGKPTCADPDLLSGIVRGEWKLNGYIVSDCDSVDVFY 311

Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
           ++QH+T TPEEAAA AI AGLDL+CG FLG HTE+AV+ GL+ E  ++ A+ N     MR
Sbjct: 312 NSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMR 371

Query: 357 LGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAV 416
           LG FDG PS   YG LGPKDVCT +HQE+A EAARQGIVLLKN   SLPLS    +T+A+
Sbjct: 372 LGFFDGNPSKAIYGKLGPKDVCTSEHQEMAREAARQGIVLLKNSKGSLPLSPTAIKTLAI 431

Query: 417 IGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASR 476
           IGPN++VT TMIGNY G  C YTTPLQG+     T +  GC +VAC   Q+   A   + 
Sbjct: 432 IGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQI-DEAKKIAA 490

Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
            ADAT+L++G+DQSIEAE  DR  + LPG+Q  L+++V+ ASKG  ILV+MSGG  D++F
Sbjct: 491 AADATVLIVGIDQSIEAEGRDRVSIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISF 550

Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
           AKND +IA+I+W GYPG+AGG AIAD++FG  NP G+LPMTWYPQ Y+  +PMT M MRP
Sbjct: 551 AKNDDKIASILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRP 610

Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
             +  YPGRTYRFY G  +Y FG G+SYT F H +  AP  V++P++  H   ++    K
Sbjct: 611 DPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVSIPIEEGHSCHSSKC--K 668

Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAG 716
           ++      C  L   + + V N G+  G+HT+ +FS+PP+ H +P K L+ FEKV V A 
Sbjct: 669 SVDAVQESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAK 728

Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           A+  V   + VCK LS+VD  GT+++ LG H +H+G  KHS+++
Sbjct: 729 AEALVRFKVDVCKDLSIVDELGTQKVALGLHVLHVGSLKHSLNV 772


>gi|224054312|ref|XP_002298197.1| predicted protein [Populus trichocarpa]
 gi|222845455|gb|EEE83002.1| predicted protein [Populus trichocarpa]
          Length = 741

 Score =  904 bits (2337), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 438/739 (59%), Positives = 543/739 (73%), Gaps = 9/739 (1%)

Query: 28  FACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
           FACD   + +  +  FC  SL +  RV DL+ RL+LQEK+  L++ A +V RLGI  YEW
Sbjct: 4   FACDVVSNPSLASFGFCNTSLGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRLGIPKYEW 63

Query: 87  WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           WSEALHGVS VGPGT F    PGATSFPQVI TA+SFN +L+ AIG+VVS EARAMYN G
Sbjct: 64  WSEALHGVSYVGPGTHFSSVVPGATSFPQVILTAASFNTSLFVAIGKVVSTEARAMYNVG 123

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG---DRLKVAAS 203
            AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LS KY + YV+GLQ  D    D LKVAA 
Sbjct: 124 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQQRDDGNPDGLKVAAC 183

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           CKH+TAYDLDNW GVDR+HFNA V+KQD++DTF  PF+ CV++G VASVMCSYN+VNG+P
Sbjct: 184 CKHYTAYDLDNWKGVDRYHFNAVVTKQDMDDTFQPPFKSCVVDGNVASVMCSYNKVNGIP 243

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAG--LDLDC 321
           TCADP++L   IRGEW+LNGYIV+DCDS+ V+Y++QH+T TPEEAAA AI AG  LDL+C
Sbjct: 244 TCADPDLLSGVIRGEWKLNGYIVTDCDSIDVFYNSQHYTKTPEEAAAKAILAGIRLDLNC 303

Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
           G FLG HTE+AV  GL++E  I+ A+ N     MRLG FDG+PS Q YG LGPKDVCT +
Sbjct: 304 GSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAE 363

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           +QELA EAARQGIVLLKN   SLPLS    + +AVIGPN++VT TMIGNY G  C YTTP
Sbjct: 364 NQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 423

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
           LQG+     T +  GC +VAC+  Q+   A   +  ADAT+LVMG D SIEAE+ DR  +
Sbjct: 424 LQGLAALVATTYLPGCSNVACSTAQV-DDAKKIAAAADATVLVMGADLSIEAESRDRVDI 482

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           LLPG+QQ L++ V+ AS GP ILV+MSGG +DV+FAK + +I +I+W GYPG+AGG AIA
Sbjct: 483 LLPGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIA 542

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           DI+FG+ NP G+LPMTWYPQ Y+  +PMT M MRP  S  YPGRTYRFY G  VY FG G
Sbjct: 543 DIIFGSYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDG 602

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           +SY+ F H +  AP +V+VPL+  H  +  +   K++      C  LT  V + +KN G+
Sbjct: 603 LSYSEFSHELTQAPGLVSVPLEENH--VCYSSECKSVAAAEQTCQNLTFDVHLRIKNTGT 660

Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
             G+HT+ +FSTPP+ H +P K LV FEKV + A     VG  + VCK LSVVD  G+++
Sbjct: 661 TSGSHTVFLFSTPPSVHNSPQKHLVGFEKVFLHAQTDSHVGFKVDVCKDLSVVDELGSKK 720

Query: 742 IPLGEHNIHIGGTKHSVSL 760
           + LGEH +HIG  KHS+++
Sbjct: 721 VALGEHVLHIGSLKHSMTV 739


>gi|356524862|ref|XP_003531047.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Glycine max]
          Length = 765

 Score =  902 bits (2330), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/739 (58%), Positives = 541/739 (73%), Gaps = 7/739 (0%)

Query: 26  EPFACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
           + FACD  K        FC  SL +  RV DL+GRL+LQEK+  L++ A  V RLGI  Y
Sbjct: 28  QTFACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAVDVSRLGIPKY 87

Query: 85  EWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN 144
           EWWSEALHGVSNVGPGT+F    PGATSFP  I TA+SFN +L+E IGRVVS EARAMYN
Sbjct: 88  EWWSEALHGVSNVGPGTRFSNVIPGATSFPMPILTAASFNTSLFEVIGRVVSTEARAMYN 147

Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG---DRLKVA 201
            G AGLTYWSPN+NIFRDPRWGRG ETPGEDPVL+ KYAA YV+GLQ +DG   ++LKVA
Sbjct: 148 VGLAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDGGDPNKLKVA 207

Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
           A CKH+TAYD+DNW G+ R+ FNA V+KQD+EDTF  PF+ CV++G VASVMCSYN+VNG
Sbjct: 208 ACCKHYTAYDVDNWKGIQRYTFNAVVTKQDMEDTFQPPFKSCVIDGNVASVMCSYNKVNG 267

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
            PTCADP++LK  +RGEW+LNGYIVSDCDSV V Y  QH+T TPEEAAA +I AGLDL+C
Sbjct: 268 KPTCADPDLLKGVVRGEWKLNGYIVSDCDSVEVLYKDQHYTKTPEEAAAISILAGLDLNC 327

Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
           G FLG +TE AV++GL+ E  INNA+ N     MRLG FDG+P  QPYG+LGPKDVCT +
Sbjct: 328 GRFLGQYTEGAVKQGLIDEASINNAVTNNFATLMRLGFFDGDPRKQPYGNLGPKDVCTQE 387

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           +QELA EAARQGIVLLKN   SLPL+    +++AVIGPN++ T  MIGNY GI C Y +P
Sbjct: 388 NQELAREAARQGIVLLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYISP 447

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
           LQG+  +A T +  GC DV C +  L  A   A+  ADAT++V+G   +IEAE+LDR  +
Sbjct: 448 LQGLTAFAPTSYAAGCLDVRCPNPVLDDAKKIAA-SADATVIVVGASLAIEAESLDRVNI 506

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           LLPG+QQ LVS+V+ ASKGP ILV+MSGG +DV+FAKN+ +I +I+W GYPG+AGG AIA
Sbjct: 507 LLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKNNNKITSILWVGYPGEAGGAAIA 566

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           D++FG  NP G+LPMTWYPQ Y+  +PMT M MRP  +  YPGRTYRFYKG  V+ FG G
Sbjct: 567 DVIFGFHNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPATGYPGRTYRFYKGETVFAFGDG 626

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           +SY++ VH +  AP +V+V L   H  +  +   K+I V    C  L   + + +KN G 
Sbjct: 627 LSYSSIVHKLVKAPQLVSVQLAEDH--VCRSSECKSIDVVGEHCQNLVFDIHLRIKNKGK 684

Query: 682 KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
              AHT+ +FSTPPA H AP K L+ FEKVH+   ++  V   + VCK LS+VD  G R+
Sbjct: 685 MSSAHTVFLFSTPPAVHNAPQKHLLGFEKVHLIGKSEALVSFKVDVCKDLSIVDELGNRK 744

Query: 742 IPLGEHNIHIGGTKHSVSL 760
           + LG+H +H+G  KH +S+
Sbjct: 745 VALGQHLLHVGDLKHPLSV 763


>gi|356525896|ref|XP_003531557.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Glycine max]
          Length = 776

 Score =  900 bits (2325), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 430/760 (56%), Positives = 546/760 (71%), Gaps = 7/760 (0%)

Query: 5   IAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
            +F F+  +LL+           FACD  K+       FC  SL +  RV DL+ RL+LQ
Sbjct: 18  FSFMFVATVLLNCDRVSGQTSSVFACDVAKNPALAGYGFCDKSLSLEDRVADLVKRLTLQ 77

Query: 64  EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
           EK+  L++ A +V RLGI  YEWWSEALHGVSNVGPGT F    PGATSFP  I TA+SF
Sbjct: 78  EKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLVPGATSFPMPILTAASF 137

Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
           NA+L+EAIGRVVS EARAMYN G AGLTYWSPN+NIFRDPRWGRGQETPGEDP+LS KYA
Sbjct: 138 NASLFEAIGRVVSTEARAMYNVGLAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYA 197

Query: 184 ASYVRGLQGSD---GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
             YV+GLQ +D    ++LKVAA CKH+TAYDLDNW G+ R+ FNA V++QD++DTF  PF
Sbjct: 198 TGYVKGLQQTDDGDSNKLKVAACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPPF 257

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           + CV++G VASVMCSYNQVNG PTCADP++LK  IRGEW+LNGYIVSDCDSV V +  QH
Sbjct: 258 KSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGEWKLNGYIVSDCDSVEVLFKDQH 317

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           +T TPEEAAA+ I AGLDL+CG +LG +TE AV++GLL E  INNA+ N     MRLG F
Sbjct: 318 YTKTPEEAAAETILAGLDLNCGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGFF 377

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
           DG+PS Q YG+LGP DVCT +++ELA EAARQGIVLLKN   SLPL+    +++AVIGPN
Sbjct: 378 DGDPSKQTYGNLGPNDVCTSENRELAREAARQGIVLLKNSLGSLPLNAKAIKSLAVIGPN 437

Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
           ++ T  MIGNY GI C Y +PLQ +     T +  GC +V CA+ +L   A   +  ADA
Sbjct: 438 ANATRVMIGNYEGIPCNYISPLQALTALVPTSYAAGCPNVQCANAEL-DDATQIAASADA 496

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
           T++V+G   +IEAE+LDR  +LLPG+QQ LVS+V+ ASKGP ILV+MSGG +DV+FAK++
Sbjct: 497 TVIVVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKSN 556

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
            +I +I+W GYPG+AGG AIAD++FG  NP G+LPMTWYPQ Y+  +PMT M MR   + 
Sbjct: 557 DKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVNKVPMTNMNMRADPAT 616

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
            YPGRTYRFYKG  V+ FG G+S++N  H +  AP +V+VPL   H   ++     ++ V
Sbjct: 617 GYPGRTYRFYKGETVFSFGDGISFSNIEHKIVKAPQLVSVPLAEDHECRSSEC--MSLDV 674

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
               C  L   + + VKN+G    +H +L+F TPP  H AP K L+ FEKVH+P  ++ +
Sbjct: 675 ADEHCQNLAFDIHLGVKNMGKMSSSHVVLLFFTPPDVHNAPQKHLLGFEKVHLPGKSEAQ 734

Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           V   + +CK LSVVD  G R++PLG+H +H+G  KH +S+
Sbjct: 735 VRFKVDICKDLSVVDELGNRKVPLGQHLLHVGNLKHQLSV 774


>gi|356558612|ref|XP_003547598.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Glycine max]
          Length = 776

 Score =  899 bits (2324), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/761 (56%), Positives = 546/761 (71%), Gaps = 7/761 (0%)

Query: 5   IAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
            +F F+  +LL+ +         FACD  K+       FC  SL +  RV DL+ RL+LQ
Sbjct: 18  FSFTFVASVLLNCNRVSGQTSAVFACDVAKNPALAGYGFCDKSLSVEDRVADLVKRLTLQ 77

Query: 64  EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
           EK+  L++ A +V RLGI  YEWWSEALHGVSNVGPGT F    PGATSFP  I TA+SF
Sbjct: 78  EKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLVPGATSFPMPILTAASF 137

Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
           NA+L+EAIGRVVS EARAMYN G AGLTYWSPN+NIFRDPRWGRGQETPGEDP+LS KYA
Sbjct: 138 NASLFEAIGRVVSTEARAMYNVGLAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYA 197

Query: 184 ASYVRGLQGSD---GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
             YV+GLQ +D    ++LKVAA CKH+TAYDLDNW G+ R+ FNA V++QD++DTF  PF
Sbjct: 198 TGYVKGLQQTDDGDSNKLKVAACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPPF 257

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           + CV++G VASVMCSYNQVNG PTCADP++LK  IRGEW+LNGYIVSDCDSV V +  QH
Sbjct: 258 KSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGIIRGEWKLNGYIVSDCDSVEVLFKDQH 317

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           +T TPEEAAA  I AGLDL+CG +LG +TE AV++GLL E  INNA+ N     MRLG F
Sbjct: 318 YTKTPEEAAAQTILAGLDLNCGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGFF 377

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
           DG+PS QPYG+LGPKDVCT +++ELA EAARQGIVLLKN   SLPL+    +++AVIGPN
Sbjct: 378 DGDPSKQPYGNLGPKDVCTSENRELAREAARQGIVLLKNSPGSLPLNAKTIKSLAVIGPN 437

Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
           ++ T  MIGNY GI C Y +PLQ +     T +  GC +V CA+ +L   A   +  ADA
Sbjct: 438 ANATRVMIGNYEGIPCNYISPLQTLTALVPTSYAAGCPNVQCANAEL-DDATQIAASADA 496

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
           T++++G   +IEAE+LDR  +LLPG+QQ LVS+V+ ASKGP ILV+MSGG +DV+FAK++
Sbjct: 497 TVIIVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKSN 556

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
            +I +I+W GYPG+AGG AIAD++FG  NP G+LPMTWYPQ Y+  +PMT M MR   + 
Sbjct: 557 DKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQAYVNKVPMTNMNMRADPAT 616

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
            YPGRTYRFYKG  V+ FG G+S+++  H +  AP +V+VPL   H   ++      I  
Sbjct: 617 GYPGRTYRFYKGETVFSFGDGISFSSIEHKIVKAPQLVSVPLAEDHECRSSECMSLDIAD 676

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
            H  C  L   + + VKN G    +H +L+F TPP  H AP K L+ FEKVH+P  ++ +
Sbjct: 677 EH--CQNLAFDIHLGVKNTGKMSTSHVVLLFFTPPDVHNAPQKHLLGFEKVHLPGKSEAQ 734

Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           V   + VCK LSVVD  G R++PLG+H +H+G  KH +SL 
Sbjct: 735 VRFKVDVCKDLSVVDELGNRKVPLGQHLLHVGNLKHPLSLR 775


>gi|74355968|dbj|BAE44362.1| alpha-L-arabinofuranosidase [Raphanus sativus]
          Length = 780

 Score =  899 bits (2323), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 442/768 (57%), Positives = 550/768 (71%), Gaps = 16/768 (2%)

Query: 6   AFFFLGLILLSASSSGLAAREP-FACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
           + F L LI L    S  A   P FACD   + +     FC  ++ I  RV DL+ RL+LQ
Sbjct: 16  SVFSLSLIFLCLLDSSNAQSTPVFACDVAGNPSLAAYGFCNTAIKIEYRVADLVARLTLQ 75

Query: 64  EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
           EK+ +L S    V RLGI  YEWWSEALHGVS VGPGT+F G  PGATSFPQVI TA+SF
Sbjct: 76  EKIGVLTSKLHGVARLGIPTYEWWSEALHGVSYVGPGTRFSGQVPGATSFPQVILTAASF 135

Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
           N +L++AIG+VVS EARAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP+LS KYA
Sbjct: 136 NVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYA 195

Query: 184 ASYVRGLQ---GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
           + YV+GLQ    SD +RLKVAA CKH+TAYD+DNW GV+R+ FNA V++QD++DT+  PF
Sbjct: 196 SGYVKGLQETDSSDANRLKVAACCKHYTAYDVDNWKGVERYSFNAVVNQQDLDDTYQPPF 255

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           + CV++G VASVMCSYN+VNG PTCADP++L   IRGEW+LNGYIVSDCDSV V Y  QH
Sbjct: 256 KSCVVDGNVASVMCSYNKVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQH 315

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           +T TPEEAAA +I AGLDL+CG FLG HTE+AV+ GL+ E  I+ A+ N     MRLG F
Sbjct: 316 YTKTPEEAAAISINAGLDLNCGYFLGDHTEAAVKAGLVKEAAIDKAITNNFLTLMRLGFF 375

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
           DG+P  Q YG LGPKDVCTP +QELA EAARQGIVLLKN G +LPLS    +T+AVIGPN
Sbjct: 376 DGDPKKQIYGGLGPKDVCTPANQELAAEAARQGIVLLKNTG-ALPLSPKTIKTLAVIGPN 434

Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
           ++VT TMIGNY G  C YTTPLQG+     T +  GC +VACA   + G+   A+  +DA
Sbjct: 435 ANVTKTMIGNYEGTPCKYTTPLQGLAGTVHTTYLPGCSNVACAVADVAGSTKLAA-ASDA 493

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
           T+LV+G DQSIEAE+ DR  L LPG+QQELV++V+ A+KGP  LV+MSGG  D+ FAKND
Sbjct: 494 TVLVIGADQSIEAESRDRVDLNLPGQQQELVTQVAKAAKGPVFLVIMSGGGFDITFAKND 553

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
            +IA I+W GYPG+AGG A AD++FG  NP G+LPMTWYPQ Y+  +PMT M MRP +S 
Sbjct: 554 AKIAGILWVGYPGEAGGIATADVIFGRYNPSGRLPMTWYPQSYVEKVPMTNMNMRPDKSN 613

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
            YPGRTYRFY G  VY FG G+SYT F H++  AP +V++ L+  H  +  +   +++  
Sbjct: 614 GYPGRTYRFYTGETVYAFGDGLSYTKFSHSLVKAPRLVSLSLEENH--VCRSSECQSLNA 671

Query: 661 THAKCNRLTLG-------VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHV 713
               C+    G       V + V+N G ++G HT+ +F+TPPA H +P K L+ FEK+ +
Sbjct: 672 IGPHCDNAVSGTGGKAFEVHIKVQNGGDREGIHTVFLFTTPPAVHGSPRKHLLGFEKIRL 731

Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
               +  V   + VCK LSVVD  G R+I LG+H +H+G  KHS+S+ 
Sbjct: 732 GKMEEAVVKFKVDVCKDLSVVDEVGKRKIGLGQHLLHVGDVKHSLSIR 779


>gi|297797477|ref|XP_002866623.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
 gi|297312458|gb|EFH42882.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
          Length = 784

 Score =  895 bits (2313), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 442/767 (57%), Positives = 549/767 (71%), Gaps = 19/767 (2%)

Query: 5   IAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
           + FF   L L +A SS +     FACD   + +     FC   L I  RV DL+ RL+LQ
Sbjct: 26  LCFFLYFLDLSNAQSSPV-----FACDVAANPSLAAYGFCNTVLKIEYRVADLVARLTLQ 80

Query: 64  EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
           EK+  L+S A  V RLGI  YEWWSEALHGVS +GPGT F    PGATSFPQVI TA+SF
Sbjct: 81  EKIGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQVILTAASF 140

Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
           N +L++AIG+VVS EARAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP+L+ KYA
Sbjct: 141 NVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLASKYA 200

Query: 184 ASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
           + YV+GLQ +DG   +RLKVAA CKH+TAYD+DNW GV+R+ FNA V++QD++DT+  PF
Sbjct: 201 SGYVKGLQETDGGDSNRLKVAACCKHYTAYDVDNWKGVERYSFNAVVTQQDMDDTYQPPF 260

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           + CV++G VASVMCSYNQVNG PTCADP++L   IRGEW+LNGYIVSDCDSV V Y  QH
Sbjct: 261 KSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQH 320

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           +T TP EAAA +I AGLDL+CG FLG HTE AV+ GL++E  I+ A+ N     MRLG F
Sbjct: 321 YTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFF 380

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
           DG P +Q YG LGP DVCT  +QELA +AARQGIVLLKN G  LPLS    +T+AVIGPN
Sbjct: 381 DGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNTG-FLPLSPKSIKTLAVIGPN 439

Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
           ++VT TMIGNY G  C YTTPLQG+     T +  GC +VACA   + GA   A+  AD 
Sbjct: 440 ANVTKTMIGNYEGTPCKYTTPLQGLAGAVSTTYLPGCSNVACAVADVAGATKLAA-TADV 498

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
           T+L++G DQSIEAE+ DR  L LPG+QQELV +V+ A+KGP +LV+MSGG  D+ FAKND
Sbjct: 499 TVLLIGADQSIEAESRDRVDLNLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKND 558

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
           P+IA I+W GYPG+AGG AIADI+FG  NP G+LPMTWYPQ Y+  +PMT M MRP +SK
Sbjct: 559 PKIAGILWVGYPGEAGGIAIADIIFGRYNPSGRLPMTWYPQSYVEKVPMTIMNMRPDKSK 618

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
            YPGRTYRFY G  VY FG G+SYT F H++  AP++V++ L+  H  +  +   +++  
Sbjct: 619 GYPGRTYRFYTGETVYAFGDGLSYTKFSHSLVKAPSLVSLSLEENH--VCRSSECQSLDA 676

Query: 661 THAKCNRLTLG------VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
               C     G      VQ+ V+N G ++G HT+ +F+TPPA H +P K L+ FEK+ + 
Sbjct: 677 IGPHCENAVSGGGSAFEVQIKVRNGGDREGIHTVFLFTTPPAIHGSPRKHLLGFEKIRLG 736

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
              +  V   + VCK LSVVD  G R+I LG+H +H+G  KHS+S+ 
Sbjct: 737 KMEEAVVRFKVEVCKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSIR 783


>gi|15237736|ref|NP_201262.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
 gi|75262663|sp|Q9FLG1.1|BXL4_ARATH RecName: Full=Beta-D-xylosidase 4; Short=AtBXL4; Flags: Precursor
 gi|10178060|dbj|BAB11424.1| beta-xylosidase [Arabidopsis thaliana]
 gi|332010539|gb|AED97922.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
          Length = 784

 Score =  894 bits (2309), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 439/769 (57%), Positives = 546/769 (71%), Gaps = 14/769 (1%)

Query: 3   STIAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLS 61
           S++   FL   L   + S   +   FACD   + +     FC   L I  RV DL+ RL+
Sbjct: 19  SSVYLIFLCFFLYFLNFSNAQSSPVFACDVAANPSLAAYGFCNTVLKIEYRVADLVARLT 78

Query: 62  LQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTAS 121
           LQEK+  L+S A  V RLGI  YEWWSEALHGVS +GPGT F    PGATSFPQVI TA+
Sbjct: 79  LQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQVILTAA 138

Query: 122 SFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGK 181
           SFN +L++AIG+VVS EARAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP+L+ K
Sbjct: 139 SFNVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLASK 198

Query: 182 YAASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDV 238
           YA+ YV+GLQ +DG   +RLKVAA CKH+TAYD+DNW GV+R+ FNA V++QD++DT+  
Sbjct: 199 YASGYVKGLQETDGGDSNRLKVAACCKHYTAYDVDNWKGVERYSFNAVVTQQDMDDTYQP 258

Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT 298
           PF+ CV++G VASVMCSYNQVNG PTCADP++L   IRGEW+LNGYIVSDCDSV V Y  
Sbjct: 259 PFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKN 318

Query: 299 QHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLG 358
           QH+T TP EAAA +I AGLDL+CG FLG HTE AV+ GL++E  I+ A+ N     MRLG
Sbjct: 319 QHYTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLG 378

Query: 359 MFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
            FDG P +Q YG LGP DVCT  +QELA +AARQGIVLLKN G  LPLS    +T+AVIG
Sbjct: 379 FFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNTG-CLPLSPKSIKTLAVIG 437

Query: 419 PNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQA 478
           PN++VT TMIGNY G  C YTTPLQG+     T +  GC +VACA   + GA   A+  A
Sbjct: 438 PNANVTKTMIGNYEGTPCKYTTPLQGLAGTVSTTYLPGCSNVACAVADVAGATKLAA-TA 496

Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
           D ++LV+G DQSIEAE+ DR  L LPG+QQELV +V+ A+KGP +LV+MSGG  D+ FAK
Sbjct: 497 DVSVLVIGADQSIEAESRDRVDLHLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAK 556

Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
           NDP+IA I+W GYPG+AGG AIADI+FG  NP GKLPMTWYPQ Y+  +PMT M MRP +
Sbjct: 557 NDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDK 616

Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
           +  YPGRTYRFY G  VY FG G+SYT F HT+  AP++V++ L+  H  +  +   +++
Sbjct: 617 ASGYPGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSLGLEENH--VCRSSECQSL 674

Query: 659 KVTHAKCNRLTLG------VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVH 712
                 C     G      V + V+N G ++G HT+ +F+TPPA H +P K LV FEK+ 
Sbjct: 675 DAIGPHCENAVSGGGSAFEVHIKVRNGGDREGIHTVFLFTTPPAIHGSPRKHLVGFEKIR 734

Query: 713 VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           +    +  V   + +CK LSVVD  G R+I LG+H +H+G  KHS+S+ 
Sbjct: 735 LGKREEAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSIR 783


>gi|357445735|ref|XP_003593145.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
 gi|355482193|gb|AES63396.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
          Length = 775

 Score =  890 bits (2300), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/738 (57%), Positives = 540/738 (73%), Gaps = 7/738 (0%)

Query: 28  FACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
           FACD  K+    +  FC  SL +  RV+DL+ RL+LQEK+  L + A  V RLGI  YEW
Sbjct: 40  FACDVAKNTNVSSYGFCDKSLSVEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEW 99

Query: 87  WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           WSEALHGVSN+GPGT F    PGATSFP  I TA+SFN +L++AIG VVS+EARAMYN G
Sbjct: 100 WSEALHGVSNIGPGTHFSSLVPGATSFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVG 159

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD---GDRLKVAAS 203
            AGLTYWSPN+NIFRDPRWGRGQETPGEDP+LS KYAA YV+GLQ +D    D+LKVAA 
Sbjct: 160 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQQTDDGDSDKLKVAAC 219

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           CKH+TAYD+DNW GV R+ F+A VS+QD++DTF  PF+ CV++G VASVMCSYN+VNG P
Sbjct: 220 CKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKP 279

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           TCADP++LK  IRG+W+LNGYIVSDCDSV V +  QH+T TPEEAAA  I +GLDLDCG 
Sbjct: 280 TCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLFKDQHYTKTPEEAAAKTILSGLDLDCGS 339

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           +LG +T  AV++GL+ E  INNA+ N     MRLG FDG+PS QPYG+LGPKDVCTP++Q
Sbjct: 340 YLGQYTGGAVKQGLVDEASINNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQ 399

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
           ELA EAARQGIVLLKN   SLPLS    +++AVIGPN++ T  MIGNY GI C YT+PLQ
Sbjct: 400 ELAREAARQGIVLLKNSPGSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQ 459

Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           G+  +  T +  GC DV CA+ Q+   A   +  ADATI+V+G + +IEAE+LDR  +LL
Sbjct: 460 GLTAFVPTSYAPGCPDVQCANAQI-DDAAKIAASADATIIVVGANLAIEAESLDRVNILL 518

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG+QQ+LV++V+  SKGP ILV+MSGG +DV+FAK + +I +I+W GYPG+AGG AIAD+
Sbjct: 519 PGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADV 578

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           +FG+ NP G+LPMTWYPQ Y+  +PMT M MR   +  YPGRTYRFYKG  V+ FG GMS
Sbjct: 579 IFGSYNPSGRLPMTWYPQSYVEKIPMTNMNMRSDPATGYPGRTYRFYKGETVFSFGDGMS 638

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           +    H +  AP +V+VPL   H     ++  K++ V    C  L   + + VKN+G   
Sbjct: 639 FGTVEHKIVKAPQLVSVPLAEDHEC--RSLECKSLDVADEHCQNLAFDIHLSVKNMGKMS 696

Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
            +H++L+F TPP  H AP K L+ FEKV +   ++  V   + VC  LSVVD  G R++P
Sbjct: 697 SSHSVLLFFTPPNVHNAPQKHLLGFEKVQLAGKSEGMVRFKVDVCNDLSVVDELGNRKVP 756

Query: 744 LGEHNIHIGGTKHSVSLH 761
           LG+H +H+G  KHS+S+ 
Sbjct: 757 LGDHMLHVGNLKHSLSVR 774


>gi|297745522|emb|CBI40687.3| unnamed protein product [Vitis vinifera]
          Length = 751

 Score =  888 bits (2295), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 442/764 (57%), Positives = 538/764 (70%), Gaps = 33/764 (4%)

Query: 4   TIAFFFLGLI--LLSASSSGLAAREP-FACD-PKDATTRTLPFCQVSLPIPQRVNDLIGR 59
           T+   FL      LS+    LA   P FACD   + T     FC  SL    RV DL+ R
Sbjct: 12  TVFLCFLSCFSHFLSSPKWVLAQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKR 71

Query: 60  LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITT 119
           L+L+EK+  L++ AA+V RLGI  YEWWSEALHGVS VGPGT F    PGATSFPQVI T
Sbjct: 72  LTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPGATSFPQVILT 131

Query: 120 ASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
           A+SFNA+L+EAIG+ VS EARAMYN G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+LS
Sbjct: 132 AASFNASLFEAIGKAVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLS 191

Query: 180 GKYAASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
            KYA+ YVRGLQ SD    DRLKVAA CKH+TAYDLDNW GVDRFHFNA V+KQD++DTF
Sbjct: 192 SKYASGYVRGLQQSDDGSPDRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTF 251

Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
             PF+ CV++G VASVMCSYNQVNG P CADP++L   +RGEW+LNGYIVSDCDSV V+Y
Sbjct: 252 QPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFY 311

Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
           ++QH+T TPEEAAA AI AGLDL+CG FLG HTE+AV+ GL+ E  ++ A+ N     MR
Sbjct: 312 NSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMR 371

Query: 357 LGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAV 416
           LG FDG PS   YG LGPKDVCT +HQELA EAARQGIVLLKN   SLPLS    +T+AV
Sbjct: 372 LGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAV 431

Query: 417 IGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASR 476
           IGPN++VT TMIGNY G  C YTTPLQG+     T +  GC +VAC   Q+   A   + 
Sbjct: 432 IGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQI-DEAKKIAA 490

Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
            ADAT+L++G+DQSIEAE  DR  + LPG+Q  L+++V+ ASKG  ILV+MSGG  D++F
Sbjct: 491 AADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISF 550

Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
           AKND +I +I+W GYPG+AGG AIAD++FG  NP G+LPMTWYPQ Y+  +PMT M MRP
Sbjct: 551 AKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRP 610

Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
             +  YPGRTYRFY G  +Y FG G+SYT F H                H S++A     
Sbjct: 611 DPASGYPGRTYRFYTGETIYTFGDGLSYTQFNH----------------HLSVDA----- 649

Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAG 716
                   C  L   + + V N G+  G+HT+ +FS+PP+ H +P K L+ FEKV V A 
Sbjct: 650 ----VQESCQNLVFDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAK 705

Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           A+  V   + VCK LS+VD  GTR++ LG H +H+G  KHS+++
Sbjct: 706 AKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNV 749


>gi|292630922|sp|A5JTQ2.1|XYL1_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 1;
           AltName: Full=Xylan
           1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 1;
           Short=MsXyl1; Includes: RecName: Full=Beta-xylosidase;
           AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
           Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
           Full=Alpha-N-arabinofuranosidase; AltName:
           Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
           Flags: Precursor
 gi|146762261|gb|ABQ45227.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
           varia]
          Length = 774

 Score =  888 bits (2295), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 426/738 (57%), Positives = 539/738 (73%), Gaps = 7/738 (0%)

Query: 28  FACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
           FACD  K+    +  FC  SL +  RV+DL+ RL+LQEK+  L + A  V RLGI  YEW
Sbjct: 39  FACDVAKNTNVSSYGFCDNSLSVEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEW 98

Query: 87  WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           WSEALHGVSN+GPGT F    PGAT+FP  I TA+SFN +L++AIG VVS+EARAMYN G
Sbjct: 99  WSEALHGVSNIGPGTHFSSLVPGATNFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVG 158

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD---GDRLKVAAS 203
            AGLTYWSPN+NIFRDPRWGRGQETPGEDP+LS KYAA YV+GLQ +D    D+LKVAA 
Sbjct: 159 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQQTDDGDSDKLKVAAC 218

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           CKH+TAYD+DNW GV R+ F+A VS+QD++DTF  PF+ CV++G VASVMCSYN+VNG P
Sbjct: 219 CKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKP 278

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           TCADP++LK  IRG+W+LNGYIVSDCDSV V Y  QH+T TPEEAAA  I +GLDLDCG 
Sbjct: 279 TCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLYKDQHYTKTPEEAAAKTILSGLDLDCGS 338

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           +LG +T  AV++GL+ E  I NA+ N     MRLG FDG+PS QPYG+LGPKDVCTP++Q
Sbjct: 339 YLGQYTGGAVKQGLVDEASITNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQ 398

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
           ELA EAARQGIVLLKN   SLPLS    +++AVIGPN++ T  MIGNY GI C YT+PLQ
Sbjct: 399 ELAREAARQGIVLLKNSPRSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQ 458

Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           G+  +  T +  GC DV CA+ Q+   A   +  ADATI+V+G + +IEAE+LDR  +LL
Sbjct: 459 GLTAFVPTSYAPGCPDVQCANAQI-DDAAKIAASADATIIVVGANLAIEAESLDRVNILL 517

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG+QQ+LV++V+  SKGP ILV+MSGG +DV+FAK + +I +I+W GYPG+AGG AIAD+
Sbjct: 518 PGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADV 577

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           +FG+ NP G+LPMTWYPQ Y+  +PMT M MR   +  YPGRTYRFYKG  V+ FG GMS
Sbjct: 578 IFGSYNPSGRLPMTWYPQSYVEKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFGDGMS 637

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           +    H +  AP +V+VPL   H     ++  K++ V    C  L   + + VKN+G   
Sbjct: 638 FGTVEHKIVKAPQLVSVPLAEDHEC--RSLECKSLDVADKHCQNLAFDIHLSVKNMGKMS 695

Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
            +H++L+F TPP  H AP K L+ FEKV +   ++  V   + VC  LSVVD  G R++P
Sbjct: 696 SSHSVLLFFTPPNVHNAPQKHLLGFEKVQLAGKSEGMVRFKVDVCNDLSVVDELGNRKVP 755

Query: 744 LGEHNIHIGGTKHSVSLH 761
           LG+H +H+G  KHS+S+ 
Sbjct: 756 LGDHMLHVGNLKHSLSVR 773


>gi|350534908|ref|NP_001233910.1| beta-D-xylosidase 1 precursor [Solanum lycopersicum]
 gi|37359706|dbj|BAC98298.1| LEXYL1 [Solanum lycopersicum]
          Length = 770

 Score =  883 bits (2281), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/770 (55%), Positives = 547/770 (71%), Gaps = 17/770 (2%)

Query: 1   MASTIAFFFLGLILLSAS-SSGLAAREP-FACD-PKDATTRTLPFCQVSLPIPQRVNDLI 57
           MA    F  +G ILLS+     LA   P FACD   +     L FC  SL +  RVNDL+
Sbjct: 6   MAYNNHFSIIGFILLSSLLKQVLAQNSPVFACDVTSNPALGNLTFCDASLAVENRVNDLV 65

Query: 58  GRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVI 117
            RL+L EK+  L+SGA  V RLGI  YEWWSEALHGV+  GPG  F    PGATSFPQVI
Sbjct: 66  NRLTLGEKIGFLVSGAGGVSRLGIPKYEWWSEALHGVAYTGPGVHFTSLVPGATSFPQVI 125

Query: 118 TTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPV 177
            TA+SFN TL++ IG+VVS EARAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP 
Sbjct: 126 LTAASFNVTLFQTIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPT 185

Query: 178 LSGKYAASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIED 234
           L+ KY  +YV GLQ +D    ++LKVAA CKH+TAYD+DNW G++R+ FNA V +QD++D
Sbjct: 186 LTSKYGVAYVEGLQQTDDGSTNKLKVAACCKHYTAYDVDNWKGIERYSFNAVVRQQDLDD 245

Query: 235 TFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGV 294
           TF  PFR CV+EG VASVMCSYNQVNG PTC DPN+L   +RGEW+LNGYIV+DCDS+ V
Sbjct: 246 TFQPPFRSCVLEGAVASVMCSYNQVNGKPTCGDPNLLAGIVRGEWKLNGYIVTDCDSLQV 305

Query: 295 YYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQ 354
            + +Q++T TPEEAAA  + +G+DL+CG +L  +T+ AV + L++E  I+ A+ N     
Sbjct: 306 IFKSQNYTKTPEEAAALGLNSGVDLNCGSWLSTYTQGAVNQKLVNESVIDRAISNNFATL 365

Query: 355 MRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTV 414
           MRLG FDG P S+ YG+LGPKDVCTP++QELA EAARQGIVLLKN   SLPL+    +++
Sbjct: 366 MRLGFFDGNPKSRIYGNLGPKDVCTPENQELAREAARQGIVLLKNTAGSLPLTPTAIKSL 425

Query: 415 AVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDA 474
           AVIGPN++VT TMIGNY GI C YTTPLQG+     TI++ GC DV+C   Q     ID 
Sbjct: 426 AVIGPNANVTKTMIGNYEGIPCKYTTPLQGLTASVATIYKPGCADVSCNTAQ-----IDD 480

Query: 475 SRQ----ADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGG 530
           ++Q    ADA +LVMG DQSIE E+LDR  + LPG+Q  LV++V+  +KGP ILV+MSGG
Sbjct: 481 AKQIATTADAVVLVMGSDQSIEKESLDRTSITLPGQQSILVAEVAKVAKGPVILVIMSGG 540

Query: 531 PIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMT 590
            +DV FA ++P+I +I+W G+PG+AGG A+AD++FG  NP G+LPMTWYPQ Y   +PMT
Sbjct: 541 GMDVQFAVDNPKITSILWVGFPGEAGGAALADVIFGYYNPSGRLPMTWYPQSYADVVPMT 600

Query: 591 EMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
           +M MRP+ +  YPGRTYRFY GP V+ FGHG+SY+ F H +  AP  V++PL  +H    
Sbjct: 601 DMNMRPNPATNYPGRTYRFYTGPTVFTFGHGLSYSQFKHHLDKAPQFVSLPLGEKHTCRL 660

Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
           +    K +      C+ +   + + VKNVG   G+H + +F++PP+ H AP K L+ FEK
Sbjct: 661 SKC--KTVDAVGQSCSNMGFDIHLRVKNVGKISGSHIIFLFTSPPSVHNAPKKHLLGFEK 718

Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           VH+    +  V  N++VCK+LSV D  G R++ LG H +HIG  KHS+++
Sbjct: 719 VHLTPQGEGVVKFNVNVCKHLSVHDELGNRKVALGPHVLHIGDLKHSLTV 768


>gi|15242492|ref|NP_196535.1| beta-xylosidase 3 [Arabidopsis thaliana]
 gi|75264323|sp|Q9LXD6.1|BXL3_ARATH RecName: Full=Beta-D-xylosidase 3; Short=AtBXL3; AltName:
           Full=Alpha-L-arabinofuranosidase; Flags: Precursor
 gi|7671416|emb|CAB89357.1| beta-xylosidase-like protein [Arabidopsis thaliana]
 gi|9759004|dbj|BAB09531.1| beta-xylosidase [Arabidopsis thaliana]
 gi|15450735|gb|AAK96639.1| AT5g09730/F17I14_80 [Arabidopsis thaliana]
 gi|332004056|gb|AED91439.1| beta-xylosidase 3 [Arabidopsis thaliana]
          Length = 773

 Score =  881 bits (2276), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 424/766 (55%), Positives = 543/766 (70%), Gaps = 13/766 (1%)

Query: 4   TIAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
           +++  FL  I+  +  S   +   FACD   + +   L FC   L I  RV DL+GRL+L
Sbjct: 10  SVSTLFLCFIVCISEQSNNQSSPVFACDVTGNPSLAGLRFCNAGLSIKARVTDLVGRLTL 69

Query: 63  QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
           +EK+  L S A  V RLGI  Y+WWSEALHGVSNVG G++F G  PGATSFPQVI TA+S
Sbjct: 70  EEKIGFLTSKAIGVSRLGIPSYKWWSEALHGVSNVGGGSRFTGQVPGATSFPQVILTAAS 129

Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
           FN +L++AIG+VVS EARAMYN G+AGLT+WSPNVNIFRDPRWGRGQETPGEDP LS KY
Sbjct: 130 FNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQETPGEDPTLSSKY 189

Query: 183 AASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
           A +YV+GLQ +DG   +RLKVAA CKH+TAYD+DNW  V+R  FNA V++QD+ DTF  P
Sbjct: 190 AVAYVKGLQETDGGDPNRLKVAACCKHYTAYDIDNWRNVNRLTFNAVVNQQDLADTFQPP 249

Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
           F+ CV++G VASVMCSYNQVNG PTCADP++L   IRG+W+LNGYIVSDCDSV V +  Q
Sbjct: 250 FKSCVVDGHVASVMCSYNQVNGKPTCADPDLLSGVIRGQWQLNGYIVSDCDSVDVLFRKQ 309

Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
           H+  TPEEA A ++ AGLDL+C  F G H   AV+ GL++E  I+ A+ N     MRLG 
Sbjct: 310 HYAKTPEEAVAKSLLAGLDLNCDHFNGQHAMGAVKAGLVNETAIDKAISNNFATLMRLGF 369

Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
           FDG+P  Q YG LGPKDVCT D+QELA + ARQGIVLLKN   SLPLS    +T+AVIGP
Sbjct: 370 FDGDPKKQLYGGLGPKDVCTADNQELARDGARQGIVLLKNSAGSLPLSPSAIKTLAVIGP 429

Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
           N++ T TMIGNY G+ C YTTPLQG+     + +Q GC +VAC D  + G+A+D +  AD
Sbjct: 430 NANATETMIGNYHGVPCKYTTPLQGLAETVSSTYQLGC-NVACVDADI-GSAVDLAASAD 487

Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
           A +LV+G DQSIE E  DR  L LPG+QQELV++V+MA++GP +LV+MSGG  D+ FAKN
Sbjct: 488 AVVLVVGADQSIEREGHDRVDLYLPGKQQELVTRVAMAARGPVVLVIMSGGGFDITFAKN 547

Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
           D +I +I+W GYPG+AGG AIAD++FG  NP G LPMTWYPQ Y+  +PM+ M MRP +S
Sbjct: 548 DKKITSIMWVGYPGEAGGLAIADVIFGRHNPSGNLPMTWYPQSYVEKVPMSNMNMRPDKS 607

Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIK 659
           K YPGR+YRFY G  VY F   ++YT F H +  AP +V++ LD  H   ++    +++ 
Sbjct: 608 KGYPGRSYRFYTGETVYAFADALTYTKFDHQLIKAPRLVSLSLDENHPCRSSEC--QSLD 665

Query: 660 VTHAKCNRLTLG-----VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
                C     G     V ++VKN G + G+HT+ +F+T P  H +P KQL+ FEK+ + 
Sbjct: 666 AIGPHCENAVEGGSDFEVHLNVKNTGDRAGSHTVFLFTTSPQVHGSPIKQLLGFEKIRLG 725

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
              +  V  N++VCK LSVVD +G R+I LG H +H+G  KHS+++
Sbjct: 726 KSEEAVVRFNVNVCKDLSVVDETGKRKIALGHHLLHVGSLKHSLNI 771


>gi|292630923|sp|A5JTQ3.1|XYL2_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 2;
           AltName: Full=Xylan
           1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 2;
           Short=MsXyl2; Includes: RecName: Full=Beta-xylosidase;
           AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
           Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
           Full=Alpha-N-arabinofuranosidase; AltName:
           Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
           Flags: Precursor
 gi|146762263|gb|ABQ45228.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
           varia]
          Length = 774

 Score =  880 bits (2273), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 422/762 (55%), Positives = 540/762 (70%), Gaps = 15/762 (1%)

Query: 7   FFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
           F     +LLS           FACD  K+       FC   L +  RV DL+ RL+LQEK
Sbjct: 18  FVLFATLLLSGGRVSSQTSAVFACDVAKNPALANYGFCNKKLSVDARVKDLVRRLTLQEK 77

Query: 66  VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
           V  L++ A  V RLGI  YEWWSEALHGVSN+GPGT F    PGATSFP  I  A+SFNA
Sbjct: 78  VGNLVNSAVDVSRLGIPKYEWWSEALHGVSNIGPGTHFSNVIPGATSFPMPILIAASFNA 137

Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
           +L++ IG+VVS EARAM+N G AGLTYWSPN+NIFRDPRWGRGQETPGEDP+L+ KYAA 
Sbjct: 138 SLFQTIGKVVSTEARAMHNVGLAGLTYWSPNINIFRDPRWGRGQETPGEDPLLASKYAAG 197

Query: 186 YVRGLQGSD---GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
           YV+GLQ +D    ++LKVAA CKH+TAYD+D+W GV R+ FNA V++QD++DT+  PF+ 
Sbjct: 198 YVKGLQQTDDGDSNKLKVAACCKHYTAYDVDDWKGVQRYTFNAVVTQQDLDDTYQPPFKS 257

Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT 302
           CV++G VASVMCSYNQVNG PTCADP++LK  IRG+W+LNGYIVSDCDSV V +  QH+T
Sbjct: 258 CVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGKWKLNGYIVSDCDSVDVLFKNQHYT 317

Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
            TPEEAAA +I AGLDL+CG FLG +TE AV++GL+ E  INNA+ N     MRLG FDG
Sbjct: 318 KTPEEAAAKSILAGLDLNCGSFLGRYTEGAVKQGLIGEASINNAVYNNFATLMRLGFFDG 377

Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
           +PS QPYG+LGPKDVCT  +QELA EAARQGIVLLKN   SLPL+    +++AVIGPN++
Sbjct: 378 DPSKQPYGNLGPKDVCTSANQELAREAARQGIVLLKNCAGSLPLNAKAIKSLAVIGPNAN 437

Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQ----A 478
            T  MIGNY GI C YT+PLQG+     T    GC DV C +     AA+D +++    A
Sbjct: 438 ATRAMIGNYEGIPCKYTSPLQGLTALVPTSFAAGCPDVQCTN-----AALDDAKKIAASA 492

Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
           DAT++V+G + +IEAE+ DR  +LLPG+QQ+LV++V+  +KGP IL +MSGG +DV+FAK
Sbjct: 493 DATVIVVGANLAIEAESHDRINILLPGQQQQLVTEVANVAKGPVILAIMSGGGMDVSFAK 552

Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
            + +I +I+W GYPG+AGG AIAD++FG  NP G+LPMTWYPQ Y+  +PMT M MRP  
Sbjct: 553 TNKKITSILWVGYPGEAGGAAIADVIFGYHNPSGRLPMTWYPQSYVDKVPMTNMNMRPDP 612

Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
           +  YPGRTYRFYKG  V+ FG G+SY+ F H +  AP +V+VPL   H  +  +   K++
Sbjct: 613 ATGYPGRTYRFYKGETVFSFGDGISYSTFEHKLVKAPQLVSVPLAEDH--VCRSSKCKSL 670

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQ 718
            V    C  L   + + +KN G    + T+ +FSTPPA H AP K L+AFEKV +   ++
Sbjct: 671 DVVGEHCQNLAFDIHLRIKNKGKMSSSQTVFLFSTPPAVHNAPQKHLLAFEKVLLTGKSE 730

Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
             V   + VCK L +VD  G R++ LG+H +H+G  KH +S+
Sbjct: 731 ALVSFKVDVCKDLGLVDELGNRKVALGKHMLHVGDLKHPLSV 772


>gi|224070626|ref|XP_002303181.1| predicted protein [Populus trichocarpa]
 gi|222840613|gb|EEE78160.1| predicted protein [Populus trichocarpa]
          Length = 773

 Score =  877 bits (2265), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 436/759 (57%), Positives = 546/759 (71%), Gaps = 8/759 (1%)

Query: 7   FFFLGLILLSASSSGLAAREP-FACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
           F    + L+  S+   A   P FACD   + +  +L FC  S+ I  RV DL+ RL+LQE
Sbjct: 16  FLLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGINDRVVDLVKRLTLQE 75

Query: 65  KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
           K+  L++ A  V RLGI  YEWWSEALHGVS VGPGT F  D  GATSFPQVI TA+SFN
Sbjct: 76  KIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVILTAASFN 135

Query: 125 ATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
            +L+EAIG+VVS EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LS KY +
Sbjct: 136 TSLFEAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGS 195

Query: 185 SYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
            YV+GLQ  D    D+LKVAA CKH+TAYDLDNW G DR+HFNA V+KQD++DTF  PF+
Sbjct: 196 CYVKGLQQRDDGDPDKLKVAACCKHYTAYDLDNWKGSDRYHFNAVVTKQDMDDTFQPPFK 255

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
            CV++G VASVMCSYNQVNG PTCADP++L   IRGEW LNGYIV+DCDS+ V+Y +Q++
Sbjct: 256 SCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWNLNGYIVTDCDSLDVFYKSQNY 315

Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
           T TPEEAAA AI AG+DL+CG FLG HTE+AV+ GL++E  I+ A+ N     MRLG FD
Sbjct: 316 TKTPEEAAAAAILAGVDLNCGSFLGQHTEAAVKGGLVNEHAIDIAVSNNFATLMRLGFFD 375

Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
           G+PS Q YG LGPKDVCT ++QELA EAARQGIVLLKN   SLPLS    + +AVIGPN+
Sbjct: 376 GDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNA 435

Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
           +VT TMIGNY G  C YTTPLQG+     T +  GC +VAC+  Q+   A   +  ADAT
Sbjct: 436 NVTKTMIGNYEGTPCKYTTPLQGLAASVATTYLPGCSNVACSTAQV-DDAKKLAAAADAT 494

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           +LVMG D SIEAE+ DR  +LLPG+QQ L++ V+  S GP ILV+MSGG +DV+FA+ + 
Sbjct: 495 VLVMGADLSIEAESRDRVDVLLPGQQQLLITAVANVSCGPVILVIMSGGGMDVSFARTND 554

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
           +I +I+W GYPG+AGG AIADI+FG  NP G+LPMTWYPQ Y+  +PMT M MRP  S  
Sbjct: 555 KITSILWVGYPGEAGGAAIADIIFGYYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPSNG 614

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
           YPGRTYRFY G  VY FG G+SY+ F H +  AP +V VPL+  H  +  +   +++  +
Sbjct: 615 YPGRTYRFYTGETVYSFGDGLSYSQFTHELIQAPQLVYVPLEESH--VCHSSECQSVVAS 672

Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
              C   T  + + VKN G+  G+HT+ +FS+PPA H +P K LV FEKV + A   + V
Sbjct: 673 EQTCQNSTFDMLLRVKNEGTISGSHTVFLFSSPPAVHNSPQKHLVGFEKVFLNAQTGRHV 732

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
              + +CK LSVVD  G++++ LGEH +H+G  KH +S+
Sbjct: 733 RFKVDICKDLSVVDELGSKKVALGEHVLHVGSLKHFLSV 771


>gi|413919688|gb|AFW59620.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 773

 Score =  874 bits (2259), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 430/736 (58%), Positives = 534/736 (72%), Gaps = 7/736 (0%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           FACD  +AT  +  FC  S     R  DL+ RL+L EKV  L+   AA+PRLG+  YEWW
Sbjct: 40  FACDASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYEWW 99

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SEALHGVS VGPGT+F    PGATSFPQ I TA+SFNATL+ AIG VVS+EARAM+N G 
Sbjct: 100 SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNVGL 159

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCK 205
           AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA  YV GLQG  S    LKVAA CK
Sbjct: 160 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGAGALKVAACCK 219

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H+TAYD+DNW GV+R+ F+A VS+QD++DTF  PF+ CV++G VASVMCSYNQVNG PTC
Sbjct: 220 HYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQVNGKPTC 279

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
           AD ++L   IRG+W+LNGYI SDCDSV V Y+ QH+T TPE+AAA +I+AGLDL+CG FL
Sbjct: 280 ADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLDLNCGTFL 339

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
             HT +AVQ G LSE D++ A+ N L   MRLG FDG+P   P+G+LGP DVCTP +QEL
Sbjct: 340 AQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVCTPSNQEL 399

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           A EAARQGIVLLKN G  LPLS    +++AVIGPN++ + TMIGNY G  C YTTPLQG+
Sbjct: 400 AREAARQGIVLLKNTG-KLPLSATSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGL 458

Query: 446 GRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
           G    T++Q GC +V C+ + L   AA  A+  AD T+LV+G DQSIE E+LDR  LLLP
Sbjct: 459 GANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESLDRTSLLLP 518

Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
           G+Q +LVS V+ AS GP ILV+MSGGP D++FAK+  +IAAI+W GYPG+AGG AIAD+L
Sbjct: 519 GQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPGEAGGAAIADVL 578

Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
           FG  NP G+LP+TWYP+ + T +PMT+M MRP  S  YPGRTYRFY G  VY FG G+SY
Sbjct: 579 FGYHNPSGRLPVTWYPESF-TKVPMTDMRMRPDPSTGYPGRTYRFYTGDTVYAFGDGLSY 637

Query: 625 TNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG 684
           T+F H + +AP  +A+ L   H  +  T    +++   A C  L   V + V+N G + G
Sbjct: 638 TSFAHHLVSAPKQLALQLAEGHACL--TEQCPSVEAEGAHCEGLAFDVHLRVRNAGERSG 695

Query: 685 AHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
            HT+ +FS+PPA H AP K L+ FEKV +  G    V   + VCK LSVVD  G R++ L
Sbjct: 696 GHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVAL 755

Query: 745 GEHNIHIGGTKHSVSL 760
           G H +H+G  KH+++L
Sbjct: 756 GSHTLHVGDLKHTLNL 771


>gi|242077366|ref|XP_002448619.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
 gi|241939802|gb|EES12947.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
          Length = 767

 Score =  872 bits (2253), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 431/736 (58%), Positives = 532/736 (72%), Gaps = 7/736 (0%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           FACD  +AT  +  FC  S     R  DL+ RL+L EKV  L+   AA+PRLGI  YEWW
Sbjct: 34  FACDASNATLASYGFCNRSASASARAADLVSRLTLAEKVGFLVDKQAALPRLGIPLYEWW 93

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SEALHGVS VGPGT+F    P ATSFPQ I TA+SFNATL+ AIG VVS+EARAM+N G 
Sbjct: 94  SEALHGVSYVGPGTRFSSLVPAATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNVGL 153

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ--GSDGDRLKVAASCK 205
           AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA  YV GLQ  GS    LKVAA CK
Sbjct: 154 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDAGSGSGSLKVAACCK 213

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H+TAYD+DNW GV+R+ FNA VS+QD++DTF  PF+ CV++G VASVMCSYNQVNG PTC
Sbjct: 214 HYTAYDVDNWKGVERYTFNAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQVNGKPTC 273

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
           AD ++L   IRG+W+LNGYI SDCDSV V Y+ QH+T TPE+AAA +I+AGLDL+CG FL
Sbjct: 274 ADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLDLNCGNFL 333

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
             HT +AVQ G LSE D++ A+ N     MRLG FDG+P   P+G+LGP DVCT  +QEL
Sbjct: 334 AQHTVAAVQAGKLSESDVDRAITNNFITLMRLGFFDGDPRKLPFGNLGPSDVCTSSNQEL 393

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           A EAARQGIVLLKN G +LPLS    +++AVIGPN++ + TMIGNY G  C YTTPLQG+
Sbjct: 394 AREAARQGIVLLKNSG-ALPLSASSIKSLAVIGPNANASFTMIGNYEGTPCKYTTPLQGL 452

Query: 446 GRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
           G    T++Q GC +V C+ + L   AA  A+  AD T+LV+G DQSIE E+LDR  LLLP
Sbjct: 453 GANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESLDRTSLLLP 512

Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
           G+Q +LVS V+ AS+GP ILV+MSGGP D++FAK+  +IAAI+W GYPG+AGG AIAD+L
Sbjct: 513 GQQPQLVSAVANASRGPCILVIMSGGPFDISFAKSSDKIAAILWVGYPGEAGGAAIADVL 572

Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
           FG  NP G+LP+TWYP+ + T +PM +M MRP  S  YPGRTYRFY G  VY FG G+SY
Sbjct: 573 FGHHNPSGRLPVTWYPESF-TKVPMIDMRMRPDASTGYPGRTYRFYTGDTVYAFGDGLSY 631

Query: 625 TNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG 684
           T+F H + +AP  VA+ L   H  +  T    +++   A C  L   V + V+N G   G
Sbjct: 632 TSFAHHLVSAPKQVALQLAEGHTCL--TEQCPSVEAEGAHCEGLAFDVHLRVRNAGDMSG 689

Query: 685 AHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
           AHT+ +FS+PPA H AP K L+ FEKV +  G    V   + VCK LSVVD  G R++ L
Sbjct: 690 AHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVAL 749

Query: 745 GEHNIHIGGTKHSVSL 760
           G H +H+G  KH++SL
Sbjct: 750 GNHTLHVGDLKHTLSL 765


>gi|115460876|ref|NP_001054038.1| Os04g0640700 [Oryza sativa Japonica Group]
 gi|38344900|emb|CAE02971.2| OSJNBb0079B02.3 [Oryza sativa Japonica Group]
 gi|113565609|dbj|BAF15952.1| Os04g0640700 [Oryza sativa Japonica Group]
 gi|116310882|emb|CAH67823.1| OSIGBa0138H21-OSIGBa0138E01.14 [Oryza sativa Indica Group]
 gi|218195682|gb|EEC78109.1| hypothetical protein OsI_17615 [Oryza sativa Indica Group]
          Length = 765

 Score =  872 bits (2252), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 425/736 (57%), Positives = 531/736 (72%), Gaps = 6/736 (0%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           FACD  +AT     FC  +     R  DL+GRL+L EKV  L++  AA+PRLGI  YEWW
Sbjct: 31  FACDASNATVSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYEWW 90

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SEALHGVS VGPGT+F    PGATSFPQ I TA+SFNA+L+ AIG VVS EARAM+N G 
Sbjct: 91  SEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGL 150

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG--DRLKVAASCK 205
           AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA  YV GLQ + G  D LKVAA CK
Sbjct: 151 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGGGSDALKVAACCK 210

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H+TAYD+DNW GV+R+ F+A VS+QD++DTF  PF+ CV++G VASVMCSYN+VNG PTC
Sbjct: 211 HYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTC 270

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
           AD ++L   IRG+W+LNGYIVSDCDSV V Y+ QH+T  PE+AAA  I++GLDL+CG FL
Sbjct: 271 ADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNPEDAAAITIKSGLDLNCGNFL 330

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
             HT +AVQ G LSE D++ A+ N   V MRLG FDG+P   P+G LGPKDVCT  +QEL
Sbjct: 331 AQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTSSNQEL 390

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           A EAARQGIVLLKN G +LPLS    +++AVIGPN++ + TMIGNY G  C YTTPLQG+
Sbjct: 391 AREAARQGIVLLKNTG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGL 449

Query: 446 GRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
           G    T++Q GC +V C+ + L   AA  A+  AD T+LV+G DQS+E E+LDR  LLLP
Sbjct: 450 GANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDRTSLLLP 509

Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
           G+Q +LVS V+ AS+GP ILV+MSGGP D++FAK+  +I+AI+W GYPG+AGG A+ADIL
Sbjct: 510 GQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPGEAGGAALADIL 569

Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
           FG  NPGG+LP+TWYP  +   + MT+M MRP  S  YPGRTYRFY G  VY FG G+SY
Sbjct: 570 FGYHNPGGRLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRTYRFYTGDTVYAFGDGLSY 629

Query: 625 TNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG 684
           T F H++ +AP  VAV L   H     T    +++     C  L+  V + V+N G   G
Sbjct: 630 TKFAHSLVSAPEQVAVQLAEGHAC--HTEHCFSVEAAGEHCGSLSFDVHLRVRNAGGMAG 687

Query: 685 AHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
            HT+ +FS+PP+ H AP K L+ FEKV +  G    V   + VCK LSVVD  G R++ L
Sbjct: 688 GHTVFLFSSPPSVHSAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVAL 747

Query: 745 GEHNIHIGGTKHSVSL 760
           G H +H+G  KH+++L
Sbjct: 748 GSHTLHVGDLKHTLNL 763


>gi|297811069|ref|XP_002873418.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
 gi|297319255|gb|EFH49677.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
          Length = 780

 Score =  871 bits (2251), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 431/770 (55%), Positives = 543/770 (70%), Gaps = 14/770 (1%)

Query: 4   TIAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
           +++   L  +L  +  S   +   FACD   + +   L FC   L I  RV DL+GRL+L
Sbjct: 15  SVSTLLLCFLLCISEQSNAQSSPVFACDVTGNPSLAGLRFCNTGLNIKSRVTDLVGRLTL 74

Query: 63  QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
           +EK+  L S A  V RLGI  Y+WWSEALHGVSNVG G+ F G  PGATSFPQVI TA+S
Sbjct: 75  EEKIGFLGSNAIGVSRLGIPAYKWWSEALHGVSNVGGGSSFSGQVPGATSFPQVILTAAS 134

Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
           FN +L++AIG+VVS EARAMYN G+AGLT+WSPNVNIFRDPRWGRGQETPGEDP LS KY
Sbjct: 135 FNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQETPGEDPELSSKY 194

Query: 183 AASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
           A +YVRGLQ +DG   +RLKVAA CKH+TAYD+DNW  V RF FNA V++QD+ DTF  P
Sbjct: 195 AVAYVRGLQETDGGDPNRLKVAACCKHYTAYDVDNWKDVHRFTFNAVVNQQDMADTFQPP 254

Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
           F+ CV++G VASVMCSYNQVNG PTCADP++L   IRG+W+LNGYIVSDCDSV V Y  Q
Sbjct: 255 FKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGQWKLNGYIVSDCDSVDVLYTKQ 314

Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
           H+T TPEEA A +I AGLDL+C  F G +   AV+ GL++E  I+ A+ N     MRLG 
Sbjct: 315 HYTKTPEEAVAKSILAGLDLNCDHFTGQYAMKAVKVGLVNETAIDKAISNNFATLMRLGF 374

Query: 360 FDGEPSSQP-YGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
           FDG+P  Q  YG LGP DVCT ++QELA +AARQGIVLLKN   SLPLS    +T+AVIG
Sbjct: 375 FDGDPKKQQLYGGLGPNDVCTANNQELARDAARQGIVLLKNSAGSLPLSPSAIKTLAVIG 434

Query: 419 PNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQA 478
           PN++ T TMIGNY GI C YTTPLQG+     + +Q GC +VACA+  L G+A   +  A
Sbjct: 435 PNANATETMIGNYNGIPCKYTTPLQGLAETVSSTYQLGC-NVACAEPDL-GSAAALAASA 492

Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
           DA +LVMG DQSIE E LDR  L LPG+QQELV++V+  +KGP +LV+MSGG  D+ FAK
Sbjct: 493 DAVVLVMGADQSIEQENLDRLDLYLPGKQQELVTQVAKVAKGPVVLVIMSGGAFDITFAK 552

Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
           N+ +I  I+W GYPG+AGG AIAD++FG  NP G LPMTWYPQ Y+  +PMT M MRP +
Sbjct: 553 NEEKITGIMWVGYPGEAGGLAIADVIFGRHNPSGNLPMTWYPQSYVEKVPMTNMNMRPDK 612

Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
           S  YPGRTYRFY G  VY FG G+SYTNF H +  AP +V++ LD  H   ++    +++
Sbjct: 613 SNGYPGRTYRFYTGETVYAFGDGLSYTNFNHQILKAPKLVSLDLDENHACRSSEC--QSV 670

Query: 659 KVTHAKCNR-----LTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHV 713
                 C+      L   VQ+ V+NVG ++G+HT+ +F+TPP  H +P K L+ FEK+ +
Sbjct: 671 DAIGPHCDNAVGGGLNFEVQLKVRNVGDREGSHTVFLFTTPPEVHGSPRKHLLGFEKIRL 730

Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
               +  +  N+ VCK LSVVD  G R+I LG + +H+G  KHS+++  +
Sbjct: 731 GEKEETVIRFNVDVCKDLSVVDEIGKRKIALGHYLLHVGSFKHSLTISVS 780


>gi|9294427|dbj|BAB02547.1| beta-1,4-xylosidase [Arabidopsis thaliana]
          Length = 876

 Score =  848 bits (2191), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 402/767 (52%), Positives = 537/767 (70%), Gaps = 17/767 (2%)

Query: 7   FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
           F  L L++++  SS   +++ FACD     T    FC VSL    R  DL+ RLSL+EKV
Sbjct: 6   FVRLSLLIIALVSSLCESQKNFACDISAPATAKYGFCNVSLSYEARAKDLVSRLSLKEKV 65

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
           + L++ A  VPRLG+  YEWWSEALHGVS+VGPG  F G  PGATSFP  I TA+SFN +
Sbjct: 66  QQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAASFNTS 125

Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
           LW  +G VVS EARAM+N G AGLTYWSPNVN+FRDPRWGRGQETPGEDP++  KYA +Y
Sbjct: 126 LWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVNY 185

Query: 187 VRGLQ----GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
           V+GLQ         RLKV++ CKH+TAYDLDNW G+DRFHF+AKV+KQD+EDT+  PF+ 
Sbjct: 186 VKGLQDVHDAGKSRRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKS 245

Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT 302
           CV EG V+SVMCSYN+VNG+PTCADPN+L+  IRG+WRL+GYIVSDCDS+ VY++  H+T
Sbjct: 246 CVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHYT 305

Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
            T E+A A A++AGL+++CG FLG +TE+AV+   L+  D++ AL+    V MRLG FDG
Sbjct: 306 KTREDAVALALKAGLNMNCGDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDG 365

Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
           +P S P+G+LGP DVC+ DHQ LALEAA+QGIVLL+N+G  LPL     + +AVIGPN++
Sbjct: 366 DPKSLPFGNLGPSDVCSKDHQMLALEAAKQGIVLLENRG-DLPLPKTTVKKLAVIGPNAN 424

Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYA--RTIHQQGCKDVACADDQLFGAAIDASRQADA 480
            T  MI NYAG+ C YT+P+QG+ +Y   + +++ GCKDV C D  L  AA+ A  +AD 
Sbjct: 425 ATKVMISNYAGVPCKYTSPIQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADV 484

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
           T+LV+GLDQ++EAE LDR  L LPG Q++LV  V+ A+K   +LV+MS GPID++FAKN 
Sbjct: 485 TVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNL 544

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
             I A++W GYPG+AGG AIA ++FG  NP G+LP TWYPQE+   + MT+M MRP+ + 
Sbjct: 545 STIRAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYPQEFADKVAMTDMNMRPNSTS 604

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
            +PGR+YRFY G  +Y FG+G+SY++F   V +AP+++ +  +    ++N T S   + +
Sbjct: 605 GFPGRSYRFYTGKPIYKFGYGLSYSSFSTFVLSAPSIIHIKTNPIM-NLNKTTS---VDI 660

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP------AGHWAPHKQLVAFEKVHVP 714
           +   C+ L + + + VKN G + G+H +LVF  PP       G   P  QLV FE+V V 
Sbjct: 661 STVNCHDLKIRIVIGVKNHGLRSGSHVVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVG 720

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
               ++  ++  VCK LS+VD  G R++  G H + IG        H
Sbjct: 721 RSMTEKFTVDFDVCKALSLVDTHGKRKLVTGHHKLVIGSNSDQQIYH 767


>gi|15230897|ref|NP_188596.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
 gi|259585724|sp|Q9LJN4.2|BXL5_ARATH RecName: Full=Probable beta-D-xylosidase 5; Short=AtBXL5; Flags:
           Precursor
 gi|332642747|gb|AEE76268.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
          Length = 781

 Score =  848 bits (2191), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 402/767 (52%), Positives = 537/767 (70%), Gaps = 17/767 (2%)

Query: 7   FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
           F  L L++++  SS   +++ FACD     T    FC VSL    R  DL+ RLSL+EKV
Sbjct: 6   FVRLSLLIIALVSSLCESQKNFACDISAPATAKYGFCNVSLSYEARAKDLVSRLSLKEKV 65

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
           + L++ A  VPRLG+  YEWWSEALHGVS+VGPG  F G  PGATSFP  I TA+SFN +
Sbjct: 66  QQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAASFNTS 125

Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
           LW  +G VVS EARAM+N G AGLTYWSPNVN+FRDPRWGRGQETPGEDP++  KYA +Y
Sbjct: 126 LWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVNY 185

Query: 187 VRGLQ----GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
           V+GLQ         RLKV++ CKH+TAYDLDNW G+DRFHF+AKV+KQD+EDT+  PF+ 
Sbjct: 186 VKGLQDVHDAGKSRRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKS 245

Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT 302
           CV EG V+SVMCSYN+VNG+PTCADPN+L+  IRG+WRL+GYIVSDCDS+ VY++  H+T
Sbjct: 246 CVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHYT 305

Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
            T E+A A A++AGL+++CG FLG +TE+AV+   L+  D++ AL+    V MRLG FDG
Sbjct: 306 KTREDAVALALKAGLNMNCGDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDG 365

Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
           +P S P+G+LGP DVC+ DHQ LALEAA+QGIVLL+N+G  LPL     + +AVIGPN++
Sbjct: 366 DPKSLPFGNLGPSDVCSKDHQMLALEAAKQGIVLLENRG-DLPLPKTTVKKLAVIGPNAN 424

Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYA--RTIHQQGCKDVACADDQLFGAAIDASRQADA 480
            T  MI NYAG+ C YT+P+QG+ +Y   + +++ GCKDV C D  L  AA+ A  +AD 
Sbjct: 425 ATKVMISNYAGVPCKYTSPIQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADV 484

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
           T+LV+GLDQ++EAE LDR  L LPG Q++LV  V+ A+K   +LV+MS GPID++FAKN 
Sbjct: 485 TVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNL 544

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
             I A++W GYPG+AGG AIA ++FG  NP G+LP TWYPQE+   + MT+M MRP+ + 
Sbjct: 545 STIRAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYPQEFADKVAMTDMNMRPNSTS 604

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
            +PGR+YRFY G  +Y FG+G+SY++F   V +AP+++ +  +    ++N T S   + +
Sbjct: 605 GFPGRSYRFYTGKPIYKFGYGLSYSSFSTFVLSAPSIIHIKTNPIM-NLNKTTS---VDI 660

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP------AGHWAPHKQLVAFEKVHVP 714
           +   C+ L + + + VKN G + G+H +LVF  PP       G   P  QLV FE+V V 
Sbjct: 661 STVNCHDLKIRIVIGVKNHGLRSGSHVVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVG 720

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
               ++  ++  VCK LS+VD  G R++  G H + IG        H
Sbjct: 721 RSMTEKFTVDFDVCKALSLVDTHGKRKLVTGHHKLVIGSNSDQQIYH 767


>gi|357166259|ref|XP_003580652.1| PREDICTED: beta-D-xylosidase 4-like [Brachypodium distachyon]
          Length = 774

 Score =  845 bits (2184), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 419/738 (56%), Positives = 526/738 (71%), Gaps = 8/738 (1%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           FACD  ++T     FC  +     R  DL+ RL+L +KV  L++   A+ RLGI  YEWW
Sbjct: 38  FACDAANSTVAGYAFCDRAKSASARAADLVSRLTLADKVGFLVNKQPALARLGIPAYEWW 97

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SEALHGVS VGPGT+F    PGATSFPQ I TA+SFNA+L+ AIG VVS+EARAM+N G 
Sbjct: 98  SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSNEARAMHNVGL 157

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ--GSDGD-RLKVAASC 204
           AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ +YA  YV GLQ  G+D D  LKVAA C
Sbjct: 158 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASRYAVGYVSGLQDAGADADGPLKVAACC 217

Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           KH+TAYD+DNW GV+R+ F+AKVS+QD++DTF  PF+ CV++GKVASVMCSYN+VNG PT
Sbjct: 218 KHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVIDGKVASVMCSYNKVNGKPT 277

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
           CAD ++L   IRG+W+LNGYIVSDCDSV V Y  QH+T TPEEAAA  I++GLDL+CG F
Sbjct: 278 CADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYSQQHYTKTPEEAAAITIKSGLDLNCGDF 337

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
           L  HT +AVQ G LSE D++ A+ N   + MRLG FDG+P    YG LGPKDVCT  +QE
Sbjct: 338 LAKHTVAAVQAGNLSESDVDRAITNNFIMLMRLGFFDGDPRKLAYGSLGPKDVCTSSNQE 397

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LA E ARQGIVLLKN G +LPLS    +++AVIGPN++ + TMIGNY G  C YTTPL G
Sbjct: 398 LARETARQGIVLLKNDG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLHG 456

Query: 445 IGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           +G    T++Q GC +V C+ + L   AA  A+  AD T+LV+G DQSIE EALDR  LLL
Sbjct: 457 LGNNVATVYQPGCSNVGCSGNSLQLSAATAAAASADVTVLVVGADQSIEREALDRTSLLL 516

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG+Q +L+S V+ ASKG  ILV+MSGGP D++FAK   +I+AI+W GYPG+AGG AIADI
Sbjct: 517 PGQQPDLISAVANASKGHVILVVMSGGPFDISFAKASDKISAILWVGYPGEAGGAAIADI 576

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           +FG  NP G+LP+TWYP  +   +PMT+M MRP  S  YPGRTYRFY G  V+ FG G+S
Sbjct: 577 IFGKYNPSGRLPVTWYPASFADKVPMTDMRMRPDNSTGYPGRTYRFYTGETVFAFGDGLS 636

Query: 624 YTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
           YT   H  VA  P+ V++ L   H     T    +++     C  +   V++ V N G  
Sbjct: 637 YTTMSHNLVAAPPSEVSMQLAEGHAC--HTKECASVEAAGDHCEGMAFEVRLRVHNTGEM 694

Query: 683 DGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
            GAHT+L+FS+PPA H AP K L+ FEK+++  G        + VCK LSVVD  G R++
Sbjct: 695 AGAHTVLLFSSPPAVHNAPAKHLLGFEKLNLEPGQAGVAAFKVDVCKDLSVVDELGNRKV 754

Query: 743 PLGEHNIHIGGTKHSVSL 760
            LG H +H+G  KH+++L
Sbjct: 755 ALGGHTLHVGDLKHTLNL 772


>gi|326492918|dbj|BAJ90315.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 775

 Score =  842 bits (2176), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 418/741 (56%), Positives = 524/741 (70%), Gaps = 10/741 (1%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           FACD  +AT     FC        R  DL+ RL+L EKV  L++   A+ RLGI  YEWW
Sbjct: 37  FACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWW 96

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SEALHGVS VGPGT+F    PGATSFPQ I TA+SFNA+L+ AIG VVS EARAM+N G 
Sbjct: 97  SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGL 156

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-----GSDGDRLKVAA 202
           AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA  YV GLQ     G     LKVAA
Sbjct: 157 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGAGGVTDGALKVAA 216

Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            CKH+TAYD+DNW GV+R+ F+AKVS+QD++DTF  PF+ CV++G VASVMCSYN+VNG 
Sbjct: 217 CCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGK 276

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
           PTCAD ++L+  IRG+W+LNGYIVSDCDSV V Y  QH+T TPEEAAA  I++GLDL+CG
Sbjct: 277 PTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGLDLNCG 336

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
            FL  HT +AVQ G LSE D++ A+ N   + MRLG FDG+P    +G LGPKDVCT  +
Sbjct: 337 NFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSN 396

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           +ELA E ARQGIVLLKN G +LPLS    +++AVIGPN++ + TMIGNY G  C YTTPL
Sbjct: 397 RELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPL 455

Query: 443 QGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
           QG+G    T++Q GC +V C+ + L    A+ A+  AD T+LV+G DQSIE E+LDR  L
Sbjct: 456 QGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRTSL 515

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           LLPG+Q +LVS V+ AS GP ILV+MSGGP D++FAK   +IAAI+W GYPG+AGG A+A
Sbjct: 516 LLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGAALA 575

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           DILFG+ NP GKLP+TWYP  Y   + MT+M MRP  S  YPGRTYRFY G  V+ FG G
Sbjct: 576 DILFGSHNPSGKLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAFGDG 635

Query: 622 MSYTNFVHTVANA-PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           +SYT   H++ +A P+ V++ L   H       +  +++     C+ L   V++ V+N G
Sbjct: 636 LSYTKMSHSLVSAPPSYVSMRLAEDHPCRAEECA--SVEAAGDHCDDLAFDVKLQVRNAG 693

Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
              GAH++L+FS+PP  H AP K L+ FEKV +  G    V   + VC+ LSVVD  G R
Sbjct: 694 EVAGAHSVLLFSSPPPAHNAPAKHLLGFEKVSLAPGEAGTVAFRVDVCRDLSVVDELGGR 753

Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
           ++ LG H +H+G  KH+V L 
Sbjct: 754 KVALGGHTLHVGDLKHTVELR 774


>gi|326494302|dbj|BAJ90420.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326521150|dbj|BAJ96778.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326527851|dbj|BAK08165.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 775

 Score =  841 bits (2173), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 417/741 (56%), Positives = 524/741 (70%), Gaps = 10/741 (1%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           FACD  +AT     FC        R  DL+ RL+L EKV  L++   A+ RLGI  YEWW
Sbjct: 37  FACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWW 96

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SEALHGVS VGPGT+F    PGATSFPQ I TA+SFNA+L+ AIG VVS EARAM+N G 
Sbjct: 97  SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGL 156

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-----GSDGDRLKVAA 202
           AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA  YV GLQ     G     LKVAA
Sbjct: 157 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGAGGVTDGALKVAA 216

Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            CKH+TAYD+DNW GV+R+ F+AKVS+QD++DTF  PF+ CV++G VASVMCSYN+VNG 
Sbjct: 217 CCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGK 276

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
           PTCAD ++L+  IRG+W+LNGYIVSDCDSV V Y  QH+T TPEEAAA  I++GLDL+CG
Sbjct: 277 PTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGLDLNCG 336

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
            FL  HT +AVQ G LSE D++ A+ N   + MRLG FDG+P    +G LGPKDVCT  +
Sbjct: 337 NFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSN 396

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           +ELA E ARQGIVLLKN G +LPLS    +++AVIGPN++ + TMIGNY G  C YTTPL
Sbjct: 397 RELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPL 455

Query: 443 QGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
           QG+G    T++Q GC +V C+ + L    A+ A+  AD T+LV+G DQSIE E+LDR  L
Sbjct: 456 QGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRTSL 515

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           LLPG+Q +LVS V+ AS GP ILV+MSGGP D++FAK   +IAAI+W GYPG+AGG A+A
Sbjct: 516 LLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGAALA 575

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           DILFG+ NP G+LP+TWYP  Y   + MT+M MRP  S  YPGRTYRFY G  V+ FG G
Sbjct: 576 DILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAFGDG 635

Query: 622 MSYTNFVHTVANA-PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           +SYT   H++ +A P+ V++ L   H       +  +++     C+ L   V++ V+N G
Sbjct: 636 LSYTKMSHSLVSAPPSYVSMRLAEDHPCRAEECA--SVEAAGDHCDDLAFDVKLQVRNAG 693

Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
              GAH++L+FS+PP  H AP K L+ FEKV +  G    V   + VC+ LSVVD  G R
Sbjct: 694 EVAGAHSVLLFSSPPPAHNAPAKHLLGFEKVSLAPGEAGTVAFRVDVCRDLSVVDELGGR 753

Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
           ++ LG H +H+G  KH+V L 
Sbjct: 754 KVALGGHTLHVGDLKHTVELR 774


>gi|18025340|gb|AAK38481.1| alpha-L-arabinofuranosidase/beta-D-xylosidase isoenzyme ARA-I
           [Hordeum vulgare]
          Length = 777

 Score =  834 bits (2155), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 415/741 (56%), Positives = 521/741 (70%), Gaps = 10/741 (1%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           FACD  +AT     FC        R  DL+ RL+L EKV  L++   A+ RLGI  YEWW
Sbjct: 39  FACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWW 98

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SEALHGVS VGPGT+F    PGATSFPQ I TA+SFNA+L+ AIG VVS EARAM+N G 
Sbjct: 99  SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGL 158

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-----GSDGDRLKVAA 202
           AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA  YV GLQ     G     LKVAA
Sbjct: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGAGGVTDGALKVAA 218

Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            CKH+TAYD+DNW GV+R+ F+AKVS+QD++DTF  PF+ CV++G VASVMCSYN+VNG 
Sbjct: 219 CCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGK 278

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
           PTCAD ++L+  IRG+W+LNGYIVSDCDSV V Y  QH+T TPEEAAA  I++G+DL+CG
Sbjct: 279 PTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGVDLNCG 338

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
            FL  HT +AVQ G LSE D++ A+ N   + MRLG FDG+P    +G LGPKDVCT  +
Sbjct: 339 NFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSN 398

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           +ELA E ARQGIVLLKN G +LPLS    +++AVIGPN++ + TMIGNY G  C YTTPL
Sbjct: 399 RELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPL 457

Query: 443 QGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
           QG+G    T++Q GC +V C+ + L    A+ A+  AD T+LV+G DQSIE E+LDR  L
Sbjct: 458 QGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRTSL 517

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           LLPG+Q +LVS V+ AS GP ILV+MSGGP D++FAK   +IAA +W GYPG+AGG A+ 
Sbjct: 518 LLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAATLWVGYPGEAGGAALD 577

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           D LFG+ NP G+LP+TWYP  Y   + MT+M MRP  S  YPGRTYRFY G  V+ FG G
Sbjct: 578 DTLFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAFGDG 637

Query: 622 MSYTNFVHTVANA-PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           +SYT   H++ +A P+ V++ L   H  +       +++     C+ L L V++ V+N G
Sbjct: 638 LSYTKMSHSLVSAPPSYVSMRLAEDH--LCRAEECASVEAAGDHCDDLALDVKLQVRNAG 695

Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
              GAH++L+FS+PP  H AP K LV FEKV +  G    V   + VC+ LSVVD  G R
Sbjct: 696 EVAGAHSVLLFSSPPPAHNAPAKHLVGFEKVSLAPGEAGTVAFRVDVCRDLSVVDELGGR 755

Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
           ++ LG H +H G  KH+V L 
Sbjct: 756 KVALGGHTLHDGDLKHTVELR 776


>gi|85813772|emb|CAJ65922.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 757

 Score =  834 bits (2154), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/777 (54%), Positives = 530/777 (68%), Gaps = 60/777 (7%)

Query: 7   FFFLGLI---LLSASSSGLAAREP-FACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLS 61
           F F  L+   L S+S   LA   P FACD   + +  +  FC  SL +  RV DL+ RL+
Sbjct: 16  FLFFSLVCFLLFSSSHVVLAQSSPVFACDVVSNPSLASFGFCNTSLGVSDRVVDLVKRLT 75

Query: 62  LQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTAS 121
           LQEK+  L++ A +V RLGI  YEWWSEALHGVS VGPGT F    PGATSFPQVI TA+
Sbjct: 76  LQEKILFLVNSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSSVVPGATSFPQVILTAA 135

Query: 122 SFNATLWEAIGR----VVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPV 177
           SFN +L+ AIG+    VVS EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+
Sbjct: 136 SFNTSLFVAIGKVISQVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPL 195

Query: 178 LSGKYAASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSKQDIE 233
           LS KY + YV+GLQ  D    D LKVAA CKH+TAYDLDNW GVDR+HFNA  V+KQD++
Sbjct: 196 LSSKYGSGYVKGLQQRDDGNPDGLKVAACCKHYTAYDLDNWKGVDRYHFNAVVVTKQDMD 255

Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG--YIVSDCDS 291
           DTF  PF+ CV++G VASVMCSYN+VNG+PTCADP++L   IRGEW+LNG  YIV+DCDS
Sbjct: 256 DTFQPPFKSCVVDGNVASVMCSYNKVNGIPTCADPDLLSGVIRGEWKLNGYVYIVTDCDS 315

Query: 292 VGVYYDTQHFTSTPEEAAADAIRA--GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVN 349
           + V+Y++QH+T TPEEAAA AI A  GLDL+CG FLG HTE+AV  GL++E  I+ A+ N
Sbjct: 316 IDVFYNSQHYTKTPEEAAAKAILAGIGLDLNCGSFLGKHTEAAVTAGLVNESAIDRAVSN 375

Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
                MRLG FDG+PS Q YG LGPKDVCT ++QELA EAARQGIVLLKN          
Sbjct: 376 NFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKN---------- 425

Query: 410 RHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFG 469
                                  G  C YTTPLQG+     T +  GC +VAC+  Q+  
Sbjct: 426 ----------------------TGTPCKYTTPLQGLAALVATTYLPGCSNVACSTAQV-D 462

Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
            A   +  ADAT+LVMG D SIEAE+ DR  +LLPG+QQ L++ V+ AS GP ILV+MSG
Sbjct: 463 DAKKIAAAADATVLVMGADLSIEAESRDRVDILLPGQQQLLITAVANASTGPVILVIMSG 522

Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSN------PGGKLPMTWYPQEY 583
           G +DV+FAK + +I +I+W GYPG+AGG AIADI+FG+ N      PGG+LPMTWYPQ Y
Sbjct: 523 GGMDVSFAKTNDKITSILWVGYPGEAGGAAIADIIFGSYNPSTHQPPGGRLPMTWYPQSY 582

Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
           +  +PMT M MRP  S  YPGRTYRFY G  VY FG G+SY+ F H +  AP +V+VPL+
Sbjct: 583 VDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDGLSYSEFSHELTQAPGLVSVPLE 642

Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
             H  +  +   K++      C      V + +KN G+  G+HT+ +FSTPP+ H +P K
Sbjct: 643 ENH--VCYSSECKSVAAAEQTCQNFD--VHLRIKNTGTTSGSHTVFLFSTPPSVHNSPQK 698

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            LV FEKV + A     VG  + VCK LSVVD  G++++ LGEH +HIG  KHS+++
Sbjct: 699 HLVGFEKVFLHAQTDSHVGFKVDVCKDLSVVDELGSKKVALGEHVLHIGSLKHSMTV 755


>gi|297834874|ref|XP_002885319.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
 gi|297331159|gb|EFH61578.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
          Length = 865

 Score =  832 bits (2148), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 401/767 (52%), Positives = 529/767 (68%), Gaps = 30/767 (3%)

Query: 9   FLGLILLSAS--SSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
           F+G+ LL A+  SS   +++ FACD  D  T    FC VSL    R  DL+ RLSL+EKV
Sbjct: 6   FVGVSLLIAALVSSLCESQKNFACDRNDPATAKYGFCNVSLSYEARAKDLVSRLSLKEKV 65

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
           + L++ A  V RLG+  YEWWSEALHGVS+VGPG +F G  PGATSFP  I TA+SFN +
Sbjct: 66  QQLVNKATGVSRLGVPPYEWWSEALHGVSDVGPGVRFNGTVPGATSFPATILTAASFNTS 125

Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
           LW  +G VVS EARAM+N G AGLTYWSPNVNIFRDPRWGRGQETPGEDP++  KYA +Y
Sbjct: 126 LWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSKYAVNY 185

Query: 187 VRGLQ----GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
           V+GLQ         RLKV++ CKH+TAYDLDNW G+DRFHF+AKV+KQD+EDT+  PF+ 
Sbjct: 186 VKGLQDVQDAGKSRRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQPPFKS 245

Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT 302
           CV EG V+SVMCSYN+VNG+PTCADPN+L+  IRG+WRL+GYIVSDCDS+ VY+D  H+T
Sbjct: 246 CVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFDDIHYT 305

Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
            T            L+++CG FLG +TE+AV+   L+  +++ AL+    V MRLG FDG
Sbjct: 306 KTR-----------LNMNCGDFLGKYTENAVKLKKLNGSEVDEALIYNYIVLMRLGFFDG 354

Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
           +P S P+G LGP DVC+ DHQ LALEAA+QGIVLL+N+G  LPLS    + +AVIGPN++
Sbjct: 355 DPKSLPFGQLGPSDVCSKDHQMLALEAAKQGIVLLENRG-DLPLSKTAVKKIAVIGPNAN 413

Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYA--RTIHQQGCKDVACADDQLFGAAIDASRQADA 480
            T  MI NYAG+ C YT+PLQG+ +Y   + +++ GCKDV C +  L  AA+ A  +AD 
Sbjct: 414 ATKVMISNYAGVPCKYTSPLQGLQKYVPEKVVYEPGCKDVNCGEQTLISAAVKAVSEADV 473

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
           T+LV+GLDQ++EAE LDR  L LPG Q++LV  V+ A+K   +LV+MS GPID++FAKN 
Sbjct: 474 TVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNL 533

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
             I+A++W GYPG+AGG AIA ++FG  NP G+LP TWY QE+   + MT+M MRP+ + 
Sbjct: 534 STISAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYSQEFADKVAMTDMNMRPNSTS 593

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
            +PGR+YRFY G  +Y FG+G+SY+ F   V +AP+++ +  +    ++N T S   I +
Sbjct: 594 GFPGRSYRFYTGKPIYKFGYGLSYSAFSTFVLSAPSIIHIKTNPIL-NLNKTTS---IDI 649

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP------AGHWAPHKQLVAFEKVHVP 714
           +   C+ L + + + VKN G + G+H +LVF  PP       G   P  QLV FE+V V 
Sbjct: 650 STVNCHDLKIRIVIGVKNRGQRSGSHVVLVFWKPPKCSKTLVGAGVPQTQLVGFERVEVG 709

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
               ++V +   VCK LS+VD  G R++  G H + IG        H
Sbjct: 710 RSMTEKVTVEFDVCKALSLVDTHGKRKLVTGHHTLVIGSNSDQQIYH 756


>gi|357444469|ref|XP_003592512.1| Xylosidase [Medicago truncatula]
 gi|355481560|gb|AES62763.1| Xylosidase [Medicago truncatula]
          Length = 781

 Score =  826 bits (2134), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 396/748 (52%), Positives = 527/748 (70%), Gaps = 19/748 (2%)

Query: 29  ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
           ACD     T   PFC  SL    R  DL+ RL+LQEK + L++ +  + RLG+  YEWWS
Sbjct: 25  ACDKGSPKTSNFPFCNTSLSYETRAKDLVSRLTLQEKAQQLVNPSTGISRLGVPAYEWWS 84

Query: 89  EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA 148
           EALHGVSNVGPGT+F    PGATSFP VI +A+SFN TLW  +G+VVS+EARAMYN   A
Sbjct: 85  EALHGVSNVGPGTRFDSRVPGATSFPAVILSAASFNETLWYTMGQVVSNEARAMYNVDLA 144

Query: 149 GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ------GSDGDRLKVAA 202
           GLT+WSPNVN+FRDPRWGRGQETPGEDP++  +YA +YVRGLQ       + GDRLKV++
Sbjct: 145 GLTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEASAKGDRLKVSS 204

Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            CKH+TAYD+DNW GVDRFHF+AKV+KQD+EDT+  PF+ CV+EG V+SVMCSYN+VNG+
Sbjct: 205 CCKHYTAYDVDNWKGVDRFHFDAKVTKQDLEDTYQPPFKSCVLEGHVSSVMCSYNRVNGI 264

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
           PTCADP++L+  IRG+W L+GYIVSDCDSV VYY++ H+T TPE+A A A++AGL+++CG
Sbjct: 265 PTCADPDLLQGVIRGQWGLDGYIVSDCDSVEVYYNSIHYTKTPEDAVALALKAGLNMNCG 324

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
            FL  +T +AV    +    ++ ALV    V MRLG F+  P S P+ +LGP DVCT ++
Sbjct: 325 DFLKKYTANAVNLKKVDVSIVDQALVYNYIVLMRLGFFE-NPKSLPFANLGPSDVCTKEN 383

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q+LALEAA+QGIVLL+N   +LPLS  + + +AVIGPN++ T  MI NYAGI C Y++PL
Sbjct: 384 QQLALEAAKQGIVLLENNKGALPLSKTKIKNLAVIGPNANATTVMISNYAGIPCRYSSPL 443

Query: 443 QGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
           QG+ +Y  ++ + +GC DV C++  LF AA+ A+  ADA +LV+GLDQSIEAE LDR  L
Sbjct: 444 QGLQKYISSVTYARGCSDVKCSNQNLFAAAVKAAASADAVVLVVGLDQSIEAEGLDRVNL 503

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
            LPG Q++LV  V+ A+KG  ILV+M+ GPID++F K+   I  I+W GYPGQ GG AIA
Sbjct: 504 TLPGFQEKLVKDVAAATKGTLILVIMAAGPIDISFTKSVSNIGGILWVGYPGQDGGNAIA 563

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
            ++FG  NPGG+ P TWYPQ Y+  +PMT+M MR + S+ +PGRTYRFY G  +Y FG+G
Sbjct: 564 QVIFGDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANSSRNFPGRTYRFYNGKSLYEFGYG 623

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSI-----NATISGKAIKVTHAKCNRLTLGVQVDV 676
           +SY+ F   +A+AP+ + +    ++ SI     N  +  + I ++   C  LT  + + V
Sbjct: 624 LSYSTFSTHIASAPSTIMLQ---KNTSISKPLNNIFLDDQVIDISTISCFNLTFSLVIGV 680

Query: 677 KNVGSKDGAHTLLVFSTPPAGHW---APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
           KN G  DG+H +LVF  PP+       P KQL+ FE+  V  G  + V + I +CK LS 
Sbjct: 681 KNNGPFDGSHVVLVFLEPPSSEAVSGVPLKQLIGFERAQVKVGKTEFVTVKIDICKMLSN 740

Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           VD  G R++ +G+HNI +G +      H
Sbjct: 741 VDSDGKRKLVIGQHNILVGTSSEKQVTH 768


>gi|255573163|ref|XP_002527511.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223533151|gb|EEF34909.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 810

 Score =  826 bits (2134), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 408/760 (53%), Positives = 523/760 (68%), Gaps = 23/760 (3%)

Query: 24  AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
           A + FACD     T    FC  SL    R  DLI RL+LQEKV+ +++ AA +PRLGI  
Sbjct: 21  ASQNFACDKNSPQTNDYSFCNTSLSYQDRAKDLISRLTLQEKVQQVVNHAAGIPRLGIPA 80

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           YEWWSEALHGVSNVG G +F G  PGATSFP +I +A+SFN TLW  +G+VVS EAR M+
Sbjct: 81  YEWWSEALHGVSNVGFGVRFNGTVPGATSFPAMILSAASFNETLWLKMGQVVSTEARTMH 140

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ------GSDGDR 197
           + G AGLTYWSPNVN+FRDPRWGRGQETPGEDP++  +YA +YVRGLQ       S  D+
Sbjct: 141 SVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEGNSTADK 200

Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
           LKV++ CKH+TAYDLD W GVDRFHF+AKV+KQD+EDT+  PFR CV E  V+SVMCSYN
Sbjct: 201 LKVSSCCKHYTAYDLDKWKGVDRFHFDAKVTKQDLEDTYQPPFRSCVEEAHVSSVMCSYN 260

Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
           +VNG+PTCADP++LK  IRGEW L+GYIVSDCDS+ VYYD+ ++T+TPE+A A A++AGL
Sbjct: 261 RVNGIPTCADPDLLKGIIRGEWNLDGYIVSDCDSIEVYYDSINYTATPEDAVALALKAGL 320

Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
           +++CG FLG +T  AV+   + E  ++ AL+    V MRLG FDG+P S  +G+LGP DV
Sbjct: 321 NMNCGEFLGKYTVDAVKLNKVEESVVDQALIYNFIVLMRLGFFDGDPKSLLFGNLGPSDV 380

Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
           C+  HQ+LAL+AARQGIVLL N+G +LPLS    R +AVIGPN++VT TMI NYAGI C 
Sbjct: 381 CSDGHQKLALDAARQGIVLLYNKG-ALPLSKNNTRNLAVIGPNANVTTTMISNYAGIPCK 439

Query: 438 YTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
           YTTPLQG+ +Y  T+ +  GCK V+C+DD L  AA  A+  ADA +L++GLDQSIE E L
Sbjct: 440 YTTPLQGLQKYVSTVTYAAGCKSVSCSDDTLIDAATQAAAAADAVVLLVGLDQSIEREGL 499

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DR  L LPG Q++LV  V  A+ G  +LV+MS  PIDV+FA N  +I  I+W GYPGQAG
Sbjct: 500 DRENLTLPGFQEKLVVDVVNATNGTVVLVVMSSSPIDVSFAVNKSKIKGILWVGYPGQAG 559

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           G A+A ++FG  NP G+ P TWYPQEY   +PMT+M MR + +  +PGRTYRFY G  +Y
Sbjct: 560 GDAVAQVMFGDYNPAGRSPFTWYPQEYAHQVPMTDMNMRANSTANFPGRTYRFYAGNTLY 619

Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLD---------GRHGSINA--TISGKAIKVTHAKC 665
            FGHG+SY+ F + + + P+ + +  +           H S      I+ +A+ +T   C
Sbjct: 620 KFGHGLSYSTFSNFIISGPSTLLLKTNSDLKPDIILSTHNSTEEHPFINSQAMDITTLNC 679

Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW---APHKQLVAFEKVHVPAGAQQRVG 722
               L + + V+N G   G H +LVF  PP       A + QLV F +V V  G  Q V 
Sbjct: 680 TNSLLSLILGVRNNGPVSGDHVVLVFWKPPNSSEVTGAANVQLVGFSRVEVNRGKTQNVT 739

Query: 723 INIHVCKYLSVVDRSGTRRIPLGEHNIHIGG-TKHSVSLH 761
           + I VCK LS+VD  G R++  G+H   IG  T+H V  H
Sbjct: 740 LEIDVCKRLSLVDSEGKRKLVTGQHIFTIGSDTEHQVRHH 779


>gi|225428983|ref|XP_002264114.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 818

 Score =  824 bits (2129), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 401/784 (51%), Positives = 532/784 (67%), Gaps = 26/784 (3%)

Query: 8   FFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
            FL L LL+ S         +ACD  D  +   PFC  SLP   R +DL+ RL+LQEK K
Sbjct: 35  IFLSLPLLAISQ----PTHRYACDRTDPNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAK 90

Query: 68  LLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
            LI+ A  + RLG+  YEWWSEALHGVSN G G  F    P  T FP VI +A+SFN +L
Sbjct: 91  QLINSATGISRLGVPDYEWWSEALHGVSNSGIGVHFHDPIPAVTIFPAVILSAASFNESL 150

Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
           W  +G+VVS E RAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP++  +YA +YV
Sbjct: 151 WYTMGQVVSTEGRAMYNVGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYV 210

Query: 188 RGLQ--GSDG----DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
           RGLQ  G +G    DRLKV++ CKH+TAYD+D W GVDRFHF+AKV+ QD+EDT+  PF+
Sbjct: 211 RGLQEVGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFK 270

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
            CV EG V+SVMCSYN+VNGVPTCA+P +LK  IR +W L+GYIVSDCDS+ VY++  ++
Sbjct: 271 SCVEEGHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY 330

Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
           T TPE+A A A++AGL+L+CG +LG +T++AV  G + E  +N AL+    V MRLG FD
Sbjct: 331 TETPEDAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFD 390

Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
           G+P+  P+G +GP DVCT DHQ LAL+AA+QGIVLL N G +LPLS    +T+AVIGPN+
Sbjct: 391 GDPTMLPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNNG-ALPLSPNTTKTLAVIGPNA 449

Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADA 480
           D T TM+ NYAG+ C YT+PLQG+ +Y   + +++GC +V+C+++ L   A   +  ADA
Sbjct: 450 DATNTMLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADA 509

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
           T++V+GLD  IEAE LDR  L LPG Q++LV + + A+ G  ILV+MS GP+D++F KN 
Sbjct: 510 TVVVVGLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNV 569

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
            +I  I+W GYPGQAGG AI+ ++FG  NPGG+ P TWYPQEY+  +PMT+M MRP+ + 
Sbjct: 570 SKIGGILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATS 629

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL----------DGRHGSI- 649
            +PGRTYRFY G  +Y FGHG+SY+ F   + +APT V V L             + ++ 
Sbjct: 630 NFPGRTYRFYTGKSLYQFGHGLSYSTFYKFIKSAPTTVLVHLLPQMDMPNIFSSNYPTMP 689

Query: 650 NATISGKAIKVTHAKCNRLT-LGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLV 706
           N   +G+AI ++   C  L+ + + + VKN G  DG H +L F  PP      AP  +LV
Sbjct: 690 NPNTNGQAIDISAIDCRNLSNIDIVIGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELV 749

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
            FE+V V  G  + VG+ + VC  +S VD  G R++ +G H + +G +    + H     
Sbjct: 750 GFERVEVKRGKTEMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVVGSSSEQQARHHVNFR 809

Query: 767 VIKS 770
           + KS
Sbjct: 810 LAKS 813


>gi|195614824|gb|ACG29242.1| auxin-induced beta-glucosidase [Zea mays]
 gi|413920229|gb|AFW60161.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 655

 Score =  821 bits (2120), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 410/655 (62%), Positives = 492/655 (75%), Gaps = 29/655 (4%)

Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-------GSD 194
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP +S +YAA+YVRGLQ       G  
Sbjct: 1   MYNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGH 60

Query: 195 GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC 254
            +RLK+AA CKHFTAYDLD W G DRFHFNA V+ QD+EDTF+VPFR CV +G+ ASVMC
Sbjct: 61  RNRLKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVMC 120

Query: 255 SYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIR 314
           SYNQVNGVPTCAD   L+ TIRG W L+GYIVSDCDSV V++  QH+T TPE+AAA  +R
Sbjct: 121 SYNQVNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTRTPEDAAAATLR 180

Query: 315 AGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
           AGLDLDCGPFL L+  SAV  G +++ D++ AL+NT+TVQMRLGMFDG+P++ P+G LGP
Sbjct: 181 AGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRLGP 240

Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPS------LPLSHIRHRTVAVIGPNSDVTVTMI 428
            DVCT +HQ+LAL+AARQG+VLLKN+  +      LPL    HR VAV+GP++D TV MI
Sbjct: 241 ADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVAMI 300

Query: 429 GNYAGIACGYTTPLQGIGRY-ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL 487
           GNYAG  C YTTPLQG+  Y AR  HQ GC DVAC  +Q   AA++A+RQADAT++V GL
Sbjct: 301 GNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVAGL 360

Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
           DQ +EAE LDR  LLLPGRQ EL+S V+ ASKGP ILVLMSGGPID+AFA+NDPRI  I+
Sbjct: 361 DQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDGIL 420

Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
           W GYPGQAGG AIAD++FG  NPG KLP+TWY Q+Y+  +PMT MAMR + ++ YPGRTY
Sbjct: 421 WVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGYPGRTY 480

Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG---------KAI 658
           RFY GP +YPFGHG+SYT F HT+A+APT + V L G   S  +  S          +A+
Sbjct: 481 RFYTGPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLARPVRAV 540

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA------GHWAPHKQLVAFEKVH 712
           +V HA+C  LT+ V VDV NVG +DGAH +LV+   P+      G  AP +QLVAFEKVH
Sbjct: 541 RVAHARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVAFEKVH 600

Query: 713 VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           VPAG   RV + I VC  LSV DR+G RR+P+GEH + IG   HSVSL    LGV
Sbjct: 601 VPAGGVARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGELTHSVSLGVEQLGV 655


>gi|147844622|emb|CAN82161.1| hypothetical protein VITISV_035506 [Vitis vinifera]
          Length = 925

 Score =  821 bits (2120), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 399/784 (50%), Positives = 531/784 (67%), Gaps = 26/784 (3%)

Query: 8   FFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
            FL L LL+ S         +ACD  D  +   PFC  SLP   R +DL+ RL+LQEK K
Sbjct: 11  IFLSLPLLAISQ----PTHRYACDRTDPNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAK 66

Query: 68  LLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
            LI+ A  + RLG+  YEWWSEALHGVSN G G  F    P  T FP VI +A+SFN +L
Sbjct: 67  QLINSATGISRLGVPDYEWWSEALHGVSNSGIGVHFHDPIPAVTIFPAVILSAASFNESL 126

Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
           W  +G+VVS E RAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP++  +YA +YV
Sbjct: 127 WYTMGQVVSTEGRAMYNVGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYV 186

Query: 188 RGLQ--GSDG----DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
           RGLQ  G +G    DRLKV++ CKH+TAYD+D W GVDRFHF+AKV+ QD+EDT+  PF+
Sbjct: 187 RGLQEVGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFK 246

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
            CV EG V+SVMCSYN+VNGVPTCA+P +LK  IR +W L+GYIVSDCDS+ VY++  ++
Sbjct: 247 XCVEEGHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY 306

Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
           T TPE+A A A++AGL+L+CG +LG +T++AV  G + E  +B AL+    V MRLG FD
Sbjct: 307 TETPEDAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVBQALIYNYIVLMRLGFFD 366

Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
           G+P+  P+G +GP DVCT DHQ LAL+AA+QGIVLL N G +LPLS    +T+AVIGPN+
Sbjct: 367 GDPTMLPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNNG-ALPLSPNTTKTLAVIGPNA 425

Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADA 480
           D T TM+ NYAG+ C YT+PLQG+ +Y   + +++GC +V+C+++ L   A   +  ADA
Sbjct: 426 DATNTMLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADA 485

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
           T++V+GLD  IEAE LDR  L LPG Q++LV + + A+ G  ILV+MS GP+D++F KN 
Sbjct: 486 TVVVVGLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNV 545

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
            +I  I+W GYPGQAGG AI+ ++FG  NPGG+ P TWYPQEY+  +PMT+M MRP+ + 
Sbjct: 546 SKIGGILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATX 605

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL----------DGRHGSI- 649
            +PGRTYRFY G  +Y FGHG+SY+ F   + +AP  V V L             + ++ 
Sbjct: 606 NFPGRTYRFYTGKSLYQFGHGLSYSTFYKFIKSAPXTVLVHLLPQMDMPNIFSSNYPTMP 665

Query: 650 NATISGKAIKVTHAKCNRLT-LGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLV 706
           N   +G+AI ++   C  L+ + + + VKN G  DG H +L F  PP      AP  +LV
Sbjct: 666 NPNTNGQAIDISAIDCRNLSNIDIVIGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELV 725

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
            FE+V V  G  + VG+ + VC  +S VD  G R++ +G H + +G +    + H     
Sbjct: 726 GFERVEVKRGKTEMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVVGSSSEQQARHHVNFR 785

Query: 767 VIKS 770
           + KS
Sbjct: 786 LAKS 789


>gi|326489197|dbj|BAK01582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 709

 Score =  813 bits (2101), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 404/711 (56%), Positives = 509/711 (71%), Gaps = 10/711 (1%)

Query: 58  GRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVI 117
           GR    +KV  L++   A+ RLGI  YEWWSEALHGVS VGPGT+F    PGATSFPQ I
Sbjct: 1   GRGGESQKVGFLVNKQPALGRLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPI 60

Query: 118 TTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPV 177
            TA+SFNA+L+ AIG VVS EARAM+N G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+
Sbjct: 61  LTAASFNASLFRAIGEVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPL 120

Query: 178 LSGKYAASYVRGLQ-----GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDI 232
           L+ KYA  YV GLQ     G     LKVAA CKH+TAYD+DNW GV+R+ F+AKVS+QD+
Sbjct: 121 LASKYAVGYVTGLQDAGAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDL 180

Query: 233 EDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSV 292
           +DTF  PF+ CV++G VASVMCSYN+VNG PTCAD ++L+  IRG+W+LNGYIVSDCDSV
Sbjct: 181 DDTFQPPFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSV 240

Query: 293 GVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLT 352
            V Y  QH+T TPEEAAA  I++GLDL+CG FL  HT +AVQ G LSE D++ A+ N   
Sbjct: 241 DVLYTQQHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFI 300

Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
           + MRLG FDG+P    +G LGPKDVCT  ++ELA E ARQGIVLLKN G +LPLS    +
Sbjct: 301 MLMRLGFFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNSG-ALPLSAKSIK 359

Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-FGAA 471
           ++AVIGPN++ + TMIGNY G  C YTTPLQG+G    T++Q GC +V C+ + L    A
Sbjct: 360 SMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTA 419

Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           + A+  AD T+LV+G DQSIE E+LDR  LLLPG+Q +LVS V+ AS GP ILV+MSGGP
Sbjct: 420 VAAAASADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGP 479

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
            D++FAK   +IAAI+W GYPG+AGG A+ADILFG+ NP G+LP+TWYP  Y   + MT+
Sbjct: 480 FDISFAKASDKIAAILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTD 539

Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA-PTVVAVPLDGRHGSIN 650
           M MRP  S  YPGRTYRFY G  V+ FG G+SYT   H++ +A P+ V++ L   H    
Sbjct: 540 MRMRPDTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCRA 599

Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
              +  +++     C+ L   V++ V+N G   GAH++L+FS+PP  H AP K L+ FEK
Sbjct: 600 EECA--SVEAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAPAKHLLGFEK 657

Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           V +  G    V   + VC+ LSVVD  G R++ LG H +H+G  KH+V L 
Sbjct: 658 VSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGDLKHTVELR 708


>gi|356574315|ref|XP_003555294.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 5-like
           [Glycine max]
          Length = 901

 Score =  812 bits (2098), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 399/785 (50%), Positives = 539/785 (68%), Gaps = 33/785 (4%)

Query: 1   MASTIAFF--FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIG 58
           M + + F   FL  ++L  +S      +  AC    A T   PFC  SL    R  DL+ 
Sbjct: 1   MKNQLMFLSMFLSFLILPTAS------QKHACG--SAKTSNFPFCDTSLSYEDRAKDLVS 52

Query: 59  RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
           RL+LQEK + L++ +A + RLG+  YEWWSEALHGVSN+GPGT+F    PGATSFP VI 
Sbjct: 53  RLTLQEKTQQLVNPSAGISRLGVPAYEWWSEALHGVSNLGPGTRFDKKVPGATSFPAVIL 112

Query: 119 TASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
           +A+SFNA+LW+ +G+VVS EARAMYN   AGLT+WSPNVN+FRDPRWGRGQETPGEDP++
Sbjct: 113 SAASFNASLWQKMGQVVSTEARAMYNVDLAGLTFWSPNVNVFRDPRWGRGQETPGEDPLV 172

Query: 179 SGKYAASYVRGLQ------GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDI 232
             +YA  Y+RGLQ       +  DRLKV++ CKH+TAYDLDNW G+DRFHF+AKV+KQD+
Sbjct: 173 VSRYAVMYLRGLQEVEDEASAKADRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDL 232

Query: 233 EDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSV 292
           ED++  PF+ CV+EG V+SVMCSYN+VNG+PTCADP++LK  IRG+W L+GYIVSDCDSV
Sbjct: 233 EDSYQPPFKSCVVEGHVSSVMCSYNRVNGIPTCADPDLLKGIIRGQWGLDGYIVSDCDSV 292

Query: 293 GVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLT 352
            VYY+  H+T+TPE+A A A++AGL+++CG FL  +T +AV    +    ++ ALV    
Sbjct: 293 EVYYNAIHYTATPEDAVALALKAGLNMNCGDFLKKYTANAVNLKKVDVATVDQALVYNYI 352

Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
           V MRLG FD +P S P+ +LGP DVCT D+Q+LAL+AA+QGIVLL+N   +LPLS    +
Sbjct: 353 VLMRLGFFD-DPKSLPFANLGPSDVCTKDNQQLALDAAKQGIVLLENNNGALPLSQTNIK 411

Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAA 471
            +AVIGPN++ T  MI NYAGI C YT+PLQG+ +Y  ++ +  GC +V C +  L  AA
Sbjct: 412 KLAVIGPNANATTVMISNYAGIPCRYTSPLQGLQKYISSVNYAPGCSNVKCDNQSLIAAA 471

Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           + A+  ADA +LV+GLDQSIEAE LDR  L LPG Q++ V  V+ A+KG  ILV+M+ GP
Sbjct: 472 VKAAASADAVVLVVGLDQSIEAEGLDRENLTLPGFQEKFVKDVAGATKGKVILVIMAAGP 531

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
           ID++  K+   I  I+W GYPGQAGG AIA ++FG  NPGG+ P TWYPQ Y+  +PMT+
Sbjct: 532 IDISSTKSVSNIGGILWVGYPGQAGGDAIAQVIFGDYNPGGRSPFTWYPQSYVDQVPMTD 591

Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP---LDGRHGS 648
           M MR ++S+ +PGRTYRFY G  +Y FGHG+SY+ F   VA+AP+ + +    +   H  
Sbjct: 592 MNMRANKSRNFPGRTYRFYNGNSLYEFGHGLSYSTFSMYVASAPSSIMIENTSISEPHNM 651

Query: 649 INATIS---------GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
           +++  S         G+AI ++   C  LT  + + VKN G  +G+H +LVF  P    +
Sbjct: 652 LSSNNSGTQVESLSDGQAIDISTINCQDLTFLLVIGVKNNGPLNGSHVVLVFWEPATSEF 711

Query: 700 ---APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
              AP KQL+ FE+V V  G  + V + I +C+ +S VD  G R++ +G+H I +G +  
Sbjct: 712 VIGAPIKQLIGFERVQVVVGVTEFVTVKIDICQLISNVDSDGKRKLVIGQHTILVGSSSE 771

Query: 757 SVSLH 761
           +   H
Sbjct: 772 TQVRH 776


>gi|78482949|emb|CAJ41429.1| beta (1,4)-xylosidase [Populus tremula x Populus alba]
          Length = 732

 Score =  810 bits (2093), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 444/780 (56%), Positives = 534/780 (68%), Gaps = 68/780 (8%)

Query: 4   TIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
           T++  FLG+ L ++      A +PFACDPKD T R LPFCQV+LPI  RVNDLIGR++LQ
Sbjct: 8   TLSVLFLGVSLQTSK-----ALDPFACDPKDGTNRDLPFCQVNLPIHTRVNDLIGRMTLQ 62

Query: 64  EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
           EKV LL++ AAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG FP ATSFPQVITTA+SF
Sbjct: 63  EKVGLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGAFPVATSFPQVITTAASF 122

Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
           NATLWEAIGRVVSDEARAM+NGG AGLTYWSPNV     PRWGRGQETPGEDPV+ GKYA
Sbjct: 123 NATLWEAIGRVVSDEARAMFNGGVAGLTYWSPNVTYSVYPRWGRGQETPGEDPVVVGKYA 182

Query: 184 ASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
           ASYVRGLQGSDG RLKVAA CKHFTAYDLDNWNGVDRFHFNAKVSKQD+ DTFDVPFRMC
Sbjct: 183 ASYVRGLQGSDGIRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMVDTFDVPFRMC 242

Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
           V EGKVASVMCSYNQVNG+PTCADPN+LK+T+RG+WRLNGYIVSDCDS GVYY  QHFTS
Sbjct: 243 VKEGKVASVMCSYNQVNGIPTCADPNLLKKTVRGQWRLNGYIVSDCDSFGVYYGQQHFTS 302

Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
            P  ++    +AGLDLDCGPFL  H + AV++    E +INNA + TLT Q+ LG+FDG 
Sbjct: 303 -PRRSSLGCYKAGLDLDCGPFLVTHRD-AVKKA-AEEAEINNAWLKTLTFQISLGIFDGS 359

Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
           P  Q  G + P  +  P +Q+LA+ A ++ + + KN+  +  L   RH    + GP +  
Sbjct: 360 P-LQAVGDVVPT-MGPPTNQDLAVNAPKR-LFIFKNR--AFLLYSPRH----IFGPVALF 410

Query: 424 TVT--MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
                M+GNY G+ C Y  PLQG+  +   ++  GC +V CA   + G+A+D +  ADA 
Sbjct: 411 KSLPFMLGNYEGLPCKYLFPLQGLAGFVSLLYLPGCSNVICAVADV-GSAVDLAASADAV 469

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           +LV+G DQSIE E  DR    LPG+QQELV++V+MA+KGP +LV+M     D+A +    
Sbjct: 470 VLVVGADQSIEREGHDRVDFYLPGKQQELVTRVAMAAKGPVLLVIM-----DLAISGGG- 523

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGT-------SNPGGKLPMTWYPQEYITNLPMTEMAM 594
                    Y  Q  G  I+D+  G+       SN  G +P   Y +     L  T++  
Sbjct: 524 -------CSY-NQVNGIPISDVCEGSSYRWPSFSNCHGYMPWISYSRAIWETLRFTKVNW 575

Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFG--HGMSYTNFVHTVANAPTVV--AVPLDGRHGSIN 650
            P         T+ + K   ++ FG  H     +   T    P  +       GR   ++
Sbjct: 576 VP---------TWSWNK---LHKFGSHHSKCTDDGFGTPRRPPPWLRKCNHFQGRQSELH 623

Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
                 ++           LG+QVDVKN GS DG HTLLV+  PPA HWAPHKQLVAFEK
Sbjct: 624 MLDVIDSL-----------LGMQVDVKNTGSMDGTHTLLVYFRPPARHWAPHKQLVAFEK 672

Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           VHV AG QQRVGINIHVCK LSVVD SG RRIP+GEH++HIG  KHSVSL A+ LGV++S
Sbjct: 673 VHVAAGTQQRVGINIHVCKSLSVVDGSGIRRIPMGEHSLHIGDVKHSVSLQASILGVVES 732


>gi|296083056|emb|CBI22460.3| unnamed protein product [Vitis vinifera]
          Length = 896

 Score =  803 bits (2074), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 391/772 (50%), Positives = 512/772 (66%), Gaps = 50/772 (6%)

Query: 8   FFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
            FL L LL+ S         +ACD  D  +   PFC  SLP   R +DL+ RL+LQEK K
Sbjct: 35  IFLSLPLLAISQ----PTHRYACDRTDPNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAK 90

Query: 68  LLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
            LI+ A  + RLG+  YEWWSEALHGVSN G G  F    P  T FP VI +A+SFN +L
Sbjct: 91  QLINSATGISRLGVPDYEWWSEALHGVSNSGIGVHFHDPIPAVTIFPAVILSAASFNESL 150

Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
           W  +G+VVS E RAMYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP++  +YA +YV
Sbjct: 151 WYTMGQVVSTEGRAMYNVGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYV 210

Query: 188 RGLQ--GSDG----DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
           RGLQ  G +G    DRLKV++ CKH+TAYD+D W GVDRFHF+AKV+ QD+EDT+  PF+
Sbjct: 211 RGLQEVGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFK 270

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
            CV EG V+SVMCSYN+VNGVPTCA+P +LK  IR +W L+GYIVSDCDS+ VY++  ++
Sbjct: 271 SCVEEGHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY 330

Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
           T TPE+A A A++AGL+L+CG +LG +T++AV  G + E  +N AL+    V MRLG FD
Sbjct: 331 TETPEDAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFD 390

Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
           G+P+  P+G +GP DVCT DHQ LAL+AA+QGIVLL N G +LPLS    +T+AVIGPN+
Sbjct: 391 GDPTMLPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNNG-ALPLSPNTTKTLAVIGPNA 449

Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADA 480
           D T TM+ NYAG+ C YT+PLQG+ +Y   + +++GC +V+C+++ L   A   +  ADA
Sbjct: 450 DATNTMLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADA 509

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
           T++V+GLD  IEAE LDR  L LPG Q++LV + + A+ G  ILV+MS GP+D++F KN 
Sbjct: 510 TVVVVGLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNV 569

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
            +I  I+W GYPGQAGG AI+ ++FG  NPGG+ P TWYPQEY+  +PMT+M MRP+ + 
Sbjct: 570 SKIGGILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATS 629

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
            +PGRTYRFY G  +Y FGHG+SY+ F   ++N   V+                      
Sbjct: 630 NFPGRTYRFYTGKSLYQFGHGLSYSTFYKNLSNIDIVIG--------------------- 668

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPAGAQ 718
                          VKN G  DG H +L F  PP      AP  +LV FE+V V  G  
Sbjct: 669 ---------------VKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKT 713

Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           + VG+ + VC  +S VD  G R++ +G H + +G +    + H     + KS
Sbjct: 714 EMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVVGSSSEQQARHHVNFRLAKS 765


>gi|302811516|ref|XP_002987447.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
 gi|300144853|gb|EFJ11534.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
          Length = 779

 Score =  799 bits (2064), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 396/761 (52%), Positives = 512/761 (67%), Gaps = 41/761 (5%)

Query: 22  LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
           +A + P+ACD ++AT     FC   LP   RV DLI R++LQEK+  L++ AA +PRLG+
Sbjct: 26  VAGQAPYACDQRNATLLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGL 85

Query: 82  KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
             YEWW EALHGV+ V PG KFGG FPGATSFP  I TA+SF+A         VS EARA
Sbjct: 86  PRYEWWQEALHGVA-VSPGVKFGGKFPGATSFPMPILTAASFDA---------VSTEARA 135

Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD--GDRLK 199
           M+N   AGLTYWSPNVNI+RDPRWGRGQETPGEDP+LS KYA  YVRGLQ ++  GD+LK
Sbjct: 136 MHNYQRAGLTYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLGGDKLK 195

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
           V+A CKH TAYD+DNW G  RF FNA V++QD+ DT++ PF+ CV + KV+SVMCSYN+V
Sbjct: 196 VSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRV 255

Query: 260 NGVPTCADPNILKRTIRGEWRLNG----------------YIVSDCDSVGVYYDTQHFTS 303
           NGVPTCAD N+L  T+R  W LNG                YIVSDCDS+  ++D  ++  
Sbjct: 256 NGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNYAK 315

Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
           T E+  ADA+ AGL+LDCGPFL +HT+SA+  G ++E ++N AL     VQMRLG++DG 
Sbjct: 316 TAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGN 375

Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
           P SQPYG+LGP+ VCT ++Q+LAL+AA++GIVLLKN G  LP S    RTVA IGP++  
Sbjct: 376 PRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKA 435

Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
           T  MIGNY GI C YTTP  G+  YAR ++  GC DVAC  D L G+A+  + QADA +L
Sbjct: 436 TRAMIGNYQGIPCKYTTPHDGLSAYARVVYSAGCSDVACYSDSLIGSAVSTASQADAVVL 495

Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
            +GLD + EAE  DR  LLLPG+QQELV++V+ A+KGP +LV+ SGG +DV+FAK + ++
Sbjct: 496 FVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPAVLVIFSGGSVDVSFAKYNNKV 555

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
             I+WAGYPG+AGG AIA +LFG  NPGG+LP+TWYP+ + T + M +M MRP  S+ YP
Sbjct: 556 QGILWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESF-TGITMLDMNMRPDASRGYP 614

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
           GRTYRFY G  VY FG+G +Y+   H    AP  +  P      ++  +  G  +   H 
Sbjct: 615 GRTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFP---EAAAVKRSCDGN-LTCFHL 670

Query: 664 K------CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP-AGH-WAPHKQLVAFEKVHVPA 715
                  C+ LT  V++ V N G +     +L++S+PP AG   AP +QL  F KV V  
Sbjct: 671 NAHDEITCSTLTSKVRILVHNKGDRPSNRAVLLYSSPPNAGRDGAPIRQLAGFGKVSVAP 730

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
           GA + V I I  CK+LS    +G R +  G H + +G  +H
Sbjct: 731 GAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARH 771


>gi|302796585|ref|XP_002980054.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
 gi|300152281|gb|EFJ18924.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
          Length = 779

 Score =  797 bits (2058), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 395/765 (51%), Positives = 513/765 (67%), Gaps = 41/765 (5%)

Query: 22  LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
           +A + P+ACD ++AT     FC   LP   RV DLI R++LQEK+  L++ AA +PRLG+
Sbjct: 26  VAGQAPYACDQRNATLLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGL 85

Query: 82  KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
             YEWW EALHGV+ V PG KFGG FPGATSFP  I TA+SF+A         VS EARA
Sbjct: 86  PRYEWWQEALHGVA-VSPGVKFGGKFPGATSFPMPILTAASFDA---------VSTEARA 135

Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD--GDRLK 199
           M+N   AGLTYWSPNVNI+RDPRWGRGQETPGEDP+LS KYA  YVRGLQ ++  GD+LK
Sbjct: 136 MHNYQRAGLTYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLGGDKLK 195

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
           V+A CKH TAYD+DNW G  RF FNA V++QD+ DT++ PF+ CV + KV+SVMCSYN+V
Sbjct: 196 VSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRV 255

Query: 260 NGVPTCADPNILKRTIRGEWRLNG----------------YIVSDCDSVGVYYDTQHFTS 303
           NGVPTCAD N+L  T+R  W LNG                YIVSDCDS+  ++D  ++  
Sbjct: 256 NGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNYAK 315

Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
           T E+  ADA+ AGL+LDCGPFL +HT+SA+  G ++E ++N AL     VQMRLG++DG 
Sbjct: 316 TAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGN 375

Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
           P SQPYG+LGP+ VCT ++Q+LAL+AA++GIVLLKN G  LP S    RTVA IGP++  
Sbjct: 376 PRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKA 435

Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
           T  MIGNY GI C YTTP  G+  YAR ++  GC DVAC  + L G+A   + QADA +L
Sbjct: 436 TRAMIGNYQGIPCKYTTPHDGLSAYARVVYSAGCSDVACYSNSLIGSAASTASQADAVVL 495

Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
            +GLD + EAE  DR  LLLPG+QQELV++V+ A+KGP +LV+ SGG +DV+FAK D ++
Sbjct: 496 FVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPVVLVIFSGGSVDVSFAKYDKKV 555

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
             ++WAGYPG+AGG AIA +LFG  NPGG+LP+TWYP+ + T + M +M MRP  S+ YP
Sbjct: 556 QGMLWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESF-TGITMLDMNMRPDASRGYP 614

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
           GRTYRFY G  VY FG+G +Y+   H    AP  +  P      ++  +  G  +   H 
Sbjct: 615 GRTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFP---EAAAVKRSCDGN-LTCFHL 670

Query: 664 K------CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP-AGH-WAPHKQLVAFEKVHVPA 715
                  C+ LT  V++ V N G +     +L++S+PP AG   AP +QL  F KV V  
Sbjct: 671 NAHDEITCSTLTSKVRILVHNEGDRPSNRAVLLYSSPPNAGRDGAPIRQLAGFGKVSVAP 730

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           GA + V I I  CK+LS    +G R +  G H + +G  +H + +
Sbjct: 731 GAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARHPLPI 775


>gi|77552476|gb|ABA95273.1| Beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 883

 Score =  795 bits (2052), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 418/657 (63%), Positives = 502/657 (76%), Gaps = 22/657 (3%)

Query: 133 RVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
           + VSDE RAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA+YVRGLQ 
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQ 286

Query: 193 SD--GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
                 RLK+AA CKHFTAYDLDNW+G DRFHFNA V++QD+EDTF+VPFR CV++G+ A
Sbjct: 287 QQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRAA 346

Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
           SVMCSYNQVNGVPTCAD   L+ TIR  W L GYIVSDCDSV V+Y  QH+T T E+A A
Sbjct: 347 SVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVA 406

Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
             +RAGLDLDCGPFL  +TE AV +G + + DI+ A+ NT+TVQMRLGMFDG+P++QP+G
Sbjct: 407 ATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDGDPAAQPFG 466

Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIG 429
           HLGP+ VCT  HQELA+EAARQGIVLLKN G +LPLS     R VAV+GP+++ TV MIG
Sbjct: 467 HLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHAEATVAMIG 526

Query: 430 NYAGIACGYTTPLQGIGRY-ARTIHQQGCKDVACA-DDQLFGAAIDASRQADATILVMGL 487
           NYAG  C YTTPLQG+ RY AR  HQ GC DVACA   Q   AA+DA+R+ADATI+V GL
Sbjct: 527 NYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRADATIVVAGL 586

Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
           DQ IEAE LDRA LLLPGRQ EL+S V+ ASKGP ILVLMSGGPID+ FA+NDP+IA I+
Sbjct: 587 DQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGIL 646

Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
           WAGYPGQAGG AIAD++FG  NPGGKLP+TWYPQ+Y+  +PMT MAMR + +K YPGRTY
Sbjct: 647 WAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPAKGYPGRTY 706

Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL------DGRHGSINATI---SGKAI 658
           RFY GP ++PFGHG+SYT+F H++A+AP+ + V L           S+NAT       A+
Sbjct: 707 RFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHHAAASASASLNATARLSRAAAV 766

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH--------KQLVAFEK 710
           +V HA+C  L + V VDV+NVG +DGAHT+LV++  PA   A          +QLVAFEK
Sbjct: 767 RVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHGAPVRQLVAFEK 826

Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           VHV AG   RV + I VC  LSV DR+G RRIP+GEH + IG   H+V++    LGV
Sbjct: 827 VHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGELTHTVTIALEQLGV 883



 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 67/98 (68%), Positives = 79/98 (80%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
           TLPFC+ SLP   R  DL+ RL+  EKV+LL++ AA VPRLG+ GYEWWSEALHGVS+ G
Sbjct: 40  TLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99

Query: 99  PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVS 136
           PG +FGG FPGAT+FPQVI TA+SFNATLWE IG+  S
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRS 137


>gi|125535275|gb|EAY81823.1| hypothetical protein OsI_36995 [Oryza sativa Indica Group]
          Length = 885

 Score =  794 bits (2050), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 418/659 (63%), Positives = 502/659 (76%), Gaps = 24/659 (3%)

Query: 133 RVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
           + VSDE RAMYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA+YVRGLQ 
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQ 286

Query: 193 SD--GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
                 RLK+AA CKHFTAYDLDNW+G DRFHFNA V++QD+EDTF+VPFR CV++G+ A
Sbjct: 287 QQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRAA 346

Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
           SVMCSYNQVNGVPTCAD   L+ TIR  W L GYIVSDCDSV V+Y  QH+T T E+A A
Sbjct: 347 SVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVA 406

Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
             +RAGLDLDCGPFL  +TE AV +G + + DI+ A+ NT+TVQMRLGMFDG+P++QP+G
Sbjct: 407 ATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDGDPAAQPFG 466

Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIG 429
           HLGP+ VCT  HQELA+EAARQGIVLLKN G +LPLS     R VAV+GP+++ TV MIG
Sbjct: 467 HLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHAEATVAMIG 526

Query: 430 NYAGIACGYTTPLQGIGRY-ARTIHQQGCKDVACA-DDQLFGAAIDASRQADATILVMGL 487
           NYAG  C YTTPLQG+ RY AR  HQ GC DVACA   Q   AA+DA+R+ADATI+V GL
Sbjct: 527 NYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRADATIVVAGL 586

Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
           DQ IEAE LDRA LLLPGRQ EL+S V+ ASKGP ILVLMSGGPID+ FA+NDP+IA I+
Sbjct: 587 DQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGIL 646

Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
           WAGYPGQAGG AIAD++FG  NPGGKLP+TWYPQ+Y+  +PMT MAMR + +K YPGRTY
Sbjct: 647 WAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPAKGYPGRTY 706

Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL--------DGRHGSINATI---SGK 656
           RFY GP ++PFGHG+SYT+F H++A+AP+ + V L             S+NAT       
Sbjct: 707 RFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLAAHHAAASASASASLNATARLSRAA 766

Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH--------KQLVAF 708
           A++V HA+C  L + V VDV+NVG +DGAHT+LV++  PA   A          +QLVAF
Sbjct: 767 AVRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHGAPVRQLVAF 826

Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           EKVHV AG   RV + I VC  LSV DR+G RRIP+GEH + IG   H+V++    LGV
Sbjct: 827 EKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGELTHTVTIALEQLGV 885



 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 66/98 (67%), Positives = 79/98 (80%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
           TLPFC+ SLP   R  DL+ R++  EKV+LL++ AA VPRLG+ GYEWWSEALHGVS+ G
Sbjct: 40  TLPFCRRSLPARARARDLVARMTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99

Query: 99  PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVS 136
           PG +FGG FPGAT+FPQVI TA+SFNATLWE IG+  S
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRS 137


>gi|302791321|ref|XP_002977427.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
 gi|300154797|gb|EFJ21431.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
          Length = 772

 Score =  794 bits (2050), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 380/777 (48%), Positives = 518/777 (66%), Gaps = 36/777 (4%)

Query: 7   FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
            FF  +    A S G A     AC   D+ + + PFC VSLP+P RV DL+GR++L EK+
Sbjct: 5   LFFFSIAWAPAESRGGAGA---AC--TDSRSSSFPFCDVSLPVPDRVADLVGRMNLSEKI 59

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
             ++S A+ +PRLGI GY+WW EALHGV+   PG KF    P ATSFPQVI T +SFN++
Sbjct: 60  AQIVSNASGIPRLGIPGYQWWEEALHGVAE-SPGVKFAAPVPSATSFPQVILTVASFNSS 118

Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
           LW  I + +S EA AMYN G +GLT+WSPN+NIFRDPRWGRGQETPGEDP+LS KYAA +
Sbjct: 119 LWNKIAQAISIEAIAMYNAGRSGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYF 178

Query: 187 VRGLQGSDGD-------------RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIE 233
           VRGLQ  D D             RLKV++ CKHFTAYD++   G D FHFNA+V+ QD++
Sbjct: 179 VRGLQEGDYDEGTAISTMQGSPTRLKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQ 238

Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG 293
           DTFD PFR C+++G+ + +MCSYN+VNGVP+CAD   L  T+R  W   GYIVSDCD+V 
Sbjct: 239 DTFDPPFRSCIVDGQASGLMCSYNRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVA 298

Query: 294 VYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTV 353
           + Y+  ++T+T E+A AD + AG+DL+CG FL  HT +A+++G ++E  ++ AL N +TV
Sbjct: 299 LLYEYINYTTTAEDAVADVLSAGMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTV 358

Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
           +MRLG+FDG  S + Y  +GP  VCTP+H++L+LEAA QGIVLLKN G  LP       T
Sbjct: 359 RMRLGLFDGN-SGETYNSIGPDAVCTPEHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMT 417

Query: 414 VAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAI 472
           +AVIGP+ + T TM+GNYAG+ C Y TP QG+  Y +  + + GCKD+ C D  LF AA+
Sbjct: 418 IAVIGPSGNATETMLGNYAGVPCQYITPFQGLQEYTKCVVFEPGCKDIMCNDTTLFLAAV 477

Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
            A+  +DA ++V+GLD+  E E LDR  LLLPG QQ LV +VS  +KGP ILV+MSGGPI
Sbjct: 478 RAAENSDAVVIVVGLDKDQEREGLDRTSLLLPGNQQGLVLEVSKVAKGPVILVVMSGGPI 537

Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
           DV FAK + +I+ ++W GYPG+AGG AIA ++FG  NP G+LPMTWYPQ +  ++ +  M
Sbjct: 538 DVTFAKENCKISNVLWVGYPGEAGGKAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNM 597

Query: 593 AMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT------VVAVPLDGRH 646
            +RP+ S  +PGRTYRFY G  VY FGHG+SYTNF +T   AP+       VA+    R 
Sbjct: 598 HLRPNTSTGFPGRTYRFYTGENVYEFGHGLSYTNFTYTNFCAPSNITARNTVAIRTPLRE 657

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA---PHK 703
                    +   + +  C  L   V   + N G++D  H  L+++ PPA   +   P K
Sbjct: 658 D------GARQFPIDYTGCEALAFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRK 711

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           QL++F++ H+ AG   +V  ++  CK L + + +GT+ +  G++ + +G  +H +SL
Sbjct: 712 QLISFKRQHLIAGRCAKVEFDVDTCKDLGLTNEAGTKVLVHGDYKLSLGDIEHVISL 768


>gi|302786474|ref|XP_002975008.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
 gi|300157167|gb|EFJ23793.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
          Length = 772

 Score =  793 bits (2048), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/782 (49%), Positives = 523/782 (66%), Gaps = 46/782 (5%)

Query: 7   FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
            FF  +    A S G A     AC   D+ + + PFC VSLP+P RV DL+GR++L EK+
Sbjct: 5   LFFFSIAWAPAESRGGAGA---AC--TDSRSSSFPFCDVSLPVPDRVADLVGRMNLSEKI 59

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
             ++S A+ +PRLGI GY+WW EALHGV+   PG KF    P ATSFPQVI T +SFN++
Sbjct: 60  AQIVSNASGIPRLGIPGYQWWEEALHGVAE-SPGVKFAAPVPSATSFPQVILTVASFNSS 118

Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
           LW  I + +S EA AMYN G +GLT+WSPN+NIFRDPRWGRGQETPGEDP+LS KYAA +
Sbjct: 119 LWNKIAQAISIEAIAMYNAGRSGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYF 178

Query: 187 VRGLQGSDGD-------------RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIE 233
           VRGLQ  D D             RLKV++ CKHFTAYD++   G D FHFNA+V+ QD++
Sbjct: 179 VRGLQEGDYDEGTAISTMQRRPTRLKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQ 238

Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG 293
           DTFD PFR C+++G+ + +MCSYN+VNGVP+CAD   L  T+R  W   GYIVSDCD+V 
Sbjct: 239 DTFDPPFRSCIVDGQASGLMCSYNRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVA 298

Query: 294 VYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTV 353
           + Y+  ++T+T E+A AD + AG+DL+CG FL  HT +A+++G ++E  ++ AL N +TV
Sbjct: 299 LLYEYINYTTTAEDAVADVLSAGMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTV 358

Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
           +MRLG+FDG  S + Y  +GP  VCT +H++L+LEAA QGIVLLKN G  LP       T
Sbjct: 359 RMRLGLFDGN-SGETYNSIGPDAVCTREHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMT 417

Query: 414 VAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAI 472
           +AVIGP+ + T TM+GNYAG+ C Y TP QG+  Y +  + + GCKD+ C D  LF AA+
Sbjct: 418 IAVIGPSGNATETMLGNYAGVPCQYITPFQGLQEYTKGVVFEPGCKDIMCNDTTLFLAAV 477

Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
            A+  +DA ++V+GLD+  E E LDR  LLLPG QQ+LV +VS  +KGP ILV+MSGGPI
Sbjct: 478 RAAENSDAVVIVVGLDKDQEREGLDRTSLLLPGYQQDLVLEVSKVAKGPVILVVMSGGPI 537

Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
           DV FAK + +I++++W GYPG+AGG AIA ++FG  NP G+LPMTWYPQ +  ++ +  M
Sbjct: 538 DVTFAKGNCKISSVLWVGYPGEAGGKAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNM 597

Query: 593 AMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT--------VVAVPL-- 642
            +RP+ S  +PGRTYRFY G  VY FGHG+SYTNF +T  +AP+         +  PL  
Sbjct: 598 HLRPNTSTGFPGRTYRFYTGENVYEFGHGLSYTNFTYTNFSAPSNITARNTVAIRTPLRE 657

Query: 643 DG-RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA- 700
           DG RH  I+ T            C  L   V   + N G++D  H  L+++ PPA   + 
Sbjct: 658 DGARHFPIDYT-----------GCEALAFKVVAYISNTGTRDSDHISLLYAIPPAASSSL 706

Query: 701 --PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
             P KQL++F++ H+ AG   +V  ++  CK L + + +GT+ +  G++ + +G  +H +
Sbjct: 707 SPPRKQLISFKRQHLIAGRCAKVEFDVDTCKDLGLTNEAGTKVLVHGDYKLSLGDIEHVI 766

Query: 759 SL 760
           SL
Sbjct: 767 SL 768


>gi|302811514|ref|XP_002987446.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
 gi|300144852|gb|EFJ11533.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
          Length = 772

 Score =  793 bits (2048), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 394/748 (52%), Positives = 527/748 (70%), Gaps = 12/748 (1%)

Query: 22  LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
           LA + P+ACD  +AT    PFC  SLPI  RV D + RL+L+EK+  LI+ A  +PRLG+
Sbjct: 24  LAGQAPYACDQSNATLAAFPFCNTSLPITDRVEDYVARLTLEEKISQLINTATGIPRLGV 83

Query: 82  KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
             Y+WW EALHGV++  PG +FGG  P ATSFP  ITTA+SFN +L+  IG+ VS EARA
Sbjct: 84  PKYQWWQEALHGVAS-SPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARA 142

Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG--DRLK 199
           M+N G +GLT+WSPN+NI+RDPRWGRGQETPGEDP+LS  +A  YVRGLQ S    D+LK
Sbjct: 143 MHNLGQSGLTFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQAGSDKLK 202

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
           V+A CKH TAYD+DNW G DR+HFNA V++QD+EDT++ PF+ CV +G V+SVMCSYN++
Sbjct: 203 VSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRL 262

Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
           NGVPTCAD  +L  T+R  W+LNGYIVSDCDS+ V++D  ++ +T E+AAADA+ AGL+L
Sbjct: 263 NGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADALLAGLNL 322

Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
           +CG FL  HT SA+Q+  ++E  IN AL   +TVQMRLG++DG+P SQ YG LG  DVCT
Sbjct: 323 NCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCT 382

Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT 439
            +HQ LALEAARQG+VLLKN G +LPLS  + +++AV+GP+++ T  MIGNYAGI C YT
Sbjct: 383 SEHQTLALEAARQGMVLLKNLG-ALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYT 441

Query: 440 TPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRA 499
           +PLQ   +YA+  +  GC +VAC+ D L   A+ A+  ADA ++ +GLD +IEAE+LDR 
Sbjct: 442 SPLQAFQKYAQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRT 501

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            LLLPG+QQELVS+V  A+KGP ++V++S G ID+ FA +D RIA I+WAGYPGQAGG A
Sbjct: 502 SLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAA 561

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
           IA+++FG  NP GKLP TWYPQ + T++ M +M MRP+ S  YPGRTYRFY GP ++ FG
Sbjct: 562 IAEVIFGDHNPSGKLPATWYPQNF-TSISMLDMNMRPNASTGYPGRTYRFYTGPTIFKFG 620

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRH---GSINATISGKAIKVTHAK-CNRLTLGVQVD 675
            G+SYT+       AP+ +++P          +  + S   +  T  K C  L   V + 
Sbjct: 621 DGLSYTSLSAKFIKAPSFLSIPSTAPMQPCTGLKKSSSCFHLDATDEKSCESLKSQVAIS 680

Query: 676 VKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPAGA-QQRVGINIHVCKYLS 732
           V+N G+   +HTL++FSTPP+      P +QLV F K+ +   +    V  ++  C++  
Sbjct: 681 VRNKGAMAISHTLMLFSTPPSAGSDGVPQRQLVGFNKIQIAGDSISNPVIFDLDPCRHFV 740

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
             DR G + +  G H +  G  +HS+ L
Sbjct: 741 HADRDGKKLLRSGTHVLTAGNEQHSLRL 768


>gi|302786124|ref|XP_002974833.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
 gi|300157728|gb|EFJ24353.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
          Length = 784

 Score =  791 bits (2043), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/766 (50%), Positives = 530/766 (69%), Gaps = 33/766 (4%)

Query: 13  ILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           IL +A +S    R  +ACD   +A+  + PFC   L I  RV DL+ RL+L EKV  +++
Sbjct: 23  ILSNARASTAQLR--YACDVSSNASLGSFPFCDTKLGIDVRVQDLVSRLTLDEKVDEMVN 80

Query: 72  GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
            A  +PRLG+  Y+WW EALHGV++  PG +FGG  P ATSFP  I TA+SFN+TL+ +I
Sbjct: 81  AAQGIPRLGVPSYQWWQEALHGVAS-SPGVQFGGLAPAATSFPMPIATAASFNSTLFYSI 139

Query: 132 GRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ 191
           G  VS EARA++N G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L+ K+A+ YVRGLQ
Sbjct: 140 GEAVSSEARALHNLGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQ 199

Query: 192 G-------SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
           G       SDG  LKV+A CKH TAYD+DNW G+DR+HFNA+VS+QD+ DT++ PF+ C+
Sbjct: 200 GGAYEGSASDG-FLKVSACCKHLTAYDVDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCI 258

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
            +G+V+SVMCSYN+VNGVPTCAD N+L  T+R  W  NGYIVSDCD++ V ++   +  +
Sbjct: 259 EDGRVSSVMCSYNRVNGVPTCADRNLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPS 318

Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
            E+A AD+I AGLDL+CG FLG H +SA+Q G ++E D+++A+ N +  +MRLG+FDG+P
Sbjct: 319 AEDAVADSILAGLDLNCGTFLGKHAKSALQAGKITEADLDHAVSNLMRTRMRLGLFDGDP 378

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
           +SQPY  LG  D+C+ DHQ+LAL+AA QG+VLLKN G SLPLS    +TVA+IGPN++ T
Sbjct: 379 NSQPYSSLGATDICSNDHQQLALDAALQGVVLLKNDG-SLPLS-TALKTVALIGPNANAT 436

Query: 425 VTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATIL 483
            TM+GNY GI C Y +PLQG+  Y+  I +  GC++VAC +  L  +A++ + +ADA +L
Sbjct: 437 YTMLGNYEGIPCKYISPLQGMQIYSSNILYSPGCRNVACNEGDLVASAVEVATKADAVVL 496

Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
           V+GLDQS E E  DR  LLLPG Q +LVS ++ A   P +LV+MS GP+D++  K++ RI
Sbjct: 497 VVGLDQSQERETFDRTSLLLPGMQSQLVSNIANAVTSPIVLVIMSAGPVDISTFKDNSRI 556

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
           +++IW GYPGQ+GG A+A ++FG  NPGG+LP TWY +E+ TN+ M +M MRP+    YP
Sbjct: 557 SSVIWLGYPGQSGGAALAHVVFGAYNPGGRLPNTWYHEEF-TNVSMLDMQMRPNPLSGYP 615

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
           GR+YRFY G  +Y FG G+SY+ + +    APT     L     +   +    A+  + A
Sbjct: 616 GRSYRFYTGTPLYNFGDGLSYSTYFYKFLLAPT----KLSFFKSNTGNSRGCPAVNRSKA 671

Query: 664 K-------------CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
           K             CN +   V V+V N+G + G+H++L+FS PP    AP KQL+AF+K
Sbjct: 672 KSGCFHLPADDLETCNSILFQVSVEVSNLGPRSGSHSVLIFSAPPPVEGAPLKQLIAFQK 731

Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
           VH+ +   QR+   I  CK+LS V R+G R +  G H + IG   H
Sbjct: 732 VHLESDTTQRLIFGIDPCKHLSSVRRNGKRFLHSGRHKLLIGNAVH 777


>gi|449433577|ref|XP_004134574.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
 gi|449530107|ref|XP_004172038.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 812

 Score =  790 bits (2041), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 388/748 (51%), Positives = 504/748 (67%), Gaps = 27/748 (3%)

Query: 28  FACDPKDATTRTLPF-----CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
           F CDP       L F     C  SL  P+R  DLI R++L EK   L   A+ V RLG+ 
Sbjct: 56  FVCDPSRYDKLGLDFSSFGFCDSSLSFPERAKDLIDRMTLSEKAAQLGHVASGVDRLGLP 115

Query: 83  GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
            Y WWSEALHGVSNVGPGT+F    PGATSFP VITTASSFN  LW+ IG+ VS EARAM
Sbjct: 116 PYNWWSEALHGVSNVGPGTQFDKVVPGATSFPNVITTASSFNEDLWKTIGQAVSTEARAM 175

Query: 143 YNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR----- 197
           YN G AGLTYWSP +N+ RDPRWGR  ETPGEDP + GKYA +YVRGLQ  +G       
Sbjct: 176 YNLGRAGLTYWSPTINVIRDPRWGRTVETPGEDPFVVGKYAKNYVRGLQDVEGSENVTDL 235

Query: 198 ----LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVM 253
               LKV++ CKH+ AYD+DNW GV+R+ F+A+V++QD+ +TF+ PF MCV EG V+SVM
Sbjct: 236 NSRPLKVSSCCKHYAAYDVDNWLGVERYSFDARVTEQDMLETFNKPFEMCVKEGDVSSVM 295

Query: 254 CSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF-TSTPEEAAADA 312
           CSYN+VNG+PTCADP +LK TIRG W L+GYIVSDCDSV V  +  H+   T E+A A  
Sbjct: 296 CSYNRVNGIPTCADPVLLKDTIRGNWGLHGYIVSDCDSVKVMVEDAHYLQDTNEDAVAQT 355

Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
           ++AGLDLDCG     +TES V++G +   +I+NAL N   V MRLG FDG      +  L
Sbjct: 356 LKAGLDLDCGQIYPNYTESTVRQGKVGMRNIDNALNNLYVVLMRLGYFDGNTG---FESL 412

Query: 373 GPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
           G  D+C+ +H ELA EAARQG VLLKN   +LP     ++T+AV+GP+++ T  M+GNYA
Sbjct: 413 GKPDICSDEHIELATEAARQGTVLLKNDNDTLPFDPSNYKTLAVVGPHANATSAMLGNYA 472

Query: 433 GIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIE 492
           G+ C   +P+ G+  YA+  +Q GC  VAC +D     A++A+R +DAT++ +G+D SIE
Sbjct: 473 GVPCRMNSPMDGLSEYAKVKYQMGCDSVACKNDTFIFGAMEAARTSDATVIFVGIDLSIE 532

Query: 493 AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
           AE+LDR  LLLPG Q +LV +V+  SKGP +LV++S G IDV+FAKN+  I AIIWAGYP
Sbjct: 533 AESLDRVDLLLPGYQTQLVQQVATVSKGPVVLVILSAGGIDVSFAKNNSNIKAIIWAGYP 592

Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
           G+ GG AIAD++FG  NPGG+LP+TWY  +Y+  LPMT M +RP +S  YPGRTY+FY G
Sbjct: 593 GEEGGRAIADVIFGKFNPGGRLPLTWYENDYVYQLPMTSMPLRPVKSLGYPGRTYKFYDG 652

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH-----GSINATISGK--AIKVTHAKC 665
           PVVYPFGHG+SYT F+H + +A   +A+ L  R         N T   +  A+ V    C
Sbjct: 653 PVVYPFGHGLSYTFFLHNLTSAKRSIAIDLSNRTQCRDIAYTNGTFKPECPAVLVDDLTC 712

Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGIN 724
               +  Q++V+N G +DG+  LLV+S PP G  + H KQ+V F++V + AG  + V   
Sbjct: 713 TE-EIEFQMEVENTGERDGSQVLLVYSVPPGGISSTHIKQVVGFQRVFLKAGDSETVTFK 771

Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           ++ CK L +VD +G   +P G H I +G
Sbjct: 772 LNACKSLGLVDFTGYNLLPAGGHTIVVG 799


>gi|302796583|ref|XP_002980053.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
 gi|300152280|gb|EFJ18923.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
          Length = 772

 Score =  786 bits (2029), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 392/748 (52%), Positives = 524/748 (70%), Gaps = 12/748 (1%)

Query: 22  LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
           LA + P+ACD  +AT    PFC  SL I  RV D + RL+L+EK+  LI+ A  +PRLG+
Sbjct: 24  LAGQAPYACDQSNATLAAFPFCNTSLAITDRVEDYVARLTLEEKISQLINTATGIPRLGV 83

Query: 82  KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
             Y+WW EALHGV++  PG +FGG  P ATSFP  ITTA+SFN +L+  IG+ VS EARA
Sbjct: 84  PKYQWWQEALHGVAS-SPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARA 142

Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG--DRLK 199
           M+N G +GLT+WSPN+NI+RDPRWGRGQETPGEDP+LS  +A  YVRGLQ S    D+LK
Sbjct: 143 MHNLGQSGLTFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQAGSDKLK 202

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
           V+A CKH TAYD+DNW G DR+HFNA V++QD+EDT++ PF+ CV +G V+SVMCSYN++
Sbjct: 203 VSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRL 262

Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
           NGVPTCAD  +L  T+R  W+LNGYIVSDCDS+ V++D  ++ +T E+AAADA+ AGL+L
Sbjct: 263 NGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADALLAGLNL 322

Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
           +CG FL  HT SA+Q+  ++E  IN AL   +TVQMRLG++DG+P SQ YG LG  DVCT
Sbjct: 323 NCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCT 382

Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT 439
            +HQ LALEAARQG+VLLKN G +LPLS  + +++AV+GP+++ T  MIGNYAGI C YT
Sbjct: 383 SEHQTLALEAARQGMVLLKNLG-ALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYT 441

Query: 440 TPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRA 499
           +PLQ   +YA+  +  GC +VAC+ D L   A+ A+  ADA ++ +GLD +IEAE+LDR 
Sbjct: 442 SPLQAFQKYAQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRT 501

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            LLLPG+QQELVS+V  A+KGP ++V++S G ID+ FA +D RIA I+WAGYPGQAGG A
Sbjct: 502 SLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAA 561

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
           IA+++FG  NP GKLP TWYPQ + T++ M +M MRP+ S  YPGRTYRFY GP ++ FG
Sbjct: 562 IAEVIFGDHNPSGKLPATWYPQNF-TSISMLDMNMRPNASTGYPGRTYRFYTGPTIFKFG 620

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRH---GSINATISGKAIKVTHAK-CNRLTLGVQVD 675
            G+SYT+       AP+ +++P          +  + S   +  T  K C  L   V + 
Sbjct: 621 DGLSYTSLSAKFIKAPSFLSIPSTAPMQPCTGLKKSSSCFHLDATDEKSCESLKSQVAIS 680

Query: 676 VKNVGSKDGAHTLLVFSTPP--AGHWAPHKQLVAFEKVHVPAGA-QQRVGINIHVCKYLS 732
           V+N G+   +HTL++FSTPP       P +QLV F K+ +   +    V  ++  C++  
Sbjct: 681 VRNKGAMAISHTLMLFSTPPNAGSDGVPQRQLVGFNKIQIAGDSISNPVIFDLDPCRHFV 740

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
             D  G + +  G H +  G  +HS+ L
Sbjct: 741 HADPDGKKLLRSGTHVLTAGNEQHSLRL 768


>gi|302760655|ref|XP_002963750.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
 gi|300169018|gb|EFJ35621.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
          Length = 785

 Score =  783 bits (2021), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/751 (50%), Positives = 523/751 (69%), Gaps = 23/751 (3%)

Query: 24  AREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
           A+  +ACD   +A+  + PFC   L +  RV DL+ RL+L EKV  +++ A  +PRLG+ 
Sbjct: 33  AQPRYACDVSSNASLGSFPFCDTKLGVDVRVQDLVSRLTLDEKVDEMVNAAQGIPRLGVP 92

Query: 83  GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
            Y+WW EALHGV++  PG +FGG  P ATSFP  I  A+SFN+TL+ +IG  VS EARA+
Sbjct: 93  SYQWWQEALHGVAS-SPGVQFGGLAPAATSFPMPIAMAASFNSTLFYSIGEAVSSEARAL 151

Query: 143 YNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-------SDG 195
           +N G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L+ K+A+ YVRGLQG       SDG
Sbjct: 152 HNLGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYGGSASDG 211

Query: 196 DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
             LKV+A CKH TAYD+DNW G+DR+HFNA+VS+QD+ DT++ PF+ C+ +G+V+SVMCS
Sbjct: 212 -FLKVSACCKHLTAYDMDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSSVMCS 270

Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
           YN+VNGVPTCAD ++L  T+R  W  NGYIVSDCD++ V ++   +  + E+A AD+I A
Sbjct: 271 YNRVNGVPTCADRSLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPSAEDAVADSILA 330

Query: 316 GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
           GLDL+CG FLG H +SA+Q G ++E D+++A+ N +  +MRLG+FDG+ +++PY  LG  
Sbjct: 331 GLDLNCGTFLGKHAKSALQAGKVTEADLDHAISNLMRTRMRLGLFDGDLNTRPYSSLGAT 390

Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
           D+C+ DHQ+LAL+AA QG+VLLKN G SLPLS    +TVA+IGPN++ T TM+GNY GI 
Sbjct: 391 DICSNDHQQLALDAALQGVVLLKNDG-SLPLS-TALKTVALIGPNANATYTMLGNYEGIP 448

Query: 436 CGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
           C Y +PLQG+  Y   I +  GC+DVAC++  L  +A++ + +ADA +LV+GLDQS E E
Sbjct: 449 CKYVSPLQGMQIYNNNILYSPGCRDVACSEGDLVASAVEVATKADAVVLVVGLDQSQERE 508

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             DR  LLLPG Q +LVS ++ A   P +LV+MS GP+D++  K++ RI+++IW GYPGQ
Sbjct: 509 TFDRTSLLLPGMQSQLVSNIANAVTCPIVLVIMSAGPVDISTFKDNSRISSVIWIGYPGQ 568

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
           +GG A+A ++FG  NPGG+LP TWY +E+ TN+ M +M MRP+    YPGR+YRFY G  
Sbjct: 569 SGGAALAHVVFGAYNPGGRLPNTWYHEEF-TNVSMLDMRMRPNPPSGYPGRSYRFYTGTP 627

Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA---IKVTH------AKC 665
           +Y FG G+SY+ +++    APT ++        S +     ++       H        C
Sbjct: 628 LYNFGDGLSYSTYLYKFLLAPTRLSFFKSNTRNSRDCPTVNRSEAEFGCFHLPADDLETC 687

Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
           N +   V V+V N+G + G+H++L+FS PP    AP KQL+AF+KVH+ +   QR+   I
Sbjct: 688 NSILFQVSVEVSNLGPRSGSHSVLIFSAPPPVEGAPLKQLIAFQKVHLESDTTQRLIFGI 747

Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
             CK+LS V R+G R +  G H + IG   H
Sbjct: 748 DPCKHLSSVRRNGKRFLHSGRHKLLIGNAVH 778


>gi|255545664|ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223546978|gb|EEF48475.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 774

 Score =  780 bits (2015), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/767 (48%), Positives = 523/767 (68%), Gaps = 19/767 (2%)

Query: 7   FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
           F    L++L  +S+      PF+CDP + +T +  FC+ SLPI QRV DL+ RL+L EK+
Sbjct: 12  FTIFPLLILQITST----EPPFSCDPSNPSTSSFLFCKTSLPISQRVRDLVSRLTLDEKI 67

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
             L+S A ++PRLGI  YEWWSEALHGV+NVG G  F G    ATSFPQVI TA+SF+A 
Sbjct: 68  SQLVSSAPSIPRLGIPAYEWWSEALHGVANVGRGIHFEGAIKAATSFPQVILTAASFDAY 127

Query: 127 LWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
            W  IG+V+  EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKYA S
Sbjct: 128 QWYRIGQVIGREARAVYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 187

Query: 186 YVRGLQGS--DGDRLK----VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
           YVRG+QG    G +LK     +A CKHFTAYDLDNW GV+RF F+A+V+ QD+ DT+  P
Sbjct: 188 YVRGVQGDSFQGGKLKGHLQASACCKHFTAYDLDNWKGVNRFVFDARVTMQDLADTYQPP 247

Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
           F+ CV +GK + +MC+YN+VNG+P+CAD N+L RT RG+W  +GYI SDCD+V + YD Q
Sbjct: 248 FQSCVQQGKASGIMCAYNRVNGIPSCADFNLLSRTARGQWDFHGYIASDCDAVSIIYDNQ 307

Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
            +  +PE+A  D ++AG+D++CG +L  HT++AV++  L E  I+ AL N  +V+MRLG+
Sbjct: 308 GYAKSPEDAVVDVLKAGMDVNCGSYLQKHTKAAVEQKKLPEASIDRALHNLFSVRMRLGL 367

Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
           F+G P+ QP+ ++GP  VC+ +HQ LALEAAR GIVLLKN    LPL   +  ++AVIGP
Sbjct: 368 FNGNPTEQPFSNIGPDQVCSQEHQILALEAARNGIVLLKNSARLLPLQKSKTVSLAVIGP 427

Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQA 478
           N++   T++GNYAG  C   TPLQ +  Y + TI+  GC  V C+   +   A+D ++  
Sbjct: 428 NANSVQTLLGNYAGPPCKTVTPLQALQYYVKNTIYYSGCDTVKCSSASI-DKAVDIAKGV 486

Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
           D  +++MGLDQ+ E E LDR  L+LPG+QQEL++ V+ ++K P +LVL+SGGP+D++FAK
Sbjct: 487 DRVVMIMGLDQTQEREELDRLDLVLPGKQQELITNVAKSAKNPIVLVLLSGGPVDISFAK 546

Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
            D  I +I+WAGYPG+AGG A+A+I+FG  NPGGKLPMTWYPQE++  +PMT+M MRP  
Sbjct: 547 YDENIGSILWAGYPGEAGGIALAEIIFGDHNPGGKLPMTWYPQEFV-KVPMTDMRMRPDP 605

Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN-APTVVAVPLDGRHGSINATISGKA 657
           S  YPGRTYRFYKG  V+ FG+G+SY+ + + +   + T + +        I+ +   +A
Sbjct: 606 SSGYPGRTYRFYKGRNVFEFGYGLSYSKYSYELKYVSQTKLYLNQSSTMRIIDNSDPVRA 665

Query: 658 IKVTH---AKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA-GHWAPHKQLVAFEKVHV 713
             V       C      V+V V+N G   G H +L+F+     G+  P +QL+ F+ V +
Sbjct: 666 TLVAQLGAEFCKESKFSVKVGVENQGEMAGKHPVLLFARHARHGNGRPRRQLIGFKSVIL 725

Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            AG +  +   +  C++ S  +  G R +  G H + +GG K+ +S+
Sbjct: 726 NAGEKAEIEFELSPCEHFSRANEDGLRVMEEGTHFLMVGGDKYPISV 772


>gi|224068498|ref|XP_002302758.1| predicted protein [Populus trichocarpa]
 gi|222844484|gb|EEE82031.1| predicted protein [Populus trichocarpa]
          Length = 462

 Score =  778 bits (2009), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/458 (80%), Positives = 413/458 (90%), Gaps = 1/458 (0%)

Query: 314 RAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG 373
           +A LDLDCGPFLG HTE AV++GLL+E +INNAL+NTLTVQMRLGMFDGEPSS+PYG+LG
Sbjct: 5   QASLDLDCGPFLGQHTEDAVRKGLLTEAEINNALLNTLTVQMRLGMFDGEPSSKPYGNLG 64

Query: 374 PKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           P DVCTP HQELALEAARQGIVLLKN GP LPLS   H++VA+IGPNS+VTVTMIGNYAG
Sbjct: 65  PTDVCTPAHQELALEAARQGIVLLKNHGPPLPLSTRHHQSVAIIGPNSNVTVTMIGNYAG 124

Query: 434 IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
           +ACGYTTPLQGIGRYA+TI+QQGC DVAC  DQ F AA+DA+RQADAT+LVMGLDQSIEA
Sbjct: 125 VACGYTTPLQGIGRYAKTIYQQGCADVACVSDQQFVAAMDAARQADATVLVMGLDQSIEA 184

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E+ DR  LLLPGRQQEL+SKV+ ASKGPTILVLMSGGPIDV+FA+NDP+I  I+WAGYPG
Sbjct: 185 ESRDRTELLLPGRQQELISKVAAASKGPTILVLMSGGPIDVSFAENDPKIGGIVWAGYPG 244

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
           QAGG AI+D+LFGT+NPGGKLPMTWYPQ+Y+TNLPMT MAMRPS+S  YPGRTYRFYKG 
Sbjct: 245 QAGGAAISDVLFGTTNPGGKLPMTWYPQDYVTNLPMTNMAMRPSKSNGYPGRTYRFYKGK 304

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGKAIKVTHAKCNRLTLGV 672
           VVYPFGHG+SYTNFVHT+A+APT+V+VPLDG R  S NATISGKAI+VTHA+CNRL+ GV
Sbjct: 305 VVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRQASRNATISGKAIRVTHARCNRLSFGV 364

Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
           QVDVKN GS DG HTLLV+S PPAGHWAP KQLVAFEKVHV AG QQRVGIN+HVCK+LS
Sbjct: 365 QVDVKNTGSMDGTHTLLVYSKPPAGHWAPLKQLVAFEKVHVAAGTQQRVGINVHVCKFLS 424

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           VVDRSG RRIP+G H++HIG  KHSVSL A+ LGVIKS
Sbjct: 425 VVDRSGIRRIPMGAHSLHIGDVKHSVSLQASILGVIKS 462


>gi|224082152|ref|XP_002306583.1| predicted protein [Populus trichocarpa]
 gi|222856032|gb|EEE93579.1| predicted protein [Populus trichocarpa]
          Length = 745

 Score =  769 bits (1986), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/765 (47%), Positives = 508/765 (66%), Gaps = 39/765 (5%)

Query: 5   IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
             F    +I+L   S+    + PF+CD  + +T+T PFC+ +LPI QR NDL+ RL+L+E
Sbjct: 9   FTFTIFTVIVLQVDST----QPPFSCDSSNPSTKTFPFCKTTLPISQRANDLVSRLTLEE 64

Query: 65  KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
           K+  L++ A  +PRLGI GY+WWSEALHGV+  GPG +F G    ATSFPQVI +A+SF+
Sbjct: 65  KISQLVNSAQPIPRLGIPGYQWWSEALHGVAYAGPGIRFNGTIKRATSFPQVILSAASFD 124

Query: 125 ATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
           A  W  I + +  EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKYA
Sbjct: 125 ANQWYRISQAIGKEARALYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLMTGKYA 184

Query: 184 ASYVRGLQGSD------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
            SYVRGLQG           L+ +A CKHFTAYDL+NWNG  R+ F+A V+ QD+ DT+ 
Sbjct: 185 VSYVRGLQGDSFKGGEIKGPLQASACCKHFTAYDLENWNGTSRYVFDAYVTAQDLADTYQ 244

Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
            PF+ CV EG+ + +MC+YN+VNG+P CAD N L RT R +W  +GYI SDCD+V + +D
Sbjct: 245 PPFKSCVEEGRASGIMCAYNRVNGIPNCADSNFLSRTARAQWGFDGYIASDCDAVSIIHD 304

Query: 298 TQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
            Q +  TPE+A    ++AG+D++CG +L  HT++AV +  L+  +I+ AL N  +V+MRL
Sbjct: 305 AQGYAKTPEDAVVAVLKAGMDVNCGSYLQQHTKAAVDQKKLTISEIDRALHNLFSVRMRL 364

Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
           G+F+G P+ Q +G++GP  VC+ ++Q LAL+AAR GIVLLKN    LPLS  +  ++AVI
Sbjct: 365 GLFNGNPTGQQFGNIGPDQVCSQENQILALDAARNGIVLLKNSAGLLPLSKSKTMSLAVI 424

Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASR 476
           GPN++   T++GNYAG  C   TPLQ +  Y + TI   GC  V C+   + G A++ ++
Sbjct: 425 GPNANSVQTLLGNYAGPPCKLVTPLQALQSYIKHTIPYPGCDSVQCSSASIVG-AVNVAK 483

Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
            AD  +L+MGLD + E E LDR  L+LPG+QQEL+  V+ A+K P +LVL+SGGP+D++F
Sbjct: 484 GADHVVLIMGLDDTQEKEGLDRRDLVLPGKQQELIISVAKAAKNPVVLVLLSGGPVDISF 543

Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
           AKND  I +I+WAGYPG+AG  A+A+I+FG  NPGGKLPMTWYPQE++  +PMT+M MRP
Sbjct: 544 AKNDKNIGSILWAGYPGEAGAIALAEIIFGDHNPGGKLPMTWYPQEFV-KVPMTDMRMRP 602

Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
             S  YPGRTYRFYKGP V+ FG+G+SY+ + + +                        +
Sbjct: 603 ETSSGYPGRTYRFYKGPTVFEFGYGLSYSKYTYEL------------------------R 638

Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVHVPA 715
           AI +   +C  +   V V VKN G   G H +L+F+     G   P K+LV F+ V + A
Sbjct: 639 AIYIGEEQCENIKFKVTVSVKNEGQMAGKHPVLLFARHAKPGKGRPIKKLVGFQTVKLGA 698

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           G +  +   +  C++LS  +  G   +  G   + +G  +H V++
Sbjct: 699 GEKTEIEYELSPCEHLSSANEDGVMVMEEGSQILLVGDKEHPVTI 743


>gi|222629651|gb|EEE61783.1| hypothetical protein OsJ_16354 [Oryza sativa Japonica Group]
          Length = 771

 Score =  769 bits (1986), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 391/731 (53%), Positives = 490/731 (67%), Gaps = 50/731 (6%)

Query: 75  AVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIG-- 132
           A+PRLGI  YEWWSEALHGVS VGPGT+F    PGATSFPQ I TA+SFNA+L+ AIG  
Sbjct: 44  ALPRLGIPAYEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGES 103

Query: 133 ----------------------------------------RVVSDEARAMYNGGTAGLTY 152
                                                   RVVS EARAM+N G AGLT+
Sbjct: 104 ACNNTSQFFFSSKSPFSICIAMENLHCDFRSRLVRFYRGARVVSTEARAMHNVGLAGLTF 163

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG--DRLKVAASCKHFTAY 210
           WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA  YV GLQ + G  D LKVAA CKH+TAY
Sbjct: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGGGSDALKVAACCKHYTAY 223

Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
           D+DNW GV+R+ F+A VS+QD++DTF  PF+ CV++G VASVMCSYN+VNG PTCAD ++
Sbjct: 224 DVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADKDL 283

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE 330
           L   IRG+W+LNGYIVSDCDSV V Y+ QH+T  PE+AAA  I++GLDL+CG FL  HT 
Sbjct: 284 LSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNPEDAAAITIKSGLDLNCGNFLAQHTV 343

Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
           +AVQ G LSE D++ A+ N   V MRLG FDG+P   P+G LGPKDVCT  +QELA EAA
Sbjct: 344 AAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTSSNQELAREAA 403

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR 450
           RQGIVLLKN G +LPLS    +++AVIGPN++ + TMIGNY G  C YTTPLQG+G    
Sbjct: 404 RQGIVLLKNTG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVA 462

Query: 451 TIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQE 509
           T++Q GC +V C+ + L   AA  A+  AD T+LV+G DQS+E E+LDR  LLLPG+Q +
Sbjct: 463 TVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDRTSLLLPGQQPQ 522

Query: 510 LVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSN 569
           LVS V+ AS+GP ILV+MSGGP D++FAK+  +I+AI+W GYP ++              
Sbjct: 523 LVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPRRSRWRRPRRHPLRIPQ 582

Query: 570 PGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH 629
               LP+TWYP  +   + MT+M MRP  S  YPGRTYRFY G  VY FG G+SYT F H
Sbjct: 583 --SWLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRTYRFYTGDTVYAFGDGLSYTKFAH 640

Query: 630 TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLL 689
           ++ +AP  VAV L   H     T    +++     C  L+  V + V+N G   G HT+ 
Sbjct: 641 SLVSAPEQVAVQLAEGHAC--HTEHCFSVEAAGEHCGSLSFDVHLRVRNAGGMAGGHTVF 698

Query: 690 VFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
           +FS+PP+ H AP K L+ FEKV +  G    V   + VCK LSVVD  G R++ LG H +
Sbjct: 699 LFSSPPSVHSAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTL 758

Query: 750 HIGGTKHSVSL 760
           H+G  KH+++L
Sbjct: 759 HVGDLKHTLNL 769


>gi|224066931|ref|XP_002302285.1| predicted protein [Populus trichocarpa]
 gi|222844011|gb|EEE81558.1| predicted protein [Populus trichocarpa]
          Length = 773

 Score =  764 bits (1974), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/751 (47%), Positives = 511/751 (68%), Gaps = 15/751 (1%)

Query: 23  AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
           + + PF+CD  + +T+  PFC+ +LPI QR  DL+ RL+L EK+  L++ A  +PRLGI 
Sbjct: 23  STQPPFSCDSSNPSTKAFPFCETTLPISQRARDLVSRLTLDEKISQLVNSAPPIPRLGIP 82

Query: 83  GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
           GYEWWSEALHGVSN GPG  F  +  GATSFPQVI TA+SF+A  W  IG+ +  EARA+
Sbjct: 83  GYEWWSEALHGVSNAGPGIHFNDNIKGATSFPQVILTAASFDAYQWYRIGQAIGKEARAL 142

Query: 143 YNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----- 196
           YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++G YAASYV+G+QG   +     
Sbjct: 143 YNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGLYAASYVKGVQGDSFEGGKIK 202

Query: 197 -RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
             L+ +A CKHFTAYDLDNW G++RF F+A+V+ QD+ DT+  PF+ CV +G+ + +MC+
Sbjct: 203 GHLQASACCKHFTAYDLDNWKGMNRFVFDARVTMQDLADTYQPPFKSCVEQGRASGIMCA 262

Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
           YN+VNGVP+CAD N+L +T R +W   GYI SDCD+V + +D Q +  +PE+A  D ++A
Sbjct: 263 YNKVNGVPSCADSNLLSKTARAQWGFRGYITSDCDAVSIIHDDQGYAKSPEDAVVDVLKA 322

Query: 316 GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
           G+D++CG +L  H + AV++  LSE DI+ AL N  +V+MRLG+F+G P  Q +G++GP 
Sbjct: 323 GMDVNCGSYLLKHAKVAVEQKKLSESDIDKALHNLFSVRMRLGLFNGRPEGQLFGNIGPD 382

Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
            VC+ +HQ LALEAAR GIVLLKN    LPLS  + +++AVIGPN++    ++GNYAG  
Sbjct: 383 QVCSQEHQILALEAARNGIVLLKNSARLLPLSKSKTKSLAVIGPNANSGQMLLGNYAGPP 442

Query: 436 CGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
           C + TPLQ +  Y + T++   C  V C+   +   A+D ++ AD  +L+MGLDQ+ E E
Sbjct: 443 CRFVTPLQALQSYIKQTVYHPACDTVQCSSASV-DRAVDVAKGADNVVLMMGLDQTQERE 501

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
            LDR  LLLPG+QQEL+  V+ A+K P +LVL SGGP+D++FAKND  I +I+WAGYPG+
Sbjct: 502 ELDRTDLLLPGKQQELIIAVAKAAKNPVVLVLFSGGPVDISFAKNDKNIGSILWAGYPGE 561

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
            G  A+A+I+FG  NPGG+LPMTWYPQE++  +PMT+M MRP  S  YPGRTYRFY+G  
Sbjct: 562 GGAIALAEIVFGDHNPGGRLPMTWYPQEFV-KVPMTDMGMRPEASSGYPGRTYRFYRGRS 620

Query: 615 VYPFGHGMSYTNFVHTV-ANAPTVVAVPLDGRHGSINATISGKAIKVTHAK---CNRLTL 670
           V+ FG+G+SY+ + + + A +   + +        IN   S ++  ++      C +   
Sbjct: 621 VFEFGYGISYSKYSYELTAVSQNTLYLNQSSTMHIINDFDSVRSTLISELGTEFCEQNKC 680

Query: 671 GVQVDVKNVGSKDGAHTLLVFSTPPA-GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
             ++ VKN G   G H +L+F+     G+  P KQL+ F+ V + AG +  +   +  C+
Sbjct: 681 RARIGVKNHGEMAGKHPVLLFARQEKHGNGRPRKQLIGFQSVVLGAGERAEIEFEVSPCE 740

Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           +LS  +  G   +  G H + + G ++ +S+
Sbjct: 741 HLSRANEDGLMVMEEGRHFLVVDGDEYPISV 771


>gi|449508468|ref|XP_004163321.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 7-like
           [Cucumis sativus]
          Length = 783

 Score =  761 bits (1964), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/754 (47%), Positives = 510/754 (67%), Gaps = 15/754 (1%)

Query: 20  SGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRL 79
           +G +++ P+ACD  +  T+TLPFC+  LPI  R  DL+ RL+L EKV  L++    +PRL
Sbjct: 30  AGSSSQPPYACDSSNPLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPRL 89

Query: 80  GIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEA 139
           GI  YEWWSEALHGV+NVG G +  G    ATSFPQVI TA+SF+  LW  IG+ +  EA
Sbjct: 90  GIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTEA 149

Query: 140 RAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD---- 194
           RA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKY+ +YVRG+QG      
Sbjct: 150 RAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGDAIEGG 209

Query: 195 --GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASV 252
             G++LK +A CKHFTAYDLD WNG+ R+ F+AKV+ QD+ DT+  PF  CV EGK + +
Sbjct: 210 KLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEEGKASGI 269

Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADA 312
           MC+YN+VNGVP+CAD ++L  T R +W+ NGYI SDCD+V + +D Q +   PE+A AD 
Sbjct: 270 MCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAKIPEDAVADV 329

Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
           +RAG+D++CG +L  HT+SAV+   +  + I+ AL N  +V+MRLG+FDG P+  P+G +
Sbjct: 330 LRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFDGNPTKLPFGQI 389

Query: 373 GPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
           G   VC+  HQ LAL+AAR+GIVLLKN    LPLS     ++AVIG N +   T+ GNYA
Sbjct: 390 GRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNGNDPKTLRGNYA 449

Query: 433 GIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSI 491
           GI C   TP QG+  Y + T++ +GC    C +  ++  A+  ++  D  +LVMGLDQ+ 
Sbjct: 450 GIPCKSATPFQGLNNYVKNTVYHRGCNYANCTEATIY-QAVKIAKSVDYVVLVMGLDQTQ 508

Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
           E E  DR  L LPG+Q +L+++V+ A+K P ILV++SGGP+D++ AK + +I +I+WAGY
Sbjct: 509 EREDFDRTELGLPGKQDKLIAEVAKAAKXPVILVILSGGPVDISSAKYNEKIGSILWAGY 568

Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
           PGQAGGTAIA+I+FG  NPGG+LP+TWYP ++I   PMT+M MR   S  YPGRTYRFY 
Sbjct: 569 PGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFI-KFPMTDMRMRADSSTGYPGRTYRFYN 627

Query: 612 GPVVYPFGHGMSYTNFVH---TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK-CNR 667
           GP VY FG+G+SY+N ++   +V+ +  +++ P   +    +  +S + +     K C  
Sbjct: 628 GPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSDLVSYRLVSELDKKFCES 687

Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
            T+ V V V+N G   G H++L+F  P    + +P KQLV F+KV + AG ++ +   + 
Sbjct: 688 KTVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLVGFKKVEINAGERREIEFLVS 747

Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            C ++S     G   I  G +++ +G  +H + +
Sbjct: 748 PCDHISKASEEGLMIIEEGSYSLVVGDVEHPLDI 781


>gi|85813770|emb|CAJ65921.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 704

 Score =  760 bits (1963), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 390/671 (58%), Positives = 478/671 (71%), Gaps = 30/671 (4%)

Query: 5   IAFFFLGLILLSASSSGLAAREP-FACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
           + F    + L+  S+   A   P FACD   + +  +L FC  S+ I  RV DL+ RL+L
Sbjct: 15  VVFLLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGINDRVVDLVKRLTL 74

Query: 63  QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
           QEK+  L++ A  V RLGI  YEWWSEALHGVS VGPGT F  D  GATSFPQVI TA+S
Sbjct: 75  QEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVILTAAS 134

Query: 123 FNATLWEAIGRV-----VSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPV 177
           FN +L+EAIG+V     VS EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+
Sbjct: 135 FNTSLFEAIGKVYYTQVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPL 194

Query: 178 LSGKYAASYVRGLQGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV-SKQDIE 233
           LS KY + YV+GLQ  D    D+LKVAA CKH+TAYDLDNW G DR+HFNA V +KQD++
Sbjct: 195 LSSKYGSCYVKGLQQRDDGDPDKLKVAACCKHYTAYDLDNWKGSDRYHFNAVVVTKQDMD 254

Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY-------IV 286
           DTF  PF+ CV++G VASVMCSYNQVNG PTCADP++L   IRGEW LNGY       IV
Sbjct: 255 DTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWNLNGYQWGCCRYIV 314

Query: 287 SDCDSVGVYYDTQHFTSTPEEA------AADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
           +DCDS+ V+Y +Q++T TPEEA      A +++  G+DL+CG FLG HTE+AV+ GL++E
Sbjct: 315 TDCDSLDVFYKSQNYTKTPEEAAAAAILAGNSLVTGVDLNCGSFLGQHTEAAVKGGLVNE 374

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
             I+ A+ N     MRLG FDG+PS Q YG LGPKDVCT ++QELA EAARQGIVLLKN 
Sbjct: 375 HAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNT 434

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY-AGIACGYTTPLQGIGRYARTIHQQGCKD 459
             SLPLS    + +AVIGPN++VT TMIGNY  G  C YTTPLQG+     T +  GC +
Sbjct: 435 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGGTPCKYTTPLQGLAASVATTYLPGCSN 494

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
           VAC+  Q+   A   +  ADAT+LVMG D SIEAE+ DR  +LLPG+QQ L++ V+  S 
Sbjct: 495 VACSTAQV-DDAKKLAAAADATVLVMGADLSIEAESRDRVDVLLPGQQQLLITAVANVSC 553

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG----GKLP 575
           GP ILV+MSGG +DV+FA+ + +I +I+W GYPG+AGG AIADI+FG  NP     G+LP
Sbjct: 554 GPVILVIMSGGGMDVSFARTNDKITSILWVGYPGEAGGAAIADIIFGYYNPSTHQPGRLP 613

Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           MTWYPQ Y+  +PMT M MRP  S  YPGRTYRFY G  VY FG G+SY+ F H +  AP
Sbjct: 614 MTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDGLSYSQFTHELIQAP 673

Query: 636 TVVAVPLDGRH 646
            +V VPL+  H
Sbjct: 674 QLVYVPLEESH 684


>gi|449465962|ref|XP_004150696.1| PREDICTED: probable beta-D-xylosidase 7-like [Cucumis sativus]
          Length = 783

 Score =  760 bits (1962), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/754 (47%), Positives = 510/754 (67%), Gaps = 15/754 (1%)

Query: 20  SGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRL 79
           +G +++ P+ACD  +  T+TLPFC+  LPI  R  DL+ RL+L EKV  L++    +PRL
Sbjct: 30  AGSSSQPPYACDSSNPLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPRL 89

Query: 80  GIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEA 139
           GI  YEWWSEALHGV+NVG G +  G    ATSFPQVI TA+SF+  LW  IG+ +  EA
Sbjct: 90  GIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTEA 149

Query: 140 RAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD---- 194
           RA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKY+ +YVRG+QG      
Sbjct: 150 RAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGDAIEGG 209

Query: 195 --GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASV 252
             G++LK +A CKHFTAYDLD WNG+ R+ F+AKV+ QD+ DT+  PF  CV EGK + +
Sbjct: 210 KLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEEGKASGI 269

Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADA 312
           MC+YN+VNGVP+CAD ++L  T R +W+ NGYI SDCD+V + +D Q +   PE+A AD 
Sbjct: 270 MCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAKIPEDAVADV 329

Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
           +RAG+D++CG +L  HT+SAV+   +  + I+ AL N  +V+MRLG+FDG P+  P+G +
Sbjct: 330 LRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFDGNPTKLPFGQI 389

Query: 373 GPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
           G   VC+  HQ LAL+AAR+GIVLLKN    LPLS     ++AVIG N +   T+ GNYA
Sbjct: 390 GRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNGNDPKTLRGNYA 449

Query: 433 GIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSI 491
           GI C   TP QG+  Y + T++ +GC    C +  ++  A+  ++  D  +LVMGLDQ+ 
Sbjct: 450 GIPCKSATPFQGLNNYVKNTVYHRGCNYANCTEATIY-QAVKIAKSVDYVVLVMGLDQTQ 508

Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
           E E  DR  L LPG+Q +L+++V+ A+K P ILV++SGGP+D++ AK + +I +I+WAGY
Sbjct: 509 EREDFDRTELGLPGKQDKLIAEVAKAAKRPVILVILSGGPVDISSAKYNEKIGSILWAGY 568

Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
           PGQAGGTAIA+I+FG  NPGG+LP+TWYP ++I   PMT+M MR   S  YPGRTYRFY 
Sbjct: 569 PGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFI-KFPMTDMRMRADSSTGYPGRTYRFYN 627

Query: 612 GPVVYPFGHGMSYTNFVH---TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK-CNR 667
           GP VY FG+G+SY+N ++   +V+ +  +++ P   +    +  +S + +     K C  
Sbjct: 628 GPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSDLVSYRLVSELDKKFCES 687

Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
            T+ V V V+N G   G H++L+F  P    + +P KQLV F+KV + AG ++ +   + 
Sbjct: 688 KTVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLVGFKKVEINAGERREIEFLVS 747

Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            C ++S     G   I  G +++ +G  +H + +
Sbjct: 748 PCDHISKASEEGLMIIEEGSYSLVVGDVEHPLDI 781


>gi|371917286|dbj|BAL44719.1| SlArf/Xyl4 [Solanum lycopersicum]
          Length = 775

 Score =  754 bits (1947), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/766 (47%), Positives = 511/766 (66%), Gaps = 22/766 (2%)

Query: 12  LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           LI LS  S   + + PF+CD  +  T++L FCQ  LPI  RV DL+ RL+L EK+  L++
Sbjct: 13  LISLSLVSIVQSTQPPFSCDSSNPQTKSLKFCQTGLPISVRVLDLVSRLTLDEKISQLVN 72

Query: 72  GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
            A A+PRLGI  YEWWSE+LHGV + G G  F G   GATSFPQVI TA++F+  LW  I
Sbjct: 73  SAPAIPRLGIPAYEWWSESLHGVGSAGKGIFFNGSIAGATSFPQVILTAATFDENLWYRI 132

Query: 132 GRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
           G+V+  EAR +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKYA  YVRG+
Sbjct: 133 GQVIGVEARGVYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGKYAIRYVRGV 192

Query: 191 QGS--DGDRLK-----VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
           QG   +G +LK      +A CKHFTAYDLD W  +DRF FNA V+ QD+ DTF  PF+ C
Sbjct: 193 QGDSFNGGQLKKGHLQASACCKHFTAYDLDQWKNLDRFSFNAIVTPQDMADTFQPPFQDC 252

Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
           + + + + +MCSYN VNG+P+CA+ N+L +T R +W  +GYI SDCD+V V +D   + +
Sbjct: 253 IQKAQASGIMCSYNSVNGIPSCANYNLLTKTARQQWGFHGYITSDCDAVQVMHDNHRYGN 312

Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
           TPE++ A A++AG+D+DCG +L  +T+SAV +  +S++ I+ AL N  +++MRLG+F+G+
Sbjct: 313 TPEDSTAFALKAGMDIDCGDYLKKYTKSAVMKKKVSQVHIDRALHNLFSIRMRLGLFNGD 372

Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
           P  Q YG++ P  VC P HQ+LALEAAR GIVLLKN G  LPLS  +  ++AVIG N++ 
Sbjct: 373 PRKQLYGNISPSQVCAPQHQQLALEAARNGIVLLKNTGKLLPLSKAKTNSLAVIGHNANN 432

Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATI 482
              + GNY G  C Y   L+ +  YA+++ +QQGC    C    +   A++ +R AD  +
Sbjct: 433 AYILRGNYDGPPCKYIEILKALVGYAKSVQYQQGCNAANCTSANI-DQAVNIARNADYVV 491

Query: 483 LVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPR 542
           L+MGLDQ+ E E  DR  L+LPG+Q+ L++ V+ A+K P ILV++SGGP+D++FAK +P+
Sbjct: 492 LIMGLDQTQEREQFDRDDLVLPGQQENLINSVAKAAKKPVILVILSGGPVDISFAKYNPK 551

Query: 543 IAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRY 602
           I +I+WAGYPG+AGG A+A+I+FG  NPGGKLP+TWYPQ ++  +PMT+M MRP     Y
Sbjct: 552 IGSILWAGYPGEAGGIALAEIIFGEHNPGGKLPVTWYPQAFV-KIPMTDMRMRPDPKTGY 610

Query: 603 PGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA-PTVVAVPLDGRHGSINATISGKAIKVT 661
           PGRTYRFYKGP VY FG+G+SYT + +   +A P  + +    +  S+    +  +I+ T
Sbjct: 611 PGRTYRFYKGPKVYEFGYGLSYTTYSYGFHSATPNTIQL---NQLLSVKTVENSDSIRYT 667

Query: 662 HA------KCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVP 714
                    C +      V V+N G  DG H +L+F     A + +P KQLV F+ V + 
Sbjct: 668 FVDEIGSDNCEKAKFSAHVSVENSGEMDGKHPVLLFVKQDKARNGSPIKQLVGFQSVSLK 727

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           AG   ++   I  C++LS  +  G   I  G   + +G  +H +++
Sbjct: 728 AGENSQLVFEISPCEHLSSANEDGLMMIEEGSRYLVVGDAEHPINI 773


>gi|168065036|ref|XP_001784462.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663987|gb|EDQ50724.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 726

 Score =  753 bits (1943), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/732 (51%), Positives = 502/732 (68%), Gaps = 27/732 (3%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
           + FC  SL    RV DL+ RL+L+EKV  L++ A+A+PRL I  YEWW E LHGV++V  
Sbjct: 1   MRFCDTSLSDEIRVFDLVSRLTLEEKVTQLVNTASAIPRLSIPAYEWWQEGLHGVAHV-- 58

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
              FGG  P ATSFP  I T +SFN  LW  IG+  S EARA YN G AGLTYWSP +NI
Sbjct: 59  --SFGGSLPRATSFPLPILTTASFNKDLWNQIGQAFSTEARAFYNDGIAGLTYWSPVINI 116

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDLDNWNG 217
            RDPRWGR QET GEDP  +  YA  +V+G+Q  D +  RLK++A CKHFTAYD+DNW G
Sbjct: 117 ARDPRWGRIQETSGEDPYTTSAYATHFVQGMQEGDANSKRLKLSACCKHFTAYDVDNWEG 176

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
           +DR+HF+AK    ++ DT++ PF+ CV EG+ AS+MCSYN+VNGVPTCA+ + L+ T+R 
Sbjct: 177 IDRYHFDAKA---NLADTYNPPFQSCVQEGRSASLMCSYNKVNGVPTCANYDFLENTVRR 233

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
            W LNGYIVSDCDSV V +++ ++  T E+AAADA+ AGLDL+CG +L  +TE AV  G 
Sbjct: 234 AWGLNGYIVSDCDSVLVMHESTNYAPTTEDAAADALNAGLDLNCGDYLASYTEGAVAMGK 293

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           ++   ++NA+ N   V+MRLGMFDG P++Q +G++G  DVCTP HQELA+EAARQGIVLL
Sbjct: 294 VNASRVDNAVYNVFLVRMRLGMFDGNPANQEFGNIGVADVCTPAHQELAVEAARQGIVLL 353

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-----GRYARTI 452
           KN G  LPLS  ++   AVIGPN++ T TM+GNY GI C Y TPLQG+     G Y +  
Sbjct: 354 KNDGNILPLS--KNINTAVIGPNANATHTMLGNYEGIPCQYITPLQGLVKFGSGDYHKVW 411

Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
             +GC + AC  D    +A+  +  ADA +LV+GL Q  E+EALDR  LLLPG QQ L+ 
Sbjct: 412 FSEGCVNTACQQDDQISSAVSTAAVADAVVLVVGLSQVQESEALDRTSLLLPGYQQTLID 471

Query: 513 KVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
           +V+ A+ G P +LVLM  GP+D+ FAKND RI +I+W GYPGQ+GG AIA+++FG  NPG
Sbjct: 472 EVAGAAAGRPVVLVLMCAGPVDINFAKNDKRIQSILWVGYPGQSGGQAIAEVIFGAHNPG 531

Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
           GKLPM+WYP++Y T + MT M MRP     YPGRTYRFY G  +Y FG+G+SYT + H+ 
Sbjct: 532 GKLPMSWYPEDY-TKISMTNMNMRPDSRSNYPGRTYRFYTGEKIYDFGYGLSYTEYKHSF 590

Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTHAK-CNRLTLGVQVDVKNVGSKDGAHTLLV 690
           A APT V  P      SI++ +         +K C+     V ++V+N+G+  G HTLL+
Sbjct: 591 ALAPTTVMTP------SIHSQLCDPHQTSAGSKTCSSSNFDVHINVENIGAMAGNHTLLL 644

Query: 691 FSTPPAG--HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
           F T P+   +  P KQL AF+ V++ +G+Q++V + ++ C++L  V   GTR +  G H 
Sbjct: 645 FFTAPSAGKNGTPLKQLAAFDSVYIRSGSQEKVVLTLNPCQHLGTVAEDGTRMLEAGNHI 704

Query: 749 IHIGGTKHSVSL 760
           + +G  KHS+S+
Sbjct: 705 LSVGDAKHSLSV 716


>gi|358349509|ref|XP_003638778.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355504713|gb|AES85916.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 776

 Score =  750 bits (1937), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/780 (47%), Positives = 510/780 (65%), Gaps = 28/780 (3%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           ++ST  F  +  + L+ + S LA   PFACD  + +TR+ PFC   LPI QR  DL+ RL
Sbjct: 3   LSSTFTFVTIISLFLTLTYSVLAQLPPFACDYSNPSTRSYPFCNPKLPITQRTKDLVSRL 62

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L EK+  L++ A  +PRLGI  YEWWSEALHG+ NVG G  F G    ATSFPQVI TA
Sbjct: 63  TLDEKLAQLVNSAPPIPRLGIPAYEWWSEALHGIGNVGRGIFFNGSITSATSFPQVILTA 122

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
           +SF++ LW  IG+ +  EARA+YNGG A G+T+W+PN+NIFRDPRWGRGQET GEDP+++
Sbjct: 123 ASFDSHLWYRIGQAIGVEARAIYNGGQAMGMTFWAPNINIFRDPRWGRGQETAGEDPMMT 182

Query: 180 GKYAASYVRGLQGSDGD------RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIE 233
             YA SYVRGLQG           L+ +A CKHFTAYDLDNW GV+RFHF+A+VS QD+ 
Sbjct: 183 SNYAVSYVRGLQGDSFQGGKLRGHLQASACCKHFTAYDLDNWKGVNRFHFDARVSLQDLA 242

Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG 293
           DT+  PFR C+ +G+ + +MC+YN+VNG+P+CAD N+L  T+R +W  +GYIVSDC +VG
Sbjct: 243 DTYQPPFRSCIEQGRASGIMCAYNRVNGIPSCADFNLLTNTVRKQWEFHGYIVSDCGAVG 302

Query: 294 VYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTV 353
           + +D Q +  + E+A AD + AG+DL+CG +L  H +SAVQ+  L  + I+ AL N  ++
Sbjct: 303 IIHDEQGYAKSAEDAVADVLHAGMDLECGSYLTDHAKSAVQQKKLPIVRIDRALHNLFSI 362

Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
           ++RLG FDG P+  P+G +GP  VC+ +H  LALEAAR GIVLLKN    LPL      +
Sbjct: 363 RIRLGQFDGNPAKLPFGMIGPNHVCSENHLYLALEAARNGIVLLKNTASLLPLPKTS-IS 421

Query: 414 VAVIGPNSDVT-VTMIGNYAGIACGYTTPLQGIGRYART-IHQQGCKDVACADDQLFGAA 471
           +AVIGPN++ + +T++GNYAG  C   T LQG   Y +  +   GC             A
Sbjct: 422 LAVIGPNANASPLTLLGNYAGPPCKSITILQGFQHYVKNAVFHPGCDGGPKCASAPIDKA 481

Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           +  ++ AD  +LVMGLDQS+E E  DR  L LPG+Q EL++ V+ ASK P ILVL+ GGP
Sbjct: 482 VKVAKNADYVVLVMGLDQSVEREERDRVHLDLPGKQLELINSVAKASKRPVILVLLCGGP 541

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
           ID++ AKN+ +I  IIWAGYPG+ GG A+A I+FG  NPGG+LP+TWYP++YI  +PMT+
Sbjct: 542 IDISSAKNNDKIGGIIWAGYPGELGGIALAQIIFGDHNPGGRLPITWYPKDYI-KVPMTD 600

Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
           M MR   +  YPGRTYRFYKGP VY FGHG+SYT + +        V+V  D  H + ++
Sbjct: 601 MRMRADPTTGYPGRTYRFYKGPTVYEFGHGLSYTKYSY------EFVSVTHDKLHFNQSS 654

Query: 652 ---------TISGKAI-KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH-WA 700
                    TI  K + ++    C  +++ V V VKN G+  G H +L+F  P      +
Sbjct: 655 THLMTENSETIRYKLVSELDEETCKSMSVSVTVGVKNHGNIVGRHPILLFMRPQKHRTRS 714

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P KQLV F  + + AG    VG  +  C++LS  + +G + I  G H +H+G  ++ + +
Sbjct: 715 PMKQLVGFHSLLLDAGEMSHVGFELSPCEHLSRANEAGLKIIEEGSHLLHVGEEEYLIDI 774


>gi|296084630|emb|CBI25718.3| unnamed protein product [Vitis vinifera]
          Length = 768

 Score =  750 bits (1937), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/772 (47%), Positives = 496/772 (64%), Gaps = 16/772 (2%)

Query: 1   MASTIAFFFLGLIL--LSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIG 58
           M++   F F+ L L  L   S   +    F C P   T    PFC  SLPI  R   L+ 
Sbjct: 1   MSTEWRFLFICLFLQVLPLFSISESTHPQFPCMP--PTNSDYPFCNTSLPISTRAQSLVS 58

Query: 59  RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
            L+L EK++ L   AAA+PRL I  YEWWSE+LHG++  GPG  F G    ATSFPQV+ 
Sbjct: 59  LLTLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLL 118

Query: 119 TASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
           TA+SFN +LW +IG  ++ EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++
Sbjct: 119 TAASFNRSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMV 178

Query: 179 SGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
           +  YA  +VRG QG SDGD L ++A CKH TAYDL+ W    R+ F+A VS QD+EDT+ 
Sbjct: 179 ASAYAVEFVRGFQGDSDGDGLMLSACCKHLTAYDLEKWGNFSRYSFDAVVSNQDLEDTYQ 238

Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
            PFR CV +GK + +MCSYN+VNGVP CA  ++ ++  + EW   GYI SDCD+V   Y+
Sbjct: 239 PPFRSCVQQGKASCLMCSYNRVNGVPACARQDLFQKA-KTEWGFKGYITSDCDAVATVYE 297

Query: 298 TQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
            QH+ ++PE+A AD ++AG D++CG ++  HT+SA+ +G + E DI+ AL N  +VQMRL
Sbjct: 298 YQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSAIDQGKVKEEDIDRALFNLFSVQMRL 357

Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
           G+FDG+P++  YG+LGPKDVCT +H+ LALEAARQGIVLLKN    LPL   R  ++A+I
Sbjct: 358 GLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGIVLLKNDKKFLPLDKSRISSLAII 417

Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASR 476
           GP +D    + G Y GI C   + ++G+  Y  +T    GC DV C  D  F  A+  +R
Sbjct: 418 GPQADQPF-LGGGYTGIPCKPESLVEGLKTYVEKTSFAAGCVDVPCLSDTGFDEAVSIAR 476

Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
           +AD  ++V GLD S E E  DR  LLLPG+Q  L+S V+ A + P +LVL  GGP+DV+F
Sbjct: 477 KADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISSVASAIQKPLVLVLTGGGPLDVSF 536

Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
           A+ DPRIA+I+W GYPG+AG  A+A+I+FG  NPGG+LPMTWYP+ + T +PM +M MR 
Sbjct: 537 AEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TRVPMNDMNMRA 595

Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
              + YPGRTYRFY G  VY FG G+SYT F +   +AP  + +       S       +
Sbjct: 596 DPYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQFVSAPNKLNLLRSSDTVSSKNLPRQR 655

Query: 657 AIKVTH------AKCNRLTLGVQVDVKNVGSKDGAHTLLVFS-TPPAGHWAPHKQLVAFE 709
             +V +        C+ L   V++ V NVG  DG+H +++FS  P      P KQL+ F 
Sbjct: 656 REEVNYFHIEELDTCDSLRFHVEISVTNVGDMDGSHVVMLFSRVPKIVKGTPEKQLIGFS 715

Query: 710 KVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           +VH  +       I +  C++ S+ +  G R +PLG+H I +G   HSVS+ 
Sbjct: 716 RVHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPLGDHTIMLGDVVHSVSVE 767


>gi|32488698|emb|CAE03635.1| OSJNBb0003B01.27 [Oryza sativa Japonica Group]
          Length = 839

 Score =  749 bits (1933), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/639 (56%), Positives = 461/639 (72%), Gaps = 6/639 (0%)

Query: 125 ATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
           + ++  I  VVS EARAM+N G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA 
Sbjct: 202 SMMYNLIVLVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAV 261

Query: 185 SYVRGLQGSDG--DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
            YV GLQ + G  D LKVAA CKH+TAYD+DNW GV+R+ F+A VS+QD++DTF  PF+ 
Sbjct: 262 GYVTGLQDAGGGSDALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKS 321

Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT 302
           CV++G VASVMCSYN+VNG PTCAD ++L   IRG+W+LNGYIVSDCDSV V Y+ QH+T
Sbjct: 322 CVIDGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYT 381

Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
             PE+AAA  I++GLDL+CG FL  HT +AVQ G LSE D++ A+ N   V MRLG FDG
Sbjct: 382 KNPEDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDG 441

Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
           +P   P+G LGPKDVCT  +QELA EAARQGIVLLKN G +LPLS    +++AVIGPN++
Sbjct: 442 DPRKLPFGSLGPKDVCTSSNQELAREAARQGIVLLKNTG-ALPLSAKSIKSMAVIGPNAN 500

Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADAT 481
            + TMIGNY G  C YTTPLQG+G    T++Q GC +V C+ + L   AA  A+  AD T
Sbjct: 501 ASFTMIGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVT 560

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           +LV+G DQS+E E+LDR  LLLPG+Q +LVS V+ AS+GP ILV+MSGGP D++FAK+  
Sbjct: 561 VLVVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSD 620

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
           +I+AI+W GYPG+AGG A+ADILFG  NPGG+LP+TWYP  +   + MT+M MRP  S  
Sbjct: 621 KISAILWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRMRPDSSTG 680

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
           YPGRTYRFY G  VY FG G+SYT F H++ +AP  VAV L   H     T    +++  
Sbjct: 681 YPGRTYRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLAEGHAC--HTEHCFSVEAA 738

Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
              C  L+  V + V+N G   G HT+ +FS+PP+ H AP K L+ FEKV +  G    V
Sbjct: 739 GEHCGSLSFDVHLRVRNAGGMAGGHTVFLFSSPPSVHSAPAKHLLGFEKVSLEPGQAGVV 798

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
              + VCK LSVVD  G R++ LG H +H+G  KH+++L
Sbjct: 799 AFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNL 837



 Score =  129 bits (324), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 63/106 (59%), Positives = 74/106 (69%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           FACD  +AT     FC  +     R  DL+GRL+L EKV  L++  AA+PRLGI  YEWW
Sbjct: 31  FACDASNATVSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYEWW 90

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGR 133
           SEALHGVS VGPGT+F    PGATSFPQ I TA+SFNA+L+ AIG 
Sbjct: 91  SEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGE 136


>gi|357485313|ref|XP_003612944.1| Beta-D-xylosidase [Medicago truncatula]
 gi|355514279|gb|AES95902.1| Beta-D-xylosidase [Medicago truncatula]
          Length = 783

 Score =  746 bits (1926), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/782 (46%), Positives = 501/782 (64%), Gaps = 31/782 (3%)

Query: 3   STIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
           STI  F   L+L+            + C P  +     PFC +SLPI  R   LI  L+L
Sbjct: 8   STIIIFLFSLLLIHLPK--FFTTPDYPCKPPHSH---YPFCNISLPISTRTTSLISLLTL 62

Query: 63  QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
            +K+  L + A+++  LGI  Y+WWSEALHG++  GPG  F G    AT+FPQVI +A++
Sbjct: 63  SDKINQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAA 122

Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
           FN +LW  IG  V  E RAM+N G AGL++W+PNVN+FRDPRWGRGQETPGEDP++   Y
Sbjct: 123 FNRSLWFLIGYAVGVEGRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAY 182

Query: 183 AASYVRGLQGSDG------------DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
           A  +VRG+QG DG            D L V+A CKHFTAYDL+ W    R++FNA V++Q
Sbjct: 183 AVEFVRGIQGVDGIKKVLNDHDSDDDGLMVSACCKHFTAYDLEKWGEFSRYNFNAVVTQQ 242

Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
           D+EDT+  PFR CV +GK + +MCSYN+VNGVP CA  ++L   +R +W   GYI SDCD
Sbjct: 243 DLEDTYQPPFRGCVQQGKASCLMCSYNEVNGVPACASKDLLG-LVRNKWGFEGYIASDCD 301

Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNT 350
           +V   ++ Q +  + E+A AD ++AG+D++CG F+  HTESA+++GL+ E D++ AL N 
Sbjct: 302 AVATVFEYQKYAKSAEDAVADVLKAGMDINCGTFMLRHTESAIEQGLVKEEDLDRALFNL 361

Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
            +VQMRLG+F+G+P    +G LGP+DVCTP+H++LALEAARQGIVLLKN    LPL    
Sbjct: 362 FSVQMRLGLFNGDPEKGKFGKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKD 421

Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFG 469
             ++A+IGP +  T  + G Y+GI C   +   G+  Y +TI +  GC DV C  D  F 
Sbjct: 422 RVSLAIIGPMA-TTSELGGGYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDGFA 480

Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
            AID ++QAD  ++V GLD ++E E LDR  LLLPG+Q +LVS+V+ ASK P ILVL  G
Sbjct: 481 VAIDIAKQADFVVIVAGLDTTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLTGG 540

Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
           GP+DV+FA+++  I +I+W GYPG+AGG A+A+I+FG  NP G+LPMTWYP+ + TN+PM
Sbjct: 541 GPLDVSFAESNQLITSILWIGYPGEAGGKALAEIIFGEFNPAGRLPMTWYPESF-TNVPM 599

Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD---GRH 646
            +M MR   S+ YPGRTYRFY G  +Y FGHG+SY++F + V +AP+ +++      G  
Sbjct: 600 NDMGMRADPSRGYPGRTYRFYTGSRIYGFGHGLSYSDFSYRVLSAPSKLSLSKTTNGGLR 659

Query: 647 GSINATISGKAIKVTHA------KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HW 699
            S+   +     +V H        CN L+  V + V NVG  DG+H +++FS  P     
Sbjct: 660 RSLLNKVEKDVFEVDHVHVDELQNCNSLSFSVHISVMNVGDMDGSHVVMLFSKWPKNIQG 719

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
           +P  QLV   ++H  +       I    C++ S  D  G R +PLG H +++G  +H VS
Sbjct: 720 SPESQLVGPSRLHTVSNKSIETSILADPCEHFSFADEQGKRILPLGNHILNVGDVEHIVS 779

Query: 760 LH 761
           + 
Sbjct: 780 IE 781


>gi|225469218|ref|XP_002264031.1| PREDICTED: probable beta-D-xylosidase 6-like [Vitis vinifera]
          Length = 789

 Score =  741 bits (1914), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/793 (46%), Positives = 496/793 (62%), Gaps = 37/793 (4%)

Query: 1   MASTIAFFFLGLIL--LSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIG 58
           M++   F F+ L L  L   S   +    F C P   T    PFC  SLPI  R   L+ 
Sbjct: 1   MSTEWRFLFICLFLQVLPLFSISESTHPQFPCMP--PTNSDYPFCNTSLPISTRAQSLVS 58

Query: 59  RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
            L+L EK++ L   AAA+PRL I  YEWWSE+LHG++  GPG  F G    ATSFPQV+ 
Sbjct: 59  LLTLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLL 118

Query: 119 TASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
           TA+SFN +LW +IG  ++ EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++
Sbjct: 119 TAASFNRSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMV 178

Query: 179 SGKYAASYVRGLQG----------------------SDGDRLKVAASCKHFTAYDLDNWN 216
           +  YA  +VRG QG                      SDGD L ++A CKH TAYDL+ W 
Sbjct: 179 ASAYAVEFVRGFQGGNWKGGDEIRGAVGKKRVLRGDSDGDGLMLSACCKHLTAYDLEKWG 238

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
              R+ F+A VS QD+EDT+  PFR CV +GK + +MCSYN+VNGVP CA  ++ ++  +
Sbjct: 239 NFSRYSFDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGVPACARQDLFQKA-K 297

Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
            EW   GYI SDCD+V   Y+ QH+ ++PE+A AD ++AG D++CG ++  HT+SA+ +G
Sbjct: 298 TEWGFKGYITSDCDAVATVYEYQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSAIDQG 357

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
            + E DI+ AL N  +VQMRLG+FDG+P++  YG+LGPKDVCT +H+ LALEAARQGIVL
Sbjct: 358 KVKEEDIDRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGIVL 417

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQ 455
           LKN    LPL   R  ++A+IGP +D    + G Y GI C   + ++G+  Y  +T    
Sbjct: 418 LKNDKKFLPLDKSRISSLAIIGPQADQPF-LGGGYTGIPCKPESLVEGLKTYVEKTSFAA 476

Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
           GC DV C  D  F  A+  +R+AD  ++V GLD S E E  DR  LLLPG+Q  L+S V+
Sbjct: 477 GCVDVPCLSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISSVA 536

Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
            A + P +LVL  GGP+DV+FA+ DPRIA+I+W GYPG+AG  A+A+I+FG  NPGG+LP
Sbjct: 537 SAIQKPLVLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPGGRLP 596

Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           MTWYP+ + T +PM +M MR    + YPGRTYRFY G  VY FG G+SYT F +   +AP
Sbjct: 597 MTWYPESF-TRVPMNDMNMRADPYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQFVSAP 655

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTH------AKCNRLTLGVQVDVKNVGSKDGAHTLL 689
             + +       S       +  +V +        C+ L   V++ V NVG  DG+H ++
Sbjct: 656 NKLNLLRSSDTVSSKNLPRQRREEVNYFHIEELDTCDSLRFHVEISVTNVGDMDGSHVVM 715

Query: 690 VFS-TPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
           +FS  P      P KQL+ F +VH  +       I +  C++ S+ +  G R +PLG+H 
Sbjct: 716 LFSRVPKIVKGTPEKQLIGFSRVHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPLGDHT 775

Query: 749 IHIGGTKHSVSLH 761
           I +G   HSVS+ 
Sbjct: 776 IMLGDVVHSVSVE 788


>gi|302141935|emb|CBI19138.3| unnamed protein product [Vitis vinifera]
          Length = 1411

 Score =  741 bits (1914), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/749 (48%), Positives = 499/749 (66%), Gaps = 37/749 (4%)

Query: 22   LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
            L++  PFACD  D  T++  FC  +L I QR +DLI RL+L EK+  LIS AA++PRLGI
Sbjct: 692  LSSSPPFACDSSDPLTKSYAFCNTTLRISQRASDLISRLTLDEKISQLISSAASIPRLGI 751

Query: 82   KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
              YEWWSEALHG+ +   G +F G    ATSFPQVI TA+SF+A LW  IG+ +  E RA
Sbjct: 752  PAYEWWSEALHGIRDRH-GIRFNGTIRSATSFPQVILTAASFDAHLWYRIGQAIGIETRA 810

Query: 142  MYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-----DG 195
            MYN G A G+T+W+PN+NIFRDPRWGRGQETPGEDPV++GKYA SYVRGLQG        
Sbjct: 811  MYNAGQAMGMTFWAPNINIFRDPRWGRGQETPGEDPVVAGKYAVSYVRGLQGDTFEGGKV 870

Query: 196  DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
            D L+ +A CKHFTAYDLDNW  +DR+ F+A+V+ QD+ DT+  PFR C+ EG+ + +MC+
Sbjct: 871  DVLQASACCKHFTAYDLDNWTSIDRYTFDARVTMQDLADTYQPPFRSCIEEGRASGLMCA 930

Query: 256  YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
            YN VNGVP CAD N+L +T RG+W  +GYIVSDCD+V + +D Q +  +PE+A A  + A
Sbjct: 931  YNLVNGVPNCADFNLLSKTARGQWGFDGYIVSDCDAVSLVHDVQGYAKSPEDAVAIVLTA 990

Query: 316  GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
            G+D+ CG +L  H +SAV +  L+E +I+ AL+N  TV+MRLG+F+G P   P+G++GP 
Sbjct: 991  GMDVACGGYLQKHAKSAVSQKKLTESEIDRALLNLFTVRMRLGLFNGNPRKLPFGNIGPD 1050

Query: 376  DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
             VC+ +HQ LALEAAR GIVLLKN    LPLS     ++AVIGPN++ T T++GNYAG  
Sbjct: 1051 QVCSTEHQTLALEAARSGIVLLKNSDRLLPLSKGETLSLAVIGPNANATDTLLGNYAGPP 1110

Query: 436  CGYTTPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
            C + +PLQG+  Y   T++  GC DVAC+   +   A+D ++QAD  +LVMGLDQ+ E E
Sbjct: 1111 CKFISPLQGLQSYVNNTMYHAGCNDVACSSASIEN-AVDVAKQADYVVLVMGLDQTQERE 1169

Query: 495  ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
              DR  L+LPG+Q++L++ V+ A+K P +LVL+ GGP+D++FAK    I +I+WAGYPG+
Sbjct: 1170 KYDRLDLVLPGKQEQLITGVAKAAKKPVVLVLLCGGPVDISFAKGSSNIGSILWAGYPGE 1229

Query: 555  AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
            AGG AIA+ +FG  NPGG+LP+TWYP+++I  +PMT+M MRP     YPGRT+RFY G  
Sbjct: 1230 AGGAAIAETIFGDHNPGGRLPVTWYPKDFI-KIPMTDMRMRPEPQSGYPGRTHRFYTGKT 1288

Query: 615  VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
            V+ FG+G+SY+ +                          S + + VT  K          
Sbjct: 1289 VFEFGNGLSYSPY--------------------------SYEFLSVTPNKLYLNQPSTTH 1322

Query: 675  DVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
             V+N G   G H +L+F     AG+ +P KQLV F+ V + AG    V   +  C++LS 
Sbjct: 1323 VVENSGKMAGKHPVLLFVKQAKAGNGSPMKQLVGFQNVFLDAGESSNVEFILSPCEHLSR 1382

Query: 734  VDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
             ++ G   +  G H + +G  ++ +++ A
Sbjct: 1383 ANKDGLMVMEQGIHLLVVGDKEYPIAIVA 1411



 Score =  691 bits (1784), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/631 (52%), Positives = 451/631 (71%), Gaps = 13/631 (2%)

Query: 5   IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
           I   ++ +IL+   S+      PF+CD  + +T++  FC+ +LPIP RV DL+ RL+L E
Sbjct: 9   INLIYVTVILVGVEST---QSPPFSCDSSNPSTKSYHFCKTTLPIPDRVRDLVSRLTLDE 65

Query: 65  KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
           K+  L++ A A+PRLGI  YEWWSEALHGV++ GPG +F G    ATSFPQVI TA+SF+
Sbjct: 66  KISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQVILTAASFD 125

Query: 125 ATLWEAIGRVVSDEARAMYNGG-TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
             LW  IGR +  EARA+YN G T G+T+W+PN+NIFRDPRWGRGQETPGEDP+++G YA
Sbjct: 126 VHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGSYA 185

Query: 184 ASYVRGLQG------SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
            SYVRG+QG           L+ +A CKHFTAYDLD+W G+DRF F+A+V+ QD+ DT+ 
Sbjct: 186 VSYVRGVQGDCLRGLKRCGELQASACCKHFTAYDLDDWKGIDRFKFDARVTMQDLADTYQ 245

Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
            PF  C+ EG+ + +MC+YN+VNGVP+CAD N+L  T R  W   GYI SDCD+V + +D
Sbjct: 246 PPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSDCDAVSLIHD 305

Query: 298 TQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
           +  F  TPE+A  D ++AG+D++CG +L  HT+SAV +  L E +++ AL N   V+MRL
Sbjct: 306 SYGFAKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRALENLFAVRMRL 365

Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
           G+F+G P  QPYG +GP  VC+ +HQ LAL+AAR GIVLLKN    LPL   +  ++AVI
Sbjct: 366 GLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLPKGKTMSLAVI 425

Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASR 476
           GPN++   T+IGNYAG  C + TPLQ +  Y + T++  GC  VAC+   +   A++ ++
Sbjct: 426 GPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVACSSPSI-EKAVEIAQ 484

Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
           +AD  +LVMGLDQ+ E EA DR  L+LPG+QQ+L+  V+ A+K P +LVL+SGGP+D++F
Sbjct: 485 KADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVLLSGGPVDISF 544

Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
           AK    I +I+WAGYPG AGG AIA+ +FG  NPGG+LP+TWYPQ++ T +PMT+M MRP
Sbjct: 545 AKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPGGRLPVTWYPQDF-TKIPMTDMRMRP 603

Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
             +  YPGRTYRFY G  V+ FG+G+SY+ +
Sbjct: 604 ESNSGYPGRTYRFYTGEKVFEFGYGLSYSTY 634


>gi|356531391|ref|XP_003534261.1| PREDICTED: probable beta-D-xylosidase 6-like [Glycine max]
          Length = 780

 Score =  740 bits (1910), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/746 (48%), Positives = 496/746 (66%), Gaps = 24/746 (3%)

Query: 37  TRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSN 96
           +++ PFC  SLP   R   L+  L+L EK+ LL + A+++PRLGI  Y+WWSE+LHG++ 
Sbjct: 36  SQSHPFCDTSLPTLTRARSLVSLLTLPEKILLLSNNASSIPRLGIPAYQWWSESLHGLAL 95

Query: 97  VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPN 156
            GPG  F G  P ATSFPQVI +A+SFN +LW      ++ EARAM+N G AGLT+W+PN
Sbjct: 96  NGPGVSFAGAVPSATSFPQVILSAASFNRSLWLRTAAAIAREARAMFNVGQAGLTFWAPN 155

Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR----------LKVAASCKH 206
           +N+FRDPRWGRGQETPGEDP+L+  YA  YVRGLQG  G +          L V+A CKH
Sbjct: 156 INLFRDPRWGRGQETPGEDPMLASAYAVEYVRGLQGLSGIQDAVVVDDDDTLMVSACCKH 215

Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
           FTAYDLD W    R++FNA VS+QD+EDT+  PFR C+ +GK + +MCSYN+VNGVP CA
Sbjct: 216 FTAYDLDMWGQFSRYNFNAVVSQQDLEDTYQPPFRSCIQQGKASCLMCSYNEVNGVPACA 275

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
              +L    R +W   GYI SDCD+V   Y+ Q +  + E+A AD ++AG+D++CG F+ 
Sbjct: 276 SEELLGLA-RDKWGFKGYITSDCDAVATVYEYQKYAKSQEDAVADVLKAGMDINCGTFML 334

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
            HTESA+++G + E D++ AL+N  +VQ+RLG+FDG+P    +G LGPKDVCT +H+ LA
Sbjct: 335 RHTESAIEQGKVKEEDLDRALLNLFSVQLRLGLFDGDPIRGRFGKLGPKDVCTQEHKTLA 394

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
           L+AARQGIVLLKN    LPL      ++AVIGP +  T  + G Y+GI C  ++  +G+G
Sbjct: 395 LDAARQGIVLLKNDKKFLPLDRDIGASLAVIGPLA-TTTKLGGGYSGIPCSSSSLYEGLG 453

Query: 447 RYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
            +A  I +  GC DV C  D  F  AID ++QAD  ++V GLD + E E  DR  LLLPG
Sbjct: 454 EFAERISYAFGCYDVPCDSDDGFAEAIDTAKQADFVVIVAGLDATQETEDHDRVSLLLPG 513

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
           +Q  LVS V+ ASK P ILVL+ GGP+DV+FA+ +P+IA+IIW GYPG+AGG A+A+I+F
Sbjct: 514 KQMNLVSSVADASKNPVILVLIGGGPLDVSFAEKNPQIASIIWLGYPGEAGGKALAEIIF 573

Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
           G  NP G+LPMTWYP+ + TN+PM EM+MR   S+ YPGRTYRFY G  VY FGHG+S++
Sbjct: 574 GEFNPAGRLPMTWYPEAF-TNVPMNEMSMRADPSRGYPGRTYRFYTGGRVYGFGHGLSFS 632

Query: 626 NFVHTVANAPTVVAVPL---DGRHGSINATISGKAIKVTHA------KCNRLTLGVQVDV 676
           +F +   +AP+ +++     DG    +   +  +   V +        CN+L+  V + V
Sbjct: 633 DFSYNFLSAPSKISLSRTIKDGSRKRLLYQVENEVYGVDYVPVNQLQNCNKLSFSVHISV 692

Query: 677 KNVGSKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
            N+G  DG+H +++FS  P     +P  QLV F ++H  +       I +H C++LS  D
Sbjct: 693 MNLGGLDGSHVVMLFSKGPKVVDGSPETQLVGFSRLHTISSKPTETSILVHPCEHLSFAD 752

Query: 736 RSGTRRIPLGEHNIHIGGTKHSVSLH 761
           + G R +PLG H + +G  +H VS+ 
Sbjct: 753 KQGKRILPLGPHTLSVGDLEHVVSIE 778


>gi|449531013|ref|XP_004172482.1| PREDICTED: beta-D-xylosidase 1-like, partial [Cucumis sativus]
          Length = 534

 Score =  738 bits (1905), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/532 (66%), Positives = 432/532 (81%), Gaps = 6/532 (1%)

Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
           +EDT++VPF+ CV+EGKVASVMCSYNQVNG PTCADP++LK TIRG W L+GYIVSDCDS
Sbjct: 1   LEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGAWGLDGYIVSDCDS 60

Query: 292 VGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTL 351
           VGV YD+QHFT TPEEAAA  I+AGLDLDCGPFL +HT +AV RGLL E+D+NNAL N L
Sbjct: 61  VGVLYDSQHFTPTPEEAAASTIKAGLDLDCGPFLAVHTATAVGRGLLKEVDLNNALANLL 120

Query: 352 TVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRH 411
           +VQMRLGMFDGEP++QPYG+LGPKDVCTP H+ LALEAARQGIVLL+N+  +LPLS  RH
Sbjct: 121 SVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAARQGIVLLQNRAGALPLSPTRH 180

Query: 412 RTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAA 471
           RTVAVIGPNSD TVTMIGNYAG+AC YTTP+QGI +Y +TIH +GC +VAC  DQL G A
Sbjct: 181 RTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYVKTIHAKGCANVACVGDQLIGEA 240

Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
             A+R ADA ++V+GLDQSIEAE+ DR G+LLPG+Q+ELV ++ +A KGPT++VLMSGGP
Sbjct: 241 EAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELVRRIGLACKGPTVVVLMSGGP 300

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
           IDV+FAKND +I+ I+W GYPGQAGG AIAD+LFG +NPGGKLPMTWYPQ Y+  +PMT 
Sbjct: 301 IDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQSYLAKVPMTN 360

Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
           M +RP  S  YPGRTYRFYKGPVV+PFG G+SY+ F  + A APT +++PL     + +A
Sbjct: 361 MGLRPDPSTGYPGRTYRFYKGPVVFPFGFGLSYSKFSQSFAEAPTKISLPLSSLSPNSSA 420

Query: 652 TISGKAIKVTHAKCNRLT-LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
           T     +KV+H  C  ++ L + +DVKN G+ DG+HT+LVFST P   W+P K L+ FEK
Sbjct: 421 T-----VKVSHTDCASVSDLPIMIDVKNTGTVDGSHTILVFSTVPNQTWSPEKHLIGFEK 475

Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
           VH+ AG+Q+RV I IHVC +LS VD  GTRRIP+GEH +HIG   HS+SL A
Sbjct: 476 VHLIAGSQKRVRIGIHVCDHLSRVDEFGTRRIPMGEHKLHIGDLTHSISLQA 527


>gi|225432136|ref|XP_002274651.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 809

 Score =  738 bits (1904), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/783 (47%), Positives = 506/783 (64%), Gaps = 40/783 (5%)

Query: 5   IAFFFLGLILLSASSSG-----LAAREP------FACDPKDATT-----RTLPFCQVSLP 48
           IAFF +   + S    G     +AAR P      + CD           +   +C  S P
Sbjct: 17  IAFFAVCSAIKSPLKDGPAAAPMAARGPIDGNYTYVCDESRFAALGLDMKDFHYCDSSSP 76

Query: 49  IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
              R  DL+ R++L EKV      A+ V R+G+  Y WWSEALHGVSN G    F    P
Sbjct: 77  YEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEALHGVSNFGRCVFFDEVVP 136

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GATSFP VI +A+SFN +LW+ +G+ VS EARAMYN G AGLT+WSPN+N+ RDPRWGR 
Sbjct: 137 GATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLTFWSPNINVVRDPRWGRI 196

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDR---------LKVAASCKHFTAYDLDNWNGVD 219
            ETPGEDP L G YA +YVRGLQ   G           LKV++ CKH+ AYDLDNW G D
Sbjct: 197 LETPGEDPHLVGLYAVNYVRGLQDVVGAENTTDLNSRPLKVSSCCKHYAAYDLDNWKGAD 256

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R HF+A+VS QD+ +TF +PF MCV EG V+SVMCSYN++NG+P+CAD  +LK+TIRGEW
Sbjct: 257 RVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVMCSYNKINGIPSCADSRLLKQTIRGEW 316

Query: 280 RLNGYIVSDCDSVGVY-YDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
            L+GYIVSDCDSV V   D +   S+  ++AA A+ AG++LDCG F       AV +G  
Sbjct: 317 DLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQALNAGMNLDCGTFNNRSLTEAVNQGKA 376

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           ++ D++++L     + MR+G FDG P+   +  LG  D+C+ +H ELA EAARQGIVLLK
Sbjct: 377 NQADLDHSLRYLYVLLMRVGFFDGIPA---FASLGKDDICSAEHIELAREAARQGIVLLK 433

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N   +LPL  +++  +A++GP+++ T  MIGNYAGI C Y +PL          +++GC 
Sbjct: 434 NDNATLPLKSVKN--IALVGPHANATDAMIGNYAGIPCYYVSPLDAFSSMGEVRYEKGCA 491

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           DV C ++     A++A+++ADATI+  G D SIEAEALDR  LLLPG Q +L+++V+  S
Sbjct: 492 DVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEAEALDRVDLLLPGYQTQLINQVADLS 551

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
            GP +LV+MSGG +D++FA+++P+IAAI+WAGYPG+ GG AIAD++ G  NPGG+LP+TW
Sbjct: 552 TGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPGEQGGNAIADVILGKYNPGGRLPITW 611

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           Y  +Y+  LPMT MA+RP  S  YPGRTY+F+ G  VYPFG+GMSYTNF ++++ +    
Sbjct: 612 YEADYVDMLPMTSMALRPVDSLGYPGRTYKFFNGSTVYPFGYGMSYTNFSYSLSTSQRWT 671

Query: 639 AVPLDGRHGS-----INATI--SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
            + L           IN T      A+ V    C   ++  +V VKNVG  DG+  ++V+
Sbjct: 672 NINLRKLQRCRSMVYINDTFVPDCPAVLVDDLSCKE-SIEFEVAVKNVGRMDGSEVVVVY 730

Query: 692 STPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
           S+PP G    H K++V FE+V V  G  ++V  +++VCK L +VD +G   +P G H I 
Sbjct: 731 SSPPLGIAGTHIKKVVGFERVFVKVGGTEKVKFSMNVCKSLGIVDSTGYALLPSGSHTIK 790

Query: 751 IGG 753
           +GG
Sbjct: 791 VGG 793


>gi|255557375|ref|XP_002519718.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223541135|gb|EEF42691.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 802

 Score =  734 bits (1895), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/741 (48%), Positives = 497/741 (67%), Gaps = 25/741 (3%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
           T  FC  SL    R  DL+ +++L+EKV+ L   A  VPRLGI  YEWWSEALHGVS+VG
Sbjct: 65  TFGFCDSSLSYEVRAKDLVNQMTLKEKVQQLGDLAYGVPRLGIPKYEWWSEALHGVSDVG 124

Query: 99  PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
           PGT F    PGATSFP  I T +SFN +LW+ IG+  S +ARAMYN G AGLTYWSPNVN
Sbjct: 125 PGTFFDDLVPGATSFPTTILTTASFNESLWKNIGQA-SAKARAMYNLGRAGLTYWSPNVN 183

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR---------LKVAASCKHFTA 209
           + RDPRWGR  ETPGEDP + G+YA +YVRGLQ  +G           LKV++ CKH+ A
Sbjct: 184 VVRDPRWGRTVETPGEDPYVVGRYAVNYVRGLQDVEGTENYTDLNTRPLKVSSCCKHYAA 243

Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
           YD++ W GV+R  F+A+V++QD+ +TF  PF MCV EG V+SVMCS+N+VNG+PTCADP 
Sbjct: 244 YDVEKWQGVERLTFDARVTEQDMVETFLRPFEMCVKEGDVSSVMCSFNRVNGIPTCADPK 303

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCGPFLGLH 328
           +L +TIRG+W L+GYIVSDCDS+ V  D   F   T E+A A  ++AGLDLDCG +    
Sbjct: 304 LLNQTIRGDWDLHGYIVSDCDSIEVMVDNHKFLGDTNEDAVAQVLKAGLDLDCGGYYTNF 363

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
           TE++V++G   E  I+ +L     V MRLG FDG P    Y  LG KD+CT ++ ELA +
Sbjct: 364 TETSVKQGKAREEYIDRSLKYLYVVLMRLGFFDGTPQ---YQKLGKKDICTKENVELAKQ 420

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
           AAR+GIVLLKN   +LPLS  + + +AV+GP+++ T  MIGNYAG+ C Y +P+ G   Y
Sbjct: 421 AAREGIVLLKNND-TLPLSMDKVKNLAVVGPHANATRVMIGNYAGVPCRYVSPIDGFSIY 479

Query: 449 ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQ 508
           +   ++ GC DV C ++ L   A+ A++ ADATI+V GLD +IEAE LDR  LLLPG Q 
Sbjct: 480 SNVTYEIGC-DVPCKNESLVFPAVHAAKNADATIIVAGLDLTIEAEGLDRNDLLLPGYQT 538

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
           +L+++V+ A+ GP ILV+M+ G +D++FA+++ +I AI+W GYPGQ GG AIAD++FG  
Sbjct: 539 QLINQVAGAANGPVILVIMAAGGVDISFARDNEKIKAILWVGYPGQEGGHAIADVVFGKY 598

Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
           NPGG+LP+TWY  +++  +PMT M +RP +   YPG+TY+FY G  VYPFG+G+SYT F 
Sbjct: 599 NPGGRLPITWYEADFVEQVPMTYMQLRPDEELGYPGKTYKFYDGSTVYPFGYGLSYTTFS 658

Query: 629 HTVANAPTVVAVPL-------DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           + + +A     + L       D R+G+     S  A+   H  CN     ++V+V+N GS
Sbjct: 659 YNITSAKRSKHIALNKFQHCRDLRYGNETFKPSCPAVLTDHLPCND-DFELEVEVENTGS 717

Query: 682 KDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
           +DG+  ++V+S  P G    + KQ++ F++V V AG+ ++V    +VCK   ++D +   
Sbjct: 718 RDGSEVVMVYSKTPEGIVGSYIKQVIGFKRVFVQAGSVEKVNFRFNVCKSFRIIDYNAYS 777

Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
            +P G H I +G    S+ L+
Sbjct: 778 ILPSGGHTIMVGDDIVSIPLY 798


>gi|18025342|gb|AAK38482.1| beta-D-xylosidase [Hordeum vulgare]
          Length = 777

 Score =  734 bits (1895), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/757 (48%), Positives = 489/757 (64%), Gaps = 23/757 (3%)

Query: 22  LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
           +AA  PF+C        +  FC   LPI QR  DL+ +L+L+EK+  L   + AV RLG+
Sbjct: 28  VAADPPFSC----GAPSSAAFCDRRLPIEQRAADLVSKLTLEEKISQLGDESPAVDRLGV 83

Query: 82  KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
             Y+WWSEALHGV+N G G    G    ATSFPQVI TA+SFN  LW  IG+V+  EAR 
Sbjct: 84  PAYKWWSEALHGVANAGRGVHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARG 143

Query: 142 MYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS------D 194
           +YN G A GLT+W+PN+N+FRDPRWGRGQETPGEDP ++GKYAA +VRG+QG       +
Sbjct: 144 VYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGMSGAIN 203

Query: 195 GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC 254
              L+ +A CKHFTAYDL+NW GV RF F+AKV++QD+ DT++ PF+ CV +G  + +MC
Sbjct: 204 SSDLEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMC 263

Query: 255 SYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIR 314
           SYN+VNGVPTCAD N+L +T RG+W  NGYI SDCD+V + +D Q +   PE+A AD ++
Sbjct: 264 SYNRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLK 323

Query: 315 AGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
           AG+D++CG ++  H  SA Q+G ++  DI+ AL N   ++MRLG+FDG P    YG++G 
Sbjct: 324 AGMDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFDGNPKYNRYGNIGA 383

Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
             VC+ +HQ+LAL+AAR GIVLLKN G +LPLS  +  ++AVIGPN +    ++GNY G 
Sbjct: 384 DQVCSKEHQDLALQAARDGIVLLKNDGAALPLSKSKVSSLAVIGPNGNNASLLLGNYFGP 443

Query: 435 ACGYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
            C   TPLQ +  Y +     QGC    C    + G A+ A+  AD  +L MGLDQ+ E 
Sbjct: 444 PCISVTPLQALQGYVKDARFVQGCNAAVCNVSNI-GEAVHAAGSADYVVLFMGLDQNQER 502

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E +DR  L LPG Q+ LV+ V+ A+K P ILVL+ GGP+DV FAKN+P+I AI+WAGYPG
Sbjct: 503 EEVDRLELGLPGMQESLVNSVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPG 562

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
           QAGG AIA +LFG  NPGG+LP+TWYP+E+ T +PMT+M MR   S  YPGRTYRFYKG 
Sbjct: 563 QAGGIAIAQVLFGDHNPGGRLPVTWYPKEF-TAVPMTDMRMRADPSTGYPGRTYRFYKGK 621

Query: 614 VVYPFGHGMSYTNFVHTVANAPT-------VVAVPLDGRHGSINATISGKAIKVTHAKCN 666
            VY FG+G+SY+ + H  A+  T       +  +    R  S   T+S    ++    C+
Sbjct: 622 TVYNFGYGLSYSKYSHRFASKGTKPPSMSGIEGLKATAR-ASAAGTVSYDVEEMGAEACD 680

Query: 667 RLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
           RL     V V+N G  DG H +L+F   P A    P  QL+ F+ VH+ A     V   +
Sbjct: 681 RLRFPAVVRVQNHGPMDGGHLVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVEFEV 740

Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
             CK+LS     G + I  G H + +G  +  +S  A
Sbjct: 741 SPCKHLSRAAEDGRKVIDQGSHFVRVGDDEFELSFMA 777


>gi|224093292|ref|XP_002309869.1| predicted protein [Populus trichocarpa]
 gi|222852772|gb|EEE90319.1| predicted protein [Populus trichocarpa]
          Length = 694

 Score =  733 bits (1891), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/717 (49%), Positives = 483/717 (67%), Gaps = 38/717 (5%)

Query: 55  DLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFP 114
           DL+ +++L EKV  L + A  VPRLG+  Y+WWSEALHGVSNVGPGT F    PG+TSFP
Sbjct: 2   DLVNQMTLNEKVLQLGNKAYGVPRLGLAEYQWWSEALHGVSNVGPGTFFDDLIPGSTSFP 61

Query: 115 QVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGE 174
            VITTA++FN +LW+ IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPGE
Sbjct: 62  TVITTAAAFNESLWKVIGQAVSTEARAMYNLGRAGLTYWSPNINVVRDPRWGRAIETPGE 121

Query: 175 DPVLSGKYAASYVRGLQGSDGDR---------LKVAASCKHFTAYDLDNWNGVDRFHFNA 225
           DP L G+YA +YVRGLQ  +G           LKV++ CKH+ AYD+DNW GV+R+ F+A
Sbjct: 122 DPYLVGRYAVNYVRGLQDVEGSENYTDPNSRPLKVSSCCKHYAAYDVDNWKGVERYTFDA 181

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
           +VS+QD+ +TF  PF MCV +G V+SVMCSYN+VNG+PTCADP +L +TIRG+W L+GYI
Sbjct: 182 RVSEQDMVETFLRPFEMCVKDGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWDLHGYI 241

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
           VSDCDS+ V  +   +              GLDLDCG +   + E+AV++G + E DI+ 
Sbjct: 242 VSDCDSLQVMVENHKWL-------------GLDLDCGAYYTENVEAAVRQGKVREADIDK 288

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
           +L     V MRLG FDG P    Y   G  DVC+ ++ ELA EAAR+G VLLKN+  SLP
Sbjct: 289 SLNFLYVVLMRLGFFDGIPQ---YNSFGKNDVCSKENIELATEAAREGAVLLKNENDSLP 345

Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADD 465
           LS  + +T+AVIGP+S+ T  MIGNYAGI C   TP++G+ +YA+  +Q GC D+AC D+
Sbjct: 346 LSIEKVKTLAVIGPHSNATSAMIGNYAGIPCQIITPIEGLSKYAKVDYQMGCSDIACKDE 405

Query: 466 QLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILV 525
                A++++++ADATI++ G+D SIEAE+LDR  LLLPG Q +L+++V+  S GP +LV
Sbjct: 406 SFIFPAMESAKKADATIILAGIDLSIEAESLDRDDLLLPGYQTQLINQVASVSNGPVVLV 465

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
           LMS G +D++FAK++  I +I+W GYPG+ GG AIAD++FG  NPGG+LP+TW+  +Y+ 
Sbjct: 466 LMSAGGVDISFAKSNGDIKSILWVGYPGEEGGNAIADVIFGKYNPGGRLPLTWHEADYVD 525

Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
            LPMT M +RP  S  YPGRTY+F+ G  VYPFGHG+SYT F + + +    + + LD  
Sbjct: 526 MLPMTSMPLRPIDSLGYPGRTYKFFNGSTVYPFGHGLSYTQFTYKLTSTIRSLDIKLDKY 585

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQ 704
               +      + K +             +V N G+KDG+  ++V++ PP G  A + KQ
Sbjct: 586 QYCHDLGYKNDSFKPSF------------EVLNAGAKDGSEVVIVYAKPPEGIDATYIKQ 633

Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           ++ F++V VPAG  ++V    +  K L VVD +    +P G H I +G    S S+ 
Sbjct: 634 VIGFKRVFVPAGGSEKVKFEFNASKSLQVVDFNAYSVLPSGGHTIMLGDDIISFSVQ 690


>gi|225432134|ref|XP_002274619.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 805

 Score =  732 bits (1889), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/741 (50%), Positives = 489/741 (65%), Gaps = 24/741 (3%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           +   +C  SLP   RV DL+ R++L+EK + +I  A+ VPR+G+  Y+WWSEALHGV+NV
Sbjct: 63  KDFVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANV 122

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
           G  T F    PGATSFP VI +A+SFN +LW+ +G+VVS EARAMYN G AGLT+WSPN+
Sbjct: 123 GSATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTFWSPNI 182

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR---------LKVAASCKHFT 208
           N+ RDPRWGR  ETPGEDP+  G Y  +YVRGLQ  +G           LK+A+SCKHF 
Sbjct: 183 NVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASSCKHFA 242

Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
           AYDLD W  VDR HF+AKVS+QD+ +TF  PF MCV EG  +SVMCS+N +NG+P CADP
Sbjct: 243 AYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIPPCADP 302

Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAADAIRAGLDLDCGPFLGL 327
             LK  IR +W L+GYIVSDC ++      Q F   T EE  A +++AGLDL+CG +   
Sbjct: 303 RFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECGHYYND 362

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
              +AV+ G +SE D++ +L     V MR+G FDG PS      LG KD+C  +H ELA 
Sbjct: 363 SLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPS---LASLGKKDICNDEHIELAR 419

Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
           EAARQGIVLLKN   +LPL  ++   +A++GP+++ TV MIGNYAGI C Y +PL     
Sbjct: 420 EAARQGIVLLKNDNATLPLKPVKK--LALVGPHANATVAMIGNYAGIPCHYVSPLDAFSE 477

Query: 448 YARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQ 507
                ++ GC DV C +D     A +A++ ADATI+++G D SIEAE  DR  LLLPG Q
Sbjct: 478 LGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPGYQ 537

Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT 567
            E+V++V+  S GP ILV+M GGPID++FAKN+P+IAAI+WAG+PG+ GG AIADI+FG 
Sbjct: 538 TEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVFGK 597

Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
            NPGG+ P+TWY   Y+  LPMT MA+RP +S  YPGRTY+F+ G  VYPFG+G+SYTNF
Sbjct: 598 YNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGSTVYPFGYGLSYTNF 657

Query: 628 VHTVANAPTVVAVPLDGRHGSINATISG-------KAIKVTHAKCNRLTLGVQVDVKNVG 680
            +++      V + L       +   S         A+ V    C+  +   QV VKNVG
Sbjct: 658 SYSLTAPTRSVHISLTRLQQCRSMAYSSDSFQPECSAVLVDDLSCDE-SFEFQVAVKNVG 716

Query: 681 SKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
           S DG+  ++V+S+PP+G    H KQ++ FE+V V  G  ++V  +++VCK L +VD SG 
Sbjct: 717 SMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMNVCKSLGLVDSSGY 776

Query: 740 RRIPLGEHNIHIGGTKHSVSL 760
             +P G H I  G    SVS 
Sbjct: 777 ILLPSGSHTIMAGDNSTSVSF 797


>gi|297842585|ref|XP_002889174.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335015|gb|EFH65433.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 766

 Score =  732 bits (1889), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/757 (46%), Positives = 506/757 (66%), Gaps = 27/757 (3%)

Query: 23  AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
           +A  P +CDP + TT+   FC+  LPI QR  DL+ RL++ EK+  L + A  +PRLG+ 
Sbjct: 18  SAPPPHSCDPSNPTTKLYQFCRTDLPISQRARDLVSRLNIDEKISQLGNTAPGIPRLGVP 77

Query: 83  GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
            YEWWSEALHGV+  GPG +F G    ATSFPQVI TA+SF++  W  I +V+  EAR +
Sbjct: 78  AYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGV 137

Query: 143 YNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----- 196
           YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++G YA +YVRGLQG   D     
Sbjct: 138 YNAGQAQGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGTYAVAYVRGLQGDSFDGRKTL 197

Query: 197 --RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC 254
              L+ +A CKHFTAYDLD W G+ R+ FNA+VS  D+ +T+  PF+ C+ EG+ + +MC
Sbjct: 198 SIHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGIMC 257

Query: 255 SYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIR 314
           +YN+VNG+P+CADPN+L RT RG WR  GYI SDCD+V + +D Q +  TPE+A AD ++
Sbjct: 258 AYNRVNGIPSCADPNLLTRTARGLWRFRGYITSDCDAVSIIHDAQGYAKTPEDAVADVLK 317

Query: 315 AGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
           AG+D++CG +L  HT+SA+Q+  +SE DI+ AL+N  +V++RLG+F+G+P+  PYG++ P
Sbjct: 318 AGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNISP 377

Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
            DVC+P HQ LALEAAR GIVLLKN    LP S     ++AVIGPN+ V  T++GNYAG 
Sbjct: 378 NDVCSPAHQALALEAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVAKTLLGNYAGP 437

Query: 435 ACGYTTPLQGIGRYART-IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
            C   TPL  +  Y +  ++  GC  VAC++  +   A+  +R AD  +L+MGLDQ+ E 
Sbjct: 438 PCKTVTPLDALRSYVKNAVYHNGCDSVACSNAAI-DQAVAIARNADHVVLIMGLDQTQEK 496

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E +DR  L LPG+QQEL++ V+ A+K P +LVL+ GGP+D++FA N+ +I +I+WAGYPG
Sbjct: 497 EDMDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFATNNDKIGSIMWAGYPG 556

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
           +AGG A+A+I+FG  NPGG+LP+TWYPQ ++ N+ MT+M MR +    YPGRTY+FYKGP
Sbjct: 557 EAGGIALAEIIFGDHNPGGRLPVTWYPQSFV-NVQMTDMRMRSATG--YPGRTYKFYKGP 613

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT------HAKCNR 667
            V+ FGHG+SY+ + +      T+ A  L        A ++  +++ T         CN 
Sbjct: 614 KVFEFGHGLSYSTYSYRFK---TLGATNLYLNQS--KAQLNSDSVRYTLVSEMGEEGCNI 668

Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW---APHKQLVAFEKVHVPAGAQQRVGIN 724
               V V V+N G   G H +L+F+    G        KQLV F+ + +  G +  +   
Sbjct: 669 AKTKVIVTVENQGEMAGKHPVLMFARHERGGENGKRAEKQLVGFKSIVLSNGEKAEMEFE 728

Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           I +C++LS  +  G   +  G++ + +G ++  ++++
Sbjct: 729 IGLCEHLSRANEVGVMVVEEGKYFLTVGDSELPLTIN 765


>gi|15218202|ref|NP_177929.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
 gi|259585708|sp|Q9SGZ5.2|BXL7_ARATH RecName: Full=Probable beta-D-xylosidase 7; Short=AtBXL7; Flags:
           Precursor
 gi|18086336|gb|AAL57631.1| At1g78060/F28K19_32 [Arabidopsis thaliana]
 gi|332197942|gb|AEE36063.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
          Length = 767

 Score =  731 bits (1888), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/746 (46%), Positives = 502/746 (67%), Gaps = 17/746 (2%)

Query: 23  AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
           +A  P +CDP + TT+   FC+  LPI +R  DL+ RL++ EK+  L++ A  +PRLG+ 
Sbjct: 19  SAPPPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVP 78

Query: 83  GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
            YEWWSEALHGV+  GPG +F G    ATSFPQVI TA+SF++  W  I +V+  EAR +
Sbjct: 79  AYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGV 138

Query: 143 YNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----- 196
           YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++G YA +YVRGLQG   D     
Sbjct: 139 YNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDGRKTL 198

Query: 197 --RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC 254
              L+ +A CKHFTAYDLD W G+ R+ FNA+VS  D+ +T+  PF+ C+ EG+ + +MC
Sbjct: 199 SNHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGIMC 258

Query: 255 SYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIR 314
           +YN+VNG+P+CADPN+L RT RG+W   GYI SDCD+V + YD Q +  +PE+A AD ++
Sbjct: 259 AYNRVNGIPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIYDAQGYAKSPEDAVADVLK 318

Query: 315 AGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
           AG+D++CG +L  HT+SA+Q+  +SE DI+ AL+N  +V++RLG+F+G+P+  PYG++ P
Sbjct: 319 AGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNISP 378

Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
            +VC+P HQ LAL+AAR GIVLLKN    LP S     ++AVIGPN+ V  T++GNYAG 
Sbjct: 379 NEVCSPAHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVVKTLLGNYAGP 438

Query: 435 ACGYTTPLQGIGRYART-IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
            C   TPL  +  Y +  ++ QGC  VAC++  +   A+  ++ AD  +L+MGLDQ+ E 
Sbjct: 439 PCKTVTPLDALRSYVKNAVYHQGCDSVACSNAAI-DQAVAIAKNADHVVLIMGLDQTQEK 497

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E  DR  L LPG+QQEL++ V+ A+K P +LVL+ GGP+D++FA N+ +I +IIWAGYPG
Sbjct: 498 EDFDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFAANNNKIGSIIWAGYPG 557

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
           +AGG AI++I+FG  NPGG+LP+TWYPQ ++ N+ MT+M MR +    YPGRTY+FYKGP
Sbjct: 558 EAGGIAISEIIFGDHNPGGRLPVTWYPQSFV-NIQMTDMRMRSATG--YPGRTYKFYKGP 614

Query: 614 VVYPFGHGMSYTNFVHTVAN-APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
            VY FGHG+SY+ + +     A T + +       + ++       ++    C+     V
Sbjct: 615 KVYEFGHGLSYSAYSYRFKTLAETNLYLNQSKAQTNSDSVRYTLVSEMGKEGCDVAKTKV 674

Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHW---APHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
            V+V+N G   G H +L+F+    G        KQLV F+ + +  G +  +   I +C+
Sbjct: 675 TVEVENQGEMAGKHPVLMFARHERGGEDGKRAEKQLVGFKSIVLSNGEKAEMEFEIGLCE 734

Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTK 755
           +LS  +  G   +  G++ + +G ++
Sbjct: 735 HLSRANEFGVMVLEEGKYFLTVGDSE 760


>gi|297811163|ref|XP_002873465.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319302|gb|EFH49724.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 796

 Score =  731 bits (1886), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/783 (46%), Positives = 495/783 (63%), Gaps = 40/783 (5%)

Query: 15  LSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAA 74
           ++ +   L +   F C P   ++   PFC VSL I QR   L+  L+L EK+  L + AA
Sbjct: 17  IAETFKNLDSHPQFPCKPPHFSS--YPFCNVSLSIKQRAISLVSLLTLPEKIGQLSTTAA 74

Query: 75  AVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRV 134
           +VPRLGI  YEWWSE+LHG+++ GPG  F G    ATSFPQVI +A+SFN TLW  IG  
Sbjct: 75  SVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAASFNRTLWYEIGSA 134

Query: 135 VSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ--- 191
           V+ EARAMYNGG AGLT+W+PN+N+FRDPRWGRGQETPGEDP +  +Y   +VRG Q   
Sbjct: 135 VAVEARAMYNGGQAGLTFWAPNINLFRDPRWGRGQETPGEDPKVVSEYGVEFVRGFQEKK 194

Query: 192 ---------GSDG------------DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
                    GSD              +L ++A CKHFTAYDL+ W    R+ FNA V++Q
Sbjct: 195 KRKVLKTRFGSDNVDDDARYDDDADGKLMLSACCKHFTAYDLEKWGNFTRYDFNAVVTEQ 254

Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
           D+EDT+  PF  C+ +GK + +MCSYN VNGVP CA  ++L++  R EW  +GYI SDCD
Sbjct: 255 DMEDTYQPPFETCIKDGKASCLMCSYNAVNGVPACAQGDLLQKA-RVEWGFDGYITSDCD 313

Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNT 350
           +V   ++ Q +T +PEEA ADAI+AG+D++CG ++  +T+SA+++G +SE  ++ AL+N 
Sbjct: 314 AVATIFEYQGYTKSPEEAVADAIKAGVDINCGTYMLRNTQSAIEQGKVSEELVDRALLNL 373

Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
             VQ+RLG+FDG+P    YG LG  D+C+ DH++LALEAARQGIVLLKN    LPL+   
Sbjct: 374 FAVQLRLGLFDGDPRGGHYGKLGSNDICSSDHRKLALEAARQGIVLLKNDYKLLPLNKNH 433

Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQGCKDVACADDQLFG 469
             ++A++GP ++    M G Y G  C   T    +  Y  +T +  GC DV+C  D  FG
Sbjct: 434 VSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCSDVSCVSDTGFG 493

Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
            A+  ++ AD  I+V GLD S E E  DR  L LPG+Q++LVS V+  SK P ILVL  G
Sbjct: 494 EAVAIAKGADFVIVVAGLDLSQETEDKDRFSLSLPGKQKDLVSSVAAVSKKPVILVLTGG 553

Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
           GP+DV FAK DPRI +IIW GYPG+ GG A+A+I+FG  NPGG+LP+TWYP+ +  ++PM
Sbjct: 554 GPVDVTFAKTDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRLPITWYPESF-ADVPM 612

Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA----VPLDGR 645
           ++M MR   S+ YPGRTYRFY GP VY FG G+SYT F + + +AP  ++    +P    
Sbjct: 613 SDMHMRADSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFDYKIISAPIRLSLSELLPQQSS 672

Query: 646 HGSINATISGKAIK------VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST-PPAGH 698
           H         + ++      V    C  L   V+V+V+N G  DG+H L++FS       
Sbjct: 673 HKKQLLQHGEEQLQYIQLDDVMVNSCESLRFNVRVNVRNTGEIDGSHVLMLFSKMARVLS 732

Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
             P KQL+ F++VH+ +         I  CKYLSV +  G R IPLG H + +G  +HS+
Sbjct: 733 GVPEKQLIGFDRVHIRSNEMMETVFVIDPCKYLSVANDVGKRVIPLGIHALFLGDLQHSL 792

Query: 759 SLH 761
           S+ 
Sbjct: 793 SVE 795


>gi|371917284|dbj|BAL44718.1| SlArf/Xyl3 [Solanum lycopersicum]
          Length = 777

 Score =  730 bits (1885), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/771 (45%), Positives = 505/771 (65%), Gaps = 23/771 (2%)

Query: 4   TIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
           T+ F F+ L+LL   +   + + PF+CD  +  T + PFC  +LPIPQRVNDL+ RL++ 
Sbjct: 10  TVMFIFVILVLLFRRTE--STKPPFSCDSSNPNTSSYPFCNAALPIPQRVNDLVSRLTVD 67

Query: 64  EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
           EK+  L++GA  +PRLGI  YEWWSE LHG+S  G GT F G    AT FPQ+I TASSF
Sbjct: 68  EKILQLVNGAPEIPRLGISAYEWWSEGLHGISRHGKGTLFNGTIKAATQFPQIILTASSF 127

Query: 124 NATLWEAIGRVVSDEARAMYNGGT-AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
           +  LW  I + +  EARA+YN G   G+T W+PN+NI RDPRWGRGQETPGEDP++ GKY
Sbjct: 128 DENLWYRIAQAIGREARAVYNAGQLKGITLWAPNINILRDPRWGRGQETPGEDPMMVGKY 187

Query: 183 AASYVRGLQGS--DGDRLK-----VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDT 235
             +YVRGLQG   +G +LK      +A CKHF A D+DNW+   R+ F+A+V KQD+ D+
Sbjct: 188 GVAYVRGLQGDSFEGGKLKDGHLQTSACCKHFIAQDMDNWHNFSRYTFDAQVLKQDLADS 247

Query: 236 FDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVY 295
           ++ PF+ CV +GK +SVMC+YN VNG+P CA+ ++L  T RG+W L GYIVSDCD+V   
Sbjct: 248 YEPPFKDCVEQGKASSVMCAYNLVNGIPNCANFDLLTTTARGKWGLQGYIVSDCDAVDKM 307

Query: 296 YDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQM 355
           Y  QH+   PE+A A  ++AG+D++CG  L  +T+SA+++  + E DI+ AL N  +V+M
Sbjct: 308 YSEQHYAKEPEDAVAATLKAGMDVNCGSHLKTYTKSALEKQKVKESDIDRALHNLFSVRM 367

Query: 356 RLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVA 415
           RLG+F+G+PS   YG +   +VC+ +H+ LA+EAAR G VLLKN    LPLS ++  ++A
Sbjct: 368 RLGLFNGDPSKLEYGDISAAEVCSEEHRALAVEAARSGSVLLKNSNRLLPLSKMKTASLA 427

Query: 416 VIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTIHQQGCKDVACADDQLFGAAIDA 474
           VIGP ++ +  ++GNY G +C   T  QG+ G  A T++  GC  + C    +   A++ 
Sbjct: 428 VIGPKANDSEVLLGNYEGFSCKNVTLFQGLQGYVANTMYHPGCDFINCTSPAI-DEAVNI 486

Query: 475 SRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
           +++AD  +LVMGLDQ++E E  DR  L LPG Q++L++ ++ A+  P ILVLM GGP+DV
Sbjct: 487 AKKADYVVLVMGLDQTLEREKFDRTELGLPGMQEKLITSIAEAASKPVILVLMCGGPVDV 546

Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
            FAK++P+I  I+W GYPG+ G  A+A ILFG  NPGG+ P+TWYP+E+   + M +M M
Sbjct: 547 TFAKDNPKIGGILWVGYPGEGGAAALAQILFGEHNPGGRSPVTWYPKEF-NKVAMNDMRM 605

Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN-ATI 653
           RP  S  YPGRTYRFY GP V+ FG+G+SYTN+ +T A   +V    L  ++  IN +T 
Sbjct: 606 RPESSSGYPGRTYRFYNGPKVFEFGYGLSYTNYSYTFA---SVSKNQLLFKNPKINQSTE 662

Query: 654 SGKAIKVTHAK-----CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVA 707
            G  + +  +      CN   + V+V VKN G   G H +L+F    +     P K L+ 
Sbjct: 663 KGSVLNIAVSDVGPEVCNSAMITVKVAVKNQGEMAGKHPVLLFLKHSSTVDEVPKKTLIG 722

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
           F+ V++ AGA  +V  ++  C++ +  +R GT  I  G+H + +G  ++ +
Sbjct: 723 FKSVNLEAGANTQVTFDVKPCEHFTRANRDGTLVIDEGKHFLLLGDQEYPI 773


>gi|225459350|ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like [Vitis vinifera]
          Length = 774

 Score =  729 bits (1883), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/771 (47%), Positives = 513/771 (66%), Gaps = 22/771 (2%)

Query: 5   IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
           I   ++ +IL+   S+      PF+CD  + +T++  FC+ +LPIP RV DL+ RL+L E
Sbjct: 9   INLIYVTVILVGVEST---QSPPFSCDSSNPSTKSYHFCKTTLPIPDRVRDLVSRLTLDE 65

Query: 65  KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
           K+  L++ A A+PRLGI  YEWWSEALHGV++ GPG +F G    ATSFPQVI TA+SF+
Sbjct: 66  KISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQVILTAASFD 125

Query: 125 ATLWEAIGRVVSDEARAMYNGG-TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
             LW  IGR +  EARA+YN G T G+T+W+PN+NIFRDPRWGRGQETPGEDP+++G YA
Sbjct: 126 VHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGSYA 185

Query: 184 ASYVRGLQG------SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
            SYVRG+QG           L+ +A CKHFTAYDLD+W G+DRF F+A+V+ QD+ DT+ 
Sbjct: 186 VSYVRGVQGDCLRGLKRCGELQASACCKHFTAYDLDDWKGIDRFKFDARVTMQDLADTYQ 245

Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
            PF  C+ EG+ + +MC+YN+VNGVP+CAD N+L  T R  W   GYI SDCD+V + +D
Sbjct: 246 PPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSDCDAVSLIHD 305

Query: 298 TQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
           +  F  TPE+A  D ++AG+D++CG +L  HT+SAV +  L E +++ AL N   V+MRL
Sbjct: 306 SYGFAKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRALENLFAVRMRL 365

Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
           G+F+G P  QPYG +GP  VC+ +HQ LAL+AAR GIVLLKN    LPL   +  ++AVI
Sbjct: 366 GLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLPKGKTMSLAVI 425

Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASR 476
           GPN++   T+IGNYAG  C + TPLQ +  Y + T++  GC  VAC+   +   A++ ++
Sbjct: 426 GPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVACSSPSI-EKAVEIAQ 484

Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
           +AD  +LVMGLDQ+ E EA DR  L+LPG+QQ+L+  V+ A+K P +LVL+SGGP+D++F
Sbjct: 485 KADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVLLSGGPVDISF 544

Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
           AK    I +I+WAGYPG AGG AIA+ +FG  NPGG+LP+TWYPQ++ T +PMT+M MRP
Sbjct: 545 AKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPGGRLPVTWYPQDF-TKIPMTDMRMRP 603

Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
             +  YPGRTYRFY G  V+ FG+G+SY+ +  +    P         +  + +   +  
Sbjct: 604 ESNSGYPGRTYRFYTGEKVFEFGYGLSYSTY--SCETIPVTRNKLYFNQSSTAHVYENTD 661

Query: 657 AIKVTHAK------CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFE 709
           +I+ T         C+   + + + V+N G   G H++L+F     A   +P KQLVAF+
Sbjct: 662 SIRYTSVAELGKELCDSNNISISIRVRNDGEMAGKHSVLLFVRRLKASAGSPIKQLVAFQ 721

Query: 710 KVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            VH+  G    VG  ++ C++ S  ++ G   I  G H + +G  +H V++
Sbjct: 722 SVHLNGGESADVGFLLNPCEHFSGPNKDGLMVIEEGTHFLVVGDQEHPVTV 772


>gi|224066929|ref|XP_002302284.1| predicted protein [Populus trichocarpa]
 gi|222844010|gb|EEE81557.1| predicted protein [Populus trichocarpa]
          Length = 742

 Score =  729 bits (1881), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/707 (50%), Positives = 482/707 (68%), Gaps = 27/707 (3%)

Query: 12  LILLSASSSGL-----AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
           LIL++  ++ L     + + P++CD  D +T+  PFCQ  LPI QRV DL+ RL+L EKV
Sbjct: 11  LILIAIHTTSLHLYVESTQPPYSCDSSDPSTKLYPFCQTKLPISQRVEDLVSRLTLDEKV 70

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGV---SNVGPGTKFGGDFPGATSFPQVITTASSF 123
             L+  A A+PRLGI  YEWWSEALHGV   + V  G +F G    ATSFPQVI TA+SF
Sbjct: 71  SQLVDTAPAIPRLGIPAYEWWSEALHGVALQTTVRQGIRFNGTIRFATSFPQVILTAASF 130

Query: 124 NATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
           +A LW  IG+V+  EAR +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKY
Sbjct: 131 DAHLWYRIGQVIGKEARGIYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVAGKY 190

Query: 183 AASYVRGLQGSD------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
           A SYVRG+QG        G++L+ +A CKHFTAYDLD W G++RF F+A    QD+ DT+
Sbjct: 191 AVSYVRGVQGDSFGGGTLGEQLQASACCKHFTAYDLDKWKGMNRFVFDA----QDLADTY 246

Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
             PF+ C+ EGK + +MC+YN+VNGVP CAD N+L +  RG+W   GYI SDCD+V + +
Sbjct: 247 QPPFQSCIQEGKASGIMCAYNRVNGVPNCADYNLLSKKARGQWGFYGYITSDCDAVAIIH 306

Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
           D Q +  +PE+A AD ++AG+D++CG +L  +T+SAV++  L E +I+ AL N  +++MR
Sbjct: 307 DDQGYAKSPEDAVADVLKAGMDVNCGDYLKNYTKSAVKKKKLPESEIDRALHNLFSIRMR 366

Query: 357 LGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAV 416
           LG+F+G P+ QPYG++ P  VC+ +HQ LAL+AA+ GIVLLKN    LPLS +  +++AV
Sbjct: 367 LGLFNGNPTKQPYGNIAPDQVCSQEHQALALKAAQDGIVLLKNPDKLLPLSKLETKSLAV 426

Query: 417 IGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDAS 475
           IGPN++ +  ++GNY G  C   TPLQG+  Y + T +  GC  VAC+   +   A+  +
Sbjct: 427 IGPNANNSTKLLGNYFGPPCKTVTPLQGLQNYIKNTRYHPGCSRVACSSASI-NQAVKIA 485

Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           + AD  ILVMGLDQ+ E E  DR  L+LPG+Q+EL++ V+ A+K P +LVL  GGP+DV+
Sbjct: 486 KGADQVILVMGLDQTQEKEEQDRVDLVLPGKQRELITAVAKAAKKPVVLVLFCGGPVDVS 545

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           FAK D  I +IIWAGYPG+AGGTA+A I+FG  NPGG+LPMTWYPQ++ T +PMT+M MR
Sbjct: 546 FAKYDQNIGSIIWAGYPGEAGGTALAQIIFGDHNPGGRLPMTWYPQDF-TKVPMTDMRMR 604

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA---T 652
           P  S  YPGRTYRFY G  V+ FG+G+SY+N+ + +A + T   + L      I     T
Sbjct: 605 PQLSSGYPGRTYRFYNGKKVFEFGYGLSYSNYSYELA-SDTQNKLYLRASSNQITKNSNT 663

Query: 653 ISGKAI-KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH 698
           I  K I  +    C +    V V VKN G   G +  + +   P  H
Sbjct: 664 IRHKLISNIGKELCEKTKFTVTVRVKNHGEMAGENAEIQYELSPCEH 710


>gi|242062502|ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
 gi|241932371|gb|EES05516.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
          Length = 784

 Score =  727 bits (1877), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/757 (47%), Positives = 494/757 (65%), Gaps = 20/757 (2%)

Query: 21  GLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
           G A+  P+ C         +PFC  +LPI +RV+DL+ RL++ EK+  L   + A+PRLG
Sbjct: 33  GAASEPPYTCGA--GAPPNIPFCDTALPIDRRVDDLVSRLTVAEKISQLGDESPAIPRLG 90

Query: 81  IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
           +  Y+WWSEALHGV+N G G    G    ATSFPQVI TA+SFN  LW  IG+V+  EAR
Sbjct: 91  VPAYKWWSEALHGVANAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGVEAR 150

Query: 141 AMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS------ 193
           A+YN G A GLT+W+PN+N+FRDPRWGRGQETPGEDP ++GKYAA +VRG+QG       
Sbjct: 151 AVYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGVAGPV 210

Query: 194 DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVM 253
           +   L+ +A CKHFTAYDL+NW G+ R+ ++AKV+ QD+EDT++ PF+ CV +G  + +M
Sbjct: 211 NSTDLEASACCKHFTAYDLENWKGITRYVYDAKVTAQDLEDTYNPPFKSCVEDGHASGIM 270

Query: 254 CSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAI 313
           CSYN+VNGVPTCAD N+L +T R  W   GYI SDCD+V + +D Q +  T E+A AD +
Sbjct: 271 CSYNRVNGVPTCADYNLLSKTARQSWGFYGYITSDCDAVSIIHDAQGYAKTSEDAVADVL 330

Query: 314 RAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG 373
           +AG+D++CG ++  +  SA+Q+G ++E DIN AL N  TV+MRLG+F+G+P    YG++G
Sbjct: 331 KAGMDVNCGGYVQKYGASALQQGKITEQDINRALHNLFTVRMRLGLFNGDPRRNRYGNIG 390

Query: 374 PKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           P  VCT +HQ+LALEAA+ GIVLLKN G +LPLS     ++AVIG N++   +++GNY G
Sbjct: 391 PDQVCTQEHQDLALEAAQDGIVLLKNDGGALPLSKSGVASLAVIGFNANNATSLLGNYFG 450

Query: 434 IACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIE 492
             C   TPLQ +  Y + T    GC   AC +      A+ A+  AD+ +L MGLDQ+ E
Sbjct: 451 PPCVTVTPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLDQNQE 509

Query: 493 AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
            E +DR  L LPG+QQ L+  V+ A+K P ILVL+ GGP+DV+FAK +P+I AI+WAGYP
Sbjct: 510 REEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYP 569

Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
           G+AGG AIA +LFG  NPGG+LP+TWYPQ++ T +PMT+M MR   +  YPGRTYRFY+G
Sbjct: 570 GEAGGIAIAQVLFGEHNPGGRLPVTWYPQDF-TKVPMTDMRMRADPATGYPGRTYRFYRG 628

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA----KCNRL 668
           P V+ FG+G+SY+ + H     P      + G   ++  T  G A     A     C+RL
Sbjct: 629 PTVFNFGYGLSYSKYSHRFVTKPPPSMSNVAGLK-ALATTAGGVATYDVEAIGSETCDRL 687

Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPPA---GHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
                V V+N G  DG H +LVF   P    G   P +QL+ F+ +H+ A     V   +
Sbjct: 688 KFPAVVRVQNHGPMDGKHPVLVFLRWPNATDGSGRPARQLIGFQSLHLRATQTAHVEFEV 747

Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
             CK+ S     G + I  G H + +G  +  +S  A
Sbjct: 748 SPCKHFSRATEDGRKVIDQGSHFVMVGDDEFEMSFMA 784


>gi|297736787|emb|CBI25988.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score =  727 bits (1876), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/734 (50%), Positives = 486/734 (66%), Gaps = 41/734 (5%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           +   +C  SLP   RV DL+ R++L+EK + +I  A+ VPR+G+  Y+WWSEALHGV+NV
Sbjct: 63  KDFVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANV 122

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
           G  T F    PGATSFP VI +A+SFN +LW+ +G+VVS EARAMYN G AGLT+WSPN+
Sbjct: 123 GSATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTFWSPNI 182

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR---------LKVAASCKHFT 208
           N+ RDPRWGR  ETPGEDP+  G Y  +YVRGLQ  +G           LK+A+SCKHF 
Sbjct: 183 NVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASSCKHFA 242

Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
           AYDLD W  VDR HF+AKVS+QD+ +TF  PF MCV EG  +SVMCS+N +NG+P CADP
Sbjct: 243 AYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIPPCADP 302

Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAADAIRAGLDLDCGPFLGL 327
             LK  IR +W L+GYIVSDC ++      Q F   T EE  A +++AGLDL+CG +   
Sbjct: 303 RFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECGHYYND 362

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
              +AV+ G +SE D++ +L     V MR+G FDG PS      LG KD+C  +H ELA 
Sbjct: 363 SLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPS---LASLGKKDICNDEHIELAR 419

Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
           EAARQGIVLLKN   +LPL  ++   +A++GP+++ TV MIGNYAGI C Y +PL     
Sbjct: 420 EAARQGIVLLKNDNATLPLKPVKK--LALVGPHANATVAMIGNYAGIPCHYVSPLDAFSE 477

Query: 448 YARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQ 507
                ++ GC DV C +D     A +A++ ADATI+++G D SIEAE  DR  LLLPG Q
Sbjct: 478 LGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPGYQ 537

Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT 567
            E+V++V+  S GP ILV+M GGPID++FAKN+P+IAAI+WAG+PG+ GG AIADI+FG 
Sbjct: 538 TEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVFGK 597

Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
            NPGG+ P+TWY   Y+  LPMT MA+RP +S  YPGRTY+F+ G  VYPFG+G+SYTNF
Sbjct: 598 YNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGSTVYPFGYGLSYTNF 657

Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
            +++  APT        R   I+ T                +   QV VKNVGS DG+  
Sbjct: 658 SYSLT-APT--------RSVHISLT----------------SFEFQVAVKNVGSMDGSEV 692

Query: 688 LLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGE 746
           ++V+S+PP+G    H KQ++ FE+V V  G  ++V  +++VCK L +VD SG   +P G 
Sbjct: 693 VMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMNVCKSLGLVDSSGYILLPSGS 752

Query: 747 HNIHIGGTKHSVSL 760
           H I  G    SVS 
Sbjct: 753 HTIMAGDNSTSVSF 766


>gi|15238197|ref|NP_196618.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
 gi|75264319|sp|Q9LXA8.1|BXL6_ARATH RecName: Full=Probable beta-D-xylosidase 6; Short=AtBXL6; Flags:
           Precursor
 gi|7671447|emb|CAB89387.1| beta-xylosidase-like protein [Arabidopsis thaliana]
 gi|15982753|gb|AAL09717.1| AT5g10560/F12B17_90 [Arabidopsis thaliana]
 gi|332004180|gb|AED91563.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
          Length = 792

 Score =  726 bits (1873), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/787 (46%), Positives = 490/787 (62%), Gaps = 38/787 (4%)

Query: 7   FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
            FF   I  + +   L +   F C P   ++   PFC VSL I QR   L+  L L EK+
Sbjct: 11  LFFTSAI--AETFKNLDSHPQFPCKPPHFSS--YPFCNVSLSIKQRAISLVSLLMLPEKI 66

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
             L + AA+VPRLGI  YEWWSE+LHG+++ GPG  F G    ATSFPQVI +A+SFN T
Sbjct: 67  GQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAASFNRT 126

Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
           LW  IG  V+ E RAMYNGG AGLT+W+PN+N+FRDPRWGRGQETPGEDP +  +Y   +
Sbjct: 127 LWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGEDPKVVSEYGVEF 186

Query: 187 VRGLQGSDGD----------------------RLKVAASCKHFTAYDLDNWNGVDRFHFN 224
           VRG Q                           +L ++A CKHFTAYDL+ W    R+ FN
Sbjct: 187 VRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNFTRYDFN 246

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           A V++QD+EDT+  PF  C+ +GK + +MCSYN VNGVP CA  ++L++  R EW   GY
Sbjct: 247 AVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLLQKA-RVEWGFEGY 305

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           I SDCD+V   +  Q +T +PEEA ADAI+AG+D++CG ++  HT+SA+++G +SE  ++
Sbjct: 306 ITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKVSEELVD 365

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            AL+N   VQ+RLG+FDG+P    YG LG  D+C+ DH++LALEA RQGIVLLKN    L
Sbjct: 366 RALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLKNDHKLL 425

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQGCKDVACA 463
           PL+     ++A++GP ++    M G Y G  C   T    +  Y  +T +  GC DV+C 
Sbjct: 426 PLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCSDVSCD 485

Query: 464 DDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTI 523
            D  FG A+  ++ AD  I+V GLD S E E  DR  L LPG+Q++LVS V+  SK P I
Sbjct: 486 SDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAVSKKPVI 545

Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
           LVL  GGP+DV FAKNDPRI +IIW GYPG+ GG A+A+I+FG  NPGG+LP TWYP+ +
Sbjct: 546 LVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRLPTTWYPESF 605

Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP-- 641
            T++ M++M MR + S+ YPGRTYRFY GP VY FG G+SYT F + + +AP  +++   
Sbjct: 606 -TDVAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKILSAPIRLSLSEL 664

Query: 642 LDGRHGSINATISGKAIK------VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST-P 694
           L  +         G+ ++      V    C  L   V+V V N G  DG+H +++FS  P
Sbjct: 665 LPQQSSHKKQLQHGEELRYLQLDDVIVNSCESLRFNVRVHVSNTGEIDGSHVVMLFSKMP 724

Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
           P     P KQL+ +++VHV +         I  CK LSV +  G R IPLG H + +G  
Sbjct: 725 PVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVGKRVIPLGSHVLFLGDL 784

Query: 755 KHSVSLH 761
           +HS+S+ 
Sbjct: 785 QHSLSVE 791


>gi|449451581|ref|XP_004143540.1| PREDICTED: probable beta-D-xylosidase 6-like [Cucumis sativus]
          Length = 777

 Score =  725 bits (1871), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/749 (46%), Positives = 484/749 (64%), Gaps = 28/749 (3%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
           + PFC  SL    R   L+  L+L EK++ L + A+++PRLGI  Y+WWSE LHG++  G
Sbjct: 29  SYPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSEGLHGIATNG 88

Query: 99  PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
           PG  F G    ATSFPQV+ TA+SFN TLW  IG  ++ EARAM+N G  GLT W+PN+N
Sbjct: 89  PGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCGLTIWAPNIN 148

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-----------------LKVA 201
           IFRDPRWGRGQETPGEDP+++  Y+  +VRGLQ  +  +                 L V+
Sbjct: 149 IFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEVLEEDNGMGSLMVS 208

Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
           A CKHFTAYDL+ WN   R+ F++ V++QD+ DT+  PFR C+ +GK + +MCSYN VNG
Sbjct: 209 ACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGKASCLMCSYNAVNG 268

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
           VP CA+P++LK+  R +W L GYI SDCD+V   Y+ Q +T TPE+A AD ++AG+D++C
Sbjct: 269 VPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKYTDTPEDAIADVLKAGMDINC 327

Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
           G F+   T+SA+ +G + E ++++AL+N  +VQ RLG FDG P    +G LG +DVCT  
Sbjct: 328 GTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGKFGELGAQDVCTAQ 387

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           H+ LALEAARQGIVLLKN+   LPL      ++ VIG  ++ +  ++G YAG+ C   + 
Sbjct: 388 HKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLLGGYAGVPCSPMSL 447

Query: 442 LQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
           ++G   YA TI    GC DV CA D  F  AI  +++AD  I V GLD S E E LDR  
Sbjct: 448 VEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGLDASQETEDLDRVS 507

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           LLLPG+Q +LVS V+  SK P ILVL+ GGP+D++FAK D R+A+I+W G PG+AGG A+
Sbjct: 508 LLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASILWIGNPGEAGGKAL 567

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           A+++FG  NPGG+LP+TWYPQ + TN+PM +M MRP+ S+ YPGRTYRFY G  +Y FG 
Sbjct: 568 AEVIFGDYNPGGRLPVTWYPQSF-TNVPMNDMHMRPNPSRGYPGRTYRFYTGDRIYGFGE 626

Query: 621 GMSYTNFVHTVANAPTVVAV---PLDGRHGSINATISGKAIKVTHAK----CNRLTLGVQ 673
           G+SYT+F + + +AP  V +       R   I     G  +     +    C+ L   V+
Sbjct: 627 GLSYTSFKYRLLSAPKKVNLLGKAETSRRRIIPQVRDGVNMSYMEVEEVESCDLLRFEVK 686

Query: 674 VDVKNVGSKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
           + V N+G  DG+H +++FS  P      P +QL+ F++++V         I +  C ++S
Sbjct: 687 LSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKRNQSAESSIMVDPCNHVS 746

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           + D  G R IPLG+H I +G  +H +S+ 
Sbjct: 747 LADEYGKRVIPLGDHTISLGDLEHVISIQ 775


>gi|356515806|ref|XP_003526589.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 772

 Score =  725 bits (1871), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 365/754 (48%), Positives = 491/754 (65%), Gaps = 26/754 (3%)

Query: 27  PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
           PFACD  + ++R+ PFC   LPIPQR  DL+ RL+L EK+  L++ A  +PRLGI  Y+W
Sbjct: 25  PFACDFSNPSSRSYPFCNPKLPIPQRTKDLLSRLTLDEKLSQLVNTAPPIPRLGIPAYQW 84

Query: 87  WSEALHGVSNVGPGTKFGGD--FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN 144
           WSEALHGVS VGPG  F  +     ATSFPQVI TA+SF++ LW  IG  +  EARA++N
Sbjct: 85  WSEALHGVSGVGPGILFDNNSTISSATSFPQVILTAASFDSRLWYRIGHAIGIEARAIFN 144

Query: 145 GGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD--GDRLKVA 201
            G A GLT+W+PN+NIFRDPRWGRGQET GEDP+L+ +YA S+VRGLQG    G  L  +
Sbjct: 145 AGQANGLTFWAPNINIFRDPRWGRGQETAGEDPLLTSRYAVSFVRGLQGDSFKGAHLLAS 204

Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
           A CKHFTAYDLDNW GVDRF F+A+VS QD+ DT+  PF+ CV +G+ + +MC+YN+VNG
Sbjct: 205 ACCKHFTAYDLDNWKGVDRFVFDARVSLQDLADTYQPPFQSCVQQGRASGIMCAYNRVNG 264

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
           VP CAD  +L +T R +W  NGYI SDC +VG  +D Q +  +PE+  AD +RAG+DL+C
Sbjct: 265 VPNCADYGLLTQTARNQWDFNGYITSDCGAVGFIHDRQRYAKSPEDVVADVLRAGMDLEC 324

Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
           G +L  H +SAV +  L   +I+ AL N  +++MRLG+FDG P+   +G +G   VC+ +
Sbjct: 325 GSYLTYHAKSAVLQKKLGMSEIDRALQNLFSIRMRLGLFDGNPTRLSFGLIGSNHVCSKE 384

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHR-TVAVIGPNSDVT-VTMIGNYAGIACGYT 439
           HQ LALEAAR GIVLLKN    LPL       ++AVIGPN++ + +T++GNYAG  C Y 
Sbjct: 385 HQYLALEAARNGIVLLKNSPTLLPLPKTSPSISLAVIGPNANSSPLTLLGNYAGPPCKYV 444

Query: 440 TPLQGIGRYART-IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
           T LQG   Y +   +  GC             A++ +++ D  +LVMGLDQS E E  DR
Sbjct: 445 TILQGFRHYVKNAFYHPGCDGGPKCSSAQIDQAVEVAKKVDYVVLVMGLDQSEEREERDR 504

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
             L LPG+Q EL++ V+ ASK P ILVL+SGGP+D+  AK + +I  I+WAGYPG+ GG 
Sbjct: 505 VHLDLPGKQLELINGVAEASKKPVILVLLSGGPLDITSAKYNHKIGGILWAGYPGELGGI 564

Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
           A+A I+FG  NPGG+LP TWYP++YI  +PMT+M MR   S  YPGRTYRFYKGP VY F
Sbjct: 565 ALAQIIFGDHNPGGRLPTTWYPKDYI-KVPMTDMRMRADPSTGYPGRTYRFYKGPKVYEF 623

Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA---------TISGKAI-KVTHAKCNRL 668
           G+G+SY+ + +        V+V  D  H + ++         TIS K + ++    C  +
Sbjct: 624 GYGLSYSKYSY------EFVSVTHDKLHFNQSSTHLMVENSETISYKLVSELDEQTCQSM 677

Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPP-AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHV 727
           +L V V V+N GS  G H +L+F  P      +P KQLV FE V + AG    V   +  
Sbjct: 678 SLSVTVRVQNHGSMVGKHPVLLFIRPKRQKSGSPVKQLVGFESVMLDAGEMAHVEFEVSP 737

Query: 728 CKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           C++LS  + +G   I  G H + +   +H + ++
Sbjct: 738 CEHLSRANEAGAMIIEEGSHMLLVDDLEHPIDIY 771


>gi|224058158|ref|XP_002299457.1| predicted protein [Populus trichocarpa]
 gi|222846715|gb|EEE84262.1| predicted protein [Populus trichocarpa]
          Length = 780

 Score =  725 bits (1871), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/742 (48%), Positives = 493/742 (66%), Gaps = 23/742 (3%)

Query: 18  SSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVP 77
           S+S   A   F C P   T  T  FC  SLPI +R   LI  L+LQEK++ L   A+ +P
Sbjct: 20  SNSKSVANPQFPCKP--PTHNTYSFCNKSLPITRRAQSLISHLTLQEKIQQLSDNASGIP 77

Query: 78  RLGIKGYEWWSEALHGVSNVGPGTKF--GGDFPGATSFPQVITTASSFNATLWEAIGRVV 135
           RLGI  YEWWSE+LHG+S  GPG  F  GG    AT FPQVI +A+SFN TLW  IG  +
Sbjct: 78  RLGIPHYEWWSESLHGISINGPGVSFKNGGPVTSATGFPQVIVSAASFNRTLWFLIGSAI 137

Query: 136 SDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--- 192
           + EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP+++  YA  +V+G QG   
Sbjct: 138 AIEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAIEFVKGFQGGHW 197

Query: 193 --SDG----DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVME 246
              DG    D+L ++A CKH TAYDL+ W    R+ FNA V++QD+EDT+  PFR C+ +
Sbjct: 198 KNEDGEINDDKLMLSACCKHSTAYDLEKWGNFSRYSFNAVVTEQDMEDTYQPPFRSCIQK 257

Query: 247 GKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE 306
           GK + +MCSYN+VNGVP CA  ++L++  R EW   GYI SDCD+V   ++ Q+++ +PE
Sbjct: 258 GKASCLMCSYNEVNGVPACAREDLLQKP-RTEWGFKGYITSDCDAVATIFEYQNYSKSPE 316

Query: 307 EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSS 366
           +A A A++AG+D++CG ++  + +SAV++G L E DI+ AL N  +VQ+RLG+FDG+P  
Sbjct: 317 DAVAIALKAGMDINCGTYVLRNAQSAVEKGKLQEEDIDRALHNLFSVQLRLGLFDGDPRK 376

Query: 367 QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT 426
             +G LGPK+VCT +H+ LALEAARQGIVLLKN    LPL+     ++A+IGP +++  +
Sbjct: 377 GQFGKLGPKNVCTKEHKTLALEAARQGIVLLKNDKKLLPLNKKAVSSLAIIGPLANMANS 436

Query: 427 MIGNYAGIACGYTTPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASRQADATILVM 485
           + G+Y G  C   +  +G+  Y  +T +  GC DVAC  D  F  AI  +++AD  I+V 
Sbjct: 437 LGGDYTGYPCDPQSLFEGLKAYVKKTSYAIGCLDVACVSDTQFHKAIIVAKRADFVIIVA 496

Query: 486 GLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAA 545
           GLD S E E  DR  LLLPG+Q  LVS V+ ASK P ILVL  GGP+DV+FAK DPRIA+
Sbjct: 497 GLDLSQETEEHDRVSLLLPGKQMSLVSSVAAASKKPVILVLTGGGPLDVSFAKGDPRIAS 556

Query: 546 IIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR 605
           I+W GYPG+AG  A+A+I+FG  NPGG+LPMTWYP+ + T + MT+M MRP+ S+ YPGR
Sbjct: 557 ILWIGYPGEAGAKALAEIIFGEYNPGGRLPMTWYPESF-TEVSMTDMNMRPNPSRGYPGR 615

Query: 606 TYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTH--- 662
           TYRFY G  VY FG G+SYTNF + + +AP+ +++       S    +     ++++   
Sbjct: 616 TYRFYTGNRVYGFGGGLSYTNFTYKILSAPSKLSLSGSLSSNSRKRILQQGGERLSYINI 675

Query: 663 ---AKCNRLTLGVQVDVKNVGSKDGAHTLLVFS-TPPAGHWAPHKQLVAFEKVHVPAGAQ 718
                C+ L   +Q+ V+NVG+ DG H +++FS  P     AP KQLV F++VH  +   
Sbjct: 676 NEITSCDSLRFYMQILVENVGNMDGGHVVMLFSRVPTVFRGAPEKQLVGFDRVHTISHRS 735

Query: 719 QRVGINIHVCKYLSVVDRSGTR 740
             + I +  C++LSV +  G +
Sbjct: 736 TEMSILVDPCEHLSVANEQGKK 757


>gi|449496501|ref|XP_004160150.1| PREDICTED: probable beta-D-xylosidase 6-like, partial [Cucumis
           sativus]
          Length = 767

 Score =  724 bits (1870), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/749 (46%), Positives = 484/749 (64%), Gaps = 28/749 (3%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
           + PFC  SL    R   L+  L+L EK++ L + A+++PRLGI  Y+WWSE LHG++  G
Sbjct: 19  SYPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSEGLHGIATNG 78

Query: 99  PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
           PG  F G    ATSFPQV+ TA+SFN TLW  IG  ++ EARAM+N G  GLT W+PN+N
Sbjct: 79  PGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCGLTIWAPNIN 138

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-----------------LKVA 201
           IFRDPRWGRGQETPGEDP+++  Y+  +VRGLQ  +  +                 L V+
Sbjct: 139 IFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEVLEEDNGMGSLMVS 198

Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
           A CKHFTAYDL+ WN   R+ F++ V++QD+ DT+  PFR C+ +GK + +MCSYN VNG
Sbjct: 199 ACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGKASCLMCSYNAVNG 258

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
           VP CA+P++LK+  R +W L GYI SDCD+V   Y+ Q +T TPE+A AD ++AG+D++C
Sbjct: 259 VPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKYTDTPEDAIADVLKAGMDINC 317

Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
           G F+   T+SA+ +G + E ++++AL+N  +VQ RLG FDG P    +G LG +DVCT  
Sbjct: 318 GTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGKFGELGAQDVCTAQ 377

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           H+ LALEAARQGIVLLKN+   LPL      ++ VIG  ++ +  ++G YAG+ C   + 
Sbjct: 378 HKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLLGGYAGVPCSPMSL 437

Query: 442 LQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
           ++G   YA TI    GC DV CA D  F  AI  +++AD  I V GLD S E E LDR  
Sbjct: 438 VEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGLDASQETEDLDRVS 497

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           LLLPG+Q +LVS V+  SK P ILVL+ GGP+D++FAK D R+A+I+W G PG+AGG A+
Sbjct: 498 LLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASILWIGNPGEAGGKAL 557

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           A+++FG  NPGG+LP+TWYPQ + TN+PM +M MRP+ S+ YPGRTYRFY G  +Y FG 
Sbjct: 558 AEVIFGDYNPGGRLPVTWYPQSF-TNVPMNDMHMRPNPSRGYPGRTYRFYTGDRIYGFGE 616

Query: 621 GMSYTNFVHTVANAPTVVAV---PLDGRHGSINATISGKAIKVTHAK----CNRLTLGVQ 673
           G+SYT+F + + +AP  V +       R   I     G  +     +    C+ L   V+
Sbjct: 617 GLSYTSFKYRLLSAPKKVNLLGKAETSRRRIIPQVRDGVNMSYMEVEEVESCDLLRFEVK 676

Query: 674 VDVKNVGSKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
           + V N+G  DG+H +++FS  P      P +QL+ F++++V         I +  C ++S
Sbjct: 677 LSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKRNQSAESSIMVDPCNHVS 736

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           + D  G R IPLG+H I +G  +H +S+ 
Sbjct: 737 LADEYGKRVIPLGDHTISLGDLEHVISIQ 765


>gi|37359708|dbj|BAC98299.1| LEXYL2 [Solanum lycopersicum]
          Length = 633

 Score =  724 bits (1869), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/633 (54%), Positives = 460/633 (72%), Gaps = 6/633 (0%)

Query: 131 IGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
           IG+VVS E RAMYN G AGLTYWSPNVNI+RDPRWGRGQET GEDP LS +Y  +YV+GL
Sbjct: 2   IGKVVSTEGRAMYNVGQAGLTYWSPNVNIYRDPRWGRGQETAGEDPTLSSRYGVAYVKGL 61

Query: 191 QGSDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
           Q  D    D LKVA+ CKH+TAYD+D+W G+ R++FNAKV++QD++DTF+ PF+ CV++G
Sbjct: 62  QQRDDGKKDMLKVASCCKHYTAYDVDDWKGIQRYNFNAKVTQQDLDDTFNPPFKSCVLDG 121

Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE 307
            VASVMCSYNQV+G PTC D ++L   IRG+W+LNGYIV+DCDS+   Y  QH+T TPEE
Sbjct: 122 NVASVMCSYNQVDGKPTCGDYDLLAGVIRGQWKLNGYIVTDCDSLNEMYWAQHYTKTPEE 181

Query: 308 AAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ 367
            AA ++ AGL L+CG +LG +T+ AV +GL++E  I+ A+ N     MRLG FDG P +Q
Sbjct: 182 TAALSLNAGLGLNCGSWLGKYTQGAVNQGLVNESVIDRAVTNNFATLMRLGFFDGNPKNQ 241

Query: 368 PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTM 427
            YG+LGPKD+CT DHQELA EAARQGIVLLKN   SLPLS    +++AVIGPN+++  TM
Sbjct: 242 LYGNLGPKDICTEDHQELAREAARQGIVLLKNTAGSLPLSPKSIKSLAVIGPNANLAYTM 301

Query: 428 IGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL 487
           +G+Y G  C YTTPL G+G    T++QQGC D+ACA  Q+  A    +  ADA +LVMG 
Sbjct: 302 VGSYEGSPCKYTTPLDGLGASVSTVYQQGC-DIACATAQVDNAKK-VAAAADAVVLVMGS 359

Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
           DQ+IE E+ DR  + LPG+Q  LV++V+  SKGP ILV+MSGG +DV FA ++P++ +I+
Sbjct: 360 DQTIERESKDRFNITLPGQQSLLVTEVASVSKGPVILVIMSGGGMDVKFAVDNPKVTSIL 419

Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
           W G+PG+AGG A+AD++FG  NPGG+LPMTWYPQ Y+  + MT M MR      +PGR+Y
Sbjct: 420 WVGFPGEAGGAALADVVFGYHNPGGRLPMTWYPQSYVDKVDMTNMNMRADPKTGFPGRSY 479

Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
           RFYKGP V+ FG G+SYT + H +  AP  V++PL+  H +  +T       V    CN 
Sbjct: 480 RFYKGPTVFNFGDGLSYTQYKHHLVKAPKFVSIPLEEGH-ACRSTKCKSIDAVNEQGCNN 538

Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHV 727
           L L + + V+NVG   G+HT+L+F++PP+ H AP K L+ F+K+H+   ++  V  N+ V
Sbjct: 539 LGLDIHLKVQNVGKMRGSHTVLLFTSPPSVHNAPQKHLLDFQKIHLTPQSEGVVKFNLDV 598

Query: 728 CKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           CK+LSVVD  G R++ LG H +HIG  KHS++L
Sbjct: 599 CKHLSVVDEVGNRKVALGLHVLHIGDLKHSLTL 631


>gi|357489441|ref|XP_003615008.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516343|gb|AES97966.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 798

 Score =  723 bits (1866), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/792 (46%), Positives = 514/792 (64%), Gaps = 42/792 (5%)

Query: 5   IAFFFLGLI----LLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           I+F FL L     L+ A S       P++CD  +  T++LPFC ++L I QR  D++ RL
Sbjct: 11  ISFIFLFLTRYHRLVHADSP--THVPPYSCDTTNPLTKSLPFCNLNLTITQRAKDIVSRL 68

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L EK+  L++ A ++PRLGI  Y+WW EALHGV+N G G +  G   GATSFPQVI TA
Sbjct: 69  TLDEKISQLVNTAPSIPRLGIPSYQWWDEALHGVANAGKGIRLNGSVAGATSFPQVILTA 128

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
           +SF++ LW  I +V+  EAR +YN G A G+T+W+PN+NIFRDPRWGRGQET GEDP+++
Sbjct: 129 ASFDSKLWYQISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVN 188

Query: 180 GKYAASYVRGLQGSD-------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS---- 228
            KY  SYVRGLQG         GDRLK +A CKHFTAYDLDNW G+DRF F+AKVS    
Sbjct: 189 SKYGVSYVRGLQGDSFEGGKLIGDRLKASACCKHFTAYDLDNWKGLDRFDFDAKVSFLFS 248

Query: 229 ------------KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
                        QD+ DT+  PF  C+++G+ + +MC+YN+VNGVP CAD N+L +T R
Sbjct: 249 MAYSPWMINYVTLQDLADTYQPPFHSCIVQGRSSGIMCAYNRVNGVPNCADYNLLTKTAR 308

Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
            +W  NGYI SDC++V + YD Q +  TPE+A AD ++AG+D++CG +L  H ++AV + 
Sbjct: 309 QKWNFNGYITSDCEAVRIIYDNQGYAKTPEDAVADVLQAGMDVECGDYLTKHAKAAVLQK 368

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
            +    I+ AL N  T+++RLG+FDG P+   YG +GP  VC+ ++ +LALEAAR GIVL
Sbjct: 369 KVPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVL 428

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMI-GNYAGIACGYTTPLQGIGRYARTIH-Q 454
           LKN    LPL   R  T+ VIGPN++ +  ++ GNY G  C     L+G   YA   H +
Sbjct: 429 LKNTASILPLP--RVNTLGVIGPNANKSSKVVLGNYFGRPCRLVPILKGFYTYASQTHYR 486

Query: 455 QGCKD-VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
            GC D   CA  ++   A++ ++ +D  ILVMGLDQS E E+ DR  L LPG+QQEL++ 
Sbjct: 487 SGCLDGTKCASAEI-DRAVEVAKISDYVILVMGLDQSQERESRDRDDLELPGKQQELINS 545

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
           V+ ASK P ILVL+ GGP+D+ FAKN+ +I  IIWAGYPG+ GG A+A ++FG  NPGG+
Sbjct: 546 VAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGR 605

Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VA 632
           LPMTWYP+++I  +PMT+M MR   S  YPGRTYRFY GP VY FG+G+SY+N+ +  ++
Sbjct: 606 LPMTWYPKDFI-KIPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFIS 664

Query: 633 NAPTVVAVPLDGRHGSI--NATISGKAI-KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLL 689
                + +     H  +  + TI  K + ++    C  +++ V + + N GS  G H +L
Sbjct: 665 VKNNNLHINQSTTHSILENSETIYYKLVSELGEETCKTMSISVTLGITNTGSMAGKHPVL 724

Query: 690 VFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
           +F  P  G +  P KQLV FE V V  G +  VG  + VC++LS  + SG + I  G H 
Sbjct: 725 LFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGHL 784

Query: 749 IHIGGTKHSVSL 760
           + +G  ++S+++
Sbjct: 785 LVVGEEEYSINI 796


>gi|357489431|ref|XP_003615003.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516338|gb|AES97961.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 780

 Score =  723 bits (1865), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/776 (46%), Positives = 506/776 (65%), Gaps = 23/776 (2%)

Query: 4   TIAFFFLGLIL---LSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           TI F FL L     L  + S      P++CD  +  T++ PFC ++L I QR  D++ RL
Sbjct: 7   TITFIFLFLTRYHRLVHADSLATNVPPYSCDTSNPLTKSFPFCNLNLTITQRAKDIVSRL 66

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L EK+  L++ A A+PRLGI  Y+WW+EALHGVS VG G +  G    ATSFPQ+I  A
Sbjct: 67  TLDEKISQLVNTAPAIPRLGIPSYQWWNEALHGVSYVGKGIRLNGSITAATSFPQIILIA 126

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
           +SF+  LW  I +V+  EAR +YN G A G+T+W+PN+NIFRDPRWGRGQET GEDP+++
Sbjct: 127 ASFDPKLWYRISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVN 186

Query: 180 GKYAASYVRGLQGSD-------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDI 232
            KY  SYVRGLQG         G RLK +A CKHFTAYDL+NW GV+R+ F+AKV+ QD+
Sbjct: 187 SKYGVSYVRGLQGDSFEGGKLIGGRLKASACCKHFTAYDLENWKGVNRYVFDAKVTLQDL 246

Query: 233 EDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSV 292
            DT+   F  CV++G+ + +MC+YN+VNGVP CAD N+L  T R +W  NGYI SDCD+V
Sbjct: 247 ADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLLTNTARKKWNFNGYIASDCDAV 306

Query: 293 GVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLT 352
              Y+ Q +  TPE+  AD +RAG+D++CG ++  H +SAV +  +    I+ AL N  T
Sbjct: 307 RFIYEKQGYAKTPEDVVADVLRAGMDVECGNYMTKHAKSAVLQKKIPISQIDRALHNLFT 366

Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
           +++RLG+FDG P+   YG +GP  VC+ ++ +LALEAAR GIVLLKN    LPL   R  
Sbjct: 367 IRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTASILPLP--RVN 424

Query: 413 TVAVIGPNSD-VTVTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKD-VACADDQLFG 469
           T+ VIGPN++  ++ ++GNY G  C   + L+G   YA   H + GC D V CA  ++  
Sbjct: 425 TLGVIGPNANKSSIVLLGNYFGQPCKQVSILKGFYTYASQTHYRSGCTDGVKCASAEI-D 483

Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
            A++ ++ +D  ILVMGLDQS E E LDR  L LPG+QQ+L++ V+ ASK P ILV++ G
Sbjct: 484 RAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQKLINSVAKASKKPVILVILCG 543

Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
           GP+D+ FAKN+ +I  IIWAGYPG+ GG A+A ++FG  NPGG+LPMTWYP+++I  +PM
Sbjct: 544 GPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPMTWYPKDFI-KIPM 602

Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVAVPLDGRHGS 648
           T+M MR   S  YPGRTYRFY GP VY FG+G+SY+N+ +  ++     + +     H  
Sbjct: 603 TDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFISVKNNNIHINQSTTHSI 662

Query: 649 I--NATISGKAIKVTHAK-CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQ 704
           +  + TI  K +     K C  +++ V + + N GS  G H +L+F  P  G +  P KQ
Sbjct: 663 LENSETIRYKLVSELGKKACKTMSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQ 722

Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           LV FE V V  G +  VG  + VC++LS  + SG + I  G +   +G  ++S+++
Sbjct: 723 LVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGYLFLVGELEYSINI 778


>gi|225432132|ref|XP_002274591.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Vitis vinifera]
          Length = 805

 Score =  721 bits (1862), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/761 (46%), Positives = 491/761 (64%), Gaps = 29/761 (3%)

Query: 22  LAAREPFACDPKDAT-----TRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAV 76
           LA    + CD           ++  FC  SL   +R  DL+ R++LQEKV   +  A+ V
Sbjct: 41  LAKNYTYVCDESRYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGV 100

Query: 77  PRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVS 136
            RLG+  Y WWSEALHG+SN+GPG  F    PGATS P VI + ++FN TLW+ +GRVVS
Sbjct: 101 RRLGLPEYSWWSEALHGISNLGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVS 160

Query: 137 DEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
            E RAMYN G AGLT+WSPN+N+ RD RWGR QET GEDP + G++A +YVRGLQ  +G 
Sbjct: 161 TEGRAMYNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGT 220

Query: 197 R---------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
                     LKV++ CKH+ AYD+D+W  VDR  F+A+VS+QD+++TF  PF  CV EG
Sbjct: 221 ENVTDLNSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREG 280

Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE- 306
            V+SVMCS+N++NG+P C+DP +LK  IR EW L+GYIVSDC  + V  D Q++ +  + 
Sbjct: 281 DVSSVMCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKV 340

Query: 307 EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSS 366
           +A A  ++AGLDL+CG +       +V  G +S+ +++ AL N   + MR+G FDG P+ 
Sbjct: 341 DAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA- 399

Query: 367 QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT 426
             Y  LG KD+C  DH ELA EAARQGIVLLKN    LPL     + +A++GP+++ T  
Sbjct: 400 --YESLGLKDICAADHIELAREAARQGIVLLKNDYEVLPLKP--GKKIALVGPHANATEV 455

Query: 427 MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMG 486
           MIGNYAG+ C Y +PL+         +  GC D +C++D  F  A +A++ A+ TI+ +G
Sbjct: 456 MIGNYAGLPCKYVSPLEAFSAIGNVTYATGCLDASCSNDTYFSEAKEAAKSAEVTIIFVG 515

Query: 487 LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAI 546
            D SIEAE +DR   LLPG Q EL+ +V+  S GP ILV++SG  ID+ FAKN+PRI+AI
Sbjct: 516 TDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAI 575

Query: 547 IWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT 606
           +W G+PG+ GG AIAD++FG  NPGG+LP+TWY  +Y+  LPM+ M++RP     YPGRT
Sbjct: 576 LWVGFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRT 635

Query: 607 YRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG--RHGSINATISGK-----AIK 659
           Y+F+ G  VYPFG+GMSYT F +++A +   + + L+   +  ++  T   K     A+ 
Sbjct: 636 YKFFDGSTVYPFGYGMSYTKFSYSLATSKISIDIDLNKFQKCRTVAYTEDQKVPSCPAVL 695

Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQ 718
           +    C+  T+  +V V NVG  DG+  L+V+S PP+G    H KQ++ F+KV V AG  
Sbjct: 696 LDDMSCDD-TIEFEVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDT 754

Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
           +RV  +++ CK L +VD +G   +P G H I +G   +S S
Sbjct: 755 ERVKFSMNACKSLRIVDSTGYSLLPSGSHTIRVGDYSNSAS 795


>gi|212275712|ref|NP_001130324.1| uncharacterized protein LOC100191418 precursor [Zea mays]
 gi|194688848|gb|ACF78508.1| unknown [Zea mays]
 gi|413938927|gb|AFW73478.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 780

 Score =  721 bits (1861), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/766 (47%), Positives = 490/766 (63%), Gaps = 19/766 (2%)

Query: 12  LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           L+ L   +  +A+  P+ C         +PFC   LPI +RV+DL+ R+++ EK+  L  
Sbjct: 19  LLQLHGGAVVVASEPPYTCGA--GAPPNIPFCDAGLPIDRRVDDLVSRMTVAEKISQLGD 76

Query: 72  GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
            + A+PRLG+  Y+WWSEALHG+SN G G    G    ATSFPQVI TA+SFN  LW  I
Sbjct: 77  QSPAIPRLGVPAYKWWSEALHGISNQGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRI 136

Query: 132 GRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
           G+V+  EARA+YN G A GLT+W+PN+N+FRDPRWGRGQETPGEDP ++GKYAA +VRG+
Sbjct: 137 GQVIGVEARAVYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGV 196

Query: 191 QGS------DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
           QG       +   L+ +A CKHFTAYDL+NW GV R+ F+AKV+ QD+ DT++ PF+ CV
Sbjct: 197 QGYGLAGPVNSTGLEASACCKHFTAYDLENWKGVTRYVFDAKVTAQDLADTYNPPFKSCV 256

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
            +G  + +MCSYN+VNGVPTCAD N+L  T R +W   GYI SDCD+V + +D Q +  T
Sbjct: 257 EDGHASGIMCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAKT 316

Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
            E+A AD ++AG+D++CG ++  H  SA+Q+G ++E DIN AL N   V+MRLG+F+G+P
Sbjct: 317 AEDAVADVLKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDP 376

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQG--PSLPLSHIRHRTVAVIGPNSD 422
               YG +GP  VCT +HQ+LALEAA+ GIVLLKN G   +LPLS     ++AVIG N++
Sbjct: 377 RRNLYGDIGPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNAN 436

Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADAT 481
             + + GNY G  C   TPLQ +  Y + T    GC   AC +      A+ A+  AD+ 
Sbjct: 437 DAIRLRGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSV 495

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           +L MGLDQ  E E +DR  L LPG+QQ L+  V+ A+K P ILVL+ GGP+DV+FAK +P
Sbjct: 496 VLFMGLDQDQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNP 555

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
           +I AI+WAGYPG+AGG AIA +LFG  NPGG+LP+TWYPQ++ T +PMT+M MR   +  
Sbjct: 556 KIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDF-TRVPMTDMRMRADPATG 614

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA--PTVVAVPLDGRHGSINATISGKAIK 659
           YPGRTYRFY+GP V+ FG+G+SY+ + H  A    PT     L     +     S     
Sbjct: 615 YPGRTYRFYRGPTVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKAVEATAGGMASYDVEA 674

Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA---GHWAPHKQLVAFEKVHVPAG 716
           +    C+RL     V V+N G  DG H++LVF   P    G   P  QL+ F+ +H+ A 
Sbjct: 675 IGSETCDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRAT 734

Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
               V   +  CK+ S     G + I  G H + +G  +  +S  A
Sbjct: 735 QTAHVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGEDEFEMSFMA 780


>gi|218191593|gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indica Group]
          Length = 774

 Score =  720 bits (1858), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/750 (48%), Positives = 486/750 (64%), Gaps = 22/750 (2%)

Query: 27  PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
           PF+C        +  FC   LPI QR +DL+ RL+L+EK+  L   + AV RLG+  Y+W
Sbjct: 29  PFSC----GAPSSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKW 84

Query: 87  WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           WSEALHGVSN G G    G    ATSFPQVI TA+SFN  LW  IG+V+  EARA+YN G
Sbjct: 85  WSEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNG 144

Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS------DGDRLK 199
            A GLT+W+PN+N+FRDPRWGRGQETPGEDP ++GKYAA +VRG+QG       +   L+
Sbjct: 145 QAEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAINSTDLE 204

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
            +A CKHFTAYDL+NW GV R+ F+AKV+ QD+ DT++ PFR CV +G  + +MCSYN+V
Sbjct: 205 ASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRV 264

Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
           NGVPTCAD N+L +T RG+WR  GYI SDCD+V + +D Q +  T E+A AD ++AG+D+
Sbjct: 265 NGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLKAGMDV 324

Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
           +CG ++  H  SA+Q+G ++E DIN AL N   V+MRLG+F+G P    YG++GP  VCT
Sbjct: 325 NCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCT 384

Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT 439
            +HQ LALEAA+ G+VLLKN   +LPLS  +  ++AVIG N++    ++GNY G  C   
Sbjct: 385 QEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISV 444

Query: 440 TPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
           TPLQ +  Y + T    GC   AC    + G A   +   D  +L MGLDQ  E E +DR
Sbjct: 445 TPLQVLQGYVKDTRFLAGCNSAACNVSSI-GEAAQLASSVDYVVLFMGLDQDQEREEVDR 503

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
             L LPG Q+ L++ V+ A+K P ILVL+ GGP+DV FAK +P+I AI+WAGYPG+AGG 
Sbjct: 504 LELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGI 563

Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
           AIA +LFG  NPGG+LP+TWYP+E+ T++PMT+M MR   S  YPGRTYRFY+G  VY F
Sbjct: 564 AIAQVLFGEHNPGGRLPVTWYPKEF-TSVPMTDMRMRADPSTGYPGRTYRFYRGNTVYKF 622

Query: 619 GHGMSYTNFVHT-VANAPTVVAV-PLDGRHGSINA---TISGKAIKVTHAKCNRLTLGVQ 673
           G+G+SY+ + H  VAN   + ++  +DG      A   T+S    ++    C++L     
Sbjct: 623 GYGLSYSKYSHHFVANGTKLPSLSSIDGLKAMATAAAGTVSYDVEEIGTETCDKLKFPAL 682

Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHW---APHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
           V V+N G  DG H +L+F   P G      P  QL+ F+ +H+ +     V   +  CK+
Sbjct: 683 VRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEFEVSPCKH 742

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            S     G + I  G H + +G  +  +S 
Sbjct: 743 FSRATEDGKKVIDHGSHFMMVGDDEFEMSF 772


>gi|115448721|ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group]
 gi|46390122|dbj|BAD15557.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|46390225|dbj|BAD15656.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|113537671|dbj|BAF10054.1| Os02g0752200 [Oryza sativa Japonica Group]
 gi|125583710|gb|EAZ24641.1| hypothetical protein OsJ_08409 [Oryza sativa Japonica Group]
          Length = 780

 Score =  719 bits (1855), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/750 (48%), Positives = 486/750 (64%), Gaps = 22/750 (2%)

Query: 27  PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
           PF+C        +  FC   LPI QR +DL+ RL+L+EK+  L   + AV RLG+  Y+W
Sbjct: 35  PFSC----GAPSSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKW 90

Query: 87  WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           WSEALHGVSN G G    G    ATSFPQVI TA+SFN  LW  IG+V+  EARA+YN G
Sbjct: 91  WSEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNG 150

Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS------DGDRLK 199
            A GLT+W+PN+N+FRDPRWGRGQETPGEDP ++GKYAA +VRG+QG       +   L+
Sbjct: 151 QAEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAINSTDLE 210

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
            +A CKHFTAYDL+NW GV R+ F+AKV+ QD+ DT++ PFR CV +G  + +MCSYN+V
Sbjct: 211 ASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRV 270

Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
           NGVPTCAD N+L +T RG+WR  GYI SDCD+V + +D Q +  T E+A AD ++AG+D+
Sbjct: 271 NGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLKAGMDV 330

Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
           +CG ++  H  SA+Q+G ++E DIN AL N   V+MRLG+F+G P    YG++GP  VCT
Sbjct: 331 NCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCT 390

Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT 439
            +HQ LALEAA+ G+VLLKN   +LPLS  +  ++AVIG N++    ++GNY G  C   
Sbjct: 391 QEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISV 450

Query: 440 TPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
           TPLQ +  Y + T    GC   AC    + G A   +   D  +L MGLDQ  E E +DR
Sbjct: 451 TPLQVLQGYVKDTRFLAGCNSAACNVSSI-GEAAQLASSVDYVVLFMGLDQDQEREEVDR 509

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
             L LPG Q+ L++ V+ A+K P ILVL+ GGP+DV FAK +P+I AI+WAGYPG+AGG 
Sbjct: 510 LELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGI 569

Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
           AIA +LFG  NPGG+LP+TWYP+E+ T++PMT+M MR   S  YPGRTYRFY+G  VY F
Sbjct: 570 AIAQVLFGEHNPGGRLPVTWYPKEF-TSVPMTDMRMRADPSTGYPGRTYRFYRGNTVYKF 628

Query: 619 GHGMSYTNFVHT-VANAPTVVAV-PLDGRHGSINA---TISGKAIKVTHAKCNRLTLGVQ 673
           G+G+SY+ + H  VAN   + ++  +DG      A   T+S    ++    C++L     
Sbjct: 629 GYGLSYSKYSHHFVANGTKLPSLSSIDGLKAMATAAAGTVSYDVEEIGPETCDKLKFPAL 688

Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHW---APHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
           V V+N G  DG H +L+F   P G      P  QL+ F+ +H+ +     V   +  CK+
Sbjct: 689 VRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEFEVSPCKH 748

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            S     G + I  G H + +G  +  +S 
Sbjct: 749 FSRATEDGKKVIDHGSHFMMVGDDEFEMSF 778


>gi|242076578|ref|XP_002448225.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
 gi|241939408|gb|EES12553.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
          Length = 766

 Score =  713 bits (1840), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/748 (46%), Positives = 489/748 (65%), Gaps = 22/748 (2%)

Query: 30  CDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSE 89
           C    + +   PFC  SL IP R   L+  L+L EK+  L + A  VPRLGI  Y+WWSE
Sbjct: 24  CASPYSGSSAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSE 83

Query: 90  ALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA 148
           +LHG+++ GPG  F  G    AT+FPQVI + ++FN +LW A+   V+ EA  M+N G A
Sbjct: 84  SLHGLADNGPGVNFSSGPVRAATTFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQA 143

Query: 149 GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKH 206
           GLTYW+PN+NIFRDPRWGRGQET GEDP ++  Y+  YV+G QG  G+  R++++A CKH
Sbjct: 144 GLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEQGEEGRIRLSACCKH 203

Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
           +TAYD++ W G  R+ FNAKV+ QD+EDT+  PF+ C+ E + + +MC+YNQVNGVP CA
Sbjct: 204 YTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNGVPMCA 263

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
           + ++L++T R EW   GYI SDCD+V + ++ Q +T + E++ A  ++AG+D++CG FL 
Sbjct: 264 NKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTYTKSDEDSIAIVLKAGMDINCGSFLV 322

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
            HT+SAV++G + E DI+ AL N  +VQ+RLG+FD   ++Q    LGP +VCT +H+ELA
Sbjct: 323 RHTKSAVEKGKVQEQDIDRALFNLFSVQLRLGIFDKPNNNQWSTQLGPNNVCTKEHRELA 382

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
            EA RQG VLLKN    LPL     R VA+IGP+++    M G+Y G+AC  TT L+GI 
Sbjct: 383 AEAVRQGAVLLKNDHSFLPLKRSEVRHVAIIGPSANDVYAMGGDYTGVACNPTTFLKGIQ 442

Query: 447 RYA-RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
            YA +T    GCKDV+C   +LFG AI A+++AD  ++V GL+ + E E  DR  LLLPG
Sbjct: 443 AYATQTTFAAGCKDVSCNSTELFGEAIAAAKRADIVVVVAGLNLTEEREDFDRVSLLLPG 502

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
           +Q  L+  V+  +K P +LVL+ GGP+DV+FAK DPRIA+I+W GYPG+ GG  + +ILF
Sbjct: 503 KQMSLIHAVASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVLPEILF 562

Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
           G  NPGGKL MTWYP+ + T +PMT+M MR   S+ YPGRTYRFY G VVY FG+G+SY+
Sbjct: 563 GEYNPGGKLAMTWYPESF-TAIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGYGLSYS 621

Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTH-----------AKCNRLTLGVQV 674
            + +++ +AP  + +       S+   IS K   +             A C  L   V V
Sbjct: 622 KYSYSILSAPKKITM----SRSSVLDIISRKPSYIRRDGLDFVKTEDIASCEALAFSVHV 677

Query: 675 DVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
            V N GS DG+H +L+F+   +     P KQLV FE+VH  AG+   V I++  CK++S 
Sbjct: 678 AVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFERVHTAAGSASNVEISVDPCKHMSA 737

Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
            +  G R + LG+H + +G  +  + + 
Sbjct: 738 ANPEGKRVLLLGDHVLTVGDEEFELFIE 765


>gi|168046596|ref|XP_001775759.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672911|gb|EDQ59442.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 784

 Score =  711 bits (1836), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/757 (48%), Positives = 491/757 (64%), Gaps = 26/757 (3%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           +ACDP        PFC  S+    RV DLI RL++QEK++ L++ AA V RLGI  Y+WW
Sbjct: 20  YACDPDGPADLLFPFCNTSISDDDRVEDLISRLTIQEKIEQLVNTAANVSRLGIPPYQWW 79

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
            E LHGV+ + P   FGG  P ATSFP    +  S+N TLW  IG+VVS E RAMYN G 
Sbjct: 80  GEGLHGVA-ISPSVYFGGATPAATSFPLPCLSVCSYNRTLWNKIGQVVSTEGRAMYNQGR 138

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----------- 196
           +GLTYWSPN+NI RDPRWGR QETPGEDP LS  YA  +V+GLQ  D D           
Sbjct: 139 SGLTYWSPNINIARDPRWGRTQETPGEDPKLSSGYAVHFVKGLQEGDYDQNQPQAVSRGP 198

Query: 197 -RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
            RLK++A CKHFTA+DLD W   DR HF++KV++QD+EDT++  F+ CV EG+ +SVMCS
Sbjct: 199 RRLKISACCKHFTAHDLDRWKDYDRDHFDSKVTQQDLEDTYNPSFKSCVKEGQSSSVMCS 258

Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
           YN++NG+P C    +L  T+R +W  +GYIVSDCD+V + +D  ++  T E+A +  + A
Sbjct: 259 YNRLNGIPMCTHYELLTLTVRNQWGFDGYIVSDCDAVALIHDYINYAPTSEDAVSYVMLA 318

Query: 316 GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
           G+DL+CG    +H  +A+ + L+ E  I+  L N   V+MRLGMFDG PS+ PYG LGP+
Sbjct: 319 GMDLNCGSTTLVHGLAALDKKLIWEGLIDMHLRNLFRVRMRLGMFDGNPSTLPYGSLGPE 378

Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
           D+CT D+Q LALEAARQ +VLLKN+  +LP        +AVIG ++D T  M+GNY G  
Sbjct: 379 DMCTEDNQHLALEAARQSLVLLKNEKNALPWKKTHGLKLAVIGHHADATREMLGNYEGYP 438

Query: 436 CGYTTPLQGIGRYA-----RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
           C + +PLQG  +       R  H++GC D AC D     AA +A+ QADA +LV+G+ Q+
Sbjct: 439 CKFVSPLQGFAKVLSDHSPRISHERGCSDAACEDQFYIYAAKEAAAQADAVVLVLGISQA 498

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
            E E  DR  LLLPGRQ ELVS V  AS G P +LVL+SG P+DV+FA +DPRI +IIWA
Sbjct: 499 QEKEGRDRDSLLLPGRQMELVSSVVEASAGRPVVLVLLSGSPLDVSFANDDPRIQSIIWA 558

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
           GYPGQ+GG AIA+ +FG  NPGG+L  +WY + Y TN+ M+ M MRP+ S  YPGRTYRF
Sbjct: 559 GYPGQSGGEAIAEAIFGLVNPGGRLAQSWYYENY-TNIDMSNMNMRPNASTGYPGRTYRF 617

Query: 610 YKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP-LDGRHGSINATISGKAIKVTHAK---C 665
           +    ++ FGHG+SY++F +T+ +AP  +  P L  +  S +  +    +   H +   C
Sbjct: 618 FTDTPLWEFGHGLSYSDFKYTMVSAPQSIMAPHLRYQLCSSDRAVMTSDLNCLHYEKEAC 677

Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPAGAQQRVGI 723
              +  V+V V N G   G H++L+FS PP+      P KQLV+FE+VH+ AGA Q +  
Sbjct: 678 KESSFHVRVWVINHGPLSGDHSVLLFSKPPSRGIDGIPLKQLVSFERVHLEAGAGQEILF 737

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            ++ C+ L  V   G R + LGEH + +G  +H +++
Sbjct: 738 KVNPCEDLGTVGDDGIRTVELGEHTLMVGMVQHVLTV 774


>gi|125534137|gb|EAY80685.1| hypothetical protein OsI_35867 [Oryza sativa Indica Group]
          Length = 779

 Score =  709 bits (1831), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/742 (48%), Positives = 483/742 (65%), Gaps = 28/742 (3%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           F C P  A  +   FC  +LP  QR  DL+ RL+  EKV  L   A  VPRLGI  Y+WW
Sbjct: 37  FTCGPASAQ-KGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKWW 95

Query: 88  SEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           SEALHG++  G G  FG G    ATSFPQVI TA++F+  LW  IG+ +  E RA YN G
Sbjct: 96  SEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNLG 155

Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
            A GL  WSPNVNIFRDPRWGRGQETPGEDP  + KY A++V+GLQGS    L+ +A CK
Sbjct: 156 QAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTNLQTSACCK 215

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H TAYD++ W GV R++FNAKV+ QD+ DT++ PFR CV++GK + +MC+Y  +NGVP C
Sbjct: 216 HITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVPAC 275

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
           A  ++L +T+RGEW+L+GY  SDCD+V + + ++HFT T EEA A A++AGLD++CG ++
Sbjct: 276 ASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDINCGVYM 335

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP-SSQPYGHLGPKDVCTPDHQE 384
             +  SA+Q+G ++E D++ AL N   ++MRLG FDG+P  ++ YG LG  DVCTP H+ 
Sbjct: 336 QQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLGAADVCTPVHKA 395

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LALEAAR+G+VLLKN    LPL      + AVIG N++  + ++GNY G+ C  TTP  G
Sbjct: 396 LALEAARRGVVLLKNDARLLPLRAPTVSSAAVIGHNANDILALLGNYYGLPCETTTPFGG 455

Query: 445 IGRYARTIH-QQGCKDVAC---ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
           I +Y ++     GC   AC   A DQ    A   ++ +D   LVMGL Q  E E LDR  
Sbjct: 456 IQKYVKSAKFLPGCSSAACDVAATDQ----ATALAKSSDYVFLVMGLSQKQEQEGLDRTS 511

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           LLLPG+QQ L++ V+ ASK P IL+L++GGP+D+ FA+ +P+I AI+WAGYPGQAGG AI
Sbjct: 512 LLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAI 571

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           AD+LFG  NP GKLP+TWYP+E+ T   MT+M MRP  +  YPGR+YRFYKG  VY FG+
Sbjct: 572 ADVLFGEFNPSGKLPVTWYPEEF-TKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 630

Query: 621 GMSYTNF----VHTVANAPTVVAVPLDGRHGSINATISGKAI----KVTHAKCNRLTLGV 672
           G+SY+ F    V    N+ +     L G   +   T  G A+    ++   +C RL   V
Sbjct: 631 GLSYSKFACRIVSGAGNSSSYGKAALAGLRAAT--TPEGDAVYRVDEIGDDRCERLRFPV 688

Query: 673 QVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
            V+V+N G  DG HT+L+F   S+   G   P +QL+ F   H+  G ++++ + I  C+
Sbjct: 689 MVEVQNHGPMDGKHTVLMFVRWSSTDGGR--PVRQLIGFRNQHLKVGEKKKLKMEISPCE 746

Query: 730 YLSVVDRSGTRRIPLGEHNIHI 751
           +LS     G + I  G H + +
Sbjct: 747 HLSRARVDGEKVIDRGSHFLMV 768


>gi|62734691|gb|AAX96800.1| Glycosyl hydrolase family 3 C terminal domain, putative [Oryza
           sativa Japonica Group]
 gi|77549994|gb|ABA92791.1| beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 853

 Score =  709 bits (1830), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/742 (48%), Positives = 482/742 (64%), Gaps = 28/742 (3%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           F C P  A  +   FC  +LP  QR  DL+ RL+  EKV  L   A  VPRLGI  Y+WW
Sbjct: 111 FTCGPASAQ-KGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKWW 169

Query: 88  SEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           SEALHG++  G G  FG G    ATSFPQVI TA++F+  LW  IG+ +  E RA YN G
Sbjct: 170 SEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNLG 229

Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
            A GL  WSPNVNIFRDPRWGRGQETPGEDP  + KY A++V+GLQGS    L+ +A CK
Sbjct: 230 QAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTNLQTSACCK 289

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H TAYD++ W GV R++FNAKV+ QD+ DT++ PFR CV++GK + +MC+Y  +NGVP C
Sbjct: 290 HITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVPAC 349

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
           A  ++L +T+RGEW+L+GY  SDCD+V + + ++HFT T EEA A A++AGLD++CG ++
Sbjct: 350 ASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDINCGVYM 409

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP-SSQPYGHLGPKDVCTPDHQE 384
             +  SA+Q+G ++E D++ AL N   ++MRLG FDG+P  ++ YG L   DVCTP H+ 
Sbjct: 410 QQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTPVHKA 469

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LALEAAR+G+VLLKN    LPL      + AVIG N++  + ++GNY G+ C  TTP  G
Sbjct: 470 LALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTTPFGG 529

Query: 445 IGRYARTIH-QQGCKDVAC---ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
           I +Y ++     GC   AC   A DQ    A   ++ +D   LVMGL Q  E E LDR  
Sbjct: 530 IQKYVKSAKFLPGCSSAACDVAATDQ----ATALAKSSDYVFLVMGLSQKQEQEGLDRTS 585

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           LLLPG+QQ L++ V+ ASK P IL+L++GGP+D+ FA+ +P+I AI+WAGYPGQAGG AI
Sbjct: 586 LLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAI 645

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           AD+LFG  NP GKLP+TWYP+E+ T   MT+M MRP  +  YPGR+YRFYKG  VY FG+
Sbjct: 646 ADVLFGEFNPSGKLPVTWYPEEF-TKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 704

Query: 621 GMSYTNF----VHTVANAPTVVAVPLDGRHGSINATISGKAI----KVTHAKCNRLTLGV 672
           G+SY+ F    V    N+ +     L G   +   T  G A+    ++   +C RL   V
Sbjct: 705 GLSYSKFACRIVSGAGNSSSYGKAALAGLRAAT--TPEGDAVYRVDEIGDDRCERLRFPV 762

Query: 673 QVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
            V+V+N G  DG HT+L+F   S+   G   P +QL+ F   H+  G ++++ + I  C+
Sbjct: 763 MVEVQNHGPMDGKHTVLMFVRWSSTDGGR--PVRQLIGFRNQHLKVGEKKKLKMEISPCE 820

Query: 730 YLSVVDRSGTRRIPLGEHNIHI 751
           +LS     G + I  G H + +
Sbjct: 821 HLSRARVDGEKVIDRGSHFLMV 842


>gi|253761872|ref|XP_002489310.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
 gi|241946958|gb|EES20103.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
          Length = 772

 Score =  709 bits (1830), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/756 (47%), Positives = 485/756 (64%), Gaps = 24/756 (3%)

Query: 22  LAAREPFACDPKDA-TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
           LA   PF+C P  A  +  L FC V+L   QR  DL+ RL+  EK+  L   A  VPRLG
Sbjct: 22  LAGDPPFSCGPTSAEASEGLAFCDVTLSPAQRAADLVSRLTPAEKIAQLGDQATGVPRLG 81

Query: 81  IKGYEWWSEALHGVSNVGPGTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDE 138
           + GY+WW+EALHG++  G G  F   G    ATSFPQV+ TA++F+  LW  IG+ +  E
Sbjct: 82  VPGYKWWNEALHGLATSGKGLHFDVVGGVRAATSFPQVLLTAAAFDDDLWFRIGQAIGRE 141

Query: 139 ARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGD 196
           ARA++N G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ +YA ++VRG+QG S   
Sbjct: 142 ARALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSS 201

Query: 197 RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSY 256
            L+ +A CKH TAYDL++WNGV R+ F A+V+ QD+EDTF+ PFR CV+EGK + +MC+Y
Sbjct: 202 LLQTSACCKHATAYDLEDWNGVARYSFVARVTAQDLEDTFNPPFRSCVVEGKASCIMCAY 261

Query: 257 NQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAG 316
             +NGVP CA+ ++L  T+RG+W L+GY+ SDCD+V +  D Q +  TPE+A A +++AG
Sbjct: 262 TAINGVPACANTDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLKAG 321

Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
           LD+DCG ++  H  +A+Q+G L+E+DI+ ALVN   V+MRLG FDG+P    YG L   D
Sbjct: 322 LDIDCGSYIQQHATAAIQQGKLTELDIDKALVNLFAVRMRLGHFDGDPRKNMYGALSAAD 381

Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
           +CTP+H+ LALEAA+ GIVLLKN G  LPL      + AVIGPNS+  + +I NY G  C
Sbjct: 382 ICTPEHRSLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNSNDGMALIANYFGPPC 441

Query: 437 GYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA 495
             TTPLQG+  Y   +    GC   AC D  +   A+  S   D   L MGL Q  E+E 
Sbjct: 442 ESTTPLQGLQSYVNNVRFLAGCSSAAC-DVAVTDQAVVLSGSEDYVFLFMGLSQQQESEG 500

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DR  LLLPG QQ L++ V+ ASK P ILVL+SGGP+D+ FA+++P+I AI+WAGYPGQA
Sbjct: 501 KDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQA 560

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG AIA +LFG  NP G+LPMTWYP+++ T +PMT+M MR   +  YPGR+YRFY+G  V
Sbjct: 561 GGLAIAKVLFGDHNPSGRLPMTWYPEDF-TKVPMTDMRMRADPTSGYPGRSYRFYQGNAV 619

Query: 616 YPFGHGMSYTNFVHTV---ANAPTVVAVPLDGRHGSINATISGKAIKVTHAK------CN 666
           Y FG+G+SY+ F   +    + P + +  L G    +  T++ +  +  H        C 
Sbjct: 620 YKFGYGLSYSTFSSRLLYGTSMPALSSTVLAG----LRETVTEEGDRSYHIDDIGTDGCE 675

Query: 667 RLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
           +L     V+V+N G  DG H+ L+F   P      P  QL+ F   H+ AG    +  +I
Sbjct: 676 QLKFPAMVEVQNHGPMDGKHSALMFLRWPNTNGGRPASQLIGFMSQHLKAGETANLRFDI 735

Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             C++ S V   G + I +G H + +    H++ + 
Sbjct: 736 SPCEHFSRVRADGMKVIDIGSHFLTV--DNHAIEIR 769


>gi|62701898|gb|AAX92971.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|62733926|gb|AAX96035.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|77550045|gb|ABA92842.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
           expressed [Oryza sativa Japonica Group]
 gi|125576900|gb|EAZ18122.1| hypothetical protein OsJ_33667 [Oryza sativa Japonica Group]
          Length = 771

 Score =  709 bits (1829), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/741 (48%), Positives = 479/741 (64%), Gaps = 22/741 (2%)

Query: 27  PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
           P++C P+  ++    FC   LP  +R  DL+ RL+  EKV  L   A  VPRLG+  Y+W
Sbjct: 26  PYSCGPRSPSS-GYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVPRLGVPPYKW 84

Query: 87  WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           WSE LHG+S  G G  F G     TSFPQV+ TA++F+  LW  IG+ +  EARA+YN G
Sbjct: 85  WSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYNLG 144

Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
            A GLT WSPNVNI+RDPRWGRGQETPGEDP  + KYA ++V+GLQGS    L+ +A CK
Sbjct: 145 QAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGSTPGTLQTSACCK 204

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H TAYDL+ WNGV R++FNAKV+ QD+ DTF+ PF+ CV++ K + VMC+Y  +NGVP C
Sbjct: 205 HATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYTDINGVPAC 264

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
           A  ++L +T RG+W L+GY+ SDCD+V +  D Q +  TPE+  A AI+AGLDL+CG + 
Sbjct: 265 ASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYAPTPEDTVAVAIKAGLDLNCGNYT 324

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP-SSQPYGHLGPKDVCTPDHQE 384
            +H  +A+Q+G + E D++ AL N   V+MRLG FDG+P S+  YGHLG  DVCT  H++
Sbjct: 325 QVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAADVCTQAHRD 384

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LALEAA+ GIVLLKN   +LPL     R+ AVIGPN++    + GNY G  C  TTPLQG
Sbjct: 385 LALEAAQDGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPCETTTPLQG 444

Query: 445 IGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           + RY  ++    GC   AC       AA  AS  +D  I+ MGL Q  E E LDR  LLL
Sbjct: 445 VQRYISSVRFLAGCDSPACGFAATGQAAALAS-SSDQVIMFMGLSQDQEKEGLDRTSLLL 503

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG+QQ L++ V+ A++ P ILVL++GGP+DV FAKN+P+I AI+WAGYPGQAGG AIA +
Sbjct: 504 PGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGLAIAKV 563

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFG  NP G+LP+TWYP+E+ T +PMT+M MR   +  YPGR+YRFY+G  VY FG+G+S
Sbjct: 564 LFGDHNPSGRLPVTWYPEEF-TRIPMTDMRMRADPATGYPGRSYRFYQGNPVYKFGYGLS 622

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATI---SGKAIKVTHAK------CNRLTLGVQV 674
           Y+ F   +  A    A P       +   I   +G   +  H +      C RL     V
Sbjct: 623 YSKFSRRLVAA----AKPRRPNRNLLAGVIPKPAGDGGESYHVEEIGEEGCERLKFPATV 678

Query: 675 DVKNVGSKDGAHTLLVFSTPP---AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
           +V N G  DG H++LVF   P   AG   P +QLV F   HV AG + R+ + I+ C++L
Sbjct: 679 EVHNHGPMDGKHSVLVFVRWPNATAGASRPARQLVGFSSQHVRAGEKARLTMEINPCEHL 738

Query: 732 SVVDRSGTRRIPLGEHNIHIG 752
           S     GT+ I  G H + +G
Sbjct: 739 SRAREDGTKVIDRGSHFLKVG 759


>gi|297736788|emb|CBI25989.3| unnamed protein product [Vitis vinifera]
          Length = 746

 Score =  708 bits (1827), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/767 (46%), Positives = 485/767 (63%), Gaps = 71/767 (9%)

Query: 5   IAFFFLGLILLSASSSG-----LAAREP------FACDPKDATT-----RTLPFCQVSLP 48
           IAFF +   + S    G     +AAR P      + CD           +   +C  S P
Sbjct: 17  IAFFAVCSAIKSPLKDGPAAAPMAARGPIDGNYTYVCDESRFAALGLDMKDFHYCDSSSP 76

Query: 49  IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
              R  DL+ R++L EKV      A+ V R+G+  Y WWSEALHGVSN G    F    P
Sbjct: 77  YEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEALHGVSNFGRCVFFDEVVP 136

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GATSFP VI +A+SFN +LW+ +G+ VS EARAMYN G AGLT+WSPN+N+ RDPRWGR 
Sbjct: 137 GATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLTFWSPNINVVRDPRWGRI 196

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
            ETPGEDP L G YA +Y                   H+ AYDLDNW G DR HF+A+VS
Sbjct: 197 LETPGEDPHLVGLYAVNY-------------------HYAAYDLDNWKGADRVHFDARVS 237

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            QD+ +TF +PF MCV EG V+SVMCSYN++NG+P+CAD  +LK+TIRGEW L+GYIVSD
Sbjct: 238 VQDMAETFVLPFEMCVKEGDVSSVMCSYNKINGIPSCADSRLLKQTIRGEWDLHGYIVSD 297

Query: 289 CDSVGVY-YDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           CDSV V   D +   S+  ++AA A+ AG++LDCG F       AV +G  ++ D++++L
Sbjct: 298 CDSVEVMAVDQKWLDSSFSDSAAQALNAGMNLDCGTFNNRSLTEAVNQGKANQADLDHSL 357

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
                + MR+G FDG P+   +  LG  D+C+ +H ELA EAARQGIVLLKN   +LPL 
Sbjct: 358 RYLYVLLMRVGFFDGIPA---FASLGKDDICSAEHIELAREAARQGIVLLKNDNATLPLK 414

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL 467
            +++  +A++GP+++ T  MIGNYAGI C Y +PL          +++GC DV C ++  
Sbjct: 415 SVKN--IALVGPHANATDAMIGNYAGIPCYYVSPLDAFSSMGEVRYEKGCADVQCLNETY 472

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
              A++A+++ADATI+  G D SIEAEALDR  LLLPG Q +L+++V+  S GP +LV+M
Sbjct: 473 IFNAMEAAKRADATIIFAGTDLSIEAEALDRVDLLLPGYQTQLINQVADLSTGPVVLVIM 532

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           SGG +D++FA+++P+IAAI+WAGYPG+ GG AIAD++ G  NPGG+LP+TWY  +Y+  L
Sbjct: 533 SGGGVDISFARDNPKIAAILWAGYPGEQGGNAIADVILGKYNPGGRLPITWYEADYVDML 592

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
           PMT MA+RP  S  YPGRTY+F+ G  VYPFG+GMSYTNF ++++               
Sbjct: 593 PMTSMALRPVDSLGYPGRTYKFFNGSTVYPFGYGMSYTNFSYSLS--------------- 637

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLV 706
                        T   C   ++  +V VKNVG  DG+  ++V+S+PP G    H K++V
Sbjct: 638 -------------TSQSCKE-SIEFEVAVKNVGRMDGSEVVVVYSSPPLGIAGTHIKKVV 683

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            FE+V V  G  ++V  +++VCK L +VD +G   +P G H I +GG
Sbjct: 684 GFERVFVKVGGTEKVKFSMNVCKSLGIVDSTGYALLPSGSHTIKVGG 730


>gi|115485163|ref|NP_001067725.1| Os11g0297300 [Oryza sativa Japonica Group]
 gi|113644947|dbj|BAF28088.1| Os11g0297300 [Oryza sativa Japonica Group]
          Length = 779

 Score =  707 bits (1825), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/742 (48%), Positives = 482/742 (64%), Gaps = 28/742 (3%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           F C P  A  +   FC  +LP  QR  DL+ RL+  EKV  L   A  VPRLGI  Y+WW
Sbjct: 37  FTCGPASAQ-KGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKWW 95

Query: 88  SEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           SEALHG++  G G  FG G    ATSFPQVI TA++F+  LW  IG+ +  E RA YN G
Sbjct: 96  SEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNLG 155

Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
            A GL  WSPNVNIFRDPRWGRGQETPGEDP  + KY A++V+GLQGS    L+ +A CK
Sbjct: 156 QAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTNLQTSACCK 215

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H TAYD++ W GV R++FNAKV+ QD+ DT++ PFR CV++GK + +MC+Y  +NGVP C
Sbjct: 216 HITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVPAC 275

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
           A  ++L +T+RGEW+L+GY  SDCD+V + + ++HFT T EEA A A++AGLD++CG ++
Sbjct: 276 ASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDINCGVYM 335

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP-SSQPYGHLGPKDVCTPDHQE 384
             +  SA+Q+G ++E D++ AL N   ++MRLG FDG+P  ++ YG L   DVCTP H+ 
Sbjct: 336 QQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTPVHKA 395

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LALEAAR+G+VLLKN    LPL      + AVIG N++  + ++GNY G+ C  TTP  G
Sbjct: 396 LALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTTPFGG 455

Query: 445 IGRYARTIH-QQGCKDVAC---ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
           I +Y ++     GC   AC   A DQ    A   ++ +D   LVMGL Q  E E LDR  
Sbjct: 456 IQKYVKSAKFLPGCSSAACDVAATDQ----ATALAKSSDYVFLVMGLSQKQEQEGLDRTS 511

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           LLLPG+QQ L++ V+ ASK P IL+L++GGP+D+ FA+ +P+I AI+WAGYPGQAGG AI
Sbjct: 512 LLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAI 571

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           AD+LFG  NP GKLP+TWYP+E+ T   MT+M MRP  +  YPGR+YRFYKG  VY FG+
Sbjct: 572 ADVLFGEFNPSGKLPVTWYPEEF-TKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 630

Query: 621 GMSYTNF----VHTVANAPTVVAVPLDGRHGSINATISGKAI----KVTHAKCNRLTLGV 672
           G+SY+ F    V    N+ +     L G   +   T  G A+    ++   +C RL   V
Sbjct: 631 GLSYSKFACRIVSGAGNSSSYGKAALAGLRAAT--TPEGDAVYRVDEIGDDRCERLRFPV 688

Query: 673 QVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
            V+V+N G  DG HT+L+F   S+   G   P +QL+ F   H+  G ++++ + I  C+
Sbjct: 689 MVEVQNHGPMDGKHTVLMFVRWSSTDGGR--PVRQLIGFRNQHLKVGEKKKLKMEISPCE 746

Query: 730 YLSVVDRSGTRRIPLGEHNIHI 751
           +LS     G + I  G H + +
Sbjct: 747 HLSRARVDGEKVIDRGSHFLMV 768


>gi|357152329|ref|XP_003576084.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 779

 Score =  706 bits (1823), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/748 (47%), Positives = 480/748 (64%), Gaps = 15/748 (2%)

Query: 27  PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
           PF+C P  AT +   FC  +LP+ +R  DL+ RL+L EKV  L   A AVPRLG+  Y+W
Sbjct: 35  PFSCGPGSAT-QGYAFCDKALPVERRAADLVSRLTLAEKVSQLGDEADAVPRLGVPAYKW 93

Query: 87  WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           WSE LHG+S  G G  F G     TSFPQV+ TA+SF+  +W  IG+ +  EARA+YN G
Sbjct: 94  WSEGLHGLSFWGHGMHFDGAVRAITSFPQVLLTAASFDQDIWYRIGQAIGTEARALYNLG 153

Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
            A GLT WSPNVNI+RDPRWGRGQETPGEDP  + KYA ++V+GLQG+    L+ +A CK
Sbjct: 154 QAQGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGTSATTLQTSACCK 213

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H TAYDL++WNGV R++FNAKV+ QD+ DTF+ PF+ CV EGK   VMC+Y  +NGVP C
Sbjct: 214 HATAYDLEDWNGVVRYNFNAKVTLQDLADTFNPPFKSCVEEGKATCVMCAYTNINGVPAC 273

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
           A  +++ +T +G+W LNGY+ SDCD+V +  D Q + +TPE+  A A++AGLDL+CG + 
Sbjct: 274 ASSDLITKTFKGDWGLNGYVSSDCDAVALLRDAQRYRATPEDTVAVALKAGLDLNCGNYT 333

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP-SSQPYGHLGPKDVCTPDHQE 384
            +H  SA+Q+G ++E D++NAL N   V+MRLG FDG+P +S  YG LG  DVC+P H+ 
Sbjct: 334 QVHGMSALQQGKMTEQDVDNALKNLFAVRMRLGHFDGDPRTSALYGSLGAADVCSPAHKN 393

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LALEAA+ GIVLLKN    LPL      + A IG N++    + GNY G  C  TTPLQG
Sbjct: 394 LALEAAQSGIVLLKNDAGILPLDPSAVASAAAIGHNANDPAALNGNYFGPPCETTTPLQG 453

Query: 445 IGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           +  Y + +    GC   AC      G A+  +  +D  IL MGL Q  E E +DR  LLL
Sbjct: 454 LQGYVKNVKFLAGCDSAACGFAAT-GQAVTLASSSDYVILFMGLSQKEEQEGIDRTSLLL 512

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG+QQ L++ V+ ASK P ILVL++GG +D+ FAK++P+I AI+WAGYPGQAGG AIA +
Sbjct: 513 PGKQQNLITAVASASKRPVILVLLTGGSVDITFAKSNPKIGAILWAGYPGQAGGLAIARV 572

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFG  NP G+LP+TWYP+E+ T +PMT+M MR   +  YPGR+YRFY+G  VY FG G+S
Sbjct: 573 LFGDHNPSGRLPVTWYPEEF-TKVPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGDGLS 631

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINA-TISGKAIKVTHAK------CNRLTLGVQVDV 676
           Y+ F   + ++     VP       + A T +   +   H +      C++L     V+V
Sbjct: 632 YSKFSRQLVSSTNTHQVPNTNLLTGLTARTATDGGMSYYHVEEIGVEGCDKLKFPAVVEV 691

Query: 677 KNVGSKDGAHTLLVFSTPP--AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           +N G  DG H++++F   P   G   P  QLV F   H+ AG +  +  ++  C++ +  
Sbjct: 692 QNHGPMDGKHSVMMFLRWPNSTGTGRPVSQLVGFRSQHLKAGEKASLTFDVSPCEHFARA 751

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
              G + I  G H + +G  +  +S H+
Sbjct: 752 REDGKKVIDRGSHFLVVGKDEREISFHS 779


>gi|125534112|gb|EAY80660.1| hypothetical protein OsI_35838 [Oryza sativa Indica Group]
          Length = 771

 Score =  706 bits (1821), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/741 (48%), Positives = 478/741 (64%), Gaps = 22/741 (2%)

Query: 27  PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
           P++C P+ + +    FC   LP  +R  DL+ RL+  EKV  L   A  V RLG+  Y+W
Sbjct: 26  PYSCGPR-SPSLGYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVARLGVPPYKW 84

Query: 87  WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           WSE LHG+S  G G  F G     TSFPQV+ TA++F+  LW  IG+ +  EARA+YN G
Sbjct: 85  WSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYNLG 144

Query: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
            A GLT WSPNVNI+RDPRWGRGQETPGEDP  + KYA ++V+GLQGS    L+ +A CK
Sbjct: 145 QAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGSTPGTLQTSACCK 204

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H TAYDL+ WNGV R++FNAKV+ QD+ DTF+ PF+ CV++ K + VMC+Y  +NGVP C
Sbjct: 205 HATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYTDINGVPAC 264

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
           A  ++L +T RG+W L+GY+ SDCD+V +  D Q +  TPE+  A AI+AGLDL+CG + 
Sbjct: 265 ASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYAPTPEDTVAVAIKAGLDLNCGNYT 324

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP-SSQPYGHLGPKDVCTPDHQE 384
            +H  +A+Q+G + E D++ AL N   V+MRLG FDG+P S+  YGHLG  DVCT  H++
Sbjct: 325 QVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAADVCTQAHRD 384

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LALEAA+ GIVLLKN   +LPL     R+ AVIGPN++    + GNY G  C  TTPLQG
Sbjct: 385 LALEAAQNGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPCETTTPLQG 444

Query: 445 IGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           + RY  ++    GC   AC       AA  AS  +D  I+ MGL Q  E E LDR  LLL
Sbjct: 445 VQRYISSVRFLAGCDSPACGFAATGQAAALAS-SSDQVIMFMGLSQDQEKEGLDRTSLLL 503

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG+QQ L++ V+ A++ P ILVL++GGP+DV FAKN+P+I AI+WAGYPGQAGG AIA +
Sbjct: 504 PGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGLAIAKV 563

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFG  NP G+LP+TWYP+E+ T +PMT+M MR   +  YPGR+YRFY+G  VY FG+G+S
Sbjct: 564 LFGDHNPSGRLPVTWYPEEF-TRIPMTDMRMRADPATGYPGRSYRFYQGNPVYKFGYGLS 622

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATI---SGKAIKVTHAK------CNRLTLGVQV 674
           Y+ F   +  A    A P       +   I   +G   +  H +      C RL     V
Sbjct: 623 YSKFTRRLVAA----AKPRRPNRNLLAGVIPKPAGDGGESYHVEEIGEEGCERLKFPATV 678

Query: 675 DVKNVGSKDGAHTLLVFSTPP---AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
           +V N G  DG H++LVF   P   AG   P +QLV F   HV AG + R+ + I+ C++L
Sbjct: 679 EVHNHGPMDGKHSVLVFVQWPNATAGASRPARQLVGFSSQHVRAGEKARLTMEINPCEHL 738

Query: 732 SVVDRSGTRRIPLGEHNIHIG 752
           S     GT+ I  G H + +G
Sbjct: 739 SRARDDGTKVIDRGSHFLKVG 759


>gi|357164885|ref|XP_003580200.1| PREDICTED: probable beta-D-xylosidase 6-like [Brachypodium
           distachyon]
          Length = 771

 Score =  706 bits (1821), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/739 (46%), Positives = 488/739 (66%), Gaps = 22/739 (2%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PFC  SLP P R   L+  L+L EK+  L + AA VPRLGI  YEWWSE+LHG+++ GPG
Sbjct: 38  PFCDASLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGIPPYEWWSESLHGLADNGPG 97

Query: 101 TKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
             F  G    AT FPQVI +A+SFN +LW A+   V+ EARAM+N G AGLTYW+PN+N+
Sbjct: 98  VNFSSGPVGAATIFPQVILSAASFNRSLWRAVAEAVAVEARAMHNAGQAGLTYWAPNINV 157

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----RLKVAASCKHFTAYDLDNW 215
           FRDPRWGRGQETPGEDP +   Y+  YV+G QG  GD    R+ ++A CKH+ AYDL+ W
Sbjct: 158 FRDPRWGRGQETPGEDPAVIAAYSVEYVKGFQGEYGDGKEGRMMLSACCKHYVAYDLEKW 217

Query: 216 NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
               R+ FNAKV++QD EDT++ PF+ C+ EG+ + +MCSYNQVNGVP CA  ++L++ +
Sbjct: 218 GNFTRYTFNAKVNEQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVPACARKDLLQK-V 276

Query: 276 RGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
           R EW   GY+VSDCD+VG+ Y  Q++T++ E++ A  ++AG+D++CG FL  HT+SA+Q+
Sbjct: 277 RDEWGFQGYVVSDCDAVGIIYGYQNYTNSDEDSIAIVLKAGMDINCGSFLIRHTKSAIQK 336

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
           G ++E DIN+AL N  +VQ+RLG+FD    +Q +  LGP ++CT +H+ELA EAARQG V
Sbjct: 337 GKITEEDINHALFNLFSVQLRLGLFDKTSGNQWFTQLGPSNICTKEHRELAAEAARQGTV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQ 454
           LLKN    LPL       +A+IGP ++    M G+Y G+ C  TT L+G+     +T   
Sbjct: 397 LLKNDNSFLPLKRSEVSHIAIIGPVANDAYIMGGDYTGVPCNPTTFLKGMQAVVPQTTIA 456

Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
            GCKD++C     FG AI+ +++AD  +L+ GL+ + E E LDR  LLLPG+Q +L++ +
Sbjct: 457 AGCKDISCNSTDGFGEAIEVAKRADIVVLIAGLNLTQETEDLDRVSLLLPGKQMDLINSI 516

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
           +  +K P +LV+  GGP+DV+FAK D RIA+++W GYPG+ GG  + +ILFG  NPGGKL
Sbjct: 517 ASVTKKPLVLVITGGGPVDVSFAKQDKRIASVLWIGYPGEVGGQVLPEILFGEYNPGGKL 576

Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
           P+TWYP+ + T +PM +M MR   S+ YPGRTYRFY G VVY FG+G+SY+ + + +  A
Sbjct: 577 PITWYPESF-TAVPMNDMNMRADPSRSYPGRTYRFYTGDVVYGFGYGLSYSKYSYNIIQA 635

Query: 635 PTVVAVPLDGRHGSINATISGKA---------IKVTH-AKCNRLTLGVQVDVKNVGSKDG 684
           PT +++    R  +++   + +A         ++V   A C  +   V + V N G+ DG
Sbjct: 636 PTKISL---SRSSAVDFISTKRAHTRRDGLDYVQVEDIASCESIKFSVHISVANDGAMDG 692

Query: 685 AHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
           +H +L+F+   +     P KQLV FE+++  AG    V I +  CK +S  +  G R + 
Sbjct: 693 SHAVLLFTRSKSSVPGFPLKQLVGFERLYAAAGKATNVEITVDPCKLMSSANTEGRRVLL 752

Query: 744 LGEHNIHIGGTKHSVSLHA 762
           LG H + +G  +H   + A
Sbjct: 753 LGSHLLMVGDEEHEFFMEA 771


>gi|326491679|dbj|BAJ94317.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 772

 Score =  705 bits (1820), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/746 (46%), Positives = 489/746 (65%), Gaps = 24/746 (3%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A   +  FC  SLP P R   L+  L+L EK+  L + AA VPRLG+  YEWWSE+LHG+
Sbjct: 33  AEANSYAFCDGSLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGVPPYEWWSESLHGL 92

Query: 95  SNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYW 153
           ++ GPG  F  G    AT FPQVI +A++FN +LW A+   V+ EARAM+N G AGLTYW
Sbjct: 93  ADNGPGVNFSSGPVAAATIFPQVILSAAAFNRSLWRAVAEAVAVEARAMHNAGQAGLTYW 152

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----RLKVAASCKHFTA 209
           +PN+N+FRDPRWGRGQETPGEDP +   Y+  YV+G QG  GD    R+ ++A CKH+ A
Sbjct: 153 APNINVFRDPRWGRGQETPGEDPAMIAAYSVEYVKGFQGEYGDGREGRMMLSACCKHYIA 212

Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
           YDL+ W    R+ FNA+V+ QD EDT++ PF+ C+ EG+ + +MCSYNQVNGVP CA  +
Sbjct: 213 YDLEKWGKFARYTFNAEVNAQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVPACARKD 272

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT 329
           +L++ IR EW   GYIVSDCD+V + ++ Q +TS+ E++ A  ++AG+D++CG FL  HT
Sbjct: 273 LLQK-IRDEWGFKGYIVSDCDAVAIIHENQTYTSSDEDSVAIVLKAGMDVNCGSFLIRHT 331

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
           +SA+++G + E DIN+AL N  +VQ+RLG+F+    +Q +  LGP +VCT +H+ELA EA
Sbjct: 332 KSAIEKGKIQEEDINHALYNLFSVQLRLGLFEKANENQWFTRLGPSNVCTKEHRELAAEA 391

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA 449
            RQG VLLKN    LPL   +   +A+IG  ++    M G+Y G+ C   T L+G+  + 
Sbjct: 392 VRQGTVLLKNDNSFLPLKRSKVSHIALIGAAANDAYIMGGDYTGVPCDPITFLKGMQAFV 451

Query: 450 -RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQ 508
            +T    GCKDV+C     FG AI+A+++AD  +++ GL+ + E+E LDR  LLLPGRQQ
Sbjct: 452 PQTTVAAGCKDVSCDSPDGFGEAIEAAKRADIVVVIAGLNLTQESEDLDRVTLLLPGRQQ 511

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
           +LV+ ++  +K P +LV+  GGP+DVAFAK DPRIA+++W GYPG+ GG  + +ILFG  
Sbjct: 512 DLVNIIASVTKKPIVLVITGGGPVDVAFAKQDPRIASVLWIGYPGEVGGQVLPEILFGEY 571

Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
           NPGGKLPMTWYP+ + T +PM +M MR   S+ YPGRTYRFY G VVY FG+G+SY+ + 
Sbjct: 572 NPGGKLPMTWYPESF-TAVPMNDMNMRADPSRGYPGRTYRFYTGEVVYGFGYGLSYSKYS 630

Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKA----------IKVTH-AKCNRLTLGVQVDVK 677
           + +  AP  +++     H  +   IS K           ++V   A C  L   V + V 
Sbjct: 631 YNIVQAPQRISL----SHSPVPGLISRKPAYTRRDGLDYVQVEDIASCESLVFSVHISVA 686

Query: 678 NVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           N G+ DG+H +L+F+   +     P KQLV FE+V+  AG+ + V I +  CKY+S  + 
Sbjct: 687 NDGAMDGSHAVLLFARSKSSVPGFPLKQLVGFERVYTAAGSSKNVAITVDPCKYMSAANT 746

Query: 737 SGTRRIPLGEHNIHIGGTKHSVSLHA 762
            G R + LG H++ +G   H   + A
Sbjct: 747 EGRRVLLLGSHHLMVGDEVHEFVIEA 772


>gi|384872601|gb|AFI25186.1| putative beta-D-xylosidase [Nicotiana tabacum]
          Length = 791

 Score =  702 bits (1813), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/748 (46%), Positives = 470/748 (62%), Gaps = 34/748 (4%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           FC  +LPI  RV  LI  L++ EK+  L     ++PRLG+  YEWWSE+LHG++  GP  
Sbjct: 43  FCNKNLPISTRVQSLISLLTIDEKILHLSDNTTSIPRLGLPAYEWWSESLHGIATNGPAV 102

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFR 161
            F G   G TSFPQVI TA++FN TLW +I   ++ EARAMYN G AGLT+W+PN+NI R
Sbjct: 103 NFNGQIKGVTSFPQVILTAAAFNRTLWHSIATAIAVEARAMYNLGQAGLTFWAPNINILR 162

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQG------------------------SDGDR 197
           DPRWGRGQETPGEDP++   YA  YV G QG                        +DG+R
Sbjct: 163 DPRWGRGQETPGEDPMVVSAYAIEYVTGFQGLNPKAKKGNRNGYGKKRRVLKEDDNDGER 222

Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
           L ++A CKHFTAYDL+ W    R+ FNA V+KQD+EDTF  PFR C+ +GK + +MCSYN
Sbjct: 223 LMLSACCKHFTAYDLEKWGDATRYDFNAVVTKQDMEDTFQAPFRSCIQQGKASCLMCSYN 282

Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
            VNGVP CAD  +L + +R +W  +GYI SDCD+V   Y+ Q +T TPE+A A A++AG 
Sbjct: 283 SVNGVPACADKELLDK-VRTDWGFDGYITSDCDAVATIYENQKYTKTPEDAVAVALKAGT 341

Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
           +++CG ++  H +SA Q+G + E D++ AL    +VQ RLG+FDG P+   + + G +DV
Sbjct: 342 NINCGTYMLRHMKSAFQQGSVLEEDLDRALQYLFSVQFRLGLFDGNPADGQFANFGAQDV 401

Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
           CT +H  LAL+AARQGIVLLKN    LPL      T+A++GP ++V+ +  G Y+G+ C 
Sbjct: 402 CTSNHLNLALDAARQGIVLLKNDQKFLPLDKTSVSTLAIVGPMANVS-SPGGTYSGVPCK 460

Query: 438 YTTPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
             +  +G  R+  RT++  GC DV C     F  AI   ++AD  I+V G D S E E  
Sbjct: 461 LKSIREGFHRHINRTLYAAGCLDVGCNSTAGFQDAISIVKEADYVIVVAGSDLSEETEDH 520

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DR  LLLPG+Q  LV+ ++ ASK P ILVL  GGP+DV+FA+ DPRIA+I+W  YPG+ G
Sbjct: 521 DRYSLLLPGQQTNLVTTLAAASKKPIILVLTGGGPVDVSFAEKDPRIASILWVAYPGETG 580

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           G A+++I+FG  NPGGKLPMTWY + + T +PMT+M MR   S  YPGRTYRFY G V+Y
Sbjct: 581 GKALSEIIFGYQNPGGKLPMTWYLESF-TKVPMTDMNMRADPSNGYPGRTYRFYTGDVLY 639

Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA----IKVTHAK-CNRLTLG 671
            FGHG+SYT+F   + +AP+ +++ L   +   +    G++    I V   + C+     
Sbjct: 640 GFGHGLSYTSFSSQLLSAPSRLSLSLAKSNRKRSILAKGRSRLGYIHVDEVESCHSSKFF 699

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
           V + V N G  DG+H L++FS        AP KQLV F++VHVPA       + +  C+ 
Sbjct: 700 VHISVTNDGDMDGSHVLMLFSRVLQNFQGAPQKQLVGFDRVHVPARKYVETSLLVDPCEL 759

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
            S  +  G R + LGEH   +   +H V
Sbjct: 760 FSFANDQGNRILALGEHTFILDDIEHVV 787


>gi|115459584|ref|NP_001053392.1| Os04g0530700 [Oryza sativa Japonica Group]
 gi|38346629|emb|CAD41212.2| OSJNBa0074L08.23 [Oryza sativa Japonica Group]
 gi|38346760|emb|CAE03865.2| OSJNBa0081C01.11 [Oryza sativa Japonica Group]
 gi|113564963|dbj|BAF15306.1| Os04g0530700 [Oryza sativa Japonica Group]
 gi|218195263|gb|EEC77690.1| hypothetical protein OsI_16749 [Oryza sativa Indica Group]
          Length = 770

 Score =  702 bits (1811), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/738 (47%), Positives = 481/738 (65%), Gaps = 22/738 (2%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PFC  +LP P R   L+  L+L EK+  L + AA  PRLG+  +EWWSE+LHGV + GPG
Sbjct: 39  PFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLHGVCDNGPG 98

Query: 101 TKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
             F  G    AT FPQVI +A++FN +LW A  R ++ EARAM+N G AGLT+W+PN+N+
Sbjct: 99  VNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLTFWAPNINV 158

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDLDNWNG 217
           FRDPRWGRGQETPGEDP +   Y+  YV+G Q   G+  R+ ++A CKH+ AYDL+ W G
Sbjct: 159 FRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGRMMLSACCKHYIAYDLEKWRG 218

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R+ FNAKV+ QD+EDT+  PF+ C+ EG+ + +MCSYNQVNGVP CA  +IL+R  R 
Sbjct: 219 FTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVNGVPACARKDILQRA-RD 277

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
           EW   GYI SDCD+V + ++ Q +T++ E++ A  ++AG+D++CG FL  HT+SA+++G 
Sbjct: 278 EWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDINCGSFLIRHTKSAIEKGK 337

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           + E DIN+AL N  +VQ+RLG FD    +Q +  LGP +VCT +H+ELA EA RQG VLL
Sbjct: 338 VQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRELAAEAVRQGTVLL 397

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQG 456
           KN    LPL       +A+IGP ++    + G+Y G+ C  TT ++G+  Y  +T    G
Sbjct: 398 KNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKGMQAYVPKTTFAAG 457

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
           CKDV C     FG AI+A+++AD  +L+ GL+ + E E  DR  LLLPGRQ +L+  V+ 
Sbjct: 458 CKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLLPGRQMDLIHTVAS 517

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
            +K P +LVLM GGP+DV+FAK+DPRIA+I+W GYPG+ GG  + +ILFG  NPGGKLP+
Sbjct: 518 VTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEILFGKYNPGGKLPI 577

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           TWYP+ + T +PM +M MR   S+ YPGRTYRFY G VVY FG+G+SY+ + +++  AP 
Sbjct: 578 TWYPESF-TAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYGLSYSKYSYSILQAPK 636

Query: 637 VVAVPLDGRHGSINATISGKAIKVTH-----------AKCNRLTLGVQVDVKNVGSKDGA 685
            +++       S+   IS K                 A C  L   V + V N G+ DG+
Sbjct: 637 KISL----SRSSVPDLISRKPAYTRRDGVDYVQVEDIASCEALQFPVHISVSNDGAMDGS 692

Query: 686 HTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
           H +L+F S+ P+   +P KQLV FE+VH  AG    V I +  CK +S  +  GTR + L
Sbjct: 693 HAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKLMSFANTEGTRVLFL 752

Query: 745 GEHNIHIGGTKHSVSLHA 762
           G H + +G  +H + + A
Sbjct: 753 GTHVLMVGDEEHELLIEA 770


>gi|115485165|ref|NP_001067726.1| Os11g0297800 [Oryza sativa Japonica Group]
 gi|62734696|gb|AAX96805.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|77549999|gb|ABA92796.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
           expressed [Oryza sativa Japonica Group]
 gi|113644948|dbj|BAF28089.1| Os11g0297800 [Oryza sativa Japonica Group]
 gi|125534139|gb|EAY80687.1| hypothetical protein OsI_35869 [Oryza sativa Indica Group]
 gi|215766717|dbj|BAG98945.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 782

 Score =  701 bits (1810), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/752 (47%), Positives = 482/752 (64%), Gaps = 21/752 (2%)

Query: 27  PFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYE 85
           PF+C    A       FC  +LP  QR  DL+ RL+  EKV  L   AA VPRLG+  Y+
Sbjct: 36  PFSCGGAAAGGGQGYAFCDATLPAEQRAADLVARLTAAEKVAQLGDQAAGVPRLGVPAYK 95

Query: 86  WWSEALHGVSNVGPGTKFGGDFPG-----ATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
           WWSEALHG++  G G  F  D PG     ATSFPQV+ TA++F+  LW  IG+ +  EAR
Sbjct: 96  WWSEALHGLATSGRGLHF--DAPGSAARAATSFPQVLLTAAAFDDDLWFRIGQAIGTEAR 153

Query: 141 AMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
           A+YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ KYA ++V+G+QG+    L+
Sbjct: 154 ALYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPTMASKYAVAFVKGMQGNSSAILQ 213

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
            +A CKH TAYDL++WNGV R++FNAKV+ QD+EDT++ PFR CV++ K   +MC+Y  +
Sbjct: 214 TSACCKHVTAYDLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKATCIMCAYTGI 273

Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
           NGVP CA+ ++L +T+RG+W L+GYI SDCD+V +  D Q +T TPE+A A A++AGLD+
Sbjct: 274 NGVPACANADLLTKTVRGDWGLDGYIASDCDAVAIMRDAQRYTQTPEDAVAVALKAGLDM 333

Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP-YGHLGPKDVC 378
           +CG ++  H  +A+Q+G L+E DI+ AL N   ++MRLG FDG+P S   YG LG  D+C
Sbjct: 334 NCGTYMQQHATAAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSVYGGLGAADIC 393

Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
           TP+H+ LALEAA  GIVLLKN    LPL      + AVIGPN++  + +IGNY G  C  
Sbjct: 394 TPEHRSLALEAAMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALIGNYFGPPCES 453

Query: 439 TTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALD 497
           TTPL GI  Y + +    GC   AC       AA  AS  +D   L MGL Q  E+E  D
Sbjct: 454 TTPLNGILGYIKNVRFLAGCNSAACDVAATDQAAAVAS-SSDYVFLFMGLSQKQESEGRD 512

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           R  LLLPG QQ L++ V+ A+K P ILVL++GGP+DV FA+ +P+I AI+WAGYPGQAGG
Sbjct: 513 RTSLLLPGEQQSLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILWAGYPGQAGG 572

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
            AIA +LFG  NPGG+LP+TWYP+E+ T +PMT+M MR   +  YPGR+YRFY+G  VY 
Sbjct: 573 LAIARVLFGDHNPGGRLPVTWYPEEF-TKVPMTDMRMRADPATGYPGRSYRFYQGKTVYK 631

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK------CNRLTLG 671
           FG+G+SY+++   + +     A        S+  T + +  +  H +      C +L   
Sbjct: 632 FGYGLSYSSYSRQLVSGGK-PAESYTNLLASLRTTTTSEGDESYHIEEIGTDGCEQLKFP 690

Query: 672 VQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
             V+V+N G  DG H++L++   P A    P  QL+ F   H+  G +  +  +I  C++
Sbjct: 691 AVVEVQNHGPMDGKHSVLMYLRWPNAKGGRPTTQLIGFRSQHLKVGEKANIRFDISPCEH 750

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
            S V + G + I  G H + +   +  +   A
Sbjct: 751 FSRVRKDGKKVIDRGSHYLMVDKDELEIRFEA 782


>gi|26449574|dbj|BAC41913.1| putative beta-xylosidase [Arabidopsis thaliana]
          Length = 732

 Score =  701 bits (1810), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/732 (47%), Positives = 466/732 (63%), Gaps = 34/732 (4%)

Query: 62  LQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTAS 121
           L EK+  L + AA+VPRLGI  YEWWSE+LHG+++ GPG  F G    ATSFPQVI +A+
Sbjct: 2   LPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAA 61

Query: 122 SFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGK 181
           SFN TLW  IG  V+ E RAMYNGG AGLT+W+PN+N+FRDPRWGRGQETPGEDP +  +
Sbjct: 62  SFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGEDPKVVSE 121

Query: 182 YAASYVRGLQ----------------------GSDGDRLKVAASCKHFTAYDLDNWNGVD 219
           Y   +VRG Q                           +L ++A CKHFTAYDL+ W    
Sbjct: 122 YGVEFVRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNFT 181

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R+ FNA V++QD+EDT+  PF  C+ +GK + +MCSYN VNGVP CA  ++L++  R EW
Sbjct: 182 RYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLLQKA-RVEW 240

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
              GYI SDCD+V   +  Q +T +PEEA ADAI+AG+D++CG ++  HT+SA+++G +S
Sbjct: 241 GFEGYITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKVS 300

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
           E  ++ AL+N   VQ+RLG+FDG+P    YG LG  D+C+ DH++LALEA RQGIVLLKN
Sbjct: 301 EELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLKN 360

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQGCK 458
               LPL+     ++A++GP ++    M G Y G  C   T    +  Y  +T +  GC 
Sbjct: 361 DHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCS 420

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           DV+C  D  FG A+  ++ AD  I+V GLD S E E  DR  L LPG+Q++LVS V+  S
Sbjct: 421 DVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAVS 480

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P ILVL  GGP+DV FAKNDPRI +IIW GYPG+ GG A+A+I+FG  NPGG+LP TW
Sbjct: 481 KKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRLPTTW 540

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP+ + T++ M++M MR + S+ YPGRTYRFY GP VY FG G+SYT F + + +AP  +
Sbjct: 541 YPESF-TDVAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKILSAPIRL 599

Query: 639 AVP--LDGRHGSINATISGKAIK------VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLV 690
           ++   L  +         G+ ++      V    C  L   V+V V N G  DG+H +++
Sbjct: 600 SLSELLPQQSSHKKQLQHGEELRYLQLDDVIVNSCESLRFNVRVHVSNTGEIDGSHVVML 659

Query: 691 FST-PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
           FS  PP     P KQL+ +++VHV +         I  CK LSV +  G R IPLG H +
Sbjct: 660 FSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVGKRVIPLGSHVL 719

Query: 750 HIGGTKHSVSLH 761
            +G  +HS+S+ 
Sbjct: 720 FLGDLQHSLSVE 731


>gi|357489463|ref|XP_003615019.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
 gi|355516354|gb|AES97977.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
          Length = 785

 Score =  701 bits (1810), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/784 (45%), Positives = 504/784 (64%), Gaps = 39/784 (4%)

Query: 5   IAFFFLGLI----LLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           I+F FL L     L+ A S  L    P++CD  +  T++  FC ++L   QR  D++ RL
Sbjct: 11  ISFIFLFLTRYNQLVHADSPTLVP--PYSCDITNPLTKSYTFCNLNLTTIQRAKDIVSRL 68

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGD--FPGATSFPQVIT 118
           +L EK+  L++ A A+PRLGI  Y+WWSEALHGV++ G G +  G+     AT FPQVI 
Sbjct: 69  TLDEKLAQLVNTAPAIPRLGIHSYQWWSEALHGVADYGKGIRLNGNVTIKAATIFPQVIL 128

Query: 119 TASSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPV 177
           TA+SF++ LW  I +V+  EARA+YN G A G+T+W+PN+NIFRDPRWGRGQET GEDP+
Sbjct: 129 TAASFDSKLWYRISKVIGTEARAVYNAGQAEGMTFWAPNINIFRDPRWGRGQETAGEDPL 188

Query: 178 LSGKYAASYVRGLQGS-------DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
           +S KYA S+VRGLQG        + DRLK +A CKHFTAYDLDNW GVDRF F+A V+ Q
Sbjct: 189 VSAKYAVSFVRGLQGDSFEGGKLNEDRLKASACCKHFTAYDLDNWKGVDRFDFDANVTLQ 248

Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
           D+ DT+  PF  C+++G+ + +MC+YN+VNG+P CAD N+L  T R +W  NGYI SDC 
Sbjct: 249 DLADTYQPPFHSCIVQGRSSGIMCAYNRVNGIPNCADYNLLTNTARKKWNFNGYITSDCS 308

Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNT 350
           +V + +D Q +   PE+A AD ++AG+D++CG +   H++SAV +  +    I+ AL N 
Sbjct: 309 AVDIIHDRQGYAKAPEDAVADVLQAGMDVECGDYFTSHSKSAVLQKKVPISQIDRALHNL 368

Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
            ++++RLG+FDG P+   YG +GP  VC+  +  +ALEAAR GIVLLKN    LPL    
Sbjct: 369 FSIRIRLGLFDGHPTKLKYGKIGPNRVCSKQNLNIALEAARSGIVLLKNAASILPLPK-S 427

Query: 411 HRTVAVIGPNSDVTVTMI-GNYAGIACGYTTPLQGIGRYA-RTIHQQGCKD-VACADDQL 467
             ++ VIGPN++ +  ++ GNY G  C   T LQG   Y+   ++  GC D   C   ++
Sbjct: 428 TDSIVVIGPNANSSSQVVLGNYFGRPCNLVTILQGFENYSDNLLYHPGCSDGTKCVSAEI 487

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
              A++ ++  D  +LVMGLDQS E+E  DR  L LPG+QQEL++ V+ ASK P ILVL 
Sbjct: 488 -DRAVEVAKVVDYVVLVMGLDQSQESEGHDRDDLELPGKQQELINSVAKASKRPVILVLF 546

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
            GGP+D++FAK D +I  I+WAGYPG+ GG A+A ++FG  NPGG+LPMTWYP+++I  +
Sbjct: 547 CGGPVDISFAKVDDKIGGILWAGYPGELGGMALAQVVFGDYNPGGRLPMTWYPKDFI-KI 605

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
           PMT+M MR   S  YPGRTYRFY GP VY FG+G+SY+N+ +        ++V  +  H 
Sbjct: 606 PMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSY------NFISVKNNNLHI 659

Query: 648 SINATIS-GKAIKVTHAK---------CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
           + + T S  +  +  H K         C  +++ V + + N GS  G H +L+F  P  G
Sbjct: 660 NQSTTYSILEKSQTIHYKLVSELGKKACKTMSISVTLGITNTGSMAGKHPVLLFVKPKKG 719

Query: 698 -HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
            +  P KQLV FE V V  G +  VG  + VC++LS  + SG + I  G +   +G  ++
Sbjct: 720 RNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGYLFLVGELEY 779

Query: 757 SVSL 760
           S+++
Sbjct: 780 SINI 783


>gi|356552866|ref|XP_003544783.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 776

 Score =  701 bits (1810), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/771 (44%), Positives = 502/771 (65%), Gaps = 20/771 (2%)

Query: 6   AFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
           A  F+  +LL+      + + P++CD   + +   PFC   LPI +R  DL+ RL+L EK
Sbjct: 8   AAIFISFLLLTLHHHAESTQPPYSCD-SSSNSPYYPFCNTRLPISKRAQDLVSRLTLDEK 66

Query: 66  VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
           +  L++ A A+PRLGI  Y+WWSEALHGV++ G G +F G    ATSFPQVI TA+SF+ 
Sbjct: 67  LAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAASFDP 126

Query: 126 TLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
            LW  I + +  EARA+YN G A G+T+W+PN+N+FRDPRWGRGQET GEDP+++ KY  
Sbjct: 127 NLWYQISKTIGKEARAVYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYGV 186

Query: 185 SYVRGLQGSD------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDV 238
           +YVRGLQG        G+RL+ +A CKHFTAYDLD+W G+DRF ++A+V+ QD+ DT+  
Sbjct: 187 AYVRGLQGDSFEGGKLGERLQASACCKHFTAYDLDHWKGLDRFVYDARVTSQDLADTYQP 246

Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT 298
           PF+ C+ +G+ + +MC+YN+VNGVP CA+ N+L +T R +W+ +GYI SDC +V + +D 
Sbjct: 247 PFQSCIEQGRASGIMCAYNRVNGVPNCANFNLLTKTARQQWKFDGYITSDCGAVSIIHDE 306

Query: 299 QHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLG 358
           Q +  T E+A AD  RAG+D++CG ++  H +SAV +  L    I+ AL N  ++++RLG
Sbjct: 307 QGYAKTAEDAIADVFRAGMDVECGDYITKHGKSAVSQKKLPISQIDRALQNLFSIRIRLG 366

Query: 359 MFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
           + DG P+  P+G +GP  VC+    +LALEAAR GIVLLKN    LPL    + T+A+IG
Sbjct: 367 LLDGNPTKLPFGTIGPDQVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKT-NPTIALIG 425

Query: 419 PNSDVTV-TMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKD-VACADDQLFGAAIDAS 475
           PN++ +    +GNY G  C   T LQG   YA+ T++  GC D   CA  Q+ G A++ +
Sbjct: 426 PNANASSKVFLGNYYGRPCNLVTLLQGFEGYAKDTVYHPGCDDGPQCAYAQIEG-AVEVA 484

Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           ++ D  +LVMGLDQS E E+ DR  L LPG+Q+EL+  V+ ASK P +LVL+ GGP+D+ 
Sbjct: 485 KKVDYVVLVMGLDQSQERESHDREYLGLPGKQEELIKSVARASKRPVVLVLLCGGPVDIT 544

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
            AK D ++  I+WAGYPG+ GG A+A ++FG  NPGGKLP+TWYP+++I  +PMT+M MR
Sbjct: 545 SAKFDDKVGGILWAGYPGELGGVALAQVVFGDHNPGGKLPITWYPKDFI-KVPMTDMRMR 603

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN-APTVVAVPLDGRHGSI--NAT 652
              +  YPGRTYRFY GP VY FG+G+SYT + + + + +   + +     H +   + T
Sbjct: 604 ADPASGYPGRTYRFYTGPKVYEFGYGLSYTKYSYKLLSLSHNTLHINQSSTHLTTQNSET 663

Query: 653 ISGKAI-KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF--STPPAGHWAPHKQLVAFE 709
           I  K + ++    C  + L + + V N G+  G H +L+F        +  P KQLV F+
Sbjct: 664 IRYKLVSELAEETCQTMLLSIALGVTNHGNMAGKHPVLLFVRQGKVRNNGNPVKQLVGFQ 723

Query: 710 KVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            V + AG   +VG  +  C++LSV + +G+  I  G + + +G  ++ + +
Sbjct: 724 SVKLNAGETVQVGFELSPCEHLSVANEAGSMVIEEGSYLLLVGDQEYPIEI 774


>gi|356548162|ref|XP_003542472.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 778

 Score =  701 bits (1808), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/773 (44%), Positives = 500/773 (64%), Gaps = 25/773 (3%)

Query: 5   IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
           I+FF L L    A S+    R P++CD   + +    FC   LPI +R  DL+ RL+L E
Sbjct: 12  ISFFLLNLHHHHAEST----RPPYSCD-SSSNSPYYSFCNTKLPITKRAQDLVSRLTLDE 66

Query: 65  KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
           K+  L++ A A+PRLGI  Y+WWSEALHGV++ G G +F G    ATSFPQVI TA+SF+
Sbjct: 67  KLAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAASFD 126

Query: 125 ATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
             LW  I + +  EARA+YN G A G+T+W+PN+N+FRDPRWGRGQET GEDP+++ KY 
Sbjct: 127 PNLWYQISKTIGREARAVYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYG 186

Query: 184 ASYVRGLQGSD------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
            +YVRGLQG         +RL+ +A CKHFTAYDLD W G+DRF F+A+V+ QD+ DT+ 
Sbjct: 187 VAYVRGLQGDSFEGGKLAERLQASACCKHFTAYDLDQWKGLDRFVFDARVTSQDLADTYQ 246

Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
            PF+ C+ +G+ + +MC+YN+VNGVP CAD N+L +T R +W+ +GYI SDC +V + ++
Sbjct: 247 PPFQSCIEQGRASGIMCAYNRVNGVPNCADFNLLTKTARQQWKFDGYITSDCGAVSIIHE 306

Query: 298 TQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
            Q +  T E+A AD  RAG+D++CG ++  H +SAV +  L    I+ AL N  ++++RL
Sbjct: 307 KQGYAKTAEDAIADVFRAGMDVECGDYITKHAKSAVFQKKLPISQIDRALQNLFSIRIRL 366

Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
           G+FDG P+  P+G +GP +VC+    +LALEAAR GIVLLKN    LPL    + T+A+I
Sbjct: 367 GLFDGNPTKLPFGTIGPNEVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKT-NPTIALI 425

Query: 418 GPNSDVTV-TMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKD-VACADDQLFGAAIDAS 475
           GPN++ +    +GNY G  C   T LQG   YA+T++  GC D   CA  Q+   A++ +
Sbjct: 426 GPNANASSKVFLGNYYGRPCNLVTLLQGFEGYAKTVYHPGCDDGPQCAYAQI-EEAVEVA 484

Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           ++ D  +LVMGLDQS E E+ DR  L LPG+Q+EL+  V+ A+K P ++VL+ GGP+D+ 
Sbjct: 485 KKVDYVVLVMGLDQSQERESHDREYLGLPGKQEELIKSVARAAKRPVVVVLLCGGPVDIT 544

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
            AK D ++  I+WAGYPG+ GG A+A ++FG  NPGGKLP+TWYP+++I  +PMT+M MR
Sbjct: 545 SAKFDDKVGGILWAGYPGELGGVALAQVVFGDHNPGGKLPITWYPKDFI-KVPMTDMRMR 603

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN-APTVVAVPLDGRH--GSINAT 652
              +  YPGRTYRFY GP VY FG+G+SYT + + + + + + + +     H     + T
Sbjct: 604 ADPASGYPGRTYRFYTGPKVYEFGYGLSYTKYSYKLLSLSHSTLHINQSSTHLMTQNSET 663

Query: 653 ISGKAI-KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STPPAGHWAPHKQLVA 707
           I  K + ++    C  + L + + V N G+  G H +L+F          +  P KQLV 
Sbjct: 664 IRYKLVSELAEETCQTMLLSIALGVTNRGNLAGKHPVLLFVRQGKVRNINNGNPVKQLVG 723

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           F+ V V AG   +VG  +  C++LSV + +G+  I  G +   +G  ++ + +
Sbjct: 724 FQSVKVNAGETVQVGFELSPCEHLSVANEAGSMVIEEGSYLFIVGDQEYPIEV 776


>gi|413925162|gb|AFW65094.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 774

 Score =  698 bits (1802), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/754 (46%), Positives = 483/754 (64%), Gaps = 19/754 (2%)

Query: 22  LAAREPFACDPKDA-TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
           +A   PF+C P  A  +  L FC V+L   QR  DL+ RL+  EK+  L   A  VPRLG
Sbjct: 25  VAGDPPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLG 84

Query: 81  IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
           + GY+WW+EALHG++  G G  F      ATSFPQV+ TA++F+  LW  IG+ +  EAR
Sbjct: 85  VPGYKWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREAR 144

Query: 141 AMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRL 198
           A++N G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ +YA ++VRG+QG S    L
Sbjct: 145 ALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSSLL 204

Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
           + +A CKH TAYDL++WNGV R+ F A+V++QD+EDTF+ PFR CV+E K + VMC+Y  
Sbjct: 205 QTSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYTA 264

Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLD 318
           +NGVP CA+ ++L  T+RG+W L+GY+ SDCD+V +  D Q +  TPE+A A +++AGLD
Sbjct: 265 INGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLKAGLD 324

Query: 319 LDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC 378
           +DCG ++  H  +A+Q+G L+E DI+ AL N   V+MRLG FDG+P    YG LG  D+C
Sbjct: 325 IDCGSYVQQHAAAAIQQGKLTEQDIDKALTNLYAVRMRLGHFDGDPRKNMYGVLGAADIC 384

Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
           TP+H+ LALEAA+ GIVLLKN G  LPL      + AVIGPN++  + +I NY G  C  
Sbjct: 385 TPEHRNLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNANDGMALIANYFGPPCES 444

Query: 439 TTPLQGIGRYARTIH-QQGCKDVAC---ADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
           TTPL+G+  Y   +    GC   AC   A DQ    A+  +   D   L MGL Q  E+E
Sbjct: 445 TTPLKGLQSYVNDVRFLAGCNSAACDVAATDQ----AVALAGSEDYVFLFMGLSQKQESE 500

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             DR  LLLPG QQ L++ V+ ASK P ILVL+SGGP+D+ FA+++P+I AI+WAGYPGQ
Sbjct: 501 GKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQ 560

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
           AGG AIA +LFG  NP G+LP+TWYP+E+ T +PMT+M MR   +  YPGR+YRFY+G  
Sbjct: 561 AGGLAIAKVLFGDHNPSGRLPVTWYPEEF-TKVPMTDMRMRADPTSGYPGRSYRFYQGNT 619

Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVP---LDGRHGSINATISGKAIKVTH---AKCNRL 668
           VY FG+G+SY+ F   + +  +V A+    L G   ++      ++  V       C +L
Sbjct: 620 VYKFGYGLSYSTFSRRLVHGTSVPALSSTLLTGLRETMTPQDGDRSYHVDAIGTEGCEQL 679

Query: 669 TLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHV 727
                V+V+N G  DG H++L+F   P      P  QL+ F   H+ AG   ++  +I  
Sbjct: 680 KFPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQLIGFRSQHLKAGETAKLRFDISP 739

Query: 728 CKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           CK+ S V   G + I +G H + +   +  +   
Sbjct: 740 CKHFSRVRADGRKVIDIGSHFLMVDNHEMEIRFE 773


>gi|357156390|ref|XP_003577440.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 755

 Score =  698 bits (1801), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/747 (47%), Positives = 480/747 (64%), Gaps = 17/747 (2%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           F+C P         FC  +LP  QR  DL+ +L+L+EKV  L   A  VPR G+ GY WW
Sbjct: 14  FSCGPPQQAQYA--FCNRALPAEQRAADLVAKLTLEEKVSQLGDQAPGVPRFGVPGYNWW 71

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SE LHGVS  G G  F G   G T+FPQV+ T +SF+ ++W  IG+ +  EARAM+N G 
Sbjct: 72  SEGLHGVSMWGHGMHFNGAVRGVTTFPQVLLTTASFDDSIWYRIGQAIGTEARAMFNLGQ 131

Query: 148 A-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
           A GLT WSPNVNI+RDPRWGRGQETPGEDP  + KYA ++VRGLQG+    L+ +A CKH
Sbjct: 132 ADGLTIWSPNVNIYRDPRWGRGQETPGEDPATASKYAVAFVRGLQGTSTTTLQTSACCKH 191

Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
            TAYDLD+WN + R++FNAKV+ QD+E+TF+ PF+ CV+EGK   VMC+Y  VNG+P CA
Sbjct: 192 ATAYDLDDWNRIGRYNFNAKVTAQDLEETFNPPFKSCVVEGKATCVMCAYTSVNGIPACA 251

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
           D  +L +TI+GEW +NGYI SDCD+V + Y T+ ++ TPE+A A AI+AGLD++CG F  
Sbjct: 252 DSGLLTKTIKGEWGMNGYISSDCDAVALLYGTR-YSGTPEDAVAAAIKAGLDMNCGNFSQ 310

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP-YGHLGPKDVCTPDHQEL 385
           +H  +A+Q+  +SE D++ AL N   ++MRLG FDG+P   P YG LG +DVC+P H++L
Sbjct: 311 VHGMAALQQRKMSEQDVDKALRNLFAIRMRLGHFDGDPLQSPLYGRLGAQDVCSPAHKDL 370

Query: 386 ALEAARQGIVLLKNQGPSLPLSH--IRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
           ALEAA+ GIVLLKN   +LPLS       + AVIGPN++    ++GNY G  C  TTPLQ
Sbjct: 371 ALEAAQNGIVLLKNDAATLPLSRPTAASASFAVIGPNANEPGALLGNYFGPPCETTTPLQ 430

Query: 444 GIGR-YARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
            + + Y++ +    GC   AC     + A+  A+  +D TIL MGL Q  E E LDR  L
Sbjct: 431 ALQKFYSKNVRFVPGCDSAACNVADTYQASGLAA-TSDYTILFMGLSQKQEQEGLDRTSL 489

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           LLPG+Q+ L++ V+ A+K P ILVL++GGP+D+ FAK +P+I AI+WAGYPGQAGG AIA
Sbjct: 490 LLPGKQESLITAVAAAAKRPIILVLLTGGPVDITFAKFNPKIGAILWAGYPGQAGGLAIA 549

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
            +LFG  NP G+LP+TWYP+EY T +PM +M MR   +  YPGR+YRFYKG  VY FG+G
Sbjct: 550 KVLFGEHNPSGRLPVTWYPEEY-TKVPMDDMRMRADPATGYPGRSYRFYKGNAVYKFGYG 608

Query: 622 MSYTNFV-HTVANAPTVVAVP----LDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
           +SY+ F    V N+ +    P    L        A+      ++    C RL     V+V
Sbjct: 609 LSYSKFSRQLVRNSSSNNRAPNTELLAAAAVDCGASRYYLVEEIGGEVCERLKFPAVVEV 668

Query: 677 KNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
           +N G  DG  ++L+F   P A    P  QLV F    + AG +  V  +I  C++ S   
Sbjct: 669 ENHGPMDGKQSVLLFLRWPTATEGRPASQLVGFRSQDLRAGEKASVSFDISPCEHFSRTT 728

Query: 736 RSGTRRIPLGEHNIHIGGTKHSVSLHA 762
             GT+ I  G H + +   +  +S  +
Sbjct: 729 VDGTKVIDRGSHFLMVDEDEMEISFDS 755


>gi|356510699|ref|XP_003524073.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Glycine max]
          Length = 613

 Score =  692 bits (1787), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/549 (61%), Positives = 415/549 (75%), Gaps = 6/549 (1%)

Query: 26  EPFACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
           + FACD  K        FC  SL +  RV DL+GRL+LQEK+  L++ A  V RLGI  Y
Sbjct: 28  QTFACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAGDVSRLGIPRY 87

Query: 85  EWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN 144
           EWWSEALHGVSNVG GT+F    PGATSFP  I TA+SFN +L+E IGRVVS EA AMYN
Sbjct: 88  EWWSEALHGVSNVGLGTRFSNVVPGATSFPMPILTAASFNTSLFEVIGRVVSTEAGAMYN 147

Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG---DRLKVA 201
            G AGLTYWSPN+NIFRDPRWGRG ETPGEDPVL+ KYAA YV+GLQ +DG   ++LKVA
Sbjct: 148 VGLAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDGGDPNKLKVA 207

Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
           A CKH+TAYD+D W G+ R+ FNA ++KQD+EDTF  PF+ CV++G VASVMCSYN+VNG
Sbjct: 208 ACCKHYTAYDVDKWKGIQRYTFNAVLTKQDLEDTFQPPFKSCVIDGNVASVMCSYNKVNG 267

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
            PTCADP++LK  +RGEW+LNGY+VSDCDSV V Y  QH+T TPEEAAA +I AGLDL+C
Sbjct: 268 KPTCADPDLLKGVVRGEWKLNGYMVSDCDSVEVLYKYQHYTKTPEEAAAISILAGLDLNC 327

Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
           G FLG +TE AV++GL+ E  INNA+ N     MRLG FDG+P  QPYG+LGPKDVCTP 
Sbjct: 328 GRFLGQYTEGAVKQGLIDE-SINNAVSNNFATLMRLGFFDGDPRKQPYGNLGPKDVCTPA 386

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           +QELA EAARQGIV LKN   SLPL+    +++AVIGPN++ T  MIGNY GI C Y +P
Sbjct: 387 NQELAREAARQGIVSLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYISP 446

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
           LQG+  +  T +  GC DV C +  L  A    S   DAT++V+G   +IEAE+LDR  +
Sbjct: 447 LQGLTAFVPTSYAAGCLDVRCPNPVLDDAK-KISASGDATVIVVGASLAIEAESLDRVNI 505

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           LLPG+QQ LV++V+ ASKGP ILV+MSGG +DV+FAK++ +I +I+W GYPG+AGG AIA
Sbjct: 506 LLPGQQQLLVTEVANASKGPVILVIMSGGGMDVSFAKDNNKITSILWVGYPGEAGGAAIA 565

Query: 562 DILFGTSNP 570
           D++FG  NP
Sbjct: 566 DVIFGFHNP 574


>gi|253761874|ref|XP_002489311.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
 gi|241946959|gb|EES20104.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
          Length = 791

 Score =  692 bits (1787), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/775 (45%), Positives = 484/775 (62%), Gaps = 20/775 (2%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           MA+ ++   L   LL   +  +A   PF+C P  + ++ LPFC + LP  QR  DL+ R+
Sbjct: 24  MAAHVSVLLLVPALLMRVA--VAGAPPFSCGP-SSPSKGLPFCNMKLPASQRAADLVSRM 80

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +  EK   L   A  VPRLG+  Y+WW+EALHGV+  G G         ATSFPQV+ TA
Sbjct: 81  TPAEKASQLGDIANGVPRLGVPSYKWWNEALHGVAISGKGIHMNQGVRSATSFPQVLHTA 140

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
           +SFN  LW  IG+    EARA YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++
Sbjct: 141 ASFNDNLWFRIGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVA 200

Query: 180 GKYAASYVRGLQGSDGDR------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIE 233
            +Y A++VRGLQGS  +       L+ +A CKH TAYDL++W GV R+ F A V+ QD+ 
Sbjct: 201 SRYGAAFVRGLQGSSSNTKSVPPVLQTSACCKHATAYDLEDWKGVSRYSFKATVTIQDLA 260

Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG 293
           DTF+ PFR CV++GK + VMC+Y  VNGVP+CA+ ++L +T RG W L+GY+ +DCD+V 
Sbjct: 261 DTFNPPFRSCVVDGKASCVMCAYTIVNGVPSCANGDLLTKTFRGSWGLDGYVAADCDAVA 320

Query: 294 VYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTV 353
           +  ++Q +  T E+  A  ++AGLD+DCGP++  +  +A+Q+G L++ D++ A+ N LT 
Sbjct: 321 IMRNSQFYRPTAEDTVAATLKAGLDIDCGPYIQQYAMAAIQKGKLTQQDVDKAVKNLLTT 380

Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
           +MRLG FDG+P +  YG+LG   +CT +H+ LALEAA  GIVLLKN    LPL      +
Sbjct: 381 RMRLGHFDGDPKTNVYGNLGAGHICTAEHKNLALEAALDGIVLLKNSAGVLPLKRGTVNS 440

Query: 414 VAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAI 472
            AVIG N++  + ++GNY G  C  TTPLQGI  Y + +    GC   AC       A  
Sbjct: 441 AAVIGHNANDVLALLGNYWGPPCAPTTPLQGIQGYVKNVKFLAGCNKAACNVAATPQATA 500

Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
            AS  +DA IL MGL Q  E+E  DR  LLLPG QQ L++ V+ A+K P ILVL++GGP+
Sbjct: 501 LAS-SSDAVILFMGLSQEQESEGKDRTTLLLPGNQQSLINAVANAAKRPVILVLLTGGPV 559

Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
           D+ FA+ +P+I AI+WAGYPGQAGG AIA +LFG  NP GKLP TWYP+E+ T +PMT+M
Sbjct: 560 DITFAQANPKIGAILWAGYPGQAGGLAIAKVLFGEKNPSGKLPNTWYPEEF-TRIPMTDM 618

Query: 593 AMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA----NAPTVVAVPLDGRHGS 648
            MR + S  YPGRTYRFY G  +Y FG+G+SY+ F H V     N     ++   G    
Sbjct: 619 RMRAAGS--YPGRTYRFYNGKTIYKFGYGLSYSKFSHRVVTGRKNPAHNTSLLAAGLAAM 676

Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVA 707
               +S     +    C++L     V V+N G  DG HT L+F   P A    P +QL+ 
Sbjct: 677 TEDNLSYHVEHIGDVVCDQLKFLAVVKVQNHGPIDGKHTALMFLRWPSATDGRPTRQLIG 736

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
           F+  H+ AG +  +   +  C++ S V + G + I  G H + +G  +  +S  A
Sbjct: 737 FQSQHIKAGEKANLRFEVSPCEHFSRVRQDGRKVIDKGSHFLKVGKHELEISFGA 791


>gi|414588273|tpg|DAA38844.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 775

 Score =  690 bits (1781), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/750 (46%), Positives = 477/750 (63%), Gaps = 19/750 (2%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           F+C P  A+ R  PFC  SLP  +R  DL+ RL++ EKV  L   AA VPRLG+  Y+WW
Sbjct: 30  FSCGPSSAS-RAYPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVPRLGVPPYKWW 88

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SE LHG++  G G +F G     TSFPQV+ T +SF+ +LW  IG+ +  EARA+YN G 
Sbjct: 89  SEGLHGLAFWGHGMRFNGTVSAVTSFPQVLLTTASFDESLWFRIGQAIGREARALYNLGQ 148

Query: 148 A-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-----GDRLKVA 201
           A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ KYA ++VRG+QGS+        L+ +
Sbjct: 149 AEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSNPAGAAAAPLQAS 208

Query: 202 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
           A CKH TAYDL++WNGV R++F+A+V+ QD+ DTF+ PF+ CV++GK + VMC+Y  +NG
Sbjct: 209 ACCKHATAYDLEDWNGVARYNFDARVTLQDLADTFNPPFQSCVVDGKASCVMCAYTVING 268

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
           VP CA  ++L +T RG W L+GY+ SDCD+V +  D Q +  TPE+  A A++AGLDL+C
Sbjct: 269 VPACASSDLLTKTFRGAWGLDGYVSSDCDAVAIMRDAQRYEPTPEDTVAVALKAGLDLNC 328

Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP-YGHLGPKDVCTP 380
           G +   H  +A+Q+G ++E D++ AL N   V+MRLG FDG+P     YG LG  DVCT 
Sbjct: 329 GTYTQQHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYGRLGAADVCTA 388

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
           DH+ LALEAA+ GIVLLKN    LPL      + AVIG N++  + + GNY G AC  TT
Sbjct: 389 DHKNLALEAAQDGIVLLKNDAGILPLDRSAVGSAAVIGHNANDPLVLSGNYFGPACETTT 448

Query: 441 PLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRA 499
           PL+G+  Y R +    GC   AC       AA  AS  A+   L MGL Q  E E LDR 
Sbjct: 449 PLEGLQSYVRNVRFLAGCSSAACGYAATGQAAALAS-SAEYVFLFMGLSQDQEKEGLDRT 507

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            LLLPG+QQ LV+ V+ A+K P +LVL++GGP+D+ FA+++P+I AI+WAGYPGQAGG A
Sbjct: 508 SLLLPGKQQSLVTAVASAAKRPVVLVLLTGGPVDITFAQSNPKIGAILWAGYPGQAGGLA 567

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
           IA +LFG  NP G+LP+TWY +++ T +PMT+M MR   +  YPGRTYRFY+G  +Y FG
Sbjct: 568 IARVLFGDHNPSGRLPVTWYTEDF-TKVPMTDMRMRADPATGYPGRTYRFYRGKTIYKFG 626

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK------CNRLTLGVQ 673
           +G+SY+ F   +      +A P       ++A     A    H        C +L    +
Sbjct: 627 YGLSYSKFSRQLVTGDKNLA-PNTSLLAHLSAKTQHAATSYYHVDDIGTVGCEQLKFPAE 685

Query: 674 VDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
           V+V N G  DG H++L+F   P A    P +QL+ F   H+ AG +  V  ++  C++ S
Sbjct: 686 VEVLNHGPMDGKHSVLMFLRWPNATDGRPVRQLIGFRSQHIKAGEKANVRFHVSPCEHFS 745

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
                G + I  G H + +G  +  +S  A
Sbjct: 746 RTRADGKKVIDRGSHFLMVGKEELEISFEA 775


>gi|414586138|tpg|DAA36709.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 769

 Score =  689 bits (1779), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/738 (46%), Positives = 483/738 (65%), Gaps = 14/738 (1%)

Query: 36  TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
           +T   PFC  SL IP R   L+  L+L EK+  L + A  VPRLGI  Y+WWSE+LHG++
Sbjct: 33  STSAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLA 92

Query: 96  NVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWS 154
           + GPG  F  G    AT FPQVI + ++FN +LW A+   V+ EA  M+N G AGLTYW+
Sbjct: 93  DNGPGVNFSSGPVRAATDFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQAGLTYWA 152

Query: 155 PNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDL 212
           PN+NIFRDPRWGRGQET GEDP ++  Y+  YV+G QG +G+  R++++A CKH+TAYD+
Sbjct: 153 PNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEEGEEGRIRLSACCKHYTAYDM 212

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
           + W G  R+ FNAKV+ QD+EDT+  PF+ C+ E + + +MC+YNQVNGVP CA  ++L+
Sbjct: 213 EKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNGVPMCAHKDLLQ 272

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
           +T R EW   GYI SDCD+V + ++ Q +T + E++ A  ++AG+D++CG FL  HT+SA
Sbjct: 273 KT-RDEWGFQGYITSDCDAVAIIHENQTYTKSGEDSIAIVLKAGMDINCGSFLVRHTKSA 331

Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
           +++G + E DI+ AL N  +VQ+RLG+FD   ++Q +  LGP  VCT +H+ELA EA RQ
Sbjct: 332 IEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCTKEHRELAAEAVRQ 391

Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RT 451
           G VLLKN    LPL     R VA+IGP+++    M G+Y G+ C  TT L+GI  YA +T
Sbjct: 392 GAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGVPCNPTTFLKGIQAYATQT 451

Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
               GCKD +C    LFG A++A+++AD  +++ GL+ + E E  DR  LLLPG+Q  L+
Sbjct: 452 SFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDFDRVSLLLPGKQMGLI 511

Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
             ++  +K P +LVL+ GGP+DV+FAK DPRIA+I+W GYPG+ GG  + +ILFG  NPG
Sbjct: 512 HAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVLPEILFGEYNPG 571

Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
           GKLP+TWYP+ + T +PMT+M MR   S+ YPGRTYRFY G VVY FG+G+SY+ + +++
Sbjct: 572 GKLPITWYPESF-TAIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGYGLSYSKYSYSI 630

Query: 632 ANAPTVVAVPLDGRHGSINATIS-------GKAIKVTHAKCNRLTLGVQVDVKNVGSKDG 684
           ++AP  + V      G I+   +       G       A C  L   V V V N GS DG
Sbjct: 631 SSAPKKITVSRSSDLGIISRKPAYTRRDGLGSVKTEDIASCEALVFSVHVAVSNHGSMDG 690

Query: 685 AHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
           +H +L+F+   +     P KQLV FE VH  AG+   V I +  CK +S  +  G R + 
Sbjct: 691 SHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGSASNVEITVDPCKQMSAANPEGKRVLL 750

Query: 744 LGEHNIHIGGTKHSVSLH 761
           LG H + +G  +  +S+ 
Sbjct: 751 LGAHVLTVGDEEFELSIE 768


>gi|357156904|ref|XP_003577615.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 767

 Score =  689 bits (1777), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/750 (47%), Positives = 487/750 (64%), Gaps = 18/750 (2%)

Query: 22  LAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGI 81
           LA   PF+C      + +  FC  +LP+ QR  DL+ RL+  EKV  L   AA VPRLG+
Sbjct: 21  LAGDPPFSCG---QASSSYAFCDAALPVAQRAADLVSRLTAAEKVAQLGDEAAGVPRLGV 77

Query: 82  KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
            GY+WW+EALHG++  G G  F G    ATSFPQV  TA++F+  LW  IG+ +  EARA
Sbjct: 78  PGYKWWNEALHGLATSGKGLHFDGAVRSATSFPQVCLTAAAFDDDLWFRIGQAIGREARA 137

Query: 142 MYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
           +YN G A GLT WSPNVNI+RDPRWGRGQETPGEDP  + +YA ++VRG+QG+    L+ 
Sbjct: 138 LYNLGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYAVAFVRGMQGNSTSLLQA 197

Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
           +A CKH TAYDL++WNGV R++F+AKV+ QD+EDTF+ PFR CV++GK + VMC+Y  +N
Sbjct: 198 SACCKHATAYDLEDWNGVARYNFDAKVTAQDLEDTFNPPFRSCVVDGKASCVMCAYTGIN 257

Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
           GVP CA+ ++L +T+RG+W L+GY  SDCD+V +  D Q +  +PE+A A A++AGLD+D
Sbjct: 258 GVPACANADLLTKTVRGDWGLDGYTASDCDAVAIMRDAQRYAQSPEDAVALALKAGLDID 317

Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
           CG ++  H  +A+Q+G ++E DI+ AL N   ++MRLG FDG+P +  YG LG  D+CT 
Sbjct: 318 CGTYMQQHAAAAIQQGKITEEDIDKALKNLFAIRMRLGHFDGDPRTNMYGGLGAADICTA 377

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
           +H+ LAL+AA+ GIVLLKN    LPL      + AVIGPN++    +I NY G  C  TT
Sbjct: 378 EHRSLALDAAQDGIVLLKNDAGILPLDRAAVASTAVIGPNANNPGALIANYFGPPCESTT 437

Query: 441 PLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRA 499
           PL+GI  Y +      GC   AC       AA  AS  +D   L MGL Q  E+E  DR 
Sbjct: 438 PLKGIQGYVKDARFLAGCSSTACDVATTDQAAALAS-TSDYVFLFMGLGQRQESEGRDRT 496

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            LLLPG+QQ L++ V+ A++ P ILVL+SGGP+DV FA+ +P+I AI+WAGYPGQAGG A
Sbjct: 497 SLLLPGKQQSLITAVADAAQRPVILVLLSGGPVDVTFAQTNPKIGAILWAGYPGQAGGLA 556

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
           IA +LFG  NP G+LP+TWYP+E+ TN+PMT+M MR   +  YPGR+YRFY+G  VY FG
Sbjct: 557 IARVLFGDHNPSGRLPVTWYPEEF-TNVPMTDMRMRADPANGYPGRSYRFYQGKTVYKFG 615

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATI-SGKAI-------KVTHAKCNRLTLG 671
           +G+SY+++   + ++ T    P      S+  T+ S + I       ++    C  L   
Sbjct: 616 YGLSYSSYSRRLLSSGTSTPAPNADLLASLTTTMPSAENILGSYHVEQIGAQGCEMLKFP 675

Query: 672 VQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
             V+V+N G  DG  ++L++   P A    P +QL+ F+K H+ AG +  +   I  C++
Sbjct: 676 AVVEVQNHGPMDGKQSVLMYLRWPNATAGRPERQLIGFKKEHLKAGEKAHIKFEIRPCEH 735

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           LS V   G + I  G H + +   KH + +
Sbjct: 736 LSRVREDGNKVIDRGSHFLRV--DKHELEI 763


>gi|297611657|ref|NP_001067709.2| Os11g0291000 [Oryza sativa Japonica Group]
 gi|255680005|dbj|BAF28072.2| Os11g0291000 [Oryza sativa Japonica Group]
          Length = 764

 Score =  687 bits (1774), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/733 (48%), Positives = 467/733 (63%), Gaps = 19/733 (2%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           FC   L   QR  DL+  L+L EKV  L   AA V RLG+  YEWWSE LHG+S  G G 
Sbjct: 31  FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIF 160
           +F G     TSFPQVI TA++F+A LW  +G  V  EARA+YN G A GLT WSPNVNIF
Sbjct: 91  RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
           RDPRWGRGQETPGEDPV + +YA ++V GLQG  G+    +A CKH TAYDLD WN V R
Sbjct: 151 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGGE---ASACCKHATAYDLDYWNNVVR 207

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
           +++++KV+ QD+EDT++ PF+ CV EGK   +MC YN +NGVP CA  ++L + +R EW 
Sbjct: 208 YNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKKVRQEWG 267

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
           +NGY+ SDCD+V    D  H+T +PE+  A +I+ G+D++CG +  +H  +AVQ+G L+E
Sbjct: 268 MNGYVASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLTE 327

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQP-YGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
            DI+ ALVN   V+MRLG FDG+P S   YGHLG  DVC+P H+ LALEAA+ GIVLLKN
Sbjct: 328 KDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKN 387

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA--RTIHQQGC 457
              +LPL      ++AVIGPN+D    + GNY G  C  TTPLQGI  Y   R     GC
Sbjct: 388 DAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGC 447

Query: 458 KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMA 517
              ACA      AA  AS  +D  +L MGL Q  E + LDR  LLLPG QQ L++ V+ A
Sbjct: 448 DSPACAVAATNEAAALAS-SSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVANA 506

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
           ++ P ILVL++GGP+DV FAK++P+I AI+WAGYPGQAGG AIA +LFG  NP G+LP+T
Sbjct: 507 ARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVT 566

Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV------ 631
           WYP+E+ T +PMT+M MR   +  YPGR+YRFY+G  VY FG+G+SY+ F   +      
Sbjct: 567 WYPEEF-TKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFST 625

Query: 632 --ANAPTVVAVPLDGRHGSINATISGKAIK-VTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
             A   +++A  +  R G     +S   +K +   +C+RL     V+V+N G  DG H++
Sbjct: 626 SNAGNLSLLAGVMARRAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSV 685

Query: 689 LVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH 747
           L++   P      P +QL+ F   HV  G +  V   +  C++ S V   G R I  G H
Sbjct: 686 LMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAH 745

Query: 748 NIHIGGTKHSVSL 760
            + +G  +   S 
Sbjct: 746 FLMVGDEELETSF 758


>gi|297736786|emb|CBI25987.3| unnamed protein product [Vitis vinifera]
          Length = 745

 Score =  684 bits (1764), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/745 (45%), Positives = 466/745 (62%), Gaps = 57/745 (7%)

Query: 22  LAAREPFACDPKDAT-----TRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAV 76
           LA    + CD           ++  FC  SL   +R  DL+ R++LQEKV   +  A+ V
Sbjct: 41  LAKNYTYVCDESRYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGV 100

Query: 77  PRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVS 136
            RLG+  Y WWSEALHG+SN+GPG  F    PGATS P VI + ++FN TLW+ +GRVVS
Sbjct: 101 RRLGLPEYSWWSEALHGISNLGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVS 160

Query: 137 DEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
            E RAMYN G AGLT+WSPN+N+ RD RWGR QET GEDP + G++A +YVRGLQ  +G 
Sbjct: 161 TEGRAMYNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGT 220

Query: 197 RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSY 256
              V++ CKH+ AYD+D+W  VDR  F+A+VS+QD+++TF  PF  CV EG V+SVMCS+
Sbjct: 221 E-NVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVMCSF 279

Query: 257 NQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRA 315
           N++NG+P C+DP +LK  IR EW L+GYIVSDC  + V  D Q++ +  + +A A  ++A
Sbjct: 280 NKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKTLQA 339

Query: 316 GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
           GLDL+CG +       +V  G +S+ +++ AL N   + MR+G FDG P+   Y  LG K
Sbjct: 340 GLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA---YESLGLK 396

Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
           D+C  DH ELA EAARQGIVLLKN    LPL     + +A++GP+++ T  MIGNYAG+ 
Sbjct: 397 DICAADHIELAREAARQGIVLLKNDYEVLPLK--PGKKIALVGPHANATEVMIGNYAGLP 454

Query: 436 CGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA 495
           C Y +PL+                        F A  + +     TI+ +G D SIEAE 
Sbjct: 455 CKYVSPLEA-----------------------FSAIGNVTYATGFTIIFVGTDLSIEAEF 491

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
           +DR   LLPG Q EL+ +V+  S GP ILV++SG  ID+ FAKN+PRI+AI+W G+PG+ 
Sbjct: 492 VDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQ 551

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG AIAD++FG  NPGG+LP+TWY  +Y+  LPM+ M++RP     YPGRTY+F+ G  V
Sbjct: 552 GGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKFFDGSTV 611

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+GMSYT F +++A +   + + L+                    KC       +V 
Sbjct: 612 YPFGYGMSYTKFSYSLATSKISIDIDLN-----------------KFQKCRTF----EVA 650

Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           V NVG  DG+  L+V+S PP+G    H KQ++ F+KV V AG  +RV  +++ CK L +V
Sbjct: 651 VTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTERVKFSMNACKSLRIV 710

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVS 759
           D +G   +P G H I +G   +S S
Sbjct: 711 DSTGYSLLPSGSHTIRVGDYSNSAS 735


>gi|413925164|gb|AFW65096.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 829

 Score =  682 bits (1759), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/766 (46%), Positives = 475/766 (62%), Gaps = 21/766 (2%)

Query: 12  LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           L+L+   +   A   PF+C      +  LPFC   LP  QR  DL+ R++  EK   L  
Sbjct: 70  LLLMRVVAVVAAGAPPFSCG--GGPSLGLPFCNTKLPAAQRAADLVSRMTPAEKASQLGD 127

Query: 72  GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEA 130
            A  VPRLG+  Y+WW+EALHGV+  G G     G    ATSFPQV+ TA+SFN  LW  
Sbjct: 128 VANGVPRLGVPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFR 187

Query: 131 IGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRG 189
           IG+    EARA YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA++VRG
Sbjct: 188 IGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRG 247

Query: 190 LQGSDGDR------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
           LQGS  +       L  +A CKH TAYDL++W GV R+ F A V+ QD+ DTF+ PFR C
Sbjct: 248 LQGSSSNTKSVPPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSC 307

Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
           V++GK + VMC+Y  VNGVP+CA+ ++L +T RG W L+GY+ +DCD+V +  ++Q +  
Sbjct: 308 VVDGKASCVMCAYTSVNGVPSCANADLLTKTFRGSWGLDGYVAADCDAVSIMRNSQFYRP 367

Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
           T E+  A  ++AGLD+DCGP++  H  +A+Q+G L++ D++ A+ N  T +MRLG FDG+
Sbjct: 368 TAEDTVATTLKAGLDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGD 427

Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
           P +  YG+LG   +CT +H+ LALEAA  GIVLLKN    LPL      + AVIG N++ 
Sbjct: 428 PKAHVYGNLGAAHICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNAND 487

Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATI 482
            + ++GNY G  C  TTPLQGI  Y + +    GC   AC       AA  AS  +D+ I
Sbjct: 488 VLALLGNYWGPPCAPTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALAS-TSDSVI 546

Query: 483 LVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPR 542
           L MGL Q  E+E  DR  LLLPG QQ L++ V+ A+K P ILVL++GGP+D+ FA+ +P+
Sbjct: 547 LFMGLSQEQESEGKDRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPK 606

Query: 543 IAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRY 602
           I AI+WAGYPGQAGG AIA +LFG  NP G+LP+TWYP+E+ T +PMT+M MR + S  Y
Sbjct: 607 IGAILWAGYPGQAGGLAIAKVLFGEKNPSGRLPVTWYPEEF-TKVPMTDMRMRSAGS--Y 663

Query: 603 PGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP-----TVVAVPLDGRHGSINATISGKA 657
           PGR+YRFYKG  +Y FG+G+SY+ F H V  A          +   G   +    +S   
Sbjct: 664 PGRSYRFYKGKTIYKFGYGLSYSKFSHRVVTARNNPAHNTTLLLAAGHAATTEDNLSYHV 723

Query: 658 IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAG 716
             +    C +L     V V+N G  DG HT L+F   P A    P +QLV F+  H+ AG
Sbjct: 724 DHIGDELCRQLKFLAVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAG 783

Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
            +  +   +  C+  S V   G + I  G H + +G  +  +S  A
Sbjct: 784 EKAHLRFEVSPCEDFSRVRDDGRKVIDKGSHFLKVGKHELEISFGA 829


>gi|326517420|dbj|BAK00077.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 781

 Score =  681 bits (1757), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/746 (46%), Positives = 480/746 (64%), Gaps = 23/746 (3%)

Query: 22  LAAREPFACDPKD-ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
           LAA  PF+C P   A T+   FC  +LP+ QR  DL+ RL+  EKV  L   AA VPRLG
Sbjct: 32  LAADPPFSCGPSSTAATQGYAFCDATLPVAQRAADLVARLTTAEKVAQLGDEAAGVPRLG 91

Query: 81  IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
           +  Y+WW+EALHG++  G G  F G    ATSFPQV  TA++F+  LW  IG+ +  EAR
Sbjct: 92  VPAYKWWNEALHGLATSGKGLHFNGAVRSATSFPQVSLTAAAFDDDLWLRIGQAIGREAR 151

Query: 141 AMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
           A+YN G A GLT WSPNVNI+RDPRWGRGQETPGEDP  + +Y  ++V+GLQG+      
Sbjct: 152 ALYNVGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYGVAFVKGLQGNSTSSSL 211

Query: 200 VA--ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
           +   A CKH TAYDL++W GV R++F+A+V+ QD+EDT++ PFR CV++GK + VMC+Y 
Sbjct: 212 LQTSACCKHATAYDLEDWGGVARYNFDARVTAQDLEDTYNPPFRSCVVDGKASCVMCAYT 271

Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
            +NGVP CA+  +L  T+R +W L+GY+ SDCD+V +  D Q +  TPE+A A A++AGL
Sbjct: 272 AINGVPACANSGLLTNTVRADWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVALALKAGL 331

Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
           D+DCG ++  H  +A+Q+G ++E D++ AL N   ++MRLG FDG+P +  YG L    +
Sbjct: 332 DIDCGTYMQQHAPAALQQGKITEDDVDKALKNLFAIRMRLGHFDGDPRANIYGGLNAAHI 391

Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
           CTP+H+ LALEAA+ GIVLLKN    LPL      + AVIGPN++    +IGNY G  C 
Sbjct: 392 CTPEHRSLALEAAQDGIVLLKNDAGILPLDRAAIASAAVIGPNANNPGLLIGNYFGPPCE 451

Query: 438 YTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
             TPL+G+  Y + +    GC   AC       AA  A   +D  +L MGL Q  E+E  
Sbjct: 452 SVTPLKGVQGYVKDVRFMAGCGSAACDVADTDQAATLAG-SSDYVLLFMGLSQQQESEGR 510

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DR  LLLPG+QQ L++ V+ A+K P ILVL++GGP+DV FAKN+P+I AI+WAGYPGQAG
Sbjct: 511 DRTSLLLPGQQQSLITAVADAAKRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAG 570

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           G AIA +LFG  NPGG+LP+TWYP+E+ T +PMT+M MR   +  YPGR+YRFY+G  VY
Sbjct: 571 GLAIARVLFGDHNPGGRLPVTWYPEEF-TKVPMTDMRMRADPATGYPGRSYRFYQGETVY 629

Query: 617 PFGHGMSYTNFVHTVANAPT--------VVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
            FG+G+SY+++   + ++ T        +  +P     G +    S     +    C +L
Sbjct: 630 KFGYGLSYSSYSRRLLSSGTPNTDLLAGLSTMPTPAEEGGV---ASYHVEHIGARGCEQL 686

Query: 669 TLGVQVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
                V+V+N G  DG H++L++   +   AG   P KQL+ F + H+ AG +  +  +I
Sbjct: 687 KFPAVVEVENHGPMDGKHSVLMYLRWANATAGR--PAKQLIGFRRQHLKAGEKASLTFDI 744

Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHI 751
             C++ S V + G + +  G H + +
Sbjct: 745 SPCEHFSRVRKDGNKVVDRGSHFLMV 770


>gi|357138088|ref|XP_003570630.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 1026

 Score =  680 bits (1754), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/693 (50%), Positives = 456/693 (65%), Gaps = 26/693 (3%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           MA     F + L L     + +A   PF+C     +  + PFC   LPI QR  DL  RL
Sbjct: 1   MAGRSIIFHVVLPLCLVLQATMATDPPFSC----GSPSSYPFCDRKLPIGQRAADLASRL 56

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV---GPGTKFG-GDFPGATSFPQV 116
           +++EKV LL   +  VPRLG+  Y+WWSEALHGV+N      G +F  G    ATSFPQV
Sbjct: 57  TVEEKVSLLGDVSPGVPRLGVPAYKWWSEALHGVANAPADRAGVRFDDGPVRAATSFPQV 116

Query: 117 ITTASSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGED 175
           + TA+SFN  LW  IG+V+  EAR +YN G A GLT+W+PN+N+FRDPRWGRGQETPGED
Sbjct: 117 LVTAASFNPHLWYRIGQVIGREARGIYNSGQAEGLTFWAPNINVFRDPRWGRGQETPGED 176

Query: 176 PVLSGKYAASYVRGLQGS------DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSK 229
           P ++GKYAA +VRG+QG       +   L+ +A CKHFTAYDL+NWNGV RF FNAKVS+
Sbjct: 177 PTMTGKYAAVFVRGVQGYGASGAVNSSGLEASACCKHFTAYDLENWNGVTRFAFNAKVSE 236

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           QD+ DT++ PFR CV +G  + +MCSYN+VNGVPTCAD N+L +T RG+WR NGYI SDC
Sbjct: 237 QDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCADHNLLSKTARGDWRFNGYITSDC 296

Query: 290 DSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVN 349
           D+V + +D Q +   PE+A AD ++AG+D++CG ++  H  SA  +G ++E DI+ AL N
Sbjct: 297 DAVAIIHDVQGYAKEPEDAVADVLKAGMDVNCGDYVQKHGVSAFHQGKITEQDIDRALQN 356

Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
              ++MRLG+FDG P    YG++G   VC  +HQ+LALEAA+ GIVLLKN   +LPL   
Sbjct: 357 LFAIRMRLGLFDGNPKYNRYGNIGADQVCKKEHQDLALEAAQDGIVLLKNDAGTLPLPKQ 416

Query: 410 RHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLF 468
           +  ++AVIG N++    + GNY G  C   +PLQ +  Y R T    GC    C    + 
Sbjct: 417 KISSLAVIGHNANDAQRLQGNYFGPPCISVSPLQALQGYVRETKFVAGCNAAVCNVSDIA 476

Query: 469 GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
           GAA  A+ +A+  +L MGLDQ  E E LDR  L LPG Q+ LV+ V+ A+K P +LVL+ 
Sbjct: 477 GAAK-AASEAEYVVLFMGLDQDQEREDLDRIELGLPGMQESLVNAVADAAKKPVVLVLLC 535

Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
           GGP+DV FAK +P+I AIIWAGYPGQAGG AIA +LFG  NPGG+LP+TWYP+EY T + 
Sbjct: 536 GGPVDVTFAKGNPKIGAIIWAGYPGQAGGIAIAQVLFGEHNPGGRLPVTWYPKEYATAVA 595

Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
           MT+M MR   S  YPGRTYRFYKG  VY FG+G+SY+ + H+  + P     P     G 
Sbjct: 596 MTDMRMRADASTGYPGRTYRFYKGKTVYNFGYGLSYSKYSHSFVSKP-----PASMMIGE 650

Query: 649 INAT---ISGK-AIKVTHAKCNRLTLGVQVDVK 677
           + A+   +SGK A+  ++    R  +    DVK
Sbjct: 651 VRASLGQLSGKSALYCSNGSVARYLVARNWDVK 683


>gi|413925166|gb|AFW65098.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 830

 Score =  677 bits (1748), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/767 (46%), Positives = 475/767 (61%), Gaps = 22/767 (2%)

Query: 12  LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           L+L+   +   A   PF+C      +  LPFC   LP  QR  DL+ R++  EK   L  
Sbjct: 70  LLLMRVVAVVAAGAPPFSCG--GGPSLGLPFCNTKLPAAQRAADLVSRMTPAEKASQLGD 127

Query: 72  GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWEA 130
            A  VPRLG+  Y+WW+EALHGV+  G G     G    ATSFPQV+ TA+SFN  LW  
Sbjct: 128 VANGVPRLGVPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFR 187

Query: 131 IGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRG 189
           IG+    EARA YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA++VRG
Sbjct: 188 IGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRG 247

Query: 190 LQGSDGDR------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
           LQGS  +       L  +A CKH TAYDL++W GV R+ F A V+ QD+ DTF+ PFR C
Sbjct: 248 LQGSSSNTKSVPPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSC 307

Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG-YIVSDCDSVGVYYDTQHFT 302
           V++GK + VMC+Y  VNGVP+CA+ ++L +T RG W L+G Y+ +DCD+V +  ++Q + 
Sbjct: 308 VVDGKASCVMCAYTSVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIMRNSQFYR 367

Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
            T E+  A  ++AGLD+DCGP++  H  +A+Q+G L++ D++ A+ N  T +MRLG FDG
Sbjct: 368 PTAEDTVATTLKAGLDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDG 427

Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
           +P +  YG+LG   +CT +H+ LALEAA  GIVLLKN    LPL      + AVIG N++
Sbjct: 428 DPKAHVYGNLGAAHICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNAN 487

Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADAT 481
             + ++GNY G  C  TTPLQGI  Y + +    GC   AC       AA  AS  +D+ 
Sbjct: 488 DVLALLGNYWGPPCAPTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALAS-TSDSV 546

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           IL MGL Q  E+E  DR  LLLPG QQ L++ V+ A+K P ILVL++GGP+D+ FA+ +P
Sbjct: 547 ILFMGLSQEQESEGKDRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANP 606

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
           +I AI+WAGYPGQAGG AIA +LFG  NP G+LP+TWYP+E+ T +PMT+M MR + S  
Sbjct: 607 KIGAILWAGYPGQAGGLAIAKVLFGEKNPSGRLPVTWYPEEF-TKVPMTDMRMRSAGS-- 663

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP-----TVVAVPLDGRHGSINATISGK 656
           YPGR+YRFYKG  +Y FG+G+SY+ F H V  A          +   G   +    +S  
Sbjct: 664 YPGRSYRFYKGKTIYKFGYGLSYSKFSHRVVTARNNPAHNTTLLLAAGHAATTEDNLSYH 723

Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPA 715
              +    C +L     V V+N G  DG HT L+F   P A    P +QLV F+  H+ A
Sbjct: 724 VDHIGDELCRQLKFLAVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKA 783

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
           G +  +   +  C+  S V   G + I  G H + +G  +  +S  A
Sbjct: 784 GEKAHLRFEVSPCEDFSRVRDDGRKVIDKGSHFLKVGKHELEISFGA 830


>gi|242052713|ref|XP_002455502.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
 gi|241927477|gb|EES00622.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
          Length = 825

 Score =  676 bits (1744), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/755 (46%), Positives = 471/755 (62%), Gaps = 41/755 (5%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG- 100
           +C  SLP  +RV DL+GRLSL+EKV+ L   A   PR+G+  Y+WW EALHGVS+VGPG 
Sbjct: 68  YCDASLPYAERVRDLVGRLSLEEKVRNLGDQAEGAPRVGLPPYKWWGEALHGVSDVGPGG 127

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           T FG   PGATSFP VI +A++FN +LW AIG VVS E RAMYN G A LTYWSPN+N+ 
Sbjct: 128 TWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMYNLGHAELTYWSPNINVV 187

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------------SDGDRLKVAASCKHFT 208
           RDPRWGR  ETPGEDP + G+YA ++VRG+Q                  +KV++ CKHF 
Sbjct: 188 RDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVVIAAGAAATADPFSRPIKVSSCCKHFA 247

Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
           AYD+D W   DR  F+A+V ++D+ +TF+ PF MC+ +G  + VMCSYN++NG+P CAD 
Sbjct: 248 AYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDASCVMCSYNRINGIPACADA 307

Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTSTPEEAAADAIRAGLDLDCG----- 322
            +L  T+R +W+L+GYIVSDCDSV V   D +    T  EA A A++AGLDLDCG     
Sbjct: 308 RLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEATAAAMKAGLDLDCGMFWEG 367

Query: 323 --PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
              F   +   AV++G + E D++NAL N  T  MRLG FDG P    +  LG  DVCT 
Sbjct: 368 ARDFFTTYGVDAVRQGKIKEADVDNALGNVYTTLMRLGFFDGMPE---FESLGADDVCTR 424

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT--MIGNYAGIACGY 438
           DH+ELA +AARQG+VLLKN    LPL   +  +V+++G    +  T  M+G+Y G  C  
Sbjct: 425 DHKELAADAARQGMVLLKNDARRLPLDPSKINSVSLVGLLEHINATDVMLGDYRGKPCRI 484

Query: 439 TTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
            TP   I +     +   C   AC+  +  G A   ++ ADATI++ GL+ S+E E+ DR
Sbjct: 485 VTPYDAIRQVVNATYVHACDSGACSTAEGMGRASRTAKIADATIVIAGLNMSVERESNDR 544

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
             LLLP  Q   ++ V+ AS  P +LV+MS G +DV+FA+N+ +I AI+WAGYPG+ GGT
Sbjct: 545 EDLLLPWNQSSWINAVAEASTTPIVLVIMSAGGVDVSFAQNNTKIGAIVWAGYPGEEGGT 604

Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP-VVYP 617
           AIAD+LFG  NPGG+LP+TW+  EY+  +PMT MA+RP  +  YPGRTY+FY GP V+YP
Sbjct: 605 AIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALRPDAAHGYPGRTYKFYGGPAVLYP 664

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDG-RHGSINATISGK---------AIKVTHAKCNR 667
           FGHG+SYT+F +        V +P+    H  +    SGK         A+ V   +C+ 
Sbjct: 665 FGHGLSYTSFTYASGTTGATVTIPIGAWEHCKMLTYKSGKAPSPSPACPALNVASHRCDE 724

Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPP--AGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
           + +   + V N G   G H + V++ PP   G  AP KQLV F +V VPAGA   V   +
Sbjct: 725 V-VSFSLRVANTGGVGGDHVVPVYTAPPPEVGD-APRKQLVEFRRVFVPAGAAVDVPFAL 782

Query: 726 HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           +VCK  ++V+ +    +P G   + +G    ++S 
Sbjct: 783 NVCKTFAIVEETAYTVVPSGVSTVIVGDDALALSF 817


>gi|30316196|sp|P83344.1|XYNB_PRUPE RecName: Full=Putative beta-D-xylosidase; AltName: Full=PpAz152
 gi|19879972|gb|AAM00218.1|AF362990_1 beta-D-xylosidase, partial [Prunus persica]
          Length = 461

 Score =  675 bits (1741), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/461 (71%), Positives = 387/461 (83%), Gaps = 2/461 (0%)

Query: 310 ADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPY 369
           ADAI+AGLDLDCGPFL +HTE+AV+RGL+S+++IN AL NT+TVQMRLGMFDGEPS+  Y
Sbjct: 1   ADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQY 60

Query: 370 GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIG 429
           G+LGP+DVCTP HQ+LALEAARQGIVLL+N+G SLPLS  RHRTVAVIGPNSDVTVTMIG
Sbjct: 61  GNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSTRRHRTVAVIGPNSDVTVTMIG 120

Query: 430 NYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQ 489
           NYAG+ACGYTTPLQGIGRY RTIHQ GC DV C  +QLFGAA  A+RQADAT+LVMGLDQ
Sbjct: 121 NYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQ 180

Query: 490 SIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
           SIEAE +DRAGLLLPG QQELVS+V+ AS+GPTILVLMSGGPIDV FAKNDPRI+AIIW 
Sbjct: 181 SIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWV 240

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
           GYPGQAGGTAIA++LFGT+NPGGKLPMTWYPQ Y+T+LPMT+MAMR   ++ YPGRTYRF
Sbjct: 241 GYPGQAGGTAIANVLFGTANPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRF 300

Query: 610 YKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT 669
           Y GPVV+PFG G+SYT F H +A+ PT+V+VPL     + N+T+  K ++V+H  CN L+
Sbjct: 301 YIGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKTVRVSHPDCNALS 360

Query: 670 -LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
            L V VDVKN GS DG HTLLVF++PP G WA  KQL+ F K+H+  G+++RV I +HVC
Sbjct: 361 PLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWASSKQLMGFHKIHIATGSEKRVRIAVHVC 420

Query: 729 KYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
           K+LSVVDR G RRIPLGEH + IG   H VSL    LG IK
Sbjct: 421 KHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSLQ-TNLGEIK 460


>gi|253761860|ref|XP_002489304.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
 gi|241946952|gb|EES20097.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
          Length = 750

 Score =  673 bits (1737), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/765 (46%), Positives = 479/765 (62%), Gaps = 29/765 (3%)

Query: 12  LILLSAS--SSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLL 69
           ++LL+A   SS +A+   F+C P  + +R  PFC  SLP  +R  DL+ RL++ EKV  L
Sbjct: 1   MLLLAAVFFSSAVASDPLFSCGPS-SPSRAYPFCDRSLPAARRAADLVSRLTVAEKVSQL 59

Query: 70  ISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWE 129
              AA VPRLG+  Y+WWSE LHG++  G G +F G   G TSFPQV+ T +SF+  LW 
Sbjct: 60  GDEAAGVPRLGVPPYKWWSEGLHGLAFWGHGMRFNGTVTGVTSFPQVLLTTASFDDGLWF 119

Query: 130 AIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVR 188
            IG+ +  EARA+YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ KYA ++VR
Sbjct: 120 RIGQAIGREARALYNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVR 179

Query: 189 GLQGSDGDR----LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
           G+QGS        L+ +A CKH TAYDL++WNGV R++F+A+V+ QD+ DTF+ PF+ CV
Sbjct: 180 GIQGSSAAGAAAPLQASACCKHATAYDLEDWNGVARYNFDARVTAQDLADTFNPPFQSCV 239

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
           ++GK   VMC+Y  +NGVP CA  ++L +T RG W  +GY+ SDCD+V + +D Q +  T
Sbjct: 240 VDGKATCVMCAYTGINGVPACASSDLLTKTFRGAWGHDGYVSSDCDAVAIMHDAQRYVPT 299

Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
           PE+  A A++             H  +A+Q+G ++E D++ AL N   V+MRLG FDG+P
Sbjct: 300 PEDTVAVALKE------------HGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDP 347

Query: 365 SSQP-YGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
                YGHLG  DVCT DH+ LALEAA+ GIVLLKN    LPL      + AVIG N++ 
Sbjct: 348 RGNALYGHLGAADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAMGSAAVIGHNAND 407

Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDASRQADATI 482
            + + GNY G AC  TTPLQG+  Y   +    GC   AC       AA  AS  ++   
Sbjct: 408 ALVLRGNYFGPACETTTPLQGVQSYVSNVRFLAGCSSAACGYAATGQAAALAS-SSEYVF 466

Query: 483 LVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPR 542
           L MGL Q  E E LDR  LLLPG+QQ L++ V+ A+K P ILVL++GGP+D+ FA+++P+
Sbjct: 467 LFMGLSQDQEKEGLDRTSLLLPGKQQSLITAVASAAKRPVILVLLTGGPVDITFAQSNPK 526

Query: 543 IAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRY 602
           I AI+WAGYPGQAGG AIA +LFG  NP G+LP+TWYP+E+ T +PMT+M MR   +  Y
Sbjct: 527 IGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYPEEF-TKVPMTDMRMRADPANGY 585

Query: 603 PGRTYRFYKGPVVYPFGHGMSYTNFV-HTVANAPTVVAVPLDGRHGSI---NATISGKAI 658
           PGR+YRFY+G  +Y FG+G+SY+ F    V      +A  L G   +    +AT      
Sbjct: 586 PGRSYRFYRGNTIYKFGYGLSYSKFSRQLVTGGKNQLASLLAGLSATTKDDDATSYYHVD 645

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGA 717
            +    C +L    +V+V+N G  DG H++L+F   P A    P  QL+ F   H+ AG 
Sbjct: 646 DIGADGCEQLRFPAEVEVQNHGPMDGKHSVLMFLRWPNATDGRPVSQLIGFTSQHIKAGE 705

Query: 718 QQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
           +  V  ++  C++ S     G + I  G H + +G  +  VS  A
Sbjct: 706 KANVRFDVRPCEHFSRARADGKKVIDRGSHFLMVGKEEVEVSFEA 750


>gi|62701894|gb|AAX92967.1| beta-xylosidase, putative [Oryza sativa Japonica Group]
 gi|77550041|gb|ABA92838.1| Glycosyl hydrolase family 3 C terminal domain containing protein
           [Oryza sativa Japonica Group]
          Length = 793

 Score =  672 bits (1734), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/761 (46%), Positives = 467/761 (61%), Gaps = 47/761 (6%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           FC   L   QR  DL+  L+L EKV  L   AA V RLG+  YEWWSE LHG+S  G G 
Sbjct: 32  FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 91

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIF 160
           +F G     TSFPQVI TA++F+A LW  +G  V  EARA+YN G A GLT WSPNVNIF
Sbjct: 92  RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 151

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
           RDPRWGRGQETPGEDPV + +YA ++V GLQG  G+    +A CKH TAYDLD WN V R
Sbjct: 152 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGGE---ASACCKHATAYDLDYWNNVVR 208

Query: 221 FHFNAK----------------------------VSKQDIEDTFDVPFRMCVMEGKVASV 252
           +++++K                            V+ QD+EDT++ PF+ CV EGK   +
Sbjct: 209 YNYDSKDGASTGKSGETSSQVEKKHGPYEKGYFAVTLQDLEDTYNPPFKSCVAEGKATCI 268

Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADA 312
           MC YN +NGVP CA  ++L + +R EW +NGY+ SDCD+V    D  H+T +PE+  A +
Sbjct: 269 MCGYNSINGVPACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVS 328

Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP-YGH 371
           I+ G+D++CG +  +H  +AVQ+G L+E DI+ ALVN   V+MRLG FDG+P S   YGH
Sbjct: 329 IKVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGH 388

Query: 372 LGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
           LG  DVC+P H+ LALEAA+ GIVLLKN   +LPL      ++AVIGPN+D    + GNY
Sbjct: 389 LGAADVCSPAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNY 448

Query: 432 AGIACGYTTPLQGIGRYA--RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQ 489
            G  C  TTPLQGI  Y   R     GC   ACA      AA  AS  +D  +L MGL Q
Sbjct: 449 FGPPCETTTPLQGIKGYLGDRARFLAGCDSPACAVAATNEAAALAS-SSDHVVLFMGLSQ 507

Query: 490 SIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
             E + LDR  LLLPG QQ L++ V+ A++ P ILVL++GGP+DV FAK++P+I AI+WA
Sbjct: 508 KQEQDGLDRTSLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWA 567

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
           GYPGQAGG AIA +LFG  NP G+LP+TWYP+E+ T +PMT+M MR   +  YPGR+YRF
Sbjct: 568 GYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEF-TKVPMTDMRMRADPATGYPGRSYRF 626

Query: 610 YKGPVVYPFGHGMSYTNFVHTV--------ANAPTVVAVPLDGRHGSINATISGKAIK-V 660
           Y+G  VY FG+G+SY+ F   +        A   +++A  +  R G     +S   +K +
Sbjct: 627 YQGNTVYNFGYGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMARRAGDDGGGMSSYLVKEI 686

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQ 719
              +C+RL     V+V+N G  DG H++L++   P      P +QL+ F   HV  G + 
Sbjct: 687 GVERCSRLVFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKA 746

Query: 720 RVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            V   +  C++ S V   G R I  G H + +G  +   S 
Sbjct: 747 MVSFEVSPCEHFSWVGEDGERVIDGGAHFLMVGDEELETSF 787


>gi|226506870|ref|NP_001146482.1| uncharacterized protein LOC100280070 precursor [Zea mays]
 gi|219887469|gb|ACL54109.1| unknown [Zea mays]
 gi|413947917|gb|AFW80566.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 835

 Score =  670 bits (1728), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/763 (46%), Positives = 471/763 (61%), Gaps = 46/763 (6%)

Query: 30  CDPKDATTRTLP-----FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
           CDP       L      +C  SLP   RV DL+GRL+L+EKV+ L   A   PR+G+  Y
Sbjct: 63  CDPARFVALGLDMSRFRYCDASLPYADRVRDLVGRLALEEKVRNLGDQAEGAPRVGLPPY 122

Query: 85  EWWSEALHGVSNVGPG-TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           +WW EALHGVS+VGPG T FG   PGATSFP VI +A++FN +LW AIG VVS E RAMY
Sbjct: 123 KWWGEALHGVSDVGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMY 182

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR------ 197
           N G A LTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q  D DR      
Sbjct: 183 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVD-DRPYAAAA 241

Query: 198 ------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVAS 251
                 +KV++ CKHF AYD+D W   DR  F+A+V ++D+ +TF+ PF MC+ +G  + 
Sbjct: 242 DPFSRPIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDASC 301

Query: 252 VMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTSTPEEAAA 310
           VMCSYN++NG+P CAD  +L  T+R +W+L+GYIVSDCDSV V   D +    T  EA A
Sbjct: 302 VMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEATA 361

Query: 311 DAIRAGLDLDCG-------PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
            A++AGLDLDCG        F   +   AV++G + E D++NAL N  T  MRLG FDG 
Sbjct: 362 AAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEGDVDNALSNVYTTLMRLGFFDGM 421

Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
           P    +  LG  +VCT  H+ELA +AARQG+VLLKN    LPL   +  +V+++G    +
Sbjct: 422 PE---FESLGASNVCTDGHKELAADAARQGMVLLKNDARRLPLDPNKINSVSLVGLLEHI 478

Query: 424 TVT--MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
             T  M+G+Y G  C   TP   I       +   C   AC   +  G A   ++ ADAT
Sbjct: 479 NATDVMLGDYRGKPCRIVTPYNAIRNMVNATYVHACDSGACNTAEGMGRASSTAKIADAT 538

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           I++ GL+ S+E E+ DR  LLLP  Q   ++ V+MAS  P +LV+MS G +DV+FA N+ 
Sbjct: 539 IVIAGLNMSVERESNDREDLLLPWNQSSWINAVAMASPTPIVLVIMSAGGVDVSFAHNNT 598

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
           +I AI+WAGYPG+ GGTAIAD+LFG  NPGG+LP+TW+  EY+  +PMT MA+RP  +  
Sbjct: 599 KIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALRPDAALG 658

Query: 602 YPGRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAV---------PLDGRHGSINA 651
           YPGRTY+FY GP V+YPFGHG+SYTNF +        V +          L  + G+ + 
Sbjct: 659 YPGRTYKFYGGPAVLYPFGHGLSYTNFSYASGTTGATVTIHIGAWEHCKMLTYKMGAPSP 718

Query: 652 TISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP--AGHWAPHKQLVAFE 709
           + +  A+ V    C+ + +   + V N G   G H + V++ PP   G  AP KQLVAF 
Sbjct: 719 SPACPALNVASHMCSEV-VSFSLRVANTGGVGGDHVVPVYTAPPPEVGD-APLKQLVAFR 776

Query: 710 KVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           +V VPAGA   V   ++VCK  ++V+ +    +P G   + +G
Sbjct: 777 RVFVPAGAAVDVPFALNVCKTFAIVEETAYTVVPSGVSTVVVG 819


>gi|326523729|dbj|BAJ93035.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 810

 Score =  666 bits (1718), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/771 (46%), Positives = 469/771 (60%), Gaps = 54/771 (7%)

Query: 30  CDPKDATTRTLP-----FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
           CDP       L      +C  SLP   RV DL+GRL+L+EKV+ L   A    R+G+  Y
Sbjct: 46  CDPARFAALGLEMAGFRYCDASLPYADRVRDLVGRLTLEEKVRNLGDRAEGAARVGLPPY 105

Query: 85  EWWSEALHGVSNVGPG-TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
            WW EALHGVS+ GPG T+FG   PGATSFP VI +A++FN TLW AIG  VS E RAMY
Sbjct: 106 LWWGEALHGVSDTGPGGTRFGDVVPGATSFPLVINSAAAFNETLWGAIGGAVSTEIRAMY 165

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-------- 195
           N G A LTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA S+VR +Q  DG        
Sbjct: 166 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVSFVRAMQDIDGAGPGAGAD 225

Query: 196 ---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASV 252
                +KV++ CKH+ AYD+D W   DR  F+A+V ++D+ +TF+ PF MCV +G  + V
Sbjct: 226 PFARPIKVSSCCKHYAAYDVDAWLTADRLTFDAQVEERDMIETFERPFEMCVRDGDASCV 285

Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTSTPEEAAAD 311
           MCSYN++NGVP CA+  +L  T+RGEW+L+GYIVSDCDSV V   D +       EA A 
Sbjct: 286 MCSYNRINGVPACANARLLSETVRGEWQLHGYIVSDCDSVRVMVRDAKWLGYNGVEATAA 345

Query: 312 AIRAGLDLDCGPF----------LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
           A++AGLDLDCG F           GL    AV++G L E +++NAL N     MRLG FD
Sbjct: 346 AMKAGLDLDCGMFWEGAQDFFTAFGL---DAVRQGKLRESEVDNALRNLYLTLMRLGFFD 402

Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
           G P  +    LG  DVCT +H+ELA +AARQG+VL+KN    LPL   +  +++++G   
Sbjct: 403 GIPELE---SLGANDVCTEEHKELAADAARQGMVLIKNDHGRLPLDTSKVNSLSLVGLLQ 459

Query: 422 DVTVT--MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
            +  T  M+G+Y G  C   TP   I +       Q C   AC+            +  D
Sbjct: 460 HINATDVMLGDYRGKPCRVVTPYDAIRKVVSATSMQVCDHGACS-------TAANGKTVD 512

Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
           ATI++ GL+ S+E E  DR  LLLP  Q   ++ V+ AS  P ILV++S G +DV+FA+N
Sbjct: 513 ATIVIAGLNMSVEKEGNDREDLLLPWNQTNWINAVAEASPYPIILVIISAGGVDVSFAQN 572

Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
           +P+I AI+WAGYPG+ GGTAIAD+LFG  NPGG+LP+TWY  EYI+ +PMT MA+RP   
Sbjct: 573 NPKIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKSEYISKIPMTSMALRPVAD 632

Query: 600 KRYPGRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDG--------RHGSIN 650
           K YPGRTY+FY GP V+YPFGHG+SY+NF +        V V +          R     
Sbjct: 633 KGYPGRTYKFYGGPEVLYPFGHGLSYSNFSYASDTTGASVTVRVGAWESCKQLTRKPGTT 692

Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFE 709
           A ++  A+ V    C    +   + V N GS+DGAH ++V++ PPA    AP KQLVAF 
Sbjct: 693 APLACPAVNVAGHGCKE-EVSFSLTVANRGSRDGAHVVMVYTVPPAEVDDAPLKQLVAFR 751

Query: 710 KVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           +V VPAGA  +V   ++VCK  ++V+ +    +P G   + +G    S S 
Sbjct: 752 RVFVPAGAAVQVPFTLNVCKAFAIVEETAYTVVPSGVSTVLVGDDALSFSF 802


>gi|14164501|dbj|BAB55751.1| putative alpha-L-arabinofuranosidase/beta-D- xylosidase isoenzyme
           ARA-I [Oryza sativa Japonica Group]
          Length = 818

 Score =  665 bits (1715), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/772 (46%), Positives = 475/772 (61%), Gaps = 51/772 (6%)

Query: 30  CDPKDATTRTL-----PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
           CDP       L     P+C  SLP   RV DL+GR++L+EKV  L   A   PR+G+  Y
Sbjct: 47  CDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRY 106

Query: 85  EWWSEALHGVSNVGPG-TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
            WW EALHGVS+VGPG T FG   PGATSFP VI +A+SFN TLW AIG VVS E RAMY
Sbjct: 107 LWWGEALHGVSDVGPGGTWFGDAVPGATSFPLVINSAASFNETLWRAIGGVVSTEIRAMY 166

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR------ 197
           N G A LTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q  DG        
Sbjct: 167 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAASAA 226

Query: 198 ---------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGK 248
                    +KV++ CKH+ AYD+D WNG DR  F+A+V ++D+ +TF+ PF MC+ +G 
Sbjct: 227 AATDAFSRPIKVSSCCKHYAAYDVDAWNGTDRLTFDARVQERDMVETFERPFEMCIRDGD 286

Query: 249 VASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTSTPEE 307
            + VMCSYN++NGVP CAD  +L  T+R +W+L+GYIVSDCDSV V   D +    T  E
Sbjct: 287 ASCVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVE 346

Query: 308 AAADAIRAGLDLDCGPFL-GLHT------ESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           A A A++AGLDLDCG F  G+H         AV++G L E  ++NAL N     MRLG F
Sbjct: 347 ATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFF 406

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
           DG P  +    LG  DVCT +H+ELA +AARQG+VLLKN    LPLS  +  +VA+ G  
Sbjct: 407 DGIPELE---SLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQL 463

Query: 421 SDVTVT--MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQA 478
             +  T  M+G+Y G  C   TP  G+ +   +     C   +C        A  A++  
Sbjct: 464 QHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSVHACDKGSC------DTAAAAAKTV 517

Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
           DATI+V GL+ S+E E+ DR  LLLP  Q   ++ V+ AS  P +LV+MS G +DV+FA+
Sbjct: 518 DATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQ 577

Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
           ++P+I A++WAGYPG+ GGTAIAD+LFG  NPGG+LP+TWY  EY++ +PMT MA+RP  
Sbjct: 578 DNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDA 637

Query: 599 SKRYPGRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDG----RHGSINATI 653
              YPGRTY+FY G  V+YPFGHG+SYTNF +  A A   V V +      +  +  A +
Sbjct: 638 EHGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGV 697

Query: 654 SG----KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAF 708
           S      A+ V    C    +   V V N G +DG H + +++ PPA    AP KQLVAF
Sbjct: 698 SSPPACPAVNVASHACQE-EVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAF 756

Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            +V V AGA   V   ++VCK  ++V+ +    +P G   + +G    S+S 
Sbjct: 757 RRVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSF 808


>gi|343172466|gb|AEL98937.1| beta-xylosidase, partial [Silene latifolia]
 gi|343172468|gb|AEL98938.1| beta-xylosidase, partial [Silene latifolia]
          Length = 374

 Score =  662 bits (1708), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 302/374 (80%), Positives = 341/374 (91%)

Query: 78  RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
           RLG++GYEWWSEALHGVSNVGPGTKF G FP ATSFPQVITTA+SFNA+LW+AIG+ VSD
Sbjct: 1   RLGLQGYEWWSEALHGVSNVGPGTKFQGAFPAATSFPQVITTAASFNASLWQAIGQAVSD 60

Query: 138 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR 197
           EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP LS +YAASYV GLQG+ G+R
Sbjct: 61  EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLSAQYAASYVTGLQGNYGNR 120

Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
           LKVAA CKH+TAYDLDNWNG+DRFHFNAKVSKQD+EDT++VPF+ CV+EGKVASVMCSYN
Sbjct: 121 LKVAACCKHYTAYDLDNWNGMDRFHFNAKVSKQDLEDTYNVPFKACVLEGKVASVMCSYN 180

Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
           QVNG PTCADP+IL+ TIRG+W LNGYIVSDCDSVGV YD QH+T TPEEAAAD I AGL
Sbjct: 181 QVNGKPTCADPDILRNTIRGQWHLNGYIVSDCDSVGVLYDDQHYTRTPEEAAADTINAGL 240

Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
           DLDCGPFL +HTE A+++GL++E  +N AL NT+TVQMRLGMFDGEPS+QP+G+LGP+DV
Sbjct: 241 DLDCGPFLAVHTEGAIRQGLVTEAAVNQALANTITVQMRLGMFDGEPSAQPFGNLGPRDV 300

Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
           CTP HQ+LAL+AAR+GIVLLKNQ  SLPLS +RHR +AVIGPN+  T TMIGNYAGIACG
Sbjct: 301 CTPAHQDLALQAAREGIVLLKNQVGSLPLSTVRHRNIAVIGPNAQATTTMIGNYAGIACG 360

Query: 438 YTTPLQGIGRYART 451
           YT+PLQGI RYART
Sbjct: 361 YTSPLQGISRYART 374


>gi|326513064|dbj|BAK03439.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 694

 Score =  659 bits (1701), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/643 (52%), Positives = 436/643 (67%), Gaps = 26/643 (4%)

Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
           TL E +G +V+ +  A+   G     +WS               ETPGEDP+L+ KYA  
Sbjct: 70  TLAEKVGFLVNKQP-ALGRLGIPAYEWWS---------------ETPGEDPLLASKYAVG 113

Query: 186 YVRGLQ-----GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
           YV GLQ     G     LKVAA CKH+TAYD+DNW GV+R+ F+AKVS+QD++DTF  PF
Sbjct: 114 YVTGLQDAGAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPF 173

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           + CV++G VASVMCSYN+VNG PTCAD ++L+  IRG+W+LNGYIVSDCDSV V Y  QH
Sbjct: 174 KSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQH 233

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           +T TPEEAAA  I++GLDL+CG FL  HT +AVQ G LSE D++ A+ N   + MRLG F
Sbjct: 234 YTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFF 293

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
           DG+P    +G LGPKDVCT  ++ELA E ARQGIVLLKN G +LPLS    +++AVIGPN
Sbjct: 294 DGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPN 352

Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQAD 479
           ++ + TMIGNY G  C YTTPLQG+G    T++Q GC +V C+ + L    A+ A+  AD
Sbjct: 353 ANASFTMIGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASAD 412

Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
            T+LV+G DQSIE E+LDR  LLLPG+Q +LVS V+ AS GP ILV+MSGGP D++FAK 
Sbjct: 413 VTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKA 472

Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
             +IAAI+W GYPG+AGG A+ADILFG+ NP G+LP+TWYP  Y   + MT+M MRP  S
Sbjct: 473 SDKIAAILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTS 532

Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA-PTVVAVPLDGRHGSINATISGKAI 658
             YPGRTYRFY G  V+ FG G+SYT   H++ +A P+ V++ L   H       +  ++
Sbjct: 533 TGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCRAEECA--SV 590

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQ 718
           +     C+ L   V++ V+N G   GAH++L+FS+PP  H AP K L+ FEKV +  G  
Sbjct: 591 EAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAPAKHLLGFEKVSLAPGEA 650

Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             V   + VC+ LSVVD  G R++ LG H +H+G  KH+V L 
Sbjct: 651 GTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGDLKHTVELR 693



 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 29/62 (46%), Positives = 35/62 (56%)

Query: 28 FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
          FACD  +AT     FC        R  DL+ RL+L EKV  L++   A+ RLGI  YEWW
Sbjct: 37 FACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWW 96

Query: 88 SE 89
          SE
Sbjct: 97 SE 98


>gi|359477633|ref|XP_003632006.1| PREDICTED: LOW QUALITY PROTEIN: beta-D-xylosidase 3-like [Vitis
           vinifera]
          Length = 781

 Score =  655 bits (1690), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/756 (47%), Positives = 474/756 (62%), Gaps = 40/756 (5%)

Query: 30  CDPKDATT-----RTLPFCQVSLPIPQ-RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
           CDP          +   +C  SLPI   RV DL+ R++L+EK   +I  AA V R+G+  
Sbjct: 17  CDPARFAALGFDMKDFVYCNSSLPIYDVRVKDLVDRMTLEEKATNVIYKAAGVERIGLPP 76

Query: 84  YEWWSEALHGVSNV---GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
           Y+WWSEALHGVS+V   GP T F    PGATSFP VI +A+SFN +LW+ I +VVS EAR
Sbjct: 77  YQWWSEALHGVSSVSINGP-TFFDETVPGATSFPNVILSAASFNQSLWKTIRQVVSKEAR 135

Query: 141 AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--- 197
           A YN G AGLT+W PNVN+ RDPRWGR QET GEDP     YA SYVRGLQ  +G     
Sbjct: 136 ATYNLGHAGLTFWCPNVNVARDPRWGRTQETXGEDPFTVSVYAVSYVRGLQDVEGTENTT 195

Query: 198 ------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVAS 251
                 LKV++S KHF AYDLDNW  VDR HFNA+VS+QD+ +TF  PF  CV EG V+ 
Sbjct: 196 DLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHFNARVSEQDMAETFLRPFEACVREGDVSG 255

Query: 252 VMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAA 310
           VMCS+N +NG+P CADP + K TIR EW L+GYIVSDC S+    + Q F   T EEA A
Sbjct: 256 VMCSFNNINGIPPCADPRLFKGTIRDEWNLHGYIVSDCWSIETIVEDQKFLDVTGEEAVA 315

Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
             ++AGLDL+CG +      SAV  G + + D++ +L N   V MRLG FDG P+     
Sbjct: 316 LNLKAGLDLECGHYYNDSPASAVMAGRVGQHDLDQSLSNLYVVLMRLGFFDGIPA---LA 372

Query: 371 HLGPKDVC-TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIG 429
            LG  D+C + +H ELA EAARQGIVLLKN   +LPL  +++  +A++GPN+D    M+G
Sbjct: 373 SLGKDDICLSAEHIELAREAARQGIVLLKNDNATLPLKSVKN--LALVGPNADAYGAMMG 430

Query: 430 NYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL-D 488
           NYAG  C   +P           ++ GC DV C +D     A++A++ AD TI+V+G+ D
Sbjct: 431 NYAGPPCRSVSPRDAFSAIGNVTYEMGCGDVLCHNDTYVYKAVEAAKHADTTIIVVGITD 490

Query: 489 QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM--SGGPIDVAFAKNDPRIAAI 546
            SI  E  DR  LLLPG Q  LV++++ A+  P ILV+    GGPID++FA+++P I  I
Sbjct: 491 VSIGTEDKDRVDLLLPGYQTHLVNQIAKATTAPIILVVCGHCGGPIDISFARDNPGIEPI 550

Query: 547 IWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT 606
           +WAG+PG+ GG AIAD+++G  NPGG+LP+TWY   Y+  LPMT MA+R  +S  YPGR 
Sbjct: 551 LWAGFPGEEGGNAIADVVYGKYNPGGRLPVTWYENGYVGMLPMTSMALRSVESLGYPGRK 610

Query: 607 YRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA-------VPLDGRHGSINATI-SGKAI 658
           Y+F+ G  VYPFG G+SYTNF +++  APT           P      SI + I    A+
Sbjct: 611 YKFFSGSTVYPFGCGLSYTNFSYSLT-APTRSIHTHLKKLQPCRSMAYSICSVIPQCPAV 669

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGA 717
            V    CN  T   +V VK VGS DG+  ++V+S+PP+G    H KQ++ FE+V V  G 
Sbjct: 670 LVDDLSCNE-TFEFEVAVKTVGSMDGSEVVIVYSSPPSGIVGTHIKQVIGFERVFVKVGX 728

Query: 718 QQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            ++V  +++VCK L +V  SG   +P G   I  GG
Sbjct: 729 VEKVKFSMNVCKSLGIVHSSGHTLLPSGSDIIKAGG 764


>gi|222629257|gb|EEE61389.1| hypothetical protein OsJ_15562 [Oryza sativa Japonica Group]
          Length = 771

 Score =  654 bits (1686), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 335/743 (45%), Positives = 467/743 (62%), Gaps = 31/743 (4%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV-----S 95
           PFC  +LP P R   L+  L+L EK+  L+      P             + GV     +
Sbjct: 39  PFCNATLPFPARARALVSLLTLDEKIAQLLQHRRGRPPPRRPALR----VVVGVPSTASA 94

Query: 96  NVGPG-TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWS 154
             GPG T   G    AT FPQVI +A++FN +LW A  R ++ EARAM+N G AGLT+W+
Sbjct: 95  TTGPGSTSPRGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLTFWA 154

Query: 155 PNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDL 212
           PN+N+FRDPRWGRGQETPGEDP +   Y+  YV+G Q   G+  R+ ++A CKH+ AYDL
Sbjct: 155 PNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGRMMLSACCKHYIAYDL 214

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
           + W G  R+ FNAKV+ QD+EDT+  PF+ C+ EG+ + +MCSYNQVNGVP CA  +IL+
Sbjct: 215 EKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVNGVPACARKDILQ 274

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
           R  R EW   GYI SDCD+V + ++ Q +T++ E++ A  ++AG+D++CG FL  HT+SA
Sbjct: 275 RA-RDEWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDINCGSFLIRHTKSA 333

Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
           +++G + E DIN+AL N  +VQ+RLG FD    +Q +  LGP +VCT +H+ELA EA RQ
Sbjct: 334 IEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRELAAEAVRQ 393

Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RT 451
           G VLLKN    LPL       +A+IGP ++    + G+Y G+ C  TT ++G+  Y  +T
Sbjct: 394 GTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKGMQAYVPKT 453

Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
               GCKDV C     FG AI+A+++AD  +L+ GL+ + E E  DR  LLLPGRQ +L+
Sbjct: 454 TFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLLPGRQMDLI 513

Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
             V+  +K P +LVLM GGP+DV+FAK+DPRIA+I+W GYPG+ GG  + +ILFG  NPG
Sbjct: 514 HTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEILFGKYNPG 573

Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
           GKLP+TWYP+ + T +PM +M MR   S+ YPGRTYRFY G VVY FG+G+SY+ + +++
Sbjct: 574 GKLPITWYPESF-TAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYGLSYSKYSYSI 632

Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTH-----------AKCNRLTLGVQVDVKNVG 680
             AP  +++       S+   IS K                 A C  L   V + V N G
Sbjct: 633 LQAPKKISL----SRSSVPDLISRKPAYTRRDGVDYVQVEDIASCEALQFPVHISVSNDG 688

Query: 681 SKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
           + DG+H +L+F S+ P+   +P KQLV FE+VH  AG    V I +  CK +S  +  GT
Sbjct: 689 AMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKLMSFANTEGT 748

Query: 740 RRIPLGEHNIHIGGTKHSVSLHA 762
           R + LG H + +G  +H + + A
Sbjct: 749 RVLFLGTHVLMVGDEEHELLIEA 771


>gi|357128056|ref|XP_003565692.1| PREDICTED: beta-D-xylosidase 3-like [Brachypodium distachyon]
          Length = 821

 Score =  652 bits (1683), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/774 (45%), Positives = 466/774 (60%), Gaps = 49/774 (6%)

Query: 30  CDPKDATTRTLP-----FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGA-AAVPRLGIKG 83
           CDP    +  L      +C  SLP  +RV DL+GRL+L+EKV  L   A  A  R+G+  
Sbjct: 46  CDPARFASLGLDMAGFRYCDASLPYAERVRDLVGRLTLEEKVANLGDQAKGAEQRVGLPR 105

Query: 84  YEWWSEALHGVSNVGPG-TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
           Y WW EALHGVS+  PG T+FG   PGATSFP V+ +A++FN TLW AIG   S E RAM
Sbjct: 106 YMWWGEALHGVSDTNPGGTRFGDVVPGATSFPLVLNSAAAFNETLWRAIGGATSTEIRAM 165

Query: 143 YNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-------- 194
           YN G A LTYWSPN+N+ RDPRWGR  ETPGEDP L G++A S+VR +Q  D        
Sbjct: 166 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFLVGRFAVSFVRAMQDIDDGANAGAG 225

Query: 195 -----GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
                  RLKV++ CKH+ AYD+D W G DR  F+A V ++D+ +TF+ PF MCV +G  
Sbjct: 226 AADPFARRLKVSSCCKHYAAYDVDKWFGADRLSFDANVQERDMVETFERPFEMCVRDGDA 285

Query: 250 ASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTSTPEEA 308
           + VMCSYN++NGVP CA+  +L  T+R +W+L+GYIVSDCDSV V   D +       +A
Sbjct: 286 SCVMCSYNRINGVPACANGRLLTGTVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYDGVQA 345

Query: 309 AADAIRAGLDLDCG-------PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
            A A++AGLDLDCG        F   +   AV++G L E +++ AL +     MRLG FD
Sbjct: 346 TAAAMKAGLDLDCGMFWEGAKDFFTAYGLQAVRQGKLKEAEVDEALGHLYLTLMRLGFFD 405

Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
           G P  Q    LG  DVCT +H+E+A EAARQG+VLLKN    LPL   +  ++A++G   
Sbjct: 406 GSPEFQ---SLGASDVCTEEHKEMAAEAARQGMVLLKNDHDRLPLDANKVNSLALVGLLQ 462

Query: 422 DVTVT--MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
            +  T  M+G+Y G  C   TP + I +       Q C   AC    L GAAI A++  D
Sbjct: 463 HINATDVMLGDYRGKPCRVVTPYEAIRKVVSGTSMQACDKGACGTTAL-GAAI-AAKTVD 520

Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
           ATI++ GL+ S+E E  DR  LLLP  Q + ++ V+ AS+ P  LV++S G +D++FA+N
Sbjct: 521 ATIVITGLNMSVEREGNDREDLLLPWDQTQWINAVAEASRDPITLVIISAGGVDISFAQN 580

Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
           +P+I AI+WAGYPG+ GGT IAD+LFG  NPGG+LP+TWY  EYI  LPMT MA+RP   
Sbjct: 581 NPKIGAILWAGYPGEEGGTGIADVLFGKYNPGGRLPLTWYKNEYIGKLPMTSMALRPVAD 640

Query: 600 KRYPGRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVP-----------LDGRHG 647
           K YPGRTY+FY GP V+YPFGHG+SYTNF +        V V            L  + G
Sbjct: 641 KGYPGRTYKFYSGPDVLYPFGHGLSYTNFTYDSYTTGASVTVKIGTAWEDSCKNLTYKPG 700

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLV 706
           +  +T    AI V    C    +   + V N G   G+H + V++ PPA    AP KQLV
Sbjct: 701 TTASTAPCPAINVAGHGCQE-EVSFTLKVSNTGGIGGSHVVPVYTAPPAEVDDAPLKQLV 759

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           AF ++ VPAG    V   + VCK  ++V+ +    +P G   + +G    S S 
Sbjct: 760 AFRRMFVPAGDAVEVPFTLSVCKAFAIVEGTAYTVVPAGVSRVLVGDESLSFSF 813


>gi|90399376|emb|CAJ86207.1| B1011H02.4 [Oryza sativa Indica Group]
          Length = 738

 Score =  642 bits (1656), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 330/738 (44%), Positives = 456/738 (61%), Gaps = 54/738 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PFC  +LP P R   L+  L+L EK+  L + AA  PRLG+  +EWWSE+LHGV + GPG
Sbjct: 39  PFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLHGVCDNGPG 98

Query: 101 TKFG-GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
             F  G    AT FPQVI +A++FN +LW A  R ++ EARAM+N G AGLT+W+PN+N+
Sbjct: 99  VNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLTFWAPNINV 158

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDLDNWNG 217
           FRDPRWGRGQETPGEDP +   Y+  YV+G Q   G+  R+ ++A CKH+ AYDL+ W G
Sbjct: 159 FRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGRMMLSACCKHYIAYDLEKWRG 218

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R+ FNAKV                                NGVP CA  +IL+R  R 
Sbjct: 219 FTRYTFNAKV--------------------------------NGVPACARKDILQRA-RD 245

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
           EW   GYI SDCD+V + ++ Q +T++ E++ A  ++AG+D++CG FL  HT+SA+++G 
Sbjct: 246 EWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDINCGSFLIRHTKSAIEKGK 305

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           + E DIN+AL N  +VQ+RLG FD    +Q +  LGP +VCT +H+ELA EA RQG VLL
Sbjct: 306 VQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRELAAEAVRQGTVLL 365

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQG 456
           KN    LPL       +A+IGP ++    + G+Y G+ C  TT ++G+  Y  +T    G
Sbjct: 366 KNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKGMQAYVPKTTFAAG 425

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
           CKDV C     FG AI+A+++AD  +L+ GL+ + E E  DR  LLLPGRQ +L+  V+ 
Sbjct: 426 CKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLLPGRQMDLIHTVAS 485

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
            +K P +LVLM GGP+DV+FAK+DPRIA+I+W GYPG+ GG  + +ILFG  NPGGKLP+
Sbjct: 486 VTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEILFGKYNPGGKLPI 545

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           TWYP+ + T +PM +M MR   S+ YPGRTYRFY G VVY FG+G+SY+ + +++  AP 
Sbjct: 546 TWYPESF-TAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYGLSYSKYSYSILQAPK 604

Query: 637 VVAVPLDGRHGSINATISGKAIKVTH-----------AKCNRLTLGVQVDVKNVGSKDGA 685
            +++       S+   IS K                 A C  L   V + V N G+ DG+
Sbjct: 605 KISL----SRSSVPDLISRKPAYTRRDGVDYVQVEDIASCEALQFPVHISVSNDGAMDGS 660

Query: 686 HTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
           H +L+F S+ P+   +P KQLV FE+VH  AG    V I +  CK +S  +  GTR + L
Sbjct: 661 HAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKLMSFANTEGTRVLFL 720

Query: 745 GEHNIHIGGTKHSVSLHA 762
           G H + +G  +H + + A
Sbjct: 721 GTHVLMVGDEEHELLIEA 738


>gi|163889365|gb|ABY48135.1| beta-D-xylosidase [Medicago truncatula]
          Length = 776

 Score =  642 bits (1656), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 335/795 (42%), Positives = 471/795 (59%), Gaps = 63/795 (7%)

Query: 3   STIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
           STI  F   L+L+            + C P  +     PFC +SLPI  R   LI  L+L
Sbjct: 7   STIIIFLFSLLLIHLPK--FFTTPDYPCKPPHSH---YPFCNISLPISTRTTSLISLLTL 61

Query: 63  QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
            +K+  L + A+++  LGI  Y+WWSEALHG++  GPG  F G    AT+FPQVI +A++
Sbjct: 62  SDKINQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAA 121

Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
           FN +LW  IG  V  E RAM+N G AGL++W+PNVN+FRDPRWGRGQETPGEDP++   Y
Sbjct: 122 FNRSLWFLIGYAVGVEGRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAY 181

Query: 183 AASYVRGLQGSDG------------DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
           A  +VRG+QG DG            D L V+A CKHFTAYDL+ W    R++FNA V   
Sbjct: 182 AVEFVRGIQGVDGIKKVLNDHDSDDDGLMVSACCKHFTAYDLEKWGEFSRYNFNAVV--- 238

Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY-IVSDC 289
              +T+  PFR CV +GK + +MCSYN+VNGVP CA  ++L   +R +W   G  I+   
Sbjct: 239 ---NTYQPPFRGCVQQGKASCLMCSYNEVNGVPACASKDLLG-LVRNKWGFEGVGILPQT 294

Query: 290 DSVGVYY-DTQHFTSTPEEAAADAIRA-----------GLDLDCGPFLGLHTESAVQRGL 337
             + + +   +   + P+      ++             +D++CG F+  HTESA+++GL
Sbjct: 295 VMLWLLFLSIKSMQNLPKMLLLMFLKQVFFYVFENLWFCMDINCGTFMLRHTESAIEQGL 354

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           + E D++ AL N  +VQMRLG+F+G+P    +G LGP+DVCTP+H++LALEAARQGIVLL
Sbjct: 355 VKEEDLDRALFNLFSVQMRLGLFNGDPEKGKFGKLGPQDVCTPEHKKLALEAARQGIVLL 414

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQG 456
           KN    LPL      ++A+IGP +  T  + G Y+GI C   +   G+  Y +TI +  G
Sbjct: 415 KNDNKFLPLDKKDRVSLAIIGPMA-TTSELGGGYSGIPCSPRSLYDGLKEYVKTISYAFG 473

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
           C DV C  D  F  AID ++QAD  ++V GLD ++E E LDR  LLLPG+Q +LVS+V+ 
Sbjct: 474 CSDVKCDSDDGFAVAIDIAKQADFVVIVAGLDTTLETEDLDRVSLLLPGKQMDLVSRVAA 533

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
           ASK P ILVL  GGP+DV+FA+++  I +I+W GYP         D      +  G+LPM
Sbjct: 534 ASKRPVILVLTGGGPLDVSFAESNQLITSILWIGYP--------VDF-----DAAGRLPM 580

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           TWYP+ + TN+PM +M MR   S+ YPGRTYRFY G  +Y FGHG+SY++F + V +AP+
Sbjct: 581 TWYPESF-TNVPMNDMGMRADPSRGYPGRTYRFYTGSRIYGFGHGLSYSDFSYRVLSAPS 639

Query: 637 VVAVPLD---GRHGSINATISGKAIKVTHA------KCNRLTLGVQVDVKNVGSKDGAHT 687
            +++      G   S+   +     +V H        CN L+  V + V NVG  DG+H 
Sbjct: 640 KLSLSKTTNGGLRRSLLNKVEKDVFEVDHVHVDELQNCNSLSFSVHISVMNVGDMDGSHV 699

Query: 688 LLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGE 746
           +++FS  P     +P  QLV   ++H  +       I    C++ S  D  G R +PLG 
Sbjct: 700 VMLFSKWPKNIQGSPESQLVGPSRLHTVSNKSIETSILADPCEHFSFADEQGKRILPLGN 759

Query: 747 HNIHIGGTKHSVSLH 761
           H +++G  +H VS+ 
Sbjct: 760 HILNVGDVEHIVSIE 774


>gi|222615852|gb|EEE51984.1| hypothetical protein OsJ_33664 [Oryza sativa Japonica Group]
          Length = 753

 Score =  641 bits (1654), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/734 (46%), Positives = 448/734 (61%), Gaps = 32/734 (4%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           FC   L   QR  DL+  L+L EKV  L   AA V RLG+  YEWWSE LHG+S  G G 
Sbjct: 31  FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIF 160
           +F G     TSFPQVI TA++F+A LW  +G  V  EARA+YN G A GLT WSPNVNIF
Sbjct: 91  RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRG-LQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
           RDP               SG       RG   G  G   + +A CKH TAYDLD WN V 
Sbjct: 151 RDP---------------SGTRPGDARRGPRHGEQGIGGEASACCKHATAYDLDYWNNVV 195

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R+++++KV+ QD+EDT++ PF+ CV EGK   +MC YN +NGVP CA  ++L + +R EW
Sbjct: 196 RYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKKVRQEW 255

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
            +NGY+ SDCD+V    D  H+T +PE+  A +I+ G+D++CG +  +H  +AVQ+G L+
Sbjct: 256 GMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLT 315

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQP-YGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           E DI+ ALVN   V+MRLG FDG+P S   YGHLG  DVC+P H+ LALEAA+ GIVLLK
Sbjct: 316 EKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLK 375

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA--RTIHQQG 456
           N   +LPL      ++AVIGPN+D    + GNY G  C  TTPLQGI  Y   R     G
Sbjct: 376 NDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAG 435

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
           C   ACA D    AA  AS  +D  +L MGL Q  E + LDR  LLLPG QQ L++ V+ 
Sbjct: 436 CDSPACAVDATNEAAALAS-SSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVAN 494

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
           A++ P ILVL++GGP+DV FAK++P+I AI+WAGYPGQAGG AIA +LFG  NP G+LP+
Sbjct: 495 AARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPV 554

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV----- 631
           TWYP+E+ T +PMT+M MR   +  YPGR+YRFY+G  VY FG+G+SY+ F   +     
Sbjct: 555 TWYPEEF-TKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFS 613

Query: 632 ---ANAPTVVAVPLDGRHGSINATISGKAIK-VTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
              A   +++A  +  R G     +S   +K +   +C+RL     V+V+N G  DG H+
Sbjct: 614 TSNAGNLSLLAGVMARRAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHS 673

Query: 688 LLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGE 746
           +L++   P      P +QL+ F   HV  G +  V   +  C++ S V   G R I  G 
Sbjct: 674 VLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGA 733

Query: 747 HNIHIGGTKHSVSL 760
           H + +G  +   S 
Sbjct: 734 HFLMVGDEELETSF 747


>gi|147857580|emb|CAN78858.1| hypothetical protein VITISV_030325 [Vitis vinifera]
          Length = 699

 Score =  635 bits (1638), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 323/631 (51%), Positives = 406/631 (64%), Gaps = 71/631 (11%)

Query: 133 RVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
           +VVS EARAMYN G AGLT+WSPNVNIF+DPRWGRGQETPGEDP+LS KYA+ YVRGLQ 
Sbjct: 135 KVVSTEARAMYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQQ 194

Query: 193 SDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
           SD    DRLKVAA CKH+TAYDLDNW GVD FHFNA V+ QD++DTF  PF+ CV++G V
Sbjct: 195 SDDGSPDRLKVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTFQPPFKSCVIDGNV 254

Query: 250 ASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAA 309
           ASV+                              YIVSDCDSV V+Y++QH+T TPEEAA
Sbjct: 255 ASVI------------------------------YIVSDCDSVDVFYNSQHYTKTPEEAA 284

Query: 310 ADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPY 369
           A AI AGLDL+CG FLG HTE+AV+ GL+ E  ++ A+ N     MRLG FDG PS   Y
Sbjct: 285 AKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIY 344

Query: 370 GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIG 429
           G LGPKDVCT +HQE A EA RQGIV                                  
Sbjct: 345 GKLGPKDVCTSEHQERAREAPRQGIV---------------------------------- 370

Query: 430 NYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQ 489
            +AG  C YTTPLQG+     T +  GC +VAC   Q+   A   +  ADAT+L++G+DQ
Sbjct: 371 -FAGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQI-DEAKKIAAAADATVLIVGIDQ 428

Query: 490 SIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
           SIEAE  DR  + LPG+Q  L+++V+  SKG  ILV+MSGG  D++FAKND +I +I W 
Sbjct: 429 SIEAEGRDRVNIQLPGQQPLLITEVAKXSKGNVILVVMSGGGFDISFAKNDDKITSIQWV 488

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
           GYPG+AGG AIAD++FG  NP GKLPMTWYPQ Y+  +PMT M MRP  +  YPGRTYRF
Sbjct: 489 GYPGEAGGAAIADVIFGFYNPSGKLPMTWYPQSYVDKVPMTNMNMRPDPASGYPGRTYRF 548

Query: 610 YKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT 669
           Y G  +Y FG G+SYT F H +  AP  V++P++  H   ++    K++      C  L 
Sbjct: 549 YTGETIYTFGDGLSYTQFNHHLVQAPKSVSIPIEEAHSCHSSKC--KSVDAVQESCQNLA 606

Query: 670 LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
             + + V N G+  G+HT+ +FS+PP+ H +P K L+ FEKV V A A+  V   + VCK
Sbjct: 607 FDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAKAKALVRFKVDVCK 666

Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            LS+VD  GTR++ LG H +H+G  KHS+++
Sbjct: 667 DLSIVDELGTRKVALGLHVLHVGNLKHSLNV 697


>gi|357153280|ref|XP_003576399.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
           distachyon]
          Length = 807

 Score =  634 bits (1636), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 346/765 (45%), Positives = 464/765 (60%), Gaps = 73/765 (9%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +C   LP   RV DLIG ++++EKV  L   AA  PR+G+  Y+WWSEALHG+S+ GP T
Sbjct: 62  YCDAKLPYGDRVRDLIGWMTVEEKVSNLGDWAAGAPRVGLPPYKWWSEALHGLSSTGPTT 121

Query: 102 KF------------GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG 149
           KF               F G T F  VI +A+SFN +LW +IG+ +S EARAMYN G  G
Sbjct: 122 KFDDLKKPRLHSGRAAVFNG-TVFANVINSAASFNESLWRSIGQAISTEARAMYNLGKGG 180

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-------GSDGDRL---- 198
           LTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q       G +GD L    
Sbjct: 181 LTYWSPNINVVRDPRWGRALETPGEDPFVVGRYAVNFVRGMQDVDDAAAGFNGDPLSRPL 240

Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
           K +A CKH+ AYD+D+W G  RF F+A+V+++D+ +TF  PF MCV +G  ++VMCSYN+
Sbjct: 241 KTSACCKHYAAYDVDDWYGHTRFKFDARVTERDMVETFQRPFEMCVRDGDASAVMCSYNR 300

Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAADAIRAGL 317
           VNG+P CAD  +L  T+R +W L+GYIVSDCD+V V  D   +   TP EA+A +++AGL
Sbjct: 301 VNGIPACADARLLAGTLRRDWGLHGYIVSDCDAVRVMTDNATWLGYTPAEASAASLKAGL 360

Query: 318 DLDCGP------------FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
           DLDCG             FL  +  +AV++G + E DI+NALVN  T  MRLG FDG P 
Sbjct: 361 DLDCGESWIVQKGKPVMDFLSTYGMAAVRQGKMRESDIDNALVNLYTTLMRLGYFDGMPR 420

Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV-T 424
              Y  L  KD+C+  H+ LAL+ ARQ +VLLKN    LPL   +  +VAV GP+++   
Sbjct: 421 ---YESLDEKDICSEAHRSLALDGARQSMVLLKNLDGLLPLDASKLASVAVRGPHAEAPE 477

Query: 425 VTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILV 484
             M G+Y G  C Y TP +GI +    I QQG                      D TI +
Sbjct: 478 KVMDGDYTGPPCRYITPREGISKDV-NISQQG---------------------GDVTIYM 515

Query: 485 MGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
            G++  IE E  DR  LLLP  Q E + +V+ AS  P +LV++SGG IDV+FA++ P+I 
Sbjct: 516 GGINMHIEREGNDREDLLLPKNQTEEILRVAAASPSPIVLVILSGGGIDVSFAQSHPKIG 575

Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
           AI+WAGYPG  GG AIAD++FG  NPGG+LP+TW+  +YI  LPMT MA+RP     YPG
Sbjct: 576 AILWAGYPGGEGGHAIADVIFGRYNPGGRLPLTWFKNKYIHQLPMTSMALRPRPEHGYPG 635

Query: 605 RTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK------A 657
           RTY+FY GP V+YPFG+G+SYT F + + N  T V +    RH    +  +G       A
Sbjct: 636 RTYKFYDGPDVLYPFGYGLSYTKFRYELLNKETAVTLAPGRRHCRQLSYKTGSVGPDCPA 695

Query: 658 IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAG 716
           + V    C   T+   V V N G  DGA+ +LV++ PPA    AP KQ+ AF +V V AG
Sbjct: 696 VDVASHACAE-TVSFNVSVVNAGKADGANAVLVYTAPPAELAGAPIKQVAAFRRVAVKAG 754

Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI-GGTKHSVSL 760
           A + V   ++VCK   +V+++    +P G   + +  G   +VS 
Sbjct: 755 AAETVVFTLNVCKAFGIVEKTAYTVVPSGVSTVIVENGDSSAVSF 799


>gi|357489437|ref|XP_003615006.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516341|gb|AES97964.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 685

 Score =  634 bits (1636), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 321/677 (47%), Positives = 442/677 (65%), Gaps = 20/677 (2%)

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVN 158
           G    G  P ATSFPQVI TA+SF+  LW  I +V+  EAR +YN G A G+ +W+PN+N
Sbjct: 2   GIILNGSIPAATSFPQVILTAASFDPKLWYQISKVIGTEARGVYNAGQAQGMNFWAPNIN 61

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-------GDRLKVAASCKHFTAYD 211
           IFRDPRWGRGQET GEDP+++ KY  SYVRGLQG         G RLK +A CKHFTAYD
Sbjct: 62  IFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSFEGGKLIGGRLKASACCKHFTAYD 121

Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
           L+NW GV+R+ F+AKV+ QD+ DT+   F  CV++G+ + +MC+YN+VNGVP CAD N+L
Sbjct: 122 LENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLL 181

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
             T R +W  NGYI SDCD+V   Y+ Q +  TPE+  AD +RAG+DL+CG ++  H +S
Sbjct: 182 TNTARKKWNFNGYIASDCDAVRFIYEKQGYAKTPEDVVADVLRAGMDLECGNYMTKHAKS 241

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
           AV +  +    I+ AL N  T+++RLG+FDG P+   YG +GP  VC+ ++ +LALEAAR
Sbjct: 242 AVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAAR 301

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD-VTVTMIGNYAGIACGYTTPLQGIGRYAR 450
            GIVLLKN    LPL   R  T+ VIGPN++  ++ ++GNY G  C   + L+G   YA 
Sbjct: 302 SGIVLLKNTASILPLP--RVNTLGVIGPNANKSSIVLLGNYIGPPCKNVSILKGFYTYAS 359

Query: 451 TIH-QQGCKD-VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQ 508
             H   GC D   CA  ++   A++ ++ +D  ILVMGLDQS E E LDR  L LPG+QQ
Sbjct: 360 QTHYHSGCTDGTKCASAEI-DRAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQ 418

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
           +L++ V+ ASK P ILVL+ GGP+D+ FAKN+ +I  IIWAGYPG+ GG A+A ++FG  
Sbjct: 419 KLINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDY 478

Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
           NPGG+LPMTWYP+++I  +PMT+M MR   S  YPGRTYRFY GP VY FG+G+SY+N+ 
Sbjct: 479 NPGGRLPMTWYPKDFI-KIPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYS 537

Query: 629 H---TVANAPTVVAVPLDGRHGSINATISGKAI-KVTHAKCNRLTLGVQVDVKNVGSKDG 684
           +   +V N    +           + TI+ K + ++    C  +++ V + + N GS  G
Sbjct: 538 YNFISVKNNNLHINQSTTYSILENSETINYKLVSELGEETCKTMSISVTLGITNTGSMAG 597

Query: 685 AHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
            H +L+F  P  G +  P KQLV FE V V  G +  VG  + VC++LS  + SG + I 
Sbjct: 598 KHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIE 657

Query: 744 LGEHNIHIGGTKHSVSL 760
            G +   +G  ++S+++
Sbjct: 658 EGGYLFLVGQEEYSINI 674


>gi|115486735|ref|NP_001068511.1| Os11g0696400 [Oryza sativa Japonica Group]
 gi|77552754|gb|ABA95551.1| Glycosyl hydrolase family 3 C terminal domain containing protein
           [Oryza sativa Japonica Group]
 gi|113645733|dbj|BAF28874.1| Os11g0696400 [Oryza sativa Japonica Group]
          Length = 816

 Score =  627 bits (1618), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 347/766 (45%), Positives = 451/766 (58%), Gaps = 74/766 (9%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +C  SLP   RV DLIGR++++EKV  L        R+G+  Y WWSEALHG+S+ GP T
Sbjct: 70  YCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLSSTGPTT 129

Query: 102 KFG-----------GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
           KF                 AT F  VI +A+SFN TLW++IG+ VS EARAMYN G  GL
Sbjct: 130 KFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNMGKGGL 189

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR------------L 198
           TYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q   G              L
Sbjct: 190 TYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQDIPGHEAVAAGGDPNTRPL 249

Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
           K +A CKH+ AYDLD+W+   RF F+A+V ++D+ +TF  PF MCV +G V+SVMCSYN+
Sbjct: 250 KTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFEMCVRDGDVSSVMCSYNR 309

Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAADAIRAGL 317
           VNG+P CAD  +L +TIR +W L+GYIVSDCD+V V  D   +   T  EA+A A++AGL
Sbjct: 310 VNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATWLGYTGAEASAAALKAGL 369

Query: 318 DLDCGP-------------FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
           DLDCG              FL  +   AV +G + E DI+NAL N     MRLG FD   
Sbjct: 370 DLDCGESWKNDTDGHPLMDFLTTYGMEAVNKGKMRESDIDNALTNQYMTLMRLGYFD--- 426

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV- 423
               Y  LG +D+CT  H+ LAL+ ARQGIVLLKN    LPL   +   V V GP+    
Sbjct: 427 DIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDANKVGFVNVRGPHVQAP 486

Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
              M G+Y G  C Y TP QG+ +Y R  H+                       A+ TI 
Sbjct: 487 EKIMDGDYTGPPCRYVTPRQGVSKYVRFSHR-----------------------ANTTIY 523

Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
             GL+ +IE E  DR  +LLP  Q E + +V+ AS  P ILV++SGG IDV+FA+N+P+I
Sbjct: 524 FGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILSGGGIDVSFAQNNPKI 583

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
            AI+WAGYPG  GG AIAD++FG  NP G+LP+TW+  +YI  LPMT M +RP     YP
Sbjct: 584 GAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQLPMTSMDLRPVAKHGYP 643

Query: 604 GRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG------- 655
           GRTY+FY GP V+YPFG+G+SYT F++ +    T + VP+ G H    +  SG       
Sbjct: 644 GRTYKFYDGPDVLYPFGYGLSYTKFLYEMGTNGTALIVPVAGGHCKKLSYKSGVSTAPAC 703

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVP 714
            AI V    C   T+   V V N G   G+H ++VFS PPA    AP KQ+VAF+ V VP
Sbjct: 704 PAINVNGHVCTE-TVSFNVSVTNGGDTGGSHPVIVFSKPPAEVDDAPMKQVVAFKSVFVP 762

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           A +   V   ++VCK   +V+++    +P G   I +     SVS 
Sbjct: 763 AWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTILVENVDSSVSF 808


>gi|125535311|gb|EAY81859.1| hypothetical protein OsI_37025 [Oryza sativa Indica Group]
          Length = 816

 Score =  627 bits (1616), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 346/767 (45%), Positives = 451/767 (58%), Gaps = 75/767 (9%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +C  SLP   RV DLIGR++++EKV  L        R+G+  Y WWSEALHG+S+ GP T
Sbjct: 69  YCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLSSTGPTT 128

Query: 102 KFG-----------GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
           KF                 AT F  VI +A+SFN TLW++IG+ VS EARAMYN G  GL
Sbjct: 129 KFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNMGKGGL 188

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR------------L 198
           TYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q   G              L
Sbjct: 189 TYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQDIPGHEAVAAGGDPNTRPL 248

Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
           K +A CKH+ AYDLD+W+   RF F+A+V ++D+ +TF  PF MCV +G V+SVMCSYN+
Sbjct: 249 KTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFEMCVRDGDVSSVMCSYNR 308

Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAADAIRAGL 317
           VNG+P CAD  +L +TIR +W L+GYIVSDCD+V V  D   +   T  EA+A A++AGL
Sbjct: 309 VNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATWLGYTGAEASAAALKAGL 368

Query: 318 DLDCGP-------------FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
           DLDCG              FL  +   AV +G + E DI+NAL N     MRLG FD   
Sbjct: 369 DLDCGESWKNDTEGHPLMDFLTTYGMEAVNKGKMRESDIDNALTNQYMTLMRLGYFD--- 425

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV- 423
               Y  LG +D+CT  H+ LAL+ ARQGIVLLKN    LPL   +   V V GP+    
Sbjct: 426 DITQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDANKVGFVNVRGPHVQAP 485

Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
              M G+Y G  C Y TP QG+ +Y R  H+                       A+ TI 
Sbjct: 486 EKIMDGDYTGPPCRYVTPRQGVSKYVRFSHR-----------------------ANTTIY 522

Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
             GL+ +IE E  DR  +LLP  Q E + +V+ AS  P ILV++SGG IDV+FA+N+P+I
Sbjct: 523 FGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILSGGGIDVSFAQNNPKI 582

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
            AI+WAGYPG  GG AIAD++FG  NP G+LP+TW+  +YI  LPMT M +RP     YP
Sbjct: 583 GAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQLPMTSMDLRPVAKHGYP 642

Query: 604 GRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH--------GSINATIS 654
           GRTY+FY GP V+YPFG+G+SYT F++ +    T + VP+ G H        G  +A  +
Sbjct: 643 GRTYKFYNGPDVLYPFGYGLSYTKFLYEMGTNGTALTVPVAGGHCKKLSYKSGVSSAAPA 702

Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHV 713
             AI V    C   T+   V V N G   G+H ++VFS PPA    AP KQ+VAF  V V
Sbjct: 703 CPAINVNGHACTE-TVSFNVSVTNGGDTGGSHPVIVFSKPPAEVDDAPIKQVVAFRSVFV 761

Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           PA +   V   ++VCK   +V+++    +P G   + +     SVS 
Sbjct: 762 PAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTVLVENVDSSVSF 808


>gi|242093144|ref|XP_002437062.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
 gi|241915285|gb|EER88429.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
          Length = 809

 Score =  626 bits (1614), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 342/770 (44%), Positives = 458/770 (59%), Gaps = 78/770 (10%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
            P+C  SLP   RV DLIG ++++EKV  L   +   PR+G+  Y+WWSEALHGVS+ GP
Sbjct: 61  FPYCDASLPYADRVRDLIGWMTVEEKVGNLGDVSHGAPRVGLPPYKWWSEALHGVSSTGP 120

Query: 100 GTKFG------GDFPG------ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
              F       G+  G      AT F  VI +A+SFN TLW++IG+ VS EARAMYN G 
Sbjct: 121 TMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNLGK 180

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ---GSDG-------DR 197
            GLTYWSPN+N+ RDPRWGR  ETPGEDP ++G+YA ++VRG+Q   G DG         
Sbjct: 181 GGLTYWSPNINVVRDPRWGRALETPGEDPFVAGRYAVNFVRGMQDIPGHDGGGDDPSTRP 240

Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
           +K +A CKH+ AYD+D+W+   RF F+A+VS++D+ +TF  PF MCV +G  + VMCSYN
Sbjct: 241 IKTSACCKHYAAYDVDDWHNHTRFTFDARVSERDMAETFLRPFEMCVRDGDASGVMCSYN 300

Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ---HFTSTPEEAAADAIR 314
           +VNG+P CAD  +L  TIRG+W+L+GYIVSDCD+V V  D     HFT    E++A +IR
Sbjct: 301 RVNGIPACADARLLSGTIRGDWQLHGYIVSDCDAVRVMTDNATWLHFTGA--ESSAASIR 358

Query: 315 AGLDLDCGP------------FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
           AGLDLDC              FL  + ++AV +G + E DI++AL N     MRLG FD 
Sbjct: 359 AGLDLDCAESWIEEKGRPLRDFLSEYGKAAVAQGKMRESDIDSALRNQYMTLMRLGYFDN 418

Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
            P    Y  L   D+CT +H+ LA + ARQG+VLLKN    LPL   +   VAV GP++ 
Sbjct: 419 IPR---YASLNETDICTDEHKSLAHDGARQGMVLLKNDDGLLPLDPEKILAVAVHGPHAR 475

Query: 423 V-TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
                M G+Y G  C Y TP QGI +  +  H+                       A+ T
Sbjct: 476 APEKIMDGDYTGPPCRYVTPRQGISKDVKISHR-----------------------ANTT 512

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           I + G++  IE E  DR  LLLP  Q E +   + AS  P ILV++SGG ID++FA   P
Sbjct: 513 IYLGGINLHIEREGNDREDLLLPKNQTEEILHFAKASPNPIILVILSGGGIDISFAHKHP 572

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
           +I AI+WAGYPG  GG AIAD++FG  NPGG+LP+TW+  +YI  +PMT M  RP   K 
Sbjct: 573 KIGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIQQIPMTSMEFRPVPEKG 632

Query: 602 YPGRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG-------SINATI 653
           YPGRTY+FY GP V+YPFG+G+SYT F++  +   T V +P  G H        S+  T 
Sbjct: 633 YPGRTYKFYDGPEVLYPFGYGLSYTKFLYETSTNGTAVTLPATGGHCKGLSYKPSVATTP 692

Query: 654 SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH 712
           + +A+ V    C   T+   + V N G + GAH +LV++ PP     AP KQ+ AF +V 
Sbjct: 693 ACQAVDVAGHACTE-TVSFNISVTNAGGRGGAHVVLVYTAPPPEVAQAPIKQVAAFRRVF 751

Query: 713 VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI--GGTKHSVSL 760
           VPA +   V   ++VCK   +V+R+    +P G   + +  G +  SVS 
Sbjct: 752 VPARSTATVPFTLNVCKAFGIVERTAYTVVPSGVSKVLVQNGDSSSSVSF 801


>gi|413954831|gb|AFW87480.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 814

 Score =  626 bits (1614), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 343/766 (44%), Positives = 452/766 (59%), Gaps = 72/766 (9%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
            P+C  SLP   RV DLIG ++++EKV  L   +   PR+G+  Y+WWSEALHGVS+ GP
Sbjct: 68  FPYCDASLPYADRVRDLIGWMTVEEKVGNLGDISHGAPRVGLPPYKWWSEALHGVSSTGP 127

Query: 100 GTKFG------GDFPG------ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
              F       G+  G      AT F  VI +A+SFN TLW +IG+ VS EARAMYN G 
Sbjct: 128 TMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWNSIGQAVSTEARAMYNLGK 187

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--------LK 199
            GLTYWSPN+N+ RDPRWGR  ETPGEDP ++G+YA ++VRG+Q   G          +K
Sbjct: 188 GGLTYWSPNINVVRDPRWGRALETPGEDPYVAGRYAVNFVRGMQDIPGHYSGDPSARPIK 247

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
            +A CKH  AYD+DNW+   RF ++A+VS++D+ +TF  PF MCV EG V+SVMCSYN+V
Sbjct: 248 TSACCKHHAAYDVDNWHNQTRFTYDARVSERDMAETFLRPFEMCVREGDVSSVMCSYNRV 307

Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-TPEEAAADAIRAGLD 318
           NGVP CAD  +L  T+RGEW LNGYIVSDCD+V V  D   + + T  E++A ++RAG+D
Sbjct: 308 NGVPACADARLLSGTVRGEWHLNGYIVSDCDAVRVMTDNATWLNFTAAESSAVSLRAGMD 367

Query: 319 LDCGP------------FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSS 366
           LDC              +L  +  +AV +G + E DI+NAL N     MRLG FD  P  
Sbjct: 368 LDCAESWIEEEGRPLRDYLSEYGMAAVAQGKMRESDIDNALTNLYMTLMRLGYFDNIPR- 426

Query: 367 QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV-TV 425
             Y  L   DVCT +H+ LAL+ ARQGIVLLKN    LPL   +   VAV GP++     
Sbjct: 427 --YASLNETDVCTDEHKSLALDGARQGIVLLKNDHGLLPLDPKKTLAVAVHGPHARAPEK 484

Query: 426 TMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVM 485
            M G+Y G  C Y TP QGI R  +  H+                       A  TI + 
Sbjct: 485 IMDGDYTGPPCRYVTPRQGISRDVKISHK-----------------------AKMTIYLG 521

Query: 486 GLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAA 545
           G++  IE E  DR  LLLP  Q E +   + AS  P ILV++SGG ID++FA+  P+I A
Sbjct: 522 GINLYIEREGNDREDLLLPKNQTEEILHFAQASPTPIILVILSGGGIDISFAQKHPKIGA 581

Query: 546 IIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR 605
           I+WAGYPG  GG AIAD++FG  NPGG+LP+TW+  +YI  +PMT M  RP   K YPGR
Sbjct: 582 ILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIEQIPMTSMEFRPVPEKGYPGR 641

Query: 606 TYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG-------SINATISGKA 657
           TY+FY GP V+YPFG+G+SYT F +  +     V++P  G H        S+    + +A
Sbjct: 642 TYKFYDGPEVLYPFGYGLSYTKFQYETSTDGVSVSLPAPGGHCKGLSYKPSVATVPACQA 701

Query: 658 IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAG 716
           + V    C   T+   V V N G + GAH +LV++ PP     AP KQ+ AF +V V A 
Sbjct: 702 VNVADHACTE-TVSFNVSVTNAGGRGGAHVVLVYTAPPPEVAEAPIKQVAAFRRVFVAAR 760

Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI--GGTKHSVSL 760
           +   V   ++VCK   +V+R+    +P G   + +  G +  SVS 
Sbjct: 761 STATVPFALNVCKAFGIVERTAYTVVPSGVSKVLVENGDSSSSVSF 806


>gi|293336530|ref|NP_001167905.1| uncharacterized protein LOC100381616 [Zea mays]
 gi|223944757|gb|ACN26462.1| unknown [Zea mays]
          Length = 630

 Score =  602 bits (1551), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 294/631 (46%), Positives = 417/631 (66%), Gaps = 13/631 (2%)

Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLK 199
           M+N G AGLTYW+PN+NIFRDPRWGRGQET GEDP ++  Y+  YV+G QG +G+  R++
Sbjct: 1   MHNAGQAGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEEGEEGRIR 60

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
           ++A CKH+TAYD++ W G  R+ FNAKV+ QD+EDT+  PF+ C+ E + + +MC+YNQV
Sbjct: 61  LSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQV 120

Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
           NGVP CA  ++L++T R EW   GYI SDCD+V + ++ Q +T + E++ A  ++AG+D+
Sbjct: 121 NGVPMCAHKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTYTKSGEDSIAIVLKAGMDI 179

Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
           +CG FL  HT+SA+++G + E DI+ AL N  +VQ+RLG+FD   ++Q +  LGP  VCT
Sbjct: 180 NCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCT 239

Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT 439
            +H+ELA EA RQG VLLKN    LPL     R VA+IGP+++    M G+Y G+ C  T
Sbjct: 240 KEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGVPCNPT 299

Query: 440 TPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
           T L+GI  YA +T    GCKD +C    LFG A++A+++AD  +++ GL+ + E E  DR
Sbjct: 300 TFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDFDR 359

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
             LLLPG+Q  L+  ++  +K P +LVL+ GGP+DV+FAK DPRIA+I+W GYPG+ GG 
Sbjct: 360 VSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQ 419

Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
            + +ILFG  NPGGKLP+TWYP+ + T +PMT+M MR   S+ YPGRTYRFY G VVY F
Sbjct: 420 VLPEILFGEYNPGGKLPITWYPESF-TAIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGF 478

Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS-------GKAIKVTHAKCNRLTLG 671
           G+G+SY+ + +++++AP  + V      G I+   +       G       A C  L   
Sbjct: 479 GYGLSYSKYSYSISSAPKKITVSRSSDLGIISRKPAYTRRDGLGSVKTEDIASCEALVFS 538

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
           V V V N GS DG+H +L+F+   +     P KQLV FE VH  AG+   V I +  CK 
Sbjct: 539 VHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGSASNVEITVDPCKQ 598

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           +S  +  G R + LG H + +G  +  +S+ 
Sbjct: 599 MSAANPEGKRVLLLGAHVLTVGDEEFELSIE 629


>gi|125576920|gb|EAZ18142.1| hypothetical protein OsJ_33692 [Oryza sativa Japonica Group]
          Length = 618

 Score =  601 bits (1549), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 294/615 (47%), Positives = 407/615 (66%), Gaps = 25/615 (4%)

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           WSPNVNIFRDPRWGRGQETPGEDP  + KY A++V+GLQGS    L+ +A CKH TAYD+
Sbjct: 2   WSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTNLQTSACCKHITAYDI 61

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
           + W GV R++FNAKV+ QD+ DT++ PFR CV++GK + +MC+Y  +NGVP CA  ++L 
Sbjct: 62  EEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVPACASSDLLT 121

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
           +T+RGEW+L+GY  SDCD+V + + ++HFT T EEA A A++AGLD++CG ++  +  SA
Sbjct: 122 KTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDINCGVYMQQNAASA 181

Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS-SQPYGHLGPKDVCTPDHQELALEAAR 391
           +Q+G ++E D++ AL N   ++MRLG FDG+P  ++ YG L   DVCTP H+ LALEAAR
Sbjct: 182 LQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTPVHKALALEAAR 241

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART 451
           +G+VLLKN    LPL      + AVIG N++  + ++GNY G+ C  TTP  GI +Y ++
Sbjct: 242 RGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTTPFGGIQKYVKS 301

Query: 452 IH-QQGCKDVAC---ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQ 507
                GC   AC   A DQ    A   ++ +D   LVMGL Q  E E LDR  LLLPG+Q
Sbjct: 302 AKFLPGCSSAACDVAATDQ----ATALAKSSDYVFLVMGLSQKQEQEGLDRTSLLLPGKQ 357

Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT 567
           Q L++ V+ ASK P IL+L++GGP+D+ FA+ +P+I AI+WAGYPGQAGG AIAD+LFG 
Sbjct: 358 QALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAIADVLFGE 417

Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
            NP GKLP+TWYP+E+ T   MT+M MRP  +  YPGR+YRFYKG  VY FG+G+SY+ F
Sbjct: 418 FNPSGKLPVTWYPEEF-TKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGYGLSYSKF 476

Query: 628 ----VHTVANAPTVVAVPLDGRHGSINATISGKAI----KVTHAKCNRLTLGVQVDVKNV 679
               V    N+ +     L G   +   T  G A+    ++   +C RL   V V+V+N 
Sbjct: 477 ACRIVSGAGNSSSYGKAALAGLRAA--TTPEGDAVYRVDEIGDDRCERLRFPVMVEVQNH 534

Query: 680 GSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           G  DG HT+L+F   S+   G   P +QL+ F   H+  G ++++ + I  C++LS    
Sbjct: 535 GPMDGKHTVLMFVRWSSTDGGR--PVRQLIGFRNQHLKVGEKKKLKMEISPCEHLSRARV 592

Query: 737 SGTRRIPLGEHNIHI 751
            G + I  G H + +
Sbjct: 593 DGEKVIDRGSHFLMV 607


>gi|348667575|gb|EGZ07400.1| xylosidase [Phytophthora sojae]
          Length = 751

 Score =  599 bits (1545), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 313/748 (41%), Positives = 447/748 (59%), Gaps = 59/748 (7%)

Query: 30  CDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSE 89
           CD  D    +LPFC  SLPI  RV+DL+ R+ L++ V LL++ A+A P + +  YEWW+E
Sbjct: 24  CD--DPKVSSLPFCDGSLPIDARVSDLVNRIPLEQAVGLLVNKASAAPSVNVPSYEWWNE 81

Query: 90  ALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG 149
           ALHGV+ + PG  F G    ATSFPQV++TA+SFN TL+  I   +S EARA YN   AG
Sbjct: 82  ALHGVA-LSPGVTFKGPLTAATSFPQVLSTAASFNRTLFYQIAEAISTEARAFYNEKNAG 140

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS---------DGDRLKV 200
           LT+W+PNVNIFRDPRWGRGQETPGEDP L+G+YA ++VRGLQG          D   LK+
Sbjct: 141 LTFWTPNVNIFRDPRWGRGQETPGEDPYLTGEYAVAFVRGLQGEAMEGHENKDDNKFLKI 200

Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
           ++ CKHF+AY  +    V R   +A V+KQD  DT+   F  CV  G V+S+MCSYN VN
Sbjct: 201 SSCCKHFSAYSQE----VPRHRNDAIVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVN 256

Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
           G+P+CAD  +L   +R +W+ +GYI SDC++V       HFT +PE+  A  + AG+DL+
Sbjct: 257 GIPSCADKGLLTDLVRNQWKFDGYITSDCEAVADVIYRHHFTQSPEQTCATTLDAGMDLN 316

Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
           CG FL  H  SA+++G++S   ++NAL N   V MRLGMF  E  +QP+ ++    V T 
Sbjct: 317 CGEFLRQHLSSAIEQGIVSTEMVHNALKNQFRVMMRLGMF--EKGTQPFSNITKDAVDTA 374

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLS---HIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
            H++LALEAARQ +VLLKN+  +LPL+     +  ++A+IGP+ + +  ++GNY GI   
Sbjct: 375 AHRQLALEAARQSVVLLKNEDNTLPLATDVFSKDGSLALIGPHFNASTALLGNYFGIPSH 434

Query: 438 YTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
             TPL+G+  Y   + +  GCK V+      F  AI+  ++AD  ++ MGLDQS E E +
Sbjct: 435 IVTPLKGVSSYVPNVAYSLGCK-VSGEVLPDFDEAIEVVKKADRVVVFMGLDQSQEREEI 493

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DR  L LPG Q  L++++  A+  P +LVL+SGG +D++  KN P++ AI++ GY GQAG
Sbjct: 494 DRYHLKLPGFQIALLNRILAAASHPIVLVLISGGSVDLSLYKNHPKVGAIVFGGYLGQAG 553

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           G A+AD+LFG  +P G+L  T+Y  +Y+  +P+ +M MRP+     PGRTYRF+ G  VY
Sbjct: 554 GQALADMLFGKYSPAGRLTQTFYDSDYVNTMPIYDMHMRPTFVTGNPGRTYRFFSGAPVY 613

Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
            FG G+SYT F                                  H  C       ++ V
Sbjct: 614 EFGFGLSYTTF----------------------------------HKACRSCVASFEITV 639

Query: 677 KNVGSKDGAHTLLVFSTPP-AGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
            N+G  +G   +L+++ PP AG    P + LVAFE+  +    +          K  ++ 
Sbjct: 640 TNLGDVEGEDAILIYAEPPHAGEGGRPLRSLVAFERTALVTTGKTATADFCLEAKAFALA 699

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
           +  G+  +  G   IH+   +H V++ A
Sbjct: 700 NAEGSWVVEQGNWTIHVDTLQHRVNVQA 727


>gi|326488213|dbj|BAJ89945.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 525

 Score =  598 bits (1541), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 297/490 (60%), Positives = 362/490 (73%), Gaps = 7/490 (1%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           FACD  +AT     FC        R  DL+ RL+L EKV  L++   A+ RLGI  YEWW
Sbjct: 37  FACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWW 96

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SEALHGVS VGPGT+F    PGATSFPQ I TA+SFNA+L+ AIG VVS EARAM+N G 
Sbjct: 97  SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGL 156

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-----GSDGDRLKVAA 202
           AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA  YV GLQ     G     LKVAA
Sbjct: 157 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDAGAGGVTDGALKVAA 216

Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            CKH+TAYD+DNW GV+R+ F+AKVS+QD++DTF  PF+ CV++G VASVMCSYN+VNG 
Sbjct: 217 CCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGK 276

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
           PTCAD ++L+  IRG+W+LNGYIVSDCDSV V Y  QH+T TPEEAAA  I++GLDL+CG
Sbjct: 277 PTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGLDLNCG 336

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
            FL  HT +AVQ G LSE D++ A+ N   + MRLG FDG+P    +G LGPKDVCT  +
Sbjct: 337 NFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSN 396

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           +ELA E ARQGIVLLKN G +LPLS    +++AVIGPN++ + TMIGNY G  C YTTPL
Sbjct: 397 RELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPL 455

Query: 443 QGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
           QG+G    T++Q GC +V C+ + L    A+ A+  AD T+LV+G DQSIE E+LDR  L
Sbjct: 456 QGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRTSL 515

Query: 502 LLPGRQQELV 511
           LLPG+Q +LV
Sbjct: 516 LLPGQQTQLV 525


>gi|340370206|ref|XP_003383637.1| PREDICTED: probable beta-D-xylosidase 5-like [Amphimedon
           queenslandica]
          Length = 728

 Score =  594 bits (1531), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 325/767 (42%), Positives = 456/767 (59%), Gaps = 58/767 (7%)

Query: 5   IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
           + F  L + L ++S +    + PF          T  +C  +  IP+RVNDL+ R+++ +
Sbjct: 3   MKFTVLLVFLFASSVADYCEKAPF---------NTYKYCDYTQSIPERVNDLLSRMTILD 53

Query: 65  KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
           K+  LI+ A A+P L I  Y+WWSE LHGV+   PG  FGG+FP ATSFPQVI   ++FN
Sbjct: 54  KIPQLITSAPAIPSLDIPAYQWWSEGLHGVAG-SPGVHFGGNFPNATSFPQVIGLGATFN 112

Query: 125 ATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
            +L  A+ +V+S EARA  NGG AGLTY++PN+NIFRDPRWGRGQETPGEDP LS +YAA
Sbjct: 113 MSLVLAMAQVISTEARAFANGGQAGLTYFAPNINIFRDPRWGRGQETPGEDPYLSSQYAA 172

Query: 185 SYVRGLQ-GSDGDR-LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
           ++V+G+Q G+D  R LK  A+CKH+ AYDL+N+  + R  FNA VS QD E+T+   FR 
Sbjct: 173 NFVKGMQEGADDTRYLKTIATCKHYAAYDLENYLNLSRHTFNAIVSDQDFEETYFPAFRS 232

Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT 302
           CV EGKV S+MCSYN VNGVP+CA+  I     RG+W   GY+VSDC ++    ++  +T
Sbjct: 233 CVEEGKVGSIMCSYNAVNGVPSCANDFINNEVARGKWGFEGYVVSDCGAISDIINSHKYT 292

Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
           S  ++  A  +R G DL+CG F   H ++A   G +++ DI+ A+    T +MRLGMFD 
Sbjct: 293 SNTDDTVAAGLRGGCDLNCGHFYSDHAQAAYDNGAITDDDIDRAMTRLFTYRMRLGMFD- 351

Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
            PS QP+       V T  H+ LAL+A+R+ IVLL+N    LPLS   HR +A++GP+  
Sbjct: 352 PPSMQPFRDYTNDKVDTKQHEALALDASRESIVLLQNNKDILPLSLTTHRKIALVGPHGQ 411

Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQ--AD 479
               M GNY G A    +P+QG+     ++    GC  VAC     F        +   +
Sbjct: 412 AQGAMQGNYKGTAPYLISPMQGLQDLGLSVTFAAGCTQVACPTIAGFSEVTKLVEEHSIE 471

Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG--PTILVLMSGGPIDVAFA 537
           A I V+GLD+S E+E  DR  L LPG+Q +L+  +   +    P I+V+MSGGP+D++  
Sbjct: 472 AIIAVIGLDESQESEGHDRTSLTLPGQQVQLLEDIKKKAVPGIPFIVVVMSGGPVDLSGV 531

Query: 538 KNDPRIA-AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
           K+   IA AI+WAGYPGQ+GG AIA++++G  NP G+LP+T+YP  YI  +P T M+MR 
Sbjct: 532 KD---IADAILWAGYPGQSGGQAIAEVIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRV 588

Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
                 PGR+Y+FY G  V+PFG G+SYT F     N P V                   
Sbjct: 589 P-----PGRSYKFYTGTPVFPFGFGLSYTTFEMKWKNPPNVT------------------ 625

Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF--STPPAGHWAPHKQLVAFEKVHVP 714
            +K TH     + +  +V V N G + G+ ++L +  ST P    AP K+L  F+K+++ 
Sbjct: 626 HLKTTH----DVDVNYEVVVTNAGKRSGSVSVLAYITSTVPG---APMKELFGFQKIYLK 678

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT---KHSV 758
                 +   +   K  + VD+ G R+I  G + I IG T   KH+V
Sbjct: 679 PEQSMTLSF-VAEPKVFTTVDKHGERKIRPGTYKITIGDTSDLKHTV 724


>gi|326431595|gb|EGD77165.1| beta-glucosidase [Salpingoeca sp. ATCC 50818]
          Length = 900

 Score =  593 bits (1528), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 329/748 (43%), Positives = 458/748 (61%), Gaps = 20/748 (2%)

Query: 13  ILLSASSSGLA----AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
           + + A+ SGL     +  P  CD +    ++LPFC  +L    R+ DLI R++  +   L
Sbjct: 153 LTIRANDSGLCLDAGSPTPRTCDVEPG--KSLPFCNTALSYDDRIRDLISRINDSDLPGL 210

Query: 69  LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLW 128
           L++ A  V  L +  Y+WWSEALHGV +  PG  FGGD P ATSFPQVI T ++FN TL+
Sbjct: 211 LVNSATGVEHLNLPAYQWWSEALHGVGH-SPGVHFGGDVPAATSFPQVIHTGATFNKTLY 269

Query: 129 EAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVR 188
             IG V+S EARAM N   AG T+W+PN+NI RDPRWGRGQETPGEDP  +G+YAA++V 
Sbjct: 270 RKIGTVISTEARAMNNVQRAGNTFWAPNINIIRDPRWGRGQETPGEDPFATGEYAANFVS 329

Query: 189 GLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
           G Q G D + +K ++ CKHF  Y+L+NW+GVDR H+NA  + QDI DT+   F  CV  G
Sbjct: 330 GFQDGEDMNYIKASSCCKHFFDYNLENWHGVDRHHYNAIATDQDIADTYLPSFEACVRYG 389

Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE 307
           + + +MCSYN VNGVP+CA+ +I+    R  W  +GYI SDC +V    ++  FT    E
Sbjct: 390 RASGLMCSYNAVNGVPSCANGDIMTVMARESWGFDGYITSDCGAVADVLNSHKFTRNTSE 449

Query: 308 AAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ 367
                + AG+D DCG F+  +   A+Q G++    +N AL     VQ RLG+FD   S Q
Sbjct: 450 TIRAVLEAGMDTDCGSFVQQYLAKAMQEGVVPRELVNTALHRLFMVQFRLGLFD-PVSKQ 508

Query: 368 PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTM 427
           PY +     V TP +Q+LALEAA+QGIVLLKN    LPL    H  VA+IGPN+D T  M
Sbjct: 509 PYTNYSVARVNTPANQQLALEAAQQGIVLLKNTNARLPLKTGLH--VALIGPNADATTVM 566

Query: 428 IGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMG 486
            GNY G A    +P++G   Y+  + + +GC DVAC D   F AA+ A+++ADA ++V+G
Sbjct: 567 QGNYQGTAPFLISPVRGFKNYSAAVTYAKGC-DVACKDTSGFDAAVAAAKEADAVVVVVG 625

Query: 487 LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAI 546
           LDQ  E+E  DR  + LPG Q++LV++V+ A+K P ++ +M+GG +D++  K +  +A I
Sbjct: 626 LDQGQESEGHDRTSITLPGHQEDLVAQVAAAAKSPIVVFVMTGGAVDLSTIKANKNVAGI 685

Query: 547 IWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT 606
           +W GYPGQ+GG A+AD++FG  +PGG+LP T YP  Y+    M +  MRP+++   PGRT
Sbjct: 686 LWCGYPGQSGGQAMADVVFGAVSPGGRLPYTIYPGSYVDACSMLDNGMRPNKTSGNPGRT 745

Query: 607 YRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCN 666
           YRFY G  VY +G G+SYT+F + +    T+       +    +A  + K I+    +  
Sbjct: 746 YRFYTGKPVYEYGTGLSYTSFSYHIHYLNTMDTSLATVQTYVQDAKQNHKFIRYDAPEFT 805

Query: 667 RLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAGHWAPHKQLVAFEKVHVPAGAQQRVGIN 724
           R    V+V+V NVG   GA  + VF  P  PA   AP K L+ FE+V +  G    V  +
Sbjct: 806 R----VEVNVTNVGRVAGADVVQVFVEPKTPAELGAPIKTLIGFERVFLNPGQWTIVQFS 861

Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           ++    L+ VD SG R    GE  +HIG
Sbjct: 862 VNAHD-LTFVDASGKRVARAGEWLVHIG 888


>gi|62321271|dbj|BAD94481.1| beta-xylosidase [Arabidopsis thaliana]
          Length = 523

 Score =  587 bits (1512), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 291/524 (55%), Positives = 366/524 (69%), Gaps = 10/524 (1%)

Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
           V++G VASVMCSYNQVNG PTCADP++L   IRGEW+LNGYIVSDCDSV V Y  QH+T 
Sbjct: 3   VVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTK 62

Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
           TP EAAA +I AGLDL+CG FLG HTE AV+ GL++E  I+ A+ N     MRLG FDG 
Sbjct: 63  TPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFDGN 122

Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
           P +Q YG LGP DVCT  +QELA +AARQGIVLLKN G  LPLS    +T+AVIGPN++V
Sbjct: 123 PKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNTG-CLPLSPKSIKTLAVIGPNANV 181

Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
           T TMIGNY G  C YTTPLQG+     T +  GC +VACA   + GA   A+  AD ++L
Sbjct: 182 TKTMIGNYEGTPCKYTTPLQGLAGTVSTTYLPGCSNVACAVADVAGATKLAA-TADVSVL 240

Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
           V+G DQSIEAE+ DR  L LPG+QQELV +V+ A+KGP +LV+MSGG  D+ FAKNDP+I
Sbjct: 241 VIGADQSIEAESRDRVDLRLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDPKI 300

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
           A I+W GYPG+AGG AIADI+FG  NP GKLPMTWYPQ Y+  +PMT M MRP ++  YP
Sbjct: 301 AGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDKASGYP 360

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
           GRTYRFY G  VY FG G+SYT F HT+  AP++V++ L+  H  +  +   +++     
Sbjct: 361 GRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSLGLEENH--VCRSSECQSLDAIGP 418

Query: 664 KCNRLTLG------VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGA 717
            C     G      V + V+N G ++G HT+ +F+TPPA H +P K LV FEK+ +    
Sbjct: 419 HCENAVSGGGSAFEVHIKVRNGGDREGIHTVFLFTTPPAIHGSPRKHLVGFEKIRLGKRE 478

Query: 718 QQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           +  V   + +CK LSVVD  G R+I LG+H +H+G  KHS+S+ 
Sbjct: 479 EAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSIR 522


>gi|340370204|ref|XP_003383636.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 755

 Score =  583 bits (1503), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 324/762 (42%), Positives = 451/762 (59%), Gaps = 51/762 (6%)

Query: 3   STIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
           S IA F   L+ L AS +    + PF             +C  S  I +RV DL+ RL++
Sbjct: 31  SAIAVF---LLFLVASRADYCEKPPF---------NAYLYCNYSASITERVKDLLSRLTV 78

Query: 63  QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
            EK+    + A+A+ RL I  Y+WWSE LHG++   PG  F  D   ATSFPQVI   ++
Sbjct: 79  LEKMSQTATNASAIERLDIPAYDWWSECLHGLAQ-SPGVFFENDLTSATSFPQVIGLGAT 137

Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
           FN +L  A+G+V+S EARA  N G +GLT+++PN+NI+RDPRWGRGQETPGEDP L+ +Y
Sbjct: 138 FNMSLVLAMGQVISTEARAFANNGQSGLTFFAPNINIYRDPRWGRGQETPGEDPYLTSQY 197

Query: 183 AASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
           AA++V+G+Q    DR  LK  A+CKH+ AY+L+ +  V R +FNA VS QD+E+T+   F
Sbjct: 198 AANFVKGIQEGSEDRRYLKAIATCKHYAAYNLERYLDVRRVNFNAIVSDQDLEETYLPAF 257

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           + CV EG+V S+MCSYN +NGVP CA+  I  +  R  W   GYIVSDC ++       +
Sbjct: 258 KACVQEGQVGSIMCSYNAINGVPNCANDFINNKIARDTWGFEGYIVSDCGAILDIQYKHN 317

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           +TS      ADA++ G DL+CG F   + E A     ++E DI+ +L    T +MRLGMF
Sbjct: 318 YTSDTNITVADALKGGCDLNCGHFYEKYMEDAFDNSTITEEDIDKSLTRLFTSRMRLGMF 377

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
           D  P  QP+     KDV TP+ Q+LAL AAR+GIVLL+N+G  LPL  ++H  +A IGPN
Sbjct: 378 D-PPEIQPFRQYSVKDVNTPEAQDLALNAAREGIVLLQNKGSVLPLDIVKHSNIAAIGPN 436

Query: 421 SDVTVTMIGNYAGIACGYTTPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASRQAD 479
           +D T  M GNY GIA    +PLQG         +Q GC  VAC D + F  A+ A +  D
Sbjct: 437 ADATHIMQGNYHGIAPYLISPLQGFSNLGINATYQIGCP-VACNDTEGFPDAVKAVQGVD 495

Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM-ASKG-PTILVLMSGGPIDVAFA 537
           A I V+GL+ + E E+ DR  + LPG Q++L+ ++   A+KG P I+V+MSGG +D+   
Sbjct: 496 AVIAVIGLNNTQEGESHDRTSIALPGHQEDLLLELKKNAAKGTPLIVVVMSGGSVDLTGV 555

Query: 538 KNDPRIA-AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
           K+   IA AI+WAGYPGQ+GG AIA++++G  NP G+LP+T+YP  YI  +P T M+MR 
Sbjct: 556 KD---IADAILWAGYPGQSGGQAIAEVIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRV 612

Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
                 PGR+Y+FY G  V+PFG G+SYT F     +                 +T    
Sbjct: 613 P-----PGRSYKFYTGTPVFPFGFGLSYTTFEIKWKDT----------------STAKDY 651

Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAG 716
            +K TH +     +  +  V N GS+ G+ ++L F T      AP K+L AF+K+++   
Sbjct: 652 YLKTTHDE----VVNYEATVTNSGSRPGSVSVLAFITSSVPG-APMKELFAFKKIYLEPT 706

Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
               V   +   K  + VD  G R+I  G + I IG   H +
Sbjct: 707 ESVDVSF-VAEPKVFTTVDIYGIRKIRPGAYKIIIGDDDHHI 747


>gi|226491558|ref|NP_001146416.1| uncharacterized protein LOC100279996 [Zea mays]
 gi|223975771|gb|ACN32073.1| unknown [Zea mays]
          Length = 507

 Score =  573 bits (1478), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 283/509 (55%), Positives = 360/509 (70%), Gaps = 5/509 (0%)

Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADA 312
           MCSYNQVNG PTCAD ++L   IRG+W+LNGYI SDCDSV V Y+ QH+T TPE+AAA +
Sbjct: 1   MCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAIS 60

Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
           I+AGLDL+CG FL  HT +AVQ G LSE D++ A+ N L   MRLG FDG+P   P+G+L
Sbjct: 61  IKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNL 120

Query: 373 GPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
           GP DVCTP +QELA EAARQGIVLLKN G  LPLS    +++AVIGPN++ + TMIGNY 
Sbjct: 121 GPSDVCTPSNQELAREAARQGIVLLKNTG-KLPLSATSIKSMAVIGPNANASFTMIGNYE 179

Query: 433 GIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSI 491
           G  C YTTPLQG+G    T++Q GC +V C+ + L   AA  A+  AD T+LV+G DQSI
Sbjct: 180 GTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSI 239

Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
           E E+LDR  LLLPG+Q +LVS V+ AS GP ILV+MSGGP D++FAK+  +IAAI+W GY
Sbjct: 240 ERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGY 299

Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
           PG+AGG AIAD+LFG  NP G+LP+TWYP+ + T +PMT+M MRP  S  YPGRTYRFY 
Sbjct: 300 PGEAGGAAIADVLFGYHNPSGRLPVTWYPESF-TKVPMTDMRMRPDPSTGYPGRTYRFYT 358

Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
           G  VY FG G+SYT+F H + +AP  +A+ L   H  +  T    +++   A C  L   
Sbjct: 359 GDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACL--TEQCPSVEAEGAHCEGLAFD 416

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
           V + V+N G + G HT+ +FS+PPA H AP K L+ FEKV +  G    V   + VCK L
Sbjct: 417 VHLRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDL 476

Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           SVVD  G R++ LG H +H+G  KH+++L
Sbjct: 477 SVVDELGNRKVALGSHTLHVGDLKHTLNL 505


>gi|301110280|ref|XP_002904220.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
 gi|262096346|gb|EEY54398.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
          Length = 709

 Score =  565 bits (1455), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 301/719 (41%), Positives = 423/719 (58%), Gaps = 57/719 (7%)

Query: 57  IGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQV 116
           + R+ L + V LL++ AA  P + I  YEWW+EALHGV+ + PG  F G    ATSFPQV
Sbjct: 12  LTRIPLDQAVGLLVNKAAPAPSVNIPSYEWWNEALHGVA-LSPGVTFKGSITAATSFPQV 70

Query: 117 ITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP 176
           ++TA+SFN +L+  I  V+S EARA +N   AGLT+W+PNVNIFRDPRWGRGQETPGEDP
Sbjct: 71  LSTAASFNRSLFYQIADVISTEARAFHNAKDAGLTFWTPNVNIFRDPRWGRGQETPGEDP 130

Query: 177 VLSGKYAASYVRGLQGS--------DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
            L+G+YA ++VRGLQG         +   LK+++ CKHF+AY  +    V R   NA V+
Sbjct: 131 YLTGEYAVAFVRGLQGEGMEGREVENSKFLKISSCCKHFSAYSQE----VPRHRNNAMVT 186

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
           KQD  DT+   F  CV  G V+S+MCSYN VNG+P+CAD  +L   +RG+W+ +GYI SD
Sbjct: 187 KQDQADTYFPAFEDCVKRGHVSSIMCSYNAVNGIPSCADKGLLTDLVRGQWKFDGYIASD 246

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALV 348
           C++V    D  H+T +PE+  A  + AG+DL+CG FL  H   A+++G+++   I+NAL 
Sbjct: 247 CEAVADVIDHHHYTQSPEQTCATTLDAGMDLNCGEFLRQHLPKALEQGIVTTEMIHNALK 306

Query: 349 NTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS- 407
           N   V MRLGMF+     +P+ ++    V T  H++LALEAARQ IVLLKN G +LPL+ 
Sbjct: 307 NQFRVLMRLGMFE---KVEPFANITKDSVDTTMHRQLALEAARQSIVLLKNDGNTLPLAT 363

Query: 408 --HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACAD 464
               R R++A+IGP+ + +  ++GNY GI     TPL+GI ++   + H  GCK V+   
Sbjct: 364 KDFTRDRSLALIGPHFNASAALLGNYFGIPSHIVTPLEGISQFVPNVAHSLGCK-VSGEV 422

Query: 465 DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTIL 524
              F  AI  +++AD  I+ +GLDQS E E +DR  + LP  Q  L+ +V   +  P + 
Sbjct: 423 LPDFDDAIAVAKKADRLIVFVGLDQSQEREEIDRYHIGLPAFQSTLLKRVLEVASHPIVF 482

Query: 525 VLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
           V++SGG +D++  KN P++ AI++ GY GQAGG A+AD+LFG  NP GKLP T+Y  EY+
Sbjct: 483 VVISGGCVDLSAYKNHPKVGAIVFGGYLGQAGGQALADVLFGKYNPSGKLPQTFYDSEYV 542

Query: 585 TNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
             + + +M MRP+      GRTYRF+ G  VY FG G+SYT F                 
Sbjct: 543 NAMSIYDMHMRPTPVTGNSGRTYRFFTGVPVYEFGFGLSYTTF----------------- 585

Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP-AGHWA-PH 702
                            H  C+       + V N G+  G   +L +  PP AG    P 
Sbjct: 586 -----------------HKNCHACVATFNITVTNAGAISGEDVILTYVEPPLAGEGGRPL 628

Query: 703 KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           K LVAFE+  + A  Q+         K  ++ + +G   +  G   IH+   +H V + 
Sbjct: 629 KSLVAFERTPLIAAGQRATAKICLEAKAFALANEAGNWVVEPGNWTIHVDTLQHKVDIQ 687


>gi|340377241|ref|XP_003387138.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 733

 Score =  563 bits (1452), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 305/726 (42%), Positives = 425/726 (58%), Gaps = 43/726 (5%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +C   L    RV DL+ RL+L+EK+  L + A+A+ RLGI GY+WWSE LHGV+ V PG 
Sbjct: 37  YCNYRLSFKDRVKDLLSRLTLEEKISQLGNSASAIDRLGIPGYQWWSEGLHGVA-VSPGL 95

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFR 161
             GG+    TSFPQ+ITTASSFN +L+  IG  VS EAR   + G  GLTY++PN+NI R
Sbjct: 96  HLGGNLTCTTSFPQIITTASSFNKSLFYEIGEAVSTEARGFADNGQGGLTYFTPNINIVR 155

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNW-NGVDR 220
           DPRWGRGQET GEDP L+ +YA + VRG QG+D +  K+ A+CKHF AYDL+++ NG  R
Sbjct: 156 DPRWGRGQETAGEDPYLTSQYAVNLVRGAQGNDSEYKKIIATCKHFAAYDLESYINGDVR 215

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             FNA+V+KQD+E+T+   FR CV  G V S+MCSYN VNGVP+C D     +  R +W+
Sbjct: 216 DSFNAEVTKQDLEETYFPAFRSCVTAGGVGSIMCSYNSVNGVPSCVDGVFNNKIARNKWK 275

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
            +GY+VSDC ++    +  H+TSTP +  A  ++ G DL+CG F   H   A   G ++E
Sbjct: 276 FDGYLVSDCGAIDDVMNKHHYTSTPTDTVAAGLKGGTDLNCGSFYQTHAMDAFLNGSITE 335

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
           +DI+ A+    T +MRLG+FD  P  QPY +     V T  HQ+LAL+AAR+ IVLL+N 
Sbjct: 336 VDIDRAVGRLFTARMRLGLFD-LPKYQPYSYFNTDVVNTKQHQDLALQAARESIVLLQNN 394

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTIHQQGCKD 459
           G  LPLS+  H  +AV+GPN    VTM G    IA    +P+ G   +     +  GC D
Sbjct: 395 G-KLPLSYEDHHKIAVVGPNILANVTMQGISQVIAPYLISPVDGFKSKGLHVTYSLGC-D 452

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV-----SKV 514
           V C     F  A    + A A + VMGLDQ IE E +DR  + LPG Q + +     +  
Sbjct: 453 VKCIVTDGFHDAFKLVKDAKAVVAVMGLDQGIERETVDREDIFLPGLQDKFLLGLRDTLT 512

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
           ++ S  P I+V+MSG  +D++ +K+     AI+W GYPGQ+GG AIA++++G  NP G+L
Sbjct: 513 NLQSPVPLIVVIMSGSSVDLSESKS--LADAILWVGYPGQSGGQAIAEVIYGEVNPSGRL 570

Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
           P+T+YP EYI  +    M+MR       PGRTYRFY    V+PFGHG+SYT F  +  N 
Sbjct: 571 PLTFYPGEYIDLVAYRHMSMREP-----PGRTYRFYTENPVFPFGHGLSYTTFELSWTN- 624

Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLL--VFS 692
                     +  ++   +   ++ +         +   + V N G   GA ++L  V S
Sbjct: 625 ----------KMNNVTEIVISDSVDI--------NIDFDITVVNTGYLSGAVSVLGYVSS 666

Query: 693 TPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
             P    AP ++L  F+KV +     +++ +        + VD  G R I  GE++I I 
Sbjct: 667 NIPD---APLRELFDFDKVFIDKYESKKISL-FATNDAFTTVDEKGRRNILPGEYDIAIE 722

Query: 753 GTKHSV 758
              H +
Sbjct: 723 NLSHKI 728


>gi|320170454|gb|EFW47353.1| beta-xylosidase [Capsaspora owczarzaki ATCC 30864]
          Length = 779

 Score =  563 bits (1450), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 317/732 (43%), Positives = 438/732 (59%), Gaps = 36/732 (4%)

Query: 33  KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
           +DA  R LPFC  +L   QR +DL+GRL+LQEK+    + A  V RLG+  YEWWSEALH
Sbjct: 28  EDAALRNLPFCNPNLAWEQRADDLVGRLTLQEKISQFGTTAPGVARLGVNAYEWWSEALH 87

Query: 93  GVSNVGPGTKFGGDFPGATSFPQVI--------TTASSFNATLWEAIGRVVSDEARAMYN 144
           GV+   PG  F G+ P +T FPQ+I           ++FN     A+ +V+S EARA  N
Sbjct: 88  GVAE-SPGVNFTGNTPVSTCFPQIIGNNCSSLSRVGATFNLDSVAAMAQVISTEARAFAN 146

Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAAS 203
            G AGLTY++PN+NIFRDPRWGRGQETPGEDP L+ +Y  + V+ LQ G D   LKV A+
Sbjct: 147 AGHAGLTYFTPNINIFRDPRWGRGQETPGEDPYLTSRYVETLVQNLQNGEDARYLKVVAT 206

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           CKH+TAYD+++W G+DRFHFNA VS QD+ +TF  PF  CV  GK AS+MCSYN VNG+P
Sbjct: 207 CKHYTAYDMEDWGGIDRFHFNAVVSDQDLVETFMPPFEACVRVGKGASLMCSYNAVNGIP 266

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           +CAD  I     R +W  +GYIVSDC ++     T ++T+T +   A  I+ G DLDCG 
Sbjct: 267 SCADDFINNEIAREQWGFDGYIVSDCGAIDCIQYTHNYTNTTQATCAAGIQGGCDLDCGD 326

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           F   H   A+    L E D++ +L      ++RLG FD   S QPY  +    + + +HQ
Sbjct: 327 FYQSHLMDAIGNATLHEADLDFSLRRLFGHRIRLGEFDAA-SIQPYRQIPVSAINSQEHQ 385

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
           ELAL+ AR+ IVLL N   +LP S    R +A+IGPN+D   T++GNY G A    TPL+
Sbjct: 386 ELALQIARESIVLLGNDNNTLPFSLATVRKLAIIGPNADDAETLLGNYYGDAPYLITPLK 445

Query: 444 GIGRYARTI---HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
           G  +   T+     +GC DV   D   F AA  A++ ADATI+V+GL+Q++E+E LDR  
Sbjct: 446 GFQQLDPTLSITFVKGC-DVNSTDTSGFVAAAAAAKAADATIVVVGLNQTVESENLDRTT 504

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L+LPG Q EL+  ++ A++GP ILV+MSG PID++   +  R  A +W GYPGQAGG A+
Sbjct: 505 LVLPGVQAELILALTAAARGPVILVVMSGSPIDLSNVIHPVR--AALWIGYPGQAGGRAL 562

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           A+ +FG  +P G+LP T YP +Y+  LPMT M MR       PGRTYRFY G  ++ FGH
Sbjct: 563 AEAVFGVFSPAGRLPFTVYPADYVNQLPMTNMDMRAG-----PGRTYRFYTGTPLFEFGH 617

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           G+SY+ F +T +N+ +  +        S++            A    ++   +V V+N G
Sbjct: 618 GLSYSTFQYTWSNS-SSSSSSSATSQHSLSTAALAAQHLAARAPVEAVSF--RVLVQNTG 674

Query: 681 SKDGAHTLLVFSTPPAGHWA----------PHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
                  +L F++  A              P + LV F ++H+  GA Q +   +   + 
Sbjct: 675 KMASDDVVLAFASFNASSIIDQSSSQFASPPIRSLVGFRRIHLAPGASQEIFFAVTSSQ- 733

Query: 731 LSVVDRSGTRRI 742
           L+ VD +G + +
Sbjct: 734 LAQVDSTGAQTL 745


>gi|413919687|gb|AFW59619.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 451

 Score =  549 bits (1414), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 264/411 (64%), Positives = 316/411 (76%), Gaps = 3/411 (0%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           FACD  +AT  +  FC  S     R  DL+ RL+L EKV  L+   AA+PRLG+  YEWW
Sbjct: 40  FACDASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYEWW 99

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SEALHGVS VGPGT+F    PGATSFPQ I TA+SFNATL+ AIG VVS+EARAM+N G 
Sbjct: 100 SEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNVGL 159

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCK 205
           AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYA  YV GLQG  S    LKVAA CK
Sbjct: 160 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGAGALKVAACCK 219

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H+TAYD+DNW GV+R+ F+A VS+QD++DTF  PF+ CV++G VASVMCSYNQVNG PTC
Sbjct: 220 HYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQVNGKPTC 279

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
           AD ++L   IRG+W+LNGYI SDCDSV V Y+ QH+T TPE+AAA +I+AGLDL+CG FL
Sbjct: 280 ADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLDLNCGTFL 339

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
             HT +AVQ G LSE D++ A+ N L   MRLG FDG+P   P+G+LGP DVCTP +QEL
Sbjct: 340 AQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVCTPSNQEL 399

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
           A EAARQGIVLLKN G  LPLS    +++AVIGPN++ + TMIGNY G +C
Sbjct: 400 AREAARQGIVLLKNTG-KLPLSATSIKSMAVIGPNANASFTMIGNYEGTSC 449


>gi|167525174|ref|XP_001746922.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774702|gb|EDQ88329.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1620

 Score =  546 bits (1406), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 305/725 (42%), Positives = 416/725 (57%), Gaps = 52/725 (7%)

Query: 27   PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
            P  CD   A  +  PFC  SL +  R+ D+I RLS+Q+KV L  + A A    G+  Y+W
Sbjct: 913  PRPCDELPA--KNFPFCNASLDLDTRIRDVISRLSIQDKVALTANTAGAAADAGLPAYQW 970

Query: 87   WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
            WSEALHGV    PG  F G    ATSFPQVI T++SFN TLW  IG  +S EARAM N  
Sbjct: 971  WSEALHGVG-FSPGVTFMGKVQAATSFPQVIHTSASFNKTLWHHIGMTISTEARAMNNVN 1029

Query: 147  TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCK 205
             AGLT+W+PN+NI RDPRWGRGQETPGEDP  +G YAA++V G+Q G D   +K ++ CK
Sbjct: 1030 QAGLTFWAPNINIIRDPRWGRGQETPGEDPYATGLYAANFVPGMQEGEDTRYIKASSCCK 1089

Query: 206  HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
            HF  Y+L++W+ VDR HFNA  + QDI DT+   F  CV  G+ +S+MCSYN VNGVP+C
Sbjct: 1090 HFFDYNLEDWHNVDRHHFNAIATDQDIADTYLPAFESCVRFGRASSLMCSYNAVNGVPSC 1149

Query: 266  ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
            A+ +I+    R  W  +GYI SDC +V   Y    + +T        + AG+D+DCG FL
Sbjct: 1150 ANADIMTTLAREAWGFDGYITSDCGAVEDVYSNHKYYNTTGATVNGVLSAGMDVDCGSFL 1209

Query: 326  GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
              H   A+  G ++   ++ AL N   VQ RLGMFD     QPY +L    V TP+HQ+L
Sbjct: 1210 SQHLADAIDSGDVTNATVDQALYNLFRVQFRLGMFD-PAEDQPYLNLTTDAVNTPEHQQL 1268

Query: 386  ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            ALEAARQG+ LL+N+   LPL     + +A+IGPN++ T  M GNY G A    +P QG+
Sbjct: 1269 ALEAARQGMTLLENRDSRLPLDASSIKQLALIGPNANATGVMQGNYNGKAPFLISPQQGV 1328

Query: 446  GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
             +Y   +              L   A+ A++ AD  ++V+GLDQ+ E+E  DR  + LPG
Sbjct: 1329 QQYVSNV-------------ALELGAVTAAKAADTVVMVIGLDQTQESEGHDREIIALPG 1375

Query: 506  RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
             Q ELV++V+ AS  P ++V+M+GG +D+   K+   +         GQAGG A+A+ LF
Sbjct: 1376 MQAELVAQVANASSSPIVVVVMTGGAVDLTPVKDLDNV---------GQAGGQALAETLF 1426

Query: 566  GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
            G +NPGG+LP T YP + +  + M +  MRP+ +   PGRTYRFY G  VY +G G+SYT
Sbjct: 1427 GDNNPGGRLPYTLYPADLVNQVSMFDDGMRPNATSGNPGRTYRFYTGTPVYAYGTGLSYT 1486

Query: 626  NFVHTVANAPTVVA-------VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
            +F +  +     V+       V   G+   I   +  +               + V V+N
Sbjct: 1487 SFSYETSTPSLRVSAERVRAWVAARGQTSFIRDEVDAEDY-------------ITVTVQN 1533

Query: 679  VGSKDGAHTLLVF--STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY-LSVVD 735
             G+  GA  + VF  +T P     P K L  FE+V +  G  +   I   V  + LSVV+
Sbjct: 1534 NGTVAGADVVQVFIKTTTPGADGNPIKSLCGFERVFLKPG--ETTSIQFPVTPHDLSVVN 1591

Query: 736  RSGTR 740
              G R
Sbjct: 1592 SRGER 1596


>gi|300121549|emb|CBK22068.2| unnamed protein product [Blastocystis hominis]
          Length = 690

 Score =  539 bits (1389), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 291/675 (43%), Positives = 390/675 (57%), Gaps = 41/675 (6%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           +R   L+  L+L EK+ L+   A+ V RL I  Y+WWSEALHGV+   PG  F    P A
Sbjct: 3   ERARALVAELTLAEKMSLMGHTASEVKRLNIPKYQWWSEALHGVA-ASPGVVFQEPTPFA 61

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQE 170
           T+FPQV  TA SF+  L+  I  ++S EAR M N   A LTYWSPNVN++RDPRWGRGQE
Sbjct: 62  TAFPQVALTAQSFDKPLFHDIASIISTEARVMNNAERANLTYWSPNVNVYRDPRWGRGQE 121

Query: 171 TPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSK 229
           TPGEDP L   YA  +VRGLQ G D   LKV+A CKH++AYDL+NW+GV+RF F+A VS 
Sbjct: 122 TPGEDPFLVATYAVEFVRGLQEGEDPRYLKVSACCKHYSAYDLENWHGVERFEFDAIVSD 181

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +D+ DTF VPF  CV +G V+S+MCSYN +NG+P CAD  +L  T RG W   GYI SDC
Sbjct: 182 RDMTDTFQVPFEQCVKKGHVSSLMCSYNAINGIPACADRELLYGTARGGWGFEGYITSDC 241

Query: 290 DSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVN 349
            ++       H+T+  +  A   +RA  DLDCG F   H   +V+ G L E ++++AL N
Sbjct: 242 GAIDTIIYNHHYTNDTDTTAMLGVRATCDLDCGGFYQQHILHSVESGRLKEAEVDDALAN 301

Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
              VQMRLG+FD     Q Y H G   + T +HQ +AL AAR+GI LLKNQ   LPLS +
Sbjct: 302 LFKVQMRLGLFD-PVEQQVYTHYGLDKLNTKEHQAMALRAAREGIALLKNQNDFLPLS-L 359

Query: 410 RHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFG 469
           + + V V+GP ++    M+GNY GI     T  QG+    R +    C  V         
Sbjct: 360 KDKHVVVMGPYAEDAGVMLGNYNGIPEFIVTVAQGL----RNV----CDHVDVVKSL--- 408

Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMS- 528
            A+      D  ++ +GL+Q IE E LDR  LLLP  Q+ L+  +   +  P +L L+S 
Sbjct: 409 EALSKLEGVDLIVVTVGLNQEIEREGLDREDLLLPASQRALLDGLLAQTDVPVVLTLLSG 468

Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
           GG +D++  + +  +  ++  GY G  GG AIA+++ G  NP G+L  T Y  +Y+TNL 
Sbjct: 469 GGSVDISAYEQNEHVVGVLAVGYGGMFGGQAIAEVIVGDVNPSGRLVNTMYYNDYVTNLD 528

Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
             +M MRP +   +PGRTYRF+ GPV++PFG G+SYT F H V               G 
Sbjct: 529 YFDMNMRPKEETGFPGRTYRFFAGPVIHPFGFGLSYTTFAHAV-------------EIGQ 575

Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWA-PHKQLV 706
           +            H   + L + V V V N GS+ G  ++L+F  +P AG    P K L 
Sbjct: 576 MR----------NHRLRSALAIDVYVKVTNTGSRQGDESVLLFVKSPLAGKQGYPLKSLA 625

Query: 707 AFEKVHVPAGAQQRV 721
            F +V +  G  Q V
Sbjct: 626 DFSRVSLAPGETQTV 640


>gi|40363751|dbj|BAD06320.1| putative beta-xylosidase [Triticum aestivum]
          Length = 573

 Score =  538 bits (1385), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 266/571 (46%), Positives = 364/571 (63%), Gaps = 8/571 (1%)

Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
           L+ +A CKHFTAYDL+NW GV RF F+AKV++QD+ DT++ PF+ CV +G  + +MCSYN
Sbjct: 5   LEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCSYN 64

Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
           +VNGVPTCAD N+L +T RG+W  NGYI SDCD+V + +D Q +   PE+A AD ++AG+
Sbjct: 65  RVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLKAGM 124

Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
           D++CG ++  H  SA Q+G ++  DI+ AL N   ++MRLG+F+G P    YG++G   V
Sbjct: 125 DVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFNGNPKYNRYGNIGADQV 184

Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
           C  +HQ+LAL+AA+ GIVLLKN   +LPLS  +  +VAVIGPN +    ++GNY G  C 
Sbjct: 185 CKKEHQDLALQAAQDGIVLLKNDAGALPLSKSKVSSVAVIGPNGNNASLLLGNYFGPPCI 244

Query: 438 YTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
             TP Q +  Y +     QGC    C    + G A+ A+  AD  +L MGLDQ+ E E +
Sbjct: 245 SVTPFQALQGYVKDATFVQGCNAAVCNVSNI-GEAVHAASSADYVVLFMGLDQNQEREEV 303

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DR  L LPG Q+ LV+KV+ A+K P ILVL+ GGP+DV FAKN+P+I AI+WAGYPGQAG
Sbjct: 304 DRLELGLPGMQESLVNKVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQAG 363

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           G AIA +LFG  NPGG+LP+TWYP+E+ T +PMT+M MR   S  YPGRTYRFYKG  VY
Sbjct: 364 GIAIAQVLFGEHNPGGRLPVTWYPKEF-TAVPMTDMRMRADPSTGYPGRTYRFYKGKTVY 422

Query: 617 PFGHGMSYTNFVHTVANA----PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
            FG+G+SY+ + H  A+     P++  +       S   T+S    ++    C+RL    
Sbjct: 423 NFGYGLSYSKYSHRFASEGTKPPSMSGIEGLKATASAAGTVSYDVEEMGAEACDRLRFPA 482

Query: 673 QVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
            V V+N G  DG H +L+F   P A    P  QL+ F+ VH+ A     V   +  CK+ 
Sbjct: 483 VVRVQNHGPMDGRHPVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVEFEVSPCKHF 542

Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
           S     G + I  G H + +G  +  +S  A
Sbjct: 543 SRAAEDGRKVIDQGSHFVKVGDDEFELSFMA 573


>gi|125576923|gb|EAZ18145.1| hypothetical protein OsJ_33695 [Oryza sativa Japonica Group]
          Length = 591

 Score =  535 bits (1378), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 266/594 (44%), Positives = 378/594 (63%), Gaps = 12/594 (2%)

Query: 178 LSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
           ++ KYA ++V+G+QG+    L+ +A CKH TAYDL++WNGV R++FNAKV+ QD+EDT++
Sbjct: 1   MASKYAVAFVKGMQGNSSAILQTSACCKHVTAYDLEDWNGVQRYNFNAKVTAQDLEDTYN 60

Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
            PFR CV++ K   +MC+Y  +NGVP CA+ ++L +T+RG+W L+GYI SDCD+V +  D
Sbjct: 61  PPFRSCVVDAKATCIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASDCDAVAIMRD 120

Query: 298 TQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
            Q +T TPE+A A A++AGLD++CG ++  H  +A+Q+G L+E DI+ AL N   ++MRL
Sbjct: 121 AQRYTQTPEDAVAVALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKALKNLFAIRMRL 180

Query: 358 GMFDGEPSSQP-YGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAV 416
           G FDG+P S   YG LG  D+CTP+H+ LALEAA  GIVLLKN    LPL      + AV
Sbjct: 181 GHFDGDPRSNSVYGGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPLDRTAVASAAV 240

Query: 417 IGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKDVACADDQLFGAAIDAS 475
           IGPN++  + +IGNY G  C  TTPL GI  Y + +    GC   AC       AA  AS
Sbjct: 241 IGPNANDGLALIGNYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVAATDQAAAVAS 300

Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
             +D   L MGL Q  E+E  DR  LLLPG QQ L++ V+ A+K P ILVL++GGP+DV 
Sbjct: 301 -SSDYVFLFMGLSQKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILVLLTGGPVDVT 359

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           FA+ +P+I AI+WAGYPGQAGG AIA +LFG  NPGG+LP+TWYP+E+ T +PMT+M MR
Sbjct: 360 FAQTNPKIGAILWAGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEF-TKVPMTDMRMR 418

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
              +  YPGR+YRFY+G  VY FG+G+SY+++   + +     A        S+  T + 
Sbjct: 419 ADPATGYPGRSYRFYQGKTVYKFGYGLSYSSYSRQLVSGGK-PAESYTNLLASLRTTTTS 477

Query: 656 KAIKVTHAK------CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAF 708
           +  +  H +      C +L     V+V+N G  DG H++L++   P A    P  QL+ F
Sbjct: 478 EGDESYHIEEIGTDGCEQLKFPAVVEVQNHGPMDGKHSVLMYLRWPNAKGGRPTTQLIGF 537

Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
              H+  G +  +  +I  C++ S V + G + I  G H + +   +  +   A
Sbjct: 538 RSQHLKVGEKANIRFDISPCEHFSRVRKDGKKVIDRGSHYLMVDKDELEIRFEA 591


>gi|340370208|ref|XP_003383638.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 732

 Score =  532 bits (1371), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 304/742 (40%), Positives = 418/742 (56%), Gaps = 59/742 (7%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           ++  +C  SLPI  RV DL+ R++L EK+  L + A ++ RL I  Y+WWSE LHGV++ 
Sbjct: 30  QSFSYCNYSLPISDRVKDLLSRMTLAEKITQLGNTAGSIDRLDIPAYQWWSEGLHGVAD- 88

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
            PG  F G F  ATSFPQVITTASSFN TL+  I  V+S EARA  N    G+ Y+  + 
Sbjct: 89  SPGVHFNGMFHNATSFPQVITTASSFNKTLYHEIAAVMSTEARAFAN---QGIVYFKQHQ 145

Query: 158 NIF--------RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
            +         RDPRWGR QETPGEDP L+ +YA  +V G QG D   LKV  +CKHF  
Sbjct: 146 QLLSNYLLFYCRDPRWGRAQETPGEDPYLNSQYAIQFVTGAQG-DSKYLKVVTTCKHFAG 204

Query: 210 YDLDNW-NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
           YDL+++ +G  R  FNAK++ QD E+T+   F+ CV E  VAS+MCSYN+VNGVP+CAD 
Sbjct: 205 YDLEDYVDGETRHSFNAKITPQDFEETYYPAFKACVEEANVASIMCSYNEVNGVPSCADG 264

Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH 328
            I  +  R  W  +G+I SDC ++    +  H+T+  ++  A A++ G DL+CG +   H
Sbjct: 265 QINNKLARDTWGFDGFIASDCGAIDDIQNKHHYTNNTDDTVAAALKGGCDLNCGSYYQSH 324

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
            +SA   G ++  +IN AL    T +M+LGMFD  P  QPY  + P  V + +HQ LAL 
Sbjct: 325 AQSAFLNGTITIGEINLALTRLFTARMKLGMFD-PPELQPYNAISPDVVNSLEHQALALN 383

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
           AAR+ IVLL+N    LPL+  +H T+AV+GP++  T  M GNY G+A    +P++G    
Sbjct: 384 AARESIVLLQNNNDVLPLNFEKHSTIAVVGPHAMATDVMQGNYNGVAPYLISPVEGFENL 443

Query: 449 A--RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGR 506
                +   GC DV C     F  A D + +ADA I V+GLDQS E+E  DR  L LP  
Sbjct: 444 GIDSVLTASGC-DVNCEVTDGFQDAFDIAVKADAVIAVLGLDQSHESEGHDREDLFLPNL 502

Query: 507 QQELVSKV-----SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           Q + V  +     +  +  P I+V+MSG  +D+   K      AI+WAGYPGQ+GG AIA
Sbjct: 503 QDKFVQDLKNTLKAAGTNAPLIVVVMSGSSVDLTVTKK--HADAILWAGYPGQSGGQAIA 560

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           +I++G  NP G+LP+T+YP  YI  +    M+MR      YPGRTY+FY     + FG G
Sbjct: 561 EIIYGKVNPSGRLPVTFYPGSYIDLVAFRHMSMR-----EYPGRTYKFYNDTPDFSFGDG 615

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           +SYT F    +    +  V                         +  T+   V V N G 
Sbjct: 616 LSYTTFYLEWSKPVNMSGV----------------------RSVSYPTVVYNVTVTNTGK 653

Query: 682 KDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHV-CKYLSVVDRSGT 739
             GA ++L + S   +G  AP K+L  FEKV +     Q V +      K  S VD+SG 
Sbjct: 654 MPGAISVLAYISYNNSG--APKKKLFGFEKVFL--NPLQSVSVTFPADSKAFSTVDKSGK 709

Query: 740 RRIPLGEHNIHIGGTK-HSVSL 760
           R +  G++++ IG    H +SL
Sbjct: 710 RSVNPGDYHVTIGDQLIHKISL 731


>gi|334187562|ref|NP_196532.2| Glycosyl hydrolase family protein [Arabidopsis thaliana]
 gi|332004052|gb|AED91435.1| Glycosyl hydrolase family protein [Arabidopsis thaliana]
          Length = 526

 Score =  531 bits (1368), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 259/481 (53%), Positives = 332/481 (69%), Gaps = 5/481 (1%)

Query: 284 YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDI 343
           YIVSDCDS+G+ Y +QH+T TPEEAAA +I AGLDL+CG FLG HTE+AV++GL+ E  I
Sbjct: 45  YIVSDCDSLGILYGSQHYTKTPEEAAAKSILAGLDLNCGSFLGNHTENAVKKGLIDEAAI 104

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           N A+ N     MRLG FDG P +QPYG LGPKDVCT +++ELA+E ARQGIVLLKN   S
Sbjct: 105 NKAISNNFATLMRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGS 164

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVAC 462
           LPLS    +T+AVIGPN++VT TMIGNY G+AC YTTPLQG+ R   T  + +GC +V C
Sbjct: 165 LPLSPSAIKTLAVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTC 224

Query: 463 ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPT 522
            +  L  A   A+  ADAT+LVMG DQ+IE E LDR  L LPG+QQELV++V+ A++GP 
Sbjct: 225 TEADLDSAKTLAA-SADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPV 283

Query: 523 ILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE 582
           +LV+MSGG  D+ FAKND +I +I+W GYPG+AGG AIAD++FG  NP GKLPMTWYPQ 
Sbjct: 284 VLVIMSGGGFDITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQS 343

Query: 583 YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL 642
           Y+  +PMT M MRP +S  Y GRTYRFY G  VY FG G+SYTNF H +  AP  V++ L
Sbjct: 344 YVEKVPMTNMNMRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLNL 403

Query: 643 DGRHGSIN---ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
           D      +    ++         A   R    VQ+ V+NVG ++G  T+ +F+TPP  H 
Sbjct: 404 DESQSCRSPECQSLDAIGPHCEKAVGERSDFEVQLKVRNVGDREGTETVFLFTTPPEVHG 463

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
           +P KQL+ FEK+ +    +  V   + VCK L VVD  G R++ LG H +H+G  KHS +
Sbjct: 464 SPRKQLLGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGSLKHSFN 523

Query: 760 L 760
           +
Sbjct: 524 I 524


>gi|407922988|gb|EKG16078.1| Glycoside hydrolase family 3 [Macrophomina phaseolina MS6]
          Length = 800

 Score =  502 bits (1292), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 281/724 (38%), Positives = 420/724 (58%), Gaps = 36/724 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+  L+L+EK+    + +  VPRLGI  Y+WW+EALHGV+   PG  
Sbjct: 41  CDSSATPLARATALVKELTLEEKLNNTGNTSPGVPRLGIPEYQWWNEALHGVAFTYPGQP 100

Query: 103 F--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
               G+F  ATSFPQ I   ++F+  L   +  VVS EARA  NGG +GL YW+PN+N +
Sbjct: 101 MTESGNFSSATSFPQPILMGAAFDDELIYEVASVVSTEARAYSNGGRSGLDYWTPNINPY 160

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYDLDNWNGVD 219
           +DPRWGRGQETPGEDP     Y  + +RGL+G+  D   K+ A+CKHFT YD++NWNG  
Sbjct: 161 KDPRWGRGQETPGEDPFHLASYVQNLIRGLEGNQNDPYKKIVATCKHFTGYDMENWNGNF 220

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R+ F+A+++ +D+ + +  PF+ C  E KV + MCSYN VNGVPTCADP +L+  +R  W
Sbjct: 221 RYQFDAQINMRDMVEYYMPPFQACAREAKVGAFMCSYNAVNGVPTCADPWLLQTVLREHW 280

Query: 280 RLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
             N    ++VSDCD++   Y    +  + E+A AD + AG DL+CG +   +   A ++G
Sbjct: 281 GWNQEDQWVVSDCDAIQNVYLPHEWAESREQAVADTLNAGTDLNCGTYYQRYLPGAYEQG 340

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L+++  ++ AL  T +  ++LG FD    SQPY  +G +DV +   QELAL+AA++GIVL
Sbjct: 341 LINDTTLDRALTRTYSSLIKLGYFDNA-DSQPYRQIGWQDVNSQHAQELALKAAQEGIVL 399

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQ 455
           LKN G  LPLS     ++A+IG  ++ T  M GNYAG+A    +PL    +    + + +
Sbjct: 400 LKNDG-LLPLSLDGVSSIALIGSWANATEQMQGNYAGVAPYLHSPLYAAEQLGVKVNYAE 458

Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
           G        DQ +GA   A+  +D  I+V G+D  IE+E LDR  +   G Q ++++K++
Sbjct: 459 GASQSNPTTDQ-WGAEYTAAENSDVIIVVGGIDNDIESEELDRVAIAWSGPQLDMITKLA 517

Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
              K P I+V M  G +D     ++  I+A++W GYPGQ GGTA+ DI+ G   P G+LP
Sbjct: 518 TYGK-PVIVVQMGAGQLDSTPLVSNANISALLWGGYPGQDGGTALFDIITGAVAPAGRLP 576

Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           +T YP  Y   + MT+M++RPS +    GRTY++Y G  V+PFG G+ YTNF   + + P
Sbjct: 577 ITQYPARYTKEVAMTDMSLRPSSTSA--GRTYKWYNGTAVFPFGFGLHYTNFSAAIPSPP 634

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL----GVQVDVKNVGSKDGAHTLLVF 691
                       + +  IS      +    ++L L     + VD+ N G++      L F
Sbjct: 635 ------------ASSFAISDLVASCSANDTSKLDLCPFTSLAVDIANDGTRASDFVALAF 682

Query: 692 ST---PPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH 747
            T    P+ H  P   LVA++++H + AG  Q   +N+ +   L  VD +G + +  G++
Sbjct: 683 LTGEFGPSPH--PKSSLVAYQRLHAIAAGETQTARLNLTLGS-LVRVDENGDKLLYPGDY 739

Query: 748 NIHI 751
           ++ I
Sbjct: 740 SVLI 743


>gi|121308314|dbj|BAF43576.1| arabinofuranosidase/xylosidase homolog [Prunus persica]
          Length = 349

 Score =  501 bits (1289), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 247/349 (70%), Positives = 288/349 (82%), Gaps = 2/349 (0%)

Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
           DVTVTMIGNYAG+ACGYTTPLQGIGRY RTIHQ GC DV C  +QLFGAA  A+RQADAT
Sbjct: 1   DVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADAT 60

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           +LVMGLDQSIEAE +DRAGLLLPG QQELVS+V+ AS+GPTILVLMSGGPIDV FAKNDP
Sbjct: 61  VLVMGLDQSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDP 120

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
           RI+AIIW GYPGQAGGTAIAD+LFGT+NPGGKLPMTWYPQ Y+T+LPMT+MAMR   ++ 
Sbjct: 121 RISAIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARG 180

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
           YPGRTYRFY+GPVV+PFG G+SYT F H +A+ PT+V+VPL     + N+T+  KA++V+
Sbjct: 181 YPGRTYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKAVRVS 240

Query: 662 HAKCNRLT-LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
           HA CN L+ L V VDVKN GS DG HTLLVF++PP G WA  KQL+ F K+H+ AG+++R
Sbjct: 241 HADCNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWASSKQLMGFHKIHIAAGSEKR 300

Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
           V I +HVCK+LSVVDR G RRIPLGEH + IG   H VSL    LG IK
Sbjct: 301 VRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSLQ-TNLGEIK 348


>gi|297745533|emb|CBI40698.3| unnamed protein product [Vitis vinifera]
          Length = 461

 Score =  499 bits (1285), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 243/375 (64%), Positives = 291/375 (77%), Gaps = 4/375 (1%)

Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG---DRL 198
           MYN G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+LS KYA+ YVRGLQ SD    DRL
Sbjct: 1   MYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSDDGSPDRL 60

Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
           K+AA CKH+TAYDLDNW GVDRFHFNA V+KQD++DTF  PF+ CV++G VASVMCSYNQ
Sbjct: 61  KIAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQ 120

Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLD 318
           VNG P CADP++L   +RGEW+LNGYIVSDCDSV V+Y++QH+T TPEEAAA AI AGLD
Sbjct: 121 VNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHYTKTPEEAAAKAILAGLD 180

Query: 319 LDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC 378
           L+CG FLG HTE+AV+ GL+ E  ++ A+ N     MRLG FDG PS   YG LGPKDVC
Sbjct: 181 LNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVC 240

Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
           T +HQELA EAARQGI+LLKN   SLPLS    +T+A+IGPN++VT TMIGNY G  C Y
Sbjct: 241 TLEHQELAREAARQGIMLLKNSKGSLPLSPTAIKTLAIIGPNANVTKTMIGNYEGTPCKY 300

Query: 439 TTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
           TTPLQG+     T +  GC +VAC+  Q+   A   +  ADAT+L++G+DQSIEAE  DR
Sbjct: 301 TTPLQGLMALVATTYLSGCSNVACSTAQI-DEAKKIAAAADATVLIVGIDQSIEAEGRDR 359

Query: 499 AGLLLPGRQQELVSK 513
             + LPG+Q  L+++
Sbjct: 360 VNIQLPGQQPLLITE 374


>gi|409041356|gb|EKM50841.1| glycoside hydrolase family 3 protein [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 764

 Score =  498 bits (1281), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 295/730 (40%), Positives = 417/730 (57%), Gaps = 29/730 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R N L+  L+L+E V   ++ +  VPRLG+  Y WWSEALHGV+ + PGT 
Sbjct: 38  CNPSADPTSRANALVDALTLEELVNNTVNASPGVPRLGLPPYNWWSEALHGVA-LSPGTN 96

Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
           F   G  F  ATSFPQ I   ++F+  L  +I  V+S EARA  N G AGL +++PN+N 
Sbjct: 97  FSVPGSPFSSATSFPQPIILGATFDDDLVTSIATVISTEARAFNNAGRAGLDFFTPNINP 156

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKHFTAYDLDNWNG 217
           F+DPRWGRGQETPGEDP    +Y    V GLQG  S     KV A CKHF  YDL+NW G
Sbjct: 157 FKDPRWGRGQETPGEDPFHIAQYVYQLVTGLQGGLSPDPYYKVIADCKHFAGYDLENWEG 216

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R  FNA +S QD+ + +   F+ CV +  V SVMCSYN VNG+P+CA+  +L+  IRG
Sbjct: 217 NSRMAFNAIISTQDLAEYYTPSFQSCVRDAHVGSVMCSYNAVNGIPSCANSYLLQDIIRG 276

Query: 278 EWRL-NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
            + L +G+I SDCD+V   +    +T+T   A+A A++AG D+DCG         AV + 
Sbjct: 277 HFGLGDGWITSDCDAVANIFSPHQYTTTLVNASAVALKAGTDVDCGTTYSQTLVDAVDQN 336

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L++E DI N+++      +RLG FD  P+ QP+  LG  DV TP  Q LAL AA +G+ L
Sbjct: 337 LVTEDDIKNSMIRLYRSLVRLGYFD-SPAEQPFRQLGWSDVNTPSSQALALTAAEEGVTL 395

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           LKN G +LPLS    R +A++GP ++ T  M GNY GIA    +PLQ +      +    
Sbjct: 396 LKNDG-TLPLSSAIKR-IALVGPWANATTQMQGNYQGIAPFLVSPLQALQDAGFQVTFAN 453

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
              +   DD  F AA+ A + ADA I   G+D++IE+E  DR  +  PG Q +LVS+++ 
Sbjct: 454 GTAINSTDDSGFAAAVSAVQVADAVIYAGGIDETIESEGNDREIITWPGNQLDLVSQLAA 513

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
             K P +++ M GG +D +  K++  + A+IW GYPGQ+GG AI +IL G   P G+LP+
Sbjct: 514 VGK-PFVVLQMGGGQVDSSSLKSNKAVNALIWGGYPGQSGGAAIVNILTGKIAPAGRLPI 572

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF-VHTVANAP 635
           T YP +Y+  +PMT+MA+RP+ +   PGRTY+++ G  ++ FG G+ YT F +      P
Sbjct: 573 TQYPADYVNEIPMTDMALRPNGTS--PGRTYKWFTGTPIFGFGFGLHYTTFSLDWAPTPP 630

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL-GVQVDVKNVGSKDGAHTLLVFSTP 694
           +  A+          +T+  +A     +  N   L   +V+VKN G     +  L+FS  
Sbjct: 631 SSFAI----------STLVSEANTAGVSFTNLAPLFTFRVNVKNTGKVGSDYVALLFSNT 680

Query: 695 PAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI-- 751
            AG   AP KQLV++ +V   A  Q            ++ +D +G   +  G +NI +  
Sbjct: 681 TAGPQPAPLKQLVSYTRVKGIAPGQTETAELKVTLGSIARIDENGDSALYPGRYNIWVDT 740

Query: 752 -GGTKHSVSL 760
            G   HS  L
Sbjct: 741 TGDIVHSFEL 750


>gi|440799679|gb|ELR20723.1| betaxylosidase [Acanthamoeba castellanii str. Neff]
          Length = 748

 Score =  497 bits (1279), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 286/736 (38%), Positives = 402/736 (54%), Gaps = 80/736 (10%)

Query: 33  KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
           +DA  + LPFC  SL   QR +DL+ RL+L + +  +   A AVP LGI  Y WW+E LH
Sbjct: 6   QDAPLKDLPFCNTSLTAGQRTDDLVSRLTLDQLIGQMGHQAPAVPSLGIPAYNWWTECLH 65

Query: 93  GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY 152
           GV      TK G + P  TSFP      ++FN  L   + R +S+EARA+ N G  GL +
Sbjct: 66  GVL-----TKCGTNCP--TSFPAPCALGAAFNMKLIHKMARAISNEARALNNEGIGGLDF 118

Query: 153 WSPNV-----------------------NIFRDPRWGRGQETPGEDPVLSGKYAASYVRG 189
           W+PN+                       +I RDPRWGR  E PGEDP ++ +Y A ++RG
Sbjct: 119 WAPNIKYSTQPTNKTRQESQLRNAMVCISINRDPRWGRNMEVPGEDPFMTAQYVAHFMRG 178

Query: 190 LQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGK 248
           LQ G D    +V  +CKHF AY L+ W   DRF F+A VS  D  +T+   F+ C++EG+
Sbjct: 179 LQEGEDSRYPQVVGTCKHFAAYSLEAWKDYDRFMFDAIVSDYDFVETYLPAFKGCIVEGR 238

Query: 249 VASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEA 308
             S+MCSYN VNGVP+CA+  +L+  +R  W  +GY+VSDCD+V   Y+  HFT TPE A
Sbjct: 239 ARSIMCSYNSVNGVPSCANDFLLRTILRDSWSFDGYVVSDCDAVDTIYNNHHFTKTPEGA 298

Query: 309 AADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP 368
            A A+ AG DL+CG F   H   A   G ++E ++  A+      +M LGM+D  P+ QP
Sbjct: 299 CAVALHAGTDLNCGDFYQKHLGKAHSEGRVTEDEVRLAVKRLFRQRMELGMWD-PPAEQP 357

Query: 369 YGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMI 428
           Y    P  V + +H +LAL+AAR+ +VLL+N+   LPL     R VAVIGPN++ T TM+
Sbjct: 358 YKQYPPSVVGSREHSDLALQAARESMVLLQNRRGVLPLRKSVRR-VAVIGPNANATETML 416

Query: 429 GNYAGIACGYTTPLQGIGRY---------ARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
           GNY G  C   T    +  Y         A   +  GC DV   +      A+ A++ AD
Sbjct: 417 GNYYGSRCHDGTYDCIVSPYLAIKAKLPQALVTYNLGC-DVDSTNTTGIPEAVKAAQAAD 475

Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
             I+V+GL+ S+E+E  DR  + LPG Q  L+  + +A+  PT++V+M GG + + + K+
Sbjct: 476 VAIVVLGLNTSVESEGKDRVAITLPGMQDHLIKSI-VATNTPTVVVMMHGGAVAIEWIKD 534

Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPG----------GKLPMTWYPQEYITNLPM 589
             ++  I+ A YPG+ GG AIAD+LFG  NPG          G+LP+T  P  Y+  +P+
Sbjct: 535 --QVDGIVDAFYPGENGGQAIADVLFGDYNPGDNKTDGTTLLGRLPVTVLPANYVDMVPL 592

Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPV-VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
           T M+MR S +   PGRTYR+Y GP  ++ FG G+SYT F     + P   A+    R  +
Sbjct: 593 TNMSMRASGNN--PGRTYRYYTGPAPLWEFGFGLSYTTFKTEWLSTPQPSALKSYARDEA 650

Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAF 708
           ++                      +V V NVG   G   +L F T       P KQL AF
Sbjct: 651 VS---------------------FRVRVTNVGPVAGDEVVLAFVTRDNADRGPLKQLFAF 689

Query: 709 EKVHVPAGAQQRVGIN 724
           E+VH+  G  + +  N
Sbjct: 690 ERVHLNPGESKEIFFN 705


>gi|118489157|gb|ABK96385.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 343

 Score =  496 bits (1276), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 238/344 (69%), Positives = 273/344 (79%), Gaps = 1/344 (0%)

Query: 427 MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMG 486
           MIGNYAG+ACGYTTPLQGI RYA+T+H  GC DV C  +Q F AA  A+R ADATILVMG
Sbjct: 1   MIGNYAGVACGYTTPLQGIRRYAKTVHLSGCNDVFCNGNQQFNAAEVAARHADATILVMG 60

Query: 487 LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAI 546
           LDQSIEAE  DR GLLLPG QQELVS+V+ AS+GPTILVLMSGGPIDV+FAKNDPRI AI
Sbjct: 61  LDQSIEAEFRDRKGLLLPGYQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAI 120

Query: 547 IWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT 606
           +W GYPGQAGG AIAD+LFGT+NPGGKLPMTWYP +Y+  +PMT M MR   S+ YPGRT
Sbjct: 121 LWVGYPGQAGGAAIADVLFGTANPGGKLPMTWYPHDYLAKVPMTNMGMRADPSRGYPGRT 180

Query: 607 YRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCN 666
           YRFYKGPVV+PFGHGMSYT F H++  AP  V+VPL   H S N T +  AI+V+HA C 
Sbjct: 181 YRFYKGPVVFPFGHGMSYTTFAHSLVQAPREVSVPLASLHVSRNTTGASNAIRVSHANCE 240

Query: 667 RLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
            L LGV +DVKN G  DG HTLLVFS+PP G W+  KQL+ FEKVH+  G+Q+RV I+IH
Sbjct: 241 ALALGVHIDVKNTGDMDGTHTLLVFSSPPGGKWSTQKQLIGFEKVHLVTGSQKRVKIDIH 300

Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           VCK+LSVVDR G RRIP GEH ++IG  KHS+SL  ATL  IKS
Sbjct: 301 VCKHLSVVDRFGIRRIPNGEHYLYIGDLKHSISLQ-ATLEEIKS 343


>gi|222618262|gb|EEE54394.1| hypothetical protein OsJ_01415 [Oryza sativa Japonica Group]
          Length = 776

 Score =  495 bits (1275), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 309/783 (39%), Positives = 412/783 (52%), Gaps = 115/783 (14%)

Query: 30  CDPKDATTRTL-----PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGY 84
           CDP       L     P+C  SLP   RV DL+GR++L+EKV  L   A   PR+G+  Y
Sbjct: 47  CDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRY 106

Query: 85  EWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA--- 141
                        G G +       A         A            RVV  +A A   
Sbjct: 107 ------------CGGGRRCTACPTSARRDVVWRRRARRHQLPARHQQRRVVQRDAVARHR 154

Query: 142 ---------MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
                    MYN G A LTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q 
Sbjct: 155 RRGVDGDQGMYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQD 214

Query: 193 SDGDR---------------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
            DG                 +KV++ CKH+ A                            
Sbjct: 215 IDGATTAASAAAATDAFSRPIKVSSCCKHYAA---------------------------- 246

Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY- 296
                         VMCSYN++NGVP CAD  +L  T+R +W+L+GYIVSDCDSV V   
Sbjct: 247 -------------CVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVR 293

Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCGPFL-GLHT------ESAVQRGLLSEIDINNALVN 349
           D +    T  EA A A++AGLDLDCG F  G+H         AV++G L E  ++NAL N
Sbjct: 294 DAKWLGYTGVEATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTN 353

Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
                MRLG FDG P  +    LG  DVCT +H+ELA +AARQG+VLLKN    LPLS  
Sbjct: 354 LYLTLMRLGFFDGIPELE---SLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPE 410

Query: 410 RHRTVAVIGPNSDVTVT--MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL 467
           +  +VA+ G    +  T  M+G+Y G  C   TP  G+ +   +     C   +C     
Sbjct: 411 KVNSVALFGQLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSVHACDKGSC----- 465

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
              A  A++  DATI+V GL+ S+E E+ DR  LLLP  Q   ++ V+ AS  P +LV+M
Sbjct: 466 -DTAAAAAKTVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIM 524

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           S G +DV+FA+++P+I A++WAGYPG+ GGTAIAD+LFG  NPGG+LP+TWY  EY++ +
Sbjct: 525 SAGGVDVSFAQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKI 584

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDG-- 644
           PMT MA+RP     YPGRTY+FY G  V+YPFGHG+SYTNF +  A A   V V +    
Sbjct: 585 PMTSMALRPDAEHGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWE 644

Query: 645 --RHGSINATISG----KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG- 697
             +  +  A +S      A+ V    C    +   V V N G +DG H + +++ PPA  
Sbjct: 645 YCKQLTYKAGVSSPPACPAVNVASHACQE-EVSFAVTVANTGGRDGTHVVPMYTAPPAEV 703

Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
             AP KQLVAF +V V AGA   V   ++VCK  ++V+ +    +P G   + +G    S
Sbjct: 704 DGAPRKQLVAFRRVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALS 763

Query: 758 VSL 760
           +S 
Sbjct: 764 LSF 766


>gi|409079878|gb|EKM80239.1| hypothetical protein AGABI1DRAFT_120267 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 786

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 295/741 (39%), Positives = 424/741 (57%), Gaps = 39/741 (5%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           ++ P C  +     R   LI   +  E ++   + +  VPRLG+  YEWWSEALHGV + 
Sbjct: 33  KSTPVCDSAKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEALHGVGH- 91

Query: 98  GPGTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSP 155
            PG  F   GDF  ATSFPQ I   ++F+  L +A+  VVS EARA  N G AGL Y++P
Sbjct: 92  SPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRAGLNYFTP 151

Query: 156 NVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKHFTAYDLD 213
           N+N F+DPRWGRGQETPGEDP    +Y    V GLQG       +KVAA CKHF AYDL+
Sbjct: 152 NINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQGGIDPWPYIKVAADCKHFAAYDLE 211

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           NW G+DRFHF+A+VS+QD+ + +  PF+ CV + K ASVMCSYN VNGVP CA   +L+ 
Sbjct: 212 NWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNGVPACASTYLLQD 271

Query: 274 TIRGEWRLNG--YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
            +R  W  +   ++ SDC ++   +D+ +FT +  EAAA +++AG D+DCG     H  +
Sbjct: 272 ILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDIDCGSTFADHLPA 331

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS-SQPYGHLGPKDVCTPDHQELALEAA 390
           A+ + L+S  D+  A +   T  +RLG FD  PS SQ Y      DV TP+ Q L+  AA
Sbjct: 332 ALNQSLISRDDLTRAFIRQYTSLIRLGYFD--PSDSQTYRQFDWSDVNTPEAQALSRRAA 389

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR 450
            +G+VLLKN G  LPL+    +T+A+IGP ++ T +M GNY G A   T+P QG      
Sbjct: 390 VEGLVLLKNDG-LLPLAP-DGKTIAIIGPYTNATSSMQGNYFGNAPIITSPFQGAQDVGF 447

Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
            +       V       F  AI+ ++ AD  + V G+D ++E E LDR+ +  PG Q +L
Sbjct: 448 KVVSAAGTTVNGTSSAGFAEAINTAKAADVVVFVGGIDNTLEREGLDRSSISWPGNQLDL 507

Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
           V  ++   K P I+V   GG +D      + ++ AIIWAGYPGQ+GGTAI DI+ G++ P
Sbjct: 508 VKDLASLGK-PLIVVQFGGGQVDDTEILANKKVQAIIWAGYPGQSGGTAIFDIIVGSTAP 566

Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT 630
            G+LP+T YP +Y   + MT+M++RPS     PGRTY++YK PV+  +GHG+ +T F  +
Sbjct: 567 AGRLPVTQYPADYTHQVRMTDMSLRPSSHN--PGRTYKWYKTPVL-EYGHGLHFTTFDFS 623

Query: 631 VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL--GVQVDVKNVGSKDGAHTL 688
               P   A   D +          + I+ +H+K   L      ++ V+N G+    +  
Sbjct: 624 WQRQP---AAEYDIQ----------ELIRASHSKFLDLAHFDTFEICVRNTGNITSDYVG 670

Query: 689 LVF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
           L+F   +T P  H  P K LVA+ +VH +  G    + + + +   ++ VD++G   +  
Sbjct: 671 LLFLSGNTGPGPH--PIKSLVAYSRVHDIQGGTSATLTLKVTLGS-VARVDKNGDLWLFP 727

Query: 745 GEHNIHIGGTKHSVSLHAATL 765
           G + + +  TK  V  H   L
Sbjct: 728 GPYRL-VLDTKDGVLTHPFRL 747


>gi|452989371|gb|EME89126.1| glycoside hydrolase family 3 protein [Pseudocercospora fijiensis
           CIRAD86]
          Length = 790

 Score =  494 bits (1273), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 274/710 (38%), Positives = 398/710 (56%), Gaps = 23/710 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +     R   LI   +L EK+    S +  VPRLG+  YEWW EALHGV++  PG  
Sbjct: 39  CDTAADPLTRAKALIAEFTLAEKINNTGSTSPGVPRLGLLPYEWWQEALHGVAS-SPGVN 97

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G+F  ATSFPQ I   ++F+  L   +  V+S EARA  N   AGL +W+PN+N F
Sbjct: 98  FSVSGEFRYATSFPQPILMGAAFDDQLIHDVASVISTEARAFSNDDRAGLDFWTPNINPF 157

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
           +DPRWGRGQETPGEDP     Y  S +RGLQG +    KV A+CKHF AYD++NWNG  R
Sbjct: 158 KDPRWGRGQETPGEDPYHLSSYVHSLIRGLQGDNPSYKKVVATCKHFVAYDVENWNGNFR 217

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
           +  +A ++ QD+ + +  PFR C  +  V + MCSYN +NGVPTCADP +L+  +R  W 
Sbjct: 218 YQLDAHINSQDLVEYYMPPFRSCARDSNVGAFMCSYNSLNGVPTCADPYLLQTVLREHWN 277

Query: 281 LNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
                 ++ SDCDSV   +   ++ S+ EEAAA +++AG D++CG +   H   A ++GL
Sbjct: 278 WTAEEQWVTSDCDSVQNVFLYHNYASSREEAAAISLKAGTDINCGTYYQEHLPRAYEQGL 337

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           ++E D++ +L+      +RLG FDG+    PY +L   DV TP  Q+LAL+AA  GI LL
Sbjct: 338 INETDVDTSLIRQYGSLIRLGYFDGD--RVPYRNLTWNDVSTPYAQDLALKAATSGITLL 395

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY-ARTIHQQG 456
           KN G  LPL       +A+IG  ++ T  M+GNY GI   + +PL    +  A   + QG
Sbjct: 396 KNDG-ILPLQITNGTKIALIGDWANATDQMLGNYHGIPPYFHSPLWAAQQTGAEVTYVQG 454

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
               +      +     A+ ++D  I + G+D+ +EAE  DR  +   G Q +++ +++ 
Sbjct: 455 PGGQSDPTTYTWRPIWSAANKSDVIIYIGGMDERVEAEEKDRVSIAWSGPQLDVIGQLAD 514

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
               PTI+V M GG +D +    +P I A++W GYPGQ GG AI DIL G S P G+LP+
Sbjct: 515 YYDKPTIVVQMGGGSLDSSPLVKNPNIRALLWGGYPGQDGGKAIFDILQGISAPAGRLPI 574

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           T Y  +YI+ +PMT+ ++RP+ +   PGRTY +     V+ FG+G+ YTNF  T+     
Sbjct: 575 TQYRADYISKVPMTDTSLRPNATSGSPGRTYIWLNEEPVFEFGYGLHYTNFTATIP---- 630

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV-QVDVKNVGSKDGAHTLLVFSTPP 695
                 D        +I   A   T +  +R       +DV N GS    +  L F T  
Sbjct: 631 ------DAESSDTTYSIDSLASDCTESYLDRCPFKTFSIDVTNTGSVTSDYVTLGFLTGA 684

Query: 696 AG-HWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
            G    P+K+LV+++++H + AG+ Q   +N+ +     V D+  T   P
Sbjct: 685 HGPEPCPNKRLVSYQRLHNITAGSTQTAALNLTLGSLSRVDDKGNTVLFP 734


>gi|426198356|gb|EKV48282.1| hypothetical protein AGABI2DRAFT_67675 [Agaricus bisporus var.
           bisporus H97]
          Length = 763

 Score =  494 bits (1271), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 296/748 (39%), Positives = 425/748 (56%), Gaps = 39/748 (5%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
           D  +   ++ P C  +     R   LI   +  E ++   + +  VPRLG+  YEWWSEA
Sbjct: 26  DCTNGPLKSTPVCDSTKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEA 85

Query: 91  LHGVSNVGPGTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA 148
           LHGV +  PG  F   GDF  ATSFPQ I   ++F+  L +A+  VVS EARA  N G A
Sbjct: 86  LHGVGH-SPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRA 144

Query: 149 GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKH 206
           GL Y++PN+N F+DPRWGRGQETPGEDP    +Y    V GLQG       +KVAA CKH
Sbjct: 145 GLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQGGIDPWPYIKVAADCKH 204

Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
           F AYDL+NW G+DRFHF+A+VS+QD+ + +  PF+ CV + K ASVMCSYN VNGVP CA
Sbjct: 205 FAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNGVPACA 264

Query: 267 DPNILKRTIRGEWRLNG--YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
              +L+  +R  W  +   ++ SDC ++   +D+ +FT +  EAAA +++AG D+DCG  
Sbjct: 265 STYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDIDCGST 324

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS-SQPYGHLGPKDVCTPDHQ 383
              H  +A+ + L+S  D+  A +   T  +RLG FD  PS SQ Y      DV TP+ Q
Sbjct: 325 FADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFD--PSHSQTYRQFDWSDVNTPEAQ 382

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
            L+  AA +G+VLLKN G  LPL+    +T+A+IGP ++ T +M GNY G A   T+P Q
Sbjct: 383 ALSRRAAVEGLVLLKNDG-LLPLAP-DGKTIAIIGPYTNATSSMQGNYFGNAPFITSPFQ 440

Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
           G       +       V       F  AI+ +R AD  + V G+D ++E E LDR+ +  
Sbjct: 441 GAQDVGFKVVSAAGTIVNGTSSAGFAEAINTARAADVVVFVGGIDNTLEREGLDRSSISW 500

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG Q +LV  ++   K P I+V   GG +D      + ++ AIIWAGYPGQ+GGTAI DI
Sbjct: 501 PGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANEKVQAIIWAGYPGQSGGTAIFDI 559

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           + G + P G+LP+T YP +Y   + MT+M++RPS     PGRTY++YK PV+  +GHG+ 
Sbjct: 560 IVGATAPAGRLPVTQYPADYTHQVRMTDMSLRPSSHN--PGRTYKWYKTPVL-EYGHGLH 616

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL--GVQVDVKNVGS 681
           +T F  +    P   A   D +          + I+ +H+K   L      ++ V+N G+
Sbjct: 617 FTTFDFSWQRQP---AAEYDIQ----------ELIRASHSKFLDLAHFDTFEICVRNTGN 663

Query: 682 KDGAHTLLVF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRS 737
               +  L+F   ++ P  H  P K LVA+ +VH +  G    + + + +   ++ VD++
Sbjct: 664 ITSDYVGLLFLSGNSGPGPH--PIKSLVAYSRVHDIQGGTSATLTLKVTLGS-VARVDKN 720

Query: 738 GTRRIPLGEHNIHIGGTKHSVSLHAATL 765
           G   +  G + + +  TK  V  H   L
Sbjct: 721 GDLWLFPGPYRL-VLDTKDGVLTHPFRL 747


>gi|396473219|ref|XP_003839293.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
 gi|312215862|emb|CBX95814.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
          Length = 789

 Score =  493 bits (1268), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 289/735 (39%), Positives = 417/735 (56%), Gaps = 34/735 (4%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
           D ++   +T   C  S     R   L+   +L+EK+    SG+  VPRLGI  Y+WWSE 
Sbjct: 23  DCQNGPLQTETICNTSASPLDRAKSLVTLYTLEEKINATSSGSPGVPRLGIPPYQWWSEG 82

Query: 91  LHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           LHG++  GP T F   G ++  +TSFPQ I   ++F+  L   + +V+S EARA  N   
Sbjct: 83  LHGIA--GPYTNFSTSGIEYSYSTSFPQPILMGAAFDDHLITDVAKVISTEARAFNNANR 140

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK-VAASCKH 206
            GL +W+PN+N FRDPRWGRGQETPGED      Y  + + GLQG   D  K V A+CKH
Sbjct: 141 TGLDFWTPNINPFRDPRWGRGQETPGEDAFHLSSYVKALIAGLQGETTDPYKRVVATCKH 200

Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
           F  YD+++WNG  R+ F+A++S+QD+ + +  PF+ CV +  V + MCSYN VNGVPTCA
Sbjct: 201 FAGYDIEDWNGNLRYQFDAQISQQDLVEYYLQPFQACV-QANVGAFMCSYNAVNGVPTCA 259

Query: 267 DPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           DP +L+  +R  W       ++ SDCD+V   Y    +++T E+A ADA+ AG DLDCG 
Sbjct: 260 DPYLLQTILREHWGWTNEEQWVTSDCDAVQNIYLPHQWSATREQAVADALIAGTDLDCGT 319

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           ++  H   A  +GL++E  ++ ALV   +  +RLG FD + + QPY   G   V T   Q
Sbjct: 320 YMQEHLPGAFAQGLVNENVLDQALVRQYSSLVRLGWFD-DAADQPYRQFGWDSVATDASQ 378

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
            LA  AA +GIVLLKN G  LPLS     ++ V G  ++ T  ++GNYAG+     +PL 
Sbjct: 379 ALARRAAVEGIVLLKNDG-VLPLSIDSSVSLGVFGDWANATSQLLGNYAGVPTYLHSPLW 437

Query: 444 GIGRYARTIHQQGCKDVACAD--DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
            + +   TI+  G       D     + +   A   +D  I + G+D SIE E  DR  L
Sbjct: 438 ALQQENLTINYAGGNPGGQGDPTTNRWSSLSGAIATSDILIYIGGIDNSIEEEGHDRTSL 497

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
              G Q +++ +++   K PTI+V+M GG ID A   N+  I+AI+WAGYPGQ GG AI 
Sbjct: 498 AWTGAQLDVIFQLAATGK-PTIVVVMGGGQIDSAPLANNANISAILWAGYPGQDGGPAIV 556

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           DIL G S P G+LP T YP  Y + +PMT+M +RPS++   PGRTY++Y G   Y FGHG
Sbjct: 557 DILTGKSPPAGRLPQTQYPASYTSLVPMTDMGLRPSENN--PGRTYKWYNGTATYEFGHG 614

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK-AIKVTHAKCNRLTLGVQVDVKNVG 680
           + YTNF  TV +       P+   +   +   + K A  +T  +C   +  V + V N G
Sbjct: 615 LHYTNFSATVTS-------PMQQSYRIADLMSTCKNATSITLERCAFTS--VDISVTNTG 665

Query: 681 SKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDR 736
           +    +  L +   S  PA H  P K LV ++++  + AGA     I++ + + L+ VD 
Sbjct: 666 AVASDYVTLCYISGSHGPAPH--PKKSLVGYQRLFGIAAGASDTARIDLTL-ESLARVDE 722

Query: 737 SGTRRIPLGEHNIHI 751
            G + +  GE+++ +
Sbjct: 723 VGNKVLYPGEYSLMV 737


>gi|389748500|gb|EIM89677.1| glycoside hydrolase family 3 protein [Stereum hirsutum FP-91666
           SS1]
          Length = 770

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 289/734 (39%), Positives = 411/734 (55%), Gaps = 25/734 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+  ++L+E V   ++ +  VPRLG+  YEWWSEALHGV++  PG  
Sbjct: 36  CNTSANFLDRAKALVNAMTLEEMVNNTVNTSPGVPRLGLPPYEWWSEALHGVAS-SPGVT 94

Query: 103 F--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   GDF GATSFP+ I  +++F+  L  ++   +S EARA  N   +GL +++PN+N F
Sbjct: 95  FETSGDFSGATSFPEPILMSAAFDDDLIFSVASTISTEARAFGNTNHSGLDFFTPNINPF 154

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGV 218
           +DPRWGRGQETPGEDP+ + +Y    + GLQG  G     K+ A CKHF AYDL+NW G 
Sbjct: 155 KDPRWGRGQETPGEDPLHTSRYVYQLITGLQGGVGPSPYYKIIADCKHFAAYDLENWEGN 214

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
           +R  FNA VS QD+ + +   F+ CV + KV SVMCSYN VNGVP C  P +L+  +R  
Sbjct: 215 NRMAFNAIVSTQDLAEFYTPSFQSCVRDAKVGSVMCSYNAVNGVPACGSPYLLQDLVRDY 274

Query: 279 WRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
           + L  + +I SDCD+VG  +D  ++T+T   A+A A+ AG D+DCG         AV  G
Sbjct: 275 FELGNDTWITSDCDAVGNIFDPHNYTTTLTNASAVALLAGTDVDCGTSYSETLGEAVSEG 334

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L+S+ D+  ALV      +RLG FD E  S PY  LG  DV TP  Q LA  AA +GIVL
Sbjct: 335 LVSKSDVERALVRLYGSLVRLGYFDPE-DSVPYRALGASDVNTPAAQTLAYTAAVEGIVL 393

Query: 397 LKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
           LKN G  LPLS ++ H  +A+IGP ++ T  M GNY GIA    +PL G       +   
Sbjct: 394 LKNDG-LLPLSSNVSH--IALIGPWANATTQMQGNYEGIAPLLISPLDGFTSAGFNVSFT 450

Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
               ++      F  A+  +  AD  + + G+D ++EAE  DR  +  PG Q EL+ ++ 
Sbjct: 451 NGTTISGNSTSGFADALSMASAADVIVYIGGIDDTVEAEGQDRTSITWPGNQLELIGELG 510

Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
              K P +++ M GG +D    K +  + A++W GYPGQAGG A+ADI+ G   P G+L 
Sbjct: 511 AFGK-PFVVIQMGGGQVDDTELKANSSVNALLWGGYPGQAGGKALADIITGVQAPAGRLT 569

Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF-VHTVANA 634
            T YP  Y+  + MT+M++RPS S   PGRTY++Y G  V+ FG G+ YT F V     +
Sbjct: 570 TTQYPASYVDQVAMTDMSVRPSNSTGSPGRTYKWYTGTPVFEFGFGLHYTTFDVEWAEGS 629

Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFST 693
           P       D     + A+ +  +  V H     L T  VQ  V N G+    +  L+FS 
Sbjct: 630 PAASYSIQD-----LVASANSSSSAVAHVDSAILDTFTVQ--VTNTGNVTSDYVALLFSN 682

Query: 694 PPAG-HWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
             AG   AP ++LV++ +V  +  G      +N+ +   ++ VD  G   I  G +N+ +
Sbjct: 683 TTAGPSPAPLQELVSYARVKGITPGVSATASLNVTLGT-IARVDEDGNSIIYPGVYNLWV 741

Query: 752 GGTKHSVSLHAATL 765
             T  + ++ +  L
Sbjct: 742 DTTGQAKAVTSFEL 755


>gi|344303941|gb|EGW34190.1| hypothetical protein SPAPADRAFT_65353 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 788

 Score =  488 bits (1256), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 284/727 (39%), Positives = 402/727 (55%), Gaps = 31/727 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C   LP  QR   ++   ++ E +  + + +  V RLG+  Y+WWSEALHG++     + 
Sbjct: 61  CNPHLPTEQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEALHGIAR----SN 116

Query: 103 F--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G++  ATSFPQ I    +FN  L++ +G V+  EARA  N G AGL ++SPN+N F
Sbjct: 117 FTASGEYSHATSFPQPILMGGAFNNDLYKQVGNVIGTEARAFNNVGRAGLDFYSPNINPF 176

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-----SDGDRLKVAASCKHFTAYDLDNW 215
           RD RWGRGQE   E PVL G YA +YV+GLQG      + D L+VAA+CKHF  YD+++W
Sbjct: 177 RDARWGRGQEVASESPVLVGNYALNYVQGLQGGLDSNQNDDTLQVAATCKHFVGYDMESW 236

Query: 216 NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
           N   R  +NA +S QD+ D +   F+ CV + K A  MCSYN VNGVP CA    L   +
Sbjct: 237 NQHSRLGYNAIISDQDLADFYLPTFQSCVRDAKAAGAMCSYNAVNGVPACASEFFLNTVL 296

Query: 276 RGEWRL-NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
           R  +   NG I SDCD++   ++   +      AAADAI+AG+D++CG     +   A+ 
Sbjct: 297 RDGFDFQNGVIHSDCDAIYNVWNPHLYAQDLGGAAADAIKAGVDVNCGDTYQNNLGYALG 356

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
              ++E  I  ++    +  +RLG FD  P +  Y      DV TP   +LA +AA +GI
Sbjct: 357 NKTINENQIRTSVTRQYSNLIRLGYFD-SPQTNKYRKYDWNDVSTPQANQLAYQAAVEGI 415

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
            LLKN G +LP +  + R VAVIGP ++ T  M+G+YAG      +PLQG       +  
Sbjct: 416 ALLKNDG-TLPFNKQKVRKVAVIGPWANATTQMLGDYAGTPPYMISPLQGAQSEGFQVEY 474

Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
                +   D   + AA++A++ ADA +   G+D S+E EALDR  L  PG Q +LVSK+
Sbjct: 475 ALGTQINTTDTSGYTAALNAAKGADAIVYFGGIDNSVENEALDRESLAWPGNQLDLVSKL 534

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
           S   K P +++   GG ID    KN+  + AI++AGYPGQ+GGTAI DIL G   P G+L
Sbjct: 535 S-GLKKPLVVLQFGGGQIDDTEIKNNKNVNAIVYAGYPGQSGGTAIWDILSGKYAPAGRL 593

Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
             T YP  Y   +PMT+M +RP Q   YPGRT+ +Y G  VY FG+G+ YT F  ++ANA
Sbjct: 594 TTTQYPASYADQVPMTDMTLRPRQG--YPGRTFMWYNGEPVYEFGYGLHYTTFSASLANA 651

Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
           P        G H S N      A K +      L     V++KN G     +  L++S  
Sbjct: 652 PR-------GGHQSFNIEQVVAAAKRSQYVDTGLITTFDVNIKNTGKTTSDYAALLYSKT 704

Query: 695 PAG---HWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
            AG   H  P+K LV+F+K+H + AG  Q   + + +   L   D +G + +  G +   
Sbjct: 705 TAGPGPH--PNKILVSFDKLHQIHAGQTQTAKLPVTIGSLLQ-TDTNGNKWLYPGTYTFF 761

Query: 751 IGGTKHS 757
           +   K +
Sbjct: 762 VDNDKKA 768


>gi|451992719|gb|EMD85198.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
           C5]
          Length = 781

 Score =  488 bits (1256), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 288/757 (38%), Positives = 429/757 (56%), Gaps = 43/757 (5%)

Query: 10  LGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLL 69
           L L+L +A   G+ A+  F  D K+   +    C  S     R   L+   +L+EK+   
Sbjct: 8   LSLLLFAA---GVFAQGNFP-DCKNGPLKNETICDPSASTLARAKSLVALYTLEEKINAT 63

Query: 70  ISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG--GDFPGATSFPQVITTASSFNATL 127
            + A  V RLG+  Y+WW+E LHG++  GP T F   GD+  +TSFPQ I   ++F+  L
Sbjct: 64  SNSAPGVARLGVPPYQWWNEGLHGIA--GPFTSFAKQGDYSYSTSFPQPILMGAAFDDDL 121

Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
              + +V+S EARA  N    GL +W+PN+N FRDPRWGRGQETPGED      Y  + +
Sbjct: 122 ITEVAKVISTEARAFNNANRTGLDFWTPNINPFRDPRWGRGQETPGEDSYHLSSYVKALI 181

Query: 188 RGLQGSDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVME 246
            GLQG+  D   +V A+CKH+  YD++NWNG  R+  + ++S+QD+ + +  PF  CV +
Sbjct: 182 HGLQGNATDPYRRVVATCKHYAGYDIENWNGNLRYQNDVQISQQDLVEYYLAPFEACV-Q 240

Query: 247 GKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTS 303
             V + MCSYN VNG P CADP +L+  +R  W  +    ++ SDCD++   Y    ++S
Sbjct: 241 ANVGAFMCSYNAVNGAPPCADPYLLQTVLREHWGWSSDDHWVTSDCDAIQNVYLPHQWSS 300

Query: 304 TPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
           T E AAAD++ AG DLDCG +L  H   AV++GL  E  ++ AL+   +  ++LG FD  
Sbjct: 301 TREGAAADSLNAGTDLDCGTYLQTHLPGAVKQGLTDETTLDKALIRQYSSLIKLGYFDA- 359

Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
           P +QPY  LG   V T   Q LAL+AA +GIVLLKN G  LP+ ++  + V + G  ++ 
Sbjct: 360 PENQPYRQLGFDAVATSASQALALKAAEEGIVLLKNDG-VLPI-NLGSKQVGIYGDWANA 417

Query: 424 TVTMIGNYAGIACGYTTP---LQGIG---RYARTIHQQGCKDVACADDQLFGAAIDASRQ 477
           T  + GNY G+A   T+P   LQ +G   +YA  +          A   L G        
Sbjct: 418 TSQLQGNYFGVAKFLTSPLMALQNLGVDVKYAGNLPGGQGDPTTGAWSSLSGVIT----T 473

Query: 478 ADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFA 537
           +D  I V G+D  +E+E  DR+ L L G Q +++ +++   K P I+V+M GG ID +  
Sbjct: 474 SDVHIWVGGIDNGVESEDRDRSWLTLTGGQLDVIGQLADTGK-PVIVVIMGGGQIDTSPL 532

Query: 538 KNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPS 597
             +P+I+A++WAGYPGQ GGTAI +IL G + P G+LP T YP +Y++ +PMT+MAMRPS
Sbjct: 533 IRNPKISAVLWAGYPGQDGGTAIVNILTGKAAPAGRLPQTQYPSKYVSEVPMTDMAMRPS 592

Query: 598 QSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA 657
              + PGRTY++Y G  ++ FG+G+ YTNF  ++ N P         +  +I+  + G  
Sbjct: 593 D--KNPGRTYKWYTGEPIFEFGYGLHYTNFSASITNQPK--------QSYAISDLVKG-- 640

Query: 658 IKVTHAKCNRLTL-GVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKV-HVP 714
              T     R    G+ V V+N G     +  L F T   G    P K LVA++++ ++ 
Sbjct: 641 CNSTGGFLERCPFTGITVSVQNTGKISSDYVTLGFLTGSFGPKPYPKKSLVAYDRLFNIA 700

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
           AG+     +N+ +   L+ VD SG + +  G++ + I
Sbjct: 701 AGSSSTATLNLTLAS-LARVDESGNKVLYPGDYELQI 736


>gi|409079872|gb|EKM80233.1| hypothetical protein AGABI1DRAFT_57801 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 767

 Score =  486 bits (1250), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 293/725 (40%), Positives = 416/725 (57%), Gaps = 38/725 (5%)

Query: 52  RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG--GDFPG 109
           R   LI   + +E ++   + +  VPRLG+  Y+WWSEALHGV+   PG  F   G+F  
Sbjct: 47  RATTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVAG-SPGVSFAPSGEFSS 105

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
           ATSFPQ I   S+F+  L +A+  V+S EARA  N   AGL Y++PN+N F+DPRWGRGQ
Sbjct: 106 ATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRAGLDYFTPNINPFKDPRWGRGQ 165

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           ETPGEDP    +Y  S + GLQG    R   KVAA CKH+ AYDLD+W G+DRFHF+AKV
Sbjct: 166 ETPGEDPFHVSQYVYSLIDGLQGGIDPRPYFKVAADCKHYAAYDLDSWEGIDRFHFDAKV 225

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG--YI 285
           S QD+ + +   F+ CV + KVASVMCSYN VNG+P CA+P +L+  +R  W  +   ++
Sbjct: 226 SLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGIPACANPYLLQDILRDFWGFDDDRWV 285

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
            SDCD++G  + T +FT T  EA ADA++AG D+DCG     H   A+ + L++  D+  
Sbjct: 286 TSDCDAIGNIFTTHNFTDTFAEAVADALKAGTDVDCGTSYSTHLPDALNQSLITRDDLER 345

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
           AL    T  MRLG FD  P SQP   L   DV  PD Q LA  AA +G+VLLKN G  LP
Sbjct: 346 ALTRQYTSLMRLGYFD-PPESQPLRQLAWSDVNKPDAQALAHTAAVEGLVLLKNDG-FLP 403

Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI--GRYARTIHQQGCKDVACA 463
           +S    +T+A+IGP ++ T  M GNY G A    TP QG     +   +   G   +   
Sbjct: 404 VS-ASGKTIAIIGPYANATKDMQGNYFGTAPFIVTPFQGAVDAGFNEVVSAAGTS-INGT 461

Query: 464 DDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTI 523
            +  F AAI  +  +D  I   G++ SIE+EA DR  +   G Q  LV +++   K P +
Sbjct: 462 SEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRLTIAWTGNQLSLVKQLASLGK-PVV 520

Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
           +V   GG +D +   ++  + A+IWAGYPGQ+GGTAI D++ G   P G+L +T YP+++
Sbjct: 521 VVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTAIFDVITGAVAPAGRLSVTQYPEDF 580

Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
           +  + MT+MA+RP  +   PGRTY++Y G  V  FGHG+ +T F  +    P        
Sbjct: 581 VNQVGMTDMALRPGSAN--PGRTYKWYTGRPVLEFGHGLHFTTFDFSWRGRP-------- 630

Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGV-QVDVKNVGSKDGAHTLLVFSTPPAGHWAPH 702
           GR  +I   +     K      + + L    V+++N G+    +  L+F    AG +APH
Sbjct: 631 GRKYNIQHLLHTADKKFP----DLIPLDTFHVNIRNTGNITSDYVALLFLKSNAG-FAPH 685

Query: 703 --KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI----GGTK 755
             K LV+F + H + AG+   V + +++   ++ VD  G   +  G++ + +    G   
Sbjct: 686 PKKSLVSFARAHRIDAGSSATVDLGVNLGS-IARVDEHGDSWLFAGDYQLVLDIGDGVLS 744

Query: 756 HSVSL 760
           HS SL
Sbjct: 745 HSFSL 749


>gi|426198365|gb|EKV48291.1| hypothetical protein AGABI2DRAFT_219902 [Agaricus bisporus var.
           bisporus H97]
          Length = 767

 Score =  485 bits (1249), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 294/725 (40%), Positives = 416/725 (57%), Gaps = 38/725 (5%)

Query: 52  RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG--GDFPG 109
           R   LI   + +E ++   + +  VPRLG+  Y+WWSEALHGV+   PG  F   G+F  
Sbjct: 47  RAKTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVAG-SPGVSFAPSGEFSS 105

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
           ATSFPQ I   S+F+  L +A+  V+S EARA  N   AGL Y++PN+N F+DPRWGRGQ
Sbjct: 106 ATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRAGLDYFTPNINPFKDPRWGRGQ 165

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           ETPGEDP    +Y  S + GLQG    R   KVAA CKH+ AYDLD+W G+DRFHF+AKV
Sbjct: 166 ETPGEDPFHVSQYVYSLIDGLQGGIDPRPYFKVAADCKHYAAYDLDSWEGIDRFHFDAKV 225

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG--YI 285
           S QD+ + +   F+ CV + KVASVMCSYN VNG+P CA+P +L+  +R  W  +   ++
Sbjct: 226 SLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGIPACANPYLLQDILRDFWGFDDDRWV 285

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
            SDCD++G  + T +FT T  EA ADA++AG D+DCG     H   A+ + L++  D+  
Sbjct: 286 TSDCDAIGNIFTTHNFTDTFAEAVADALKAGTDVDCGTSYSTHLPDALNQSLITRDDLER 345

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
           AL    T  MRLG FD  P SQP   L   DV  PD Q LA  AA +G+VLLKN G  LP
Sbjct: 346 ALTRQYTSLMRLGYFD-PPESQPLRQLAWSDVNKPDAQALAHTAAVEGLVLLKNDG-FLP 403

Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI--GRYARTIHQQGCKDVACA 463
           +S    +T+A+IGP ++ T  M GNY G A    TP QG     +   +   G   +   
Sbjct: 404 VS-ASGKTIAIIGPYANATKDMQGNYFGTAPFIVTPFQGAVDAGFNEVVSAAGTS-INGT 461

Query: 464 DDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTI 523
            +  F AAI  +  +D  I   G++ SIE+EA DR  +   G Q  LV +++   K P +
Sbjct: 462 SEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRLTIAWTGNQLSLVKQLASLGK-PVV 520

Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
           +V   GG +D +   ++  + A+IWAGYPGQ+GGTAI D++ G   P G+L +T YP+++
Sbjct: 521 VVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTAIFDVITGAVAPAGRLSVTQYPEDF 580

Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
           +  + MT+MA+RP  +   PGRTY++Y G  V  FGHG+ +T F  +    P        
Sbjct: 581 VNQVGMTDMALRPGSAN--PGRTYKWYTGRPVLEFGHGLHFTTFDFSWRGRP-------- 630

Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGV-QVDVKNVGSKDGAHTLLVFSTPPAGHWAPH 702
           GR  +I   +     K      + + L    V+++N G+    +  L+F    AG +APH
Sbjct: 631 GRKYNIQHLLHTADKKFP----DLIPLDTFHVNIRNTGNITSDYVALLFLRSNAG-FAPH 685

Query: 703 --KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH----NIHIGGTK 755
             K LV+F + H + AG+   V + +++   ++ VD  G   +  G++    +I  G   
Sbjct: 686 PKKSLVSFARAHRIDAGSSATVDLGVNLGS-IARVDEHGDSWLFAGDYQLVLDIGDGVLS 744

Query: 756 HSVSL 760
           HS SL
Sbjct: 745 HSFSL 749


>gi|395334835|gb|EJF67211.1| beta-xylosidase [Dichomitus squalens LYAD-421 SS1]
          Length = 774

 Score =  485 bits (1248), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 300/741 (40%), Positives = 419/741 (56%), Gaps = 44/741 (5%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   LI   + +E     ++ +  VPRLG+  Y WWSE LHGV+   PG  
Sbjct: 41  CDTSKDPITRATALIDLWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQ-SPGVT 99

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G+F  ATSFPQ I   ++F+  L +A+  VVS E RA  N G AGL YW+PN+N F
Sbjct: 100 FAPSGNFSYATSFPQPILMGAAFDDPLIQAVASVVSTEGRAFNNVGRAGLDYWTPNINPF 159

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKHFTAYDLDNWNGV 218
           +DPRWGRGQETPGEDP     Y  + + GLQG        KV A CKHF AYD+DNW G 
Sbjct: 160 KDPRWGRGQETPGEDPFHLQGYVYNLILGLQGGLDPTPYFKVVADCKHFAAYDMDNWEGN 219

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
            R+ FNA V++QD+ + +   F+ CV + KVASVMCSYN VNG+P+CA+  +L+  +R  
Sbjct: 220 VRYGFNAVVTQQDLSEYYLPSFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQDILRDY 279

Query: 279 WRLNG--YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
           W  +   ++ SDCD+V   Y   ++T  P +AAADA+ AG D+DCG F   +   A+ +G
Sbjct: 280 WGFDDTRWVTSDCDAVQNIYTPHNYTDNPAQAAADALLAGTDIDCGTFSSTYLPDALSQG 339

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L++  D+  A +      +RLG FD  P SQPY  LG  DV TP+ Q+LA  AA +G+VL
Sbjct: 340 LVNATDLKRAAIRQYASLVRLGYFD-PPESQPYRQLGWSDVNTPEAQQLAHTAAVEGMVL 398

Query: 397 LKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
           LKN G +LPLS H+R   +A+IGP ++ T  M GNYAGIA    +PL G  +    +   
Sbjct: 399 LKNDG-TLPLSKHVRK--LALIGPWANATTLMQGNYAGIAPYLISPLLGAQQAGFDVEYV 455

Query: 456 GCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
              +V   +D   F AA+ A+++ADA I   GLD+++E E +DR  +  PG Q +LV+++
Sbjct: 456 FGTNVTTTNDTSGFAAAVAAAKRADAVIFAGGLDETVEREEVDRLNVTWPGNQLDLVAEL 515

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
           +   K P I+    GG +D +  K+   + AIIW GYPGQ+GGTA+ DIL G + P G+L
Sbjct: 516 ASVGK-PLIVAQFGGGQLDDSALKSKRSVNAIIWGGYPGQSGGTALFDILTGKAAPAGRL 574

Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
           P+T YP EY   +PMT+M +RPS +   PGRTY++Y G  V+ FG G+ YT F    A+ 
Sbjct: 575 PITQYPAEYANQVPMTDMTLRPSATN--PGRTYKWYTGTPVFEFGFGLHYTTFSFAWAS- 631

Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNR----LTLG----VQVDVKNVGSKDGAH 686
                      +   N   +  +I    A  N+    L L       V V N G     +
Sbjct: 632 -----------NAHANTPAASYSIDALMASGNKSAAFLDLAPLDTFAVRVTNTGKMTSDY 680

Query: 687 TLLVFST---PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
             L+F++    PA H  P+KQLVA+ +VH  A  Q  +         ++  D SG + + 
Sbjct: 681 VALLFASGTFGPAPH--PNKQLVAYTRVHGVAPKQSTIAELTVTLGAIARADESGAKWVY 738

Query: 744 LGEHNIHIGGT---KHSVSLH 761
            G + + +  T   KH+ +L 
Sbjct: 739 PGTYTLALDTTEQLKHTFTLE 759


>gi|392560759|gb|EIW53941.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
           SS1]
          Length = 783

 Score =  482 bits (1240), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 291/687 (42%), Positives = 402/687 (58%), Gaps = 28/687 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C ++     R   LIG  + +E     ++ +  VPRLG+  Y WWSE LHGV+   PG  
Sbjct: 41  CDITKDPITRATALIGLWTDEELTSNTVNASPGVPRLGLPAYNWWSEGLHGVAQ-SPGVT 99

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G+F  ATSFPQ I   ++F+ TL +AI  +VS E RA  N G AGL YW+PN+N F
Sbjct: 100 FAPSGNFSHATSFPQPILMGAAFDDTLIQAIATIVSTEGRAFNNAGRAGLDYWTPNINPF 159

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGV 218
           +DPRWGRGQETPGEDP    +Y  + + GLQG    +   KV A CKHF AYDL+NW G+
Sbjct: 160 KDPRWGRGQETPGEDPFHLSQYVYNLILGLQGGLDPKPYFKVVADCKHFAAYDLENWEGI 219

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
            R  F+A VS+QD+ + +  PF+ CV + KVASVMCSYN VNG+P+CA+  +L+  +R  
Sbjct: 220 VRNGFDAIVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQDVLRDH 279

Query: 279 WRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
           W    + ++ SDCD+V        +T+ P +AAADA+ AG D+DCG F   +   A+QRG
Sbjct: 280 WGFTDDRWVTSDCDAVENILTPHKYTTDPAQAAADALLAGTDIDCGTFSSTYLPEALQRG 339

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L++  D+  A +      +RLG FD +P++QPY  LG  DV TP  Q+LA  AA +GIVL
Sbjct: 340 LVNSTDLRRAAIRQYASLVRLGYFD-DPAAQPYRQLGWSDVNTPQAQQLAHTAAVEGIVL 398

Query: 397 LKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
           LKN G  LP S H+R   +A+IGP ++ T  + G+Y G+A    +PLQG       +   
Sbjct: 399 LKNDG-VLPFSKHVRK--LALIGPWANATSLLQGSYIGVAPYLVSPLQGAQEAGFEVEYV 455

Query: 456 GCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
              +V   +D   F AA+ A R+ADA +   GLD+++E E  DR  +  PG Q +LV+++
Sbjct: 456 LGTNVTTQNDMSGFAAAVAAVRRADAVVFAGGLDETVECEGTDRLNVTWPGNQLDLVAEL 515

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
               K P I+    GG +D    K+   + AIIW GYPGQ+GGTA+ DIL G + P G+L
Sbjct: 516 ERVGK-PLIVAQFGGGQLDDTALKHSKAVNAIIWGGYPGQSGGTALFDILTGKAAPAGRL 574

Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
           P+T YP  Y   +PMT+M++RPS +   PGRTY++Y G  V+ FG G+ YT FV + A  
Sbjct: 575 PITQYPAAYTKQVPMTDMSLRPSATN--PGRTYKWYSGTPVFEFGFGLHYTTFVFSWAAP 632

Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV------QVDVKNVGSKDGAHTL 688
               AV      GS+  + S   + V H + +   L +       V V N G     +  
Sbjct: 633 SAAAAVDSTASFGSLAKSYSISQL-VAHGQESTAFLDLAPLDTFAVRVTNTGRVASDYVA 691

Query: 689 LVFST---PPAGHWAPHKQLVAFEKVH 712
           L+F +    PA H  P KQLVA+ +VH
Sbjct: 692 LLFVSGAFGPAPH--PKKQLVAYTRVH 716


>gi|452846807|gb|EME48739.1| glycoside hydrolase family 3 protein [Dothistroma septosporum
           NZE10]
          Length = 802

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 279/728 (38%), Positives = 415/728 (57%), Gaps = 34/728 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +     R   LI   +LQEK+    S +  VPRLG+  Y WW EALHGV++  PG  
Sbjct: 39  CDTTADPLTRATALINAFTLQEKLNNTGSTSPGVPRLGLPAYTWWQEALHGVAS-SPGVN 97

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G F  ATSFPQ I   ++F+  L   +  V+S EARA  N   AGL +W+PN+N F
Sbjct: 98  FSDSGPFRYATSFPQPILMGAAFDDDLIRDVATVISTEARAFNNDKRAGLDFWTPNINPF 157

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK-VAASCKHFTAYDLDNWNGVD 219
           +D RWGRGQETPGEDP     Y A+ + GLQGS  D+ K V A+CKHF AYD+++WNG  
Sbjct: 158 KDSRWGRGQETPGEDPYHLSSYVAALIEGLQGSPDDKYKRVVATCKHFVAYDMESWNGNF 217

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R+ F+A+VS QD+ + +  PF+ C  +  V + MCSYN +NGVPTCADP +L+  +R +W
Sbjct: 218 RYQFDAQVSSQDLVEYYMPPFQQCARDSNVGAFMCSYNALNGVPTCADPWLLQTVLREKW 277

Query: 280 RLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
                  ++ SDCD+V   +    + ST EEAAA +++AG D++CG +   H  +A  +G
Sbjct: 278 NWTSEQQWVTSDCDAVQNVFLPHDYASTREEAAALSLKAGTDINCGTYYQDHLPAAYDQG 337

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L++  D++ +L+   +  +RLG FDG   + PY +L   DV TP  Q+LA +AA +GI L
Sbjct: 338 LINTTDLDISLIRQYSSLVRLGYFDGL--AVPYRNLTWNDVSTPHAQQLAYKAAAEGITL 395

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH--- 453
           LKN G  LPL+     ++A+IG  ++ T  M+GNY GI   + +PL    +   T++   
Sbjct: 396 LKNDG-VLPLTISNGTSIALIGDWANATDQMLGNYDGIPPFFHSPLYAAQQTGATVNFAT 454

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
             G +     D  L      A+ ++D  I   G+D S+E+E +DR  L   G Q +++ +
Sbjct: 455 GPGGQGDPTTDHWL--PVWAAANKSDVIIYAGGIDNSVESEGMDRVSLTWTGAQLDMIGQ 512

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
           ++M  K P I++ M GG ID +   N+P ++A+IW GYPGQ GG A+ DI+ G + P G+
Sbjct: 513 LAMYGK-PVIVLQMGGGQIDSSPLVNNPNVSALIWGGYPGQDGGVALFDIIRGITAPAGR 571

Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
           LP T YP +YI+ +PMT+M +RP+ +   PGRTY +Y    V+P+G G+ YTNF  T A 
Sbjct: 572 LPTTQYPAKYISQVPMTDMTLRPNSTTGSPGRTYIWYNENAVFPYGLGLHYTNF--TAAI 629

Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT------LGVQVDVKNVGSKDGAHT 687
            P+    P      S N+  +   I    + C              V + N G     + 
Sbjct: 630 KPS---FPSTYDSSSSNSGSASYDISTLTSNCTATYKDLCPFTSFSVSITNTGEIMSDYV 686

Query: 688 LLVFST---PPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
            L F      PA H  P+K+LV+++++H + AG+ Q   +N+ +   L+ VD  G + + 
Sbjct: 687 TLGFLAGIHGPAPH--PNKRLVSYQRLHNITAGSSQTAWLNLTLGS-LARVDEMGNKVLY 743

Query: 744 LGEHNIHI 751
            G++ + +
Sbjct: 744 PGDYALLV 751


>gi|451849522|gb|EMD62825.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
          Length = 849

 Score =  481 bits (1238), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 279/715 (39%), Positives = 411/715 (57%), Gaps = 39/715 (5%)

Query: 52  RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG--GDFPG 109
           R   L+   +L+EK+    + A  V RLGI  Y+WW+E LHG++  GP T F   GD+  
Sbjct: 114 RAKSLVALYTLEEKINATSNSAPGVARLGIPPYQWWNEGLHGIA--GPFTSFAKQGDYSY 171

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
           +TSFPQ I   ++F+  L   +  V+S EARA  N    GL +W+PN+N FRDPRWGRGQ
Sbjct: 172 STSFPQPILMGAAFDDNLITEVANVISTEARAFNNVNRTGLDFWTPNINPFRDPRWGRGQ 231

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           ETPGED      Y  + + GLQG++ D   +V A+CKH+  YD++NWNG  R+  + ++S
Sbjct: 232 ETPGEDSYHLSSYVKALIHGLQGNETDPYRRVVATCKHYAGYDIENWNGNLRYQNDVQIS 291

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YI 285
           +QD+ + +  PF  CV +  V + MCSYN VNG P CADP +L+  +R  W  +    ++
Sbjct: 292 QQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYMLQTVLREHWGWSSDEHWV 350

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
            SDCDS+   Y    ++ST E AAAD++ AG DLDCG +L  H   AV++GL +E  ++N
Sbjct: 351 TSDCDSIQNVYLPHQWSSTREGAAADSLNAGTDLDCGTYLQSHLPGAVKQGLTNETTLDN 410

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
           AL+   +  ++LG FD  P +QPY  LG   V T   Q LAL+AA +GIVLLKN G  LP
Sbjct: 411 ALIRQYSSLIKLGYFD-IPENQPYRQLGFDAVATSASQALALKAAEEGIVLLKNDG-VLP 468

Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP---LQGIG---RYARTIHQQGCKD 459
           + +   + V + G  ++ T  + GNY G+A   T+P   L+ +G   RYA  +   G  D
Sbjct: 469 I-NFGSKNVGIYGDWANATSQLQGNYFGVAKFLTSPYMALEKLGVNVRYAGNL-PGGQGD 526

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
                       I  S   D  I V G+D  IE+E  DR+ L L G Q +++ +++   K
Sbjct: 527 PTTGSWPRLSGVITTS---DVHIWVGGMDNGIESEDRDRSWLTLTGSQLDVIGQLADTGK 583

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
            P I+++M GG ID +    +P+I+A++WAGYPGQ GGTAI +IL G + P G+LP T Y
Sbjct: 584 -PVIVIIMGGGQIDTSPLIKNPKISAVLWAGYPGQDGGTAIVNILTGKAAPAGRLPQTQY 642

Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
             +Y++ +PMT+MAMRP  S + PGRTY++Y G  ++ FG+G+ YTNF  ++ N P    
Sbjct: 643 LYKYVSEVPMTDMAMRP--SNKNPGRTYKWYTGKPIFEFGYGLHYTNFSASITNQPK--- 697

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTL-GVQVDVKNVGSKDGAHTLLVFSTPPAG- 697
                +  +I+  + G     T     R    G+ V V+N G     +  L F T   G 
Sbjct: 698 -----QSYAISDLVKG--CNSTGGFLERCPFTGINVSVQNTGKTSSDYVTLGFLTGSFGP 750

Query: 698 HWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
              P K LVA++++ ++ A +     +N+ +   L+ VD SG + +  G++ + I
Sbjct: 751 KPYPKKSLVAYDRLFNIAASSSSTATLNLTLAS-LARVDESGNKVLYPGDYELQI 804


>gi|413919686|gb|AFW59618.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 475

 Score =  479 bits (1233), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 241/447 (53%), Positives = 309/447 (69%), Gaps = 5/447 (1%)

Query: 315 AGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
           AGLDL+CG FL  HT +AVQ G LSE D++ A+ N L   MRLG FDG+P   P+G+LGP
Sbjct: 31  AGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGP 90

Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
            DVCTP +QELA EAARQGIVLLKN G  LPLS    +++AVIGPN++ + TMIGNY G 
Sbjct: 91  SDVCTPSNQELAREAARQGIVLLKNTG-KLPLSATSIKSMAVIGPNANASFTMIGNYEGT 149

Query: 435 ACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEA 493
            C YTTPLQG+G    T++Q GC +V C+ + L   AA  A+  AD T+LV+G DQSIE 
Sbjct: 150 PCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIER 209

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E+LDR  LLLPG+Q +LVS V+ AS GP ILV+MSGGP D++FAK+  +IAAI+W GYPG
Sbjct: 210 ESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPG 269

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
           +AGG AIAD+LFG  NP G+LP+TWYP+ + T +PMT+M MRP  S  YPGRTYRFY G 
Sbjct: 270 EAGGAAIADVLFGYHNPSGRLPVTWYPESF-TKVPMTDMRMRPDPSTGYPGRTYRFYTGD 328

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
            VY FG G+SYT+F H + +AP  +A+ L   H  +  T    +++   A C  L   V 
Sbjct: 329 TVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACL--TEQCPSVEAEGAHCEGLAFDVH 386

Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
           + V+N G + G HT+ +FS+PPA H AP K L+ FEKV +  G    V   + VCK LSV
Sbjct: 387 LRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSV 446

Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           VD  G R++ LG H +H+G  KH+++L
Sbjct: 447 VDELGNRKVALGSHTLHVGDLKHTLNL 473


>gi|393247584|gb|EJD55091.1| beta-xylosidase [Auricularia delicata TFB-10046 SS5]
          Length = 763

 Score =  478 bits (1229), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 287/755 (38%), Positives = 410/755 (54%), Gaps = 34/755 (4%)

Query: 10  LGLILLSASSSGLAAREPFACDP-KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
            GL+L + + + L A    A  P KD        C  +     R   LI   + +E V  
Sbjct: 8   FGLVLAALTPATLGAFPDCANGPLKDNLV-----CNTTANFMDRAKALIDEFTTEELVNN 62

Query: 69  LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNA 125
            ++G+  VPRLG+  Y+WWSEALHGV+   PG  F   G DF  ATSFPQ I   ++F+ 
Sbjct: 63  TVNGSPGVPRLGLPPYQWWSEALHGVAGANPGVHFAPAGEDFDHATSFPQPILMGAAFDD 122

Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
            L   +  V+S EARA  N G +G+ +++PN+N FRDPRWGRGQETPGEDP+   +Y   
Sbjct: 123 ELIHEVATVISTEARAFNNFGFSGIDFFTPNINPFRDPRWGRGQETPGEDPLHISRYVFQ 182

Query: 186 YVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
            V  LQG  G     K+ A CKHF  YDL++W G+DRFHF+A ++ QD+ + +   F+ C
Sbjct: 183 LVTALQGGLGPSPYYKIVADCKHFAGYDLESWEGIDRFHFDAVITTQDLAEFYTPSFQSC 242

Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL-NGYIVSDCDSVGVYYDTQHFT 302
           V + KV SVMCSYN VNGVP CA   +L+  +R  + L +G+I SDCD+V   + T +FT
Sbjct: 243 VRDAKVGSVMCSYNSVNGVPACASSYLLQDIVRDFYGLGDGWITSDCDAVQNVFTTHNFT 302

Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
           +T   A+A +++AG D+DCG         A+ +GL+ E D+  ALV      +R G FD 
Sbjct: 303 TTQANASAISLKAGTDVDCGNVYAQSLGDALDQGLVEEDDLKQALVRLYGSLVRTGYFD- 361

Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
            P  QP+  LG  DV TP  + LAL AA +GIVLLKN G  LPLS      V ++GP  +
Sbjct: 362 SPEEQPFRQLGWADVDTPASRRLALLAAEEGIVLLKNDG-LLPLSSRDVPNVIMVGPWGN 420

Query: 423 VTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADAT 481
            T  M GNY G A    +P QG       +    G       D   F  A+ A+   D  
Sbjct: 421 ATTMMQGNYFGNAPYLVSPRQGFVDAGFNVTFFNGTVGTNGTDTSGFDEAVAAAGDTDLI 480

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           + V G D  +E E+ DR  +  PG Q +L+ +++   K P I++ M  G +D  + K   
Sbjct: 481 VFVGGPDNVVERESRDRINITWPGVQLDLIKELAGVGK-PMIVLQMGAGQVDDTWLKESD 539

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
            I A+IW GYPGQ+GGTA+A+I+ G + P  +LP+T YP++YI+ LPMT+M +RPS S  
Sbjct: 540 AINALIWGGYPGQSGGTALANIVTGKTAPAARLPITQYPEDYIS-LPMTDMNVRPSNSS- 597

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
            PGRTY+++ G  ++ FG G+ Y+ F    A  P     P     G + A  S      T
Sbjct: 598 -PGRTYKWFTGEPIFEFGFGLHYSKFDFAWAEEP-----PASFAIGDLVANASSPVDLAT 651

Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKV-HVPAGAQQ 719
                      QV+V N+G        ++F    AG   AP K+LV + ++ ++P GA  
Sbjct: 652 FHT-------FQVNVTNLGPVASDFVAMLFGNTTAGPSPAPLKELVGYTRLTNIPVGATV 704

Query: 720 RVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
              + + +   ++  D  G   +  G++++ +  T
Sbjct: 705 TASVPVTLGT-IARADEDGNSVLFPGQYSVWLDTT 738


>gi|224068504|ref|XP_002302759.1| predicted protein [Populus trichocarpa]
 gi|222844485|gb|EEE82032.1| predicted protein [Populus trichocarpa]
          Length = 273

 Score =  477 bits (1227), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 218/251 (86%), Positives = 237/251 (94%)

Query: 24  AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
           A +PFACDP+D T+R  PFCQV LPI  RV+DLIGR++LQEKV LL++ AAAVPRLGIKG
Sbjct: 23  ALDPFACDPEDGTSRNFPFCQVKLPIQSRVSDLIGRMTLQEKVGLLVNDAAAVPRLGIKG 82

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           YEWWSEALHGVSNVGPGT+FGG FPGATSFPQVITTA+SFNATLWEAIGRVVSDEARAM+
Sbjct: 83  YEWWSEALHGVSNVGPGTQFGGAFPGATSFPQVITTAASFNATLWEAIGRVVSDEARAMF 142

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
           NGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPV++GKYAASYVRGLQG+DGDRLKVAA 
Sbjct: 143 NGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNDGDRLKVAAC 202

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           CKHFTAYDLDNWNGVDRFHFNA+VSKQD+EDTFDVPFRMCV EGKVASVMCSYNQVNG+P
Sbjct: 203 CKHFTAYDLDNWNGVDRFHFNAQVSKQDMEDTFDVPFRMCVKEGKVASVMCSYNQVNGIP 262

Query: 264 TCADPNILKRT 274
           TCADP +LK+T
Sbjct: 263 TCADPKLLKKT 273


>gi|398403795|ref|XP_003853364.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
 gi|339473246|gb|EGP88340.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
          Length = 785

 Score =  473 bits (1218), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 274/713 (38%), Positives = 394/713 (55%), Gaps = 33/713 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +     R   LI   +++EK+    S A  VPRLG+  Y WW EALHGV+   PG  
Sbjct: 39  CDFTADPLTRATALIAAFTIEEKINNTGSTAPGVPRLGLPAYTWWQEALHGVAQ-SPGVN 97

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   GDF  ATSFPQ I   ++F+  L + +  V+S EARA  N   +GL YW+PN+N F
Sbjct: 98  FSDSGDFRYATSFPQPILMGAAFDDDLIKDVATVISTEARAFNNDARSGLDYWTPNINPF 157

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
           +D RWGRGQETPGEDP     Y  S + GLQG DG   KV A+CKHF AYDL+ WNG  R
Sbjct: 158 KDSRWGRGQETPGEDPYHLSSYVKSLIAGLQG-DGKYKKVVATCKHFVAYDLETWNGNFR 216

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
           + F+  V  Q++ + +  PF+ C  +  V + MCSYN +NG+PTCADP +L+  +R  W 
Sbjct: 217 YQFDPHVGSQELVEYYMPPFQACARDANVGAFMCSYNSLNGIPTCADPYLLQTILREHWN 276

Query: 281 LNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
                 ++ SDCDS+   Y    +TST EEA A +++AG D++CG +       A+  GL
Sbjct: 277 WTSEEQWVTSDCDSIQNVYLPHEYTSTREEAVAVSLKAGTDVNCGTYYQEFLPGALSLGL 336

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           ++E DI+ AL+   +  +RLG FDG  ++  Y  L  KDV TP  Q+LAL+AA +GI LL
Sbjct: 337 VTEKDIDMALIRQYSSLVRLGYFDG--TAVEYRSLSWKDVSTPYAQQLALKAAVEGITLL 394

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGC 457
           KN G  LPL+  +   +AVIG  ++ T  M+GNY GI     +PL    +    +   G 
Sbjct: 395 KNDG-ILPLAITKDTKIAVIGDWANATEQMLGNYDGIPPYLHSPLWAAQQTGANVTYSGN 453

Query: 458 K----DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
                D    +      A+D   +AD  +   G+D  +EAE +DR  +   G Q +++ +
Sbjct: 454 PGGQGDPTTNNWLHIWTAVD---EADVILFAGGIDNGVEAEGMDRVSIAWTGAQLDVIGQ 510

Query: 514 VSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
             +AS+G P I+  M    +D     N+  I+A++W GYPGQ GG A+ DI+ G S P G
Sbjct: 511 --LASRGKPVIVAQMGTNGVDSTPLLNNQNISALLWGGYPGQDGGVALLDIIQGKSAPAG 568

Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA 632
           +LP T YP  YI+ +PMT+M +RP+ +  +PGRTY +Y    V+ FG+G+ YTNF  T++
Sbjct: 569 RLPTTQYPASYISKVPMTDMHLRPNSTTGFPGRTYMWYNEKPVFEFGYGLHYTNFSATIS 628

Query: 633 NAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFS 692
                   P D    SI A ++    +    +C    +  ++ V N G+    +  L F 
Sbjct: 629 --------PTDTTSFSI-ADLTKDCTEHYMDRCPFADM--KIAVTNTGNVTSDYVTLGFL 677

Query: 693 TPPAGHW-APHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
               G    P+K+LV ++++H + AGA Q   +N+ +     V D   T   P
Sbjct: 678 AGEHGPAPCPNKRLVNYQRLHNITAGASQTTSLNLTLASLARVDDMGNTVLYP 730


>gi|389748262|gb|EIM89440.1| hypothetical protein STEHIDRAFT_182874, partial [Stereum hirsutum
           FP-91666 SS1]
          Length = 772

 Score =  471 bits (1212), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 280/722 (38%), Positives = 402/722 (55%), Gaps = 28/722 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +     R   LI   +L + V   ++G+  V RLG+  Y+WW+EALHGV +  PG  
Sbjct: 39  CNTTAHFVDRATSLIEEFNLTDLVNNTVNGSPGVDRLGLPPYQWWNEALHGVGS-SPGVN 97

Query: 103 FG----GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
           +G     +F  ATSFP  I   ++FN +L  +I  V+S EARA  N   AGLT+++PN+N
Sbjct: 98  WGSGPDANFTSATSFPAPILLGATFNDSLIASIADVISTEARAFNNFNYAGLTFFTPNIN 157

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKHFTAYDLDNWN 216
            FRDPRWGRGQETPGEDP    +Y   YV GLQG  S     KV A+CKH  AYD++NW 
Sbjct: 158 PFRDPRWGRGQETPGEDPYHLSRYVYQYVVGLQGGLSPDPYYKVLANCKHVLAYDVENWE 217

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
           G DR  FNA V+ QD+ + +   F+ C+ + + AS MCSYN VNGVP+CA   ILK  +R
Sbjct: 218 GNDRTGFNAVVTTQDLSEFYTPSFQGCLRDAQGASAMCSYNAVNGVPSCASSYILKDLVR 277

Query: 277 GEWRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
             W L    G+I  DC +V   Y    +T T   A A A+ AG DLDCG     +  +AV
Sbjct: 278 DFWGLGEREGWITGDCGAVQNIYQPHGYTDTLVNATAVAMDAGTDLDCGDVYSPNLWTAV 337

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
             GL++   I  AL+      +RLG FD     QPY      +V TP  Q+LA  AA QG
Sbjct: 338 VEGLITAGQIQTALIRLYGSLIRLGYFD-PAEQQPYRSFDWSNVNTPSSQDLAYNAAVQG 396

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
           IVLL+N G  LPLS    + +A+IGP ++ T+++ GNYAGIA    +P Q        + 
Sbjct: 397 IVLLENDG-LLPLS-TNVKNIALIGPMANATLSLQGNYAGIAPFVISPQQAFETAGYNVT 454

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
                 ++ +D+  +  A++A++ AD  + V G+D SIEAE  DR  +  PG Q +L+ +
Sbjct: 455 FAFGTGISNSDNSGYSEALEAAQGADVVVFVGGIDNSIEAEGQDRTSIEWPGSQLDLIGQ 514

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
           +    K P ++V M GG  D +  K +  + A++WAGYPGQ+GGTA+ DI+ G  +P G+
Sbjct: 515 LGELGK-PLVVVRMGGGQCDDSTLKANATVNALLWAGYPGQSGGTALVDIISGKQSPSGR 573

Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
           LP+T YP  Y++ + MT+MA+RP+ S   PGRTY++Y G  +YPFG+G+ YT F    ++
Sbjct: 574 LPVTQYPSSYVSEIDMTDMAIRPNSSGS-PGRTYKWYTGAPIYPFGYGIHYTTFRLAWSD 632

Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK-DGAHTLLVFS 692
           + +      D         I   A K        +     + V N GS     +  L+F+
Sbjct: 633 SSSTTYNIQD---------IVSSANKSGGFADTEILDTFSLLVTNTGSNYTSDYVALLFA 683

Query: 693 TPPAG-HWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
              +G   AP ++LV + +V H+  G      +N+ +   +S VD +G   +  G +N+ 
Sbjct: 684 NSTSGPSPAPLQELVGYTRVPHITPGGTATAELNVTLGS-ISRVDENGNWILYPGTYNLW 742

Query: 751 IG 752
           +G
Sbjct: 743 VG 744


>gi|392570764|gb|EIW63936.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
           SS1]
          Length = 781

 Score =  471 bits (1212), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 292/725 (40%), Positives = 411/725 (56%), Gaps = 28/725 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C V+     R   LI   + +E     ++ +  VPRLG+  Y WWSE LHGV+   PG  
Sbjct: 41  CDVTKDPITRATALISIWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQ-SPGVT 99

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G+F  ATSFPQ I   ++F+  L +AI  +VS E RA  N G AGL YW+PN+N F
Sbjct: 100 FAPSGNFSYATSFPQPILMGAAFDDPLIQAIATIVSTEGRAFNNAGRAGLDYWTPNINPF 159

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGV 218
           +DPRWGRGQETPGEDP    +Y  + + GLQG    +   KV A CKHF AYD+DNW GV
Sbjct: 160 KDPRWGRGQETPGEDPFHLSQYVYNLILGLQGGLDPKPYFKVVADCKHFAAYDMDNWEGV 219

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
            R+ FNA VS+QD+ + +  PF+ CV + KVASVMCSYN VNG+P+CA+  +L+  +R  
Sbjct: 220 VRYGFNAVVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQDVLRDH 279

Query: 279 WRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
           W    + ++ SDCD+V   +   ++T+ P +AAADA+ AG D+DCG F   +   A+QRG
Sbjct: 280 WGFTDDRWVTSDCDAVQNIFTPHNYTTDPAQAAADALLAGTDIDCGTFSSTYLPEALQRG 339

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L++  D+  A +      +RLG FD +P++QPY  LG  DV T   Q+LA  AA +G+VL
Sbjct: 340 LVNSTDLRRAAIRQYASLVRLGYFD-DPAAQPYRQLGWSDVNTLQAQQLAHTAAVEGMVL 398

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           LKN G  LPLS  R R +A+IGP ++ T  + GNY GIA    +P+QG  +    +    
Sbjct: 399 LKNDG-LLPLSK-RVRKLALIGPWANATRLLQGNYFGIAPYLVSPVQGAQQAGFEVEYVF 456

Query: 457 CKDVACADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
             +V   +D   F AA+ A+++ADA +   GLD+++E E +DR  +  PG Q +LV+++ 
Sbjct: 457 GTNVTTRNDTSGFAAAVAAAKRADAVVFAGGLDETVEREEIDRLNVTWPGNQLDLVAELE 516

Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
              K P I+    GG +D    K    + AIIW GYPGQ+GGTA+ DIL G + P G+LP
Sbjct: 517 RVGK-PLIVAQFGGGQLDNTALKRSKAVNAIIWGGYPGQSGGTALFDILTGKAAPAGRLP 575

Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           +T YP  Y   +PMT+M +RPS +   PGRTY++Y G  V+ FG G+ YT F    A   
Sbjct: 576 ITQYPAAYAEQVPMTDMTLRPSATN--PGRTYKWYSGTPVFEFGFGLHYTTFAFAWAAPG 633

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV------QVDVKNVGSKDGAHTLL 689
                       + + +IS     V H + +   L +       V V N G     +  L
Sbjct: 634 AAADSTASFGGPAKSYSISQ---LVAHGQESAAFLDLAPLDTFAVRVTNTGKVASDYVAL 690

Query: 690 VF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGE 746
           +F   S  PA H  P K LVA+ ++H  A     VG        ++  D +G + +  G 
Sbjct: 691 LFVSGSFGPAPH--PKKTLVAYTRIHGLAPRGSTVGQLPVTLGAIARADENGEKWVHPGT 748

Query: 747 HNIHI 751
           + + +
Sbjct: 749 YTLAL 753


>gi|392590128|gb|EIW79457.1| glycoside hydrolase family 3 protein [Coniophora puteana RWD-64-598
           SS2]
          Length = 770

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 275/727 (37%), Positives = 414/727 (56%), Gaps = 33/727 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  SL   QR   L+   +++E +   ++G+  VPRLG+  Y+WWSE LHGV++  PG  
Sbjct: 37  CDTSLNATQRAAALVELFTVEELINNTVNGSPGVPRLGLPAYQWWSEGLHGVAD-SPGVN 95

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G F  ATSFPQ I  +++F+  L +A+G VV  E R+  N G AGL +W+PN+N F
Sbjct: 96  FSTSGPFSYATSFPQPIVMSAAFDDALIKAVGGVVGMEGRSFNNYGHAGLDFWTPNINPF 155

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGV 218
           +DPRWGRGQETPGEDP    +Y  + ++GLQG        +V A+CKHF  YDL++W   
Sbjct: 156 KDPRWGRGQETPGEDPYHIAQYVYNLIQGLQGGVNPEPYFQVVATCKHFAGYDLEDWENN 215

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
            R+ F+A ++ QD+ + +   F+ C  + +  + MCSYN VNG+PTCAD  +L+  +R  
Sbjct: 216 FRYGFDALITTQDLSEFYLPSFQSCYRDAQAGASMCSYNAVNGIPTCADTYLLQDILRDY 275

Query: 279 WRLNG--YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
           W  +   ++ SDCD+V   Y+  ++T+ P++AAADA+RAG DLDCG F   +   A  + 
Sbjct: 276 WNFDETRWVTSDCDAVENIYNPHNYTALPQQAAADALRAGTDLDCGTFYTEYLPLAYNQS 335

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L++E ++  AL       +RLG FD   + QPY   G  +V TP  Q+LA  AA +GI L
Sbjct: 336 LITETELRAALTRQYASLVRLGYFD-PAAQQPYRQYGWSNVDTPYAQQLAYTAATEGITL 394

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           LKN G +LPL     + +A+IGP ++ T  M GNY G+A    +PLQG       +    
Sbjct: 395 LKNDG-TLPLPSTL-KNIALIGPWANATNQMQGNYFGVAPYLVSPLQGALAAGYNVTYVF 452

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
             ++       F AAI A+R+ADA +   G+D ++EAEA+DR  +  PG Q +L+ +++ 
Sbjct: 453 GTNITSNSTAGFAAAIAAAREADAVVYAGGIDVTVEAEAMDRYNVTWPGNQLQLIGELAA 512

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
             K P ++    GG +D    K +  + ++IWAGYPGQ+GG A+ DI+ G   P G+L  
Sbjct: 513 LGK-PFVVAQFGGGQVDDTEIKANASVNSLIWAGYPGQSGGQALFDIISGKVAPAGRLVT 571

Query: 577 TWYPQEYITNLPMTEMAMRPSQS-KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           T YP +Y+  +PMT+M +RP+ +    PGRTY++Y G  VY FG+G+ YTNF +T   AP
Sbjct: 572 TQYPADYVYEIPMTDMNLRPNANGTTSPGRTYKWYTGAPVYEFGYGLHYTNFTYTWTKAP 631

Query: 636 TVVAVPLDGRHGSINATISGK--AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-- 691
                       +I   +S    A  +  A  + L+    V V N G+    ++ L+F  
Sbjct: 632 ASTY--------NIQTLVSAASGAAHIDLAPFDTLS----VAVTNAGAVTSDYSALLFVN 679

Query: 692 -STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
            +  PA +  P+K L A+ ++H V AGA Q    ++ V   ++  D  G   +  G + +
Sbjct: 680 GTYGPAPY--PNKALAAYTRLHSVAAGAAQTATFDV-VLNQIARADAYGNFWLYPGAYEL 736

Query: 750 HIGGTKH 756
            +  T+ 
Sbjct: 737 ALDTTRE 743


>gi|83774566|dbj|BAE64689.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 822

 Score =  467 bits (1202), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 278/734 (37%), Positives = 411/734 (55%), Gaps = 48/734 (6%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P C  SL I +RV+ L+  L+L+EK+  L+  +A   RLG+  YEWWSEA HGV +  PG
Sbjct: 78  PVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVGS-APG 136

Query: 101 TKFG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
            +F     +F  ATSFP  I TA+SF+ TL   I  V+  E RA  N G +G  +W+PN+
Sbjct: 137 VQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFSGFDFWAPNI 196

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N FRDPRWGRGQETPGEDP+++  Y  ++V GLQG D    +V A+CKH+  YDL+    
Sbjct: 197 NGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNKQVIATCKHYAVYDLE---- 252

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R+  N   ++QD+ D F  PF+ CV +  V S+MCSYN V+G+P CA+  +L   +R 
Sbjct: 253 TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLSEVLRK 312

Query: 278 EWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAV 333
            W  N    Y+VSDC +V   +   +FT T E AA+ A+ AG+DL+CG  +L L+   A 
Sbjct: 313 HWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSYLKLNESLAA 372

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
            +  +  +D   +L    +    +G FDG      Y  L   DV TPD Q LA EAA +G
Sbjct: 373 NQTSVKVMD--QSLARLYSALFTVGFFDG----GKYDKLDFSDVSTPDAQALAYEAAVEG 426

Query: 394 IVLLKNQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
           + LLKN    LPL S  ++++VAVIGP ++ T  M G+Y+G A    +PL+  G     +
Sbjct: 427 MTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLEAFGDSRWKV 485

Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
           +      +   +   F  A+ A+ ++D  I + G+D S+E+E LDR  L  PG Q +L++
Sbjct: 486 NYALGTAMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLTWPGNQLDLIT 545

Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
            +S  SK P ++V   GG +D +    +  I A++WAGYP Q+GGTA+ D+L G  +P G
Sbjct: 546 SLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAG 604

Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF----- 627
           +LP+T YP  Y   + + ++ +RP+ S  YPGRTY++Y G  V PFG+G+ YT F     
Sbjct: 605 RLPVTQYPASYADQVNIFDINLRPTDS--YPGRTYKWYTGKPVLPFGYGLHYTKFMFDWE 662

Query: 628 --VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
             ++   N   +VA   +   G IN               N     V+V VKNVG K   
Sbjct: 663 KTLNREYNIQDLVASCRNSSGGPIND--------------NTPLTTVKVRVKNVGHKTSD 708

Query: 686 H-TLLVFSTPPAGHWA-PHKQLVAFEK-VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           + +LL  S+  AG    P+K LV++ + +++  G+ Q   + + +   L+  D +G+  I
Sbjct: 709 YVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTLGS-LARADENGSLVI 767

Query: 743 PLGEHNIHIGGTKH 756
             G + I +  ++ 
Sbjct: 768 FPGRYKIALDHSEE 781


>gi|391865040|gb|EIT74331.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 822

 Score =  466 bits (1200), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 276/734 (37%), Positives = 409/734 (55%), Gaps = 48/734 (6%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P C  SL I +RV+ L+  L+L+EK+  L+  +A   RLG+  YEWWSEA HGV +  PG
Sbjct: 78  PVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVGS-APG 136

Query: 101 TKFG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
            +F     +F  ATSFP  I TA+SF+ TL   I  V+  E RA  N G +G  +W+PN+
Sbjct: 137 VQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFSGFDFWAPNI 196

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N FRDPRWGRGQETPGEDP+++  Y  ++V GLQG D    +V A+CKH+  YDL+    
Sbjct: 197 NGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNKQVIATCKHYAVYDLE---- 252

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R+  N   ++QD+ D F  PF+ CV +  V S+MCSYN V+G+P CA+  +L   +R 
Sbjct: 253 TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLDEVLRK 312

Query: 278 EWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAV 333
            W  N    Y+VSDC +V   +   +FT T E AA+ A+ AG+DL+CG  +L L+   A 
Sbjct: 313 HWNFNSDYYYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSYLKLNESLAA 372

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
            +  +  +D   +L    +    +G FDG      Y  L   DV TPD Q LA EAA +G
Sbjct: 373 NQTSVKVMD--RSLARLYSALFTVGFFDG----GKYDKLDFSDVSTPDAQALAYEAAVEG 426

Query: 394 IVLLKNQGPSLPLSHI-RHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
           + LLKN    LPL    ++++VAVIGP ++ T  M G+Y+G A    +PL+  G     +
Sbjct: 427 MTLLKNDD-LLPLDFPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLEAFGDSRWKV 485

Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
           +      +   +   F  A+ A+ ++D  I + G+D S+E+E LDR  L  PG Q +L++
Sbjct: 486 NYALGTAINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLAWPGNQLDLIT 545

Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
            +S  SK P ++V   GG +D +    +  I A++WAGYP Q+GGTA+ D+L G  +P G
Sbjct: 546 SLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAG 604

Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF----- 627
           +LP+T YP  Y   + + ++ +RP+ S  YPGRTY++Y G  V PFG+G+ YT F     
Sbjct: 605 RLPVTQYPASYADQVNIFDINLRPTDS--YPGRTYKWYTGKPVLPFGYGLHYTKFMFDWE 662

Query: 628 --VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
             ++   N   +VA   +   G IN               N     V+  VKNVG K   
Sbjct: 663 KTLNREYNIQDLVASCRNSSGGPIND--------------NTPLTTVKARVKNVGHKTSD 708

Query: 686 H-TLLVFSTPPAGHWA-PHKQLVAFEK-VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           + +LL  S+  AG    P+K LV++ + +++  G+ Q   + + +   L+  D +G+  I
Sbjct: 709 YVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTLGS-LARADENGSLVI 767

Query: 743 PLGEHNIHIGGTKH 756
             G + I +  ++ 
Sbjct: 768 FPGRYKIALDNSEE 781


>gi|317156541|ref|XP_001825822.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
          Length = 882

 Score =  466 bits (1200), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 278/729 (38%), Positives = 409/729 (56%), Gaps = 48/729 (6%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P C  SL I +RV+ L+  L+L+EK+  L+  +A   RLG+  YEWWSEA HGV +  PG
Sbjct: 138 PVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVGS-APG 196

Query: 101 TKFG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
            +F     +F  ATSFP  I TA+SF+ TL   I  V+  E RA  N G +G  +W+PN+
Sbjct: 197 VQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFSGFDFWAPNI 256

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N FRDPRWGRGQETPGEDP+++  Y  ++V GLQG D    +V A+CKH+  YDL+    
Sbjct: 257 NGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNKQVIATCKHYAVYDLE---- 312

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R+  N   ++QD+ D F  PF+ CV +  V S+MCSYN V+G+P CA+  +L   +R 
Sbjct: 313 TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLSEVLRK 372

Query: 278 EWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAV 333
            W  N    Y+VSDC +V   +   +FT T E AA+ A+ AG+DL+CG  +L L+   A 
Sbjct: 373 HWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSYLKLNESLAA 432

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
            +  +  +D   +L    +    +G FDG      Y  L   DV TPD Q LA EAA +G
Sbjct: 433 NQTSVKVMD--QSLARLYSALFTVGFFDG----GKYDKLDFSDVSTPDAQALAYEAAVEG 486

Query: 394 IVLLKNQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
           + LLKN    LPL S  ++++VAVIGP ++ T  M G+Y+G A    +PL+  G     +
Sbjct: 487 MTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLEAFGDSRWKV 545

Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
           +      +   +   F  A+ A+ ++D  I + G+D S+E+E LDR  L  PG Q +L++
Sbjct: 546 NYALGTAMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLTWPGNQLDLIT 605

Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
            +S  SK P ++V   GG +D +    +  I A++WAGYP Q+GGTA+ D+L G  +P G
Sbjct: 606 SLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAG 664

Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF----- 627
           +LP+T YP  Y   + + ++ +RP+ S  YPGRTY++Y G  V PFG+G+ YT F     
Sbjct: 665 RLPVTQYPASYADQVNIFDINLRPTDS--YPGRTYKWYTGKPVLPFGYGLHYTKFMFDWE 722

Query: 628 --VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
             ++   N   +VA   +   G IN               N     V+V VKNVG K   
Sbjct: 723 KTLNREYNIQDLVASCRNSSGGPIND--------------NTPLTTVKVRVKNVGHKTSD 768

Query: 686 H-TLLVFSTPPAGHWA-PHKQLVAFEK-VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           + +LL  S+  AG    P+K LV++ + +++  G+ Q   + + +   L+  D +G+  I
Sbjct: 769 YVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTLGS-LARADENGSLVI 827

Query: 743 PLGEHNIHI 751
             G + I +
Sbjct: 828 FPGRYKIAL 836


>gi|388857998|emb|CCF48443.1| related to Beta-xylosidase [Ustilago hordei]
          Length = 782

 Score =  466 bits (1199), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 290/784 (36%), Positives = 424/784 (54%), Gaps = 56/784 (7%)

Query: 7   FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
            F  G+ L S +   L+   P +C P    ++    C  ++P   R   L+ + + +E +
Sbjct: 2   LFRRGITLSSLARMHLSFSFP-SCAPDSPLSKIPDICDPTIPFYTRATSLVNQFTTEELL 60

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG-----GDFPGATSFPQVITTAS 121
              I+ A  VPRLGI  Y+WW+EALHGV+   PG  F       +F  AT FPQ I   +
Sbjct: 61  NNTINYAPGVPRLGIPNYQWWTEALHGVAK-SPGVNFDLSDPHAEFTSATQFPQTINLGA 119

Query: 122 SFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSP-NVNIFRDPRWGRGQETPGEDPVLSG 180
           +F+  L++ I  V++ E RA  N G AGL  +SP N+N FRDPRWGRGQET GEDP+   
Sbjct: 120 TFDDDLYQQIASVIASEVRAYNNAGKAGLNLYSPLNINCFRDPRWGRGQETVGEDPLHMS 179

Query: 181 KYAASYVRGLQG------SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIED 234
           ++A S V GLQG      ++G++L VAA+CKHF AYDL+ ++  +R+ F+A VSKQD+ D
Sbjct: 180 RFAVSIVHGLQGPHAQNEAEGNKLTVAATCKHFLAYDLEQYDRGERYQFDAIVSKQDLSD 239

Query: 235 TFDVP-FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN---GYIVSDCD 290
            F +P FR CV +G   ++M SYN VN VP  A    L+   R  W L+    Y+ SDCD
Sbjct: 240 -FHLPQFRACVRDGGATTLMTSYNAVNNVPPSASKYYLQTLARQAWGLDKTHNYVTSDCD 298

Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNT 350
           +V   YD   +     EAAA +I AG DLDCG     +  +A+++ L     I  A++  
Sbjct: 299 AVANVYDGHRYAQNYVEAAAKSINAGTDLDCGATYSENLGAALKQKLTDIATIRRAVIRM 358

Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
               +RLG FD +P+SQP   L  KDV +P  Q LA  +A   I LLKN   +LP+   +
Sbjct: 359 YASLVRLGYFD-DPASQPLRQLTWKDVNSPSSQRLAYTSALSSITLLKNLDSTLPIKQ-K 416

Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGCKDVAC----A 463
              +A+IGP ++V+ +  GNYAG A    T +    +    A+ +   G  D++     +
Sbjct: 417 PTKIAIIGPYTNVSTSFSGNYAGPAAFNMTMVHAASQVFPDAKIVWVNGT-DISGPYIPS 475

Query: 464 DDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMA----SK 519
           D Q    A+  +  AD+ +   G+D SIE E+ DR  +  P  Q  L+ ++S +     K
Sbjct: 476 DAQ---DAVKLTSDADSVVFAGGIDASIERESHDRKDIAWPPNQLRLIHELSQSRKKDKK 532

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
              ++V   GG +D A  K+D  + A++WAGYPGQ+   A+ DIL G + P G+LP+T Y
Sbjct: 533 SKLVVVQFGGGQLDGASLKSDDAVGALVWAGYPGQSASLAVWDILAGKAVPAGRLPVTQY 592

Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
           P  YI  LP + M++RP     YPGRTY++YKG   YPFGHG+ YT F  ++A  P   A
Sbjct: 593 PASYIDGLPESAMSLRPKAG--YPGRTYKWYKGVPTYPFGHGLHYTTFSASLAK-PQPYA 649

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF---STPPA 696
           +P          T + K  +  HA+   +   VQ ++KN G     +T L+F   S  PA
Sbjct: 650 IP---------TTPAAKGPEGVHAEHISVA-DVQANIKNTGKVASDYTALLFARHSNGPA 699

Query: 697 GHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
            +  P K LV + KV ++ AG +  V I I     L+  D  G + +  G + + +   +
Sbjct: 700 PY--PRKTLVGYTKVKNLSAGEESSVTIKITQAA-LARADEEGNQFLYPGSYQLELDTEE 756

Query: 756 HSVS 759
           H ++
Sbjct: 757 HRLA 760


>gi|359473427|ref|XP_002265788.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Vitis vinifera]
          Length = 464

 Score =  466 bits (1199), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 226/467 (48%), Positives = 311/467 (66%), Gaps = 16/467 (3%)

Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR---- 197
           MYN G AGLT+WSPN+N+ RD RWGR QET  EDP + G++A +YVRGLQ  +G      
Sbjct: 1   MYNLGHAGLTFWSPNINVVRDTRWGRTQETSREDPFMVGEFAVNYVRGLQDVEGTENVTD 60

Query: 198 -----LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASV 252
                LKV++ CKH+ AYD+D+W  +DR  F+A+VS+QD+++TF  PF  CV EG V+SV
Sbjct: 61  LNSRPLKVSSCCKHYAAYDIDSWLNIDRHTFDARVSEQDMKETFVSPFERCVREGDVSSV 120

Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAAD 311
           MCS+N++NG+P C+DP +LK  IR EW L+GYIVSDC  + V  D Q++ +  + +A A 
Sbjct: 121 MCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAK 180

Query: 312 AIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGH 371
            ++AGLDL+CG +        V  G +S+ +++ AL N   + MR+G FDG P+   Y  
Sbjct: 181 TLQAGLDLECGHYYTDALNELVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA---YES 237

Query: 372 LGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
           LG KD+C  DH ELA EAARQGIVLLKN     PL     + +A++GP+++ T  MIGNY
Sbjct: 238 LGLKDICAADHIELAREAARQGIVLLKNDYEVFPLK--PGKKLALVGPHANATEVMIGNY 295

Query: 432 AGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSI 491
           AG+   Y +PL+         +  GC D +C++D  F  A +A++ A+ TI+ +G D SI
Sbjct: 296 AGLPRKYVSPLEAFSAIGNVTYTTGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSI 355

Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
           EAE +DR   LLPG Q EL+ +V+  S GP ILV++SG  ID+ FAKN+PRI+AI+W G+
Sbjct: 356 EAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGF 415

Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM-AMRPS 597
           PG+ GG AIAD++FG  NPGG+LP+TWY  +Y+  L    M A  PS
Sbjct: 416 PGEQGGHAIADVVFGKYNPGGRLPVTWYEADYVACLETHIMDAKTPS 462


>gi|443893988|dbj|GAC71176.1| hypothetical protein PANT_1d00031 [Pseudozyma antarctica T-34]
          Length = 759

 Score =  466 bits (1199), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 276/726 (38%), Positives = 397/726 (54%), Gaps = 49/726 (6%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  SL    R   L+   + QE +   I+ A  VPRLGI  Y+WW+EALHGV+   PG  
Sbjct: 36  CDTSLDYWTRATSLVAEFTTQELINNTINTAPGVPRLGIPPYQWWTEALHGVAG-SPGVN 94

Query: 103 FGGD----FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSP-NV 157
           F  D    +  AT+FPQ+I   ++F+  L+E +   +++E RA  N G AGL  +SP N+
Sbjct: 95  FADDVEAPYGSATNFPQIINLGATFDDALYEQVATHIANETRAFNNAGKAGLNMYSPLNI 154

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N FRDPRWGRGQET GEDP+   +YA   V+GLQG + D L++AA+CKH+ AYDL+ W+G
Sbjct: 155 NCFRDPRWGRGQETTGEDPLHMSRYAVKMVQGLQGPNQDELRLAATCKHYLAYDLEKWDG 214

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
           V+R+ F+A+VS+Q++ + +   FR CV +GK  ++M SYN VN VP  A    L+   R 
Sbjct: 215 VERYQFDAQVSRQELAEFYLPQFRACVRDGKAVTLMTSYNAVNNVPPSASRYYLETLARK 274

Query: 278 EWRLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
           EW L+    Y+ SDCD+V   +D  H+  +  +AAAD+I AG DL+CG     +   A++
Sbjct: 275 EWGLDKKHNYVTSDCDAVANVFDGHHYADSYVQAAADSINAGTDLNCGATYSDNLGQALE 334

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
           + L     I  A+      Q+RLG+FD +   QP   LG + V T   Q+LA  +A   +
Sbjct: 335 QNLTDVETIRTAVARMYASQVRLGLFDPK-QGQPLRELGWEHVNTKAAQDLAYSSAAASV 393

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG---IACGYTTPLQGIGRYART 451
            LLKN G +LP+       VAVIGP S+ T  + GNYAG    A   T   Q +   A  
Sbjct: 394 TLLKNNG-TLPVDGATK--VAVIGPYSNATFALRGNYAGPGPFAITMTEAAQRVFSQATI 450

Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
               G       +     AA+  +++AD  I   G+D +IE+E LDRA +  P  Q +L+
Sbjct: 451 SSANGTTISGTYNHTDAEAAMQLAKEADLVIFAGGIDPTIESEELDRATIAWPPNQLQLI 510

Query: 512 SKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
             +  MA K    +V   GG ID A  K D  I A++WAGYPGQ+G  A+ D++ G + P
Sbjct: 511 HALGGMAKK--MAVVQFGGGQIDGASIKADGNIGALLWAGYPGQSGALAVMDVIAGNTAP 568

Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT 630
            G+LP+T YP EYI  L  T MA+RP+ +  YPGRTY++Y G   YP+ HG+ YT F   
Sbjct: 569 AGRLPITQYPAEYIDGLAETTMALRPNAT--YPGRTYKWYSGTPTYPYAHGLHYTEFKAE 626

Query: 631 VAN-APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLL 689
           +A  AP  +A                      +A+  R+   VQ  + N G +   +  L
Sbjct: 627 LAQPAPYTIAT-------------------AGYAEFERVAT-VQATITNAGQRTSDYAAL 666

Query: 690 VF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK-YLSVVDRSGTRRIPLG 745
           VF   +  PA H  P+K LV ++KV   A  + R  + + + +  L+  D  G   +  G
Sbjct: 667 VFARHTNGPAPH--PNKTLVGYKKVKAIAPGESR-SVEVEITQAALARGDEEGNLVLYPG 723

Query: 746 EHNIHI 751
           ++ + +
Sbjct: 724 KYELEL 729


>gi|302683060|ref|XP_003031211.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
 gi|300104903|gb|EFI96308.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
          Length = 761

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 291/717 (40%), Positives = 394/717 (54%), Gaps = 28/717 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  SL   +R   L+  L++ E +   +  A  VPRLG+  Y WW+EALHGV+   PG  
Sbjct: 35  CDTSLGHVERARALVEELTVAEMINNTVHTAPGVPRLGLPPYNWWNEALHGVA-ASPGVV 93

Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
           F   G +F  ATSFP  I   S+F+  L  A+G V S EARA  N G AGL YW+PN+N 
Sbjct: 94  FTSPGEEFSSATSFPMPINMGSAFDDALMLAVGNVTSTEARAFNNAGLAGLDYWTPNINP 153

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGV 218
           F+DPRWGRG ETPGEDP+ + +Y  + V GLQG  D   LKVAA CKH+ AYDL++W GV
Sbjct: 154 FKDPRWGRGAETPGEDPLHAARYVRTLVEGLQGGIDPPSLKVAADCKHWAAYDLEDWGGV 213

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
            R+ F+A V+ QD+ + +  PF+ CV + + ASVMCSYN VNGVP CA P +LK  +R  
Sbjct: 214 ARYAFDAVVTPQDLAEYYSPPFKSCVRDARAASVMCSYNAVNGVPACASPYLLKTVLRDA 273

Query: 279 WRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
           W L  + ++ SDCD+VG  YD   +T      +A +++AG DLDCG     +   A  RG
Sbjct: 274 WGLAEDRWVTSDCDAVGNVYDPHGYTEDFVNGSAVSLKAGSDLDCGTTYSQYLPEAYDRG 333

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L+ E D+  AL       + LG FD  P  QPY  +   DV TP  Q LA  AA +  VL
Sbjct: 334 LIDEDDLKAALTRLYASLVWLGYFDA-PEDQPYRQISWADVNTPAAQALAYTAAIESFVL 392

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVT-VTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
           LKN G +LPL+     ++A+IGP ++ + V + GNY GI      PLQG       +   
Sbjct: 393 LKNDG-TLPLTD-SSLSIALIGPMANASAVQLQGNYNGIPPFAIAPLQGFLDAGFNVTYV 450

Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
              +V   D      A+ A+  AD  I V G+D ++E EA DR  +  P  Q  L+S + 
Sbjct: 451 LGTNVTGNDADDIDGAVAAAEAADVVIYVGGIDSTVEEEAKDRTEISWPDNQLALLSALE 510

Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
            A K P ++V M GG +D    K    + AI+WAGYPGQ+GGTAIAD + G   P G+L 
Sbjct: 511 EAGK-PLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSGGTAIADTVMGKVAPAGRLS 569

Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF-VHTVANA 634
           +T YP  Y+  + MT+M +RP  S   PGRTY++Y G  VYP+G+G+ YTNF V   ++A
Sbjct: 570 ITQYPASYVDAVAMTDMTLRPDNSTGNPGRTYKWYTGTPVYPYGYGLHYTNFSVAWASDA 629

Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
           P            SI    S     V  A  +      +V V N G        L+F + 
Sbjct: 630 PEACY--------SIQDLTSSADGFVDLAPLDTF----RVTVTNDGDVASDFVALLFVST 677

Query: 695 PAGHW-APHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
            AG   AP K+LVA+ +   V  G    V + + +   L+  D SG   +  G++ +
Sbjct: 678 QAGPAPAPMKELVAYARASDVQPGDSTDVDLEVTLGA-LARSDESGDASLYPGDYEL 733


>gi|242216161|ref|XP_002473890.1| beta-xylosidase [Postia placenta Mad-698-R]
 gi|220726990|gb|EED80923.1| beta-xylosidase [Postia placenta Mad-698-R]
          Length = 741

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 282/730 (38%), Positives = 409/730 (56%), Gaps = 31/730 (4%)

Query: 30  CDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSE 89
           C     TT T+  C  S    +R   LI   +L+EK+    + A  VPRLG+  Y+WW E
Sbjct: 23  CANGPLTTNTV--CDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQE 80

Query: 90  ALHGVSNVGPGTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           ALHGV+   PG  F   G++  ATSFPQ I   ++F+  L   +  +VS EARA  N   
Sbjct: 81  ALHGVAE-SPGVIFAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANR 139

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKH 206
           +G+ +W+PN+N F+DPRWGRGQETPGEDP     Y  + + GLQG  D +  ++ A+CKH
Sbjct: 140 SGIDFWTPNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQGGLDPEYKRIVATCKH 199

Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
           F AYDL+NW G  R+ F+A VS QD+ + +   FR C  +  V S MCSYN VNGVP+CA
Sbjct: 200 FAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNGVPSCA 259

Query: 267 DPNILKRTIRGEW---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           +  +L+  +R  W     + YI SDCD++   Y+  ++T+T  E  ADA+ AG DLDCG 
Sbjct: 260 NSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTATRAETVADALNAGTDLDCGE 319

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDH 382
           +   +  +A  +GL +E  +N AL+      ++LG FD  P+  QPY  +G  +V TP+ 
Sbjct: 320 YYPENLGAAYDQGLFTESTLNRALIRQYAALVKLGYFD--PADIQPYRQIGWANVSTPEA 377

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           +ELA  AA +GI LLKN G +LPLS    +T+A+IGP ++ T  M GNY G+A    +PL
Sbjct: 378 EELAYTAAVEGITLLKNDG-TLPLSP-SIKTIALIGPWANATTQMQGNYYGVAPYLISPL 435

Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
                   T++      V       F AA  A+  ADA I   G+D ++EAEA+DR  L 
Sbjct: 436 MAAEELGFTVYYSAGPGVDDPTTSSFPAAFAAAEAADAIIYAGGIDITVEAEAMDRYTLD 495

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
            PG Q + + ++S+  K P I++   GG ID +    +P + A++W GYPGQ+GG AI D
Sbjct: 496 WPGVQPDFIDQLSLLGK-PLIVLQFGGGQIDDSALLPNPGVNALVWGGYPGQSGGKAIMD 554

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
           I+ G + P G+LP+T YP +Y+  + MT+M++RPS +   PGRTY +Y G  +  FG G+
Sbjct: 555 IIVGNAAPAGRLPITQYPLDYVYQVAMTDMSLRPSPTN--PGRTYMWYTGTPIVEFGFGL 612

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
            YT F  +++  P+  +  +        AT+      V H            +V N GS 
Sbjct: 613 HYTTFTASLSQ-PSAPSYDI--------ATLVSLCSGVAHPDLCPFA-SYTANVTNTGSS 662

Query: 683 DGAH--TLLVFSTPPAGHWAPHKQLVAFEKVHVPAG-AQQRVGINIHVCKYLSVVDRSGT 739
             +   +LL  +        P+K LVA++++H  A  A Q   +N+ +   LS VD  G 
Sbjct: 663 VTSDFVSLLFLAGEHGPAPYPNKVLVAYDRLHAIAPLASQTTTLNLTLGS-LSRVDDYGN 721

Query: 740 RRIPLGEHNI 749
             +  GE+ +
Sbjct: 722 TILYPGEYTL 731


>gi|392596548|gb|EIW85871.1| hypothetical protein CONPUDRAFT_80240 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 770

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 283/751 (37%), Positives = 404/751 (53%), Gaps = 38/751 (5%)

Query: 22  LAAREPFACDPKDATTRTL---PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPR 78
           LAA +  A    D TT  L     C  SL   QR   LI   ++ E +   ++ A  VPR
Sbjct: 13  LAASQVSAYGFPDCTTAPLVNNSVCDTSLNATQRAAALIDLFTVDELIVNTVNWAPGVPR 72

Query: 79  LGIKGYEWWSEALHGVSN-VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
           LG+  YEWWSE LHGV+N  G      G F  ATSFPQ I  +++F+  L +A+G V+  
Sbjct: 73  LGLPAYEWWSEGLHGVANSAGVTWSITGPFSYATSFPQPILMSAAFDDALIKAVGGVIGM 132

Query: 138 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDG 195
           E RA  N G AGL +W+PN+N F+DPRWGRGQETPGEDP    +Y  + ++GLQG     
Sbjct: 133 EGRAFNNYGHAGLDFWTPNINPFKDPRWGRGQETPGEDPYHIAQYVYNLIQGLQGGLDPE 192

Query: 196 DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
              +V A+CKHF  YDL++W+   R+ +NA +S QD+ + +   F+ C  +    + MCS
Sbjct: 193 PYFQVVATCKHFAGYDLEDWDFNYRYGYNAIISTQDLSEYYLPSFQSCYRDAFAGASMCS 252

Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNG--YIVSDCDSVGVYYDTQHFTSTPEEAAADAI 313
           YN +NG+PTCAD  +L+  +RG W  +   ++  DCDSV   YD  H+T+ P++AAADA+
Sbjct: 253 YNAINGIPTCADTYLLQDILRGFWGFDQTRWVTGDCDSVEDIYDFHHYTALPQQAAADAL 312

Query: 314 RAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG 373
           +AG D+DCG F       A    L++E D+  AL       +RLG FD   S QPY    
Sbjct: 313 KAGSDIDCGIFYTTWLPLAYTESLITEQDLRAALTRQYASLVRLGYFD-PASEQPYRQYN 371

Query: 374 PKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
             +V T   QELA  AA +GI LLKN G +LP S    + +A+IGP +  T  M GNY G
Sbjct: 372 WSNVDTSYAQELAYTAAVEGITLLKNDG-TLPFSSAI-KNIALIGPWTFATTQMQGNYYG 429

Query: 434 IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
            A    +P QG       I      +V       + AA  A++ ADA + V G+D ++EA
Sbjct: 430 NAPYLISPYQGAQLAGYNISYVLETNVTSNTTDGYAAAFTAAQGADAIVFVGGIDNTVEA 489

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           EA+DR  +  P  Q  L+ ++    K P ++V   GG +D      +P + A++W GYPG
Sbjct: 490 EAMDRNDITWPAFQLWLIGELGKLGK-PLVVVQFGGGQVDDTEINANPDVNALLWGGYPG 548

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP-SQSKRYPGRTYRFYKG 612
           Q+GG A+ DI+ G   P G+L  T YP +Y+  +PMT M +RP +     PGRTY++Y G
Sbjct: 549 QSGGQALFDIISGKVAPAGRLVSTQYPADYVNEIPMTNMNLRPDANGTTSPGRTYKWYTG 608

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG- 671
             VY FG+G+ YTNF +    AP               AT S +A+         + L  
Sbjct: 609 TPVYEFGYGLHYTNFTYAWTKAPA--------------ATYSIEALVAAGQGSAHIDLAP 654

Query: 672 ---VQVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGIN 724
              + V+V N G+    ++ L+F   +  PA +  P+K L A+ ++H V AGA Q     
Sbjct: 655 FDTLSVEVTNAGAVTSDYSALLFVNGTYGPAPY--PNKSLAAYTRLHNVTAGASQTATFE 712

Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
           + V   ++  D  G   +  G + + +  T+
Sbjct: 713 V-VLNQIARADVQGNFWLYPGAYEVALDTTR 742


>gi|115436902|ref|XP_001217674.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|121734342|sp|Q0CB82.1|BXLB_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|114188489|gb|EAU30189.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 765

 Score =  464 bits (1194), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 272/713 (38%), Positives = 395/713 (55%), Gaps = 37/713 (5%)

Query: 24  AREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKG 83
           A+ PF  D ++        C  +L    R   L+  ++L+EK+      +  VPRLG+  
Sbjct: 25  AQSPFP-DCENGPLSKNAVCDTTLDPVTRAQALLAAMTLEEKINNTQYNSPGVPRLGLPA 83

Query: 84  YEWWSEALHGVSNVGPGTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
           Y WWSEALHGV+   PG  F   G+F  ATSFP  IT  ++F+  L + I  V+  E RA
Sbjct: 84  YNWWSEALHGVAG-SPGVHFADSGNFSYATSFPSPITLGAAFDDDLVKQIATVIGTEGRA 142

Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKV 200
             N G AGL YW+PN+N +RDPRWGRGQETPGEDP  + +Y    + GLQ   G ++ K+
Sbjct: 143 FGNAGHAGLDYWTPNINPYRDPRWGRGQETPGEDPFHTSRYVYHLIDGLQDGIGPEKPKI 202

Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
            A+CKHF  YD+++W G +R+ F+A +S QD+ + +  PF+ C  + KV +VMCSYN VN
Sbjct: 203 VATCKHFAGYDIEDWEGNERYAFDAVISDQDMAEYYFPPFKTCTRDAKVDAVMCSYNSVN 262

Query: 261 GVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
           G+PTCADP +L+  +R  W   G   ++ SDC ++   Y    + +    AAA A+ AG 
Sbjct: 263 GIPTCADPWLLQTVLREHWEWEGVGHWVTSDCGAIDNIYKDHKYVADGAHAAAVAVNAGT 322

Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
           DLDCG        SA+ +GLL    ++ AL    +  ++LG FD   + QPY  +G  DV
Sbjct: 323 DLDCGSVYPQFLGSAISQGLLGNRTLDRALTRLYSSLVKLGYFD-PAADQPYRSIGWSDV 381

Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
            TPD ++LA  AA +G VLLKN G +LPL   ++ TVA++GP ++ T  + GNY G A  
Sbjct: 382 ATPDAEQLAHTAAVEGTVLLKNDG-TLPLK--KNGTVAIVGPYANATTQLQGNYEGTAKY 438

Query: 438 YTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALD 497
             T L    +    +       +       F  A++A++ +D  I   G+D  +EAEALD
Sbjct: 439 IHTMLSAAAQQGYKVKYAPGTGINSNSTSGFEQALNAAKGSDLVIYFGGIDHEVEAEALD 498

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           R  +  PG Q +L+ ++S   K P ++V   GG +D +   ++  +  ++WAGYP QAGG
Sbjct: 499 RTSIAWPGNQLDLIQQLSDLKK-PLVVVQFGGGQVDDSSLLSNAGVNGLLWAGYPSQAGG 557

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
            A+ DIL G + P G+LP+T YP+EY+  +PMT+M +RP  S   PGRTYR+Y   V+ P
Sbjct: 558 AAVFDILTGKTAPAGRLPVTQYPEEYVDQVPMTDMNLRPGPSN--PGRTYRWYDKAVI-P 614

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
           FG+GM YT F  +               +G  N      A+K  +A     +L     VK
Sbjct: 615 FGYGMHYTTFDVSWKRK----------NYGPYNTA----AVKAENAVLETFSL----QVK 656

Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHV 727
           N G     +  LVF T       P+  K LV +++V  +  G ++ V I++ V
Sbjct: 657 NTGKVTSDYVALVFLTTTDAGPKPYPIKTLVGYQRVKAIRPGERKVVDIDVTV 709


>gi|330934749|ref|XP_003304687.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
 gi|311318569|gb|EFQ87188.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
          Length = 798

 Score =  462 bits (1190), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 274/739 (37%), Positives = 404/739 (54%), Gaps = 43/739 (5%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
           D  +   + +  C  S     R   L+   +L+EK+    SGA  VPRLG+  Y+WW+E 
Sbjct: 25  DCTNGPLKNVTICDPSASPLARAKSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWNEG 84

Query: 91  LHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           LHG++  GP T F   G ++  +TSFPQ I   ++F+  L   + +V+S EARA  N   
Sbjct: 85  LHGIA--GPYTNFSHSGVEWSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANR 142

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK-VAASCKH 206
            GL +W+PN+N FRDPRWGRGQETPGED      Y  + + GLQG   D  K V A+CKH
Sbjct: 143 TGLDFWTPNINPFRDPRWGRGQETPGEDAYHLSSYVQALIHGLQGEATDPYKRVVATCKH 202

Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
           F  YD+++WNG  R+  + ++++QD+ + +  PF+ CV +  V + MCSYN VNG P CA
Sbjct: 203 FAGYDVEDWNGNLRYQNDVQITQQDLVEYYLAPFQACV-QANVGAFMCSYNAVNGAPPCA 261

Query: 267 DPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           DP +L+  +R  W  N    ++  DCD+V   Y    ++ST   AAAD++ AG D+ CG 
Sbjct: 262 DPYLLQTILREHWGWNKEEQWVTGDCDAVQNVYFPHQWSSTRAGAAADSLVAGTDITCGT 321

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           ++  H  +A ++ LL+E  ++ AL+   +  +RLG FD  P +QPY  LG   V T   Q
Sbjct: 322 YMQEHLPAAFRQKLLNESSLDLALIRQYSSLVRLGYFDA-PENQPYRQLGFDAVATNASQ 380

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
            LA  AA +GIVLLKN G +LPLS     TV + G  ++ T  ++GNYAG+A    +PL 
Sbjct: 381 ALARRAAAEGIVLLKNDG-TLPLSLDSSMTVGLFGDWANATTQLLGNYAGVATYLHSPLY 439

Query: 444 GIGRYARTIHQQGCKDVACAD------DQLFGAAIDASRQADATILVMGLDQSIEAEALD 497
            + +    I+  G K     D        L+G    A   +D  I V G+D  +E E  D
Sbjct: 440 ALKQTGVKINYAGGKPGGQGDPTTNRWSNLYG----AYSTSDVLIYVGGIDNGVEEEGHD 495

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           R  L   G Q +++ +++   K P I+V+  GG ID +   N+P I+AI+WAGYPGQ GG
Sbjct: 496 RGYLTWTGPQLDVIGQLAETGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGG 554

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
           +AI DI+ G + P G+LP T YP  Y   + M  M +RP ++   PGRTY++Y G  V+ 
Sbjct: 555 SAIIDIISGKTAPAGRLPQTQYPASYAAAVSMMNMNLRPGENN--PGRTYKWYNGSAVFE 612

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG-VQVDV 676
           FG+GM YTNF   ++            +  +I++  SG     T     R     V V V
Sbjct: 613 FGYGMHYTNFSAAISTQMQ--------QSYAISSLASG--CNSTGGFLERCPFASVDVQV 662

Query: 677 KNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLS 732
            N G     +  L +   +  PA H  P K LV+++++H +  GA     +N+ +   ++
Sbjct: 663 HNTGKVTSDYVTLGYMAGTFGPAPH--PRKTLVSYKRLHNIAGGATSTAKLNLTLAS-VA 719

Query: 733 VVDRSGTRRIPLGEHNIHI 751
            VD  G + +  G +++ I
Sbjct: 720 RVDEYGNKVLYPGHYSLQI 738


>gi|344302281|gb|EGW32586.1| hypothetical protein SPAPADRAFT_51129 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 788

 Score =  462 bits (1188), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 271/721 (37%), Positives = 398/721 (55%), Gaps = 31/721 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C   LP  QR   ++   ++ E +  + + +  V RLG+  Y+WWSE LHG++     + 
Sbjct: 61  CNPYLPNNQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEGLHGIAR----SN 116

Query: 103 F--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G++  ATSFPQ I    +FN+ L++ +G V+  EARA  N G AGL Y+SPN+N F
Sbjct: 117 FTASGEYSHATSFPQPILMGGAFNSDLYKQVGNVIGTEARAFNNVGRAGLDYYSPNINPF 176

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-----DGDRLKVAASCKHFTAYDLDNW 215
           +DPRWGRGQE   E PVL G YA +YV+GLQG      + D L+VAA+CKHF  YD+++W
Sbjct: 177 KDPRWGRGQEVASESPVLVGNYALNYVQGLQGGIDSNPNDDTLQVAATCKHFAGYDMESW 236

Query: 216 NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
               R  +NA +S QD+ D +   F+ CV + K A  MCSYN +NG+P CA    L   I
Sbjct: 237 KQHSRLGYNAIISDQDLADYYFPTFQSCVRDAKAAGAMCSYNAINGIPVCASEFFLGTVI 296

Query: 276 RGEWRL-NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
           R  +   NG I SDCDS+   ++   +      AAAD I+AG+D++CG     +   A+ 
Sbjct: 297 REGFDFQNGVIHSDCDSLYSIWNPHLYVQDLGAAAADGIKAGVDVNCGDTYQNNLGYALG 356

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
              ++E  I  ++    +  +RLG FD  P +  Y      DV T    +LA +AA +GI
Sbjct: 357 NKTINEDQIRASVTRQYSNLIRLGYFD-SPQTNKYRTYNWSDVSTSQANQLAYQAAVEGI 415

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
            LLKN G +LP +  + + VAVIGP ++ T  M+G+YAG      +PLQG       +  
Sbjct: 416 TLLKNDG-TLPFNKDKVKNVAVIGPWANATTDMLGDYAGTPPYLISPLQGAQDSGFKVQY 474

Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
                +       + AA++A++ ADA +   G+D SIE EALDR  L  PG Q +LVSK+
Sbjct: 475 AYGTQINTTLTTNYTAALNAAKGADAIVYFGGIDNSIENEALDRESLAWPGNQLDLVSKL 534

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
           S  +K P ++V    G +D    KN+  + +I++AGYPGQ+GGTAI D+L G   P G+L
Sbjct: 535 SGLNK-PLVVVQFGAGQVDDTEIKNNNNVNSIVYAGYPGQSGGTAIWDVLNGIYAPAGRL 593

Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
             T YP  Y   +PMT+M +RP     YPGRT+ +Y G  VY FG+G+ YT F  ++ANA
Sbjct: 594 STTQYPASYADQVPMTDMTLRPRDG--YPGRTFMWYNGEPVYEFGYGLHYTTFSVSLANA 651

Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
           P   A     +  +I+  I+ K+ +        L     V++KN G     +  L++S  
Sbjct: 652 PPKGAP----QSFNIDQFIAAKSSQYVDTS---LITTFDVNIKNTGKVTSDYAALLYSNT 704

Query: 695 PAG---HWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
            +G   H  P+K LV+F+K+H +  G  Q   + + +   L   D +G + +  G +   
Sbjct: 705 TSGPGPH--PNKILVSFDKLHQIHPGQIQTASLPVTIGSLLQ-TDTNGNKWLYPGAYTFF 761

Query: 751 I 751
           +
Sbjct: 762 V 762


>gi|238492365|ref|XP_002377419.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|220695913|gb|EED52255.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
          Length = 775

 Score =  461 bits (1187), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 274/734 (37%), Positives = 408/734 (55%), Gaps = 48/734 (6%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P C  SL I +RV+ L+  L+L+EK+  L+  +A   RLG+  YEWWSEA HGV +  PG
Sbjct: 31  PVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVGS-APG 89

Query: 101 TKFG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
            +F     +F  ATSFP  I TA+SF+ TL   I  V+  E R   N G +G  +W+PN+
Sbjct: 90  VQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRVFGNNGFSGFDFWAPNI 149

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N FRDPRWGRGQETPGEDP+++  Y  ++V GLQG D    +V A+CKH+  YDL+    
Sbjct: 150 NGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNKQVIATCKHYAVYDLE---- 205

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R+  N   ++QD+ + F  PF+ CV +  V S+MCSYN V+G+P CA+  +L   +R 
Sbjct: 206 TGRYGNNYNPTQQDLSEYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLDEVLRK 265

Query: 278 EWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAV 333
            W  N    Y+VSDC +V   +   +FT T E AA+ A+ AG+DL+CG  +L L+   A 
Sbjct: 266 HWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSYLKLNESLAA 325

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
            +  +  +D   +L    +    +G FDG      Y  L   DV TPD Q LA EAA +G
Sbjct: 326 NQTSVKVMD--QSLARLYSALFTVGFFDG----GKYDKLDFSDVSTPDAQALAYEAAVEG 379

Query: 394 IVLLKNQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
           + LLKN    LPL S  ++++VAVIGP ++ T  M G+Y+G A    +PL+  G     +
Sbjct: 380 MTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLEAFGDSRWKV 438

Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
           +      +   +   F  A+ A+ ++D  I + G+D S+E+E LDR  L  PG Q +L++
Sbjct: 439 NYALGTAINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLAWPGNQLDLIT 498

Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
            +S  SK P ++V   GG +D +    +  I A++WAGYP Q+GGTA+ D+L G  +P G
Sbjct: 499 SLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAG 557

Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF----- 627
           +LP+T YP  Y   + + ++ +RP+    YPGRTY++Y G  V PFG+G+ YT F     
Sbjct: 558 RLPVTQYPASYADQVNIFDINLRPTD--LYPGRTYKWYTGKPVLPFGYGLHYTKFMFDWE 615

Query: 628 --VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
             ++   N   +VA   +   G IN               N     V+  VKNVG K   
Sbjct: 616 KTLNREYNIQDLVASCRNSSGGPIND--------------NTPLTTVKARVKNVGHKTSD 661

Query: 686 H-TLLVFSTPPAGHWA-PHKQLVAFEK-VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           + +LL  S+  AG    P+K LV++ + +++  G+ Q   + + +   L+  D +G+  I
Sbjct: 662 YVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTLGS-LARADENGSLVI 720

Query: 743 PLGEHNIHIGGTKH 756
             G + I +  ++ 
Sbjct: 721 FPGRYKIALDNSEE 734


>gi|403412992|emb|CCL99692.1| predicted protein [Fibroporia radiculosa]
          Length = 760

 Score =  457 bits (1177), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 270/716 (37%), Positives = 403/716 (56%), Gaps = 27/716 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   LIG  +L+EK+    + +  VPRLG+  Y+WW EALHGV+   PG  
Sbjct: 34  CDTSASPVARATALIGLFTLEEKINNTGNTSPGVPRLGLPAYQWWQEALHGVAE-SPGVI 92

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G++  ATSFPQ I   ++F+  L   +  +VS EARA  N   +GL +W+PN+N F
Sbjct: 93  FAETGEYSYATSFPQPILMGAAFDDELINQVATIVSTEARAFNNANRSGLDFWTPNINPF 152

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVD 219
           +DPRWGRGQETPGEDP     Y  + + GLQG  D +  ++ A+CKH+  YDL+NW G  
Sbjct: 153 KDPRWGRGQETPGEDPFHLQSYVYNLITGLQGGLDPEYKRIVATCKHYAGYDLENWEGNV 212

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R+ F+A +S QD+ + +   F  C  +  V + MCSYN VNGVP+CA+  +L+  +RG W
Sbjct: 213 RYGFDALISIQDLSEFYTRSFETCARDANVGAFMCSYNAVNGVPSCANSYLLQDILRGHW 272

Query: 280 RL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
                + +I SDCD++   Y+  ++  T E   ADA+ AG DLDCG +   +  +A   G
Sbjct: 273 NWTSDDQWITSDCDAIQNIYEPHYYAPTRELTVADALNAGADLDCGTYYPENLGAAYDEG 332

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L +E  ++ AL+      ++LG FD    +QPY  +G  +V TP+ +ELA  AA +GI L
Sbjct: 333 LFAESTLDRALIRQYASLVKLGYFD-PAENQPYRQIGWANVSTPEAEELAYRAAVEGITL 391

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           +KN G +LPLS    +++A+IGP ++ T  M GNY G      +PL        T++   
Sbjct: 392 IKNDG-TLPLSP-SIKSLALIGPWANATTQMQGNYYGQPPYLISPLMAAEALNYTVYYSP 449

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
              V       F AA  A++ ADA I + G+D ++EAEA+DR  L  PG Q + + ++S 
Sbjct: 450 GPGVDDPTTSSFPAAFAAAQAADAIIYIGGIDTTVEAEAMDRYTLDWPGVQPDFIDQLSQ 509

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
             K P +++ M GG +D +    +  + A+IW GYPGQ+GGTA+ DI+ G + P G+LP 
Sbjct: 510 FGK-PLVVLQMGGGQVDDSCLLPNTNVNALIWGGYPGQSGGTALMDIIVGNAAPAGRLPT 568

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           T YP +Y+  + MT+M++RPS +   PGRTY +Y G  +  FG G+ YTNF   ++  P+
Sbjct: 569 TQYPLDYVYQVAMTDMSLRPSATN--PGRTYMWYTGTPIVEFGFGLHYTNFSAELSQ-PS 625

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK-DGAHTLLVFSTPP 695
             +  +        A++ G    V H           V+V N+GSK    +  L+F    
Sbjct: 626 APSYDI--------ASLVGACEGVAHLDLCAFE-SYTVNVTNIGSKVTSDYVALLFVAGE 676

Query: 696 AGHWA-PHKQLVAFEKVHVPAG-AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
            G    P+K L A++++H  A  + Q+  +N+ +   LS VD  G R +  GE+ +
Sbjct: 677 HGPAPIPNKVLAAYDRLHTIAPLSSQQATLNLTLGS-LSRVDEYGNRVLYPGEYTL 731


>gi|189203341|ref|XP_001938006.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187985105|gb|EDU50593.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 761

 Score =  457 bits (1177), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 270/722 (37%), Positives = 399/722 (55%), Gaps = 38/722 (5%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
            S P   R   L+   +L+EK+    SGA  VPRLG+  Y+WWSE LHG++  GP T F 
Sbjct: 3   TSRPPLARAQSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWSEGLHGIA--GPYTNFS 60

Query: 105 --GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             G++  +TSFPQ I   ++F+  L   + +V+S EARA  N    GL +W+PN+N FRD
Sbjct: 61  DSGEWSYSTSFPQPILMGAAFDDDLITDVAKVISTEARAFNNANRTGLDFWTPNINPFRD 120

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK-VAASCKHFTAYDLDNWNGVDRF 221
           PRWGRGQETPGED      Y  + + GLQG   D  K V A+CKHF  YD+++WNG  R+
Sbjct: 121 PRWGRGQETPGEDAYHLSSYVQALIHGLQGESTDPYKRVVATCKHFAGYDVEDWNGNLRY 180

Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
             + ++++Q++ + +  PF+ CV +  V + MCSYN VNG P CADP +L+  +R  W  
Sbjct: 181 QNDVQITQQELVEYYLAPFQACV-QANVGAFMCSYNAVNGAPPCADPYLLQTILREHWGW 239

Query: 282 NG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
                ++  DCD+V   Y    ++ T   AAAD++ AG D+ CG ++  H  +A Q+ LL
Sbjct: 240 TNEEQWVTGDCDAVQNVYLPHQWSPTRAGAAADSLVAGTDVTCGTYMQEHLPAAFQQKLL 299

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           +E  ++ AL+   +  +RLG FD    +QPY  LG   V T   Q LA  AA +GIVLLK
Sbjct: 300 NESSLDQALIRQYSSLVRLGYFDAS-ENQPYRQLGFDAVATNASQALARRAAAEGIVLLK 358

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G +LPLS     TV + G  ++ T  ++GNYAG+A    +PL  + +    I+  G  
Sbjct: 359 NDG-TLPLSLDSSVTVGLFGDWANATSQLLGNYAGVATYLHSPLYALEQTGVKINYAGGN 417

Query: 459 DVACAD------DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
                D        L+G    A   +D  I V G+D S+E E  DR  L   G Q +++ 
Sbjct: 418 PGGQGDPTTNRWSNLYG----AYSTSDVLIYVGGIDNSVEEEGRDRGYLTWTGAQLDVIG 473

Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
           +++   K P I+V+  GG ID +   N+P I+AI+WAGYPGQ GG+AI DI+ G + P G
Sbjct: 474 QLADTGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGGSAIIDIIGGKTAPAG 532

Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA 632
           +LP T YP  Y   + M  M +RP ++   PGRTY++Y G   + FG+GM YTNF   + 
Sbjct: 533 RLPQTQYPANYTAAVSMMNMNLRPGENS--PGRTYKWYNGSATFEFGYGMHYTNFSAEIT 590

Query: 633 NAPTVVAVPLDGRHGSINATISG-KAIKVTHAKCNRLTLGVQV-DVKNVGSKDGAHTLLV 690
                       +  +I++  SG  +      +C   ++ VQV +  NV S       + 
Sbjct: 591 TQMQ--------QSYAISSLASGCNSTGGFLERCPFASVNVQVHNTGNVTSDYITLGYMA 642

Query: 691 FSTPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
            +  PA H  P K LV+++++H +  GA     +N+ +   L+ VD  G + +  G++++
Sbjct: 643 GTFGPAPH--PRKTLVSYKRLHSIAGGATSTATLNLTLAS-LARVDEHGNKVLYPGDYSL 699

Query: 750 HI 751
            I
Sbjct: 700 QI 701


>gi|336377735|gb|EGO18896.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 766

 Score =  457 bits (1177), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 273/718 (38%), Positives = 413/718 (57%), Gaps = 43/718 (5%)

Query: 9   FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
           FL L+L    ++GLA+   F  D  +        C  SL    R   ++   ++ E +  
Sbjct: 8   FLPLVL----TAGLASAYSFP-DCTNGPLAQNAICDTSLDPISRATAVVDLFTIDELINN 62

Query: 69  LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG--GDFPGATSFPQVITTASSFNAT 126
            +S +  VPRLG+  Y+WWSE LHGV++  PG  F   G+F  ATSFPQ I   ++F+  
Sbjct: 63  TVSTSPGVPRLGLPPYQWWSEGLHGVAD-SPGVNFSASGEFSYATSFPQPIIMGAAFDDE 121

Query: 127 LWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASY 186
           L +++G +V  E R+  N G AGL +W+PN+N F+DPRWGRGQETPGEDP    +Y  + 
Sbjct: 122 LIKSVGAIVGMEGRSFNNYGRAGLDFWTPNINPFKDPRWGRGQETPGEDPYHLAQYVYNL 181

Query: 187 VRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
           V+GLQG    +   +V ++CKHF AYDL++W+G  R+ F+A V+ QD+ + +   F+ C 
Sbjct: 182 VQGLQGGLDPKPYYQVISTCKHFAAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCY 241

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFT 302
            + KV + MCSYN VNG+P+CA+  +L+  +R  W    + ++ SDCD+V   YD  ++T
Sbjct: 242 RDAKVGAAMCSYNAVNGIPSCANTYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNYT 301

Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
            TPEEA ADA++AG D+DCG F   +   A  + L++E ++  AL+      +RLG FD 
Sbjct: 302 KTPEEAVADALKAGTDIDCGTFYSEYLPGAYNQSLITETELRQALIRQYASLVRLGYFD- 360

Query: 363 EPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
            P+  QPY      +V TP  Q+LA +AA +GIVLLKN G +LPLS    + +A+IGP  
Sbjct: 361 -PTDIQPYRQYNWNNVDTPQAQQLAYQAAAEGIVLLKNDG-TLPLSS-DIKNIALIGPWG 417

Query: 422 DVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT 481
           + T  M GNY G+A    +PL G       +      ++   D   F AAI A++ AD  
Sbjct: 418 NATGEMQGNYYGVAPYLISPLMGAVATGYNVTYVFGTNITSNDTSGFAAAIAAAQGADVV 477

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           I   G+D+++E+E  DR  +  PG Q +LV +++   K P ++V   GG +D    K + 
Sbjct: 478 IYAGGIDETVESEGNDRNYITWPGNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANS 536

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
            + A++WAGYPGQ+GG+A+ DI+ G   P G+LP+T YP +Y+  +PMT+M +RP+ +  
Sbjct: 537 TVNALLWAGYPGQSGGSALFDIISGKVAPAGRLPVTQYPADYVYEIPMTDMDLRPNATS- 595

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
            PGRTY++Y G  +Y FG+G+ YT F +  A AP+              +T + + +  +
Sbjct: 596 -PGRTYKWYTGTPIYDFGYGLHYTTFSYKWAKAPS--------------STYNIQTLVQS 640

Query: 662 HAKCNRLTLG----VQVDVKNVG--SKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVH 712
               + L L       V+V N G  + D A  L V  T  P+ +  P+K L+ + ++H
Sbjct: 641 GNLYSYLDLAPFDTFTVNVTNTGNVTSDFASLLFVNGTYGPSPY--PNKSLITYARLH 696


>gi|238508313|ref|XP_002385353.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
 gi|296439537|sp|B8NYD8.1|BXLB_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|220688872|gb|EED45224.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
          Length = 776

 Score =  456 bits (1174), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 271/697 (38%), Positives = 389/697 (55%), Gaps = 35/697 (5%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  SL    R   L+  ++L+EK+      ++  PRLG+  Y WW+EALHGV+  G G  
Sbjct: 41  CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 99

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G+F  ATSFP  I   ++F+  L + +  V+S EARA  NGG AGL YW+PN+N F
Sbjct: 100 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 159

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAASCKHFTAYDLDNWNGVD 219
           RDPRWGRGQETPGEDP+   +Y    V GLQ   G +R KV A+CKHF AYDL+NW G++
Sbjct: 160 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPERPKVVATCKHFAAYDLENWEGIE 219

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R+ F+A VS QD+ + +   F+ C  + KV +VMCSYN +NG+PTCAD  +L+  +R  W
Sbjct: 220 RYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHW 279

Query: 280 ---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
              +   ++  DC ++   Y   H+ +    AAA A+ AG DLDCG     +  SA+Q+G
Sbjct: 280 GWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEYLRSALQQG 339

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L +   +NNAL+   +  ++LG FD     QPY  +G  +V TP  +ELA +A  +GIV+
Sbjct: 340 LYNNQTLNNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFTPAAEELAHKATVEGIVM 398

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           LKN G +LPL    + TVA+IGP ++ T  + GNY G      T +         +    
Sbjct: 399 LKNDG-TLPLK--SNGTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQ 455

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
             D+       F  AI A+++AD  I   G+D +IE E+ DR  ++ PG Q +L+ ++S 
Sbjct: 456 GTDINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD 515

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
             K P I+V   GG +D +    +  + A++WAGYP QAGG A+ DIL G S P G+LP+
Sbjct: 516 LEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTGKSAPAGRLPV 574

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           T YP  Y+  +PMT+M +RP  +   PGRTYR+Y   V+ PFG G+ YT F  +  +A  
Sbjct: 575 TQYPASYVDEVPMTDMTLRPGSNN--PGRTYRWYDKAVL-PFGFGLHYTTFNVSWNHA-- 629

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAHTLLVFSTP- 694
                    +G  N      A   T+A  +  L     + V N G+    +  L+F T  
Sbjct: 630 --------EYGPYNT--DSVASGTTNAPVDTELFDTFSITVTNTGNVASDYIALLFLTAD 679

Query: 695 ---PAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHV 727
              P  +  P K LV + +   +  G  Q+V +++ V
Sbjct: 680 RVGPEPY--PIKTLVGYSRAKGIEPGQSQQVKLDVSV 714


>gi|336365124|gb|EGN93476.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 732

 Score =  456 bits (1173), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 271/731 (37%), Positives = 418/731 (57%), Gaps = 40/731 (5%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  SL    R   ++   ++ E +   +S +  VPRLG+  Y+WWSE LHGV++  PG  
Sbjct: 22  CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVAD-SPGVN 80

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G+F  ATSFPQ I   ++F+  L +++G +V  E R+  N G AGL +W+PN+N F
Sbjct: 81  FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWTPNINPF 140

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR--LKVAASCKHFTAYDLDNWNGV 218
           +DPRWGRGQETPGEDP    +Y  + V+GLQG    +   +V ++CKHF AYDL++W+G 
Sbjct: 141 KDPRWGRGQETPGEDPYHLAQYVYNLVQGLQGGLDPKPYYQVISTCKHFAAYDLEDWDGN 200

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
            R+ F+A V+ QD+ + +   F+ C  + KV + MCSYN VNG+P+CA+  +L+  +R  
Sbjct: 201 YRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPSCANTYLLQSILRDF 260

Query: 279 WRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
           W    + ++ SDCD+V   YD  ++T TPEEA ADA++AG D+DCG F   +   A  + 
Sbjct: 261 WGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVADALKAGTDIDCGTFYSEYLPGAYNQS 320

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIV 395
           L++E ++  AL+      +RLG FD  P+  QPY      +V TP  Q+LA +AA +GIV
Sbjct: 321 LITETELRQALIRQYASLVRLGYFD--PTDIQPYRQYNWNNVDTPQAQQLAYQAAAEGIV 378

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
           LLKN G +LPLS    + +A+IGP  + T  M GNY G+A    +PL G       +   
Sbjct: 379 LLKNDG-TLPLSS-DIKNIALIGPWGNATGEMQGNYYGVAPYLISPLMGAVATGYNVTYV 436

Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
              ++   D   F AAI A++ AD  I   G+D+++E+E  DR  +  PG Q +LV +++
Sbjct: 437 FGTNITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYITWPGNQLDLVGELA 496

Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
              K P ++V   GG +D    K +  + A++WAGYPGQ+GG+A+ DI+ G   P G+LP
Sbjct: 497 AVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALFDIISGKVAPAGRLP 555

Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           +T YP +Y+  +PMT+M +RP+ +   PGRTY++Y G  +Y FG+G+ YT F +  A AP
Sbjct: 556 VTQYPADYVYEIPMTDMDLRPNATS--PGRTYKWYTGTPIYDFGYGLHYTTFSYKWAKAP 613

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG----VQVDVKNVG--SKDGAHTLL 689
           +              +T + + +  +    + L L       V+V N G  + D A  L 
Sbjct: 614 S--------------STYNIQTLVQSGNLYSYLDLAPFDTFTVNVTNTGNVTSDFASLLF 659

Query: 690 VFST-PPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH 747
           V  T  P+ +  P+K L+ + ++H + +G    V + + +   ++  D  G   +  G +
Sbjct: 660 VNGTYGPSPY--PNKSLITYARLHDIASGDTASVALGVTLGS-IARADTYGNMWLYPGTY 716

Query: 748 NIHIGGTKHSV 758
            + +    +SV
Sbjct: 717 QVTLDTLGNSV 727


>gi|121797681|sp|Q2TYT2.1|BXLB_ASPOR RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|83775471|dbj|BAE65591.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 797

 Score =  456 bits (1173), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 270/695 (38%), Positives = 388/695 (55%), Gaps = 31/695 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  SL    R   L+  ++L+EK+      ++  PRLG+  Y WW+EALHGV+  G G  
Sbjct: 62  CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 120

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G+F  ATSFP  I   ++F+  L + +  V+S EARA  NGG AGL YW+PN+N F
Sbjct: 121 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 180

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAASCKHFTAYDLDNWNGVD 219
           RDPRWGRGQETPGEDP+   +Y    V GLQ   G +R KV A+CKHF AYDL+NW G++
Sbjct: 181 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPERPKVVATCKHFAAYDLENWEGIE 240

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R+ F+A VS QD+ + +   F+ C  + KV +VMCSYN +NG+PTCAD  +L+  +R  W
Sbjct: 241 RYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHW 300

Query: 280 ---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
              +   ++  DC ++   Y   H+ +    AAA A+ AG DLDCG     +  SA+Q+G
Sbjct: 301 GWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEYLGSALQQG 360

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L +   +NNAL+   +  ++LG FD     QPY  +G  +V TP  +ELA +A  +GIV+
Sbjct: 361 LYNNQTLNNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFTPAAEELAHKATVEGIVM 419

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           LKN G +LPL    + TVA+IGP ++ T  + GNY G      T +         +    
Sbjct: 420 LKNDG-TLPLK--SNGTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQ 476

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
             D+       F  AI A+++AD  I   G+D +IE E+ DR  ++ PG Q +L+ ++S 
Sbjct: 477 GTDINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD 536

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
             K P I+V   GG +D +    +  + A++WAGYP QAGG A+ DIL G S P G+LP+
Sbjct: 537 LEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTGKSAPAGRLPV 595

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           T YP  Y+  +PMT+M +RP  +   PGRTYR+Y   V+ PFG G+ YT F  +  +A  
Sbjct: 596 TQYPASYVDEVPMTDMTLRPGSNN--PGRTYRWYDKAVL-PFGFGLHYTTFNVSWNHA-- 650

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
                    +G  N      A   T+A  +  L     + V N G+    +  L+F T  
Sbjct: 651 --------EYGPYNT--DSVASGTTNAPVDTELFDTFSITVTNTGNVASDYIALLFLTAD 700

Query: 696 AGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHV 727
                P+  K LV + +   +  G  Q+V +++ V
Sbjct: 701 GVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVSV 735


>gi|317158006|ref|XP_001826724.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
          Length = 776

 Score =  456 bits (1172), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 270/695 (38%), Positives = 388/695 (55%), Gaps = 31/695 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  SL    R   L+  ++L+EK+      ++  PRLG+  Y WW+EALHGV+  G G  
Sbjct: 41  CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 99

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G+F  ATSFP  I   ++F+  L + +  V+S EARA  NGG AGL YW+PN+N F
Sbjct: 100 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 159

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAASCKHFTAYDLDNWNGVD 219
           RDPRWGRGQETPGEDP+   +Y    V GLQ   G +R KV A+CKHF AYDL+NW G++
Sbjct: 160 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPERPKVVATCKHFAAYDLENWEGIE 219

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R+ F+A VS QD+ + +   F+ C  + KV +VMCSYN +NG+PTCAD  +L+  +R  W
Sbjct: 220 RYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHW 279

Query: 280 ---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
              +   ++  DC ++   Y   H+ +    AAA A+ AG DLDCG     +  SA+Q+G
Sbjct: 280 GWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEYLGSALQQG 339

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L +   +NNAL+   +  ++LG FD     QPY  +G  +V TP  +ELA +A  +GIV+
Sbjct: 340 LYNNQTLNNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFTPAAEELAHKATVEGIVM 398

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           LKN G +LPL    + TVA+IGP ++ T  + GNY G      T +         +    
Sbjct: 399 LKNDG-TLPLK--SNGTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQ 455

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
             D+       F  AI A+++AD  I   G+D +IE E+ DR  ++ PG Q +L+ ++S 
Sbjct: 456 GTDINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD 515

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
             K P I+V   GG +D +    +  + A++WAGYP QAGG A+ DIL G S P G+LP+
Sbjct: 516 LEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTGKSAPAGRLPV 574

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           T YP  Y+  +PMT+M +RP  +   PGRTYR+Y   V+ PFG G+ YT F  +  +A  
Sbjct: 575 TQYPASYVDEVPMTDMTLRPGSNN--PGRTYRWYDKAVL-PFGFGLHYTTFNVSWNHA-- 629

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
                    +G  N      A   T+A  +  L     + V N G+    +  L+F T  
Sbjct: 630 --------EYGPYNT--DSVASGTTNAPVDTELFDTFSITVTNTGNVASDYIALLFLTAD 679

Query: 696 AGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHV 727
                P+  K LV + +   +  G  Q+V +++ V
Sbjct: 680 GVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVSV 714


>gi|302683012|ref|XP_003031187.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
 gi|300104879|gb|EFI96284.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
          Length = 752

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 293/753 (38%), Positives = 405/753 (53%), Gaps = 37/753 (4%)

Query: 5   IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
           +AF  L  I LS ++   A   P   D  +    + P C  SL   +R   L+   ++ E
Sbjct: 1   MAFLLLTSIALSLAAGSRAQGFP---DCVNGPLASNPVCDASLGHVERARALVEEFTVPE 57

Query: 65  KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPG-ATSFPQVITTASSF 123
            +   ++ A  VPRLG+  YEWW+EALHGV  + PG  F    P  ATSFP  I   S+F
Sbjct: 58  MINNTVNAAFGVPRLGLPPYEWWNEALHGVG-LSPGVVFFEPEPAVATSFPMPINMGSAF 116

Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
           +  L  A+G V+S EARA  N G AGL YW+PN+N F+DPRWGRG ETPGEDP+ + +Y 
Sbjct: 117 DDALMLAMGDVISTEARAFSNAGRAGLDYWTPNINPFKDPRWGRGAETPGEDPLHAARYV 176

Query: 184 ASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
            S V GLQG  D   LKVAA+CKH+ AYDL+NW GV R+ F+A V+ QD+ + +  PFR 
Sbjct: 177 RSLVEGLQGGIDPPSLKVAAACKHWAAYDLENWGGVTRYAFDAVVTPQDLAEYYAPPFRS 236

Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQH 300
           CV + + AS MCSYN VNGVP CA P +LK  +R  W L  + ++ SDC +VG  YD   
Sbjct: 237 CVRDARAASAMCSYNAVNGVPACASPYLLKTVLRDAWGLAEDRWVTSDCGAVGNVYDPHG 296

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           +T     A+  +++AG DL+CG     +   A  RGL+ E D+  AL       + LG F
Sbjct: 297 YTEDLVNASTVSLKAGTDLNCGTNYTQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYF 356

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
           D  P  QPY  +   DV TP+ Q LA  AA +  VLLKN G +LPL+     ++A+IGP 
Sbjct: 357 DA-PEDQPYRQITWADVNTPEAQALAYTAAIKSFVLLKNDG-TLPLTD-STLSLALIGPM 413

Query: 421 SDVT-VTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
           ++ + + M+GNY GI      PLQG       +      +V   D   F AA+ A+  AD
Sbjct: 414 ANASALQMLGNYFGIPPFVIAPLQGFLDAGFNVTYVLGTNVTGNDAGSFDAAVAAAEAAD 473

Query: 480 ATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
             I V G+D ++E E  DR  +  P  Q  L+S +    K P ++V M GG +D    K 
Sbjct: 474 VVIYVGGIDNTLEMEEKDRTEISWPDNQLALLSALEGVGK-PLVVVQMGGGQLDDTPLKE 532

Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS 599
              + AI+WAGYPGQ+GGTAIAD + G   P G+L        Y+  + MT+M +RP  +
Sbjct: 533 SDAVNAILWAGYPGQSGGTAIADTVTGKVAPAGRL--------YVDEVAMTDMTLRPDNA 584

Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNF-VHTVANAPTVVAVPLDGRHGSINATISGKAI 658
              PGRTY++Y G  VYP+G+G+ YTN  V   ++AP            SI       + 
Sbjct: 585 TGNPGRTYKWYTGTPVYPYGYGLHYTNISVAWASDAPEACY--------SIQDLTGEASG 636

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKV-HVPAG 716
            V  A  +      +V V N G        L+F +  AG   AP K++VA+ +   V  G
Sbjct: 637 FVDLAPLDTF----RVTVTNEGDIASDFVALLFVSTQAGPAPAPIKEMVAYARASDVQPG 692

Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
               V + + +   L+  D SG   +  G++ +
Sbjct: 693 NSTEVELEVTLGA-LARTDESGDASLYPGKYEL 724


>gi|332982588|ref|YP_004464029.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
 gi|332700266|gb|AEE97207.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
           50-1 BON]
          Length = 714

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 276/735 (37%), Positives = 399/735 (54%), Gaps = 79/735 (10%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +  VSL    R  DL+ R++L EK+  +I  A A+PRL I  Y WW+E LHGV+  G   
Sbjct: 13  YKDVSLSFEDRAKDLVSRMTLPEKISQMIYDAPAIPRLDIPAYNWWNECLHGVARAGI-- 70

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
                   AT FPQ I  A++FN  L   +   +SDEARA ++        G   GLT+W
Sbjct: 71  --------ATVFPQAIAMAATFNPELIHKVAEAISDEARAKHHEAVRNGDRGIYKGLTFW 122

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           SPN+NIFRDPRWGRG ET GEDP L+ +   ++V+GLQG D   LKV A+ KH+  +   
Sbjct: 123 SPNINIFRDPRWGRGHETYGEDPYLTSRMGVAFVKGLQGDDPKYLKVVATPKHYAVH--- 179

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           +     R  F+A+VS++D+ +T+   F  CV EGK  S+M +YN+ NG P CA   +LK 
Sbjct: 180 SGPESQRHSFDARVSQKDLRETYLPAFEECVKEGKAVSIMGAYNRTNGEPCCASKTLLKD 239

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
            +R EW  +GY+VSDC ++   +     T T  E+AA A+  G +L+CG       + AV
Sbjct: 240 ILRDEWGFDGYVVSDCGAIDDIHMHHKVTKTAAESAALAVNNGCELNCGKTYEYLCQ-AV 298

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC-TPDHQELALEAARQ 392
           ++GL+SE  I+ A++   T +MRLGMFD  P    Y H+ P DV  +P+H+ELALE ARQ
Sbjct: 299 EQGLISEETIDQAVIKLFTARMRLGMFD-PPEMVRYAHI-PYDVNDSPEHRELALETARQ 356

Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA--- 449
            IVLLKN    LPLS  + +T+AVIGPN+D    ++ NY G    Y TPL+GI       
Sbjct: 357 SIVLLKNDENILPLSK-KLKTIAVIGPNADDLDVLLANYFGTPSKYVTPLEGIKNKVSPD 415

Query: 450 -RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
            + ++ +GC+    + D  F  A++ +  AD  I+ +GL   IE E           DR 
Sbjct: 416 TKVLYAKGCEVTGNSVDG-FDEAVNIAEMADIVIMCLGLSPRIEGEEGDVADSDGGGDRL 474

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            + LPG Q++L+  +    K P +LVL++G  I + +A     + AII A YPG+ GGTA
Sbjct: 475 HIDLPGMQEQLLETIYGTGK-PIVLVLLNGSAIAINWAHE--HVPAIIEAWYPGEEGGTA 531

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
           IAD+LFG  NP G+LP+T+     + +LP       P       GRTYR+++   +YPFG
Sbjct: 532 IADVLFGDYNPAGRLPITFV--RSLDDLP-------PFTDYNMKGRTYRYFEKEPLYPFG 582

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
           +G+SYT+F ++                   N  +S   +   +       L + VDV+N 
Sbjct: 583 YGLSYTSFKYS-------------------NLRLSAMRLPAGN------NLDINVDVENT 617

Query: 680 GSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
           G   G   + L  S   A    P +QL   + + +  G +Q V   +   +++S+ D  G
Sbjct: 618 GKLAGREVVQLYISDVEASVEVPMRQLCGIQCITLEPGQKQTVSFTVEP-QHMSLFDYDG 676

Query: 739 TRRIPLGEHNIHIGG 753
            R +  G+  I +GG
Sbjct: 677 KRILEPGQFIIAVGG 691


>gi|223945397|gb|ACN26782.1| unknown [Zea mays]
          Length = 516

 Score =  454 bits (1169), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 235/518 (45%), Positives = 319/518 (61%), Gaps = 10/518 (1%)

Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADA 312
           MCSYN+VNGVPTCAD N+L  T R +W   GYI SDCD+V + +D Q +  T E+A AD 
Sbjct: 1   MCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAKTAEDAVADV 60

Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
           ++AG+D++CG ++  H  SA+Q+G ++E DIN AL N   V+MRLG+F+G+P    YG +
Sbjct: 61  LKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDI 120

Query: 373 GPKDVCTPDHQELALEAARQGIVLLKNQG--PSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
           GP  VCT +HQ+LALEAA+ GIVLLKN G   +LPLS     ++AVIG N++  + + GN
Sbjct: 121 GPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGN 180

Query: 431 YAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQ 489
           Y G  C   TPLQ +  Y + T    GC   AC +      A+ A+  AD+ +L MGLDQ
Sbjct: 181 YFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLDQ 239

Query: 490 SIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
             E E +DR  L LPG+QQ L+  V+ A+K P ILVL+ GGP+DV+FAK +P+I AI+WA
Sbjct: 240 DQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWA 299

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
           GYPG+AGG AIA +LFG  NPGG+LP+TWYPQ++ T +PMT+M MR   +  YPGRTYRF
Sbjct: 300 GYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDF-TRVPMTDMRMRADPATGYPGRTYRF 358

Query: 610 YKGPVVYPFGHGMSYTNFVHTVANA--PTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
           Y+GP V+ FG+G+SY+ + H  A    PT     L     +     S     +    C+R
Sbjct: 359 YRGPTVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKAVEATAGGMASYDVEAIGSETCDR 418

Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPPA---GHWAPHKQLVAFEKVHVPAGAQQRVGIN 724
           L     V V+N G  DG H++LVF   P    G   P  QL+ F+ +H+ A     V   
Sbjct: 419 LKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHVEFE 478

Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
           +  CK+ S     G + I  G H + +G  +  +S  A
Sbjct: 479 VSPCKHFSRATEDGRKVIDQGSHFVMVGEDEFEMSFMA 516


>gi|391864313|gb|EIT73609.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 797

 Score =  453 bits (1165), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 269/695 (38%), Positives = 387/695 (55%), Gaps = 31/695 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  SL    R   L+  ++L+EK+      ++  PRLG+  Y WW+EALHGV+  G G  
Sbjct: 62  CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 120

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G+F  ATSFP  I   ++F+  L + +  V+S EARA  NGG AGL YW+PN+N F
Sbjct: 121 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 180

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAASCKHFTAYDLDNWNGVD 219
           RDPRWGRGQETPGEDP+   +Y    V GLQ   G +R KV A+CKHF AYDL+NW G++
Sbjct: 181 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPERPKVVATCKHFAAYDLENWEGIE 240

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R+ F+A VS QD+ + +   F+ C  + KV +VMCSYN +NG+PTCAD  +L+  +R  W
Sbjct: 241 RYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHW 300

Query: 280 ---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
              +   ++  DC ++   Y   H+ +    AAA A+ AG DLDCG     +  SA+Q+G
Sbjct: 301 GWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEYLGSALQQG 360

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L +   + NAL+   +  ++LG FD     QPY  +G  +V TP  +ELA +A  +GIV+
Sbjct: 361 LYNNQTLYNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFTPAAEELAHKATVEGIVM 419

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           LKN G +LPL    + TVA+IGP ++ T  + GNY G      T +         +    
Sbjct: 420 LKNDG-TLPLK--SNGTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQ 476

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
             D+       F  AI A+++AD  I   G+D +IE E+ DR  ++ PG Q +L+ ++S 
Sbjct: 477 GTDINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD 536

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
             K P I+V   GG +D +    +  + A++WAGYP QAGG A+ DIL G S P G+LP+
Sbjct: 537 LEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTGKSAPAGRLPV 595

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           T YP  Y+  +PMT+M +RP  +   PGRTYR+Y   V+ PFG G+ YT F  +  +A  
Sbjct: 596 TQYPASYVDEVPMTDMTLRPGSNN--PGRTYRWYDKAVL-PFGFGLHYTTFNVSWNHA-- 650

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
                    +G  N      A   T+A  +  L     + V N G+    +  L+F T  
Sbjct: 651 --------EYGPYNT--DSVASGTTNAPVDTELFDTFSITVTNTGNVASDYIALLFLTAD 700

Query: 696 AGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHV 727
                P+  K LV + +   +  G  Q+V +++ V
Sbjct: 701 GVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVSV 735


>gi|442803736|ref|YP_007371885.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
           DSM 8532]
 gi|442739586|gb|AGC67275.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
           DSM 8532]
          Length = 715

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 278/743 (37%), Positives = 397/743 (53%), Gaps = 92/743 (12%)

Query: 46  SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
           S    +R  DL+ R++++EKV  ++  + A+ RLGI  Y WW+EALHGV+  G       
Sbjct: 11  SYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT------ 64

Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNV 157
               AT FPQ I  A++F+  L   +  V+S E RA Y+  +         GLT+WSPN+
Sbjct: 65  ----ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFWSPNI 120

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG+    LK AA  KHF  +     +G
Sbjct: 121 NIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNHPKYLKAAACAKHFAVH-----SG 175

Query: 218 VD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
            +  R  FNA VSK+D+ +T+   F+  V E KV SVM +YN+ NG P C    +L   +
Sbjct: 176 PESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEPCCGSKTLLSDIL 235

Query: 276 RGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
           RGEW   G++VSDC ++  ++   H T+T  E+AA A+R G DL+CG   G +   A++ 
Sbjct: 236 RGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESAALAVRNGCDLNCGNMFG-NLLIALKE 294

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGI 394
           GL++E +I+ A+   +  +M+LGMFD  P  Q PY  +    V   +H+ELAL+ A++ I
Sbjct: 295 GLITEEEIDRAVTRLMITRMKLGMFD--PEDQVPYASISYDFVDCKEHRELALDVAKKSI 352

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA----R 450
           VLLKN G  LPL   + R++AVIGPN+D    +IGNY G A  Y T L GI   A    R
Sbjct: 353 VLLKNDG-LLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLDGIREMAGDDVR 411

Query: 451 TIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL------- 496
             +  GC       +++    D++   A+  +  AD  I+ +GLD +IE E +       
Sbjct: 412 IYYSVGCHLYKDRVENLGEPGDRI-AEAVTCAEHADVVIMCLGLDSTIEGEEMHESNIYG 470

Query: 497 --DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             D+  L LPG+QQEL+  V    K P +LVL++G  + V +A  D  I AI+ A YPG 
Sbjct: 471 SGDKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWA--DEHIPAILNAWYPGA 527

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGP 613
            GG AIA +LFG +NP GKLP+T+Y       LP  T+ +M          RTYRF K  
Sbjct: 528 LGGRAIASVLFGETNPSGKLPVTFY--RTTEELPDFTDYSME--------NRTYRFMKNE 577

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
            +YPFG G+SYT F ++                   +  +S   I+            V 
Sbjct: 578 ALYPFGFGLSYTTFDYS-------------------DLKLSKDTIRAGEG------FNVS 612

Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
           V V N G   G   + V+       W  P+ QL   ++V + +G    +   I   + L+
Sbjct: 613 VKVTNTGKMAGEEVVQVYIKDLEASWRVPNWQLSGMKRVRLESGETAEITFEIR-PEQLA 671

Query: 733 VVDRSGTRRIPLGEHNIHIGGTK 755
           VV   G   I  GE  I++GG++
Sbjct: 672 VVTDEGKSVIEPGEFEIYVGGSQ 694


>gi|291167620|dbj|BAI82526.1| 1,4-beta-D-xylosidase [Aureobasidium pullulans var. melanogenum]
          Length = 805

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 267/718 (37%), Positives = 404/718 (56%), Gaps = 27/718 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+   ++ EK+ L  + +  VPRLG+  Y+WW EALHGV++  PG  
Sbjct: 44  CDKSADPVARAKALVAAFTVAEKLNLTGNNSPGVPRLGLPVYQWWQEALHGVAS-SPGVT 102

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G F  ATSFPQ I   ++F+  L +++  VVS EARA  N G AGL +W+PN+N +
Sbjct: 103 FNATGQFDSATSFPQPILMGAAFDDALIQSVAEVVSTEARAFNNYGRAGLDFWTPNINPY 162

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVD 219
           RDPRWGRGQETPGEDP     Y  S + GLQG  D +  K+ A+CKHF  YD+++WNG  
Sbjct: 163 RDPRWGRGQETPGEDPYHLSSYVHSLIMGLQGGEDPEIRKITATCKHFAGYDIESWNGNL 222

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R+  + ++ ++D+ + +   FR C  +  V + MC+Y+ +NGVPTCADP +L   +R  W
Sbjct: 223 RYQNDVQIPQRDLVEYYLPSFRSCARDSNVGAFMCTYSALNGVPTCADPWLLNDVLREHW 282

Query: 280 RLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
                  ++ SDCDS+   +   +F+ T + AAA A+ AG DLDCG +   H   A  +G
Sbjct: 283 GWTNEEQWVTSDCDSIQNIFLPHNFSDTRQGAAAAALNAGTDLDCGTYYQHHLPLAYSQG 342

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L+++  ++ ALV   T  +R G FDG P++  Y +L   DV T   Q+LAL+AA +G+VL
Sbjct: 343 LINQTTVDQALVRLYTSLVRTGYFDG-PNAM-YRNLTWSDVGTTHAQQLALQAAEEGMVL 400

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY-ARTIHQQ 455
           LKN G  LPLS      +A+IG  ++ T  M GNY G+     +PL    +  A+  + Q
Sbjct: 401 LKNDG-LLPLSISNGTKIALIGSWANATTQMQGNYYGVPTYLHSPLYAAQQTGAQVFYAQ 459

Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
           G           +     A+ +AD  I + G+D S+EAE +DR  +   G Q +++ +++
Sbjct: 460 GPGGQGDPTTDHWLPVWTAAEKADIIIYIGGVDISVEAEGMDREDINWTGAQLDIIGELA 519

Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
           M  K P +L  M G  +D     N+  I+A+IW GYPGQ GG A+ +I+ G + P G+LP
Sbjct: 520 MYGK-PMVLAQM-GDQLDNTPIVNNANISALIWGGYPGQDGGVALFNIITGKTAPAGRLP 577

Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           +T YP  YI ++PMT+M +RP+ +   PGRTY++Y G  V+ FG+GM YT F   ++   
Sbjct: 578 VTQYPAHYIADIPMTDMTLRPNATTGSPGRTYKWYNGTAVFEFGYGMHYTKFSADIS--- 634

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV-DVKNVGSKDGAHTLLVFSTP 694
                P+      I++ +SG   +    +C   ++ V V +  NV S   A   +     
Sbjct: 635 -----PMSKSSYDISSLLSG-CNETYKDRCAFESISVNVHNTGNVTSDYAALGFIAGQFG 688

Query: 695 PAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
           P+ +  P K LV ++++H +  G+ Q   +N+ +   LS VD  G   +  G++ + I
Sbjct: 689 PSPY--PKKSLVNYQRLHNIAGGSSQTATLNLTLGS-LSRVDDHGNTYLYPGDYALMI 743


>gi|242813865|ref|XP_002486253.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
 gi|218714592|gb|EED14015.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
          Length = 893

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 276/698 (39%), Positives = 391/698 (56%), Gaps = 30/698 (4%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P C  SL    R   L+  ++ +EKV+   +G+    RLG+  Y+WW+EALHGV+   PG
Sbjct: 163 PICDTSLDPLTRAKGLVDAMTFEEKVQNTQNGSPGAARLGLPAYQWWNEALHGVAG-SPG 221

Query: 101 TKF--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
             F   G+F  ATSFPQ I  +++F+  L + +G VVS E RA  N G AGL +W+PN+N
Sbjct: 222 VTFQPSGNFSYATSFPQPILMSAAFDDALIKEVGTVVSIEGRAFNNYGNAGLDFWTPNIN 281

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNG 217
            FRDPRWGRGQETPGEDP    +Y  + V GLQ G      +V A+CKHF  YD+++W G
Sbjct: 282 PFRDPRWGRGQETPGEDPYHIARYVYNLVDGLQNGIAPANPRVVATCKHFAGYDIEDWEG 341

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R+ FNA +S QD+ + +  PF+ C  + +V ++MCSYN VNG+PTCAD  +L   +R 
Sbjct: 342 NSRYGFNAIISTQDLSEYYLPPFKSCARDAQVDAIMCSYNAVNGIPTCADSYLLDTILRD 401

Query: 278 EWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
            W  N    ++ SDCD+V   Y    +TS+   AAADA+ AG +LDCG  +  +  +A  
Sbjct: 402 HWNWNQTGHWVTSDCDAVDNIYSDHRYTSSLAAAAADALNAGTNLDCGTTMSNNLAAAAA 461

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
           + L     +N+ALV   +  +RLG FD E S   Y  LG  DV T   Q+LA  AA +GI
Sbjct: 462 QDLFKNATLNSALVYLYSSLVRLGWFDSEDSQ--YSSLGWSDVGTTASQQLANRAAVEGI 519

Query: 395 VLLKNQGPS-LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
           VLLKN     LPLS    +T+A+IGP ++ T  + GNY G      T + G  +   T+ 
Sbjct: 520 VLLKNDHKKVLPLSQ-HGQTIALIGPYANATTQLQGNYYGTPAYIRTLVWGAEQMGYTVQ 578

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
            +    +   D   F AA+ A++ AD  I   G+D SIEAEA+DR  +   G Q +L+ +
Sbjct: 579 YEAGTGINSTDTSGFAAAVAAAKTADIVIYAGGIDNSIEAEAMDRNTIAWTGNQLQLIDQ 638

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
           +S   K P +++   GG +D +    +  + A++W GYP Q GG A+ DIL G S P G+
Sbjct: 639 LSQVGK-PLVVLQFGGGQLDDSALLQNENVNALLWCGYPSQTGGQAVFDILTGQSAPAGR 697

Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
           LP+T YP  Y   +PMT+M++RP+ S   PGRTYR+Y   V+ PFG G+ YT F  + A 
Sbjct: 698 LPVTQYPANYTNAIPMTDMSLRPNGST--PGRTYRWYDDAVI-PFGFGLHYTTFDASWA- 753

Query: 634 APTVVAVPLDGRHGSIN-ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF- 691
                    D + G  N A++  KA K  +           V+VKN G        L+F 
Sbjct: 754 ---------DKKFGPYNTASLVAKASKSKYQDTAPFD-SFHVNVKNTGKVTSDFVALLFA 803

Query: 692 STPPAG-HWAPHKQLVAFEKV-HVPAGAQQRVGINIHV 727
           ST  AG    P K L+++ +   +  G  + V I++ +
Sbjct: 804 STDNAGPKPYPIKTLISYARASSIKPGETRTVSIDVTI 841


>gi|378730020|gb|EHY56479.1| beta-glucosidase, variant [Exophiala dermatitidis NIH/UT8656]
 gi|378730021|gb|EHY56480.1| beta-glucosidase [Exophiala dermatitidis NIH/UT8656]
          Length = 783

 Score =  451 bits (1159), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 283/734 (38%), Positives = 413/734 (56%), Gaps = 39/734 (5%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +  +  R   L+  L+ +EK  L  + +  VPRLG+  Y+WW EALHGV++  PG  
Sbjct: 35  CNTNASVADRAKALVAALTNEEKFNLTGNTSPGVPRLGLYSYQWWQEALHGVAS-SPGVN 93

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   GDF  ATSFPQ I  +++F+  L  A+  VVS EARA  N   +GL +W+PN+N +
Sbjct: 94  FSTSGDFSHATSFPQPILMSAAFDDALINAVATVVSTEARAFNNVNRSGLDFWTPNINPY 153

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK-VAASCKHFTAYDLDNWNGVD 219
           +DPRWGRGQETPGED      Y A+ + GLQG     +K V A+CKHF AYDL++W   D
Sbjct: 154 KDPRWGRGQETPGEDTFHLKSYVAALIDGLQGGLNPPIKKVIATCKHFVAYDLEDWITTD 213

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R++F+A VS QD+ + +  PF+ C  + +V S+MCSYN +NGVPTCADP IL+  +R  W
Sbjct: 214 RYNFDAIVSTQDLAEYYMQPFQTCARDARVGSIMCSYNAMNGVPTCADPYILQTVLREHW 273

Query: 280 RLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
                  Y+ SDCD++   Y   ++  T E+A ADA+ AG DL+CG +   H  +A   G
Sbjct: 274 NWTDDGQYVTSDCDAIQNIYAPHYYEPTREQAVADALTAGTDLNCGTYYQTHLPAAFSEG 333

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L ++  I+  +    +  ++LG FD  PS+ PY  L   DV TP  + LAL+AA +GIVL
Sbjct: 334 LFNQTVIDQTITRLYSALIKLGYFD-PPSATPYRSLNWSDVSTPAAEALALKAAEEGIVL 392

Query: 397 LKNQGPSLPLSH--IRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIH 453
           LKN G  LPLS    ++ TVA+IG  ++ T TM GNY GIA    +PL  + +       
Sbjct: 393 LKNDG-LLPLSFPTDKNTTVAIIGGWANATTTMQGNYFGIAPYLHSPLYALQQLPNINAV 451

Query: 454 QQGCKDVACAD--DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
             G   V   D  D+L GAA     +AD  I+  GL  S E+E+ DR  +       +++
Sbjct: 452 YGGGFGVPTTDGWDELLGAA----GEADLIIIADGLTTSDESESNDRYTIGWQPAAIDII 507

Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
           +++S   K PT+ + M G  +D     N+P I+A+IW GYPG AGG A+ +IL G + P 
Sbjct: 508 NQLSGMGK-PTVFLQM-GDQLDNTPLLNNPNISALIWGGYPGMAGGDALINILTGKAAPA 565

Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
           G+LP+T YP +Y+  + MT+M +RP+ +   PGRTY++Y   V+ PFG+G+ YTNF    
Sbjct: 566 GRLPVTQYPADYVNQVNMTDMELRPNATSGNPGRTYKWYNNAVL-PFGYGLHYTNF-SVA 623

Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR-----LTL----GVQVDVKNVGSK 682
           A+A             S     +   I    + C+R     L L       V+V N GSK
Sbjct: 624 ASAQGQAQTQSGPSSNSSQGQGTSYNISSLVSSCDRSQYAYLDLCPFESFNVNVTNTGSK 683

Query: 683 DGAHTLLVFSTPPAGHWAPH----KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRS 737
             +  + +     +G + P     KQLVA++++ ++ AGA     +N+ +   L+  D +
Sbjct: 684 LASDFVALGFI--SGSYGPQPYPIKQLVAYQRLFNISAGASATATLNLTLGS-LARHDEN 740

Query: 738 GTRRIPLGEHNIHI 751
           G   +  G++ + I
Sbjct: 741 GNAVLYPGDYGLLI 754


>gi|297039776|gb|ADH95739.1| beta-xylosidase [Aspergillus fumigatus]
          Length = 771

 Score =  450 bits (1157), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 257/594 (43%), Positives = 351/594 (59%), Gaps = 15/594 (2%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
           L  C  SL +  R   L+  ++ +EKV      +  VPRLG+  Y WWSEALHGV+   P
Sbjct: 40  LAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVAG-SP 98

Query: 100 GTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
           G +F   G F  ATSFPQ I   ++F+  L + +  VVS E RA  N G +GL +W+PN+
Sbjct: 99  GVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRSGLDFWTPNI 158

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWN 216
           N FRD RWGRGQETPGEDP+   +Y    V GLQ   G    KV A+CKHF AY L++WN
Sbjct: 159 NPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANPKVVATCKHFAAYGLEDWN 218

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
           GV R  FNA+VS QD+ + +  PF+ C  + +V +VMCSYN +NGVP CAD  +L+  +R
Sbjct: 219 GVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNALNGVPACADSYLLQTILR 278

Query: 277 GEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
             W+ +    +I SDC ++   Y+  +FT+TP EAAA A+ AG DLDCG     +   A 
Sbjct: 279 EHWKWDEPGRWITSDCGAIDDIYNGHNFTTTPAEAAATALNAGTDLDCGTVFPKYLGQAA 338

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
             GL S   ++ ALV   +  ++LG FD     QPY  +G  DV TP  + LA +AA +G
Sbjct: 339 DEGLYSNQTLDRALVRLYSSFVKLGYFD-PAEDQPYRSIGWTDVDTPAVEALAHKAAGEG 397

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
           IVLLKN   +LPL      T+A+IGP ++ T  M GNY G A    T L    +    + 
Sbjct: 398 IVLLKNDK-TLPLK--AKGTLALIGPYANATKQMQGNYEGPAKYIRTLLWAATQAGYDVK 454

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
                 +       F AA+ A++QAD  +   G+D +IEAE  DR  +  PG Q  L+ +
Sbjct: 455 YAAGTAINTNSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGRDRTTIAWPGNQVNLIDQ 514

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
           +S   K P ++V   GG +D +   ++PR+ A++WAGYP Q GG+AI DIL G + P G+
Sbjct: 515 LSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALLWAGYPSQEGGSAIFDILTGKTAPAGR 573

Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
           LP+T YP +Y+  +PMT+MA+RP  +   PGRTYR+Y   V+ PFG G+ YT F
Sbjct: 574 LPVTQYPADYVNQVPMTDMALRPGSNT--PGRTYRWYDKAVL-PFGFGLHYTTF 624


>gi|296439595|sp|A1CCL9.2|BXLB_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
          Length = 771

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 287/697 (41%), Positives = 389/697 (55%), Gaps = 32/697 (4%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
           L  C  S  +  R   L+  +S  EKV      A  VPRLG+  Y WWSEALHGV+   P
Sbjct: 40  LAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSEALHGVAGA-P 98

Query: 100 GTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
           G  F   G F  ATSF Q I   +SF+  L + +  VV  E RA  N G AGL YW+PN+
Sbjct: 99  GVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGRAGLDYWTPNI 158

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWN 216
           N FRDPRWGRGQETPGEDP+   +Y    V GLQG  G  R ++AA+CKHF AYD+++WN
Sbjct: 159 NPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIGPARPQIAATCKHFAAYDMEDWN 218

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
           GV R  F+A+VS QD+ + +   F+ CV + +V +VMCSYN +NGVPTCADP +L+  +R
Sbjct: 219 GVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNALNGVPTCADPYLLQTLLR 278

Query: 277 GEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
             W  +    ++VSDC ++   Y   ++T T  EAAA A+ AG DLDCG     H   A 
Sbjct: 279 EHWDWDQPGHWVVSDCGAIDDIYIGHNYTKTGAEAAAVALNAGTDLDCGTVFPKHLGEAA 338

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
           ++GL +   ++ ALV   +  ++LG FD     QPYG +G KDV TP  ++LA +AA +G
Sbjct: 339 EQGLYTNQTLDRALVRLYSSLVKLGYFD-PAEKQPYGSIGWKDVDTPAAEQLAHKAAVEG 397

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
           IVLLKN   +LPL      T+A+IGP ++ T  M GNY G      T      ++   + 
Sbjct: 398 IVLLKND-QTLPLK--AKGTLALIGPYANATKQMQGNYQGPPKYIRTLEWAATQHGYQVQ 454

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
                 +  +    F AA+ A++ AD  +   G+D +IE+E LDR  +  PG Q  L+S+
Sbjct: 455 YSPGTAINNSSTAGFAAALAAAKDADVVLYAGGIDNTIESETLDRTTITWPGNQLSLISE 514

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
           +S   K P I++   GG +D      +P + A++WAGYP Q GG AI DIL G + P G+
Sbjct: 515 LSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQEGGAAIFDILTGKAAPAGR 573

Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
           LP+T YP  Y   +PMTEM +R       PGRTYR+Y   VV PFG G+ YT+F      
Sbjct: 574 LPITQYPAAYTAQVPMTEMGLRAGGDN--PGRTYRWYDKAVV-PFGFGLHYTSF------ 624

Query: 634 APTVVAVPLD-GRHGSIN-ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
                 V  D GR G  N A +  +A   +H     L    +V V+N G+    +  L+F
Sbjct: 625 -----EVSWDRGRLGPYNTAALVNRAPGGSHVD-RALFDTFRVQVQNTGTVTSDYVALLF 678

Query: 692 -STPPAG-HWAPHKQLVAFEKV-HVPAGAQQRVGINI 725
             T  AG    P K LV + +V  V  G ++ V I +
Sbjct: 679 VKTEDAGPEPYPLKTLVGYTRVQQVKPGERRSVEIEV 715


>gi|121712174|ref|XP_001273702.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
 gi|119401854|gb|EAW12276.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
          Length = 803

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 292/713 (40%), Positives = 390/713 (54%), Gaps = 64/713 (8%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
           L  C  S  +  R   L+  +S  EKV      A  VPRLG+  Y WWSEALHGV+   P
Sbjct: 72  LAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSEALHGVAGA-P 130

Query: 100 GTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
           G  F   G F  ATSF Q I   +SF+  L + +  VV  E RA  N G AGL YW+PN+
Sbjct: 131 GVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGRAGLDYWTPNI 190

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWN 216
           N FRDPRWGRGQETPGEDP+   +Y    V GLQG  G  R ++AA+CKHF AYD+++WN
Sbjct: 191 NPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIGPARPQIAATCKHFAAYDMEDWN 250

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
           GV R  F+A+VS QD+ + +   F+ CV + +V +VMCSYN +NGVPTCADP +L+  +R
Sbjct: 251 GVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNALNGVPTCADPYLLQTLLR 310

Query: 277 GEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
             W  +    ++VSDC ++   Y   ++T T  EAAA A+ AG DLDCG     H   A 
Sbjct: 311 EHWDWDQPGHWVVSDCGAIDDIYIGHNYTKTGAEAAAVALNAGTDLDCGTVFPKHLGEAA 370

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
           ++GL +   ++ ALV   +  ++LG FD     QPYG +G KDV TP  ++LA +AA +G
Sbjct: 371 EQGLYTNQTLDRALVRLYSSLVKLGYFD-PAEKQPYGSIGWKDVDTPAAEQLAHKAAVEG 429

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
           IVLLKN   +LPL      T+A+IGP ++ T  M GNY           QG  +Y RT+ 
Sbjct: 430 IVLLKND-QTLPLK--AKGTLALIGPYANATKQMQGNY-----------QGPPKYIRTLE 475

Query: 454 ----QQGCKDVACADDQLFGAAIDASR------------QADATILVMGLDQSIEAEALD 497
               Q G +          G AI+ S              AD  +   G+D +IE+E LD
Sbjct: 476 WAATQHGYQVQYSP-----GTAINNSSTAGFAAALAAAKDADVVLYAGGIDNTIESETLD 530

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           R  +  PG Q  L+S++S   K P I++   GG +D      +P + A++WAGYP Q GG
Sbjct: 531 RTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQEGG 589

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
            AI DIL G + P G+LP+T YP  Y   +PMTEM +R       PGRTYR+Y   VV P
Sbjct: 590 AAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGLRAGGDN--PGRTYRWYDKAVV-P 646

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLD-GRHGSIN-ATISGKAIKVTHAKCNRLTLGVQVD 675
           FG G+ YT+F            V  D GR G  N A +  +A   +H     L    +V 
Sbjct: 647 FGFGLHYTSF-----------EVSWDRGRLGPYNTAALVNRAPGGSHVD-RALFDTFRVQ 694

Query: 676 VKNVGSKDGAHTLLVF-STPPAG-HWAPHKQLVAFEKV-HVPAGAQQRVGINI 725
           V+N G+    +  L+F  T  AG    P K LV + +V  V  G ++ V I +
Sbjct: 695 VQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLVGYTRVQQVKPGERRSVEIEV 747


>gi|119473971|ref|XP_001258861.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
 gi|292495290|sp|A1DJS5.1|XYND_NEOFI RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|119407014|gb|EAW16964.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
          Length = 771

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 253/594 (42%), Positives = 350/594 (58%), Gaps = 15/594 (2%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
           L  C  SL +  R   L+  ++ +EKV      +  VPRLG+  Y WWSEALHGV+   P
Sbjct: 40  LAVCDTSLDVTTRARSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVAG-SP 98

Query: 100 GTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
           G +F   G F  ATSFPQ I   ++F+  L + +  VVS E RA  N G AGL +W+PN+
Sbjct: 99  GVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRAGLDFWTPNI 158

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWN 216
           N FRD RWGRGQETPGEDP+   +Y    V GLQ   G    KV A+CKHF AYDL++WN
Sbjct: 159 NPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANPKVVATCKHFAAYDLEDWN 218

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
           GV R  FNA+VS QD+ + +  PF+ C  + KV +VMCSYN +NGVP CAD  +L+  +R
Sbjct: 219 GVVRHSFNAEVSTQDLSEFYLPPFKSCARDAKVDAVMCSYNALNGVPACADSYLLQTILR 278

Query: 277 GEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
             W+ +    +I  DC ++   Y+  ++T TP EAAA A+ AG DLDCG     +   A 
Sbjct: 279 EHWKWDEPGHWITGDCGAIDDIYNGHNYTKTPAEAAATALNAGTDLDCGTVFPKYLGQAA 338

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
             GL +   ++ ALV   +  ++LG FD     QPY  +G KDV +P  + LA +AA +G
Sbjct: 339 DEGLYTNKTLDKALVRLYSSLVKLGYFD-PAEDQPYRSIGWKDVDSPAAEALAHKAAVEG 397

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
           IVLLKN   +LPL      T+A+IGP ++ T  M GNY G      T L    +    + 
Sbjct: 398 IVLLKNDK-TLPLK--AKGTLALIGPYANATKQMQGNYEGPPKYIRTLLWAATQAGYDVK 454

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
                 +       F AA+ A++QAD  +   G+D +IEAE  DR  ++ PG Q +L+ +
Sbjct: 455 YVAGTAINANSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGHDRTTIVWPGNQLDLIDQ 514

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
           +S   K P ++V   GG +D +   ++P + A++W GYP Q GG+AI DIL G + P G+
Sbjct: 515 LSKIGK-PLVVVQFGGGQVDDSSLLSNPHVNALLWTGYPSQEGGSAIFDILTGKTAPAGR 573

Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
           LP+T YP +Y+  +P+T+MA+RP  +   PGRTYR+Y   V+ PFG G+ YT F
Sbjct: 574 LPVTQYPADYVNQVPLTDMALRPGSNT--PGRTYRWYDKAVL-PFGFGLHYTTF 624


>gi|70986056|ref|XP_748529.1| beta-xylosidase [Aspergillus fumigatus Af293]
 gi|74668295|sp|Q4WFI6.1|BXLB_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|296439536|sp|B0Y0I4.1|BXLB_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|66846158|gb|EAL86491.1| beta-xylosidase, putative [Aspergillus fumigatus Af293]
 gi|159128339|gb|EDP53454.1| beta-xylosidase [Aspergillus fumigatus A1163]
          Length = 771

 Score =  445 bits (1145), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 258/594 (43%), Positives = 352/594 (59%), Gaps = 15/594 (2%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
           L  C  SL +  R   L+  ++ +EKV      +  VPRLG+  Y WWSEALHGV+   P
Sbjct: 40  LAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVAG-SP 98

Query: 100 GTKFG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
           G +F   G F  ATSFPQ I   ++F+  L + +  VVS E RA  N G +GL +W+PN+
Sbjct: 99  GVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRSGLDFWTPNI 158

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWN 216
           N FRD RWGRGQETPGEDP+   +Y    V GLQ   G    KV A+CKHF AYDL++WN
Sbjct: 159 NPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANPKVVATCKHFAAYDLEDWN 218

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
           GV R  FNA+VS QD+ + +  PF+ C  + +V +VMCSYN +NGVP CAD  +L+  +R
Sbjct: 219 GVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNALNGVPACADSYLLQTILR 278

Query: 277 GEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
             W+ +    +I SDC ++   Y+  +FT+TP EAAA A+ AG DLDCG     +   A 
Sbjct: 279 EHWKWDEPGRWITSDCGAIDDIYNGHNFTTTPAEAAATALNAGTDLDCGTVFPKYLGQAA 338

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
             GL S   ++ ALV   +  ++LG FD     QPY  +G  DV TP  + LA +AA +G
Sbjct: 339 DEGLYSNQTLDRALVRLYSSLVKLGYFD-PAEDQPYRSIGWTDVDTPAAEALAHKAAGEG 397

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
           IVLLKN   +LPL      T+A+IGP ++ T  M GNY G A    T L    +    + 
Sbjct: 398 IVLLKNDK-TLPLKA--KGTLALIGPYANATKQMQGNYEGPAKYIRTLLWAATQAGYDVK 454

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
                 +       F AA+ A++QAD  +   G+D +IEAE  DR  +  PG Q  L+ +
Sbjct: 455 YAAGTAINTNSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGRDRTTIAWPGNQVNLIDQ 514

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
           +S   K P ++V   GG +D +   ++PR+ A++WAGYP Q GG+AI DIL G + P G+
Sbjct: 515 LSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALLWAGYPSQEGGSAIFDILTGKTAPAGR 573

Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
           LP+T YP +Y+  +PMT+MA+RP  +   PGRTYR+Y   V+ PFG G+ YT F
Sbjct: 574 LPVTQYPADYVNQVPMTDMALRPGSNT--PGRTYRWYDKAVL-PFGFGLHYTTF 624


>gi|23304843|emb|CAD48309.1| beta-xylosidase B [Clostridium stercorarium]
          Length = 715

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 274/741 (36%), Positives = 394/741 (53%), Gaps = 88/741 (11%)

Query: 46  SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
           S    +R  DL+ R++++EKV  ++  + A+ RLGI  Y WW+EALHGV+  G       
Sbjct: 11  SYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT------ 64

Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNV 157
               AT FPQ I  A++F+  L   +  V+S E RA Y+  +         GLT+WSPN+
Sbjct: 65  ----ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFWSPNI 120

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG+    LK    CK+   + +   + 
Sbjct: 121 NIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNHPKYLKAGGMCKNILPFTVVPESL 180

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R  FNA VSK+D+ +T+   F+  V E KV SVM +YN+ NG P C    +L   +RG
Sbjct: 181 --RHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEPCCGSKTLLSDILRG 238

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
           EW   G++VSDC ++  ++   H T+T  E+AA A+R G DL+CG   G +   A++ GL
Sbjct: 239 EWGFKGHVVSDCWAIRDFHMHHHVTATAPESAALAVRNGCDLNCGNMFG-NLLIALKEGL 297

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVL 396
           ++E +I+ A+   +  +M+LGMFD  P  Q PY  +     C  +H+ELAL+ A++ IVL
Sbjct: 298 ITEEEIDRAVTRLMITRMKLGMFD--PEDQVPYASISSFVDCK-EHRELALDVAKKSIVL 354

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA----RTI 452
           LKN G  LPL   + R++AVIGPN+D    +IGNY G A  Y T L GI   A    R  
Sbjct: 355 LKNDG-LLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLDGIREMAGDDVRIY 413

Query: 453 HQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL--------- 496
           +  GC       +++    D++   A+  +  AD  I+ +GLD +IE E +         
Sbjct: 414 YSVGCHLYKDRVENLGEPGDRI-AEAVTCAEHADVVIMCLGLDSTIEGEEMHESNIYGSG 472

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           D+  L LPG+QQEL+  V    K P +LVL++G  + V +A  D  I AI+ A YPG  G
Sbjct: 473 DKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWA--DEHIPAILNAWYPGALG 529

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           G AIA +LFG +NP GKLP+T+Y       LP  T+ +M          RTYRF K   +
Sbjct: 530 GRAIASVLFGETNPSGKLPVTFY--RTTEELPDFTDYSME--------NRTYRFMKNEAL 579

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG G+SYT F ++                   +  +S   I+            V V 
Sbjct: 580 YPFGFGLSYTTFDYS-------------------DLKLSKDTIRAGEG------FNVSVK 614

Query: 676 VKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           V N G   G   + V+       W  P+ QL   ++V + +G    +   I   + L+VV
Sbjct: 615 VTNTGKMAGEEVVQVYIKDLEASWRVPNWQLSGMKRVRLESGETAEITFEIR-PEQLAVV 673

Query: 735 DRSGTRRIPLGEHNIHIGGTK 755
              G   I  GE  I++GG++
Sbjct: 674 TDEGKSVIEPGEFEIYVGGSQ 694


>gi|242786966|ref|XP_002480909.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
 gi|218721056|gb|EED20475.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
          Length = 757

 Score =  444 bits (1142), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 262/712 (36%), Positives = 398/712 (55%), Gaps = 35/712 (4%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG---DF 107
           QRV  LI  L+L+EK+  L+  +A   RLG+  YEWW+EA HGV +  PG +F     +F
Sbjct: 24  QRVKSLIDSLTLEEKILNLVDASAGSERLGLPSYEWWNEATHGVGS-APGVQFTEKPVNF 82

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGR 167
             ATSFP  I TA+SF+  L   I  V+  E RA  N G +G  +W+PN+N FRDPRWGR
Sbjct: 83  SYATSFPAPILTAASFDDALVREIASVIGREGRAFGNNGFSGFDFWAPNINPFRDPRWGR 142

Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           GQETPGED  +   Y  +++ GLQG D +  +V A+CKH+ AYDL+      R+  +   
Sbjct: 143 GQETPGEDSFVVQSYIRNFIPGLQGDDPEDKQVIATCKHYAAYDLE----TGRYGNDYNP 198

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---Y 284
           ++QD+ D F  PF+ CV +  V S+MC+YN V+G+PTCA   +L + +R  W       Y
Sbjct: 199 TQQDLADYFLAPFKTCVRDTGVGSIMCAYNAVDGIPTCASEYLLDQVLRKHWNFTADYNY 258

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDI 343
           +VSDC +V   +   +FT T E AA+ ++ AG+DL+CG  +L L+   A  +  +  +D 
Sbjct: 259 VVSDCGAVTDIWQYHNFTDTEEAAASVSLNAGVDLECGSSYLKLNESLAANQTTVQALD- 317

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
             AL    +    +G FDG      Y  LG  DV TP+ Q LA EAA +G+ LLKN    
Sbjct: 318 -QALTRLYSALFTVGFFDG----GKYTALGFADVSTPEAQSLAYEAAVEGMTLLKNDKRL 372

Query: 404 LPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVAC 462
           LP+ S  ++++VA+IGP ++ T  M G+Y+GI     +PL+    +   ++      +  
Sbjct: 373 LPIRSSHKYKSVALIGPFANATTQMQGDYSGIPPFLISPLEAFKGHDWEVNYAMGTGINN 432

Query: 463 ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPT 522
                F +A+ A+ ++D  I + G+D SIEAE LDR  L  PG Q +LV+++S   K P 
Sbjct: 433 QTTTGFASALAAAEKSDLVIYLGGIDNSIEAETLDRTSLTWPGNQLDLVTQLSKLHK-PL 491

Query: 523 ILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE 582
           I+V   GG +D +    +  + A++WAGYP Q+GG+A+ D+L G  +  G+LP+T YP  
Sbjct: 492 IVVQFGGGQLDDSALLQNEGVQALVWAGYPSQSGGSALLDVLLGKRSIAGRLPVTQYPAS 551

Query: 583 YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL 642
           Y   + + ++ +RP+ S  YPGRTY++Y G  V PFG+G+ YT F    A         L
Sbjct: 552 YADQVSIFDINIRPNDS--YPGRTYKWYTGMPVVPFGYGLHYTKFEFEWAQT-------L 602

Query: 643 DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT-LLVFSTPPAGHWA- 700
           +  +       S ++           T  V+  VKN+G +   +  LL  S+P AG    
Sbjct: 603 NHEYNIQQLVASCQSTGPISDNTPFTT--VKAHVKNIGPEASDYVGLLFLSSPDAGPAPR 660

Query: 701 PHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
           P+K LV++ ++H + +G+Q  + + + +   ++  D +G   I  G + I +
Sbjct: 661 PNKSLVSYLRLHNITSGSQGTLDLPLTLGS-MARADENGNLVIFPGHYKIAL 711


>gi|402225863|gb|EJU05924.1| hypothetical protein DACRYDRAFT_113532 [Dacryopinax sp. DJM-731
           SS1]
          Length = 778

 Score =  442 bits (1138), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 264/701 (37%), Positives = 385/701 (54%), Gaps = 35/701 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +L    R   L+G L++ EK    ++ +  VPRLG+  Y WWSE LHGV++  PG  
Sbjct: 42  CDSALDPLTRARALVGMLTMAEKFNNTVNASPGVPRLGLPPYNWWSEGLHGVAS-SPGVT 100

Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
           F   G +F  ATSFP+ I   ++F+  L   I  ++S EARA  N   +GL +W+PN+N 
Sbjct: 101 FAPAGQNFSYATSFPEPILMGAAFDDNLIYDIATIISTEARAFNNFNHSGLDFWTPNINP 160

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ--GSDGDRLKVAASCKHFTAYDLDNWNG 217
            RDPRWGR  ETPGEDP     Y A  V GLQ  G D    K+ A+CKH+  YDL+NW G
Sbjct: 161 VRDPRWGRSLETPGEDPFHLASYVAKLVTGLQFGGDDPKYQKLVATCKHYAGYDLENWGG 220

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R+ F+A +S QD+ + F  PF+ C  +  V SVMCSYN VNG+P+CA+  +L+  +R 
Sbjct: 221 YARYGFDAVISNQDLVEYFLPPFQTCARDVNVTSVMCSYNAVNGIPSCANDYLLQSLLRT 280

Query: 278 EW-------RLNG-YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT 329
            W        LN  Y+ SDCD+V   Y   ++T TPE+A A +++AG DLDCG F     
Sbjct: 281 YWGWEPDSESLNAHYVTSDCDAVSNIYYPHNYTITPEQAVAVSLKAGTDLDCGTFYAEWL 340

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
            S+ ++GL  + DI+ AL+ +      LG FD     Q Y      ++ T   Q+LA  A
Sbjct: 341 PSSYEQGLFHQTDIDRALIRSYAALFLLGYFD-PAEGQIYRQYNWANINTDYAQQLAYTA 399

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA 449
           A +GI LLKN    LPL       +A+IGP ++ T  M GNY GIA    +PL  + +  
Sbjct: 400 AWEGITLLKNIDDMLPLPSTM-TNIALIGPWANATTQMQGNYQGIAPFLHSPLYALQQRG 458

Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQE 509
             +      ++       F AA+ A++ AD T+ + G+D ++EAEA+DR  +  PG Q +
Sbjct: 459 INVTYVLGTNITSNSTAGFAAALAAAQTADLTLYIGGIDITVEAEAMDRVNITWPGNQLD 518

Query: 510 LVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSN 569
           L+++++  S    I+  M GG ID      +P++  ++W GYPGQ GGTA+ DIL+G+  
Sbjct: 519 LIAQLANVSTH-LIVYQMGGGQIDDTVLLENPKVHGLLWGGYPGQDGGTAMIDILYGSRA 577

Query: 570 PGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH 629
           P G+LP++ YP  +I  +PMT+M + P+     PGRTY++Y G +V PFG+G+ YT F  
Sbjct: 578 PAGRLPLSQYPANFINEVPMTDMRLHPALGT--PGRTYKWYSGDLVLPFGYGLHYTTFAK 635

Query: 630 TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV-QVDVKNVGSKDGAHTL 688
                    A+       S  AT+  +A K + A  ++    V   +V N GS    +  
Sbjct: 636 --------AALKDHSPRSSDIATLVNEA-KQSSAWLDKAFFDVFAAEVTNTGSLTSDYVA 686

Query: 689 LVFST---PPAGHWAPHKQLVAFEKV-HVPAGAQQRVGINI 725
           L + T    PA +  P   LV++ ++  V  G  Q V  ++
Sbjct: 687 LGYLTGEFGPAPY--PKSSLVSYTRLSQVTPGETQVVNFDL 725


>gi|115387056|ref|XP_001210069.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114191067|gb|EAU32767.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 908

 Score =  442 bits (1136), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 275/734 (37%), Positives = 402/734 (54%), Gaps = 51/734 (6%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  SL I +RVN L+  L+L+EK+  L+  AA   RLG+  YEWW+EA HGV +  PG +
Sbjct: 163 CDTSLSIAERVNSLVKSLTLEEKILNLVDAAAGSTRLGLPFYEWWNEATHGVGS-APGVQ 221

Query: 103 FG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
           F     +F  ATSFP  I  A+SF+  L   I  V+  E RA  N G +G  +W+PN+N 
Sbjct: 222 FTSKPANFSYATSFPAPILIAASFDNALIRKIAEVIGKEGRAFANNGFSGFDFWAPNING 281

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
           FRDPRWGRGQETPGED  ++  Y  +++ GLQG D    +V A+CKH+  YDL+      
Sbjct: 282 FRDPRWGRGQETPGEDTFVAQNYIRNFIPGLQGDDPKNKQVIATCKHYAVYDLE----TG 337

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R+  N   ++QD+ D F  PF+ CV +  V S+MCSYN V+G+P CA+  +L   +R  W
Sbjct: 338 RYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLDEVLRKHW 397

Query: 280 RLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQR 335
             N    Y+VSDC++V   +   +FT T E AAA A+ AG+DL+CG  +L L+   A  +
Sbjct: 398 GFNADYHYVVSDCNAVTDIWQYHNFTDTEEAAAAVALNAGVDLECGSSYLKLNESLAANQ 457

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
             +  +D   +L    +    +G FDG      Y HL   DV  P  Q LA EAA +G+ 
Sbjct: 458 TSVKAMD--QSLARLYSALFTIGFFDG----GKYDHLDFSDVSIPAAQALAYEAAVEGMT 511

Query: 396 LLKNQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIH 453
           LLKN G  LPL S  ++++VAVIGP ++ T  M G Y+G A    +PL       R  ++
Sbjct: 512 LLKNDG-LLPLHSQHKYKSVAVIGPFANATTQMQGGYSGNAPYLISPLVAFESDHRWKVN 570

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
                 +   +   F A++ A++++D  + + G+D SIE+E +DR  L  PG Q +L+  
Sbjct: 571 YAVGTAINDQNTTGFEASLAAAKKSDLIVYLGGIDNSIESETIDRTSLAWPGNQLDLIKS 630

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
           +S  SK P ++V   GG +D +    +  I A+IWAGYP Q+GGTA+ DIL G  +P G+
Sbjct: 631 LSNLSK-PMVVVQFGGGQVDDSALLENKDIQALIWAGYPSQSGGTALLDILVGKRSPAGR 689

Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
           LP+T YP  Y   + + ++ +RP+    +PGRTY++Y G  V PFGHG+ YT F      
Sbjct: 690 LPVTQYPASYADQINIFDINLRPNSKDSHPGRTYKWYTGKPVIPFGHGLHYTKFKF---- 745

Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG----------VQVDVKNVGSKD 683
                     G   ++N   S   I+   A C R + G          V+  V+NVG + 
Sbjct: 746 ----------GWEETLNREYS---IQELVASCQRSSGGPIKDNTPFTTVKARVRNVGHET 792

Query: 684 GAH-TLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
             + +LL  S+  AG    P+K LV+++++H  A    RV         L+  D +G+  
Sbjct: 793 SDYVSLLFLSSKNAGPAPRPNKSLVSYKRLHNIAPGSDRVADLPLTLGSLARADENGSLY 852

Query: 742 IPLGEHNIHIGGTK 755
           I  G + I +  ++
Sbjct: 853 IFPGRYKIALDNSE 866


>gi|336435507|ref|ZP_08615222.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
           1_4_56FAA]
 gi|336000960|gb|EGN31106.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
           1_4_56FAA]
          Length = 717

 Score =  442 bits (1136), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 278/735 (37%), Positives = 401/735 (54%), Gaps = 87/735 (11%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           ++  +L+ +++L EK   L   A A+PRL I  Y WW+E+LHGV+  G           A
Sbjct: 12  KQAEELVDQMTLMEKASQLRYDAPAIPRLHIPAYNWWNESLHGVARGGT----------A 61

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I  A+SF+  + E IG  ++ E RA YN            GLT+W+PNVNIFRD
Sbjct: 62  TVFPQAIGLAASFDREMLEEIGEAIALEGRAKYNAAVKLDDRDIYKGLTFWAPNVNIFRD 121

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
           PRWGRG ET GEDP LS +   SY+RGLQG DG+ +K AA  KHF  +     +G +  R
Sbjct: 122 PRWGRGHETYGEDPYLSSRLGVSYIRGLQG-DGETMKAAACAKHFAVH-----SGPEALR 175

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A+VS++D+ +T+   F+ CV EG V +VM +YN VNG P C    +LK+ +R EW 
Sbjct: 176 HEFDAEVSEKDLRETYLPAFQACVQEGHVEAVMGAYNCVNGEPCCGSETLLKKILREEWG 235

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
            +G++VSDC ++  +++    T TP ++AA A+ AG DL+CG    LH   A Q GL++E
Sbjct: 236 FDGHVVSDCWAIKDFHENHLVTGTPVQSAALAMEAGCDLNCG-VTYLHLVHACQEGLVTE 294

Query: 341 IDINNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
             I  A +   T +  LGMFDG E  S PY  +  K     +H++L+  AAR+ IVLLKN
Sbjct: 295 AQITEAAIRLFTTRFLLGMFDGSEYDSVPYTVVECK-----EHRDLSERAARESIVLLKN 349

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQ 455
            G  LPL   + +T+ +IGPN+D    +IGNY G +  Y T L+G+ R      R ++  
Sbjct: 350 NGI-LPLDREKLKTIGIIGPNADSRKALIGNYHGTSSEYITVLEGVRRLVGDEVRILYSD 408

Query: 456 GC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
           GC       +++A   D+L  A I  +R++D  IL +GLD+++E E         + D+ 
Sbjct: 409 GCHLYENKTENLAREQDRLSEARI-VARESDVVILCLGLDETLEGEEGDTGNSYASGDKV 467

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            L LP  Q+ L+  V+M  K PT+L LM+G  ID++FA  +    AI+   YPG  GG A
Sbjct: 468 DLRLPKSQRMLMEAVAMEKK-PTVLCLMAGSDIDLSFA--EKHFDAIVDLWYPGAYGGAA 524

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPF 618
            ADILFG  +P GKLP+T+Y  E +  LP  E  +MR        GRTYR+ +    YPF
Sbjct: 525 AADILFGKCSPSGKLPITFY--ESLEVLPSFEDYSMR--------GRTYRYLEQKAQYPF 574

Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
           G+G++YT           +  V L+     +     G+  +     C         +V+N
Sbjct: 575 GYGLTYTKM--------KIRNVWLENAEKDMKEVTDGENAEAAVIVC--------AEVEN 618

Query: 679 VGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
            G  D    L ++     + H  PH  L  FE++ V  G ++ V I ++   + +VVD S
Sbjct: 619 CGGMDSQEVLQIYIRDTESEHETPHPHLAGFERIFVEKGVKKLVKIPVNRSAF-TVVDES 677

Query: 738 GTRRIPLGEHNIHIG 752
           G R    G++ I  G
Sbjct: 678 GRRFTDSGKYEIFAG 692


>gi|323447708|gb|EGB03620.1| hypothetical protein AURANDRAFT_72703 [Aureococcus anophagefferens]
          Length = 744

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 276/775 (35%), Positives = 393/775 (50%), Gaps = 101/775 (13%)

Query: 34  DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
           +AT   LPFC  +L I  R  D + R+++ EK+  L +    +  LG+  Y WWSEA  G
Sbjct: 31  NATFEALPFCDATLAIDLRAADAVSRMTIPEKIDALDTKTGPIASLGLPAYNWWSEASSG 90

Query: 94  VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYW 153
           V    P TKF        ++P  +TTA SFN TLW A G  +  EARA+ N G A  TYW
Sbjct: 91  VMGSRPTTKF--------AYP--VTTAMSFNRTLWRATGAAIGREARALMNAGAAYSTYW 140

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYD 211
           +P VN+ R+PRWGR  E PGEDP L+G+YA  +V G Q +  D   L+ +A CKH+ A +
Sbjct: 141 APVVNLAREPRWGRNIEVPGEDPYLTGEYATEFVGGFQAAPEDPYHLQASACCKHYVANE 200

Query: 212 LDN--------WNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           L+N        W   DR H ++ V+++D+ D++ VPF+ CV +GKV+S+MCSYN VNGVP
Sbjct: 201 LENTRQPDGEQW---DRQHVDSNVTQRDLVDSYMVPFQACVEKGKVSSLMCSYNAVNGVP 257

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           +CA+  +L+   R  W  +GYI SDCD+    YD  H+ +TPEEA AD ++AG D+DC  
Sbjct: 258 SCANDWLLRTVARDAWHFDGYITSDCDADSNVYDAHHYAATPEEAVADVLKAGTDVDCQS 317

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKD----VC 378
           F+G H  SA+ +GL++E D++  LVN   V++RLG FD    +++P G L   D    VC
Sbjct: 318 FVGQHARSALDKGLITEADMDARLVNLFKVRLRLGHFDLSFDAAKPRGPLDEIDADAVVC 377

Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
           +  H + ++E   Q   LLKN G +LPL      T AV+GPN+ ++    G Y       
Sbjct: 378 SDAHLDASMEGLAQSATLLKNDG-ALPLKP--SGTAAVVGPNALLSKADAGYYG------ 428

Query: 439 TTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
                                               +  ADA +L +G D +  AE  D 
Sbjct: 429 -----------------------------------PTDAADAVVLAVGTDLTWAAEGKDA 453

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA--FAKNDPRIAAIIWAGYPGQAG 556
             ++    Q EL+  V+ AS  P ++V+ S  P+D+    A++D ++ A++  G P    
Sbjct: 454 TSIVFTAAQLELIDAVATASATPVVVVVFSATPLDLTPLLARSDGKVGAVVHVGQP-SVT 512

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK---------------- 600
              + D+L+G  +  G+   T YP  Y   + + +  MRP  S                 
Sbjct: 513 VKGLGDLLYGRRSFAGRAVQTVYPAAYADQISIFDFNMRPGPSAFARPDCATNESACPRG 572

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV-PLDGRHGSINATIS--GKA 657
             PGRTYRFY    V PFG G+SYT F + V +APT V + PL   +  + A     G A
Sbjct: 573 TNPGRTYRFYVDEPVVPFGFGLSYTTFAYAVRSAPTTVDLAPLRAAYAGVAAARGDGGPA 632

Query: 658 IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPA 715
               H      T    VDV N G  D    +L F TPP       P K+L  FE+VHV A
Sbjct: 633 FLSLHDDAAAATYA--VDVTNTGDIDADDVVLGFVTPPGAGVDGVPLKELFGFERVHVKA 690

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG--GTKHSVSLHAATLGVI 768
           G  + V +   + K+ +V +       P G++ I  G   T+  +   AATL   
Sbjct: 691 GETKTVYLYPALSKFKTVAEDGALAARP-GDYAIEFGIPETRGRMGYAAATLAAF 744


>gi|333379783|ref|ZP_08471502.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
           22836]
 gi|332884929|gb|EGK05184.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
           22836]
          Length = 737

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 265/745 (35%), Positives = 396/745 (53%), Gaps = 85/745 (11%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           +  PF   +L I +RVNDL+ +L+L+EKV  +++   A+ RL I  Y WW+E LHG+   
Sbjct: 23  QNYPFQNTNLSIDERVNDLVSKLTLEEKVAQMLNNTPAIERLNIPAYNWWNECLHGIGRT 82

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------G 149
                   D+   T FPQ I  A+++N  L + +   +SDE RA+YN  T+        G
Sbjct: 83  --------DYK-VTVFPQAIGMAAAWNKELMKEVASAISDEGRAIYNDATSKGNREIYYG 133

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
           LTYW+PN+NIFRDPRWGRGQET GEDP L+G    S+V GLQG D   LK AA  KH+  
Sbjct: 134 LTYWTPNINIFRDPRWGRGQETYGEDPFLTGVLGKSFVAGLQGDDTKYLKAAACAKHYAV 193

Query: 210 YDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
           +     +G +  R  FN  V+  D+ DT+   FR  V+E KVA VMC+YN  NG P C +
Sbjct: 194 H-----SGPENTRHTFNTFVTDYDLWDTYLPAFRNLVVEAKVAGVMCAYNAYNGEPCCGN 248

Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCGPFL 325
             +++  +R +W   GY+ SDC ++  +Y  QH  + P+   AAADA+  G D+DCG   
Sbjct: 249 NFLMQEILREKWNFTGYVTSDCGAIDDFY--QHHKTHPDAKYAAADAVYNGTDIDCGNEA 306

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
                 AV+ G+++E  I+ +L    T++ RLGMFD   + + Y  +    + +  H++L
Sbjct: 307 YKALVDAVKTGIITEKQIDISLKRLFTIRFRLGMFDPAENVK-YSQISTSVLESQKHKDL 365

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           AL+  R+ IVLLKN+  +LPLS  + + VAV+GPN++  V+++GNY G      TP + +
Sbjct: 366 ALKITRESIVLLKNENNTLPLSK-KLKKVAVVGPNANNEVSVLGNYNGFPTEIVTPYEAV 424

Query: 446 GRY---ARTIHQQGCKDV--ACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL---- 496
            +    A  I+++G   V  +    +   A +   +  D  I V G+   +E E +    
Sbjct: 425 KQKLKGAEVIYEKGIDFVTPSTNSKEEVSALVKRLKDVDVVIFVGGISPELEGEEMPVKI 484

Query: 497 ------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
                 DR  + LP  Q + + K  +A K PT+ V+M+G  I   +   +  I AI+ A 
Sbjct: 485 EGFTGGDRTSIKLPKIQTDFM-KALVAEKIPTVFVMMTGSAIATEWESQN--IPAIVNAW 541

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY 610
           Y GQ  GTAIAD+LFG  NP GKLP+T+Y ++  ++LP          S     RTYR++
Sbjct: 542 YGGQDAGTAIADVLFGDYNPSGKLPVTFYAKD--SDLPAF-------NSYEMKNRTYRYF 592

Query: 611 KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL 670
            G V+YPFG+G+SYT F ++    P+ +               +G   KV+         
Sbjct: 593 NGEVLYPFGYGLSYTKFEYSPIQVPSTID--------------TGNNAKVS--------- 629

Query: 671 GVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
              V +KN G  +G   + L  S P      P   L  F +V + AG  + V  N+   +
Sbjct: 630 ---VSIKNTGKVEGEEVVQLYISYPDTKGQKPLYALKGFNRVSLKAGESKTVEFNLS-PR 685

Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGT 754
            L +VD +G  ++  G+  I IGG+
Sbjct: 686 ELGLVDDAGILKVSAGKRKIFIGGS 710


>gi|429850127|gb|ELA25427.1| glycoside hydrolase family 3 protein [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 918

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 271/729 (37%), Positives = 389/729 (53%), Gaps = 40/729 (5%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  SL   QR   L+  L++ EK+  L++ A  +PRL +  YEWWSE LHGV+   PGTK
Sbjct: 170 CDESLSDKQRAAALVAELTIWEKLDNLVNEAPGIPRLRVPPYEWWSEGLHGVAR-SPGTK 228

Query: 103 F--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G+F  ATSFPQ I   S+F+  L  A+G VVS EARA  N G +GL  +SPN+N F
Sbjct: 229 FTSKGNFSYATSFPQPILLGSAFDDELVRAVGEVVSREARAFSNAGRSGLDLYSPNINAF 288

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
           +DPRWGRGQETPGED     KY ++ + GL+G D D+ K+ A+CKH+ A D +N+ GVDR
Sbjct: 289 KDPRWGRGQETPGEDTFHLQKYVSAMLSGLEGDDPDK-KLIATCKHYAANDFENYKGVDR 347

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             FNA +S QD+ + +  PF+ C +E  V S MCSYN +NG P CA+  +++  +R  W 
Sbjct: 348 SGFNAVISTQDLSEYYLPPFKTCAVEKNVGSFMCSYNGINGTPLCANSYLIEDILRKHWG 407

Query: 281 LNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT-ESAVQRG 336
            NG   Y+ +DCD V +     H+      AAA +++AG DL+C  F G    +SA  + 
Sbjct: 408 WNGDGQYVSTDCDCVALMVSYHHYAPDLGHAAAWSMQAGTDLECNAFPGSEALQSAWNQS 467

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L+SE D++ AL    T  + +G+FD +    P   LG  +V T + Q+LA  AA +G VL
Sbjct: 468 LISEKDVDKALTRMYTSLVSVGLFDLD-RKDPLRSLGWDEVNTKEAQDLAYRAAVEGAVL 526

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           +KN G  LPLS    +  A+IGP    T  M GNY G A    +P            ++ 
Sbjct: 527 MKNDG-ILPLSPDSSKKYALIGPWVSATTQMQGNYFGPAPYLISP------------RKA 573

Query: 457 CKDVACADDQLFGAAIDASRQA-----------DATILVMGLDQSIEAEALDRAGLLLPG 505
            KD+        G+  + S  +           D  I + G+D ++E E LDR  L  P 
Sbjct: 574 AKDLGLDFTYFLGSRTNKSDSSFAQAIKAAQAADVVIFMGGVDNTLEQETLDRNTLAWPE 633

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
            Q +L+  +S   K P +++   GG +D      +  + AI+W GYPGQ+GG AI DI+F
Sbjct: 634 PQLQLLRALSEVGK-PLVVLQFGGGQVDDTELLANDSVNAILWGGYPGQSGGKAILDIVF 692

Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
           G + P G+L +T YP  Y   +P T+M +RP       GRTYR+Y G    P+G G+ YT
Sbjct: 693 GRAAPAGRLSVTQYPASYNDAVPATDMNLRPGPGNSGLGRTYRWYTGETPVPYGFGLHYT 752

Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
            F   +  A  V  + +       N   + + I        R  + V V  KN G+    
Sbjct: 753 KFSVDMKPASNVHNIDIAQMAAEANDDAASE-IPSWQRGLERRMVTVTVSAKNEGNVISD 811

Query: 686 HTLLVFSTPPAG--HWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           +  LVF    AG   W P K LV + ++ ++  G +++  I I + + L  VD  G R +
Sbjct: 812 YVALVFLRSEAGPKPW-PQKTLVGYTRLRNIKPGEERKEEIIIKM-EQLVRVDEVGNRVL 869

Query: 743 PLGEHNIHI 751
             G +++ +
Sbjct: 870 YEGLYSLFL 878


>gi|7671419|emb|CAB89360.1| beta-glucosidase-like protein [Arabidopsis thaliana]
 gi|9758998|dbj|BAB09525.1| unnamed protein product [Arabidopsis thaliana]
          Length = 411

 Score =  441 bits (1133), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 216/410 (52%), Positives = 278/410 (67%), Gaps = 5/410 (1%)

Query: 355 MRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTV 414
           MRLG FDG P +QPYG LGPKDVCT +++ELA+E ARQGIVLLKN   SLPLS    +T+
Sbjct: 1   MRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGSLPLSPSAIKTL 60

Query: 415 AVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAID 473
           AVIGPN++VT TMIGNY G+AC YTTPLQG+ R   T  + +GC +V C +  L  A   
Sbjct: 61  AVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTCTEADLDSAKTL 120

Query: 474 ASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPID 533
           A+  ADAT+LVMG DQ+IE E LDR  L LPG+QQELV++V+ A++GP +LV+MSGG  D
Sbjct: 121 AA-SADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVVLVIMSGGGFD 179

Query: 534 VAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMA 593
           + FAKND +I +I+W GYPG+AGG AIAD++FG  NP GKLPMTWYPQ Y+  +PMT M 
Sbjct: 180 ITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQSYVEKVPMTNMN 239

Query: 594 MRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN--- 650
           MRP +S  Y GRTYRFY G  VY FG G+SYTNF H +  AP  V++ LD      +   
Sbjct: 240 MRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLNLDESQSCRSPEC 299

Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
            ++         A   R    VQ+ V+NVG ++G  T+ +F+TPP  H +P KQL+ FEK
Sbjct: 300 QSLDAIGPHCEKAVGERSDFEVQLKVRNVGDREGTETVFLFTTPPEVHGSPRKQLLGFEK 359

Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           + +    +  V   + VCK L VVD  G R++ LG H +H+G  KHS ++
Sbjct: 360 IRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGSLKHSFNI 409


>gi|67523807|ref|XP_659963.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
 gi|74597492|sp|Q5BAS1.1|XYND_EMENI RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|40745314|gb|EAA64470.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
 gi|259487761|tpe|CBF86686.1| TPA: Beta-xylosidase (EC 3.2.1.37)
           [Source:UniProtKB/TrEMBL;Acc:O42810] [Aspergillus
           nidulans FGSC A4]
          Length = 803

 Score =  439 bits (1130), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 266/730 (36%), Positives = 390/730 (53%), Gaps = 36/730 (4%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P C  SL    R   L+   +  E V    +    V RLG+  Y+ W EALHGV      
Sbjct: 59  PVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGVGRAN-- 116

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
               G+F  ATSFP  IT  ++ N TL   IG +VS + RA  N G  G+  +SPN+N F
Sbjct: 117 FVESGNFSWATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLGGVDVYSPNINTF 176

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVD 219
           R P WGRGQETPGED  L+  Y   Y+  LQG  D + LK+ A+ KH+  YD+++WN   
Sbjct: 177 RHPVWGRGQETPGEDAFLTSVYGYEYITALQGGVDPETLKIIATAKHYAGYDIESWNNHS 236

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG-- 277
           R   + ++++Q++ + +  PF +   + KV SVMCSYN VNGVP+CA+   L+  +R   
Sbjct: 237 RLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVPSCANKFFLQTLLRDTF 296

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
           E+  +GY+  DC +V   ++   + S    A+AD+I AG D+DCG     H+E A +  L
Sbjct: 297 EFSEDGYVSGDCGAVYNVWNPHGYASNEAAASADSILAGTDIDCGTSYQWHSEDAFEDSL 356

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           +S  DI   ++   +  ++ G FDGE +  PY  +   DV + D   +A EAA +GIVLL
Sbjct: 357 VSRSDIERGVIRLYSNLVQAGYFDGEDA--PYRDITWDDVLSTDAWNIAYEAAVEGIVLL 414

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGC 457
           KN   +LPLS    ++VAVIGP ++VT  + GNY G A    +PL G       +H    
Sbjct: 415 KND-ETLPLSK-DIKSVAVIGPWANVTEELQGNYFGPAPYLISPLTGFRDSGLDVHYALG 472

Query: 458 KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMA 517
            ++       F  A+ A++QADA I   G+D +IEAEA+DR  +  PG Q +L+SK+S  
Sbjct: 473 TNLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENITWPGNQLDLISKLSEL 532

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K P +++ M GG +D +  K++  + A+IW GYPGQ+GG A+ADI+ G   P G+L  T
Sbjct: 533 GK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALADIITGKRAPAGRLVTT 591

Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA----- 632
            YP EY    P  +M +RP+++   PG+TY +Y G  VY FGHG+ YT F  +       
Sbjct: 592 QYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFGHGLFYTTFEESTETTDAG 651

Query: 633 --NAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLV 690
             N  TV+  P  G                 HA+   L L     VKN G ++  +T LV
Sbjct: 652 SFNIQTVLTTPHSGYE---------------HAQQKTL-LNFTATVKNTGERESDYTALV 695

Query: 691 FSTPPAGHWA-PHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
           +    AG    P K +V F+++  +  G  Q + + + V + ++  D  G R +  G + 
Sbjct: 696 YVNTTAGPAPYPKKWVVGFDRLGGLEPGDSQTLTVPVTV-ESVARTDEQGNRVLYPGSYE 754

Query: 749 IHIGGTKHSV 758
           + +   +  V
Sbjct: 755 LALNNERSVV 764


>gi|343428088|emb|CBQ71612.1| related to Beta-xylosidase [Sporisorium reilianum SRZ2]
          Length = 698

 Score =  439 bits (1130), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 253/619 (40%), Positives = 344/619 (55%), Gaps = 22/619 (3%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
           TLP C  SL    R   L+ + +  E +   ++ A  VPRLGI  Y+WW+EALHGV+   
Sbjct: 31  TLPVCDTSLDFYTRATSLVAQFTTAELINNTVNHAPGVPRLGIPQYQWWTEALHGVAR-S 89

Query: 99  PGTKF----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWS 154
           PG  F     G+F  ATSFPQVI   ++F+  L+EA+   +++E RA  N G AGL  +S
Sbjct: 90  PGVNFNPDAAGEFGCATSFPQVINLGATFDDALYEAVAAHIANETRAFSNAGRAGLNMYS 149

Query: 155 P-NVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-----RLKVAASCKHFT 208
           P N+N FRDPRWGRGQET GEDP+   +YA   VRGLQG         RL +AA+CKH+ 
Sbjct: 150 PLNINAFRDPRWGRGQETVGEDPLHLSRYAVRVVRGLQGPAAQDEANPRLTLAATCKHYL 209

Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
           AYDL+   GV+R+ F+A VS QD+ D     FR CV +G   ++M SYN VNGVP  A  
Sbjct: 210 AYDLEASAGVERYQFDALVSNQDLADLHLPQFRACVRDGGATTLMTSYNAVNGVPPSASK 269

Query: 269 NILKRTIRGEWRLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
             L+   R  W L+    Y+ SDCD+V   YD  H+ +    AAA ++ AG DLDCG   
Sbjct: 270 YYLETLARDTWGLDKHHNYVTSDCDAVANVYDAHHYAADYVHAAAASLNAGTDLDCGATY 329

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
                +A+ + L     I  A+       +RLG FD    +QP   LG KDV  P  Q+L
Sbjct: 330 RDSLAAALAQNLTDVATIRRAVTRMYGSLVRLGYFDAA-EAQPLRQLGWKDVNAPAAQKL 388

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           A EAA   I LLKN+  +LPL     +T+A+IGP ++ T  + GNYAG +    TP    
Sbjct: 389 AYEAAAASITLLKNRQSTLPLRETAGKTIALIGPYTNATFALRGNYAGPSPLVITPFDAA 448

Query: 446 GRY---ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
            R    A  +   G       D     AA+  ++ AD  +   G+D ++E E+LDR  + 
Sbjct: 449 RRTFSDAHIVSANGTSIAGPYDTATASAALATAKSADIIVYAGGIDPTVEGESLDRRDIA 508

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
            P  Q  L+ +++   K   ++V   GG +D A  K D  + A++WAGYPGQ+G  A+ D
Sbjct: 509 WPANQLRLIQELAALGKV-LVVVQFGGGQVDGALLKGDDGVGALVWAGYPGQSGALALMD 567

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
           IL G   P G+LP+T YP  Y   L  T MA+RP+ +  YPGRTY++Y G   +PFG G+
Sbjct: 568 ILAGKRAPAGRLPITQYPANYTHALRETTMALRPTAT--YPGRTYKWYTGTPTFPFGFGL 625

Query: 623 SYTNFVHTVANAPTVVAVP 641
            YT F  ++A  P    +P
Sbjct: 626 HYTTFRASIA-PPATYTIP 643


>gi|156062754|ref|XP_001597299.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980]
 gi|154696829|gb|EDN96567.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 758

 Score =  439 bits (1130), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 274/712 (38%), Positives = 392/712 (55%), Gaps = 62/712 (8%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +     R   L+   +L EK+    + +  VPR+G+  Y+WW+EALHG++    GT 
Sbjct: 34  CDTTADPYTRATALVSLFTLAEKINNTGNTSPGVPRIGLPAYQWWNEALHGIAY---GTH 90

Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
           F   G ++  ATSFPQ I   ++F+  L   +   +S EARA  N    GL +W+PN+N 
Sbjct: 91  FAAAGSNYSYATSFPQPILMGAAFDDALIHDVASQISTEARAFSNANRYGLNFWTPNINP 150

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDLDNWNG 217
           ++DPRWGRGQETPGEDP     Y  + V GLQG   D    K  A+CKH+  YDL+N  G
Sbjct: 151 YKDPRWGRGQETPGEDPFHVSSYVNALVTGLQGGLDDLPYKKGVATCKHYAGYDLENGGG 210

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
           + R+ F+A ++ QD+ D +   F+ C  +  V S+MCSYN VNGVPTCAD  +L+  +R 
Sbjct: 211 IQRYAFDAIINSQDLRDYYLPSFQQCARDSNVQSIMCSYNAVNGVPTCADDWLLQSLLRE 270

Query: 278 EW---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
            W     + ++ SDCD+V   +D+ ++TSTPE+AAADA+ AG DLDCG F   +  SA  
Sbjct: 271 HWGWVEEDQWVTSDCDAVQNIWDSHNYTSTPEQAAADALNAGTDLDCGGFWPTYLGSAYN 330

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQG 393
           + L +   ++ +L       +RLG FD  P+S QPY  LG  DV TP  ++LAL+AA  G
Sbjct: 331 QSLYNISTLDRSLTRRYASLVRLGYFD--PASIQPYRQLGWSDVSTPSAEQLALQAAEDG 388

Query: 394 IVLLKNQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL---QGIGRYA 449
           IVLLKN G  LPL S+I +  VA+IGP ++ T  M GNY G A    +PL   Q  G + 
Sbjct: 389 IVLLKNDG-ILPLPSNITN--VALIGPWANATTQMQGNYYGQAPYLHSPLIAAQNAGFHV 445

Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQE 509
             +  QG  D+   +   F AAI A+++AD  I + G+D SIEAEA DR  +  P  Q  
Sbjct: 446 TYV--QGA-DIDSTNTTEFTAAIAAAKKADVIIYIGGIDNSIEAEAKDRKTIAWPSSQIS 502

Query: 510 LVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSN 569
           LV++++  S  P I+  M G  ID +    +  +  IIWAGYPGQ GGTAI +IL G + 
Sbjct: 503 LVNQLANLSI-PLIISQM-GTMIDSSSLLTNRGVNGIIWAGYPGQDGGTAIFNILTGKTA 560

Query: 570 PGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH 629
           P G+LP+T YP +Y+  + M  M + P  +   PGRTY+++ G  ++ FG G+ YT F  
Sbjct: 561 PAGRLPITQYPSDYVNEVSMNNMNLHPGANN--PGRTYKWFNGTSIFDFGFGLHYTTF-- 616

Query: 630 TVANAPTVVAVPLDGRHGSINATI---SGKAIKVTHAKCNRLT-------LGVQVDVKNV 679
                               NA I   S    +++H   N  T       L + + + N 
Sbjct: 617 --------------------NAKITPPSSNTFEISHLTSNTSTHKDLTPFLTLPISISNT 656

Query: 680 GSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVH-VPAGAQQRVGINIHVCK 729
           G+    +  L+F T   G    P K LVA+ ++H +  GA     + +++  
Sbjct: 657 GTTTSDYVALLFLTGSFGPTPYPKKSLVAYTRLHDIKGGASSTAQLKLNLAS 708


>gi|2920706|emb|CAA73902.1| beta-xylosidase [Emericella nidulans]
          Length = 802

 Score =  439 bits (1128), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 265/730 (36%), Positives = 391/730 (53%), Gaps = 36/730 (4%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P C  SL    R   L+   +  E V    +    V RLG+  Y+ W EALHGV      
Sbjct: 58  PVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGVGRAN-- 115

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
               G+F  ATSFP  IT  ++ N TL   IG +VS + RA  N G  G+  +SPN+N F
Sbjct: 116 FVESGNFSWATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLGGVDVYSPNINTF 175

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVD 219
           R P WGRGQETPGED  L+  Y   Y+  LQG+ D +  K+ A+ KH+  YD+++WN   
Sbjct: 176 RHPVWGRGQETPGEDAFLTSVYGYEYITALQGAVDPETSKIIATAKHYAGYDIESWNNHS 235

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG-- 277
           R   + ++++Q++ + +  PF +   + KV SVMCSYN VNGVP+CA+   L+  +R   
Sbjct: 236 RLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVPSCANKFFLQTLLRDTF 295

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
           E+  +GY+  DC +V   ++   + S    A+AD+I AG D+DCG     H+E A +  L
Sbjct: 296 EFSEDGYVSGDCGAVYNVWNPHGYASNEAAASADSILAGTDIDCGTSYQWHSEDAFEDSL 355

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           +S  DI   ++   +  ++ G FDGE +  PY  +   DV + D   +A EAA +GIVLL
Sbjct: 356 VSRSDIERGVIRLYSNLVQAGYFDGEDA--PYRDITWDDVLSTDAWNIAYEAAVEGIVLL 413

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGC 457
           KN   +LPLS    ++VAVIGP ++VT  + GNY G A    +PL G       +H    
Sbjct: 414 KND-ETLPLSK-DIKSVAVIGPWANVTEELQGNYFGPAPYLISPLTGFRDSGLDVHYALG 471

Query: 458 KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMA 517
            ++       F  A+ A++QADA I   G+D +IEAEA+DR  +  PG Q +L+SK+S  
Sbjct: 472 TNLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENITWPGNQLDLISKLSEL 531

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K P +++ M GG +D +  K++  + A+IW GYPGQ+GG A+ADI+ G   P G+L  T
Sbjct: 532 GK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALADIITGKRAPAGRLVTT 590

Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA----- 632
            YP EY    P  +M +RP+++   PG+TY +Y G  VY FGHG+ YT F  +       
Sbjct: 591 QYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFGHGLFYTTFEESTETTDAG 650

Query: 633 --NAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLV 690
             N  TV+  P  G                 HA+   L L     VKN G ++  +T LV
Sbjct: 651 SFNIQTVLTTPHSGYE---------------HAQQKTL-LNFTATVKNTGERESDYTALV 694

Query: 691 FSTPPAGHWA-PHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
           +    AG    P K +V F+++  +  G  Q + + + V + ++  D  G R +  G ++
Sbjct: 695 YVNTTAGPAPYPKKWVVGFDRLGGLEPGDSQTLTVPVTV-ESVARTDEQGNRVLYPGSYD 753

Query: 749 IHIGGTKHSV 758
           + +   +  V
Sbjct: 754 VALNNERSVV 763


>gi|240146254|ref|ZP_04744855.1| beta-glucosidase [Roseburia intestinalis L1-82]
 gi|257201613|gb|EEU99897.1| beta-glucosidase [Roseburia intestinalis L1-82]
 gi|291539969|emb|CBL13080.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
           XB6B4]
          Length = 710

 Score =  438 bits (1127), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 263/735 (35%), Positives = 392/735 (53%), Gaps = 87/735 (11%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           +R  +L+G+++L+EKV   +  A AV RL IK Y WW+EALHGV+  G           A
Sbjct: 15  KRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT----------A 64

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG--------LTYWSPNVNIFRD 162
           T FPQ I  A++F+  L E +G  VS EARA +N    G        LT+W+PNVNIFRD
Sbjct: 65  TVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRD 124

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
           PRWGRG ET GEDP L+ +    Y+ GLQG D + LK AA  KHF  +     +G +  R
Sbjct: 125 PRWGRGHETFGEDPYLTSRLGVRYIEGLQGHDENYLKAAACAKHFAVH-----SGPEAVR 179

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A+V++QD+ +T+   F  CV EGKV +VM +YN+ NGVP C +  +L   +R EW 
Sbjct: 180 HEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLLIDILRKEWG 239

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
            +G++ SDC ++  +++  H T T  E+ A A+  G DL+CG   G   + AV++GL+ E
Sbjct: 240 FSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMNNGCDLNCGTLFGFLVQ-AVRQGLVKE 298

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
             ++ A+ N    +M+LG+FD +    PY  +      + + ++L    AR+ +VLLKN+
Sbjct: 299 ERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPYLAADSREMKKLNEAVARRTVVLLKNK 357

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQG 456
              LPL   + +TV VIGPN+D    ++GNY G A  Y T L+GI  Y     R ++ +G
Sbjct: 358 EHILPLDKNKIKTVGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGDDVRVLYSEG 417

Query: 457 C-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
           C        ++A  +D++    +   +++D  + V+GLD  IE E         + D+  
Sbjct: 418 CHLYKDRTSNLAQENDRM-SEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNEYGSGDKPD 476

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L LPG Q+E++       K P ILVL+SG  + V +A  D  + AI+   YPG  GG AI
Sbjct: 477 LNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVDAIVQGWYPGARGGAAI 533

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           ADILFG +NP GKLP+T+Y          T   +   +     GRTYR+ +   +YPFG+
Sbjct: 534 ADILFGEANPEGKLPVTFY---------RTTEELPDFEDYSMQGRTYRYMEQEALYPFGY 584

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           G+SYT + +          V  +G                       +T+G+   VKN G
Sbjct: 585 GLSYTEYAYQNVRFLEQEPVVSEG-----------------------VTIGLS--VKNTG 619

Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
             DG  T+ V+         PH QL    K+ + AG ++ + I +    ++ + D +G +
Sbjct: 620 KMDGTETVQVYVKAEHSK-MPHGQLKKIVKLPLCAGEEKEINIRLESEAFM-LYDENGEK 677

Query: 741 RIPLGEHNIHIGGTK 755
            +P G   I +GG +
Sbjct: 678 ILPSGHFEIFVGGMQ 692


>gi|291537442|emb|CBL10554.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
           M50/1]
          Length = 710

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 262/735 (35%), Positives = 392/735 (53%), Gaps = 87/735 (11%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           +R  +L+G+++L+EKV   +  A AV RL IK Y WW+EALHGV+  G           A
Sbjct: 15  KRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT----------A 64

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG--------LTYWSPNVNIFRD 162
           T FPQ I  A++F+  L E +G  VS EARA +N    G        LT+W+PNVNIFRD
Sbjct: 65  TVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRD 124

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
           PRWGRG ET GEDP L+ +    Y+ GLQG D + LK AA  KHF  +     +G +  R
Sbjct: 125 PRWGRGHETFGEDPYLTSRLGVRYIEGLQGHDENYLKAAACAKHFAVH-----SGPEAVR 179

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A+V++QD+ +T+   F  CV EGKV +VM +YN+ NGVP C +  +L   +R EW 
Sbjct: 180 HEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLLIDILRKEWG 239

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
            +G++ SDC ++  +++  H T T  E+ A A+  G DL+CG   G   + AV++GL+ E
Sbjct: 240 FSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMNNGCDLNCGTLFGFLVQ-AVRQGLVKE 298

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
             ++ A+ N    +M+LG+FD +    PY  +      + + ++L    AR+ +VLLKN+
Sbjct: 299 ERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPYLAADSREMKKLNEAVARRTVVLLKNK 357

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQG 456
              LPL   + +T+ VIGPN+D    ++GNY G A  Y T L+GI  Y     R ++ +G
Sbjct: 358 EHILPLDKNKIKTIGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGDDVRVLYSEG 417

Query: 457 C-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
           C        ++A  +D++    +   +++D  + V+GLD  IE E         + D+  
Sbjct: 418 CHLYKDRTSNLAQENDRM-SEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNEYGSGDKPD 476

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L LPG Q+E++       K P ILVL+SG  + V +A  D  + AI+   YPG  GG AI
Sbjct: 477 LNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVDAIVQGWYPGARGGAAI 533

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           ADILFG +NP GKLP+T+Y          T   +   +     GRTYR+ +   +YPFG+
Sbjct: 534 ADILFGEANPEGKLPVTFY---------RTTEELPDFEDYSMQGRTYRYMEQEALYPFGY 584

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           G+SYT + +          V  +G                       +T+G+   VKN G
Sbjct: 585 GLSYTEYAYQNVRFLEQEPVVSEG-----------------------VTIGLS--VKNTG 619

Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
             DG  T+ V+         PH QL    K+ + AG ++ + I +    ++ + D +G +
Sbjct: 620 KMDGTETVQVYVKAEHSK-MPHGQLKKIVKLPLCAGEEKEINIRLESEAFM-LYDENGEK 677

Query: 741 RIPLGEHNIHIGGTK 755
            +P G   I +GG +
Sbjct: 678 ILPSGHFEIFVGGMQ 692


>gi|60729621|pir||JC7966 xylan 1,4-beta-xylosidase (EC 3.2.1.37) - Talaromyces emersonii
 gi|21326570|gb|AAL32053.2|AF439746_1 beta-xylosidase [Rasamsonia emersonii]
          Length = 796

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 271/740 (36%), Positives = 393/740 (53%), Gaps = 40/740 (5%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+   +L+E +    + A  VPRLG+  Y+ W+EALHG+        
Sbjct: 64  CNTSADPWARAEALVSLFTLEELINNTQNTAPGVPRLGLPQYQVWNEALHGLDRANFSDS 123

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             G++  ATSFP  I + +SFN TL   I  +++ +ARA  N G  GL  ++PN+N FR 
Sbjct: 124 --GEYSWATSFPMPILSMASFNRTLINQIASIIATQARAFNNAGRYGLDSYAPNINGFRS 181

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   LS  YA  Y+ GLQG  D + +K+ A+ KHF  YDL+NW  V R
Sbjct: 182 PLWGRGQETPGEDAFFLSSAYAYEYITGLQGGVDPEHVKIVATAKHFAGYDLENWGNVSR 241

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              NA +++QD+ + +   F       K  S+MCSYN VNGVP+C++   L+  +R  + 
Sbjct: 242 LGSNAIITQQDLSEYYTPQFLASARYAKTRSLMCSYNAVNGVPSCSNSFFLQTLLRESFN 301

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
              +GY+ SDCD+V   ++   +      AAAD++ AG D+DCG  +  H   +     +
Sbjct: 302 FVDDGYVSSDCDAVYNVFNPHGYALNQSGAAADSLLAGTDIDCGQTMPWHLNESFYERYV 361

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           S  DI  +L       +RLG FDG  S   Y +L   DV T D   ++ EAA +GI LLK
Sbjct: 362 SRGDIEKSLTRLYANLVRLGYFDGNNSV--YRNLNWNDVVTTDAWNISYEAAVEGITLLK 419

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G +LPLS  + R++A+IGP ++ TV M GNY G      +PL+       T++     
Sbjct: 420 NDG-TLPLSK-KVRSIALIGPWANATVQMQGNYYGTPPYLISPLEAAKASGFTVNYAFGT 477

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           +++    Q F  AI A++++D  I   G+D +IEAE  DR  L  PG Q +L+ ++S   
Sbjct: 478 NISTDSTQWFAEAISAAKKSDVIIYAGGIDNTIEAEGQDRTDLKWPGNQLDLIEQLSKVG 537

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P +++ M GG +D +  K +  + A++W GYPGQ+GG A+ DIL G   P G+L  T 
Sbjct: 538 K-PLVVLQMGGGQVDSSSLKANKNVNALVWGGYPGQSGGAALFDILTGKRAPAGRLVSTQ 596

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP EY T  P  +M +RP+ S   PG+TY +Y G  VY FGHG+ YT F  + A A T  
Sbjct: 597 YPAEYATQFPANDMNLRPNGSN--PGQTYIWYTGTPVYEFGHGLFYTEFQESAA-AGTNK 653

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVFSTP 694
              LD              +   H     +     L V VDVKNVG     +T L+F+  
Sbjct: 654 TSTLDIL----------DLVPTPHPGYEYIELVPFLNVTVDVKNVGHTPSPYTGLLFANT 703

Query: 695 PAG-HWAPHKQLVAFEK---VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
            AG    P+K LV F++   +H    AQ    + +     ++  D +G + I  GE+ + 
Sbjct: 704 TAGPKPYPNKWLVGFDRLATIHPAKTAQVTFPVPLGA---IARADENGNKVIFPGEYELA 760

Query: 751 IGGTK-----HSVSLHAATL 765
           +   +      S++ +AATL
Sbjct: 761 LNNERSVVVSFSLTGNAATL 780


>gi|347832625|emb|CCD48322.1| glycoside hydrolase family 3 protein [Botryotinia fuckeliana]
          Length = 772

 Score =  436 bits (1122), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 276/704 (39%), Positives = 393/704 (55%), Gaps = 43/704 (6%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   LI   +L EKV    + +  VPR+G+  YEWW+EALHG++   PGT 
Sbjct: 34  CDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIAR-SPGTT 92

Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
           F   G ++  +TSFPQ I   ++F+  L   +   VS EARA  N    GL +W+PN+N 
Sbjct: 93  FAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNRFGLNFWTPNINP 152

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDLDNWNG 217
           ++DPRWGRGQETPGEDP  +  Y  + + GLQG   D    K  A+CKHF  YDL++ +G
Sbjct: 153 YKDPRWGRGQETPGEDPFHTSSYVNALITGLQGGLDDLPYKKGVATCKHFAGYDLESSDG 212

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R+ F+A +  QD+ D +  PF+ C  +  V SVMCSYN +NGVPTCAD  +L+  +R 
Sbjct: 213 AIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCADDWLLQTLLRE 272

Query: 278 EW---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
            W     + ++ SDCD+V   +D  ++T TPE++AADA+ AG DLDCG F   +  SA  
Sbjct: 273 HWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSAADALNAGTDLDCGTFWPTYLGSAYD 332

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
           +GL     ++ +L       +RLG FD  PS QPY  L   +V TP  Q+LAL+AA  GI
Sbjct: 333 QGLYDISTLDRSLARRYASLVRLGYFD-PPSVQPYRQLNWDNVSTPAAQQLALQAAEDGI 391

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL---QGIGRYART 451
           VLLKN G  LPLS      VA+IGP ++ T  M GNY G A    +PL   Q  G   + 
Sbjct: 392 VLLKNDG-ILPLSS-NITNVALIGPLANATKQMQGNYYGTAPYLRSPLIAAQNAG--FKV 447

Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
            + QG  D+   +   F AAI A++ AD  I V G+D SIEAE +DR  +  P  Q  L+
Sbjct: 448 TYVQGA-DIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAEEIDRTSISWPSSQLSLI 506

Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
           ++++  S  P I+  M G  ID +   ++  + A++WAGYPGQ GGTAI +IL G + P 
Sbjct: 507 NQLANLST-PLIISQM-GCMIDSSSLLSNTGVNALLWAGYPGQDGGTAIFNILTGKTAPA 564

Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
           G+LP+T YP  Y+  + MT+M ++PS+    PGRTY++Y G  V+ +G+G+ YT F   +
Sbjct: 565 GRLPITQYPSNYVNQVTMTDMNLQPSRFN--PGRTYKWYNGEPVFEYGYGLQYTTFDAKI 622

Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT--LGVQVDVKNVGSKDGAHTLL 689
             +             S N T     +    +    LT  + + + V N G+    +  L
Sbjct: 623 TPS-------------SPNNTFEISELLANASNYKDLTPFVKIPITVSNTGTTTSDYVAL 669

Query: 690 VF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCK 729
            F   +  PA H  P K LVA+ ++H +  GA     +++++  
Sbjct: 670 FFLSGTFGPAPH--PKKSLVAYTRLHDITGGANATAEVSLNLAS 711


>gi|219887077|gb|ACL53913.1| unknown [Zea mays]
 gi|224035251|gb|ACN36701.1| unknown [Zea mays]
 gi|413919685|gb|AFW59617.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 405

 Score =  436 bits (1122), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 219/407 (53%), Positives = 282/407 (69%), Gaps = 5/407 (1%)

Query: 355 MRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTV 414
           MRLG FDG+P   P+G+LGP DVCTP +QELA EAARQGIVLLKN G  LPLS    +++
Sbjct: 1   MRLGFFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKNTG-KLPLSATSIKSM 59

Query: 415 AVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-FGAAID 473
           AVIGPN++ + TMIGNY G  C YTTPLQG+G    T++Q GC +V C+ + L   AA  
Sbjct: 60  AVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATK 119

Query: 474 ASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPID 533
           A+  AD T+LV+G DQSIE E+LDR  LLLPG+Q +LVS V+ AS GP ILV+MSGGP D
Sbjct: 120 AAASADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFD 179

Query: 534 VAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMA 593
           ++FAK+  +IAAI+W GYPG+AGG AIAD+LFG  NP G+LP+TWYP+ + T +PMT+M 
Sbjct: 180 ISFAKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESF-TKVPMTDMR 238

Query: 594 MRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATI 653
           MRP  S  YPGRTYRFY G  VY FG G+SYT+F H + +AP  +A+ L   H  +  T 
Sbjct: 239 MRPDPSTGYPGRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACL--TE 296

Query: 654 SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHV 713
              +++   A C  L   V + V+N G + G HT+ +FS+PPA H AP K L+ FEKV +
Sbjct: 297 QCPSVEAEGAHCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLLGFEKVSL 356

Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
             G    V   + VCK LSVVD  G R++ LG H +H+G  KH+++L
Sbjct: 357 EPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNL 403


>gi|348604625|dbj|BAK96214.1| beta-xylosidase [Acremonium cellulolyticus]
          Length = 797

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 262/723 (36%), Positives = 393/723 (54%), Gaps = 37/723 (5%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   LI   +L+E +    + A  VPRLG+  Y+ WSEALHG+      T 
Sbjct: 64  CDTSANYVDRAEGLIALFTLEELINNTQNTAPGVPRLGLPPYQVWSEALHGLDRANFATS 123

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
            G ++  ATSFP  I + ++ N TL   I  ++  +ARA  N G  GL  ++PN+N FR 
Sbjct: 124 -GDEWTWATSFPMPILSMAALNRTLINQIAGIIGTQARAFNNAGRYGLDAYAPNINGFRS 182

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   LS  YA  Y+ GLQG  D D LKV A+ KHF  YDL+NW G  R
Sbjct: 183 PLWGRGQETPGEDANFLSSSYAYEYITGLQGGVDPDHLKVVATAKHFAGYDLENWGGNSR 242

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A +++QD+ + +   F       K  S MCSYN VNGVP+C+   +L+  +R  W 
Sbjct: 243 LGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSFLLQTLLRDNWD 302

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
               GY+ SDCD+V   ++   + S    AAAD++RAG D+DCG     +   +   G +
Sbjct: 303 FPEYGYVSSDCDAVYNVFNPHGYASNQSAAAADSLRAGTDIDCGQTYPWNLNQSFIEGSV 362

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           +  +I  ++V   +  ++LG FDG+ S   Y  LG  DV T D   ++ EAA +GIVLLK
Sbjct: 363 TRGEIERSIVRLYSNLVKLGYFDGDKSE--YRQLGWNDVVTTDAWNISYEAAVEGIVLLK 420

Query: 399 NQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGC 457
           N G  LPLS H+  +++A+IGP ++ T  + GNY G A    TPLQG       ++    
Sbjct: 421 NDG-ILPLSKHV--KSIALIGPWANATEQLQGNYYGTAPYLITPLQGASDAGYKVNYALG 477

Query: 458 KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMA 517
            ++     + F  A+ A++++D  + + G+D +IEAE  DR  +  PG Q +L+ ++S  
Sbjct: 478 TNILGNTTEGFADALSAAKKSDVIVYLGGIDNTIEAEGTDRMNVTWPGNQLDLIQQLSQT 537

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K P +++ M GG +D +  K + ++ A++W GYPGQ+GGTAI DIL G   P G+L  T
Sbjct: 538 GK-PLVVLQMGGGQVDSSSIKANSKVNALVWGGYPGQSGGTAIFDILSGKRVPAGRLVTT 596

Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
            YP EY T  P T+M +RP  +   PG+TY +Y G  VY FG+G+ YT F  T       
Sbjct: 597 QYPAEYATQFPATDMNLRPDGASN-PGQTYMWYTGTPVYDFGYGLFYTTFKETAQ----- 650

Query: 638 VAVPLDGRHGSINATISG--KAIKVTHAKCNRLT--LGVQVDVKNVGSKDGAHTLLVFST 693
                  + GS +  IS    A +    + + L   + +   +KN G     +T ++F+ 
Sbjct: 651 -------KLGSSSFDISEIVAAPRSPSYEYSELVPFVNITATIKNTGKTASPYTAMLFAN 703

Query: 694 P----PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
                PA +  P+K LV ++++  +  G    + I + +   ++ VD +G R +  G++ 
Sbjct: 704 TTNAGPAPY--PNKWLVGYDRLASIEPGKSADLVIPVPIGA-IARVDENGNRIVYPGDYQ 760

Query: 749 IHI 751
           + +
Sbjct: 761 LAL 763


>gi|421077748|ref|ZP_15538711.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           JBW45]
 gi|392524151|gb|EIW47314.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           JBW45]
          Length = 750

 Score =  436 bits (1120), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 270/752 (35%), Positives = 398/752 (52%), Gaps = 85/752 (11%)

Query: 34  DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
           D       +   +L   QR  DL+ R++L+EKV  ++  + A+PRLG+  Y WWSEALHG
Sbjct: 24  DERMEIFDYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHG 83

Query: 94  VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------G 145
           V+  G           AT FPQ I  A++F+  L   +  V+S E RA ++        G
Sbjct: 84  VARAGV----------ATVFPQAIGLAATFDEKLIHDVAEVISIEGRAKFHEFQRKGDHG 133

Query: 146 GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
              GLT+WSPNVNIFRDPRWGRGQET GEDP L+G+   S+++GLQG D   L+ AA  K
Sbjct: 134 IYKGLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQDKKYLRAAACAK 193

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           HF  +        +R  F+A VS +D+ +T+   F+ CV E  V +VM +YN+VNG P C
Sbjct: 194 HFAVHSGPE---SERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEPCC 250

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
               +LK T+R EW   G++VSDC ++  +++    TS+  E+ A A+  G DL+CG  +
Sbjct: 251 GSNMLLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPESVALALNNGCDLNCGN-M 309

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
            L+   A Q GL++E  IN A+   +  +M+LG+FD    + PY ++G       +H+E 
Sbjct: 310 YLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFD-TAENVPYTNIGFHQNDCQEHREF 368

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           ALE +++ +VLLKN+   LPL      ++AVIGPN++    + GNY G A  Y T L+GI
Sbjct: 369 ALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYITVLEGI 428

Query: 446 ----GRYARTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
               G+     + QGC       +++  A D+ F  A+  + +AD  ++ MGLD SIE E
Sbjct: 429 REAVGKDTIVSYAQGCHLYRDKAENLGEARDR-FAEAVSTAERADIVVMCMGLDASIEGE 487

Query: 495 ---------ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAA 545
                    + D+ GL LPG QQEL+  +    K P ILVL++G  + V +A    ++ A
Sbjct: 488 EGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTWAAE--KVPA 544

Query: 546 IIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPG 604
           II A YPG  GG A+A  +FG  +P GKLP+T+Y       LP  T+ +M+         
Sbjct: 545 IIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRT--TEELPEFTDYSMK--------N 594

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           RTYR+     +YPFG+G+ YT F +               R   +N T       V   +
Sbjct: 595 RTYRYMTKEALYPFGYGLGYTTFAY---------------RQLQLNRTQISAGENV---Q 636

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
           C+ L       VKN G+     T+ L      A    P  +L   +KVH+  G +Q V  
Sbjct: 637 CSVL-------VKNTGNFASDETVQLYIKDVKASVEVPILELQGIQKVHLLPGTEQEVFF 689

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
            +   + L++++  G   +  G   I++GG +
Sbjct: 690 TL-TPRQLALINEEGNCILEPGAFEIYVGGCQ 720


>gi|340519849|gb|EGR50086.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
          Length = 796

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 277/726 (38%), Positives = 404/726 (55%), Gaps = 51/726 (7%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +  I +R   ++  ++L EKV  + S A+   RLG+  Y+W +EALHGV+    G +
Sbjct: 75  CDTTKSIAERAAAIVKPMTLNEKVANVGSSASGSARLGLPAYQWQNEALHGVAG-STGVQ 133

Query: 103 F----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
           F    G +F  ATSFP  I  +++F+  L +++   +S EARA  N G AGL +W+PN+N
Sbjct: 134 FQSPLGANFSAATSFPMPILLSAAFDDALVKSVATAISTEARAFANYGFAGLDFWTPNIN 193

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNG 217
            FRDPRWGRG ETPGED      Y  + V GLQG  D D  +  ++CKHF AYD++N   
Sbjct: 194 PFRDPRWGRGMETPGEDAFRIQGYVLALVDGLQGGIDPDFYRTLSTCKHFAAYDIEN--- 250

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R   N   ++QD+ D +   F  CV + KVAS+MC+YN V+GVP CAD  +L+  +R 
Sbjct: 251 -GRTANNLSPTQQDMADYYLPMFETCVRDAKVASIMCAYNAVDGVPACADSYLLQDVLRD 309

Query: 278 EWRLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
            +       Y+VSDCD+V   +D  H+ +   +AAA +I AG DLDCG    +   S VQ
Sbjct: 310 TYGFTEDFNYVVSDCDAVENVFDPHHYAANLTQAAAMSINAGTDLDCGSSYNVLNAS-VQ 368

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
            GL +E  ++ +L+   +  +++G FD +P+   Y  LG  +V T   Q LA +AA +G+
Sbjct: 369 AGLTTEATLDKSLIRLYSALVKVGYFD-QPAE--YNSLGWGNVNTTQSQALAHDAATEGM 425

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
            LLKN G +LPLS      VAVIGP ++VT  M GNYAG A     PL    +  R +  
Sbjct: 426 TLLKNDG-TLPLSRTLSN-VAVIGPWANVTTQMQGNYAGTAPLLVNPLSVFQQKWRNVKY 483

Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
                +   D   F AA+ A+  +D  + + G+D S+E E  DR+ +  PG Q  L+S++
Sbjct: 484 AQGTAINSQDTSGFNAALSAASSSDVIVYLGGIDISVENEGFDRSSITWPGNQLNLISQL 543

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
           +   K P ++V   GG ID +   ++ ++ +I+WAGYPGQ GG AI D+L G + P G+L
Sbjct: 544 ANLGK-PLVIVQFGGGQIDDSALLSNSKVNSILWAGYPGQDGGNAIFDVLTGANPPAGRL 602

Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
           P+T YP  Y+ N  + +M +RPS     PGRTY +Y G  V PFG+G+ YTNF  +  + 
Sbjct: 603 PVTQYPANYVNNNNIQDMNLRPSNG--IPGRTYAWYTGTPVLPFGYGLHYTNFSLSFQST 660

Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNR---LTLGVQVDVKNVGSK-----DGAH 686
            T          GS  AT+      V +A  N+       + V+VKN G K     D   
Sbjct: 661 KTA---------GSDIATL------VNNAGSNKDLATFATIVVNVKNTGGKANLASDYVG 705

Query: 687 TLLVFSTP--PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
            L + ST   PA H  P+KQL A+ +V +V  GA Q++ + +++   L+  D +G R I 
Sbjct: 706 LLFLKSTNAGPAPH--PNKQLAAYGRVRNVGVGATQQLTLTVNLGS-LARADTNGDRWIY 762

Query: 744 LGEHNI 749
            G + +
Sbjct: 763 PGAYTL 768


>gi|171678585|ref|XP_001904242.1| hypothetical protein [Podospora anserina S mat+]
 gi|170937362|emb|CAP62020.1| unnamed protein product [Podospora anserina S mat+]
          Length = 800

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 275/766 (35%), Positives = 403/766 (52%), Gaps = 48/766 (6%)

Query: 9   FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
           F G  L SA    +     F  +  D    T   C  +L  P+R   L+  L+ +EK++ 
Sbjct: 6   FWGASLTSAVVRAIDLPFQFYPNCVDDLLSTNQVCNTTLSPPERAAALVAALTPEEKLQN 65

Query: 69  LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNA 125
           ++S +   PR+G+  Y WWSEALHGV+   PGT+F    G F  +TSFP  +  A++F+ 
Sbjct: 66  IVSKSLGAPRIGLPAYNWWSEALHGVA-YAPGTQFWQGDGPFNSSTSFPMPLLMAATFDD 124

Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
            L E I  V+  E RA  N G +GL YW+PNVN F+DPRWGRG ETPGED +L  +YAA+
Sbjct: 125 ELLEKIAEVIGIEGRAFGNAGFSGLDYWTPNVNPFKDPRWGRGSETPGEDVLLVKRYAAA 184

Query: 186 YVRGLQGSDGDR-LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
            ++GL+G   ++  +V A+CKH+ A D ++WNG  R +FNAK+S QD+ + + +PF+ CV
Sbjct: 185 MIKGLEGPVPEKERRVVATCKHYAANDFEDWNGATRHNFNAKISLQDMAEYYFMPFQQCV 244

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHF 301
            + +V S+MC+YN VNGVP+CA P +L+  +R  W     N YI SDC++V        +
Sbjct: 245 RDSRVGSIMCAYNAVNGVPSCASPYLLQTILREHWNWTEHNNYITSDCEAVLDVSLNHKY 304

Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
            +T  E  A +  AG+D  C          A  +GLL E  ++ AL+      +R G FD
Sbjct: 305 AATNAEGTAISFEAGMDTSCEYEGSSDIPGAWSQGLLKESTVDRALLRLYEGIVRAGYFD 364

Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI----RHRTVAVI 417
           G+ S   Y  LG  DV  P  Q+L+L+AA  G VLLKN G +LPLS +    R + VA+I
Sbjct: 365 GKQSL--YSSLGWADVNKPSAQKLSLQAAVDGTVLLKNDG-TLPLSDLLDKSRPKKVAMI 421

Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--------ARTIHQQGCKDVACADDQLFG 469
           G  SD    + G Y+G A    TP     +            +H     + +  D+    
Sbjct: 422 GFWSDAKDKLRGGYSGTAAYLHTPAYAASQLGIPFSTASGPILHSDLASNQSWTDN---- 477

Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
            A+ A++ AD  +   G+D S   E  DR  L  PG Q  L++ ++  SK   ++VL  G
Sbjct: 478 -AMAAAKDADYILYFGGIDTSAAGETKDRYDLDWPGAQLSLINLLTTLSK--PLIVLQMG 534

Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
             +D     ++P+I AI+WA +PGQ GGTA+ +++ G  +P G+LP+T YP  +   +PM
Sbjct: 535 DQLDNTPLLSNPKINAILWANWPGQDGGTAVMELVTGLKSPAGRLPVTQYPSNFTELVPM 594

Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA-PTVVAVPLDGRHGS 648
           T+MA+RPS      GRTYR+YK P V  FG G+ YT F        P V+          
Sbjct: 595 TDMALRPSAGNSQLGRTYRWYKTP-VQAFGFGLHYTTFSPKFGKKFPAVI---------D 644

Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA---GHWAPHKQL 705
           ++  + G   K     C    L V V+  N G++   +  L F + P    G W P K L
Sbjct: 645 VDEVLEGCDDKYLDT-CPLPDLPVVVE--NRGNRTSDYVALAFVSAPGVGPGPW-PIKTL 700

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
            AF ++    G ++R G        L+  D  G   +  G++ + +
Sbjct: 701 GAFTRLRGVKGGEKREGGLKWNLGNLARHDEEGNTVVYPGKYEVSL 746


>gi|392962219|ref|ZP_10327666.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           DSM 17108]
 gi|392452977|gb|EIW29882.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           DSM 17108]
          Length = 724

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 267/739 (36%), Positives = 395/739 (53%), Gaps = 83/739 (11%)

Query: 46  SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
           +L   QR  DL+ R++++EKV  ++  + A+ RLGI  Y WWSEALHGV+  G       
Sbjct: 10  TLSFEQRAKDLVSRMTIEEKVTQMVYSSPAISRLGIPAYNWWSEALHGVARAGV------ 63

Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNV 157
               AT FPQ I  A++F+  L   +  ++S EARA ++        G   GLT+WSPNV
Sbjct: 64  ----ATVFPQAIGLAATFDEKLIYDVAEIISIEARAKFHEFQRKGDHGIYKGLTFWSPNV 119

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           NIFRDPRWGRGQET GEDP L+G+   S+++GLQG D   L+ AA  KHF  +       
Sbjct: 120 NIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQDKKYLRAAACAKHFAVHSGPE--- 176

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +R  F+A VS +D+ +T+   F+ CV E  V +VM +YN+VNG P C    +LK T+R 
Sbjct: 177 SERHRFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEPCCGSNILLKETLRQ 236

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
           EW   G++VSDC ++  +++    TS+  E+ A A+  G DL+CG  + L+   A Q GL
Sbjct: 237 EWGFTGHVVSDCWAIKDFHENHRVTSSAPESVALALNNGCDLNCGN-MYLNLLIAYQEGL 295

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           ++E  IN A+   +  +M+LG+FD    + PY ++G       +H+E ALE +++ +VLL
Sbjct: 296 VTEEAINTAVTRLMLTRMKLGLFDA-AENVPYTNIGFHQNDCQEHREFALEVSKKTLVLL 354

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIH 453
           KN+   LPL      ++AVIGPN++    + GNY G A  Y T L+GI    G+     +
Sbjct: 355 KNENHLLPLDRNTISSIAVIGPNANSREALTGNYFGTASNYITVLEGIREAVGKDTMVSY 414

Query: 454 QQGC---KDVA---CADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDR 498
            QGC   +D A     +   F  A+  + +AD  ++ MGLD SIE E         + D+
Sbjct: 415 AQGCHLYRDKAENLGEERDRFAEAVSTAERADLVVMCMGLDASIEGEEGDVSNEYASGDK 474

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
            GL LPG QQEL+  +    K P ILVL++G  + V +A    ++ AII A YPG  GG 
Sbjct: 475 LGLNLPGLQQELLEVIYKTGK-PIILVLLAGSALAVTWAAE--KVPAIIQAWYPGAEGGK 531

Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
           A+A  +FG  +P GKLP+T+Y       LP  T+ +M+         RTYR+     +YP
Sbjct: 532 ALASAIFGEYSPVGKLPITFYRT--TEELPEFTDYSMK--------NRTYRYMTKEALYP 581

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
           FG+G+ YT F +               R   +N T   K     + +C+ L       VK
Sbjct: 582 FGYGLGYTTFAY---------------RQLQLNRT---KICAGENVQCSIL-------VK 616

Query: 678 NVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           N G+     T+ L      A    P   L   +K+H+  GA+Q +   +   + L++++ 
Sbjct: 617 NTGNFASDETVQLYIKDVKASVEVPIWALQGIQKIHLLPGAEQEISFTL-TSRQLALINE 675

Query: 737 SGTRRIPLGEHNIHIGGTK 755
            G   +  G   I++GG +
Sbjct: 676 KGNCILEPGIFEIYVGGCQ 694


>gi|421060771|ref|ZP_15523202.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           B3]
 gi|421065248|ref|ZP_15527033.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A12]
 gi|421073214|ref|ZP_15534285.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A11]
 gi|392444242|gb|EIW21677.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A11]
 gi|392454445|gb|EIW31278.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           B3]
 gi|392459366|gb|EIW35779.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A12]
          Length = 724

 Score =  434 bits (1115), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 267/740 (36%), Positives = 393/740 (53%), Gaps = 85/740 (11%)

Query: 46  SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
           +L   QR  DL+ R++L+EKV  ++  + A+PRLG+  Y WWSEALHGV+  G       
Sbjct: 10  TLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVARAGV------ 63

Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNV 157
               AT FPQ I  A++F+  L   +  V+S E RA ++        G   GLT+WSPNV
Sbjct: 64  ----ATVFPQAIGLAATFDEKLIFNVAEVISIEGRAKFHEFQRKGDHGIYKGLTFWSPNV 119

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           NIFRDPRWGRGQET GEDP L+G+   S+++GLQG D   L+ AA  KHF  +       
Sbjct: 120 NIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQDKKYLRAAACAKHFAVHSGPE--- 176

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +R  F+A VS +D+ +T+   F+ CV E  V +VM +YN+VNG P C    +LK T+R 
Sbjct: 177 SERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEPCCGSNMLLKETLRR 236

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
           EW   G++VSDC ++  +++    TS+  E+ A A+  G DL+CG  + L+   A Q GL
Sbjct: 237 EWGFTGHVVSDCWAIKDFHENHRVTSSAPESVAMALNNGCDLNCGN-MYLNLLIAYQEGL 295

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           ++E  IN A+   +  +M+LG+FD    + PY  +G       +H+E ALE +++ +VLL
Sbjct: 296 VTEEAINTAVTRLMLTRMKLGLFD-TAENVPYTKIGFHQNDCQEHREFALEVSKKTLVLL 354

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIH 453
           KN+   LPL      ++AVIGPN++    + GNY G A  Y T L+GI    G+     +
Sbjct: 355 KNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYITVLEGIREAVGKDTMVSY 414

Query: 454 QQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALD 497
            QGC       +++  A D+ F  A+  + +AD  ++ MGLD SIE E         + D
Sbjct: 415 AQGCHLYRDKAENLGEARDR-FAEAVSTAERADIVVMCMGLDASIEGEEGDVSNEYASGD 473

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           + GL LPG QQEL+  +    K P ILVL++G  + V +A    +I AII A YPG  GG
Sbjct: 474 KLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTWAAE--KIPAIIQAWYPGAEGG 530

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
            A+A  +FG  +P GKLP+T+Y       LP  T+ +M+         RTYR+     +Y
Sbjct: 531 KALASAIFGEYSPVGKLPITFY--RTTEELPEFTDYSMK--------NRTYRYMTKEALY 580

Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
           PFG+G+ YT F +          + L+    S+   + G  +                 V
Sbjct: 581 PFGYGLGYTTFAYR--------QLQLNRTQISVGENVQGSVL-----------------V 615

Query: 677 KNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
           KN G+     T+ L      A    P   L   +KVH+  G +Q V   +   + L++++
Sbjct: 616 KNTGNFASDETVQLYIKDVKASVEVPIWALQGIQKVHLLPGTEQEVFFTL-TPRQLALIN 674

Query: 736 RSGTRRIPLGEHNIHIGGTK 755
             G   +  G   I++GG +
Sbjct: 675 EEGNCILEPGVFEIYVGGCQ 694


>gi|310792973|gb|EFQ28434.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 728

 Score =  434 bits (1115), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 258/709 (36%), Positives = 389/709 (54%), Gaps = 32/709 (4%)

Query: 60  LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGD---FPGATSFPQV 116
           +S++EKV+ L+  +A V  LG+  + WW+E LHGV    PG  F  D   F  ATSFP  
Sbjct: 1   MSVEEKVRNLVDASAGVKSLGLPPHGWWNEGLHGVG-FSPGVLFAQDSEPFGYATSFPLP 59

Query: 117 ITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP 176
           I TA+SF+  L+ AIG+V+  E RA  N G AG  +W+PN+N FRDPRWGRGQETPGED 
Sbjct: 60  ILTAASFDDDLFNAIGQVIGREGRAFSNYGYAGFNFWTPNMNAFRDPRWGRGQETPGEDV 119

Query: 177 VLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
           ++   Y  SYV GLQGSD     + A+CKHF AYD++      R   N   ++QD++D +
Sbjct: 120 LVVSNYVQSYVTGLQGSDPTDKVIIAACKHFAAYDIE----TARRANNYNPTQQDLQDYY 175

Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVG 293
              FR CV +  V +VMCSYN V+G+P C+   +LK  +R  W       ++VSDC +V 
Sbjct: 176 LPAFRRCVRDSHVGTVMCSYNSVDGIPACSSEYLLKEVLRDTWGFTNDYQFVVSDCGAVT 235

Query: 294 VYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTV 353
             +   +FT+T ++AA+ ++ AG DL+CG    LH   ++    +++  ++ AL      
Sbjct: 236 DVWLLHNFTNTEQDAASVSMAAGTDLECGSSY-LHLNGSLADKQVTQERVDEALTRLYKA 294

Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
              +G FDG   S     LG  DV T D Q++A EAAR G+ LLKN G  LPL+  ++++
Sbjct: 295 LFTVGYFDGSSHSS----LGWSDVSTIDAQQIACEAARAGMTLLKNDG-VLPLADGKYKS 349

Query: 414 VAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAI 472
           VA+IGP ++ T  M GNY G A    +PL    + +   ++     D+    D  F  A+
Sbjct: 350 VALIGPFANATTQMQGNYFGRAPFVRSPLWAFTQQSSLQVNYAAGTDINSTSDSGFADAL 409

Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
            A++ +D  I   G+D +IEAE LDR  +  PG Q +L+S++SM  K P ++    GG +
Sbjct: 410 AAAKNSDIVIFCGGIDTTIEAETLDRVSITWPGNQLDLISQLSMLGK-PLVVAQFGGGQV 468

Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
           D     ++  + A+ WAG PGQAGG A+ D++ G ++  G+LP T YP  Y   + +  +
Sbjct: 469 DDTALVDNANVNALFWAGLPGQAGGLAMYDLVVGKASFAGRLPTTQYPASYADLVSIFNI 528

Query: 593 AMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINAT 652
            +RP+ +  +PGRTY++Y G  V+PFG G+ YT F  T  +        L+  +   N  
Sbjct: 529 NLRPNGT--FPGRTYKWYIGEPVFPFGFGLHYTKFNFTWKDT-------LEPTYDISNII 579

Query: 653 ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT-LLVFSTPPAGHWA-PHKQLVAFEK 710
              ++    H         V V VKNVG+    +  LL  S+  AG    P+K L ++ +
Sbjct: 580 SWARSQNNGHVTDTTPFTSVNVTVKNVGNVRSDYVGLLFLSSKNAGPVPRPNKSLASYSR 639

Query: 711 VH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
            H +  GA  ++ + + +  + +  D  G   I  G++ + +   K  V
Sbjct: 640 AHDIETGASDQLTLKLTLGSF-ARSDSQGNLTIFPGDYKLELDNDKSLV 687


>gi|67902828|ref|XP_681670.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
 gi|74592887|sp|Q5ATH9.1|BXLB_EMENI RecName: Full=Exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|40747867|gb|EAA67023.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
 gi|259484335|tpe|CBF80465.1| TPA: beta-1,4-xylosidase (Eurofung) [Aspergillus nidulans FGSC A4]
          Length = 763

 Score =  433 bits (1114), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 267/719 (37%), Positives = 386/719 (53%), Gaps = 29/719 (4%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV-G 98
           LP C  SL   +R   L+  L+L+EK+      AA   RLG+  Y WW+EALHGV+   G
Sbjct: 36  LPICDTSLSPLERAKSLVSALTLEEKINNTGHEAAGSSRLGLPAYNWWNEALHGVAEKHG 95

Query: 99  PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
              +  GDF  ATSFP  I   ++FN  L   +  ++S EARA  N   AG+ YW+PNVN
Sbjct: 96  VSFEESGDFSYATSFPAPIVLGAAFNDALIRRVAEIISTEARAFSNSDHAGIDYWTPNVN 155

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
            F+DPRWGRGQETPGEDP+   +Y   +V GLQG D ++ KV A+CKH  AYDL+ W GV
Sbjct: 156 PFKDPRWGRGQETPGEDPLHCSRYVKEFVGGLQGDDPEKPKVVATCKHLAAYDLEEWGGV 215

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
            RF F+AKVS  D+ + +  PF+ C ++  V + MCSYN +NGVP CAD  +L+  +R  
Sbjct: 216 SRFEFDAKVSAVDLLEYYLPPFKTCAVDASVGAFMCSYNALNGVPACADRYLLQTVLREH 275

Query: 279 WRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
           W   G   ++  DC +V       H+  +  EAAA A+ AG+DLDCG +L  +   A ++
Sbjct: 276 WGWEGPGHWVTGDCGAVERIQTYHHYVESGPEAAAAALNAGVDLDCGTWLPSYLGEAERQ 335

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
           GL+S   ++ AL    T  ++LG FD     QP   LG  DV T + +ELA   A QG V
Sbjct: 336 GLISNETLDAALTRLYTSLVQLGYFD-PAEGQPLRSLGWDDVATSEAEELAKTVAIQGTV 394

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
           LLKN   +LPL    + T+A+IGP  + T  +  NYAG A    T ++   R    +   
Sbjct: 395 LLKNIDWTLPLK--ANGTLALIGPFINFTTELQSNYAGPAKHIPTMIEAAERLGYNVLTA 452

Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
              +V       F  A+  + +ADA I   G+D ++E E+LDR  +  PG Q+EL+ +++
Sbjct: 453 PGTEVNSTSTDGFDDALAIAAEADALIFFGGIDNTVEEESLDRTRIDWPGNQEELILELA 512

Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
              + P  +V   GG +D +       + AI+WAGYP QAGG  + D+L G + P G+LP
Sbjct: 513 ELGR-PLTVVQFGGGQVDDSALLASAGVGAIVWAGYPSQAGGAGVFDVLTGKAAPAGRLP 571

Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           +T YP+ Y+  +PMT+M ++P      PGRTYR+Y+  V+ PFG G+ YT F  + A   
Sbjct: 572 ITQYPKSYVDEVPMTDMNLQPGTDN--PGRTYRWYEDAVL-PFGFGLHYTTFNVSWAKK- 627

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
                      G  +A    +    +    +  +L     V N G     +  LVF++ P
Sbjct: 628 ---------AFGPYDAATLARGKNPSSNIVDTFSLA----VTNTGDVASDYVALVFASAP 674

Query: 696 --AGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
                 AP K LV + +   +  G  ++V + + V   L+     G   +  GE+ + +
Sbjct: 675 ELGAQPAPIKTLVGYSRASLIKPGETRKVDVEVTVAP-LTRATEDGRVVLYPGEYTLLV 732


>gi|291528382|emb|CBK93968.1| Beta-glucosidase-related glycosidases [Eubacterium rectale M104/1]
          Length = 714

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 249/604 (41%), Positives = 357/604 (59%), Gaps = 63/604 (10%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           L+ ++++ EK+  ++  + A+ RLGI  Y WW+EALHGV+  G           AT FPQ
Sbjct: 12  LVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVFPQ 61

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGR 167
            I  A++F+A L E IG VVS E R  +N        G   GLT+W+PNVNIFRDPRWGR
Sbjct: 62  AIGLAAAFDADLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRWGR 121

Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNA 225
           G ET GEDP L+GK   +Y+RGLQG D D LK AA  KHF  +     +G +  R  F+A
Sbjct: 122 GHETYGEDPYLTGKLGCAYIRGLQGDDPDHLKSAACAKHFAVH-----SGPEAIRHEFDA 176

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
           K SK D+ DT+   F+ CV + KV +VM +YN+VNG P C    +LK  +R E+   G++
Sbjct: 177 KASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILRDEFGFEGHV 236

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
           VSDC ++  +++  H T T EE+AA A+  G DL+CG    LH + A  +GL+S+  I  
Sbjct: 237 VSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAF-LHLKDAYDKGLVSDEAITA 295

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
           A+   + V++RLGM    PS  PY  +  + V   +H EL++EAAR+ +VLLKN+   LP
Sbjct: 296 AVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLVLLKNKDNFLP 353

Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGC---K 458
           L     +T+AVIGPN++    +IGNY G +  Y TPL+G+ +Y     R ++ +GC   K
Sbjct: 354 LDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGDDTRVLYAEGCHLYK 413

Query: 459 DVA---CADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLPGR 506
           D       +   F  A+  + Q+D  ++ +GLD +IE E         + D+ GL+LPG 
Sbjct: 414 DKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGDKLGLMLPGL 473

Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFG 566
           Q+EL+  V+   K P ILVL +G  ID+++A+    + AII + YPG  GG A+A+ +FG
Sbjct: 474 QEELLEAVAAVGK-PVILVLSAGSAIDLSWAEE--HVDAIIDSWYPGARGGKAVAEAIFG 530

Query: 567 TSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY- 624
             +P GKLP+T+Y  +   NLP  T+ +M          RTYR+    V+YPFG+G+ Y 
Sbjct: 531 EYSPSGKLPVTFY--QGTENLPEFTDYSM--------AHRTYRYTNENVLYPFGYGLHYG 580

Query: 625 -TNF 627
            TN+
Sbjct: 581 ETNY 584


>gi|367046937|ref|XP_003653848.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
 gi|347001111|gb|AEO67512.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
          Length = 923

 Score =  432 bits (1112), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 275/729 (37%), Positives = 397/729 (54%), Gaps = 37/729 (5%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P C  SLPI  RV  L+G+L+LQEK+  L+ GA+   R+G+  YEWWSEALHGV+   PG
Sbjct: 163 PACNTSLPIADRVRWLVGQLTLQEKITNLVDGASGSARVGLPPYEWWSEALHGVA-ASPG 221

Query: 101 TKFGGD----FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPN 156
             F G     F  ATSFP  IT +++F+  L   I  VV  E RA  N G +G  +W+PN
Sbjct: 222 VTFAGPNGTAFSYATSFPMPITISAAFDDDLVSQIAAVVGREGRAFANHGLSGFDFWTPN 281

Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
           +N FRDPRWGRG ETPGED     +Y    + GLQGSD    ++ A+CKH+  YD++   
Sbjct: 282 INPFRDPRWGRGPETPGEDAFRIQQYIRHLIPGLQGSDPLDKQIIATCKHYAVYDVE--- 338

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
              R+ ++      D+ + +  PF+ CV +  + SVMCSYN V+G+P CA   +L+  +R
Sbjct: 339 -TGRYEYDYDPQPHDLAEYYLAPFKTCVRDVGIGSVMCSYNAVDGIPACASEYLLQSVLR 397

Query: 277 GEWRLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
             W       Y+VSDCD+V   Y   +FT +P  AAA A+ AG DL+CG    L+   ++
Sbjct: 398 DHWGFTEPYQYVVSDCDAVRFIYSPHNFTDSPAAAAAVALNAGTDLECGSTY-LNLNQSL 456

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
              + +E  ++ AL    T    +G FDG   S  YG LG   V T D Q LA +AA  G
Sbjct: 457 ASNMTTEAALDRALTRLYTALHTIGFFDG---SARYGGLGWDAVGTGDAQVLAYQAAVDG 513

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ART 451
            VLLKN+   LPL   R R +AVIGP ++ T  M GNY G A    +PL        A  
Sbjct: 514 AVLLKNEKSLLPLDSKRLRKLAVIGPWANATTQMQGNYFGQAAYLVSPLAAFQSAWGADN 573

Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
           +       +A      F AA+ A++ ADA + + G+D S+E+E+LDR  +  PG Q +L+
Sbjct: 574 VLFANGTGIAGNSTAGFAAALAAAKAADAVVFLGGVDNSVESESLDRTAISWPGNQLDLI 633

Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
           ++++   K P ++V   GG +D +    +PR+ A++WAGYPGQAGG AIAD+L G   P 
Sbjct: 634 AQLAAVGK-PLVVVQCGGGQLDDSALLANPRVGALLWAGYPGQAGGAAIADLLTGKQAPA 692

Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQS------KRYPGRTYRFYKGPVVYPFGHGMSYT 625
           G+LP+T Y   Y + + + + ++RP +S        +PGRTY++Y G  V PFG+G+ YT
Sbjct: 693 GRLPVTQYAASYTSEVSLFDPSLRPRRSGGSKSHSTFPGRTYKWYTGKPVLPFGYGLHYT 752

Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
            F    A+ P   A  + G   + N T +  A     A      L V V V N G     
Sbjct: 753 TFRTAWADEPRGRAYDIAGLFPA-NTTTTSSAFS---AADTYPVLNVSVTVTNTGRGASD 808

Query: 686 HTLLVF----STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTR 740
           +  L+F    +  PA +  P+K LV + +   +  G+  R+ + + +   L+  D  G R
Sbjct: 809 YVGLLFLRTRNAGPAPY--PNKWLVGYARARGLAPGSSARLELAVALGS-LARADEDGRR 865

Query: 741 RIPLGEHNI 749
            +  G++ +
Sbjct: 866 VVYPGDYEL 874


>gi|367053033|ref|XP_003656895.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
 gi|347004160|gb|AEO70559.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
          Length = 758

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 273/727 (37%), Positives = 402/727 (55%), Gaps = 40/727 (5%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLI------SGAAAVPRLGIKGYEWWSEALHGVSN 96
           C  +   P+R   L+  +++ EK+  L+      S +   PRLG+  YEWWSEALHGV+ 
Sbjct: 11  CDTTASPPKRAAALVEAMNITEKLANLVEYVMARSSSKGAPRLGLPPYEWWSEALHGVA- 69

Query: 97  VGPGTKF---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYW 153
             PG  F   GG F  ATSF   IT +++F+  L + +  V+S EARA  N G+AGL +W
Sbjct: 70  ASPGVSFNWSGGPFSYATSFANPITLSAAFDDELVQKVADVISTEARAFANAGSAGLDFW 129

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +PN+N +RDPRWGRG ETPGEDPV    Y  S +RGL+G +  + KV A+CKH+ AYDL+
Sbjct: 130 TPNINPWRDPRWGRGSETPGEDPVRIKGYVRSLLRGLEGEESIK-KVIATCKHYAAYDLE 188

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
            W+ + R+ F+A VS QD+ + +  PF+ C  + KV S+MCSYN +NG P CA+  ++  
Sbjct: 189 RWHNITRYEFDAIVSLQDLSEYYLPPFQQCARDSKVGSIMCSYNSLNGTPACANTYLMDD 248

Query: 274 TIRGEWRL---NGYIVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDC---GPFLG 326
            +R  WR    N YI SDC+++  +   +H FT T  EAAA A  AG D  C   G    
Sbjct: 249 ILRKHWRWTEDNNYITSDCNAIKDFLPDEHNFTQTAAEAAAAAYTAGTDTVCEVAGSPPY 308

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
                A  + LLSE  I+ AL       +R G FD   S+ PY  +G  DV T + Q LA
Sbjct: 309 TDVVGAYDQKLLSEEVIDRALRRLYEGLVRAGYFDPA-SASPYRDIGWSDVNTAEAQALA 367

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
           L++A  G+VLLKN G +LP+  +  +TVA+IG  +  T +M+G Y+GI   Y +P+   G
Sbjct: 368 LQSASDGLVLLKNDG-TLPI-KLEGKTVALIGHWASGTRSMLGGYSGIPPYYHSPVYAAG 425

Query: 447 RYARTIHQQG--CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
           +   T           + A D     A+ A+ ++D  +   GLDQS+ +E  DR  +  P
Sbjct: 426 QLNLTYKYASGPVAPASAARDTWTADALSAANKSDVILYFGGLDQSVASEDKDRDSIAWP 485

Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
             Q  L+   ++A  G  ++V+  G  +D      +P ++AI+WAGYPGQ+GGTA+ + +
Sbjct: 486 PAQLTLIQ--TLAGLGKPLVVIQLGDQVDDTPLLTNPNVSAILWAGYPGQSGGTAVLNAI 543

Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY-KGPVVYPFGHGMS 623
            G S P G+LP+T YP  Y + LP+T+M++RP  +   PGRTYR+  +   V PFG+G+ 
Sbjct: 544 TGVSPPAGRLPVTQYPSSYTSQLPLTDMSLRPDPASGRPGRTYRWLPRNATVLPFGYGLH 603

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           YTNF           A P   ++ ++  +      K+ H     L   V V+V N G++ 
Sbjct: 604 YTNF----------TARPNPAQNFTLTPSALLAPCKLAHRDLCPLPYPVTVEVTNTGART 653

Query: 684 GAHTLLVFSTPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTR 740
             +  LVF+T       PH  K LVA+ ++  +  G   R  + + +   L+ VD +G R
Sbjct: 654 SDYVGLVFATTRDAGPPPHPLKTLVAYARLRGIAPGRTARAQVQVALGD-LARVDAAGNR 712

Query: 741 RIPLGEH 747
            +  G +
Sbjct: 713 VLYPGRY 719


>gi|291525508|emb|CBK91095.1| Beta-glucosidase-related glycosidases [Eubacterium rectale DSM
           17629]
          Length = 714

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 248/604 (41%), Positives = 356/604 (58%), Gaps = 63/604 (10%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           L+ ++++ EK+  ++  + A+ RLGI  Y WW+EALHGV+  G           AT FPQ
Sbjct: 12  LVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVFPQ 61

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGR 167
            I  A++F+  L E IG VVS E R  +N        G   GLT+W+PNVNIFRDPRWGR
Sbjct: 62  AIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRWGR 121

Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNA 225
           G ET GEDP L+GK   +Y+RGLQG D D LK AA  KHF  +     +G +  R  F+A
Sbjct: 122 GHETYGEDPYLTGKLGCAYIRGLQGDDPDHLKSAACAKHFAVH-----SGPEAIRHEFDA 176

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
           K SK D+ DT+   F+ CV + KV +VM +YN+VNG P C    +LK  +R E+   G++
Sbjct: 177 KASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILRDEFGFEGHV 236

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
           VSDC ++  +++  H T T EE+AA A+  G DL+CG    LH + A  +GL+S+  I  
Sbjct: 237 VSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAF-LHLKDAYDKGLVSDEAITA 295

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
           A+   + V++RLGM    PS  PY  +  + V   +H EL++EAAR+ +VLLKN+   LP
Sbjct: 296 AVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLVLLKNKDNFLP 353

Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGC---K 458
           L     +T+AVIGPN++    +IGNY G +  Y TPL+G+ +Y     R ++ +GC   K
Sbjct: 354 LDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRVLYAEGCHLYK 413

Query: 459 DVA---CADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLPGR 506
           D       +   F  A+  + Q+D  ++ +GLD +IE E         + D+ GL+LPG 
Sbjct: 414 DKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGDKLGLMLPGL 473

Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFG 566
           Q+EL+  V+   K P ILVL +G  ID+++A+    + AII + YPG  GG A+A+ +FG
Sbjct: 474 QEELLEAVAAVGK-PVILVLSAGSAIDLSWAEE--HVDAIIDSWYPGARGGKAVAEAIFG 530

Query: 567 TSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY- 624
             +P GKLP+T+Y  +   NLP  T+ +M          RTYR+    V+YPFG+G+ Y 
Sbjct: 531 EYSPSGKLPVTFY--QGTENLPEFTDYSM--------AHRTYRYTNENVLYPFGYGLHYG 580

Query: 625 -TNF 627
            TN+
Sbjct: 581 ETNY 584


>gi|212531051|ref|XP_002145682.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
 gi|210071046|gb|EEA25135.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
          Length = 799

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 255/720 (35%), Positives = 392/720 (54%), Gaps = 29/720 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   LI   +L+E +    +    VPRLG+  YE WSE LHG+       K
Sbjct: 64  CDTSANYVDRAEGLIALFTLEELINNTQNSGPGVPRLGLPPYEVWSEGLHGLDRAH-FVK 122

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
            G ++  ATSFP  I + ++ N TL   I  +++ +ARA  N G  GL  ++PN+N FR 
Sbjct: 123 SGDEWTWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGRYGLDAYAPNINGFRS 182

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   L+  YA  Y+ GLQG  D D LK+AA+ KHF  YDL+NW G  R
Sbjct: 183 PLWGRGQETPGEDANFLTSSYAYEYITGLQGGIDPDNLKIAATAKHFAGYDLENWGGNSR 242

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A++++QD+ + +   F       K  S MCSYN VN +P+C+   +L+  +R +W 
Sbjct: 243 LGFDARITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNAIPSCSSSFLLQTLLREQWD 302

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
               GY+ SDCD+V   ++   + S    AAA+++RAG D+DCG     H   +   G +
Sbjct: 303 FPEYGYVSSDCDAVYNVFNPHGYASNQSSAAAESLRAGTDIDCGQTYSWHLNQSFIEGSV 362

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           +  +I  +++   +  ++LG FDG+ +   Y  LG  DV T D   ++ EAA +GIVLLK
Sbjct: 363 TRGEIERSILRLYSNLVKLGYFDGDKNE--YRQLGWNDVVTTDAWNISYEAAVEGIVLLK 420

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G  LPLS    ++VA++GP ++ T  + GNY G A    TPLQG       ++     
Sbjct: 421 NDG-VLPLSK-NVKSVALVGPWANATKQLQGNYFGTAPYLITPLQGASDAGYKVNYALGT 478

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           +++      F  A+ A++++D  + + G+D +IEAE  DR  +  P  Q +L+ ++S   
Sbjct: 479 NISGNTTDGFANALSAAKKSDVIVYLGGIDNTIEAEGTDRMNVTWPRNQLDLIQQLSQTG 538

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P +++ M GG +D +  K++ ++ A+IW GYPGQ+GG AI DIL G   P G+L  T 
Sbjct: 539 K-PLVVLQMGGGQVDSSSIKSNSKVNALIWGGYPGQSGGKAIFDILKGKRAPAGRLVSTQ 597

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP EY T  P T+M++RP   K  PG+TY +Y G  VY FG+G+ YT F  T        
Sbjct: 598 YPAEYATQFPATDMSLRP-DGKSNPGQTYMWYIGKPVYEFGYGLFYTTFKET-------- 648

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLT--LGVQVDVKNVGSKDGAHTLLVFSTP-- 694
           A  L     S + +    + +    + + L   L V   +KN G     +T ++F+    
Sbjct: 649 AKKLGSSSSSFDISEIVSSPRSPSYEYSELVPFLNVTATIKNTGKTASPYTAMLFANTTN 708

Query: 695 --PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
             PA +  P+K LV ++++  +  G    + I + +   ++ VD++G R +  G++ + +
Sbjct: 709 AGPAPY--PNKWLVGYDRLPSIEPGKSADLVIPVPIGA-IARVDKNGNRIVYPGDYQLTL 765


>gi|292495634|sp|A1CND4.2|XYND_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
          Length = 792

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 256/712 (35%), Positives = 386/712 (54%), Gaps = 23/712 (3%)

Query: 52  RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
           R   LI   +L+E V    + +  VPRLG+  Y+ W+EALHG+       +  G F  +T
Sbjct: 72  RAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGLDRAYFTDE--GQFSWST 129

Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQET 171
           SFP  I T S+ N TL   +  ++S + RA  N G  GL  +SPN+N FR P WGRGQET
Sbjct: 130 SFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRYGLDVYSPNINSFRHPVWGRGQET 189

Query: 172 PGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSK 229
           PGED   LS  YA  Y+ G+QG  D   LK+ A+ KH+  YD++NW+G  R   +  +++
Sbjct: 190 PGEDAYCLSSAYAYEYITGIQGGVDPKSLKLVATAKHYAGYDIENWDGHSRLGNDMNITQ 249

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL--NGYIVS 287
           QD+ + +   F +   + KV SVMCSYN VNGVP+CA+   L+  +R  +    +GYI S
Sbjct: 250 QDLSEYYTPQFLVAARDAKVRSVMCSYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYISS 309

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           DCDS    ++   + +    AAAD+IRAG D+DCG     + + AV + LLS  DI   +
Sbjct: 310 DCDSAYNVFNPHEYAANVSSAAADSIRAGTDIDCGTTYQYYFDEAVDQNLLSRADIERGV 369

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
           +   +  MRLG FDG  S+  Y +L   DV T +   ++ E   +G VLLKN G +LPLS
Sbjct: 370 IRLYSNLMRLGYFDGNSSA--YRNLTWNDVVTTNSWNISYEV--EGTVLLKNDG-TLPLS 424

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL 467
               R++A++GP  +V+  + GNY G A    +PL         ++     +++      
Sbjct: 425 E-SIRSIALVGPWMNVSTQLQGNYFGPAPYLISPLDAFRDSHLDVNYAFGTNISSNSTDG 483

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           F  A+ A++++DA I   G+D S+EAE LDR  +  PG+Q EL+ ++S   K P I++ M
Sbjct: 484 FSKALSAAKKSDAIIFAGGIDNSLEAETLDRMNITWPGKQLELIDQLSQLGK-PLIVLQM 542

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
            GG +D +  K++  + ++IW GYPGQ+GG A+ DI+ G   P G+L +T YP EY T  
Sbjct: 543 GGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQF 602

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
           P T+M++RP  +   PG+TY +Y G  VY FGHG+ YT F   V++A  V   P      
Sbjct: 603 PATDMSLRPHGNN--PGQTYMWYTGTPVYEFGHGLFYTTF--RVSHARAVKIKPT----Y 654

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLV 706
           +I   ++       H +     L   VD+ N G     +T ++F+   AG    P K LV
Sbjct: 655 NIQDLLAQPHPGYIHVEQMPF-LNFTVDITNTGKASSDYTAMLFANTTAGPAPYPKKWLV 713

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
            F+++     +  ++         ++  D  G R +  G++ + +   +  V
Sbjct: 714 GFDRLPTLGPSTSKLMTIPVTINSMARTDELGNRVLYPGKYELALNNERSVV 765


>gi|347531439|ref|YP_004838202.1| beta-glucosidase [Roseburia hominis A2-183]
 gi|345501587|gb|AEN96270.1| beta-glucosidase [Roseburia hominis A2-183]
          Length = 716

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 266/747 (35%), Positives = 400/747 (53%), Gaps = 96/747 (12%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
           +SL   +    L+ +++L+EK+  +   + A+ RL I  Y WW+EALHGV+  G      
Sbjct: 1   MSLETKEYAKRLVEQMTLEEKISQMRYESPAIERLHIPAYNWWNEALHGVARSGV----- 55

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPN 156
                AT FPQ I  A++F+  L E IG VVS E RA +         G   GLT+W+PN
Sbjct: 56  -----ATMFPQAIALAATFDEELIEKIGDVVSTEGRAKFEAYSGRGDRGIYKGLTFWAPN 110

Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
           +NIFRDPRWGRG ET GEDP L+ K   +Y+RG+QG D D LK AA  KHF  +     +
Sbjct: 111 INIFRDPRWGRGHETYGEDPCLTAKLGCAYIRGIQGKDPDHLKAAACAKHFAVH-----S 165

Query: 217 GVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRT 274
           G +  R  F+AKVS  D+ DT+   F+ CV +  V +VM +YN+VNG P C    +L+  
Sbjct: 166 GPEALRHEFDAKVSLHDLYDTYLYAFKRCVKDAGVEAVMGAYNRVNGEPACGSKTLLQDI 225

Query: 275 IRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
           +R ++   G++VSDC ++  +++  H T T EE+AA A+  G DL+CG    L+   A +
Sbjct: 226 LREQFGFEGHVVSDCWAILDFHEHHHVTKTVEESAAMAVNHGCDLNCGKAF-LYLSRACE 284

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
           +GL+ E  I  A+   + V++RLGM +  PS  PY ++    V  P+H  L+LEA+++ +
Sbjct: 285 QGLVEEKTITEAVERLMDVRIRLGMMEDYPS--PYANIPYDVVECPEHIALSLEASKRSM 342

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----AR 450
           VLLKN    LPL   +  T+AVIGPN++    ++GNY G +  Y TPL+GI  Y     R
Sbjct: 343 VLLKNDNHFLPLKQEQVHTIAVIGPNANSRAALVGNYEGTSSRYITPLEGIQEYTGEKTR 402

Query: 451 TIHQQGCKDVACADDQL---------FGAAIDASRQADATILVMGLDQSIEAE------- 494
            ++ QGC       DQ+         F  A+ A+ +AD  ++ +GLD  IE E       
Sbjct: 403 VLYAQGCH---LYKDQVEFLGEPKDRFKEALIAAERADVIVMCLGLDAGIEGEEGDAGNE 459

Query: 495 --ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
             + D+ GL LPG QQEL+  V+   K P +L +++G  +D+++A+   +I AI+   YP
Sbjct: 460 YASGDKLGLKLPGLQQELLEAVAAVGK-PIVLTVLAGSALDLSWAQEHAQIRAILDCWYP 518

Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYP-QEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
           G  GG AIA+ LFG  +P GKLP+T+Y   E++ +     MA          GRTYR+  
Sbjct: 519 GARGGKAIAEALFGEFSPCGKLPVTFYEGTEFLPDFTDYSMA----------GRTYRYTD 568

Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
             V+YPFG+G++Y+   ++ A+A               + T  G    VT          
Sbjct: 569 RHVLYPFGYGLTYSQIRYSDAHA---------------DVTDFGILEPVT---------- 603

Query: 672 VQVDVKNVGS---KDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
           V V V+N G+   ++     + FS   A  + P  QL     V +  G ++ V I +   
Sbjct: 604 VHVTVENTGTYPVQEAVQVYVRFSEREA--YDPGYQLKGIRSVALECGEKKEVCITLSP- 660

Query: 729 KYLSVVDRSGTRRIPLGEHNIHIGGTK 755
           +  +++   G   +  G + I +GG +
Sbjct: 661 RDFALISEEGKCLVHPGSYEIAVGGQQ 687


>gi|238923424|ref|YP_002936940.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
 gi|238875099|gb|ACR74806.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
          Length = 714

 Score =  431 bits (1107), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 247/604 (40%), Positives = 356/604 (58%), Gaps = 63/604 (10%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           L+ ++++ EK+  ++  + A+ RLGI  Y WW+EALHGV+  G           AT FPQ
Sbjct: 12  LVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVFPQ 61

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGR 167
            I  A++F+  L E IG VVS E R  +N        G   GLT+W+PNVNIFRDPRWGR
Sbjct: 62  AIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRWGR 121

Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNA 225
           G ET GEDP L+GK   +Y+RGLQG D D LK AA  KHF  +     +G +  R  F+A
Sbjct: 122 GHETYGEDPYLTGKLGCAYIRGLQGDDPDHLKSAACAKHFAVH-----SGPEAIRHEFDA 176

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
           K SK D+ DT+   F+ CV + KV +VM +YN+VNG P C    +LK  +R E+   G++
Sbjct: 177 KASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILRDEFGFEGHV 236

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
           VSDC ++  +++  H T T EE+AA A+  G DL+CG    LH + A  +G++S+  I  
Sbjct: 237 VSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAF-LHLKDAYDKGMVSDEAITA 295

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
           A+   + V++RLGM    PS  PY  +  + V   +H EL++EAAR+ +VLLKN+   LP
Sbjct: 296 AVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLVLLKNKDNFLP 353

Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGC---K 458
           L     +T+AVIGPN++    +IGNY G +  Y TPL+G+ +Y     R ++ +GC   K
Sbjct: 354 LDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRVLYAEGCHLYK 413

Query: 459 DVA---CADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLPGR 506
           D       +   F  A+  + Q+D  ++ +GLD +IE E         + D+ GL+LPG 
Sbjct: 414 DKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGDKLGLMLPGL 473

Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFG 566
           Q+EL+  V+   K P ILVL +G  ID+++A+    + AII + YPG  GG A+A+ +FG
Sbjct: 474 QEELLEAVAAVGK-PVILVLSAGSAIDLSWAEE--HVDAIIDSWYPGARGGKAVAEAIFG 530

Query: 567 TSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY- 624
             +P GKLP+T+Y  +   NLP  T+ +M          RTYR+    V+YPFG+G+ Y 
Sbjct: 531 EYSPNGKLPVTFY--QGTENLPEFTDYSM--------AHRTYRYTNENVLYPFGYGLHYG 580

Query: 625 -TNF 627
            TN+
Sbjct: 581 ETNY 584


>gi|310795958|gb|EFQ31419.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 824

 Score =  429 bits (1104), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 267/742 (35%), Positives = 388/742 (52%), Gaps = 44/742 (5%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +L   +R   L+  L++ EK+  L++ A  VPRL I  YEWWSE LHGV++  PGTK
Sbjct: 65  CDETLSPKERAAALVAELTIWEKLDNLVNEAPGVPRLAIPPYEWWSEGLHGVAS-SPGTK 123

Query: 103 FG--GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYW------- 153
           F   G+F  ATSFPQ I   S+F+  L +AIG VVS EARA  N G +GL  +       
Sbjct: 124 FAKSGNFSYATSFPQPIVLGSAFDDDLVKAIGEVVSKEARAFSNRGRSGLDLYVSSISRH 183

Query: 154 --------------SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
                         SPN+N F+DPRWGRGQETPGEDP     Y A+ + GL+G D  + K
Sbjct: 184 IEPEVRDDMLTEPESPNINAFKDPRWGRGQETPGEDPFHLQNYVAAMLTGLEGGDPSK-K 242

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
           + A+CKH+ A D +N+ GVDR  F+A ++ QD+ + +  PF+ C ++ KV S MCSYN +
Sbjct: 243 LIATCKHYAANDFENYKGVDRAGFDANITTQDLSEYYLPPFKTCAVDKKVGSFMCSYNAI 302

Query: 260 NGVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAG 316
           NG P CA+P +L+  +R  W  NG   Y+ +DCD V +     H+      AAA A++AG
Sbjct: 303 NGEPLCANPYLLEDILRQHWGWNGDGQYVSTDCDCVALMVSHHHYAPDLGHAAAWAMKAG 362

Query: 317 LDLDCGPFLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
            DL+C  F G    + A  + L+SE +++ +L    T  + +G FD     QP   L   
Sbjct: 363 TDLECNAFPGSEALQLAWNQSLISEKEVDKSLTRMYTALVSVGQFDSA-RGQPLRSLSWD 421

Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI-RHRTVAVIGPNSDVTVTMIGNYAGI 434
           DV T + Q+LA +A  +G VLLKN G  LPLS   R +  A+IGP  + T  M GNY G 
Sbjct: 422 DVNTKEAQKLAYQAVIEGAVLLKNDG-ILPLSAAWREKKYALIGPWINATTQMQGNYFGP 480

Query: 435 ACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
           A    +  Q    +           +   DD  F  A+D++  A   +   G+D ++EAE
Sbjct: 481 APYLISLYQAAKEFGLDFTYSLGSRINSTDDS-FKQALDSAHAAALIVFAGGVDNTLEAE 539

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             DR  L  P  Q +L+  VS   K P I++   GG +D      +  I A++W GYPGQ
Sbjct: 540 TRDRKTLAWPESQLDLLRAVSALGK-PVIVLQFGGGQVDDTELLANHSINALLWGGYPGQ 598

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
           +GG A+ D+LFG + P G+L +T YP  Y  ++P T+M +RP       GRTY +Y G  
Sbjct: 599 SGGKAVIDLLFGRAAPAGRLSVTQYPASYNEDVPSTDMNLRPGPGNSGLGRTYMWYNGDA 658

Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT---LG 671
           V P+G G+ YT F   +        +    +   +++ +S   +  T      LT   + 
Sbjct: 659 VVPYGFGLHYTTFDAKLKARQASALI----KTEEVSSLLSNDYVSGTLVWQQILTKPVVS 714

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKV-HVPAGAQQRVGINIHVCK 729
           V + V N G+    +  L+F    AG    P K L  + +  ++  G +    ++I + +
Sbjct: 715 VLITVSNTGNVASDYVALLFLRSNAGPTPQPTKTLAGYHRFRNIQPGDRSEREVSITIER 774

Query: 730 YLSVVDRSGTRRIPLGEHNIHI 751
            L  VD  G R +  G + + +
Sbjct: 775 -LVRVDELGNRVLHPGSYELFV 795


>gi|255957137|ref|XP_002569321.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591032|emb|CAP97251.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 791

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 268/729 (36%), Positives = 393/729 (53%), Gaps = 36/729 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +     R   LI   + +E V    +   A+PRLG+  Y+ W+EALHG+      T+
Sbjct: 61  CDTTAKPHDRAAALIAMFTFEELVNSTGNVMPAIPRLGLPPYQVWNEALHGLDRANL-TE 119

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
           FG D+  ATSFP  I T ++ N TL   IG +VS + RA  NGG  GL  +SPN+N FR 
Sbjct: 120 FG-DYSWATSFPSPILTMAALNRTLINQIGGIVSTQGRAFNNGGRYGLDVYSPNINSFRH 178

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRF 221
           P WGRGQETPGED  L   Y   Y+ GLQG  D   LK+AA+ KHF  YD++NW    R 
Sbjct: 179 PVWGRGQETPGEDIQLCSVYGLEYITGLQGGLDPKELKLAATAKHFAGYDIENWGNHSRL 238

Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
             +  +S  D    +   F   V + +V SVM SYN VNGVP  A+  +L+  +R  W  
Sbjct: 239 GNDMSISAFDFASYYAPQFVTAVRDARVHSVMASYNAVNGVPASANSFLLQTLLRDTWNF 298

Query: 282 --NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
             +GY+ SDCDSV   ++   + S+   AAA +I+AG D+DCG    L+   +  +G +S
Sbjct: 299 VEDGYVSSDCDSVYNVFNPHGYASSASLAAAKSIQAGTDIDCGATYQLYLNQSFTQGEIS 358

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
             +I  A     +  + LG FDG+ S   Y  L   DV   D   ++ EAA +GIVLLKN
Sbjct: 359 RSEIERAATRFYSNLVSLGYFDGDNSK--YRDLDWSDVVATDAWNISYEAAVEGIVLLKN 416

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKD 459
            G +LPLS   H +VA+IGP ++VT TM GNY G A   T PL  +      ++     +
Sbjct: 417 DG-TLPLSKDTH-SVALIGPWANVTTTMQGNYYGAAPYLTGPLAALQASDLDVNYAFGTN 474

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
           ++      F AA+ A+R++D  I   G+D S+EAE +DR  +  PG Q +L+ ++S   K
Sbjct: 475 ISSETTSGFEAALSAARKSDVVIFAGGIDNSVEAEGVDRETITWPGNQLQLIEQLSELGK 534

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
            P +++ M GG +D +  K +  + +++W GYPGQ+GG AI DIL G   P G+L +T Y
Sbjct: 535 -PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPAILDILTGKRAPAGRLTVTQY 593

Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
           P EY    P T+M++RP  S   PG+TY +Y G  VY FGHG+ YT F  ++AN+     
Sbjct: 594 PAEYALQFPATDMSLRPKGSN--PGQTYMWYTGKPVYEFGHGLFYTTFETSLANS----- 646

Query: 640 VPLDGRHGSINATISG--KAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVFST 693
                 HG+ N       K +  ++A  N +     +   ++V+N G+    +T + F  
Sbjct: 647 ------HGANNGASFDIVKLLSRSNAGYNVIEQVPFMNYTIEVENTGTVTSDYTAMAFVN 700

Query: 694 PPAG---HWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
             AG   H  P+K LV F+++  +   A Q + I + +   ++  D  G R +  G++ +
Sbjct: 701 TKAGPSPH--PNKWLVGFDRLGGIEPHATQTMTIPVSLDN-VARTDEDGNRIVYPGKYEL 757

Query: 750 HIGGTKHSV 758
            +   + +V
Sbjct: 758 ALNNERSAV 766


>gi|375150455|ref|YP_005012896.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361064501|gb|AEW03493.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 711

 Score =  427 bits (1098), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 265/738 (35%), Positives = 384/738 (52%), Gaps = 76/738 (10%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           +T  F     P+  RVNDL+ +L+L EK+ LL   +  V RLGI  Y WW+EALHGV+  
Sbjct: 13  QTSVFRNPQQPMEARVNDLLHQLTLPEKISLLGYRSKEVERLGIPAYNWWNEALHGVARA 72

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------G 149
           G           AT FPQ I  A++FN  L +    V+S EARA YN   A        G
Sbjct: 73  GV----------ATVFPQAIGMAATFNDDLLKEAATVISTEARAKYNLSLAQGRHLQYMG 122

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
           LT+WSPN+NIFRDPRWGRGQET GEDP L+     ++V+GLQG+D   LK +A  KHF  
Sbjct: 123 LTFWSPNINIFRDPRWGRGQETYGEDPFLTAHMGTAFVKGLQGNDPRYLKASACAKHFAV 182

Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
           +     NG  R  FNA V ++D+ +T+   F   V  G V SVMC+YN+VN  P C+   
Sbjct: 183 HSGPE-NG--RHTFNAIVDEKDLRETYLYAFHALVDAG-VESVMCAYNRVNDQPCCSGNF 238

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT 329
           +L   +R EW+  G++V+DC ++   +       +  E AA AI+AG++LDC   L    
Sbjct: 239 LLNSILRNEWKFKGHVVTDCGALDDIFMRHKVMPSGVEVAAAAIKAGVNLDCSNVLQKDV 298

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
           E AV++ LL+E DI+++L + L  Q++LG +D +P++ P+   G   V    H  LA   
Sbjct: 299 EKAVEQKLLNEKDIDSSLAHLLRTQIKLGFYD-DPTANPFYKYGADSVANTAHATLARAM 357

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY- 448
           A+Q +VLLKN    LPL   ++  + V+G NS     ++GNY G++    + ++GI    
Sbjct: 358 AQQSMVLLKNSNQLLPLDKKKYPAIMVVGTNSASMDALLGNYHGVSNRAVSFVEGITNAV 417

Query: 449 ---ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL---------DQSIEAEAL 496
               R  + QG       D   FG  I A+  AD T+ V+GL         D  + A+  
Sbjct: 418 DAGTRVEYDQGSD---YNDTTHFG-GIWAAGNADITVAVIGLTPVYEGEEGDAFLAAKGG 473

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           D+  + LP      +  +  A+K P I V+ +G  +D++    +P   AI+ A YPG+ G
Sbjct: 474 DKPDMSLPAAHIAFMKALRKANKKPIIAVITAGSAVDISAI--EPYADAILLAWYPGEQG 531

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           G A+ADILFG  +P G+LP+T+Y Q +         AM+        GRTYR++ G V Y
Sbjct: 532 GNALADILFGKVSPAGRLPVTFY-QSFADVPAYDNYAMK--------GRTYRYFNGKVQY 582

Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
           PFG+G+SYT+F +     P  +    D                         ++   + V
Sbjct: 583 PFGYGLSYTSFAYEWQQMPANIRTAKD-------------------------SVSFSIKV 617

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           KN GS DG   + V+   PA    P K+L AF++VHV AG ++ V + I           
Sbjct: 618 KNTGSMDGDEVVQVYVEYPAVERMPLKELKAFKRVHVKAGGEETVQLTIPASDLQKWDLA 677

Query: 737 SGTRRIPLGEHNIHIGGT 754
           + + ++  G +NI  GG 
Sbjct: 678 TSSWKLYPGSYNIFAGGN 695


>gi|376259588|ref|YP_005146308.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
 gi|373943582|gb|AEY64503.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
          Length = 712

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 270/747 (36%), Positives = 393/747 (52%), Gaps = 94/747 (12%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SL   +R  DL+ R++L+EK   L   A  V RLGI  Y WW+EALHGV+  G   
Sbjct: 6   YLDKSLSFKERAADLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                   AT FPQ I  A+ F+    E I  V++ E RA YN            G+T+W
Sbjct: 64  --------ATVFPQAIGMAAIFDDEFLEKIADVIATEGRAKYNENAKKGDRDIYKGITFW 115

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           SPNVNIFRDPRWGRG ET GEDP L+ +   ++V+GLQG DG  LK AA  KHF  +   
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQG-DGKYLKTAACAKHFAVH--- 171

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           +    DR HF+A VS++D+ +T+   F   V E KV SVM +YN+ NG P      +LK 
Sbjct: 172 SGPEDDRHHFDAVVSQKDLYETYLPAFEALVKEAKVESVMGAYNRTNGEPCNGSKTLLKD 231

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
            +R  W  +G++VSDC ++  +++    T TP E+ A A+++G DL+CG    L    A+
Sbjct: 232 ILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKSGCDLNCGNMY-LLILLAL 290

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDG--EPSSQPYGHLGPKDVCTPDHQELALEAAR 391
           + G ++E DI+ A +  +T +MRLGMFD   E    PY  L      + +H +L+LEAA+
Sbjct: 291 KEGRITEEDIDRAAIRLMTTRMRLGMFDDDCEFDKIPY-ELND----SVEHNKLSLEAAK 345

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--- 448
           + +VLLKN G  LPL   + + +AVIGPN+D ++ +  NY+G      T L GI +    
Sbjct: 346 KSMVLLKNDG-LLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSQNITILDGIRKRVSE 404

Query: 449 -ARTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL---- 496
             R  +  G        +D+A  DD+L   A+  + ++D  +L +GLD S+E E      
Sbjct: 405 DTRVWYSVGSHLFMNREEDLAQPDDRL-KEAVSVAERSDVVVLCLGLDASVEGEQNDQGT 463

Query: 497 -------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
                  D+A L LP  Q+ L++ V +A+  PTI+ L+SG  + +  A +  + AAI+  
Sbjct: 464 VILDAGGDKADLNLPESQRNLLNAV-LATGKPTIVALLSGSALSIGDAAD--KAAAIVQC 520

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
            YPG  GG A A+++FG  +P G+LP+T+Y       LP       P        RTY+F
Sbjct: 521 WYPGSRGGLAFAEMIFGDYSPAGRLPVTFYKS--TEELP-------PFADYSMENRTYKF 571

Query: 610 YKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT 669
            KG  +YPFG G+SYTNF ++    P  V                           N   
Sbjct: 572 MKGEALYPFGFGLSYTNFEYSNIVCPQNVN--------------------------NGEN 605

Query: 670 LGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
           L V VDV+N GS D    + V+     A    P   L  F+++H+ +G ++ V   I   
Sbjct: 606 LSVSVDVQNAGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIHLKSGEKKTVTFEID-S 664

Query: 729 KYLSVVDRSGTRRIPLGEHNIHIGGTK 755
             +++VD +G R I  GE  +++GG++
Sbjct: 665 NAMTIVDEAGKRYIENGEFTLYVGGSQ 691


>gi|291518645|emb|CBK73866.1| Beta-glucosidase-related glycosidases [Butyrivibrio fibrisolvens
           16/4]
          Length = 713

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 260/733 (35%), Positives = 390/733 (53%), Gaps = 83/733 (11%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           +R  +L+ +++++EK   ++  A A+ RLGI  Y WW+EALHGV+  G           A
Sbjct: 7   KRAKELVSQMTIEEKCSQMLHHAEAIDRLGIPKYCWWNEALHGVARAGD----------A 56

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I   ++F+  L E +  V S E RA YN  T         GLTYW+PNVNIFRD
Sbjct: 57  TVFPQAIGLGATFDEELVEKVADVTSTEGRAKYNEFTKHGDRDIYKGLTYWAPNVNIFRD 116

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
           PRWGRG ET GEDP L+G+   +YVRGLQG D D  K AA  KHF  +   +    +R H
Sbjct: 117 PRWGRGHETYGEDPYLTGQLGMAYVRGLQGDDLDNPKSAACAKHFAVH---SGPEAERHH 173

Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
           F+AKV+ QD+ DT+   F+  V + KV +VM +YN+VNG P C    +LK  +RG+W   
Sbjct: 174 FDAKVNDQDLYDTYLYAFKRLVKDAKVEAVMGAYNRVNGEPACGSKRLLKDILRGDWGFE 233

Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
           G++VSDC ++  +++    T    E+AA A+  G DL+CG         A +  L++E  
Sbjct: 234 GHVVSDCWAIRDFHENHKVTGCEVESAALAVNNGCDLNCGCVYE-KLLYAYKANLVTEET 292

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           I  ++   + +++RLG      S   Y  +  + V   +H+ELA+EAA++ +VLLKN G 
Sbjct: 293 ITESVERLIELRLRLGTLPERRSK--YDDIPYEVVECKEHKELAIEAAKRSMVLLKNDGL 350

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGC- 457
            LPL     +T+ VIGPNS+  + ++GNY GI+  Y T L+GI +Y     R  H  G  
Sbjct: 351 -LPLKKDEIKTIGVIGPNSNSRMALVGNYEGISSEYITVLEGIQQYVGDDVRVFHSDGTP 409

Query: 458 --KD---VACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLL 503
             KD   V       F  A+  +  +D  +L MGLD +IE E         + D+ GL L
Sbjct: 410 LWKDRMHVLSEARDTFAEAMAVAEHSDVVVLAMGLDSTIEGEEGDAGNEFGSGDKKGLKL 469

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG QQEL+ K++   K P +L++++G  +D+++A  +  + AI+   YPG  GG AIA +
Sbjct: 470 PGLQQELLEKITAIGK-PVVLLVLAGSAMDLSWANEN--VNAIMHCWYPGARGGKAIAQV 526

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFG  +P GKLP+T+Y  +   +LP       P +     GRTYR++KG  +YPFG+G+S
Sbjct: 527 LFGEDSPSGKLPLTFYKSD--ADLP-------PFEDYSMEGRTYRYFKGTPLYPFGYGLS 577

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           Y++  ++ A         +D   G+I    +                 V+V VKN G   
Sbjct: 578 YSDIQYSNAG--------IDKTEGAIGDKFT-----------------VKVTVKNAGDYK 612

Query: 684 GAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
              T+ V+     A     +  L    KV +  G  + V + +   +  +++D  G   +
Sbjct: 613 AHETVQVYVKDVEASTRVANCSLRKIAKVELLPGESKEVSLELS-ARDFAIIDEKGHCIV 671

Query: 743 PLGEHNIHIGGTK 755
             G+  + +GG +
Sbjct: 672 EPGKFKVFVGGQQ 684


>gi|358382857|gb|EHK20527.1| hypothetical protein TRIVIDRAFT_192759 [Trichoderma virens Gv29-8]
          Length = 860

 Score =  426 bits (1094), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 273/726 (37%), Positives = 400/726 (55%), Gaps = 47/726 (6%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +  I  R   ++  ++L EKV  + S A+   RLG+  Y+W +EALHGV+    G +
Sbjct: 139 CDTTKSIAARAAAIVKPMTLNEKVANVGSSASGSGRLGLPAYQWQNEALHGVAG-STGVQ 197

Query: 103 F----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
           F    G +F  ATSFP  I  +++F+  L +++   +S EARA  N G AGL +W+PN+N
Sbjct: 198 FQSPLGANFSAATSFPMPILLSAAFDDALVQSVATAISTEARAFANYGFAGLDFWTPNIN 257

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNG 217
            FRDPRWGRG ETPGED      Y  S + GLQG  D D  +  ++CKHF AYD++N   
Sbjct: 258 PFRDPRWGRGMETPGEDAFRIQGYVLSLINGLQGGIDPDFFRTISTCKHFAAYDIEN--- 314

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R   N   ++QD+ D +   F  CV + KV S+MC+YN VNGVP CAD  +L+  +R 
Sbjct: 315 -GRTANNLSPTQQDMADYYLPMFETCVRDAKVGSIMCAYNSVNGVPACADSYLLQSVLRD 373

Query: 278 EWRLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
            +       Y+VSDCD+V   YD  H+ +   +AAA ++ AG DLDCG    +   S VQ
Sbjct: 374 GYGFTEDFNYVVSDCDAVENVYDPHHYAANLTQAAAMSLNAGTDLDCGSSYNVLNAS-VQ 432

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
            G+ +E  ++ +L+   +  +++G FD +P+   Y  LG  +V T   + LA +AA  G+
Sbjct: 433 AGMTTEATLDKSLIRLYSALIKVGWFD-QPAK--YSSLGWGNVNTTQTRALAHDAATGGM 489

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
            LLKN G +LPLS    + VAVIGP  + T  + GNYAG A     PL    +  R +  
Sbjct: 490 TLLKNDG-TLPLSPTL-QNVAVIGPWVNATTQLQGNYAGTAPVLVNPLTVFQQKWRNVKY 547

Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
                +   D   F AAI A+  +D  + + G+D S+E E  DR  +  PG Q  L+S++
Sbjct: 548 AQGTAINSQDTSGFNAAISAASSSDVIVYLGGIDISVENEGFDRTAITWPGNQLSLISQL 607

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
           +   K P ++V   GG ID +   ++ ++ +I+WAGYPGQ GG A+ D+L G + P G+L
Sbjct: 608 ANLGK-PLVIVQFGGGQIDDSSLLSNSKVNSILWAGYPGQEGGNALFDVLTGANPPAGRL 666

Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
           P+T YP  Y+ N  + +M +RPS S   PGRTY +Y G  V PFG+G+ YTNF  +  + 
Sbjct: 667 PITQYPANYVNNNNIQDMNLRPSGS--IPGRTYAWYTGTPVLPFGYGLHYTNFSVSFQST 724

Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG-VQVDVKNVGSK-----DGAHTL 688
            T          G+  ATI   A     +  +R T   + V+VKN G K     D    L
Sbjct: 725 KT---------SGTDVATIVNNA----GSNKDRATFATLVVNVKNTGGKANLASDYVGLL 771

Query: 689 LVFSTP--PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
            + ST   PA H  P+KQL A+ +V  V  GA Q++ + +++   L+  D +G R +  G
Sbjct: 772 FLKSTNAGPAPH--PNKQLAAYGRVKKVGVGATQQLTLTVNLGS-LARADTNGDRWVYPG 828

Query: 746 EHNIHI 751
            + + +
Sbjct: 829 AYTLTL 834


>gi|449303062|gb|EMC99070.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 786

 Score =  425 bits (1093), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 239/605 (39%), Positives = 335/605 (55%), Gaps = 12/605 (1%)

Query: 34  DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
           ++T  T P C  SL    R + L+   +L+E      + A  VPRLG+  YE W+EALHG
Sbjct: 51  NSTLSTTPVCNRSLSAWDRAHALVQLFTLEELANNTGNTAPGVPRLGLPAYEVWNEALHG 110

Query: 94  VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYW 153
           +S+    T   G +  ATSFP  I + +S N TL   IG ++S + RA  N G  GL  +
Sbjct: 111 ISHGHFATN--GTWSWATSFPSPILSMASMNRTLINQIGDIISTQGRAFSNAGRYGLDSY 168

Query: 154 SPNVNIFRDPRWGRGQETPGEDP-VLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYD 211
           +PN+N FR P WGRGQETPGED   LS  YA  Y+ G+QG     + K+ A  KHF  YD
Sbjct: 169 APNINGFRSPVWGRGQETPGEDAFFLSSLYAYEYITGMQGGKAPAVPKLVAVPKHFAGYD 228

Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
           ++NWN   R   +  +++QD+   +   FR  +   K   +MCSYN VNGVP+C++   L
Sbjct: 229 IENWNNNSRLGLDVNITQQDLAGYYTPQFRSAIQNAKALGLMCSYNAVNGVPSCSNSFFL 288

Query: 272 KRTIRGEWRL-NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE 330
           +   R  W   NG++ SDCD+V   Y+   + +    A AD++RAG D+DCG     +  
Sbjct: 289 QTLARDTWGFGNGFVSSDCDAVYNVYNPHGYAANTTGAVADSLRAGTDIDCGTSYPFYLV 348

Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
            A   GL+S  DI  AL    +  +  G FDG  +S  Y +LG  DV T D   ++ EAA
Sbjct: 349 PAFNAGLVSRNDIELALTRYYSGLVMQGYFDG--NSSLYRNLGWNDVLTTDAWNISYEAA 406

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR 450
            +GI LLKN G +LPLS    R+VA+IGP ++ T+ + GNY   A    +PLQ       
Sbjct: 407 VEGITLLKNDG-TLPLSK-STRSVALIGPWANATLQLQGNYYAAAPYLISPLQAFRASGM 464

Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
           T++      ++  +   F  AI  ++Q+D  I   G+D SIEAE LDR  +  PG Q +L
Sbjct: 465 TVNFVNGTTISSTNTSGFAEAITLAQQSDVIIYAGGIDNSIEAEGLDRQNITWPGNQLDL 524

Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
           + ++S   K P +++ M GG +D +  KN+ ++ A++W GYPGQ+GG A+ DI+ G   P
Sbjct: 525 IYQLSQVGK-PLVVLQMGGGQVDSSALKNNSKVNALVWGGYPGQSGGQALFDIIMGNRAP 583

Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT 630
            G+L  T YP  Y T+     M M P       G+TY +Y G  VYPFGHG+ YTNF  T
Sbjct: 584 AGRLVTTQYPASYATSFNQLNMNMAPVNGSL--GQTYMWYTGTPVYPFGHGLFYTNFTTT 641

Query: 631 VANAP 635
               P
Sbjct: 642 STMGP 646


>gi|2791278|emb|CAA93248.1| beta-xylosidase [Trichoderma reesei]
 gi|340519464|gb|EGR49702.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
          Length = 797

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 267/733 (36%), Positives = 399/733 (54%), Gaps = 29/733 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S    +R   LI   +L+E +    +    VPRLG+  Y+ W+EALHG+      TK
Sbjct: 63  CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
            GG F  ATSFP  I T ++ N TL   I  ++S +ARA  N G  GL  ++PNVN FR 
Sbjct: 123 -GGQFEWATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNVNGFRS 181

Query: 163 PRWGRGQETPGEDPV-LSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   LS  Y   Y+ G+QG  D + LKVAA+ KHF  YDL+NWN   R
Sbjct: 182 PLWGRGQETPGEDAFFLSSAYTYEYITGIQGGVDPEHLKVAATVKHFAGYDLENWNNQSR 241

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A +++QD+ + +   F       K  S+MC+YN VNGVP+CA+   L+  +R  W 
Sbjct: 242 LGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCAYNSVNGVPSCANSFFLQTLLRESWG 301

Query: 281 LN--GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
               GY+ SDCD+V   ++   + S    AAA ++RAG D+DCG     H   +   G +
Sbjct: 302 FPEWGYVSSDCDAVYNVFNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHLNESFVAGEV 361

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           S  +I  ++       +RLG FD       Y  LG KDV   D   ++ EAA +GIVLLK
Sbjct: 362 SRGEIERSVTRLYANLVRLGYFD---KKNQYRSLGWKDVVKTDAWNISYEAAVEGIVLLK 418

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G +LPLS  + R++A+IGP ++ T  M GNY G A    +PL+   +    ++ +   
Sbjct: 419 NDG-TLPLSK-KVRSIALIGPWANATTQMQGNYYGPAPYLISPLEAAKKAGYHVNFELGT 476

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           ++A      F  AI A++++DA I + G+D +IE E  DR  +  PG Q +L+ ++S   
Sbjct: 477 EIAGNSTTGFAKAIAAAKKSDAIIYLGGIDNTIEQEGADRTDIAWPGNQLDLIKQLSEVG 536

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P +++ M GG +D +  K++ ++ +++W GYPGQ+GG A+ DIL G   P G+L  T 
Sbjct: 537 K-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRAPAGRLVTTQ 595

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP EY+   P  +M +RP   K  PG+TY +Y G  VY FG G+ YT F  T+A+ P  +
Sbjct: 596 YPAEYVHQFPQNDMNLRPD-GKSNPGQTYIWYTGKPVYEFGSGLFYTTFKETLASHPKSL 654

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
                    + ++ +S      T+++        + ++KN G  +  +T ++F    +  
Sbjct: 655 KF-------NTSSILSAPHPGYTYSE-QIPVFTFEANIKNSGKTESPYTAMLFVRTSNAG 706

Query: 695 PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           PA +  P+K LV F+++  +  G   ++ I I V   L+ VD  G R +  G++ + +  
Sbjct: 707 PAPY--PNKWLVGFDRLADIKPGHSSKLSIPIPVSA-LARVDSHGNRIVYPGKYELAL-N 762

Query: 754 TKHSVSLHAATLG 766
           T  SV L    +G
Sbjct: 763 TDESVKLEFELVG 775


>gi|220927661|ref|YP_002504570.1| glycoside hydrolase [Clostridium cellulolyticum H10]
 gi|219997989|gb|ACL74590.1| glycoside hydrolase family 3 domain protein [Clostridium
           cellulolyticum H10]
          Length = 712

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 265/746 (35%), Positives = 394/746 (52%), Gaps = 92/746 (12%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SL   +R  DL+ R++L+EK   L   A  V RLGI  Y WW+EALHGV+  G   
Sbjct: 6   YLDKSLSFKERAVDLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                   AT FPQ I  A+ F+    E I  V++ E RA YN  +         G+T+W
Sbjct: 64  --------ATVFPQAIGLAAIFDDEFLEKIADVIATEGRAKYNESSKKGDRDIYKGITFW 115

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           SPNVNIFRDPRWGRG ET GEDP L+ +   ++V+GLQG DG  LK AA  KHF  +   
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQG-DGKYLKSAACAKHFAVH--- 171

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           +    DR HFNA  S++D+ +T+   F   V E KV SVM +YN+ NG P      +LK 
Sbjct: 172 SGPEDDRHHFNAVASQKDMYETYLPAFEALVKEAKVESVMGAYNRTNGEPCNGSKTLLKD 231

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
            +R +W  +G++VSDC ++  +++    T TP E+ A A++ G DL+CG  + L    A+
Sbjct: 232 ILRDDWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKNGCDLNCGN-MYLLILLAL 290

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC-TPDHQELALEAARQ 392
           + G ++E DI+ A +  +T +M+LGMFD +          P +V  + +H +L+LEAAR+
Sbjct: 291 KEGKITEEDIDRAAIRLMTTRMKLGMFDDDCEFDKI----PYEVNDSIEHNKLSLEAARK 346

Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRY 448
            +VLLKN G  LPL   + + +AVIGPN+D ++ +  NY+G      T L G    +   
Sbjct: 347 SMVLLKNNGL-LPLDSKKIKNIAVIGPNADSSLALRANYSGTPSHNITILDGVRSRVSED 405

Query: 449 ARTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----- 496
            R  +  G        +D+A  DD+L   A+  + ++D  +L +GLD S+E E       
Sbjct: 406 TRVWYSLGSHLFMNREEDLAQPDDRL-KEAVSMAERSDVVVLCLGLDASVEGEQNDQGTV 464

Query: 497 ------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
                 D+A L LP  Q+ L++ V +A+  PTI+ L+SG  + +  A +  + AAI+   
Sbjct: 465 ILDAGGDKADLNLPESQRNLLNAV-LATGKPTIVALLSGSALSIGDAAD--KAAAIVQCW 521

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY 610
           YPG  GG A A+++FG  +P G+LP+T+Y       LP       P +      RTY+F 
Sbjct: 522 YPGSKGGLAFAEMIFGDYSPAGRLPVTFYKS--TEELP-------PFEDYSMENRTYKFM 572

Query: 611 KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL 670
           KG  +YPFG G+SYTNF ++    P  V                           N  +L
Sbjct: 573 KGEALYPFGFGLSYTNFEYSNIVCPQAVN--------------------------NGESL 606

Query: 671 GVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
            V VDV+N GS D    + V+     A    P+  L  F+++ + +G ++ V   I   +
Sbjct: 607 SVSVDVQNAGSVDSDEVVQVYIKDMEASVRVPNHSLCGFKRIFLKSGEKKTVTFEID-SR 665

Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTK 755
            +++VD  G R I  G+  +++GG +
Sbjct: 666 AMTIVDEEGKRYIENGDFTLYVGGAQ 691


>gi|164429277|ref|XP_958209.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
 gi|16945419|emb|CAB91343.2| related to xylan 1, 4-beta-xylosidase [Neurospora crassa]
 gi|157073010|gb|EAA28973.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
          Length = 774

 Score =  424 bits (1089), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 264/713 (37%), Positives = 381/713 (53%), Gaps = 32/713 (4%)

Query: 14  LLSASSSGLAARE-PFACDPK--DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI 70
           LLS +S+ ++A + PF   P   +    +L  C  +L  PQR   L+  ++ +EK++ L+
Sbjct: 10  LLSCTSALVSAIDLPFQTYPDCVNGPLASLKVCDATLSPPQRAAALVAAMTTEEKLQNLV 69

Query: 71  SGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATL 127
           S +   PR+G+  Y WWSEALHGV+   PGT+F    G F  +TSFP  +  A++F+  L
Sbjct: 70  SKSKGAPRIGLPAYNWWSEALHGVA-YAPGTQFRSGDGPFNSSTSFPMPLLMAATFDDEL 128

Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
            E +G V+  E RA  N G +G  YW+PNVN F+DPRWGRG ETPGED +   +YAAS +
Sbjct: 129 IEKVGEVIGTEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMI 188

Query: 188 RGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
           RGLQG   +R +V A+CKH+ A D ++WNG  R  F+AKV+ QD+ + +  PF+ C  + 
Sbjct: 189 RGLQGPLPER-RVVATCKHYAANDFEDWNGSTRHDFDAKVTLQDLAEYYLSPFQQCARDS 247

Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTST 304
           KV S+MCSYN VNGVP CA+  +++  +R  W       YI SDC++V   +   H+  T
Sbjct: 248 KVGSIMCSYNAVNGVPACANTYLMQTILREHWNWTAPGNYITSDCEAVLDIFANHHYAKT 307

Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
             E  A A  AG D  C          A  +GLL +  ++ AL       +R+G FDG  
Sbjct: 308 NAEGTALAFEAGTDSSCEYESSSDIPGAWTQGLLEQSTVDRALTRLYEGLVRVGYFDGNH 367

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN-QGPSLPLSHIRHRTVAVIGPNSDV 423
           S   Y  LG KDV +P  QE+AL+ A +GIVLLKN Q   L L       +A+IG  ++ 
Sbjct: 368 SE--YASLGWKDVNSPKSQEVALQTAVEGIVLLKNDQTLPLGLKTDPKSKLAMIGFWAND 425

Query: 424 TVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG--CKDVACADDQLFGAAIDASRQADAT 481
             T+ G Y+G      +P+         +   G      + ++D    AA++A++ A+  
Sbjct: 426 PKTLSGGYSGKPAFEHSPVYAAEAMGFNVTTAGGPVLQNSTSNDTWTQAALEAAQDANYI 485

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           +   GLD S   E  DR  +  P  Q +L+  ++   K P ++V M G  +D        
Sbjct: 486 LYFGGLDTSAAGETKDRTTINWPEAQLQLIKTLTKLGK-PLVVVQM-GDQLDNTPLLATK 543

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
            + +I+WA +PGQ GGTA+  IL G  +P G+LP+T YP  Y   +PMT+M +RPS   R
Sbjct: 544 TVNSILWANWPGQDGGTAVMQILTGLKSPAGRLPVTQYPANYTAAVPMTDMNLRPSD--R 601

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA-PTVVAVPLDGRHGSINATISGKAIKV 660
            PGRTYR+Y    V PFG G+ YT F   +A   P +    L  R G  NA        +
Sbjct: 602 LPGRTYRWYPT-AVQPFGFGLHYTTFQAKIAAPLPRLAIQDLLSRCGGDNANAYPDTCAL 660

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH 712
              K         V+V N G++   + +L F    AG    P K LV++ ++ 
Sbjct: 661 PPLK---------VEVTNSGNRSSDYVVLAFLAGDAGPRPYPIKTLVSYTRLR 704


>gi|358397360|gb|EHK46735.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
           206040]
          Length = 865

 Score =  423 bits (1088), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 264/722 (36%), Positives = 395/722 (54%), Gaps = 39/722 (5%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +L + +R   ++  ++L EKV  + S A+   RLG+  Y+W +EALHGV+    G +
Sbjct: 144 CDTTLSMAERAAAIVKPMTLDEKVANVGSSASGSARLGLPAYQWQNEALHGVAG-STGVQ 202

Query: 103 F----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
           F    G +F  ATSFP  I  +++F+  L + +   +S EARA  N G AGL +W+PN+N
Sbjct: 203 FQSPLGANFSAATSFPMPILLSAAFDDALVQNVATAISTEARAFANYGFAGLDFWTPNIN 262

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNG 217
            FRDPRWGRG ETPGED      Y  + + GLQG  + D  ++ A+CKHF AYD++N   
Sbjct: 263 PFRDPRWGRGMETPGEDAFRIQGYVLALISGLQGGINPDFFRIIATCKHFAAYDIEN--- 319

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R   N   ++QD+ D +   F  CV + KV SVMC+YN V+G+P CA   +L+  +R 
Sbjct: 320 -GRTGNNLNPTQQDMADYYLPMFETCVRDAKVGSVMCAYNAVDGIPACASEYLLQDVLRD 378

Query: 278 EWRLN---GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
            +       Y+VSDCD+V   +D  H+ S   EAAA ++ AG DLDCG    +   S V+
Sbjct: 379 GFGFTEDFNYVVSDCDAVDNVFDPHHYASNLTEAAALSLNAGTDLDCGSSYNVLNAS-VE 437

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
             L SE  +N +LV   +  +++G FD +PS   Y  L   +V T  +Q LA +AA  G+
Sbjct: 438 AALTSEAALNQSLVRLYSALIKVGYFD-QPSE--YKSLSWANVNTTQNQALAHDAATGGM 494

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTIH 453
            LLKN G +LPLS      VA+IGP  + T  M GNYAG A     PL     ++    +
Sbjct: 495 TLLKNDG-TLPLSRTLSN-VAIIGPWVNATTQMQGNYAGTAPFLVNPLDVFQQKWGNVKY 552

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
            QG   +   D   F AA+ A+  +D  + + G+D ++E E  DR  ++ PG Q +L+S+
Sbjct: 553 AQGTA-INSQDTSGFSAALSAASSSDVIVYLGGIDITVENEGFDRGSIVWPGNQLDLISQ 611

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
           ++   K P ++V   GG ID +   ++P + +I+WAGYPGQ GG A+ D+L G + P G+
Sbjct: 612 LANLGK-PLVIVQFGGGQIDDSSLLSNPNVRSILWAGYPGQDGGNAVFDVLTGANPPAGR 670

Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF---VHT 630
           LP+T YP  YI N  + +M +RPS     PGRTY +Y G  V PFG+G+ YTNF     +
Sbjct: 671 LPITQYPASYINNNNIQDMNLRPSNG--IPGRTYAWYTGTPVLPFGYGLHYTNFSVSFQS 728

Query: 631 VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLV 690
           +  A T VA  ++     I+ ++    +   H    +  L          + D    + +
Sbjct: 729 INTAGTDVATIVNNAGAVIDTSVFATLVVSVHNTGGKANL----------ASDYVGLVFL 778

Query: 691 FSTPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
            ST       P+KQL A+ +   V  GA Q++ + I++   L+  D +G R I  G++ +
Sbjct: 779 SSTNAGPSPYPNKQLAAYGRAKSVGVGATQQLTLKINLGS-LARADTNGDRWIYPGDYKL 837

Query: 750 HI 751
            +
Sbjct: 838 TL 839


>gi|336471692|gb|EGO59853.1| hypothetical protein NEUTE1DRAFT_99999 [Neurospora tetrasperma FGSC
           2508]
 gi|350292807|gb|EGZ74002.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
          Length = 770

 Score =  423 bits (1087), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 270/716 (37%), Positives = 387/716 (54%), Gaps = 38/716 (5%)

Query: 14  LLSASSSGLAARE-PFACDPK--DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI 70
           LLS SS  ++A + PF   P   +    +L  C V+L  PQR   L+  ++ +EK++ L+
Sbjct: 10  LLSCSSVLVSAIDLPFQTYPDCVNGPLASLKVCDVTLSPPQRAAALVAAMTTEEKLQNLV 69

Query: 71  SGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATL 127
           S +   PR+G+  Y WWSEALHGV+   PGT+F    G F  +TSFP  +  A++F+  L
Sbjct: 70  SKSKGAPRIGLPAYNWWSEALHGVA-YAPGTQFWSGDGPFNASTSFPMPLLMAATFDDEL 128

Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
            E +G V+  E RA  N G +G  YW+PNVN F+DPRWGRG ETPGED +   +YAAS +
Sbjct: 129 IEKVGEVIGTEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMI 188

Query: 188 RGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
           RGLQG   +R +V A+CKH+ A D ++WNG  R  FNAKV+ QD+ + +  PF+ C  + 
Sbjct: 189 RGLQGPARER-RVVATCKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPFQQCARDS 247

Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHFTST 304
           KV S+MCSYN VNGVP CA+  +++  +R  W       YI SDC++V       H+  T
Sbjct: 248 KVGSIMCSYNAVNGVPACANTYLMQTILREHWNWTAPGNYITSDCEAVLDISANHHYAET 307

Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
             E  A A  AG+D  C          A  +GLL +  ++ AL       +R+G FDG  
Sbjct: 308 NAEGTALAFEAGIDSSCEYESSSDIPGAWTQGLLEQSTVDRALKRIYEGLVRVGYFDGNH 367

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR---HRTVAVIGPNS 421
           S   Y  LG KDV +P  QE+AL+AA +GIVLLKN   +LPL  +R      +A+IG  +
Sbjct: 368 SE--YASLGWKDVNSPKSQEVALQAAVEGIVLLKNDK-TLPLD-LRTDPKSKLAMIGFWA 423

Query: 422 DVTVTMIGNYAGIACGYTTPL---QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQA 478
           +   T+ G Y+G      +P+   Q +G ++ T         + ++D    AA++A++ A
Sbjct: 424 NDPKTLSGGYSGKPAFEHSPVYAAQAMG-FSVTTAGGPVLQNSTSNDTWTQAALEAAKDA 482

Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
           +  +   G D S   E  DR  +  P  Q +L++ +S   K P ++V M G  +D     
Sbjct: 483 NYILYFGGQDTSAAGETKDRTTINWPEAQLQLITTLSKLGK-PLVVVQM-GDQLDNTPLL 540

Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
               + AI+WA + GQ GGTA+  IL G  NP G+LP+T YP  Y   +PMT+M +RPS 
Sbjct: 541 AAKAVNAILWANWLGQDGGTAVMQILTGLKNPAGRLPVTQYPANYTAAVPMTDMNLRPSD 600

Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA-NAPTVVAVPLDGRHGSINATISGKA 657
             + PGRTYR+Y    V PFG G+ YT F   +A   P +    L  R G  NA      
Sbjct: 601 --KLPGRTYRWYPT-AVQPFGFGLHYTTFQTKIAVPLPRLAIQDLLSRCGGDNANAYPDT 657

Query: 658 IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH 712
             +   K         V+V N G++   + +L F     G    P K LV++ ++ 
Sbjct: 658 CALPPLK---------VEVTNSGNRSSDYVVLAFLAGDVGPKPYPIKTLVSYTRLR 704


>gi|326202986|ref|ZP_08192853.1| glycoside hydrolase family 3 domain protein [Clostridium
           papyrosolvens DSM 2782]
 gi|325987063|gb|EGD47892.1| glycoside hydrolase family 3 domain protein [Clostridium
           papyrosolvens DSM 2782]
          Length = 712

 Score =  422 bits (1086), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 265/747 (35%), Positives = 395/747 (52%), Gaps = 94/747 (12%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SL   +R  DL+ +++L+EK   L   A  V RLGI  Y WW+EALHGV+  G   
Sbjct: 6   YLDKSLSFKERAADLVSKMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                   AT FPQ I  A+ F+    E I  V++ E RA YN            G+T+W
Sbjct: 64  --------ATVFPQAIGMAAMFDDEFLEKIADVIATEGRAKYNESAKKGDRDIYKGITFW 115

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           SPNVNIFRDPRWGRG ET GEDP L+ +   ++V+GLQG DG  LK AA  KH+  +   
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQG-DGKYLKTAACAKHYAVH--- 171

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           +    DR  F+A VS++D+ +T+   F   V E KV S+M +YN+ NG P      +LK 
Sbjct: 172 SGPEDDRHFFDAIVSQKDLYETYLPAFEALVKEAKVESIMGAYNRTNGEPCNGSKTLLKD 231

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
            +R  W  +G++VSDC ++  +++    T TP E+ A A+++G DL+CG    L    A+
Sbjct: 232 ILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKSGCDLNCGNMY-LLILLAL 290

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDG--EPSSQPYGHLGPKDVCTPDHQELALEAAR 391
           + GL++E DI+ A +  +T +M+LGMFD   E  + PY  L      + +H +++LEAA+
Sbjct: 291 KEGLITEEDIDRAAIRLMTTRMKLGMFDDDCEFDNIPY-ELND----SAEHNKISLEAAK 345

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--- 448
           + +VLLKN G  LPL   + + VAVIGPN+D ++ +  NY+G      T ++GI +    
Sbjct: 346 KSMVLLKNDGL-LPLDSKKIKNVAVIGPNADSSLALRANYSGTPSQNVTIIEGIRKRVSE 404

Query: 449 -ARTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL---- 496
             R  +  G        +D+A  DD+L   A+ A+ ++D  +L +GLD S+E E      
Sbjct: 405 NTRVWYAMGSHLFLNRDEDLAQPDDRL-KEAVSAAERSDVVVLCLGLDASVEGEQNDQGT 463

Query: 497 -------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
                  D+A L LP  Q+ L++ V +A+  PTI+ L+SG  + +  A +  + AAI+  
Sbjct: 464 VILDAGGDKADLNLPESQRNLLNAV-LATGKPTIVALLSGSALSIGDAAD--KAAAIVQC 520

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
            YPG  GG A A+++FG  +P G+LP+T+Y       LP       P        RTY+F
Sbjct: 521 WYPGAIGGLAFAEMIFGDYSPAGRLPVTFYKS--TEELP-------PFADYSMENRTYKF 571

Query: 610 YKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT 669
            KG  +YPFG G+SYT+F ++    P  V                           N   
Sbjct: 572 MKGDALYPFGFGLSYTSFEYSNMVCPQTVN--------------------------NGEN 605

Query: 670 LGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
           L V VDV+N GS D    + V+     A    P   L  F+++H+ +G ++ V   +   
Sbjct: 606 LSVSVDVQNTGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIHLKSGEKKTVTFEV-AS 664

Query: 729 KYLSVVDRSGTRRIPLGEHNIHIGGTK 755
             +S+VD +G R I  GE  ++ GG++
Sbjct: 665 NAMSIVDEAGKRHIENGEFTLYAGGSQ 691


>gi|255690205|ref|ZP_05413880.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
 gi|260624224|gb|EEX47095.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            finegoldii DSM 17565]
          Length = 1425

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 264/749 (35%), Positives = 392/749 (52%), Gaps = 77/749 (10%)

Query: 38   RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
            +  PF    L I QRV+DL+ RL+L+EKV+ +++ A A+ RLGI  Y WW+E LHGV   
Sbjct: 710  KRYPFRNPQLSIEQRVDDLVSRLTLEEKVRQMLNNAPAIKRLGIPAYNWWNECLHGVGR- 768

Query: 98   GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------G 149
               TK+       T FPQ I  A+S+N  L + +   ++DE RA+YN             
Sbjct: 769  ---TKYH-----VTVFPQAIGMAASWNDVLMKEVASSIADEGRAIYNDAQKRGDYSQYHA 820

Query: 150  LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
            LTYW+PN+NIFRDPRWGRGQET GEDP L+ K   ++V GLQG D   LK +A  KH+  
Sbjct: 821  LTYWTPNINIFRDPRWGRGQETYGEDPYLTSKIGKAFVLGLQGDDPRYLKASACAKHYAV 880

Query: 210  YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
            +     N   R  FN+ VS  D+ DT+   FR  V++  V+ VMC+YN   G P C +  
Sbjct: 881  HSGPEKN---RHSFNSDVSTYDLWDTYLPAFRTLVVDANVSGVMCAYNAFKGQPCCGNDL 937

Query: 270  ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT 329
            +++  +R +W   GY+ SDC ++   ++          AAADA+  G DLDCG    L  
Sbjct: 938  LMQSILRDKWNFKGYVTSDCGAIDDIFNHHKAHPDAATAAADAVFHGTDLDCGQSAYLAL 997

Query: 330  ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALE 388
              AV+ G+++E  ++ ++    T++ RLG+FD  P+ Q  Y H+    +    HQ+LA +
Sbjct: 998  VKAVKNGIITEKQLDVSVKRLFTIRFRLGLFD--PAEQVDYAHIPISVLECKKHQDLAKQ 1055

Query: 389  AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR- 447
             AR+ +VLLKN    LPL   + + V V+GPN+D    ++GNY G      TPLQ I   
Sbjct: 1056 LARESMVLLKND-RLLPLQKNKLKKVVVMGPNADCKDALLGNYNGHPSRMLTPLQAIRER 1114

Query: 448  ---YARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL-------- 496
                A  ++  G   +    +      ++ ++ ADA I + G+   +E E +        
Sbjct: 1115 LKGVAEVVYVSGIDYINTVSEDELKRYVNQAKGADAVIFIGGISPRLEGEEMSVNKDGFD 1174

Query: 497  --DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
              DR  + LP  Q +L+ K  +A + PT+ V+M+G  + + +      + AI+ A Y GQ
Sbjct: 1175 GGDRTSIALPTVQTQLM-KALVAGRIPTVFVMMTGSALAIPWEAK--HVPAILNAWYGGQ 1231

Query: 555  AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
             GG AIAD+LFG  NP GKLP+T+Y ++  ++LP  E       S    GRTYR++KG  
Sbjct: 1232 YGGEAIADVLFGDYNPSGKLPVTFYAKD--SDLPDFE-------SYDMQGRTYRYFKGKA 1282

Query: 615  VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
            +YPFG+G+SYT+F ++    PT                 + K I VT            V
Sbjct: 1283 LYPFGYGLSYTDFRYSSLKMPTA-------------CNTTDKEIPVT------------V 1317

Query: 675  DVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
             VKN G  DG   + L  S P      P   L  F+++++ AG  +++  ++   + LS 
Sbjct: 1318 TVKNTGKMDGEEVVQLYVSHPDKKILVPVTALKGFKRIYLKAGEAKQITFSLS-SEDLSC 1376

Query: 734  VDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
            VD +G R++  G   I +GG     +L A
Sbjct: 1377 VDENGIRKVLPGTVKIQVGGCSPVATLTA 1405


>gi|292495632|sp|Q0CMH8.2|XYND_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
          Length = 793

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 266/724 (36%), Positives = 394/724 (54%), Gaps = 27/724 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+   +L+E V    +    VPRLG+  Y+ WSE+LHGV      ++
Sbjct: 63  CDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGVYRANWASE 122

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             GD+  ATSFPQ I T ++ N TL   IG ++S +ARA  N G  GL  ++PN+N FR 
Sbjct: 123 --GDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGLDTYAPNINSFRH 180

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   L+  YA  Y+ G+QG  D + LK+ A+ KH+  YD++NW+G  R
Sbjct: 181 PVWGRGQETPGEDAYYLASTYAYEYITGIQGGVDPETLKLVATAKHYAGYDIENWDGHSR 240

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              + ++++QD+ + +   F +   + KV SVMCSYN VNGVP+C++   L+  +R  + 
Sbjct: 241 LGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRETFG 300

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
              +GY+  DC +V   ++   + +    A+AD+IRAG D+DCG     H  +A   G +
Sbjct: 301 FVEDGYVSGDCGAVYNAFNPHEYAANESSASADSIRAGTDIDCGTSYQYHFTNAFDEGEI 360

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           S  DI   ++   T  +RLG FDG  +S  Y  L   DV T D   ++ EAA +G VLLK
Sbjct: 361 SRQDIERGVIRLYTNLVRLGYFDG--NSSQYRDLTWSDVQTTDAWNISHEAAVEGTVLLK 418

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G +LPL+    R+VA+IGP ++ T  M GNY G A   T+PL  +      +H     
Sbjct: 419 NDG-TLPLAD-SIRSVALIGPWANATTQMQGNYYGPAPYLTSPLAALEASDLDVHYAFGT 476

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           +++      F  A+ A+R+ADA I   G+D +IE EALDR  +  PG Q +L++++S   
Sbjct: 477 NISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDRMNITWPGNQLDLINQLSALG 536

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P +++ M GG +D +  K++  ++A++W GYPGQ+GGTA+ DI+ G   P G+L  T 
Sbjct: 537 K-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGTALLDIIRGVRAPAGRLVTTQ 595

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP  Y T  P  +M +RP+ +   PG+TY +Y G  VY FGHG+ YT F    A+  T  
Sbjct: 596 YPAGYATQFPAIDMGLRPNGTN--PGQTYMWYTGTPVYEFGHGLFYTTFEAKRASTAT-- 651

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
                  H S N      A    +A    R  L     + N G     +T ++F+   AG
Sbjct: 652 ------NHSSFNIEDLLTAPHPGYAYPQLRPFLNFTAHITNTGRTTSDYTAMLFANTTAG 705

Query: 698 HWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
             APH  K LV F+++  +  GA Q +   I +   ++  D  G R +  G + + +   
Sbjct: 706 P-APHPNKWLVGFDRLGALEPGASQTMTFPITIDN-VARTDELGNRVLYPGRYELALNNE 763

Query: 755 KHSV 758
           +  V
Sbjct: 764 RSVV 767


>gi|154313073|ref|XP_001555863.1| hypothetical protein BC1G_05538 [Botryotinia fuckeliana B05.10]
          Length = 755

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 273/704 (38%), Positives = 386/704 (54%), Gaps = 60/704 (8%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   LI   +L EKV    + +  VPR+G+  YEWW+EALHG++   PGT 
Sbjct: 34  CDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIAR-SPGTT 92

Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
           F   G ++  +TSFPQ I   ++F+  L   +   VS EARA  N    GL +W+PN+N 
Sbjct: 93  FAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNRFGLNFWTPNINP 152

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD--RLKVAASCKHFTAYDLDNWNG 217
           ++DPRWGRGQETPGEDP  +  Y  + + GLQG   D    K  A+CKHF  YDL+N +G
Sbjct: 153 YKDPRWGRGQETPGEDPFHTSSYVNALITGLQGGLDDLPYKKGVATCKHFAGYDLENSDG 212

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R+ F+A +  QD+ D +  PF+ C  +  V SVMCSYN +NGVPTCAD  +L+  +R 
Sbjct: 213 AIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCADDWLLQTLLRE 272

Query: 278 EW---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ 334
            W     + ++ SDCD+V   +D  ++T TPE++AADA+ AG DLDCG F   +  SA  
Sbjct: 273 HWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSAADALNAGTDLDCGTFWPTYLGSAYD 332

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
           +GL     ++ +L       +RLG FD  PS QPY  L   +V TP  Q+LAL+AA  GI
Sbjct: 333 QGLYDISTLDRSLARRYASLVRLGYFD-PPSVQPYRQLNWDNVSTPAAQQLALQAAEDGI 391

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL---QGIGRYART 451
           VLLKN G  LPLS      VA+IGP ++ T  M GNY G A    +PL   Q  G   + 
Sbjct: 392 VLLKNDG-ILPLSS-NITNVALIGPLANATKQMQGNYYGTAPYLRSPLIAAQNAG--FKV 447

Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
            + QG  D+   +   F AAI A++ AD  I V G+D SIEAE              E++
Sbjct: 448 TYVQGA-DIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAE--------------EIL 492

Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
           + +S     P I+  M G  ID +   ++  + A++WAGYPGQ GGTAI +IL G + P 
Sbjct: 493 ANLST----PLIISQM-GCMIDSSSLLSNTGVNALLWAGYPGQDGGTAIFNILTGKTAPA 547

Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
           G+LP+T YP  Y+  + MT+M ++PS+    PGRTY++Y G  V+ +G+G+ YT F   +
Sbjct: 548 GRLPITQYPSNYVNQVTMTDMNLQPSRFN--PGRTYKWYNGEPVFEYGYGLQYTTFDAKI 605

Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT--LGVQVDVKNVGSKDGAHTLL 689
             +             S N T     +    +    LT  + + + V N G+    +  L
Sbjct: 606 TPS-------------SPNNTFEISELLANASNYKDLTPFVKIPITVSNTGTTTSDYVAL 652

Query: 690 VF---STPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCK 729
            F   +  PA H  P K LVA+ ++H +  GA     +++++  
Sbjct: 653 FFLSGTFGPAPH--PKKSLVAYTRLHDITGGANATAEVSLNLAS 694


>gi|238483831|ref|XP_002373154.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
 gi|292495283|sp|B8MYV0.1|XYND_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|220701204|gb|EED57542.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
          Length = 797

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 258/729 (35%), Positives = 383/729 (52%), Gaps = 32/729 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+  L+ +E V    +     PR+G+  Y+ W+EALHGV++      
Sbjct: 63  CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAHADFSD- 121

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             GDF  +TSFPQ I+T ++ N TL   I  ++S + RA  N G  GL  +SPN+N FR 
Sbjct: 122 -AGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTFRH 180

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   L+  YA  Y+ G+QG  D + LK+ A+ KH+  YD++NW+   R
Sbjct: 181 PVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLIATAKHYAGYDIENWDNHSR 240

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              + ++++QD+ + +   F +   + KV SVMCSYN VNGVP+C++   L+  +R  + 
Sbjct: 241 LGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFD 300

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
              +GY+  DC +V   ++   + +    AAAD+IRAG D+DCG     H + +     +
Sbjct: 301 FVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEV 360

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           S  D+   +       +R G FDG+ S  PY ++   DV + + Q L+ EAA Q IVLLK
Sbjct: 361 SRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTNAQNLSYEAAAQSIVLLK 418

Query: 399 NQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGC 457
           N G  LPL +    +T+A+IGP ++ T  M+GNY G A    +PLQ        I     
Sbjct: 419 NDG-ILPLTTSSSTKTIALIGPWANATTQMLGNYYGPAPYLISPLQAFQDSEYKITYTIG 477

Query: 458 KDVACADDQLF-GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
            +     D      A+  +++AD  I   G+D ++E EA DR+ +  P  Q  L++K++ 
Sbjct: 478 TNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLAD 537

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
             K P I++ M GG +D +  KN+  + A+IW GYPGQ+GG A+ADI+ G   P  +L  
Sbjct: 538 LGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLVT 596

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           T YP EY    P  +M +RP+ S   PG+TY +Y G  VY FGHG+ YTNF        T
Sbjct: 597 TQYPAEYAEVFPAIDMNLRPNGSN--PGQTYMWYTGTPVYEFGHGLFYTNF--------T 646

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVFS 692
             A    G     +  I  + +   H     +     L   VDVKN G +   +T + F 
Sbjct: 647 ASASAGSGTKNRTSFNID-EVLGRPHPGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAFV 705

Query: 693 TPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
              AG  APH  K LV F+++  V  G+ + + I + V   L+  D  G R +  G + +
Sbjct: 706 NTTAGP-APHPNKWLVGFDRLSAVEPGSAKTMVIPVTV-DSLARTDEEGNRVLYPGRYEV 763

Query: 750 HIGGTKHSV 758
            +   +  V
Sbjct: 764 ALNNEREVV 772


>gi|440472411|gb|ELQ41274.1| beta-xylosidase [Magnaporthe oryzae Y34]
 gi|440484691|gb|ELQ64724.1| beta-xylosidase [Magnaporthe oryzae P131]
          Length = 792

 Score =  417 bits (1073), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 266/716 (37%), Positives = 381/716 (53%), Gaps = 50/716 (6%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +    +R   L+  + L EK++ L++ +   PR+G+  YEWWSEALHGV+   PG  
Sbjct: 41  CDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAK-SPGVT 99

Query: 103 F----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
           F    G  F  ATSF   I  +++F+  L EA+   +S EARA  N G AGL +W+PN+N
Sbjct: 100 FNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDWWTPNIN 159

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
            ++DPRWGRG ETPGED +   KY  + +RGL+GSD    K+ A+CKH+ A DL+ WNGV
Sbjct: 160 PYKDPRWGRGMETPGEDALRISKYVKALLRGLEGSDPTTRKMVANCKHYAANDLERWNGV 219

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV---------NGVPTCADPN 269
            R++F+A V+ QD+ + +   F+ C  +  V S MC+YN +         NG P CA   
Sbjct: 220 TRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKGKDLSWNGTPVCASKY 279

Query: 270 ILKRTIRGEW---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC--GPF 324
           ++   +R  W     N +I SDC++V   ++  H++ T EEAA  A  AG D  C    +
Sbjct: 280 LMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHWSDTREEAAGSAYTAGTDTVCEVSNY 339

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
                + A  RGLL E  ++ AL       +R G FDG     PY ++   DV TP+ ++
Sbjct: 340 DKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDG--PDAPYRNITWADVNTPEARK 397

Query: 385 LALEAARQGIVLLKNQGPSLPLS----HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
           LA  +A +G+VL KN G  LP+       + +TVA+IG   D    M+G Y+GIA    T
Sbjct: 398 LAHRSAVEGMVLTKNNG-VLPIKLEELQKKGKTVALIGNWVDNGEQMLGTYSGIAPFRNT 456

Query: 441 PLQGIGRYARTIHQQG--CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
           PL         +   G        + D     A++A+ QAD  +   G+D S+EAE  DR
Sbjct: 457 PLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLYFGGIDLSVEAEDRDR 516

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
             L  P  Q +L+S +S   K PT++V +     D A   N   I+AIIWAGYPGQ GGT
Sbjct: 517 YSLAWPSAQAKLLSDISALGK-PTVVVQLGTMLDDTALLDNK-NISAIIWAGYPGQDGGT 574

Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR-----YPGRTYRFYKGP 613
           A  DI+ G + P G+LP+T YP +Y   +PMT+M +RPS+  +      PGRTYR+Y   
Sbjct: 575 AAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKDTKGGAASNPGRTYRWYD-E 633

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC---NRLTL 670
            V+PFG G+ +TNF  +VA + +      D   G      S K I     KC   + L +
Sbjct: 634 AVHPFGFGLHFTNFTTSVAVSSSSAISTSDLESG----CKSEKHID----KCSFPSSLEV 685

Query: 671 GVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINI 725
            V  D K+  S   A   +     P  +  P K LVA+ K+H +  G  ++V + +
Sbjct: 686 SVTNDGKSTTSSYAALAFVRGEYGPKPY--PLKTLVAYGKLHDIAPGQTKKVKLEL 739


>gi|389632743|ref|XP_003714024.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|351646357|gb|EHA54217.1| beta-xylosidase [Magnaporthe oryzae 70-15]
          Length = 847

 Score =  417 bits (1073), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 266/716 (37%), Positives = 382/716 (53%), Gaps = 50/716 (6%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +    +R   L+  + L EK++ L++ +   PR+G+  YEWWSEALHGV+   PG  
Sbjct: 96  CDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAK-SPGVT 154

Query: 103 F----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
           F    G  F  ATSF   I  +++F+  L EA+   +S EARA  N G AGL +W+PN+N
Sbjct: 155 FNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDWWTPNIN 214

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
            ++DPRWGRG ETPGED +   KY  + +RGL+GSD    K+ A+CKH+ A DL+ WNGV
Sbjct: 215 PYKDPRWGRGMETPGEDALRISKYVKALLRGLEGSDPTTRKMVANCKHYAANDLERWNGV 274

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV---------NGVPTCADPN 269
            R++F+A V+ QD+ + +   F+ C  +  V S MC+YN +         NG P CA   
Sbjct: 275 TRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKGKDLSWNGTPVCASKY 334

Query: 270 ILKRTIRGEW---RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC--GPF 324
           ++   +R  W     N +I SDC++V   ++  H++ T EEAA  A  AG D  C    +
Sbjct: 335 LMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHWSDTREEAAGSAYTAGTDTVCEVSNY 394

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
                + A  RGLL E  ++ AL       +R G FDG  +  PY ++   DV TP+ ++
Sbjct: 395 DKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDA--PYRNITWADVNTPEARK 452

Query: 385 LALEAARQGIVLLKNQGPSLPLS----HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
           LA  +A +G+VL KN G  LP+       + +TVA+IG   D    M+G Y+GIA    T
Sbjct: 453 LAHRSAVEGMVLTKNNG-VLPIKLEELQKKGKTVALIGNWVDNGEQMLGTYSGIAPFRNT 511

Query: 441 PLQGIGRYARTIHQQG--CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
           PL         +   G        + D     A++A+ QAD  +   G+D S+EAE  DR
Sbjct: 512 PLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLYFGGIDLSVEAEDRDR 571

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
             L  P  Q +L+S +S   K PT++V +     D A   N   I+AIIWAGYPGQ GGT
Sbjct: 572 YSLAWPSAQAKLLSDISALGK-PTVVVQLGTMLDDTALLDNK-NISAIIWAGYPGQDGGT 629

Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR-----YPGRTYRFYKGP 613
           A  DI+ G + P G+LP+T YP +Y   +PMT+M +RPS+  +      PGRTYR+Y   
Sbjct: 630 AAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKDTKGGAASNPGRTYRWYD-E 688

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC---NRLTL 670
            V+PFG G+ +TNF  +VA + +      D   G      S K I     KC   + L +
Sbjct: 689 AVHPFGFGLHFTNFTTSVAVSSSSAISTSDLESG----CKSEKHID----KCSFPSSLEV 740

Query: 671 GVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVH-VPAGAQQRVGINI 725
            V  D K+  S   A   +     P  +  P K LVA+ K+H +  G  ++V + +
Sbjct: 741 SVTNDGKSTTSSYAALAFVRGEYGPKPY--PLKTLVAYGKLHDIAPGQTKKVKLEL 794


>gi|115397385|ref|XP_001214284.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
 gi|114192475|gb|EAU34175.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
          Length = 776

 Score =  417 bits (1073), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 264/706 (37%), Positives = 387/706 (54%), Gaps = 27/706 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+   +L+E V    +    VPRLG+  Y+ WSE+LHGV      ++
Sbjct: 81  CDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGVYRANWASE 140

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             GD+  ATSFPQ I T ++ N TL   IG ++S +ARA  N G  GL  ++PN+N FR 
Sbjct: 141 --GDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGLDTYAPNINSFRH 198

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   L+  YA  Y+ G+QG  D + LK+ A+ KH+  YD++NW+G  R
Sbjct: 199 PVWGRGQETPGEDAYYLASTYAYEYITGIQGGVDPETLKLVATAKHYAGYDIENWDGHSR 258

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              + ++++QD+ + +   F +   + KV SVMCSYN VNGVP+C++   L+  +R  + 
Sbjct: 259 LGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRETFG 318

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
              +GY+  DC +V   ++   + +    A+AD+IRAG D+DCG     H  +A   G +
Sbjct: 319 FVEDGYVSGDCGAVYNAFNPHEYAANESSASADSIRAGTDIDCGTSYQYHFTNAFDEGEI 378

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           S  DI   ++   T  +RLG FDG  +S  Y  L   DV T D   ++ EAA +G VLLK
Sbjct: 379 SRQDIERGVIRLYTNLVRLGYFDG--NSSQYRDLTWSDVQTTDAWNISHEAAVEGTVLLK 436

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G +LPL+    R+VA+IGP ++ T  M GNY G A   T+PL  +      +H     
Sbjct: 437 NDG-TLPLAD-SIRSVALIGPWANATTQMQGNYYGPAPYLTSPLAALEASDLDVHYAFGT 494

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           +++      F  A+ A+R+ADA I   G+D +IE EALDR  +  PG Q +L++++S   
Sbjct: 495 NISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDRMNITWPGNQLDLINQLSALG 554

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P +++ M GG +D +  K++  ++A++W GYPGQ+GGTA+ DI+ G   P G+L  T 
Sbjct: 555 K-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGTALLDIIRGVRAPAGRLVTTQ 613

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP  Y T  P  +M +RP+ +   PG+TY +Y G  VY FGHG+ YT F    A+  T  
Sbjct: 614 YPAGYATQFPAIDMGLRPNGTN--PGQTYMWYTGTPVYEFGHGLFYTTFEAKRASTAT-- 669

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
                  H S N      A    +A    R  L     + N G     +T ++F+   AG
Sbjct: 670 ------NHSSFNIEDLLTAPHPGYAYPQLRPFLNFTAHITNTGRTTSDYTAMLFANTTAG 723

Query: 698 HWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
             APH  K LV F+++  +  GA Q +   I +   ++  D  G R
Sbjct: 724 P-APHPNKWLVGFDRLGALEPGASQTMTFPITIDN-VARTDELGNR 767


>gi|333381510|ref|ZP_08473192.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332830480|gb|EGK03108.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 738

 Score =  417 bits (1072), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 263/745 (35%), Positives = 393/745 (52%), Gaps = 89/745 (11%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    L   +RV+DL+ RL+L+EKV  +++   A+ RL I  Y WW+E LHG+      
Sbjct: 25  PFRDTKLSTDKRVSDLVSRLTLEEKVLQMLNNTPAIERLNIPAYNWWNECLHGIGR---- 80

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTY 152
           T++       T FPQ I  A++++A L + +   +SDE RA+YN  +A        GLTY
Sbjct: 81  TEYK-----VTVFPQAIGMAAAWDARLLKDVANAISDEGRAIYNDASAKGNYSIYHGLTY 135

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           W+PNVNIFRDPRWGRGQET GEDP L+G    S+V GLQG D   LK AA  KH+  +  
Sbjct: 136 WTPNVNIFRDPRWGRGQETYGEDPYLTGALGKSFVAGLQGDDSQYLKAAACAKHYAVH-- 193

Query: 213 DNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              +G +  R  FN  V+  D+ DT+   FR  V++ KVA VMC+YN  +G P C +  +
Sbjct: 194 ---SGPENTRHTFNTFVTTFDLWDTYLPAFRDLVVDAKVAGVMCAYNAFSGEPCCGNNLL 250

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCGPFLGLH 328
           ++  +R +W   GY+ SDC ++  +Y  +H  + P+   AAADA+ +G D+DCG      
Sbjct: 251 MQEILRDKWGFTGYVTSDCGAIDDFY--RHHKTHPDAKYAAADAVYSGTDIDCGNEAYKA 308

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELAL 387
              AV+ GL++E  I+ +L     ++ RLGMFD  P+        P  V     H++LAL
Sbjct: 309 LVDAVKTGLITEEQIDISLKRLFEIRFRLGMFD--PAEDVKFSKIPLSVLESQPHKDLAL 366

Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG- 446
           +  R+ IVLLKN+   LPLS  + + VAVIGPN+D  V+++GNY G      TP + I  
Sbjct: 367 KITRESIVLLKNENNFLPLSK-KLKKVAVIGPNADNEVSVLGNYNGFPTQIITPYKAIKN 425

Query: 447 --RYARTIHQQGCKDVACADDQLFGAAIDASRQA--DATILVMGLDQSIEAEAL------ 496
             +    I+++G   V  +++     A  A R    D  I   G+   +E E +      
Sbjct: 426 KLKNTEVIYEKGIDFVKPSENSKEEIAALAKRLKGMDVVIFAGGISPELEGEEMPVKIEG 485

Query: 497 ----DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
               DR  + LP  Q EL+  +  A + PT+ V+M+G  I   +   +  + AI+ A Y 
Sbjct: 486 FTGGDRTSIKLPKIQTELMQALK-AERIPTVFVMMTGSAIAAEWESQN--VPAILNAWYG 542

Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
           GQ  GTAIAD+LFG  NP GKLP+T+Y ++  ++LP          S     RTYR++ G
Sbjct: 543 GQDAGTAIADVLFGDYNPSGKLPVTFYTKD--SDLPAF-------NSYEMKNRTYRYFDG 593

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
            V+YPFG+G+SYT F ++    P            SI A   G+ ++V+           
Sbjct: 594 QVLYPFGYGLSYTKFEYSPIQMP-----------ASIKA---GENMEVS----------- 628

Query: 673 QVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
            + VKN G  DG   + ++            P   L +FE++ + AG  + V   +   +
Sbjct: 629 -ITVKNTGKTDGEEVVQLYISHDNNGTNRQLPLYALKSFERISLKAGESKSVTFKLS-PR 686

Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGT 754
            +++ D  G  ++  G+  ++IGGT
Sbjct: 687 EMALADEDGVLKMTKGKSKLYIGGT 711


>gi|380293100|gb|AFD50200.1| beta-xylosidase [Hypocrea orientalis]
          Length = 797

 Score =  417 bits (1072), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 266/733 (36%), Positives = 400/733 (54%), Gaps = 29/733 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S    +R   LI   +L+E +    +    VPRLG+  Y+ W+EALHG+      TK
Sbjct: 63  CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
            GG F  ATSFP  I T ++ N TL   I  ++S +ARA  N G  GL  ++PNVN FR 
Sbjct: 123 -GGQFEWATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNVNGFRS 181

Query: 163 PRWGRGQETPGEDPV-LSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   LS  Y   Y+ G+QG  D ++LKVAA+ KHF  YDL+NWN   R
Sbjct: 182 PLWGRGQETPGEDAFFLSSAYTYEYITGIQGGVDPEQLKVAATVKHFAGYDLENWNNQSR 241

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A +++QD+ + +   F       K  S+MCSYN VNGVP+CA+   L+  +R  W 
Sbjct: 242 LGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCSYNSVNGVPSCANSFFLQTLLRESWG 301

Query: 281 LN--GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
               GY+ SDCD+V   ++   + S    AAA ++RAG D+DCG     H   +   G +
Sbjct: 302 FPEWGYVSSDCDAVYNVFNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHLNESFVAGEV 361

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           +  +I  ++       +RLG FD       Y  LG KDV   D   ++ EAA +GIVLLK
Sbjct: 362 TRGEIERSVTRLYANLVRLGYFD---KKNQYRSLGWKDVVKTDAWNISYEAAVEGIVLLK 418

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G +LPLS  + R++A+IGP ++ T  M GNY G A    +PL+   +    ++ +   
Sbjct: 419 NDG-TLPLSK-KVRSIALIGPWANATTQMQGNYFGPAPYLISPLEAAKKAGYHVNFELGT 476

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           ++A      F  AI A++++DA + + G+D +IE E  DR  +  PG Q +L+ ++S   
Sbjct: 477 EIAGNSTAGFAKAIAAAKKSDAIVYLGGIDNTIEQEGADRTDIAWPGNQLDLIKQLSEVG 536

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P +++ M GG +D +  K++ ++ +++W GYPGQ+GG A+ DIL G   P G+L  T 
Sbjct: 537 K-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRAPAGRLITTQ 595

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP EY+   P  +M +RP   K  PG+TY +Y G  VY FG G+ YT F  T+A+ P  +
Sbjct: 596 YPAEYVHQFPQNDMNLRPD-GKSNPGQTYIWYTGKPVYEFGSGLFYTTFKETLASHPKCL 654

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
                    + ++ +S      T+++        + ++KN G  +  +T ++F    +  
Sbjct: 655 KF-------NTSSILSAPHPGYTYSE-QIPVFTFEANIKNSGKTESPYTAMLFVRTSNAG 706

Query: 695 PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           PA +  P+K LV F+++  +  G   ++ I I V   L+ VD  G R +  G++ + +  
Sbjct: 707 PAPY--PNKWLVGFDRLADIKPGHSSKLSIPIPVSA-LARVDSYGNRIVYPGKYELAL-N 762

Query: 754 TKHSVSLHAATLG 766
           T  SV L    +G
Sbjct: 763 TDESVKLEFELVG 775


>gi|242771939|ref|XP_002477942.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
 gi|218721561|gb|EED20979.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
          Length = 797

 Score =  417 bits (1071), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 259/717 (36%), Positives = 387/717 (53%), Gaps = 25/717 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S+   +R   LI   +L+E +    + A  VPRLG+  Y+ WSE LHG+       K
Sbjct: 64  CNTSVNYVERAEGLISLFTLEELINNTQNSAPGVPRLGLPPYQVWSEGLHGLDRAN-WAK 122

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
            G ++  ATSFP  I + ++ N TL   I  +++ +ARA  N G  GL  ++PN+N FR 
Sbjct: 123 SGEEWKWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGRYGLDAYAPNINGFRS 182

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   LS  YA  Y+ GLQG  D + LK+ A+ KHF  YDL+NWN   R
Sbjct: 183 PLWGRGQETPGEDAGFLSSSYAYEYITGLQGGVDPEHLKIVATAKHFAGYDLENWNNNSR 242

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A +++QD+ + +   F       K  S MCSYN VNGVP+C+   +L+  +R  W 
Sbjct: 243 LGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSFLLQTLLRENWD 302

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
               GY+ SDCD+    ++   +      AAAD++RAG D+DCG     +   +   G +
Sbjct: 303 FPDYGYVSSDCDAAYNVFNPHGYAINISAAAADSLRAGTDIDCGQTYPWYLNQSFIEGSV 362

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           +  +I  +L+   +  ++LG FDG  S   Y  LG  DV   D   ++ EAA +GIVLLK
Sbjct: 363 TRGEIERSLIRLYSNLVKLGYFDGNQSE--YRQLGWNDVVATDAWNISYEAAVEGIVLLK 420

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G  LPLS  + ++VAVIGP ++ T  + GNY G A    TPLQ        ++     
Sbjct: 421 NDG-VLPLSE-KLKSVAVIGPWANATQQLQGNYFGPAPYLITPLQAARDAGYKVNYAFGT 478

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           ++       F AA+ A++++D  I + G+D +IEAE  DR  +  PG Q +L+ ++S   
Sbjct: 479 NILGNTTDGFAAALSAAKKSDVIIYLGGIDNTIEAEGTDRMNVTWPGNQLDLIQQLSQTG 538

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P +++ M GG +D +  K++  + A++W GYPGQ+GG AI DIL G   P G+L  T 
Sbjct: 539 K-PLVVLQMGGGQVDSSSLKSNNNVNALVWGGYPGQSGGKAIFDILSGKRAPAGRLVTTQ 597

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP EY T  P T+M +RP   K  PG+TY +Y G  VY FG+ + YT F  T        
Sbjct: 598 YPAEYATQFPATDMNLRP-DGKSNPGQTYIWYTGKPVYEFGYALFYTTFKET-------- 648

Query: 639 AVPLDGRHGSINATI-SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFS-TPPA 696
           A  L      I+  I S ++    +++     + V   +KN G     +T ++F+ T  A
Sbjct: 649 AEKLASSSFDISDIIASPRSSSYAYSELVPF-VNVTATIKNTGKTASPYTAMLFANTTNA 707

Query: 697 GHWA-PHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
           G    P+K LV ++++  +  G    + I + +   +S VD +G R +  G++ + +
Sbjct: 708 GPTPYPNKWLVGYDRLPSIEPGKSTELVIPVPIGA-ISRVDENGNRIVYPGDYQLAL 763


>gi|292495285|sp|B6EY09.1|XYND_ASPJA RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|211970990|dbj|BAG82824.1| 1,4-beta-D-xylosidase [Aspergillus japonicus]
          Length = 804

 Score =  416 bits (1070), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 252/723 (34%), Positives = 387/723 (53%), Gaps = 25/723 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +     R   L+   +L+E +    + +  VPRLG+  Y+ WSEALHG++       
Sbjct: 60  CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLARANFTDN 119

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             G +  ATSFP  I +A++FN TL   I  ++S + RA  N G  GL  +SPN+N FR 
Sbjct: 120 --GAYSWATSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPNINTFRH 177

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   L+  YA  Y+ G+QG  + + LK+AA+ KHF  YD++NW+   R
Sbjct: 178 PVWGRGQETPGEDAYTLTAAYAYEYITGIQGGVNPEHLKLAATAKHFAGYDIENWDNHSR 237

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              +  +++QD+ + +   F +   +  V S MCSYN VNGVP+C++   L+  +R  + 
Sbjct: 238 LGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCSNTFFLQTLLRDTFS 297

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
              +GY+  DC +V   ++   + +    AAADAI AG D+DCG     H   ++  G +
Sbjct: 298 FVDHGYVSGDCGAVYGVFNPHGYAANEPSAAADAILAGTDIDCGTSYQYHFNESITTGAV 357

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           +  DI    +      + LG FDG  SS  PY  LG  DV   D   ++ EAA +GIVLL
Sbjct: 358 ARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDAWNISYEAAVEGIVLL 417

Query: 398 KNQGPSLPL---SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
           KN G +LPL   S  +++++A+IGP ++ T  + GNY G A    +P+        T+H 
Sbjct: 418 KNDG-TLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLISPVDAFTAAGYTVHY 476

Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
               +++      F AA+ A+R AD  + + G+D +IEAEA DR+ +  PG Q EL+S++
Sbjct: 477 APGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDRSSIAWPGNQLELISQL 536

Query: 515 SM--ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
           +   +   P ++  M GG +D +  K++ ++ A++W GYPGQ+GG A+ DIL G   P G
Sbjct: 537 AAQKSDDQPLVVYQMGGGQVDSSALKSNAKVNALLWGGYPGQSGGLALRDILTGARAPAG 596

Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA 632
           +L  T YP  Y  +    +M +RP+++ + PG+TY +Y G  VY FGHG+ YT F  + A
Sbjct: 597 RLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEPVYAFGHGLFYTTFNASSA 656

Query: 633 NAPTVVAVPLDGRHGSINAT-ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
            A             + N T ++  A   T     R        + N G +D  +T LV+
Sbjct: 657 QAAKT--------KYTFNITDLTSAAHPDTTTVGQRTLFNFTASITNSGQRDSDYTALVY 708

Query: 692 S-TPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVC-KYLSVVDRSGTRRIPLGEHN 748
           + T  AG    P+K LV F+++   A       +N+ V    L+ VD +G   +  G + 
Sbjct: 709 ANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNVPVAVDRLARVDEAGNTVLFPGRYE 768

Query: 749 IHI 751
           + +
Sbjct: 769 VAL 771


>gi|380696433|ref|ZP_09861292.1| glycoside hydrolase [Bacteroides faecis MAJ27]
          Length = 739

 Score =  416 bits (1069), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 271/779 (34%), Positives = 401/779 (51%), Gaps = 89/779 (11%)

Query: 13  ILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG 72
           IL  + S  L A+E F            PF    LP+ QRV DL+ RL+L+EKVK +++ 
Sbjct: 10  ILACSMSLHLLAQEKF------------PFRDPQLPVEQRVEDLVSRLTLEEKVKQMLNS 57

Query: 73  AAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIG 132
              V RLGI  Y WW+E LHG+      TK+       T FPQ I  A+++N  L + + 
Sbjct: 58  TPPVERLGIPAYNWWNECLHGIGR----TKYH-----VTVFPQAIGMAAAWNDALIKEVA 108

Query: 133 RVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAA 184
             ++DE RA+YN             LTYW+PN+NIFRDPRWGRGQET GEDP L+ +   
Sbjct: 109 SSIADEGRAIYNDTQRKEDYSQYHALTYWTPNINIFRDPRWGRGQETYGEDPYLTARIGE 168

Query: 185 SYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
           ++V+GLQG +   LK +A  KH+  +     N   R  FN+ VS  D+ DT+   FR  V
Sbjct: 169 AFVQGLQGDNPRYLKASACAKHYAVHSGPEKN---RHSFNSDVSTYDLWDTYLPAFRTLV 225

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
           ++ KV+ VMC+YN   G P C +  +++  +R +W   GY+ SDC ++   ++       
Sbjct: 226 VDAKVSGVMCAYNAFQGQPCCGNDLLMQSILRDKWNFTGYVTSDCGAIDDIFNHHKTHPD 285

Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
              AAADA+  G DLDCG    L    AV+ G+++E  ++ ++    T++ RLG+FD   
Sbjct: 286 AATAAADAVFHGTDLDCGHSAYLALVKAVKDGIITEKQLDVSVKRLFTIRFRLGLFD-PV 344

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
               Y  +    +    HQ+LA + AR+ +VLLKN    LPL   + + V V+GPN+D  
Sbjct: 345 ELVDYARIPISILECRKHQDLAKQLARESMVLLKND-QLLPLQKNKLKKVVVMGPNADSR 403

Query: 425 VTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDVACADDQLFGAAIDASRQADA 480
            +++GNY G      TPLQ I    G +    + +G   V           ++ ++ ADA
Sbjct: 404 ESLLGNYNGNPSRMLTPLQAIRERLGGWTEVEYIEGVDHVNTISADDLKQYVNRAKGADA 463

Query: 481 TILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGG 530
            I + G+   +E E +          DR  + LP  Q +++ K  +A   PT+ V+M+G 
Sbjct: 464 VIFIGGISPRLEGEEMPVSKDGFDGGDRTTIALPAVQTQMM-KAWVAEHIPTVFVMMTGS 522

Query: 531 PIDVAF-AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
            + + + A+N P   AI+ A Y GQ GG AIAD+LFG  NP GKLP+T+Y ++  ++LP 
Sbjct: 523 ALAIPWEAQNVP---AILNAWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAKD--SDLPD 577

Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSI 649
            E       S    GRTYR++ G  +YPFG+G+SYT+F ++    P V            
Sbjct: 578 FE-------SYDMQGRTYRYFNGKALYPFGYGLSYTSFAYSSLKLPKVCRT--------- 621

Query: 650 NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAF 708
               + K I+VT            V VKN G  +G   + L  S P      P   L  F
Sbjct: 622 ----TDKEIEVT------------VTVKNTGHTEGEEVVQLYVSHPDKKILVPLTALKGF 665

Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           +++ + AG  QRV  ++   + LS VD +G R++  G   I +GG+    +L A   GV
Sbjct: 666 KRIQLKAGEAQRVTFSLS-SEDLSCVDENGIRKVWAGTVKIQVGGSSPVATLAAPFKGV 723


>gi|367028614|ref|XP_003663591.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
 gi|347010860|gb|AEO58346.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
          Length = 760

 Score =  415 bits (1066), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 263/724 (36%), Positives = 392/724 (54%), Gaps = 42/724 (5%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+  ++  EK+  L++ +  V RLG+  Y+WW+EALHGV++   G  
Sbjct: 39  CDTSASPGARAAALVSVMNNNEKLANLVNNSPGVSRLGLSAYQWWNEALHGVAH-NRGIT 97

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
           +GG+F  AT FPQ ITT+++F+  L E IG ++S EARA  N G A L +W+PNVN FRD
Sbjct: 98  WGGEFSAATQFPQAITTSATFDDALIEQIGTIISTEARAFANNGRAHLDFWTPNVNPFRD 157

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
           PRWGRG ETPGED   + K+A ++V+G+QG  G   +V A+CKH+ AYDL+N     RF+
Sbjct: 158 PRWGRGHETPGEDAFKNKKWAEAFVKGMQGP-GPTHRVIATCKHYAAYDLENSGSTTRFN 216

Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
           F+AKVS QD+ + +  PF+ C  + KV S+MCSYN VN +P CA+P ++   +R  W   
Sbjct: 217 FDAKVSTQDLAEYYLPPFQQCARDSKVGSIMCSYNAVNEIPACANPYLMDTILRKHWNWT 276

Query: 283 G---YIVSDCDSVGVYY-----DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--SA 332
               YIVSDCD+  VYY         +  +   A   ++ AG D  C    G   +  SA
Sbjct: 277 DEHQYIVSDCDA--VYYLGNANGGHRYKPSYAAAIGASLEAGCDNMCWATGGTAPDPASA 334

Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
              G  S+  ++ A++  +   +  G FDG P    Y +L   DV T   Q+ AL+AA  
Sbjct: 335 FNSGQFSQTTLDTAILRQMQGLVLAGYFDG-PGGM-YRNLSVADVNTQTAQDTALKAAEG 392

Query: 393 GIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART 451
           GIVLLKN G  LPLS +  +  VA+IG  ++    M+G Y+G       P+        T
Sbjct: 393 GIVLLKNDG-ILPLSVNGSNFQVAMIGFWANAADKMLGGYSGSPPFNHDPVTAARSMGIT 451

Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
           ++           D    AA++A+++++A +   G+D ++E E+ DR  +  P  Q  L+
Sbjct: 452 VNYVNGPLTQPNGDT--SAALNAAQKSNAVVFFGGIDNTVEKESQDRTSIEWPSGQLALI 509

Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
            +  +A  G  ++V+  G  +D     + P + AI+WAGYPGQ GGTA+  I+ G ++P 
Sbjct: 510 RR--LAETGKPVIVVRLGTHVDDTPLLSIPNVRAILWAGYPGQDGGTAVVKIITGLASPA 567

Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
           G+LP T YP  Y +  P T MA+RPS S  YPGRTYR+Y    V+PFGHG+ YTNF  +V
Sbjct: 568 GRLPATVYPSSYTSQAPFTNMALRPSSS--YPGRTYRWYSN-AVFPFGHGLHYTNFSVSV 624

Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
            + P   A         I   ++     V +         V ++V N G++   +  L F
Sbjct: 625 RDFPASFA---------IADLLASCGDSVAYLDLCPFP-SVSLNVTNTGTRVSDYVALGF 674

Query: 692 ST---PPAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH 747
            +    P+ H  P K L  +++V ++  G  Q   ++  + + L  VD  G R +  G +
Sbjct: 675 LSGDFGPSPH--PIKTLATYKRVFNIEPGETQVAELDWKL-ESLVRVDEKGNRVLYPGTY 731

Query: 748 NIHI 751
            + +
Sbjct: 732 TLLV 735


>gi|3135209|dbj|BAA28267.1| beta-xylosidase A [Aspergillus oryzae]
          Length = 798

 Score =  415 bits (1066), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 258/730 (35%), Positives = 384/730 (52%), Gaps = 33/730 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+  L+ +E V    +     PR+G+  Y+ W+EALHGV++      
Sbjct: 63  CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAHADFSD- 121

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             GDF  +TSFPQ I+T ++ N TL   I  ++S + RA  N G  GL  +SPN+N FR 
Sbjct: 122 -AGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTFRH 180

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   L+  YA  Y+ G+QG  D + LK+ A+ KH+  YD++NW+   R
Sbjct: 181 PVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLIATAKHYAGYDIENWDNHSR 240

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              + ++++QD+ + +   F +   + KV SVMCSYN VNGVP+C++   L+  +R  + 
Sbjct: 241 LGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFD 300

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
              +GY+  DC +V   ++   + +    AAAD+IRAG D+DCG     H + +     +
Sbjct: 301 FVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEV 360

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           S  D+   ++      +R G FDG+ S  PY ++   DV + + Q L+ EAA Q IVLLK
Sbjct: 361 SRQDLERGVIRLYASLIRAGYFDGKTS--PYRNITWSDVVSTNAQNLSYEAAAQSIVLLK 418

Query: 399 NQGPSLPL--SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           N G  LPL  +    +T+A+IGP ++ T  M+GNY G A    +PLQ        I    
Sbjct: 419 NDG-ILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLISPLQAFQDSEYKITYTI 477

Query: 457 CKDVACADDQLF-GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
             +     D      A+  +++AD  I   G+D ++E EA DR+ +  P  Q  L++K++
Sbjct: 478 GTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLA 537

Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
              K P I++ M GG +D +  KN+  + A+IW GYPGQ+GG A+ADI+ G   P  +L 
Sbjct: 538 DLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLV 596

Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
            T YP EY    P  +M +RP+ S   PG+TY +Y G  VY FGHG+ YTNF        
Sbjct: 597 TTQYPAEYAEVFPAIDMNLRPNGSN--PGQTYMWYTGTPVYEFGHGLFYTNF-------- 646

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVF 691
           T  A    G     +  I  + +   H     +     L   VDVKN G +   +T + F
Sbjct: 647 TASASAGSGTKNRTSFNID-EVLGRPHPGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAF 705

Query: 692 STPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
               AG  APH  K LV F+++  V  G+ + + I + V   L+  D  G R +  G + 
Sbjct: 706 VNTTAGP-APHPNKWLVGFDRLSAVEPGSAKTMVIPVTV-DSLARTDEEGNRVLYPGRYE 763

Query: 749 IHIGGTKHSV 758
           + +   +  V
Sbjct: 764 VALNNEREVV 773


>gi|2723496|dbj|BAA24107.1| beta-1,4-xylosidase [Aspergillus oryzae]
          Length = 798

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 258/730 (35%), Positives = 384/730 (52%), Gaps = 33/730 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+  L+ +E V    +     PR+G+  Y+ W+EALHGV++      
Sbjct: 63  CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAHADFSD- 121

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             GDF  +TSFPQ I+T ++ N TL   I  ++S + RA  N G  GL  +SPN+N FR 
Sbjct: 122 -AGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTFRH 180

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   L+  YA  Y+ G+QG  D + LK+ A+ KH+  YD++NW+   R
Sbjct: 181 PVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLIATAKHYAGYDIENWDNHSR 240

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              + ++++QD+ + +   F +   + KV SVMCSYN VNGVP+C++   L+  +R  + 
Sbjct: 241 LGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFD 300

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
              +GY+  DC +V   ++   + +    AAAD+IRAG D+DCG     H + +     +
Sbjct: 301 FVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEV 360

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           S  D+   ++      +R G FDG+ S  PY ++   DV + + Q L+ EAA Q IVLLK
Sbjct: 361 SRQDLERGVIRLYASLIRAGYFDGKTS--PYRNITWSDVVSTNAQNLSYEAAAQSIVLLK 418

Query: 399 NQGPSLPL--SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           N G  LPL  +    +T+A+IGP ++ T  M+GNY G A    +PLQ        I    
Sbjct: 419 NDG-ILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLISPLQAFQDSEYKITYTI 477

Query: 457 CKDVACADDQLF-GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
             +     D      A+  +++AD  I   G+D ++E EA DR+ +  P  Q  L++K++
Sbjct: 478 GTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLA 537

Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
              K P I++ M GG +D +  KN+  + A+IW GYPGQ+GG A+ADI+ G   P  +L 
Sbjct: 538 DLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLV 596

Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
            T YP EY    P  +M +RP+ S   PG+TY +Y G  VY FGHG+ YTNF        
Sbjct: 597 TTQYPAEYAEVFPAIDMNLRPNGSN--PGQTYMWYTGTPVYEFGHGLFYTNF-------- 646

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVF 691
           T  A    G     +  I  + +   H     +     L   VDVKN G +   +T + F
Sbjct: 647 TASASAGSGTKNRTSFNID-EVLGRPHPGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAF 705

Query: 692 STPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
               AG  APH  K LV F+++  V  G+ + + I + V   L+  D  G R +  G + 
Sbjct: 706 VNTTAGP-APHPNKWLVGFDRLSAVEPGSAKTMVIPVTV-DSLARTDEEGNRVLYPGRYE 763

Query: 749 IHIGGTKHSV 758
           + +   +  V
Sbjct: 764 VALNNEREVV 773


>gi|373955483|ref|ZP_09615443.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373892083|gb|EHQ27980.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 738

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 273/784 (34%), Positives = 400/784 (51%), Gaps = 104/784 (13%)

Query: 4   TIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQ 63
           ++   FL LIL              +  P++ T +  PF   +L + +RV DL+GR++L+
Sbjct: 8   SLGMLFLTLILF------------LSFKPQNHTYK-YPFNNPALSMDERVADLVGRMTLE 54

Query: 64  EKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
           EKV  +++ A A+ RLG+  Y WW+E LHGV+     T F       T +PQ I  A+++
Sbjct: 55  EKVSQMLNSAPAIERLGVPAYNWWNECLHGVAR----TPFK-----VTVYPQAIAMAATW 105

Query: 124 NATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETPGED 175
           + T    +G   ++E RA+YN            GLTYW+PN+NIFRDPRWGRGQET GED
Sbjct: 106 DKTSMHVMGDYTAEEGRAVYNESIKNDKHDIYLGLTYWTPNINIFRDPRWGRGQETYGED 165

Query: 176 PVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY----DLDNWNGVDRFHFNAKVSKQD 231
           P L+G+  +++V+GLQG D   LK A   KH+  +    DL       R  FN  +S  D
Sbjct: 166 PFLTGEMGSAFVKGLQGDDPRYLKAAGCAKHYAVHSGPEDL-------RHKFNTDISDYD 218

Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
           + DT+   FR  V++ KV  VMC+YN   G P C    ++   +  +W+  GY+ SDC  
Sbjct: 219 LWDTYLPAFRKLVVDAKVTGVMCAYNAFKGQPCCGSDLLMNSILHDKWKFTGYVTSDCGG 278

Query: 292 VGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVN 349
           +  +Y      + P  E AAADA+  G D++CG         AV+ G LSE  I+ +L  
Sbjct: 279 IDDFYRENTHQTQPDAESAAADAVLHGTDVECGNVTYKSLVKAVKDGKLSEKQIDQSLKR 338

Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
             +V+ +LGMFD   + + Y  +G   +  P H   AL+ A Q IVLLKN+G  LPLS  
Sbjct: 339 LFSVRFKLGMFDPADAVK-YNQIGKDALEAPAHGAQALKMAHQSIVLLKNEGNLLPLSK- 396

Query: 410 RHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGCKDVACADD 465
             + +AV+GPN+D  V+++GNY G      T LQGI          I+ +    VA +  
Sbjct: 397 NLKKIAVLGPNADNAVSVLGNYNGTPSRIVTALQGIKNKLPAGTEVIYDKAVDYVADSAA 456

Query: 466 QLFGAAIDAS-RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKV 514
           +   AA+ A  + ADA I + G+   +E E +          DR+ +LLPG Q EL+ K 
Sbjct: 457 RYNYAAMAAKVKDADAIIYIGGISPELEGEEMPVSKPGFHGGDRSTILLPGVQTELL-KA 515

Query: 515 SMASKGPTILVLMSGGPIDVAF-AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
             A+  P + V+M+G  I   + A+N P   AI+ A Y GQA GTAIAD+LFG  NP G+
Sbjct: 516 LKATGKPVVFVMMTGSAIATPWEAENLP---AIVNAWYGGQAAGTAIADVLFGDYNPAGR 572

Query: 574 LPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA 632
           LP+T+Y  +   +LP  T+ +M          RTYR++KG  +Y FG+G+SY+ F +   
Sbjct: 573 LPVTFYGSD--KDLPSFTDYSMD--------NRTYRYFKGKPLYAFGYGLSYSKFEYAPL 622

Query: 633 NAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH-TLLVF 691
           +AP  +               +G+A            L V V V N    DG   T L  
Sbjct: 623 DAPLTLK--------------AGEA------------LTVHVKVTNKSKMDGEEVTELYL 656

Query: 692 STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
           S          + L  FE+  + AG  + +   +     LS+ D +G      G+  I +
Sbjct: 657 SHIGIKQKTAIRALKGFERTLIKAGETKDITFKLSSAD-LSITDLNGNLVKASGKIAISV 715

Query: 752 GGTK 755
           GG++
Sbjct: 716 GGSQ 719


>gi|391872736|gb|EIT81831.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 798

 Score =  413 bits (1062), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 258/730 (35%), Positives = 383/730 (52%), Gaps = 33/730 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+  L+ +E V    +     PR+G+  Y+ W+EALHGV++      
Sbjct: 63  CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAHADFSD- 121

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             GDF  +TSFPQ I+T ++ N TL   I  ++S + RA  N G  GL  +SPN+N FR 
Sbjct: 122 -AGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTFRH 180

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   L+  YA  Y+ G+QG  D + LK+ A+ KH+  YD++NW+   R
Sbjct: 181 PVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLIATAKHYAGYDIENWDNHSR 240

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              + ++++QD+ + +   F +   + KV SVMCSYN VNGVP+C++   L+  +R  + 
Sbjct: 241 LGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFD 300

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
              +GY+  DC +V   ++   + +    AAAD+IRAG D+DCG     H + +     +
Sbjct: 301 FVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEV 360

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           S  D+   +       +R G FDG+ S  PY ++   DV + + Q L+ EAA Q IVLLK
Sbjct: 361 SRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTNAQNLSYEAAAQSIVLLK 418

Query: 399 NQGPSLPL--SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           N G  LPL  +    +T+A+IGP ++ T  M+GNY G A    +PLQ        I    
Sbjct: 419 NDG-ILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLISPLQAFQDSEYKITYTI 477

Query: 457 CKDVACADDQLF-GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
             +     D      A+  +++AD  I   G+D ++E EA DR+ +  P  Q  L++K++
Sbjct: 478 GTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLA 537

Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
              K P I++ M GG +D +  KN+  + A+IW GYPGQ+GG A+ADI+ G   P  +L 
Sbjct: 538 DLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLV 596

Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
            T YP EY    P  +M +RP+ S   PG+TY +Y G  VY FGHG+ YTNF        
Sbjct: 597 TTQYPAEYAEVFPAIDMNLRPNGSN--PGQTYMWYTGTPVYEFGHGLFYTNF-------- 646

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVF 691
           T  A    G     +  I  + +   H     +     L   VDVKN G +   +T + F
Sbjct: 647 TASASAGSGTKNRTSFNID-EVLGRPHPGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAF 705

Query: 692 STPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
               AG  APH  K LV F+++  V  G+ + + I + V   L+  D  G R +  G + 
Sbjct: 706 VNTTAGP-APHPNKWLVGFDRLSAVEPGSAKTMVIPVTV-DSLARTDEEGNRVLYPGRYE 763

Query: 749 IHIGGTKHSV 758
           + +   +  V
Sbjct: 764 VALNNEREVV 773


>gi|365120422|ref|ZP_09338009.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363647477|gb|EHL86692.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 735

 Score =  413 bits (1062), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 267/766 (34%), Positives = 412/766 (53%), Gaps = 85/766 (11%)

Query: 30  CDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSE 89
           C        T PF    L   +RV+DL+ RL+L+EK+  +++ A A+ RLGI  Y+WW+E
Sbjct: 17  CAHTGKAQNTFPFQNPDLSFEKRVDDLVSRLTLEEKISQMLNKAPAIERLGIPAYDWWNE 76

Query: 90  ALHGVSNVGPGTKFGGDFP-GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA 148
            LHGV          G  P   T FPQ I  A++++  L++ +   ++DE RA+Y+   +
Sbjct: 77  CLHGV----------GRTPYKVTVFPQAIGMAATWDDALFQQVASSIADEGRAIYHDAIS 126

Query: 149 --------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
                   GLTYW+PN+NIFRDPRWGRGQET GEDP L+G    ++V GLQG D   LK 
Sbjct: 127 KGVHEIYHGLTYWTPNINIFRDPRWGRGQETYGEDPYLTGTLGKAFVNGLQGDDPKYLKA 186

Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
           +A  KH+  +   +   + R  FN +VS  D+ DT+   FR  V++ KV+SVMC+YN + 
Sbjct: 187 SACAKHYAVH---SGPEISRHFFNTEVSMYDLWDTYLPAFRDLVVDAKVSSVMCAYNALA 243

Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDL 319
           G P C +  +++  +R +W+  GY+ SDC ++  +   +H T +    A+ADA+  G DL
Sbjct: 244 GQPCCGNDLLMQDILRKQWKFTGYVTSDCGAIDDF--LKHKTHADAAHASADAVLHGTDL 301

Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
           +CG  + +    AV++GL++E  I+ ++      + RLG+FD  P+ +      P  V  
Sbjct: 302 ECGQNIYVKLVDAVKQGLITEAQIDESVKRLFMTRFRLGLFD--PADRVKYADTPLSVLE 359

Query: 380 PD-HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
            D H+ LAL+ +R+ +VLLKN    LPL     + +AVIGPN+D +  ++GNY G     
Sbjct: 360 CDEHKALALKMSRESVVLLKNDN-VLPLRK-NLKKIAVIGPNADDSTVVLGNYNGFPSKV 417

Query: 439 TTPLQGI----GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
            TPL+ I    G+  + I+ +    V  +D++   A I+  +  D  I V G+   +E E
Sbjct: 418 ITPLEAIRSKVGKRTQVIYDRAIDCVKPSDEKTLNALIERLKGVDQVIFVGGISPRLEGE 477

Query: 495 AL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
            L          DR  + LP  Q EL+ K+  A   P I V+M+G  + + +   +  I 
Sbjct: 478 ELPISVDGFRGGDRTTIALPEVQTELMKKMKEAGL-PVIFVMMTGSALGIEWESQN--IP 534

Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
           AI+ A Y GQ  G AIAD+LFG  NP GKLP+T+Y  +  ++LP       P  +     
Sbjct: 535 AILNAWYGGQFAGQAIADVLFGDYNPSGKLPVTFYRSD--SDLP-------PFGAFSMAN 585

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           RTYR++KG  +YPFG G+SYT F ++V   P VV+       G +     G+ IKV+   
Sbjct: 586 RTYRYFKGEALYPFGFGLSYTMFDYSV---PQVVS------GGKV-----GEPIKVS--- 628

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGIN 724
                    V VKN+G K+G   + ++ +      AP   L  F++V++ AG ++ +   
Sbjct: 629 ---------VKVKNIGKKNGDEVVQLYLSHEGVEKAPITALKGFKRVYLKAGEEKTLSFE 679

Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           I   + +S+ D +G   +  G+  I+ GG   S +  A + G++KS
Sbjct: 680 IS-PRDMSLPDDNGIITVFPGKKTIYAGGM--SPTSEAKSKGLVKS 722


>gi|292495281|sp|C0STH4.1|XYND_ASPAC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|225878711|dbj|BAH30675.1| beta-xylosidase [Aspergillus aculeatus]
          Length = 805

 Score =  413 bits (1061), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 252/723 (34%), Positives = 384/723 (53%), Gaps = 24/723 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +     R   L+   +L+E +    + +  VPRLG+  Y+ WSEALHG+      T 
Sbjct: 60  CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLGRAN-FTD 118

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
            G    G  SFP  I +A++FN TL   I  ++S + RA  N G  GL  +SPN+N FR 
Sbjct: 119 NGALHAGRPSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPNINTFRH 178

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   L+  YA  Y+ G+QG  + + LK+AA+ KHF  YD++NW+   R
Sbjct: 179 PVWGRGQETPGEDAYTLTAAYAYEYITGIQGGVNPEHLKLAATAKHFAGYDIENWDNHSR 238

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              +  +++QD+ + +   F +   +  V S MCSYN VNGVP+C++   L+  +R  + 
Sbjct: 239 LGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCSNTFFLQTLLRDTFS 298

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
              +GY+  DC +V   ++   + +    AAADAI AG D+DCG     H   ++  G +
Sbjct: 299 FVDHGYVSGDCGAVYGVFNPHGYAANEPSAAADAILAGTDIDCGTSYQYHFNESITTGAV 358

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           +  DI    +      + LG FDG  SS  PY  LG  DV   D   ++ EAA +GIVLL
Sbjct: 359 ARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDAWNISYEAAVEGIVLL 418

Query: 398 KNQGPSLPL---SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
           KN G +LPL   S  +++++A+IGP ++ T  + GNY G A    +P+        T+H 
Sbjct: 419 KNDG-TLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLISPVDAFTAAGYTVHY 477

Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
               +++      F AA+ A+R AD  + + G+D +IEAEA DR+ +  PG Q EL+S++
Sbjct: 478 APGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDRSSIAWPGNQLELISQL 537

Query: 515 SM--ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
           +   +   P ++  M GG +D +  K + ++ A++W GYPGQ+GG A+ DIL G   P G
Sbjct: 538 AAQKSDDQPLVVYQMGGGQVDSSSLKFNAKVNALLWGGYPGQSGGLALRDILTGARAPAG 597

Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA 632
           +L  T YP  Y  +    +M +RP+++ + PG+TY +Y G  VY FGHG+ YT F  + A
Sbjct: 598 RLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEPVYAFGHGLFYTTFNASSA 657

Query: 633 NAPTVVAVPLDGRHGSINAT-ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
            A             + N T ++  A   T     R        + N G +D  +T LV+
Sbjct: 658 QAAKT--------KYTFNITDLTSAAHPDTTTVGQRTLFNFTASITNSGQRDSDYTALVY 709

Query: 692 S-TPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVC-KYLSVVDRSGTRRIPLGEHN 748
           + T  AG    P+K LV F+++   A       +N+ V    L+ VD +G   +  G + 
Sbjct: 710 ANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNVPVAVDRLARVDEAGNTVLFPGRYE 769

Query: 749 IHI 751
           + +
Sbjct: 770 VAL 772


>gi|425780840|gb|EKV18836.1| Beta-xylosidase XylA [Penicillium digitatum PHI26]
 gi|425783077|gb|EKV20946.1| Beta-xylosidase XylA [Penicillium digitatum Pd1]
          Length = 792

 Score =  412 bits (1060), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 259/729 (35%), Positives = 383/729 (52%), Gaps = 36/729 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +     R   L    +L+E V    +   AVPRLG+  Y+ WSEALHG+        
Sbjct: 62  CDTTAKPHDRAAALTSMFTLEELVNSTGNVIPAVPRLGLPPYQVWSEALHGLDRAN--LT 119

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             GD+  ATSFP  I   ++ N TL   IG ++S + RA  NGG  GL  ++PN+N FR 
Sbjct: 120 ESGDYSWATSFPSPILIMAALNRTLINQIGEIISTQGRAFNNGGRYGLDVYAPNINSFRH 179

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAASCKHFTAYDLDNWNGVDRF 221
           P WGRGQETPGED  L   Y   Y+ G+QG    R LK+AA+ KHF  YDL+NW    R 
Sbjct: 180 PVWGRGQETPGEDVQLCSIYGVEYITGIQGGLNPRDLKLAATAKHFAGYDLENWGNHSRL 239

Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
             N  +S  D+   +   F   V + +V SVM SYN VNGVP+ A+  +L+  +R  W  
Sbjct: 240 GNNVAISSFDLASYYTPQFITAVRDARVHSVMSSYNAVNGVPSSANSFLLQTLLRETWNF 299

Query: 282 --NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
             +GY+ SDCD+V   ++   + S+   AAA +I+AG D+DCG    L+   ++    +S
Sbjct: 300 VEDGYVSSDCDAVFNVFNPHGYASSASLAAAKSIQAGTDIDCGATYQLYLNESLSHDEIS 359

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
             +I  A+    +  + LG FDG+ S   Y HL   DV   D   ++ EAA +GIVLLKN
Sbjct: 360 RSEIERAVTRFYSTLVSLGYFDGDNSK--YRHLHWPDVVATDAWNISYEAAVEGIVLLKN 417

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKD 459
            G +LPLS+   R+VA+IGP ++VT T+ GNY G A   T PL  +      ++     +
Sbjct: 418 DG-TLPLSN-NTRSVALIGPWANVTTTLQGNYYGAAPYLTGPLAALQASNLDVNYAFGTN 475

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
           ++      F AA+ A+ +++  I   G+D ++EAE +DR  +  PG Q +L+ ++S   K
Sbjct: 476 ISSDSTSGFEAALSAAGKSEVIIFAGGIDNTVEAEGVDRESITWPGNQLQLIEQLSKLGK 535

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
            P +++ M GG +D +  K +  + +++W GYPGQ+GG AI DIL G   P G+L +T Y
Sbjct: 536 -PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPAILDILTGKRAPAGRLTVTQY 594

Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
           P EY    P T+M++RP  +   PG+TY +Y G  VY FGHG+ YT F            
Sbjct: 595 PAEYALQFPATDMSLRPKGNN--PGQTYMWYTGKPVYEFGHGLFYTTF-----------K 641

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLT------LGVQVDVKNVGSKDGAHTLLVFST 693
           V L   HG+ N T       ++              +   V+V N G+    +T + F  
Sbjct: 642 VSLAHFHGAENGTSFDIVQLLSRPNAGYSVVEQIPFINYTVEVMNTGNVTSDYTAMAFVN 701

Query: 694 PPAG---HWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
             AG   H  P+K LV F+++  +     Q + I I +   ++  D  G R +  G++ +
Sbjct: 702 TKAGPSPH--PNKWLVGFDRLGGISPRTTQTMTIPITLDN-VARTDERGNRIVYPGKYEL 758

Query: 750 HIGGTKHSV 758
            +   + +V
Sbjct: 759 TLNNERSAV 767


>gi|116197206|ref|XP_001224415.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
 gi|88181114|gb|EAQ88582.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
          Length = 735

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 271/713 (38%), Positives = 395/713 (55%), Gaps = 65/713 (9%)

Query: 65  KVKLLISGAAA----VPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           K  ++++G A+    V RLG+  Y+WW+EALHGV++   G  +GG F  AT FPQ IT++
Sbjct: 33  KSNVVLTGPASNSPGVSRLGLSAYQWWNEALHGVAH-NRGITWGGQFSAATQFPQAITSS 91

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
           ++F+  L E IG ++S EARA  N G A L +W+PNVN FRDPRWGRG ETPGED   + 
Sbjct: 92  AAFDDHLIERIGVIISTEARAFANNGRAHLDFWTPNVNPFRDPRWGRGHETPGEDAFRNK 151

Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
           K+A ++V+G+QG++    +V A+CKH+ AYDL+N     RF+F+AKVS QD+ + +  PF
Sbjct: 152 KWAEAFVQGMQGTESTH-RVIATCKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPF 210

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYY- 296
           + C  + KV S+MCSYN VNGVP CA P ++   +R  W     N Y+VSDCD+  VYY 
Sbjct: 211 QQCARDSKVGSIMCSYNAVNGVPACASPYLMDTILRKHWNWTDQNQYVVSDCDA--VYYL 268

Query: 297 ----DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--SAVQRGLLSEIDINNALVNT 350
                   + S+   A   ++ AG D  C    G   +  SA      ++  ++ A++  
Sbjct: 269 GNANGGHRYKSSYAAAIGASLEAGCDNMCWATGGTTPDPASAFNSRQFTQATLDKAMLRQ 328

Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
           +   ++ G FDG P+S  Y +L   DV T   ++ AL+AA +GIVLLKN    LPL+   
Sbjct: 329 MQGLVKAGYFDG-PNSL-YRNLTAADVNTQVARDTALKAAEEGIVLLKNDN-ILPLTLGG 385

Query: 411 HRT-VAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQLF 468
             T VA+IG  ++    M+G Y+G       P+        T+ +  G      AD    
Sbjct: 386 SNTQVAMIGFWANAADKMLGGYSGSPPFSHDPVTAARSMGITVNYVNGPLTQTNADTS-- 443

Query: 469 GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
            AA++A++++   I   G+D ++E E+ DR  +  P  Q  ++ +++   K P I+V M 
Sbjct: 444 -AAVNAAQKSSVVIFFGGIDNTVEKESQDRTSIAWPSGQLTMIQRLAQTGK-PVIVVRM- 500

Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
           G  +D     + P + AI+WAGYPGQ GGTA+ +++ G ++P G+LP+T YP  Y    P
Sbjct: 501 GTHVDDTPLLSIPNVKAILWAGYPGQDGGTAVMNLITGLASPAGRLPVTVYPSSYTNQAP 560

Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
            T MA+RPS S  YPGRTYR+YK P V+PFGHG+ YTNF  +VA        PLD     
Sbjct: 561 YTNMALRPSSS--YPGRTYRWYKDP-VFPFGHGLHYTNF--SVA--------PLD----- 602

Query: 649 INATISGKAIKVTHAKCNRLTL-------GVQVDVKNVGSKDGAHTLLVFSTPPAGHWAP 701
             AT S   I    A C  +T         V V V N GS+   + +L F    AG + P
Sbjct: 603 FPATFS---IADLLASCKGVTYLELCPFPSVSVSVTNTGSRASDYVVLGFL---AGDFGP 656

Query: 702 H----KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
                K L  +++V  V  G  Q   ++  + + L+ VD  G R +  G + +
Sbjct: 657 TPRPIKSLATYKRVFDVQPGKTQSAELDWKL-ESLARVDGKGNRVLYPGTYTL 708


>gi|150019484|ref|YP_001311738.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
           8052]
 gi|149905949|gb|ABR36782.1| glycoside hydrolase, family 3 domain protein [Clostridium
           beijerinckii NCIMB 8052]
          Length = 709

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 256/732 (34%), Positives = 387/732 (52%), Gaps = 90/732 (12%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           ++  +L+G+++L+EK + L   ++AV RL +  Y WW+E LHGV+  G           A
Sbjct: 14  EKAKELVGKMTLEEKAEQLTYKSSAVKRLNVPRYNWWNEGLHGVARAGT----------A 63

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I  A+ F+  L   I +V+S E RA YN  +         G+T+WSPNVNIFRD
Sbjct: 64  TVFPQAIGLAAMFDDELLNYIAKVISTEGRAKYNENSKKDDRDIYKGITFWSPNVNIFRD 123

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
           PRWGRG ET GEDP L+ +   ++V+GLQG +G  LK AA  KHF  +      G+ R  
Sbjct: 124 PRWGRGHETYGEDPYLTSRLGVAFVKGLQG-EGKYLKAAACAKHFAVHS--GPEGL-RHE 179

Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
           F+A VSK+D+ +T+   F  CV EG V +VM +YN+ NG P C    +L+  +RG+W   
Sbjct: 180 FDAVVSKKDLYETYLPAFEACVKEGDVEAVMGAYNRTNGEPCCGSKTLLRDILRGKWNFK 239

Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
           G++VSDC ++  ++     TST  E+AA A++ G DL+CG    L    A + GL++E D
Sbjct: 240 GHVVSDCWAIADFHLHHRVTSTATESAALAMKNGCDLNCGNVY-LQLLLAYKEGLVTEED 298

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           I  A    +  ++RLGMFD E     Y  +  +     +H EL+L+AAR  +VLLKN G 
Sbjct: 299 ITTAAERLMATRIRLGMFDEECE---YNKIPYELNDCKEHNELSLKAARNSMVLLKNNGI 355

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC- 457
            LPL+    +++AVIGPN+D  + + GNY+G A  Y T L+GI    G   R  + +GC 
Sbjct: 356 -LPLNKNNLKSIAVIGPNADSQIMLKGNYSGTASRYITVLEGIHEAVGEDVRVYYSEGCH 414

Query: 458 ------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLL 502
                 +++A  +D+L   AI  + ++D  IL +GLD +IE E         A D+A L 
Sbjct: 415 LFRDRVEELAEPNDRL-KEAISIAERSDVAILCLGLDSTIEGEQGDAGNSEGAGDKASLN 473

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           LPGRQQEL+ K+ + +  P ILV+ +G  +   F   + + +AI+ A YPG  GG A+AD
Sbjct: 474 LPGRQQELLEKI-IETGTPVILVIGAGSAL--TFNNAEDKCSAILDAWYPGSRGGRAVAD 530

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           ++FG  +P GKLP+T+Y      +LP   + +M+         RTYR+     +YPFG+G
Sbjct: 531 LIFGKCSPSGKLPITFYRN--TKDLPEFIDYSMKD--------RTYRYMSCESLYPFGYG 580

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           ++Y+                          T+    + V   K +   + V V + N G+
Sbjct: 581 LTYS--------------------------TVKLSELHVPDVKSDFEDVEVSVKITNTGN 614

Query: 682 KDGAHTLLVFSTPPAGHWAPHKQ-LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
            D    +  +       +A     L  F++V +  G  +   + I    +  VV+  G R
Sbjct: 615 FDIEEVIQCYIKDLESKYAVRNHSLAGFKRVRLKIGESKIAKMKIKKSSF-EVVNDDGER 673

Query: 741 RIPLGEHNIHIG 752
            +      + +G
Sbjct: 674 ILDSKRFKLFVG 685


>gi|436410475|gb|AGB57183.1| beta-xylosidase [Aspergillus sp. BCC125]
          Length = 804

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 256/720 (35%), Positives = 376/720 (52%), Gaps = 32/720 (4%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
           D ++   R+   C  S     R   LI   +L E +    +    V RLG+  Y+ WSEA
Sbjct: 57  DCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 116

Query: 91  LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
           LHG+          G +  ATSFPQ I T ++ N TL   I  ++S + RA  N G  GL
Sbjct: 117 LHGLDRANFSDL--GSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
             ++PN+N FR P WGRGQETPGED  L+  YA  Y+ G+QG D D  LK+AA+ KH+  
Sbjct: 175 DVYAPNINTFRHPVWGRGQETPGEDVSLAAIYAYEYITGIQGPDPDSNLKLAATAKHYAG 234

Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
           YD++NW+   R   +  +++QD+ + +   F +   + KV SVMC+YN VNGVP CAD  
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSY 294

Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
            L+  +R  +    +GY+ SDCD+    Y+   + S+   AAA+AI AG D+DCG     
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
           H   ++  G LS  DI   ++   T  ++ G FD   +  + PY  L   DV   D   +
Sbjct: 355 HLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNI 414

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           + +AA QGIVLLKN    LPL+   +     TVA+IGP ++ T  ++GNY G A    +P
Sbjct: 415 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
                     ++      ++      F AA+ A++ AD  I   G+D ++EAEALDR  +
Sbjct: 475 RAAFEEAGYNVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESI 534

Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
             PG Q +L+ K+ S A   P I++ M GG +D +  KN+  ++A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFAL 594

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
            DI+ G  NP G+L  T YP  Y    P T+M +RP      PG+TY++Y G  VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
           G+ YT F  + +N  T        R   +N       +  TH     +T    L    ++
Sbjct: 653 GLFYTTFAESSSNTTT--------REIKLNIQ---DILSQTHEDLASITQLPVLNFTANI 701

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
           KN G  +  +T +VF+       AP+  K LV ++++  V  G  + + + I V  +  V
Sbjct: 702 KNTGKVESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELRVPIEVGSFARV 761


>gi|121809149|sp|Q4AEG8.1|XYND_ASPAW RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|73486695|dbj|BAE19756.1| beta-xylosidase [Aspergillus awamori]
          Length = 804

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 253/720 (35%), Positives = 380/720 (52%), Gaps = 32/720 (4%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
           D ++   R+   C  +     R   LI   +L E +    +    V RLG+  Y+ WSEA
Sbjct: 57  DCQNGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 116

Query: 91  LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
           LHG+          G +  ATSFPQ I T ++ N TL   I  ++S + RA  N G  GL
Sbjct: 117 LHGLDRANFSDS--GAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
             ++PN+N FR P WGRGQETPGED  L+  YA  Y+ G+QG D +  LK+AA+ KH+  
Sbjct: 175 DVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESNLKLAATAKHYAG 234

Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
           YD++NW+   R   +  +++QD+ + +   F +   + KV SVMC+YN VNGVP CAD  
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADSY 294

Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
            L+  +R  +    +GY+ SDCD+    Y+   + S+   AAA+AI AG D+DCG     
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
           H   ++  G LS  DI   ++   T  ++ G FD   +  + PY  L   DV   D   +
Sbjct: 355 HLNESITAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWNI 414

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           + +AA QGIVLLKN    LPL+   +     TVA+IGP ++ T  ++GNY G A    +P
Sbjct: 415 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
                     ++      ++      F AA+ A++ AD  I   G+D ++EAEALDR  +
Sbjct: 475 RAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESI 534

Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
             PG Q +L+ K+ S A K P I++ M GG +D +  KN+ +++A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTKVSALLWGGYPGQSGGFAL 594

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
            DI+ G  NP G+L  T YP  Y    P T+M +RP      PG+TY++Y G  VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
           G+ YT F  + +N  T   V L+ +            +  TH +   +T    L    ++
Sbjct: 653 GLFYTTFAESSSNT-TTKEVKLNIQ----------DILSRTHEELASITQLPVLNFTANI 701

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
           +N G  +  +T +VF+       AP+  K LV ++++  V  G  + + + + V  +  V
Sbjct: 702 RNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRLGEVKVGETRELRVPVEVGSFARV 761


>gi|76160898|gb|ABA40420.1| Xld [Aspergillus fumigatus]
          Length = 792

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 252/724 (34%), Positives = 387/724 (53%), Gaps = 27/724 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+   + +E V    + +  VPRLG+  Y+ WSEALHG+       +
Sbjct: 63  CDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDRANFTDE 122

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             G++  ATSFP  I T S+ N TL   I  +++ + RA  N G  GL  ++PN+N FR 
Sbjct: 123 --GEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNINAFRS 180

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
             WGRGQETPGED   L+  YA  Y+ G+QG  D + LK+ A+ KH+  YDL+NW+G  R
Sbjct: 181 AMWGRGQETPGEDAYCLASAYAYEYITGIQGGVDPEHLKLVATAKHYAGYDLENWDGHSR 240

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              +  +++Q++ + +   F +   + KV SVMCSYN VNGVP+CA+   L+  +R  + 
Sbjct: 241 LGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSCANSFFLQTLLRDTFG 300

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
              +GY+ SDCDS    ++   F +    AAAD+IRAG D+DCG     +   A     +
Sbjct: 301 FVEDGYVSSDCDSAYNVWNPHEFAANITGAAADSIRAGTDIDCGTTYQYYFGEAFDEQEV 360

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           +  +I   ++   +  +RLG FDG  S   Y  L   DV T D   ++ EAA +GIVLLK
Sbjct: 361 TRAEIERGVIRLYSNLVRLGYFDGNGSV--YRDLTWNDVVTTDAWNISYEAAVEGIVLLK 418

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G +LPL+    R+VA+IGP  +VT  + GNY G A    +PL         ++     
Sbjct: 419 NDG-TLPLAK-SVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLNAFQNSDFDVNYAFGT 476

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           +++      F  A+ A++++D  I   G+D ++EAEA+DR  +  PG Q +L+ ++S   
Sbjct: 477 NISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITWPGNQLQLIDQLSQLG 536

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P I++ M GG +D +  K++  + ++IW GYPGQ+GG A+ DI+ G   P G+L +T 
Sbjct: 537 K-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDIITGKRAPAGRLVVTQ 595

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP EY T  P T+M++RP  +   PG+TY +Y G  VY FGHG+ YT F H         
Sbjct: 596 YPAEYATQFPATDMSLRPHGNN--PGQTYMWYTGTPVYEFGHGLFYTTF-H--------A 644

Query: 639 AVPLDGR-HGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
           ++P  G+   S N            A   ++  L   V + N G     +T ++F+   A
Sbjct: 645 SLPGTGKDKTSFNIQDLLTQPHPGFANVEQMPLLNFTVTITNTGKVASDYTAMLFANTTA 704

Query: 697 GHWA-PHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
           G    P+K LV F+++  +     Q + I + +   ++  D +G R +  G++ + +   
Sbjct: 705 GPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDS-VARTDEAGNRVLYPGKYELALNNE 763

Query: 755 KHSV 758
           +  V
Sbjct: 764 RSVV 767


>gi|169767016|ref|XP_001817979.1| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
 gi|121805502|sp|Q2UR38.1|XYND_ASPOR RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|83765834|dbj|BAE55977.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 798

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 257/730 (35%), Positives = 382/730 (52%), Gaps = 33/730 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+  L+ +E V    +     PR+G+  Y+ W+EALHGV++      
Sbjct: 63  CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAHADFSD- 121

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             G F  +TSFPQ I+T ++ N TL   I  ++S + RA  N G  GL  +SPN+N FR 
Sbjct: 122 -AGGFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTFRH 180

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED   L+  YA  Y+ G+QG  D + LK+ A+ KH+  YD++NW+   R
Sbjct: 181 PVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLIATAKHYAGYDIENWDNHSR 240

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              + ++++QD+ + +   F +   + KV SVMCSYN VNGVP+C++   L+  +R  + 
Sbjct: 241 LGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFD 300

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
              +GY+  DC +V   ++   + +    AAAD+IRAG D+DCG     H + +     +
Sbjct: 301 FVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEV 360

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           S  D+   +       +R G FDG+ S  PY ++   DV + + Q L+ EAA Q IVLLK
Sbjct: 361 SRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTNAQNLSYEAAAQSIVLLK 418

Query: 399 NQGPSLPL--SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           N G  LPL  +    +T+A+IGP ++ T  M+GNY G A    +PLQ        I    
Sbjct: 419 NDG-ILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLISPLQAFQDSEYKITYTI 477

Query: 457 CKDVACADDQLF-GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVS 515
             +     D      A+  +++AD  I   G+D ++E EA DR+ +  P  Q  L++K++
Sbjct: 478 GTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLA 537

Query: 516 MASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLP 575
              K P I++ M GG +D +  KN+  + A+IW GYPGQ+GG A+ADI+ G   P  +L 
Sbjct: 538 DLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLV 596

Query: 576 MTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
            T YP EY    P  +M +RP+ S   PG+TY +Y G  VY FGHG+ YTNF        
Sbjct: 597 TTQYPAEYAEVFPAIDMNLRPNGSN--PGQTYMWYTGTPVYEFGHGLFYTNF-------- 646

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVF 691
           T  A    G     +  I  + +   H     +     L   VDVKN G +   +T + F
Sbjct: 647 TASASASSGTKNRTSFNID-EVLGRPHLGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAF 705

Query: 692 STPPAGHWAPH--KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
               AG  APH  K LV F+++  V  G+ + + I + V   L+  D  G R +  G + 
Sbjct: 706 VNTTAGP-APHPNKWLVGFDRLSAVEPGSAKTMVIPVTV-DSLARTDEEGNRVLYPGRYE 763

Query: 749 IHIGGTKHSV 758
           + +   +  V
Sbjct: 764 VALNNEREVV 773


>gi|194400335|gb|ACF61038.1| beta-xylosidase [Aspergillus awamori]
          Length = 804

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 256/720 (35%), Positives = 377/720 (52%), Gaps = 32/720 (4%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
           D ++   R+   C  S     R   LI   +L E +    +    V RLG+  Y+ WSEA
Sbjct: 57  DCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 116

Query: 91  LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
           LHG+          G +  ATSFPQ I T ++ N TL   I  ++S + RA  N G  GL
Sbjct: 117 LHGLDRANFSDS--GSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
             ++PN+N FR P WGRGQETPGED  L+  YA  Y+ G+QG D D  LK+AA+ KH+  
Sbjct: 175 DVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSNLKLAATAKHYAG 234

Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
           YD++NW+   R   +  +++QD+ + +   F +   + KV SVMC+YN VNGVP CAD  
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSY 294

Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
            L+  +R  +    +GY+ SDCD+    Y+   + S+   AAA+AI AG D+DCG     
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
           H   ++  G LS  DI   ++   T  ++ G FD   +  + PY  L   DV   D   +
Sbjct: 355 HLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNI 414

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           + +AA QGIVLLKN    LPL+   +     TVA+IGP ++ T  ++GNY G A    +P
Sbjct: 415 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
                     ++      ++      F AA+ A+R AD  I   G+D ++EAEALDR  +
Sbjct: 475 RAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRESI 534

Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
             PG Q +L+ K+ S A   P I++ M GG +D +  KN+  + A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFAL 594

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
            DI+ G  NP G+L  T YP  Y    P T+M +RP      PG+TY++Y G  VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
           G+ YT F  + +N  T   V L+ +            +  TH +   +T    L    ++
Sbjct: 653 GLFYTTFAESSSNT-TTKEVKLNIQ----------DILSQTHEELASITQLPVLNFTANI 701

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
           KN G  +  +T +VF+       AP+  K LV ++++  V  G  + + + + V  +  V
Sbjct: 702 KNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELRVPVEVGSFARV 761


>gi|70996610|ref|XP_753060.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
 gi|74672055|sp|Q4WRB0.1|XYND_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|66850695|gb|EAL91022.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
          Length = 792

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 252/724 (34%), Positives = 387/724 (53%), Gaps = 27/724 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+   + +E V    + +  VPRLG+  Y+ WSEALHG+       +
Sbjct: 63  CDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDRANFTDE 122

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             G++  ATSFP  I T S+ N TL   I  +++ + RA  N G  GL  ++PN+N FR 
Sbjct: 123 --GEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNINAFRS 180

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
             WGRGQETPGED   L+  YA  Y+ G+QG  D + LK+ A+ KH+  YDL+NW+G  R
Sbjct: 181 AMWGRGQETPGEDAYCLASAYAYEYITGIQGGVDPEHLKLVATAKHYAGYDLENWDGHSR 240

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              +  +++Q++ + +   F +   + KV SVMCSYN VNGVP+CA+   L+  +R  + 
Sbjct: 241 LGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSCANSFFLQTLLRDTFG 300

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
              +GY+ SDCDS    ++   F +    AAAD+IRAG D+DCG     +   A     +
Sbjct: 301 FVEDGYVSSDCDSAYNVWNPHEFAANITGAAADSIRAGTDIDCGTTYQYYFGEAFDEQEV 360

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           +  +I   ++   +  +RLG FDG  S   Y  L   DV T D   ++ EAA +GIVLLK
Sbjct: 361 TRAEIERGVIRLYSNLVRLGYFDGNGSV--YRDLTWNDVVTTDAWNISYEAAVEGIVLLK 418

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G +LPL+    R+VA+IGP  +VT  + GNY G A    +PL         ++     
Sbjct: 419 NDG-TLPLAK-SVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLNAFQNSDFDVNYAFGT 476

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           +++      F  A+ A++++D  I   G+D ++EAEA+DR  +  PG Q +L+ ++S   
Sbjct: 477 NISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITWPGNQLQLIDQLSQLG 536

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P I++ M GG +D +  K++  + ++IW GYPGQ+GG A+ DI+ G   P G+L +T 
Sbjct: 537 K-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDIITGKRAPAGRLVVTQ 595

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP EY T  P T+M++RP  +   PG+TY +Y G  VY FGHG+ YT F H         
Sbjct: 596 YPAEYATQFPATDMSLRPHGNN--PGQTYMWYTGTPVYEFGHGLFYTTF-H--------A 644

Query: 639 AVPLDGR-HGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
           ++P  G+   S N            A   ++  L   V + N G     +T ++F+   A
Sbjct: 645 SLPGTGKDKTSFNIQDLLTQPHPGFANVEQMPLLNFTVTITNTGKVASDYTAMLFANTTA 704

Query: 697 GHWA-PHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
           G    P+K LV F+++  +     Q + I + +   ++  D +G R +  G++ + +   
Sbjct: 705 GPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDS-VARTDEAGNRVLYPGKYELALNNE 763

Query: 755 KHSV 758
           +  V
Sbjct: 764 RSVV 767


>gi|292495282|sp|B0XP71.1|XYND_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|159131796|gb|EDP56909.1| beta-xylosidase XylA [Aspergillus fumigatus A1163]
          Length = 792

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 252/724 (34%), Positives = 387/724 (53%), Gaps = 27/724 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S     R   L+   + +E V    + +  VPRLG+  Y+ WSEALHG+       +
Sbjct: 63  CDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDRANFTDE 122

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             G++  ATSFP  I T S+ N TL   I  +++ + RA  N G  GL  ++PN+N FR 
Sbjct: 123 --GEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNINAFRS 180

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
             WGRGQETPGED   L+  YA  Y+ G+QG  D + LK+ A+ KH+  YDL+NW+G  R
Sbjct: 181 AMWGRGQETPGEDAYCLASAYAYEYITGIQGGVDPEHLKLVATAKHYAGYDLENWDGHSR 240

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              +  +++Q++ + +   F +   + KV SVMCSYN VNGVP+CA+   L+  +R  + 
Sbjct: 241 LGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSCANSFFLQTLLRDTFG 300

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
              +GY+ SDCDS    ++   F +    AAAD+IRAG D+DCG     +   A     +
Sbjct: 301 FVEDGYVSSDCDSAYNVWNPHEFAANITGAAADSIRAGTDIDCGTTYQYYFGEAFDEQEV 360

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           +  +I   ++   +  +RLG FDG  S   Y  L   DV T D   ++ EAA +GIVLLK
Sbjct: 361 TRAEIERGVIRLYSNLVRLGYFDGNGSV--YRDLTWNDVVTTDAWNISYEAAVEGIVLLK 418

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G +LPL+    R+VA+IGP  +VT  + GNY G A    +PL         ++     
Sbjct: 419 NDG-TLPLAK-SVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLNAFQNSDFDVNYAFGT 476

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           +++      F  A+ A++++D  I   G+D ++EAEA+DR  +  PG Q +L+ ++S   
Sbjct: 477 NISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITWPGNQLQLIDQLSQLG 536

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P I++ M GG +D +  K++  + ++IW GYPGQ+GG A+ DI+ G   P G+L +T 
Sbjct: 537 K-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDIITGKRAPAGRLVVTQ 595

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP EY T  P T+M++RP  +   PG+TY +Y G  VY FGHG+ YT F H         
Sbjct: 596 YPAEYATQFPATDMSLRPHGNN--PGQTYMWYTGTPVYEFGHGLFYTTF-H--------A 644

Query: 639 AVPLDGR-HGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
           ++P  G+   S N            A   ++  L   V + N G     +T ++F+   A
Sbjct: 645 SLPGTGKDKTSFNIQDLLTQPHPGFANVEQMPLLNFTVTITNTGKVASDYTAMLFANTTA 704

Query: 697 GHWA-PHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
           G    P+K LV F+++  +     Q + I + +   ++  D +G R +  G++ + +   
Sbjct: 705 GPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDS-VARTDEAGNRVLYPGKYELALNNE 763

Query: 755 KHSV 758
           +  V
Sbjct: 764 RSVV 767


>gi|410723195|ref|ZP_11362440.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
           Maddingley MBC34-26]
 gi|410603399|gb|EKQ57833.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
           Maddingley MBC34-26]
          Length = 709

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 260/732 (35%), Positives = 385/732 (52%), Gaps = 98/732 (13%)

Query: 55  DLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFP 114
           +L+ +++LQE+ + L   + A+  L +  Y WW+E LHGV+  G           AT FP
Sbjct: 18  ELVSKMTLQERAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATVFP 67

Query: 115 QVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWG 166
           Q I  A+ F+      I  ++S E RA YN        G   GLTYWSPNVNIFRDPRWG
Sbjct: 68  QAIGLAAIFDEEFLGEIADIISTEGRAKYNEYSKKDDRGIYKGLTYWSPNVNIFRDPRWG 127

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFN 224
           RG ET GEDP L+ +   ++++GLQG +G  LK+AA  KHF  +     +G +  R  FN
Sbjct: 128 RGHETYGEDPYLTSRLGVAFIKGLQG-EGKYLKLAACAKHFAVH-----SGPEGLRHEFN 181

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           A V K+D+ +T+   F  CV E  V SVM +YN+ NG P C    +LK  +RG+W   G+
Sbjct: 182 AVVEKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILRGKWGFKGH 241

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++  ++     TST  E+ A AI  G DL+CG  + L+   A + GL++E  I 
Sbjct: 242 VVSDCWALADFHLHHMITSTATESVALAIENGCDLNCGN-MYLNLLLAYKEGLVTEEQIT 300

Query: 345 NALVNTLTVQMRLGMFDG--EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
            A    +T + +LGMFD   E +  PY      + C  +H E+AL A+R+ +VLLKN G 
Sbjct: 301 TAAERLMTTRFKLGMFDEDCEYNRIPY----EVNDC-KEHNEIALIASRKSMVLLKNDG- 354

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC- 457
           +LPL     +++AVIGPN++  + + GNY+G A  YTT L+GI    G   R  + +GC 
Sbjct: 355 TLPLDKSSLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHNAVGDNIRVYYSEGCH 414

Query: 458 ------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLL 502
                 +D+A  DD+L   AI  + ++D  IL +GLD +IE E         A D+  L 
Sbjct: 415 LFKDKVEDLAGPDDRL-SEAISVAERSDVVILCLGLDSTIEGEQGDAGNSYGAGDKESLN 473

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           LPGRQQ L+ KV    K P I+VL +G  +   F   + + AAI+ A YPG  GGTA+AD
Sbjct: 474 LPGRQQNLLEKVLEVGK-PVIVVLGAGSAL--TFNGAEEKCAAILNAWYPGSHGGTAVAD 530

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           ILFG  +P GKLP+T+Y      NLP  T+ +M+        GRTYR+ +   +YPFG+G
Sbjct: 531 ILFGKCSPSGKLPVTFYKD--TANLPEFTDYSMK--------GRTYRYLEHESLYPFGYG 580

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           ++Y+                           +    ++V   K +  +  + +D++N G+
Sbjct: 581 LTYSK--------------------------VELSNLQVPFVKADFESFDISIDIRNTGN 614

Query: 682 KDGAHTLLVFSTPPAGHWAP-HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
                 +  +       +A  +  L  F++V +  G  + V I +   +    V+  G R
Sbjct: 615 YGIEEVVQCYVKDLKSKYAVLNHSLAGFKRVSLKKGESKTVTIELSK-RSFEAVNNDGER 673

Query: 741 RIPLGEHNIHIG 752
            +      + +G
Sbjct: 674 LLDSKSFKLFVG 685


>gi|322512556|gb|ADX05682.1| putative carbohydrate-active enzyme [uncultured organism]
          Length = 717

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 262/735 (35%), Positives = 385/735 (52%), Gaps = 92/735 (12%)

Query: 49  IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
             +R   L+  ++L+EKV   +  A A+ RLG+  Y +W+EALHGV+  G          
Sbjct: 13  FEERAQALVCEMTLEEKVFQTLFNAPAIERLGVPAYNYWNEALHGVARAGV--------- 63

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIF 160
            AT FPQ I  A+SF+  L   +   +S EARA +N            GLT+WSPNVNIF
Sbjct: 64  -ATVFPQAIGLAASFDEELLGQVADTISTEARAKFNMQQKFGDRDIYKGLTFWSPNVNIF 122

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD- 219
           RDPRWGRG ET GEDP LSG+   S++RG+QG D   +KVAA  KHF  +     +G + 
Sbjct: 123 RDPRWGRGHETFGEDPFLSGRLGVSFIRGMQGDDERYMKVAACAKHFAVH-----SGPED 177

Query: 220 -RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
            R  FNA VS+QD+ +T+   F  CV E  V +VM +YN+ NG   C    +L   +RGE
Sbjct: 178 QRHSFNAVVSEQDLRETYLPAFHACVTEAGVEAVMGAYNRTNGEACCGSKKLLVDILRGE 237

Query: 279 WRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
           W   G++ SDC ++  +++    T   EE  A A+ +G DL+CG  L +H   AV+ GL+
Sbjct: 238 WGFRGHVTSDCWALKDFHEFHMVTKNQEETVALAMNSGCDLNCGN-LYVHLLQAVRDGLV 296

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  I+ A+    T +M+LG+FD      PY  +G   V T  +++L  EA+R+ + LLK
Sbjct: 297 EESVIDRAVTRLFTTRMKLGLFD-RSEEVPYNGIGYDRVDTEANRKLNREASRRTVCLLK 355

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA----RTIHQ 454
           N    LPL   + RT+ V+GPN+D    ++GNY G A  Y T L GI   A    R ++ 
Sbjct: 356 NADGLLPLDISKLRTIGVVGPNADNRKALVGNYEGTASEYVTVLDGIRELAGDDVRVVYS 415

Query: 455 QGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDR 498
           +GC       + +   +D++   A   +  +D  I VMGLD  +E E         + D+
Sbjct: 416 EGCHLFRDRVQGLGQPNDRI-AEARAVAELSDVVIAVMGLDPGLEGEEGDQGNEFASGDK 474

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
             L LPG Q E++ K  + S  P +LVL+ G  + + +A+    + AI+ A YPG  GG 
Sbjct: 475 PNLELPGLQGEVL-KALVESGKPVVLVLLGGSALAIPWAEE--HVPAILDAWYPGAQGGR 531

Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
           A+AD+LFG + P GKLP+T+Y       LP  T+ +M+         RTYR+ K P +YP
Sbjct: 532 AVADVLFGRACPEGKLPVTFYRTS--EELPAFTDYSMK--------NRTYRYMKQPALYP 581

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
           FG+G+SYT++  T   A            GS++  +  +A+                 ++
Sbjct: 582 FGYGLSYTSWELTNTTA-----------EGSVDDGVVCRAV-----------------LR 613

Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
           N G+  GA T+ V+   P     P+ QL    K+ +  G    V I++   +   V +  
Sbjct: 614 NTGAMAGAQTVQVYVKAPLAT-GPNAQLKGLRKIRLQPGESAEVAISLDK-EAFGVYNEK 671

Query: 738 GTRRIPLGEHNIHIG 752
           G R +  GE+ I+IG
Sbjct: 672 GLRVLLPGEYKIYIG 686


>gi|150019782|ref|YP_001312036.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
           8052]
 gi|149906247|gb|ABR37080.1| glycoside hydrolase, family 3 domain protein [Clostridium
           beijerinckii NCIMB 8052]
          Length = 709

 Score =  410 bits (1055), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 258/730 (35%), Positives = 386/730 (52%), Gaps = 94/730 (12%)

Query: 55  DLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFP 114
           +L+ +++LQEK + L   + A+  L +  Y WW+E LHGV+  G           AT FP
Sbjct: 18  ELVSKMTLQEKAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATVFP 67

Query: 115 QVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWG 166
           Q I  A+ F+      +  +++ E RA YN        G   GLTYWSPN+NIFRDPRWG
Sbjct: 68  QAIGLAAIFDDEFLGKVANIIATEGRAKYNEYSKKDDRGIYKGLTYWSPNINIFRDPRWG 127

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFN 224
           RG ET GEDP L+ +   ++++GLQG +G  LK+AA  KHF  +     +G +  R  FN
Sbjct: 128 RGHETYGEDPYLTSRLGVAFIKGLQG-EGKYLKLAACAKHFAVH-----SGPEGLRHEFN 181

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           A V+K+D+ +T+   F  CV E  V SVM +YN+ NG P C    +LK  +RG+W   G+
Sbjct: 182 AVVNKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILRGKWGFKGH 241

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++  ++     TST  E+ A AI  G DL+CG    L+   A + GL++E  I 
Sbjct: 242 VVSDCWALADFHLHHMVTSTATESVALAIENGCDLNCGNMY-LNLLLAYKEGLVTEEQIT 300

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            A    +T + +LGMFD E     Y  +  +   + +H E+AL A+R+ +VLLKN G +L
Sbjct: 301 TAAERLMTTRFKLGMFDEECE---YNKIPYEVNDSREHNEVALIASRKSMVLLKNNG-TL 356

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC--- 457
           PL     +++AVIGPN++  + + GNY+G A  YTT L+GI    G   R  + +GC   
Sbjct: 357 PLDKSNLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHDAVGNDVRVYYSEGCHLF 416

Query: 458 ----KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLP 504
               +D+A  DD+L   AI  + ++D  +L +GLD +IE E         A D+  L LP
Sbjct: 417 KDKVEDLARPDDRL-SEAISVAERSDVVVLCLGLDSTIEGEQGDAGNSYGAGDKENLNLP 475

Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
           GRQQ L+ KV    K P I+VL +G  + +  A  + + AAI+ A YPG  GGTA+ADIL
Sbjct: 476 GRQQNLLEKVLEVGK-PVIVVLGAGSALTLNGA--EEKCAAILNAWYPGSHGGTAVADIL 532

Query: 565 FGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           FG  +P GKLP+T+Y       LP  T+ +M+        GRTYR+     +YPFG+G++
Sbjct: 533 FGKCSPSGKLPVTFYKD--TAKLPDFTDYSMK--------GRTYRYLGHESLYPFGYGLT 582

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           Y+                          T+    ++V   K    +  + +++KN G  D
Sbjct: 583 YS--------------------------TVELSNLQVPSVKQGFGSFDISIEIKNTGEYD 616

Query: 684 GAHTLLVFSTPPAGHWAP-HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
               +  +       +A  +  L  F++V +  G  + V I ++  K   VV+  G R +
Sbjct: 617 IEEVVQCYVKDIESKYAVLNHSLAGFKRVSLKKGESKIVTIKLNK-KSFEVVNDDGERLL 675

Query: 743 PLGEHNIHIG 752
              +  + +G
Sbjct: 676 DSKKFKLFVG 685


>gi|145230215|ref|XP_001389416.1| exo-1,4-beta-xylosidase xlnD [Aspergillus niger CBS 513.88]
 gi|74626559|sp|O00089.2|XYND_ASPNG RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|292495287|sp|A2QA27.1|XYND_ASPNC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|2181180|emb|CAB06417.1| xylosidase [Aspergillus niger]
 gi|134055533|emb|CAK37179.1| xylosidase xlnD-Aspergillus niger
 gi|350638468|gb|EHA26824.1| hypothetical protein ASPNIDRAFT_205670 [Aspergillus niger ATCC
           1015]
          Length = 804

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 253/720 (35%), Positives = 378/720 (52%), Gaps = 32/720 (4%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
           D ++   R+   C  +     R   LI   +L E +    +    V RLG+  Y+ WSEA
Sbjct: 57  DCQNGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 116

Query: 91  LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
           LHG+          G +  ATSFPQ I T ++ N TL   I  ++S + RA  N G  GL
Sbjct: 117 LHGLDRANFSDS--GAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
             ++PN+N FR P WGRGQETPGED  L+  YA  Y+ G+QG D +  LK+AA+ KH+  
Sbjct: 175 DVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESNLKLAATAKHYAG 234

Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
           YD++NW+   R   +  +++QD+ + +   F +   + KV SVMC+YN VNGVP CAD  
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADSY 294

Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
            L+  +R  +    +GY+ SDCD+    Y+   + S+   AAA+AI AG D+DCG     
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
           H   ++  G LS  DI   ++   T  ++ G FD   +  + PY  L   DV   D   +
Sbjct: 355 HLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWNI 414

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           + +AA QGIVLLKN    LPL+   +     TVA+IGP ++ T  ++GNY G A    +P
Sbjct: 415 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
                     ++      ++      F AA+ A++ AD  I   G+D ++EAEALDR  +
Sbjct: 475 RAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESI 534

Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
             PG Q +L+ K+ S A K P I++ M GG +D +  KN+  ++A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFAL 594

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
            DI+ G  NP G+L  T YP  Y    P T+M +RP      PG+TY++Y G  VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
           G+ YT F  + +N  T   V L+ +            +  TH     +T    L    ++
Sbjct: 653 GLFYTTFAESSSNT-TTKEVKLNIQ----------DILSQTHEDLASITQLPVLNFTANI 701

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
           +N G  +  +T +VF+       AP+  K LV ++++  V  G  + + + + V  +  V
Sbjct: 702 RNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRLGEVKVGETRELRVPVEVGSFARV 761


>gi|290889355|gb|ADD69953.1| xylosidase HistTag [synthetic construct]
          Length = 810

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 253/720 (35%), Positives = 378/720 (52%), Gaps = 32/720 (4%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
           D ++   R+   C  +     R   LI   +L E +    +    V RLG+  Y+ WSEA
Sbjct: 57  DCQNGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 116

Query: 91  LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
           LHG+          G +  ATSFPQ I T ++ N TL   I  ++S + RA  N G  GL
Sbjct: 117 LHGLDRANFSDS--GAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
             ++PN+N FR P WGRGQETPGED  L+  YA  Y+ G+QG D +  LK+AA+ KH+  
Sbjct: 175 DVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESNLKLAATAKHYAG 234

Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
           YD++NW+   R   +  +++QD+ + +   F +   + KV SVMC+YN VNGVP CAD  
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADSY 294

Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
            L+  +R  +    +GY+ SDCD+    Y+   + S+   AAA+AI AG D+DCG     
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
           H   ++  G LS  DI   ++   T  ++ G FD   +  + PY  L   DV   D   +
Sbjct: 355 HLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWNI 414

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           + +AA QGIVLLKN    LPL+   +     TVA+IGP ++ T  ++GNY G A    +P
Sbjct: 415 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
                     ++      ++      F AA+ A++ AD  I   G+D ++EAEALDR  +
Sbjct: 475 RAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESI 534

Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
             PG Q +L+ K+ S A K P I++ M GG +D +  KN+  ++A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFAL 594

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
            DI+ G  NP G+L  T YP  Y    P T+M +RP      PG+TY++Y G  VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
           G+ YT F  + +N  T   V L+ +            +  TH     +T    L    ++
Sbjct: 653 GLFYTTFAESSSNT-TTKEVKLNIQ----------DILSQTHEDLASITQLPVLNFTANI 701

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
           +N G  +  +T +VF+       AP+  K LV ++++  V  G  + + + + V  +  V
Sbjct: 702 RNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRLGEVKVGETRELRVPVEVGSFARV 761


>gi|160881137|ref|YP_001560105.1| glycoside hydrolase family 3 [Clostridium phytofermentans ISDg]
 gi|160429803|gb|ABX43366.1| glycoside hydrolase family 3 domain protein [Clostridium
           phytofermentans ISDg]
          Length = 717

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 260/737 (35%), Positives = 392/737 (53%), Gaps = 91/737 (12%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR  +L+ +++L+EKV   +  A ++PRL IK Y +W+EALHGV+  G           A
Sbjct: 12  QRATELVKKMTLEEKVFQTLHSAPSIPRLDIKAYNYWNEALHGVARAGV----------A 61

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I  A++F+  L E I   +S E R  +N            GLT+WSPNVNIFRD
Sbjct: 62  TVFPQAIGLAATFDEDLIEEIADTISTEGRGKFNAQQKYGDHDIYKGLTFWSPNVNIFRD 121

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
           PRWGRG ET GEDP LSG     +V G+QG D   LK AA  KHF  +     +G +  R
Sbjct: 122 PRWGRGHETFGEDPFLSGTLGGRFVDGIQGHDETYLKAAACAKHFAVH-----SGPEDIR 176

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             FNA+VS+QD+ +T+   F+  V E KV +VM +YN+ NG P C    +L+  +RGEW 
Sbjct: 177 HSFNAEVSEQDLRETYLPAFKKLVKEHKVEAVMGAYNRTNGEPCCGSKTLLEDILRGEWE 236

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
             G++ SDC ++  +++    TS   E+ A A+  G DL+CG  L ++   AV+ GL+ E
Sbjct: 237 FVGHVTSDCWAIKDFHEHHMVTSNAVESVALAMNRGCDLNCGN-LYVNLLQAVRDGLVEE 295

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
             I+ AL+   T +M+LG+FD E  S P+  +    V T   +EL ++A+++ +VLLKN+
Sbjct: 296 ETIDTALIRLFTTRMKLGLFDKE-ESIPFNTITYDQVDTKSSKELNIKASKKCVVLLKNE 354

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQG 456
              LPL+  +  +V VIGPN++    ++GNY G A  Y T L+GI +      R    +G
Sbjct: 355 DNILPLNPKKITSVGVIGPNANNRNALVGNYEGTASEYITVLEGIKQVVPEDVRVYFSEG 414

Query: 457 C-------KDVACADDQLFGAAIDA-SRQADATILVMGLDQSIEAE---------ALDRA 499
           C        +++  +D++  A + A    +D  I  +GLD  +E E         + D+ 
Sbjct: 415 CHLFKNKLSNLSQENDRI--AEVRAVCEHSDVVIACLGLDPGLEGEEGDQGNQFASGDKK 472

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            L LPG Q++++  +    K P IL+L+SG  + V +A  D  I AI+   YPG  GG A
Sbjct: 473 TLALPGIQEDVLKTIYECGK-PVILILLSGSALAVPWA--DEHIPAILQGWYPGAQGGRA 529

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
           IA+++FG  NP GKLP+T+Y       LP  T+ AM+         RTYR+ K   +YPF
Sbjct: 530 IAELIFGDGNPEGKLPVTFY--RTTEELPEFTDYAMK--------NRTYRYMKNEALYPF 579

Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
           G+G+SYT F HT+                 +N    GK    ++ +C        V VKN
Sbjct: 580 GYGLSYTTFEHTLL---------------YVNTDTLGKG---SNVEC-------MVRVKN 614

Query: 679 VGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
            G  +G+ T   +     G  AP+ QL   +KV +  G ++ + I +   +   + +  G
Sbjct: 615 TGDYEGSVTTQAY-VKYVGEDAPNCQLKGLKKVSLLPGEEKDIMIELDD-RAFGLYNEEG 672

Query: 739 TRRIPLGEHNIHIGGTK 755
              +  GE+ +++  ++
Sbjct: 673 EFILNQGEYELYLSDSQ 689


>gi|354508473|gb|AER26905.1| beta-xylosidase 3 [synthetic construct]
          Length = 778

 Score =  409 bits (1052), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 255/720 (35%), Positives = 377/720 (52%), Gaps = 32/720 (4%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
           D ++   R+   C  S     R   LI   +L E +    +    V RLG+  Y+ WSEA
Sbjct: 31  DCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 90

Query: 91  LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
           LHG+          G +  ATSFPQ I T ++ N TL   I  ++S + RA  N G  GL
Sbjct: 91  LHGLDRANFSDS--GSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 148

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
             ++PN+N FR P WGRGQETPGED  L+  YA  Y+ G+QG D D  LK+AA+ KH+  
Sbjct: 149 DVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSNLKLAATAKHYAG 208

Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
           YD++NW+   R   +  +++QD+ + +   F +   + KV SVMC+YN V+GVP CAD  
Sbjct: 209 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPACADSY 268

Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
            L+  +R  +    +GY+ SDCD+    Y+   + S+   AAA+AI AG D+DCG     
Sbjct: 269 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 328

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
           H   ++  G LS  DI   ++   T  ++ G FD   +  + PY  L   DV   D   +
Sbjct: 329 HLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNI 388

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           + +AA QGIVLLKN    LPL+   +     TVA+IGP ++ T  ++GNY G A    +P
Sbjct: 389 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 448

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
                     ++      ++      F AA+ A+R AD  I   G+D ++EAEALDR  +
Sbjct: 449 RAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRESI 508

Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
             PG Q +L+ K+ S A   P I++ M GG +D +  KN+  + A++W GYPGQ+GG A+
Sbjct: 509 AWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFAL 568

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
            DI+ G  NP G+L  T YP  Y    P T+M +RP      PG+TY++Y G  VY FGH
Sbjct: 569 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 626

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
           G+ YT F  + +N  T   V L+ +            +  TH +   +T    L    ++
Sbjct: 627 GLFYTTFAESSSNT-TTKEVKLNIQ----------DILSQTHEELASITQLPVLNFTANI 675

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
           KN G  +  +T +VF+       AP+  K LV ++++  V  G  + + + + V  +  V
Sbjct: 676 KNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELRVPVEVGSFARV 735


>gi|4235093|gb|AAD13106.1| beta-xylosidase [Aspergillus niger]
          Length = 804

 Score =  409 bits (1052), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 255/720 (35%), Positives = 377/720 (52%), Gaps = 32/720 (4%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
           D ++   R+   C  S     R   LI   +L E +    +    V RLG+  Y+ WSEA
Sbjct: 57  DCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEA 116

Query: 91  LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
           LHG+          G +  ATSFPQ I T ++ N TL   I  ++S + RA  N G  GL
Sbjct: 117 LHGLDRANFSDS--GSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
             ++PN+N FR P WGRGQETPGED  L+  YA  Y+ G+QG D D  LK+AA+ KH+  
Sbjct: 175 DVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSNLKLAATAKHYAG 234

Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
           YD++NW+   R   +  +++QD+ + +   F +   + KV SVMC+YN V+GVP CAD  
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPACADSY 294

Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
            L+  +R  +    +GY+ SDCD+    Y+   + S+   AAA+AI AG D+DCG     
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
           H   ++  G LS  DI   ++   T  ++ G FD   +  + PY  L   DV   D   +
Sbjct: 355 HLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNI 414

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           + +AA QGIVLLKN    LPL+   +     TVA+IGP ++ T  ++GNY G A    +P
Sbjct: 415 SYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
                     ++      ++      F AA+ A+R AD  I   G+D ++EAEALDR  +
Sbjct: 475 RAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRESI 534

Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
             PG Q +L+ K+ S A   P I++ M GG +D +  KN+  + A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFAL 594

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
            DI+ G  NP G+L  T YP  Y    P T+M +RP      PG+TY++Y G  VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
           G+ YT F  + +N  T   V L+ +            +  TH +   +T    L    ++
Sbjct: 653 GLFYTTFAESSSNT-TTKEVKLNIQ----------DILSQTHEELASITQLPVLNFTANI 701

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
           KN G  +  +T +VF+       AP+  K LV ++++  V  G  + + + + V  +  V
Sbjct: 702 KNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELRVPVEVGSFARV 761


>gi|261368518|ref|ZP_05981401.1| beta-glucosidase [Subdoligranulum variabile DSM 15176]
 gi|282569400|gb|EFB74935.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Subdoligranulum variabile DSM 15176]
          Length = 717

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 259/735 (35%), Positives = 379/735 (51%), Gaps = 88/735 (11%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           +R   L+ +++L+EK+  ++S A A+PRLGI  Y WW+E +HGV   G           A
Sbjct: 13  ERARALVAQMTLKEKISQMLSWAPAIPRLGIPAYNWWNEGIHGVGRAGT----------A 62

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I  A+SF+  L   +G  V  EAR  YN   +        GLT W+PNVNIFRD
Sbjct: 63  TVFPQAIGLAASFDEDLLGQVGEAVGVEARGKYNMYRSYQDRDIYKGLTIWAPNVNIFRD 122

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
           PRWGRG ET GEDP L+ +    +V G+QG D D L+ AA  KHF  +         R +
Sbjct: 123 PRWGRGHETYGEDPYLTSRLGVRFVEGMQGDDPDYLRAAACAKHFAVHSGPE---DQRHY 179

Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
           F+AKVS+QD+ +T+   FR  V E  V +VM +YN+ NG P C    +L   +RG+W   
Sbjct: 180 FDAKVSQQDLWETYLPAFRALVKEAGVEAVMGAYNRTNGEPCCGSKTLLVDILRGKWNFQ 239

Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
           G++ SDC ++  +++    TS P ++ A A+  G DL+CG     + E AV  G + E  
Sbjct: 240 GHVTSDCWAIKDFHEGHMVTSGPVDSVALAVNNGCDLNCGDLYA-YLEEAVAEGKVKEET 298

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           I+ +LV   T +M+LGMFD E    PY  +G   V + + Q L LE A + +VLLKN+  
Sbjct: 299 IDRSLVRLFTTRMKLGMFDAEEKV-PYNKIGYDAVDSREMQALNLEVAEKILVLLKNENH 357

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGCK 458
           +LPL   +   VAV+GPN+D    ++GNY G A  Y T L GI  Y     +  + +GC 
Sbjct: 358 TLPLDKSKLHRVAVVGPNADNRKALVGNYEGTASRYVTVLDGIQEYLGEDVQVRYSEGCH 417

Query: 459 DVA------CADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLL 503
             A         ++L         + D  I  +GLD  +E E         + D+  L L
Sbjct: 418 LYADKIQGLAKSNELISEVRGVCAECDVVICCLGLDAGLEGEEGDQGNQFASGDKQSLSL 477

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG  QE V K  + S  P ++V++SG  + +  A+     AA++ A YPG  GG A+A  
Sbjct: 478 PG-NQESVLKACIESGKPVVVVVLSGSALALGTAQEGA--AAVLQAWYPGAQGGRAVARA 534

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
           LFG  NP GKLP+T+Y  +   +LP  T+ AM+        GRTYR+ +   +YPFG+G+
Sbjct: 535 LFGECNPQGKLPVTFYHSD--EDLPAFTDYAMK--------GRTYRYMEKEPLYPFGYGL 584

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
           SY++F    A A      P DG                         + V+V V N G  
Sbjct: 585 SYSHFTFRDAKADAAQIGP-DG-------------------------VDVRVTVVNDGQY 618

Query: 683 DGAHTLLVF--STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
            G  T+ V+  +  P     P+ QL A  KV +  G ++ V +++  C + ++ +  G  
Sbjct: 619 RGRETVEVYVKAERPG---TPNAQLKALAKVDLMPGEEKCVTLHLPQCAF-ALCNEEGIS 674

Query: 741 RIPLGEHNIHIGGTK 755
            +  GE+ + +GG++
Sbjct: 675 EVLPGEYTVWLGGSQ 689


>gi|358385386|gb|EHK22983.1| glycoside hydrolase family 3 protein [Trichoderma virens Gv29-8]
          Length = 795

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 259/733 (35%), Positives = 398/733 (54%), Gaps = 29/733 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  S    +R   LI   +L+E +    +    VPRLG+  Y+ W+EALHG+      TK
Sbjct: 63  CDSSAGYAERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
            GG F  ATSFP  I + ++ N TL   I  ++S +ARA  N G  GL  ++PN+N FR 
Sbjct: 123 -GGQFQWATSFPMPILSMAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNINGFRS 181

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED  VL+  Y   Y+ G+QG  D + LK+AA+ KHF  YDL+NWN   R
Sbjct: 182 PLWGRGQETPGEDANVLTSAYTYEYITGMQGGVDPENLKIAATAKHFAGYDLENWNNQSR 241

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A +++QD+ + +   F       K  S MC+YN VNGVP+CA+   L+  +R  W 
Sbjct: 242 LGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCANSFFLQTLLRESWG 301

Query: 281 LN--GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
               GY+ SDCD+V   ++   + S    AAA ++RAG D+DCG     H   +   G +
Sbjct: 302 FPEWGYVSSDCDAVYNVWNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHLNESFVAGEV 361

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           S  +I  ++       +RLG FD       Y  LG KDV   D   ++ EAA +GIVLLK
Sbjct: 362 SRGEIERSVTRLYANLVRLGYFD---KKNEYRSLGWKDVVKTDAWNISYEAAVEGIVLLK 418

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G +LPLS  + R++A+IGP ++ T  M GNY G A    +PL+   +    ++ +   
Sbjct: 419 NDG-TLPLSK-KVRSIALIGPWANATTQMQGNYFGAAPYLISPLEAAKKAGYQVNFELGT 476

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           + A      F  AI A++++DA I   G+D ++E E  DR  +  PG Q +L+ ++S   
Sbjct: 477 ETASTSTAGFAKAIAAAKKSDAIIFAGGIDNTVEQEGADRTDIAWPGNQLDLIKQLSELG 536

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P +++ M GG +D +  K++ ++ +++W GYPGQ+GG A+ DIL G   P G+L  T 
Sbjct: 537 K-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRAPAGRLVSTQ 595

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP +Y+   P  +M +RP   K  PG+TY +Y G  VY FG G+ YT F  T++ +    
Sbjct: 596 YPADYVHQFPQNDMNLRPD-GKSNPGQTYIWYTGKPVYQFGDGIFYTTFKETLSGSS--- 651

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
                G   ++++ ++      T+++   + L    +++N G  D  ++ ++F    +  
Sbjct: 652 ----KGLKFNVSSVLAAPHPGYTYSEQTPV-LTFTANIENSGKTDSPYSAMLFVRTANAG 706

Query: 695 PAGHWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           PA +  P+K LV F+++  +  G   ++ I I V   L+ VD  G R +  G++ + +  
Sbjct: 707 PAPY--PNKWLVGFDRLATIKPGHSSKLSIPIPVSA-LARVDSLGNRIVYPGKYELAL-N 762

Query: 754 TKHSVSLHAATLG 766
           T  S+ L    +G
Sbjct: 763 TDESIKLEFELVG 775


>gi|288870210|ref|ZP_06113312.2| beta-glucosidase [Clostridium hathewayi DSM 13479]
 gi|288868024|gb|EFD00323.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
          Length = 730

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 257/736 (34%), Positives = 389/736 (52%), Gaps = 89/736 (12%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           ++   L+ +++L+EKV   ++ A A+ RLGIK Y WW+E LHGV+  G           A
Sbjct: 23  EKAEYLVKQMTLEEKVFQTMNQAPAIERLGIKAYNWWNEGLHGVARAGV----------A 72

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-----GGT---AGLTYWSPNVNIFRD 162
           T FPQ I  A++F+  L E +G  VS EARA Y+     G T    GLT W+PN+NIFRD
Sbjct: 73  TIFPQAIGLAATFDEDLIETVGEAVSTEARAKYHMQQRYGDTDIYKGLTLWAPNINIFRD 132

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
           PRWGRG ET GEDP L+ +    Y+RGLQGS    LK AA  KHF  +     +G +  R
Sbjct: 133 PRWGRGHETYGEDPWLTSRLGIRYIRGLQGSHEKYLKTAACVKHFAVH-----SGPEELR 187

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A+VS++D+ +T+   F  CV +G V +VM +YN+VNGVP C +  +L+  +R EW 
Sbjct: 188 HSFDAEVSEKDLRETYLPAFEACVKDGDVEAVMGAYNRVNGVPCCGNEYLLETILRKEWG 247

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
            +G++VSDC ++  +++    T +P E+ + A+  G DL+CG     +   AV+ G + E
Sbjct: 248 FHGHVVSDCWAIKDFHEGHGVTDSPVESVSMAMNHGCDLNCGNLF-TYLIQAVKEGKVKE 306

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
             ++ A++   T +++LG         PY  +   +V +P  ++L   AA + +VLLKN 
Sbjct: 307 ERLDEAVIRLFTTRLKLGALGKMEEDDPYAGISYLEVDSPAMKKLNRSAAGKSVVLLKNT 366

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQG 456
              LP+   R++T+ VIGPN+D    ++GNY G A  Y T L+GI       AR ++ +G
Sbjct: 367 EGLLPIDTKRYKTIGVIGPNADSRRALVGNYEGTASEYVTVLEGIREAAEPEARVLYSEG 426

Query: 457 C-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
           C         +   +D+L        R++D  I  MGLD ++E E           D+  
Sbjct: 427 CHLYKSNVSGLGARNDRL-SEVKGICRESDIVIACMGLDSTLEGEQGDTGNIYAGGDKPD 485

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L+LPG QQ+++ + +  S  P +LVL++G  + V +A  D  + AI+ A YPG  GG  +
Sbjct: 486 LMLPGLQQKIL-ETAYDSGKPVVLVLLAGSAMAVTWA--DEHLPAILTAWYPGAEGGRGV 542

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
           AD+LFGT NP G+LP+T+Y       LP  T  +M         GRTYRF K   +YPFG
Sbjct: 543 ADVLFGTVNPEGRLPVTFYRT--TEELPDFTNYSME--------GRTYRFMKQKALYPFG 592

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
            G+SYT F                          S   ++V+        + V++ V N 
Sbjct: 593 FGLSYTEF--------------------------SCSGLEVSERDSVDNGVEVKLCVANC 626

Query: 680 GSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
           G + G  T+ V+      H   + QL A  KV +  G ++ + I++    + +V D +G 
Sbjct: 627 GERWGRETIQVYVGRKEDH-DRNPQLKAAVKVGLEPGEEKTISIHLPADAF-AVYDENGK 684

Query: 740 RRIPLGEHNIHIGGTK 755
           R I    + I  GG++
Sbjct: 685 RYIDACTYQIFAGGSQ 700


>gi|358365439|dbj|GAA82061.1| beta-xylosidase [Aspergillus kawachii IFO 4308]
          Length = 788

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 251/695 (36%), Positives = 369/695 (53%), Gaps = 32/695 (4%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           LI   +L E +    +    V RLG+  Y+ WSEALHG+          G +  ATSFPQ
Sbjct: 66  LISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLDRANFSDS--GSYNWATSFPQ 123

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGED 175
            I T ++ N TL   I  ++S + RA  N G  GL  ++PN+N FR P WGRGQETPGED
Sbjct: 124 PILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPNINTFRHPVWGRGQETPGED 183

Query: 176 PVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIED 234
             L+  YA  Y+ G+QG D D  LK+AA+ KH+  YD++NW+   R   +  +++QD+ +
Sbjct: 184 VSLAAVYAYEYITGIQGPDPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSE 243

Query: 235 TFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL--NGYIVSDCDSV 292
            +   F +   + KV SVMC+YN VNGVP CAD   L+  +R  +    +GY+ SDCD+ 
Sbjct: 244 YYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAA 303

Query: 293 GVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLT 352
              Y+   + S+   AAA+AI AG D+DCG     H   ++  G LS  DI   ++   T
Sbjct: 304 YNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYT 363

Query: 353 VQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
             ++ G FD   +  + PY  L   DV   D   ++ +AA QGIVLLKN    LPL+   
Sbjct: 364 TLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKA 423

Query: 411 H----RTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQ 466
           +     TVA+IGP ++ T  ++GNY G A    +P          ++      ++     
Sbjct: 424 YPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGTGISSTSTS 483

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV-SMASKGPTILV 525
            F AA+ A+R AD  I   G+D ++EAEALDR  +  PG Q +L+ K+ S A   P I++
Sbjct: 484 GFAAALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVL 543

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
            M GG +D +  KN+  + A++W GYPGQ+GG A+ DI+ G  NP G+L  T YP  Y  
Sbjct: 544 QMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAE 603

Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
             P T+M +RP      PG+TY++Y G  VY FGHG+ YT F  + +N  T   V L+ +
Sbjct: 604 EFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTFAESSSNT-TTKEVKLNIQ 660

Query: 646 HGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAP 701
                       +  TH +   +T    L    ++KN G  +  +T +VF+       AP
Sbjct: 661 ----------DILSQTHEELASITQLPVLNFTANIKNTGKLESDYTAMVFANTSDAGPAP 710

Query: 702 H--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
           +  K LV ++++  V  G  + + + + V  +  V
Sbjct: 711 YPVKWLVGWDRLGDVKVGETRELRVPVEVGSFARV 745


>gi|366163035|ref|ZP_09462790.1| glycoside hydrolase family 3 [Acetivibrio cellulolyticus CD2]
          Length = 705

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 263/725 (36%), Positives = 392/725 (54%), Gaps = 101/725 (13%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           ++  +L+ +++L+EK   L   + A+ RLGI  Y WW+EALHGV+  G           A
Sbjct: 9   KKAEELVAQMTLEEKASQLTYNSPAIERLGIPAYNWWNEALHGVARAGT----------A 58

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I  A+ F+      I   ++ EARA YN  +         GLT WSPN+NIFRD
Sbjct: 59  TVFPQAIGLAAMFDDEFLMKIANAIAIEARAKYNESSKHGDRDIYKGLTIWSPNINIFRD 118

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY----DLDNWNGV 218
           PRWGRG ET GEDP LSGK   ++++GLQG D D +  AA  KHF AY    DL      
Sbjct: 119 PRWGRGHETYGEDPFLSGKLGVAFIKGLQG-DKDVMMTAACVKHFAAYSGPEDL------ 171

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
            R  FNA+V+K+D+ +T+   F  CV + KV +VM  YN+ NG P C    +L+  +R +
Sbjct: 172 -RHGFNAEVTKKDLWETYLPAFETCVKDAKVEAVMGGYNRTNGEPCCGSYTLLRDILREK 230

Query: 279 WRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
           W   G++VSDC ++  ++     T TPEE+ A AI AG DL+CG    L    A+Q GL+
Sbjct: 231 WGFEGHVVSDCWAIKDFHTDHMVTKTPEESVALAIDAGCDLNCGNMY-LMLLIALQEGLI 289

Query: 339 SEIDINNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           +E  I  A V   T + +LG+F+G E  + PY     + V   +H+E+A+EAAR+  VLL
Sbjct: 290 TEEHITRAAVRIFTTRFKLGLFEGSEFDNIPY-----EVVECSEHKEMAIEAARKSAVLL 344

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIH 453
           KN G  LP++    +T+ VIGPN++  + + GNY G +  Y T L+GI    G   R ++
Sbjct: 345 KNDG-ILPINKGAIKTIGVIGPNANSRIALKGNYHGTSSRYITLLEGIQDEVGDEVRVLY 403

Query: 454 QQGCKDV-------ACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALD 497
             GC+ V       A A+D+L   A+  +  +D  +L +GLD++IE E         + D
Sbjct: 404 SNGCELVKDRTEVLAYANDRL-AEAVTVAEHSDLVVLCLGLDETIEGEQSDEGNNGGSGD 462

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           +  L LP  Q+ L+ K+ +A+  PT+L LM+G  I++++A        + W  YPG  GG
Sbjct: 463 KKDLDLPEVQKSLLEKI-VATGKPTVLCLMAGSAINLSYAHEHCNGILLTW--YPGARGG 519

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
            A+ADILFG ++P GKLP+T+Y    + NL P+T+ +M+         RTYR+ +   +Y
Sbjct: 520 KAVADILFGNASPSGKLPVTFYRS--LDNLPPITDYSMK--------NRTYRYIEEAPLY 569

Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS-GKAIKVTHAKCNRLTLGVQVD 675
           PFG+G++Y +               ++ +H  I  T+   K I +T    NR ++ V+ +
Sbjct: 570 PFGYGLTYGD---------------VELKHVEIKGTVEIEKDIYITVTLQNRGSVAVE-E 613

Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
           V     KD      V +T           L AF +V + A  +++V + I     L VV+
Sbjct: 614 VVQAYIKDEQSMYAVTNT----------SLCAFMRVGLGANEEKQVSMRIPF-DSLKVVN 662

Query: 736 RSGTR 740
             G +
Sbjct: 663 LDGEK 667


>gi|330947691|ref|XP_003306937.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
 gi|311315273|gb|EFQ84970.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
          Length = 756

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 256/725 (35%), Positives = 382/725 (52%), Gaps = 28/725 (3%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
           D K+   ++   C V+    +R   L+  +  QEK++ L+S +  V RLG+  Y WW EA
Sbjct: 23  DCKNGPLKSNAICDVTASPAKRAAALVAAMQTQEKLENLVSKSKGVARLGLPAYNWWGEA 82

Query: 91  LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
           LHGV+   PG  F G +  ATSFP  +  +++F+  L   I  V+ +EARA  NGG A +
Sbjct: 83  LHGVAG-APGINFTGSYRTATSFPMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGIAPV 141

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
            +W+P++N FRDPRWGRG ETPGED +    Y  S + GL+G    R K+ A+CKH+  Y
Sbjct: 142 DFWTPDINPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLEGDKAQR-KIIATCKHYVGY 200

Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
           D++NWNG DR HF+AK++ QD+ + F  PF+ C  + KV S MCSYN VNGVPTCAD  +
Sbjct: 201 DVENWNGTDRHHFDAKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYV 260

Query: 271 LKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
           L+  +R  W     N YI SDC++V        + +T +EA A A   G+DL C      
Sbjct: 261 LEDILRKHWNWTDSNNYITSDCEAVKDISLRHKYVATLQEATAIAFNNGMDLSCEYSGTS 320

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
               A  +GLL+   I+ AL       +  G FDG  ++  Y HLG +D+ TP+ Q+L L
Sbjct: 321 DIPGAFSQGLLNVSVIDRALTRQYEGLVHAGYFDGAAAT--YAHLGVQDINTPEAQKLVL 378

Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
           + A +G+ LLKN   +LPLS      VA++G  ++ T  + G Y+G A    TP+    +
Sbjct: 379 QVAAEGLTLLKNDD-TLPLSLKSGSKVAMVGFWANTTSKLSGIYSGPAPYLHTPVYAGNK 437

Query: 448 YA--RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
                 +        + A D     A++A++++D  +   GLD S  AE  DR  +  P 
Sbjct: 438 LGLDMAVATGPILQTSGAADNWTTTALNAAKKSDFILYFGGLDPSAAAEGSDRTDISWPS 497

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
            Q +L++K  +A+ G  ++V+  G  +D         + ++IWA +PGQ GGTA+  ++ 
Sbjct: 498 AQIDLITK--LAALGKPLVVIALGDMVDHTPILKMKGVNSLIWANWPGQDGGTAVMQVIT 555

Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
           G     G+LP+T YP EY T L M +M MRP  +   PGRTYR+Y    V PFG G+ YT
Sbjct: 556 GEHAIAGRLPITQYPAEY-TQLSMLDMNMRPGGNN--PGRTYRWYN-ESVQPFGFGLHYT 611

Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
            F     ++  +          ++N     K+    H     +   ++V V N G++   
Sbjct: 612 KFAAKFGSSSGL----------TVNIQDIMKSCTKDHPDLCDVP-PIEVAVTNEGNRTSD 660

Query: 686 HTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
              L F     G    P K LV++ ++   +G+Q ++         LS VD+SG      
Sbjct: 661 FIALAFIKGEVGPKPYPLKTLVSYARLRDISGSQTKMASLALTLGALSRVDQSGNLVAYP 720

Query: 745 GEHNI 749
           GE+ +
Sbjct: 721 GEYTL 725


>gi|359409694|ref|ZP_09202159.1| Beta-glucosidase [Clostridium sp. DL-VIII]
 gi|357168578|gb|EHI96752.1| Beta-glucosidase [Clostridium sp. DL-VIII]
          Length = 723

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 265/733 (36%), Positives = 386/733 (52%), Gaps = 99/733 (13%)

Query: 55  DLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFP 114
           +L+ +++LQEK + L   + AV RL I  Y WW+E LHGV+  G           AT FP
Sbjct: 31  ELVAKMTLQEKAEQLTYNSPAVKRLNIPEYNWWNEGLHGVARAGT----------ATVFP 80

Query: 115 QVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWG 166
           Q I  A+ F+      +  +++ E RA YN  +         GLTYWSPNVNIFRDPRWG
Sbjct: 81  QAIGLAAMFDEEFLGKVAGIIATEGRAKYNENSKKEDRDIYKGLTYWSPNVNIFRDPRWG 140

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFN 224
           RG ET GEDP L+ +   ++V+GLQG DG  LK++A  KHF  +     +G +  R  FN
Sbjct: 141 RGHETYGEDPYLTSRLGVAFVKGLQG-DGKYLKLSACAKHFAVH-----SGPESLRHEFN 194

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           A VS++D+ +T+   F  CV E  V SVM +YN+ NG P C    +LK  +RG+W   G+
Sbjct: 195 AVVSQKDLHETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKALLKDILRGKWGFKGH 254

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++  ++     TST  E+ A AI  G DL+CG    L+   A + GL++E  I 
Sbjct: 255 VVSDCWALADFHMHHKVTSTATESVALAIENGCDLNCGNMY-LNLLLAYKEGLVTEEQIT 313

Query: 345 NALVNTLTVQMRLGMFDG--EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
            A    +T + +LGMFD   E +  PY      + C  +H +++LEA+R+ +VLLKN G 
Sbjct: 314 TAAERLMTTRFKLGMFDEDCEYNQIPY----EVNDC-KEHNQVSLEASRKSMVLLKNNGI 368

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQGC- 457
            LPL   + + VAVIGPN++  + + GNY+G A  YTT L GI        R  + +GC 
Sbjct: 369 -LPLDKSKLKAVAVIGPNANSEIMLKGNYSGTASKYTTILDGIHDVLDDDVRVYYSEGCH 427

Query: 458 ------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLL 502
                 +D+A  DD+L   A+  + +AD  IL +GLD +IE E         A D+  L 
Sbjct: 428 LYKEKVEDLARRDDRL-AEAVSVAERADVVILCLGLDSTIEGEQGDAGNGYGAGDKLDLN 486

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           LPG QQEL+ KV    K P ++VL +G  + +  A  + R AAI+ A YPG  GGTA AD
Sbjct: 487 LPGIQQELLEKVLETGK-PVVVVLGTGSGLTLNGA--EERCAAILNAWYPGSHGGTAAAD 543

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFY-KGPVVYPFGH 620
           ILFG  +P GKLP+T+Y       LP  T+ AM+        GRTYR+  +   +YPFG+
Sbjct: 544 ILFGKCSPSGKLPVTFYKD--TDKLPEFTDYAMK--------GRTYRYMDESNCLYPFGY 593

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           G++Y+    +    P V     DG                         + + V+++N G
Sbjct: 594 GLTYSTVELSNLQVPAVRG-EFDG-------------------------IDISVEIENTG 627

Query: 681 SKDGAHTLLVFSTPPAGHWAP-HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
           S D    +  +       +A  +  L  F++V +  G  + V + ++  +    VD +G 
Sbjct: 628 SYDIEEVVQCYIKDLESKYAVLNHSLAGFKRVSLKKGESKTVTMKLNR-RAFEAVDDAGE 686

Query: 740 RRIPLGEHNIHIG 752
           R +   +  + +G
Sbjct: 687 RILDSKKFKLFVG 699


>gi|330836687|ref|YP_004411328.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
 gi|329748590|gb|AEC01946.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
          Length = 709

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 235/600 (39%), Positives = 339/600 (56%), Gaps = 62/600 (10%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           ++ R++L EK+  +   A+A+PRL I  Y WW+EALHGV+  G           AT FPQ
Sbjct: 17  IVSRMTLDEKISQIDYRASAIPRLDIPEYNWWNEALHGVARAGI----------ATVFPQ 66

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRDPRWGR 167
            I  A+ F++ + E IG V+S E RA YN            GLT+WSPNVNIFRDPRWGR
Sbjct: 67  AIGLAAMFDSDMMERIGAVISTEGRAKYNEAVRHGDRDIYKGLTFWSPNVNIFRDPRWGR 126

Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNA 225
           GQET GEDP L+ + A +++RG+QG DG  LK AA  KHF  +     +G +  R  F+A
Sbjct: 127 GQETYGEDPYLTARLAVAFIRGIQG-DGKYLKAAACAKHFAVH-----SGPEALRHEFDA 180

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
           +VS++D+ +T+   F+  V E +V  VM +YN+VNGVP CA   +L   +R EW   G++
Sbjct: 181 RVSQKDLHETYLSAFKAAVKEAQVEIVMGAYNRVNGVPACASHELLSDILRSEWGFEGHV 240

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
           VSD +++   +   H+ +      A A++AG +L C   +  H  S+V  GL+SE +I  
Sbjct: 241 VSDYEALEDIFKHHHYVADEAHTMAVALKAGCNL-CAGKIARHLRSSVDEGLISEDEITE 299

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
           A+    T ++ +GM        PY  +G ++  TP+H +LA+EAA +  VLLKN G  LP
Sbjct: 300 AVERLFTTRIMMGMM---ADDCPYDSIGYEENDTPEHHQLAVEAASRSFVLLKNDGL-LP 355

Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC---- 457
           L   +  ++AVIGPN++    + GNY G A  Y T L+GI    G   R  + +GC    
Sbjct: 356 LEMEKISSIAVIGPNANSRKMLEGNYNGTASRYVTVLEGIQDLVGDSVRVWYSEGCHLYK 415

Query: 458 ---KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLPG 505
                ++  +D+L   A+ A++ AD  +L +GLD ++E E         + D+  L LPG
Sbjct: 416 NFHSSLSGRNDRL-AEAVSAAQHADVVVLCLGLDATLEGEEGDVEVGFGSGDKPNLSLPG 474

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
           RQQ L+  +    K P IL+L SG  + +   +ND  + AI+   YPG  GG A+AD+LF
Sbjct: 475 RQQLLLDTMLTVGK-PVILLLASGSALTLGGRENDENLKAILQIWYPGAMGGKAVADVLF 533

Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
           G   P GKLP+T+Y       LP  E            GRTYR+ KG  +YPFG+G++Y+
Sbjct: 534 GRRAPAGKLPVTFYAS--ADELPAFE-------DYSMAGRTYRYMKGNALYPFGYGLTYS 584


>gi|329745495|gb|AEB98984.1| xylosidase precursor [synthetic construct]
          Length = 804

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 254/720 (35%), Positives = 376/720 (52%), Gaps = 32/720 (4%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
           D ++   R+   C  S     R   LI   +L E +    +    V RLG+  Y+ WSEA
Sbjct: 57  DCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPVYQVWSEA 116

Query: 91  LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGL 150
           LHG+          G +  ATSFPQ I T ++ N TL   I  ++S + RA  N G  GL
Sbjct: 117 LHGLDRANFSDS--GSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGL 174

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTA 209
             ++PN+N FR P  GRGQETPGED  L+  YA  Y+ G+QG D D  LK+AA+ KH+  
Sbjct: 175 DVYAPNINTFRHPVRGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSNLKLAATAKHYAG 234

Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
           YD++NW+   R   +  +++QD+ + +   F +   + KV SVMC+YN VNGVP CAD  
Sbjct: 235 YDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSY 294

Query: 270 ILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
            L+  +R  +    +GY+ SDCD+    Y+   + S+   AAA+AI AG D+DCG     
Sbjct: 295 FLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQW 354

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS--SQPYGHLGPKDVCTPDHQEL 385
           H   ++  G LS  DI   ++   T  ++ G FD   +  + PY  L   DV   D   +
Sbjct: 355 HLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNI 414

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRH----RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           + +AA QGIVLLKN    LPL+   +     TVA+IGP ++ T  ++GNY G A    +P
Sbjct: 415 SYQAATQGIVLLKNSNKVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISP 474

Query: 442 LQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
                     ++      ++  +   F AA+ A++ AD  I   G+D ++EAEALDR  +
Sbjct: 475 RVAFEEAGYNVNFAERTGISSTNTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESI 534

Query: 502 LLPGRQQELVSKV-SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
             PG Q +L+ K+ S A   P I++ M GG +D +  KN+  ++A++W GYPGQ+GG A+
Sbjct: 535 AWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFAL 594

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
            DI+ G  NP G+L  T YP  Y    P T+M +RP      PG+TY++Y G  VY FGH
Sbjct: 595 RDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGH 652

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDV 676
           G+ YT F  + +N  T        R   +N       +  TH     +T    L    ++
Sbjct: 653 GLFYTTFAESSSNTTT--------REIKLNIQ---DILSQTHEDLASITQLPVLNFTANI 701

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPH--KQLVAFEKV-HVPAGAQQRVGINIHVCKYLSV 733
           KN G  +  +T +VF+       AP+  K LV ++++  V  G  + + + + V  +  V
Sbjct: 702 KNTGKVESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGEVKVGETRELRVPVEVGSFARV 761


>gi|336425135|ref|ZP_08605165.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336013044|gb|EGN42933.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 705

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 258/745 (34%), Positives = 392/745 (52%), Gaps = 96/745 (12%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           ++ ++L+ +++L+EK   L   A A+PRLG+  Y WW+EALHGV+  G           A
Sbjct: 9   EKAHELVSQMTLEEKASQLRYDAPAIPRLGVPTYNWWNEALHGVARAGV----------A 58

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRD 162
           TSFPQ I  A++F+  L + +G  V+ E RA YN  +         GLT+WSPNVNIFRD
Sbjct: 59  TSFPQAIAMAAAFDDELLKTVGDAVAAEGRAKYNEYSRHDDRDIYKGLTFWSPNVNIFRD 118

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           PRWGRG ET GEDP L+ +   +YV GLQGS D D +K AA  KHF  +     +G +  
Sbjct: 119 PRWGRGHETYGEDPYLTSRLGVAYVEGLQGSQDDDFMKTAACAKHFAVH-----SGPESV 173

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+A+ SK+D+ +T+   F  CV E  V +VM +YN+ NG P C  P +++  +R EW
Sbjct: 174 RHEFDAQASKKDMYETYLPAFEACVKEAGVEAVMGAYNRTNGEPCCGSPTLIQNILREEW 233

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
              G+ VSDC ++  ++     T TPEE+AA A+++G D++CG    LH   A Q+GL++
Sbjct: 234 DFQGHYVSDCWAIADFHMHHMVTKTPEESAALALKSGCDVNCG-VTYLHLLKAYQQGLVT 292

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
           E +I  A     T +  LG FD       Y  +  + V   +H ELA + A++ +VLLKN
Sbjct: 293 EEEITQAAERLFTTRFLLGCFD----KNEYDDIPYEVVECKEHLELAQKMAKESMVLLKN 348

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQ 455
            G  LPL+    +T+ VIGPN+D    ++GNY G +  Y T L+GI    G   R  + +
Sbjct: 349 DG-ILPLNKDGLKTIGVIGPNADSRTPLVGNYHGTSSRYITLLEGIQDFVGEDVRVYYSE 407

Query: 456 GC---KD----VACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
           GC   KD    +    D++   A+  +  +D  +L +GLD+++E E         + D+ 
Sbjct: 408 GCHIYKDRVEGLGWKQDRI-SEALTVAEHSDVVVLCLGLDENLEGEEGDTGNSYASGDKK 466

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            L LP  Q+EL+  V+   K P +L +MSG  ID+ FA     + AI+   YPG  GG A
Sbjct: 467 DLELPESQRELLEAVAGCGK-PVVLCMMSGSAIDMQFAAE--HVNAILQVWYPGARGGKA 523

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPF 618
            A+ILFG  +P GKLP+T+Y    +   P  E  +M+        GRTYR+ +   +YPF
Sbjct: 524 AAEILFGACSPSGKLPVTFYKD--LEGFPAFEDYSMK--------GRTYRYLEKEPLYPF 573

Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
           G+G++Y         A  +     +G+  +I A +       T          +QV +K+
Sbjct: 574 GYGLTYG---QVCVKAAELTGAVEEGKELTIKAMVENSGKYDTDDV-------IQVYIKD 623

Query: 679 VGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
           + SK+                 P+  L AF++V +  G +  + + +   + L  VD  G
Sbjct: 624 LDSKNA---------------VPNHSLCAFKRVSLKKGEKAEILLKVPY-EALMAVDEEG 667

Query: 739 TRRIPLGEHNIHIGGTK---HSVSL 760
            + +      + +G ++    S+SL
Sbjct: 668 KKYVDSSHFVLSVGTSQPDDRSISL 692


>gi|358393086|gb|EHK42487.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
           206040]
          Length = 794

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 255/732 (34%), Positives = 389/732 (53%), Gaps = 28/732 (3%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +    +R   LI   +L+E +    +    VPRLG+  Y+ W+EALHG+      TK
Sbjct: 64  CDSTAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 123

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
            GG+F   TSFP  I + ++ N TL   I  ++S +ARA  N G  GL  ++PN+N FR 
Sbjct: 124 -GGEFEWGTSFPMPILSMAALNRTLIHQIADIISTQARAFSNNGRYGLDVYAPNINGFRS 182

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           P WGRGQETPGED  VL+  Y   Y+ G+QG  D + LK+AA+ KHF  YDL+N+N   R
Sbjct: 183 PLWGRGQETPGEDANVLTSAYTYEYITGMQGGVDPENLKIAATAKHFAGYDLENYNNQSR 242

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A +++QD+ + +   F       K  S MC+YN VNGVP+C++   L+  +R  W 
Sbjct: 243 LGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCSNSFFLQTLLRESWG 302

Query: 281 L--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
               GY+ SDCD++   ++  ++ ++   AAAD+++AG D+DCG     H   +   G +
Sbjct: 303 FPEYGYVSSDCDAIYNVWNPHNYANSQSSAAADSLKAGTDIDCGQTYPWHLNESFVAGTV 362

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           S  +I  ++       +RLG FD       Y  LG KDV   D   ++ EAA +GIVLLK
Sbjct: 363 SRGEIERSVTRLYANLVRLGYFD---KKNEYRSLGWKDVVKTDAWNISYEAAVEGIVLLK 419

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G +LPLS  + R++A+IGP  + T  + GNY G A    +PLQ   +    ++ +   
Sbjct: 420 NDG-TLPLSK-KVRSIALIGPWVNATEQLQGNYFGTAPYLISPLQAAKKAGYEVNYELGT 477

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
            +       F  AI A++++DA I + G+D +IE E  DR  +  PG Q +L+ ++S   
Sbjct: 478 GINNQTTAGFAKAIAAAKKSDAIIFIGGIDNTIEQEGADRTDIAWPGNQLDLIKQLSEVG 537

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P +++ M GG +D +  K++ ++ +++W GYPGQ+GG A+ DIL G   P G+L  T 
Sbjct: 538 K-PLVVLQMGGGQVDSSSIKSNKKVNSLVWGGYPGQSGGYALFDILSGKRAPAGRLVSTQ 596

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP EY+      +M +RP   K+ PG+TY +Y G  VY FG G+ YT F  T+    T+ 
Sbjct: 597 YPAEYVHQFAQNDMNLRPD-GKKNPGQTYIWYTGKPVYQFGDGLFYTTFKETLGKQSTL- 654

Query: 639 AVPLDGRHGSINAT-ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPA 696
                      NA+ I G                   +++N G     ++ + F  T  A
Sbjct: 655 ---------KFNASQILGAGHPGYTYSEQTPVFTFTANIQNSGKTASPYSAMAFVRTSNA 705

Query: 697 G-HWAPHKQLVAFEKV-HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
           G    P+K LV F+++  +  G    + I I +   LS VD +G + +  G++ + +  T
Sbjct: 706 GPKPYPNKWLVGFDRLATIKPGHSSTLSIPIPLNA-LSRVDSNGNKIVYPGKYEL-VLNT 763

Query: 755 KHSVSLHAATLG 766
             SV L    +G
Sbjct: 764 DESVKLEFELVG 775


>gi|410098444|ref|ZP_11293422.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409222318|gb|EKN15263.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 738

 Score =  404 bits (1037), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 262/750 (34%), Positives = 383/750 (51%), Gaps = 86/750 (11%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A T   PF    LP+  RV D+I RL+L+EKV+L+   A AVPRLGI  Y WW+EALHGV
Sbjct: 21  AQTYDYPFRNPDLPLDVRVQDIISRLTLEEKVQLMKHAAPAVPRLGIPAYNWWNEALHGV 80

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-----GGTA- 148
           +               T FPQ I  A++F+    + +G + S E RA++N     G T  
Sbjct: 81  ARTKEK---------VTVFPQAIGMAATFDTEALQKMGDMTSSEGRALFNEDLKAGKTGE 131

Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
              GLTYW+PN+NIFRDPRWGRGQET GEDP L+ K  ++ V GL+G++ + LK  A  K
Sbjct: 132 IYRGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGSAIVHGLEGNNPEYLKSVACAK 191

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H+  +     N   R  ++A+VS  D+ DT+   FR  V + KV  VMC+YN+  G P C
Sbjct: 192 HYAVHSGPEHN---RHSYDARVSMYDLWDTYLPAFRELVTKAKVHGVMCAYNRFEGTPCC 248

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
               +L+  +R +W+ +GY+ SDC +V  +       S   EA ADA+  G DL+CG  L
Sbjct: 249 GHNELLQDILRNQWKFDGYVTSDCWAVSDFAKYHKTHSNDTEAVADAVLNGTDLECGN-L 307

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQE 384
               +  V++GL+SE DIN +L     +Q +LGM+D  P+ + PY  +G + +    H++
Sbjct: 308 YQKLQQGVEKGLISEKDINVSLARLFEIQFKLGMYD--PADRVPYASIGREVIECDAHKK 365

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
            A E A++ +VLLKN    LPL+  + + +A+IGPN D   T++ NY G      TP + 
Sbjct: 366 HAYEMAQKSMVLLKNNKNILPLNASKIKRIALIGPNMDNGSTLLANYFGTPSEIITPYKS 425

Query: 445 I-GRYARTIHQQGCKDVACADDQL----FGAAIDASRQADATILVMGLDQSIE------- 492
           +  R+  +I       V           F      +++AD  I V G+    E       
Sbjct: 426 LQKRFGNSIQIDTLTGVGIVQKLEGAPSFAQVAAQAKKADIIIFVGGISADYEGEAGDAG 485

Query: 493 ------AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAI 546
                   + DR  + LP  Q EL+ ++    + P ILV MSG  +  +F        AI
Sbjct: 486 AAGYGGFASGDRTTMKLPPVQTELMKELKKTGR-PLILVNMSGSVM--SFDWESRNADAI 542

Query: 547 IWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT 606
           + A Y GQA G AI D+LFG  NP G++P+T Y         M +  +   +      RT
Sbjct: 543 LQAWYGGQAAGDAITDVLFGDYNPAGRMPLTTY---------MNDEDLPDFEDYSMANRT 593

Query: 607 YRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC 665
           YR++KG V YPFG+G+SYT F +  + NA TV                +G++I+VT    
Sbjct: 594 YRYFKGDVRYPFGYGLSYTTFGYAPLQNASTVK---------------TGESIQVT---- 634

Query: 666 NRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGH-WAPHKQLVAFEKVHVPAGAQQRVGI 723
                     V N G + G   + L  S P  G+   P + L  F+++H+  G  ++V  
Sbjct: 635 --------TTVTNTGKRAGDEVVQLYISHPQNGNTRVPLRALKGFKRIHLDTGESRQVTF 686

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            +   + LS+VD  G +    G   ++IGG
Sbjct: 687 TLS-PEELSLVDEKGNQVEKEGTVELYIGG 715


>gi|398406144|ref|XP_003854538.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
 gi|339474421|gb|EGP89514.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
          Length = 884

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 260/723 (35%), Positives = 380/723 (52%), Gaps = 37/723 (5%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSN---VGP 99
           C  SL    R+  LI +++++EK   L+ GA  +PR+G+  YEWW+EALHGV+    V  
Sbjct: 146 CDTSLSQDDRIAALISQMTVEEKATNLVDGALGLPRIGLPPYEWWNEALHGVAGSRGVSF 205

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
            +  G DF  ATSFP  I   ++F+  L   +  ++  EARA  N   +G  +W+PN+N 
Sbjct: 206 DSPNGSDFSYATSFPLPILMGAAFDDPLIYDVASIIGKEARAFANYAHSGYDFWTPNMNT 265

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKHFTAYDLDNWNG 217
           F DPRWGRG E P ED   + +Y AS V GLQG     D  ++ A+CKHF  YD++    
Sbjct: 266 FLDPRWGRGLEVPTEDSFHAQRYVASLVPGLQGGKEKTDHKQIIATCKHFAVYDVE---- 321

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +R   N + + QD+ + +   F+ CV +  V S+MCSYN V GVP CA    L+  +R 
Sbjct: 322 TNRHAQNYEPTPQDLGEYYLPAFKTCVRDVNVGSIMCSYNAVYGVPACASEYFLQDVLRD 381

Query: 278 EWRLNG---YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAV 333
           +W  N    Y+ SDC++V   +   +FT T   AAA A+ AG D +CG  +L L+T  A 
Sbjct: 382 QWNFNEPYHYVTSDCEAVKDIWTPHNFTDTEPAAAAVALNAGTDTNCGTSYLQLNTSVAN 441

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
                +++DI  +L         +G FDG+P    Y  L   DV TP  Q  A  AA +G
Sbjct: 442 NWTTEAQMDI--SLTRLYNALFTVGYFDGQPE---YDGLSFADVSTPFAQATAYRAASEG 496

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
           I LLKN G  LPL    + +VA+IGP ++ T  M G Y GIA    +PL         I 
Sbjct: 497 ITLLKNDG-LLPLKK-SYNSVALIGPWANATTQMQGIYQGIAPYLVSPLAAAQAQWGHIS 554

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
                 +   +   F +A+ A+R AD  I   G+D SIE E+ DR  +  PG Q +LV +
Sbjct: 555 FTNGTAINSTNTTGFASALSAARDADVIIYAGGIDSSIEKESRDRTSISWPGNQLDLVQQ 614

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
           +S   K P ++V   GG +D +    +  + +++WAGYPGQ GG+A+ D+L G  +P G+
Sbjct: 615 LSELGK-PLVVVQFGGGQVDDSALLRNKNVNSLVWAGYPGQDGGSALIDVLVGKQSPAGR 673

Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
           L +T YP +YI  + + +  +RPS S   PGRTY++Y    V PFG+G+ YT F    A 
Sbjct: 674 LTITQYPADYINQISLFDPNLRPSDSS--PGRTYKWYNKEPVLPFGYGLHYTTFEFDWAK 731

Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK--NVGSKDGAHTLLVF 691
           AP             I + +   A   T  K N  +   ++ +K  N GS    +  LVF
Sbjct: 732 APQASY--------DIASLVDSTASYTTSPKKNDASPWTELSIKVHNSGSLGSDYVGLVF 783

Query: 692 -STPPAGHWA-PHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
             TP AG    P+K L ++ ++H + AGA   +  ++ +   L+  D  G   I  G++ 
Sbjct: 784 LRTPNAGPAPYPNKWLASYARLHGLSAGASAELSFSLSLGA-LARGDEHGDLIIYPGDYE 842

Query: 749 IHI 751
           + I
Sbjct: 843 VQI 845


>gi|451821678|ref|YP_007457879.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
 gi|451787657|gb|AGF58625.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
          Length = 710

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 258/735 (35%), Positives = 384/735 (52%), Gaps = 95/735 (12%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           ++  +L+ +++LQE+ + L   A A+  L I  Y WW+E LHGV+  G           A
Sbjct: 14  EKAKELVSKMTLQERAEQLTYKAPAIKHLNISRYNWWNEGLHGVARAGT----------A 63

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I  A+ F+  L E I  +++ E RA YN  +         GLT+WSPNVNIFRD
Sbjct: 64  TVFPQAIGLAAIFDDELLEKIAGIIATEGRAKYNENSKKEDKDIYKGLTFWSPNVNIFRD 123

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
           PRWGRG ET GEDP L+ +   ++V+GLQG D   LK+AA  KHF  +      G+ R  
Sbjct: 124 PRWGRGHETYGEDPYLTSRLGVAFVKGLQG-DEKYLKIAACAKHFAVHS--GPEGL-RHE 179

Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
           FNA VSK+D+ +T+   F  CV E  V +VM +YN+ N  P C    +LK  +RG+W+  
Sbjct: 180 FNAVVSKKDLYETYLPAFEACVKEADVEAVMGAYNRTNDEPCCGSSLLLKDILRGKWQFK 239

Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
           G++VSDC ++  ++     TST  E+AA AI+ G DL+CG    L    A + GL++E D
Sbjct: 240 GHVVSDCWAIADFHLYHGVTSTATESAALAIKNGCDLNCGNVY-LQMLLAYKEGLVTEED 298

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQ--PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
           I  A    +  ++RLGMFD E      PY     K     +H E++L A+R+ IV+L+N 
Sbjct: 299 ITRAAERLMATRIRLGMFDEECEFNKIPYTMNDCK-----EHHEVSLMASRKSIVMLRNN 353

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-----GRYARTIHQQ 455
           G  LPL   + +++ +IGPN+D  + + GNY G A  Y T L+GI         R  + +
Sbjct: 354 GL-LPLDKSKLKSIGIIGPNADSELMLKGNYFGTASKYITVLEGIHEAVDSENIRIFYSE 412

Query: 456 GC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
           GC       +D+A  DD++   A+  +  +D  IL +GLD SIE E         A D+ 
Sbjct: 413 GCHLYKDRVQDLAEPDDRM-AEAVTVAEHSDVVILCLGLDSSIEGEQGDAGNSDGAGDKL 471

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            L LPG+QQEL+ KV +A+  P I+VL +G  +       +   AAI+ A YPG  GG A
Sbjct: 472 NLNLPGKQQELLEKV-IATGKPVIVVLGAGSAL--TLQGQEENCAAILNAWYPGSFGGRA 528

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
           IAD++FG  +P GKLP+T+Y       LP  T+ +M+         RTYR+ K   +YPF
Sbjct: 529 IADLIFGKCSPSGKLPVTFYKT--TEELPEFTDYSMK--------NRTYRYMKNESLYPF 578

Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
           G G++Y+  V     + + ++   +G                         + V + + N
Sbjct: 579 GFGLTYSK-VQLSDLSVSDISKDFEG-------------------------VEVSIKISN 612

Query: 679 VGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
           VG+ D    L  +       +A  +  L AF++V +  G  + V + I+  +   VV+  
Sbjct: 613 VGNFDIEEVLQCYIKDLESKYAVDNHSLSAFKRVALNKGESKVVKMTINK-RAFEVVNDE 671

Query: 738 GTRRIPLGEHNIHIG 752
           G R +   +  + +G
Sbjct: 672 GDRILDSKKFKLFVG 686


>gi|253579611|ref|ZP_04856880.1| glycoside hydrolase, family 3 domain-containing protein
           [Ruminococcus sp. 5_1_39B_FAA]
 gi|251849112|gb|EES77073.1| glycoside hydrolase, family 3 domain-containing protein
           [Ruminococcus sp. 5_1_39BFAA]
          Length = 706

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 260/733 (35%), Positives = 391/733 (53%), Gaps = 104/733 (14%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           ++   L+ +++L EK   L   AA V RLG+  Y +W+EALHGV+  G           A
Sbjct: 13  KKAEKLVSQMTLLEKASQLKYDAAPVKRLGVPAYNYWNEALHGVARAGV----------A 62

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I  A+ F+    + +G +++ E RA YN  +A        GLT+WSPNVNIFRD
Sbjct: 63  TMFPQAIAMAAVFDDEEMKKVGDIIATEGRAKYNAYSAKEDRDIYKGLTFWSPNVNIFRD 122

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
           PRWGRG ET GEDP L+ +    +V G+QG DG  +K AA  KH+  +     +G +  R
Sbjct: 123 PRWGRGHETYGEDPYLTSRLGVKFVEGIQG-DGPVMKAAACAKHYAVH-----SGPESLR 176

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A+ S +D+ +T+   F   V E  V +VM +YN+ NG P CA   +++  +RG+W+
Sbjct: 177 HEFDAQASMKDMWETYLPAFEALVTEADVEAVMGAYNRTNGEPCCAHKYLMEDVLRGKWK 236

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
             G+  SDC ++  +++    TSTP ++AA A+ AG DL+CG    LH   A Q GL++E
Sbjct: 237 FEGHYTSDCWAIRDFHEHHMVTSTPRQSAAMALNAGCDLNCGNTY-LHMMGAYQDGLVTE 295

Query: 341 IDINNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
             I  + V  LT +  LG+FDG E    PY  +  K     +H + AL+ AR+  VLLKN
Sbjct: 296 EKITESAVRLLTTRYLLGLFDGSEYDKIPYSVVECK-----EHIDEALKMARKSCVLLKN 350

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQ 455
            G  LP+   +  T+ VIGPN+D    +IGNY G +  Y T L+GI    G   R ++ Q
Sbjct: 351 DG-VLPIDKTKVNTIGVIGPNADSRAALIGNYHGTSSEYITVLEGIREEAGDDVRILYSQ 409

Query: 456 GC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
           GC       +++A   D++  A I A   +D  IL +GL++++E E         + D+ 
Sbjct: 410 GCDLYKDKVENLAWDQDRISEAVITA-ENSDVVILCVGLNETLEGEEGDTGNSDASGDKV 468

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            L LP  Q+EL+ KV+   K PTI+VLM+G  ID+ +A+++     I+ A YPG  GG A
Sbjct: 469 DLHLPKVQEELIEKVTAVGK-PTIVVLMAGSAIDLNYAQDN--CNGILLAWYPGARGGRA 525

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
           IAD+LFG  +P GKLP+T+Y    +  +P  T+ +M+         RTYR+ +   +YPF
Sbjct: 526 IADLLFGKESPSGKLPITFYKD--LEGMPEFTDYSMK--------NRTYRYMEKEALYPF 575

Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
           G+G++Y++   T A     V+   D                          + ++  VKN
Sbjct: 576 GYGLTYSDTCVTEAEVVGEVSAESD--------------------------IVLKATVKN 609

Query: 679 VGSKDGAHTLLVF----STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
            G+ D    + V+     +P A     +  L  F++V + AG ++ V   I   K +++V
Sbjct: 610 NGTVDTDEVVQVYIKDLDSPLA---VRNYSLCGFKRVSLKAGEEKSVEFTIS-NKAMNIV 665

Query: 735 DRSGTRRIPLGEH 747
           D  G R I  G+H
Sbjct: 666 DEDGNRYIA-GKH 677


>gi|373952439|ref|ZP_09612399.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373889039|gb|EHQ24936.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 721

 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 259/751 (34%), Positives = 376/751 (50%), Gaps = 87/751 (11%)

Query: 42  FCQVSLPIPQ--------RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
           FCQ  +PI +        RV DLI RL+L EKV LL   + AVPRL I  Y WW+E LHG
Sbjct: 23  FCQ-QIPIYRNPDKKLSTRVQDLISRLTLAEKVSLLGYRSQAVPRLNIPAYNWWNEGLHG 81

Query: 94  VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA----- 148
           V+  G           AT FPQ I  A++F+  L + +  VVS EARA YN  TA     
Sbjct: 82  VARAGE----------ATIFPQAIAMAATFDDNLVKQVANVVSTEARAKYNLSTAMGRHL 131

Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
              GLT+WSPN+NIFRDPRWGRGQET GEDP L+ K   +YV GLQG+D   LK +A+ K
Sbjct: 132 QYMGLTFWSPNINIFRDPRWGRGQETYGEDPFLTSKMGNAYVHGLQGTDPLHLKTSATAK 191

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           HF A+        +R +F+A V ++D+ DT+   F+  V +G V S+M +YN+VNGVP  
Sbjct: 192 HFVAHSGPEG---ERDYFDALVDEKDLRDTYLYAFKSLV-DGGVESIMTAYNRVNGVPNS 247

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
            +  ++   +  EW   G++V+DC ++   Y T        E AA AI+AG+DLDC    
Sbjct: 248 INKTLVNDIVIKEWGFKGHVVTDCGALDDVYKTHKVLPNRMEVAAAAIKAGVDLDCSSIF 307

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
                +A+   LL+E  ++ AL   L+ Q +LG FD  PSS P+   G   +    H  L
Sbjct: 308 QTDIINAINNKLLTEKQVDAALAAVLSTQFKLGFFDA-PSSSPFYSFGADSIHNDSHVML 366

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           A + A++ +VLLKN    LPL    + ++ V+GPN+     ++ +Y G++      ++GI
Sbjct: 367 ARQMAQKSMVLLKNDKQILPLKMQNYSSIMVVGPNAASLDALVASYHGVSSKAVNFVEGI 426

Query: 446 ----GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA------ 495
                +  R  +  G       D   FG  I  +  AD T+ V+GL   +E EA      
Sbjct: 427 TAAVDKGTRVEYDLGAD---YRDTTHFG-GIWGAGNADVTVAVIGLTPVLEGEAGDAFLS 482

Query: 496 ---LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
               D+  L LP      +  +  + K P I V+ SG  +D+A     P   A+I A YP
Sbjct: 483 QTGGDKKDLSLPAGDIAFMKALRKSVKKPIIAVVTSGSDVDIAAIA--PYADAVILAWYP 540

Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYK 611
           G+ GG A+ADILFG  +P G LP+T+Y    + +LP     +M+        GRTYR++ 
Sbjct: 541 GEQGGNALADILFGKISPSGHLPLTFYNS--VNDLPAYNNYSMK--------GRTYRYFA 590

Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
           G V YPFG G+SYT F +     P                       K +++  +  T+ 
Sbjct: 591 GAVQYPFGFGLSYTTFNYQWQQQP-----------------------KTSYSAKD--TIQ 625

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
           + V VKN G+      +  +   P  +  P K+L  F+++ +  G+     I+I V +  
Sbjct: 626 LSVVVKNTGNISADEVVQAYIGYPTLNRMPLKELKGFKRITLNKGSTSLASISIPVTELQ 685

Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
                     +  G + +++G       L A
Sbjct: 686 KWNSSKHQFELYPGNYTVYLGSDSEDKKLSA 716


>gi|410628680|ref|ZP_11339398.1| beta-glucosidase [Glaciecola mesophila KMM 241]
 gi|410151684|dbj|GAC26167.1| beta-glucosidase [Glaciecola mesophila KMM 241]
          Length = 732

 Score =  400 bits (1029), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 256/749 (34%), Positives = 391/749 (52%), Gaps = 84/749 (11%)

Query: 46  SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
            L    R   L+  +++ EK+  L     A+PRL +  Y WW+EALHG++  G       
Sbjct: 34  ELSFETRAQALVNAMTIDEKITQLSHSTPAIPRLEVPQYNWWNEALHGIARNGK------ 87

Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--------NGGTAGLTYWSPNV 157
               AT FPQ I   ++F+  L + +   +SDEARA Y         G  AGLT+W+PNV
Sbjct: 88  ----ATIFPQAIGLGATFDPELAQEVANAISDEARAKYAIAQSIGNQGQYAGLTFWTPNV 143

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           NIFRDPRWGRGQET GEDP+L+ +   ++V+GLQG D   LK A   KHF  +     +G
Sbjct: 144 NIFRDPRWGRGQETYGEDPLLTSQMGTAFVKGLQGDDPKYLKSAGVAKHFAVH-----SG 198

Query: 218 VD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
            +  R  F+ + SK+D+ +T+   F   V + KVA VMC+YN V G P+CA   +L   +
Sbjct: 199 PESLRHQFDVEPSKKDLYETYLPAFEALVTQAKVAGVMCAYNGVYGQPSCASEFLLGEML 258

Query: 276 RGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
           + +W+ NGY+VSDC ++  ++     T    E+AA A+RAG+DL+CG       ++A + 
Sbjct: 259 KKKWQFNGYVVSDCGALHDFHSGHKVTHNRVESAALALRAGVDLNCGFTYEKSLKAAFEE 318

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGI 394
           GL+++  I+  L N L ++ RLG+FD  PS   P+  +G + + + +H ELA + A + I
Sbjct: 319 GLITQSLIDQRLKNLLMIRFRLGLFD--PSELNPHNAIGQEVIHSLEHIELARKVAAKSI 376

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG---RYART 451
           VLLKN+   LPLS    +   V GP +  +  ++GNY GI+    T L+GI        +
Sbjct: 377 VLLKNEKQVLPLSK-DIKVPYVTGPFAASSDMLMGNYYGISDSLVTVLEGIAGKVSLGSS 435

Query: 452 IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL---------DRAGLL 502
           ++ +       ++      A + ++ ADA I V+G+   +E E +         DR  + 
Sbjct: 436 LNYRAGALPFHSNINPLNWAPEVAKTADAVIAVVGISADMEGEEVDAIASADRGDRVAIT 495

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           LP  Q + V +++   KGP ILV+ +G P+D+  ++ DP   AI+W  YPG+ GG A+AD
Sbjct: 496 LPQNQVDYVKQLAENKKGPLILVVAAGSPVDI--SELDPLADAILWIWYPGEQGGNAVAD 553

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
           ++FG +NP G LP+T+   + I +LP       P       GRTY+F K   +YPFG G+
Sbjct: 554 VIFGDTNPSGHLPLTFV--KTIDDLP-------PFDDYTMTGRTYKFLKKLPLYPFGFGL 604

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
           SYT F                           GK      A      + + V+V+N  + 
Sbjct: 605 SYTQFKF-------------------------GKLSLSKRAPQEGENINISVEVENSTAL 639

Query: 683 DGAHTLLVFSTP--PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
           DG   + V+ +P  P  + A    L AF++VH+ A  ++ +   I   K L  V+ +G  
Sbjct: 640 DGETVVQVYLSPQVPLKNEAI-TNLKAFKRVHIGAYEKRLIEFTIE-GKNLYRVNDAGEN 697

Query: 741 RIPLGEHNIHIGG---TKHSVSLHAATLG 766
             P G + + +G    +K S+ L AA  G
Sbjct: 698 VWPSGAYTLAVGDSLPSKRSIELGAAPHG 726


>gi|125534110|gb|EAY80658.1| hypothetical protein OsI_35835 [Oryza sativa Indica Group]
          Length = 511

 Score =  400 bits (1029), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 221/490 (45%), Positives = 299/490 (61%), Gaps = 15/490 (3%)

Query: 284 YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDI 343
           Y+ SDCD+V    D  H+T +PE+  A +I+AG+D++CG +  +H  +AVQ+G L+E DI
Sbjct: 16  YVASDCDAVATIRDAHHYTLSPEDTVAVSIKAGMDVNCGNYTQVHAMAAVQKGNLTEKDI 75

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQP-YGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           + ALVN   V+MRLG FDG+P S   YGHLG  DVC+P H+ LALEAA+ GIVLLKN   
Sbjct: 76  DRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAG 135

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA--RTIHQQGCKDV 460
           +LPL      ++AVIGPN+D    + GNY G  C  TTPLQGI  Y   R     GC   
Sbjct: 136 ALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDSP 195

Query: 461 ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG 520
           ACA      AA  AS  +D  +L MGL Q  E E LDR  LLLPG QQ L++ V+ A++ 
Sbjct: 196 ACAVAATNEAAALAS-SSDHVVLFMGLSQKQEQEGLDRTSLLLPGEQQGLITAVANAARR 254

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P ILVL++GGP+DV FAK++P+I AI+ AGYPGQAGG AIA +LFG  NP G+LP+TWYP
Sbjct: 255 PVILVLLTGGPVDVTFAKDNPKIGAILLAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYP 314

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV--------A 632
           +E+ T +PMT+M MR   +  YPGR+YRFY+G  VY FG+G+SY+ F   +        A
Sbjct: 315 EEF-TKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSNA 373

Query: 633 NAPTVVAVPLDGRHGSINATISGKAIK-VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
              +++A  +  R G     +S   +K +   +C+RL     V+V+N G  DG H++L++
Sbjct: 374 GNLSLLAGVMARRAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSVLMY 433

Query: 692 ST-PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
              P      P +QL+ F   HV  G +  V   +  C++ S V   G R I  G H + 
Sbjct: 434 LRWPTKSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAHFLM 493

Query: 751 IGGTKHSVSL 760
           +G  +   SL
Sbjct: 494 VGDEELETSL 503


>gi|255284060|ref|ZP_05348615.1| beta-glucosidase [Bryantella formatexigens DSM 14469]
 gi|255265405|gb|EET58610.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Marvinbryantia formatexigens DSM 14469]
          Length = 700

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 265/721 (36%), Positives = 376/721 (52%), Gaps = 95/721 (13%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           +R   L+ +++++EK   L   A A+ RLGI  Y WW+EALHGV+  G           A
Sbjct: 8   KRAEALVAQMTVEEKASQLKYDAPAIKRLGIPAYNWWNEALHGVARAGQ----------A 57

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I   ++F+  L   I  V++ E RA YN   A        GLT+WSPNVNIFRD
Sbjct: 58  TVFPQAIGLGATFDEALLGEIADVIATEGRAKYNAYAAKEDRDIYKGLTFWSPNVNIFRD 117

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
           PRWGRG ET GEDP L+ +   ++V+GLQG DG+ +K AA  KHF  +     +G +  R
Sbjct: 118 PRWGRGHETYGEDPCLTSRLGVAFVKGLQG-DGETMKAAACAKHFAVH-----SGPEAVR 171

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             FNA+ S +D+E+T+   F   V E  V +VM +YN+ NG   CA P +L++ +R +W 
Sbjct: 172 HEFNAEASAKDMEETYLPAFEALVKEADVEAVMGAYNRTNGEACCASP-VLQKILREDWG 230

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
             G+ VSDC ++  +++    T+T +E+AA AI +G DL+CG    LH   A + GL+SE
Sbjct: 231 FEGHFVSDCWAIRDFHEHHMLTATAKESAAMAINSGCDLNCGNTY-LHILHAYRDGLVSE 289

Query: 341 IDINNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
             I  A V   T +  LG+FDG E    PY  +  K     +H  LA +AA +  VLLKN
Sbjct: 290 ETITEAAVRLFTTRFLLGLFDGSEYDDIPYTVVESK-----EHLALAEKAALESAVLLKN 344

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQ 455
            G  LPL   R RTV VIGPN+D    + GNY G A  Y T  QG+  Y     R +   
Sbjct: 345 NGI-LPLKKERLRTVGVIGPNADSRAALAGNYHGTASRYETIQQGLQDYLGEDVRVLTSV 403

Query: 456 GC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
           GC       + +A A D+L  A I  +  +D  IL +GLD+++E E         + D+ 
Sbjct: 404 GCALSEDRTEKLALAGDRLAEAQI-VAENSDVVILCLGLDETLEGEEGDTGNSYASGDKE 462

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            LLLP  Q++L+  V+   K P +L +MSG  +D+++A       AI+   YPG  GG+A
Sbjct: 463 TLLLPEAQRDLMEAVAATGK-PVVLCMMSGSDLDMSYAAE--HFDAILQLWYPGSQGGSA 519

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
            A +LFG  +P GKLP+T+Y  E +  LP  E            GRTYR+   P  YPFG
Sbjct: 520 AAKLLFGEVSPSGKLPVTFY--ETLEELPAFE-------DYSMKGRTYRYMGHPAQYPFG 570

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
            G++Y +   T AN                   I G + +        LTL V  +    
Sbjct: 571 FGLTYGDVRVTDAN-------------------IRGASAE------GDLTLAVTAENAGN 605

Query: 680 GSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
              D    + V  T  A +  P+  L AF ++H+ AG ++ + + +   +  +VVD +G 
Sbjct: 606 AVTDEVLQIYVKCTDSA-NAVPNPALAAFGRIHLEAGEKKTIEMTVP-ARAFTVVDEAGV 663

Query: 740 R 740
           R
Sbjct: 664 R 664


>gi|320161274|ref|YP_004174498.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
 gi|319995127|dbj|BAJ63898.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
          Length = 712

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 249/728 (34%), Positives = 393/728 (53%), Gaps = 75/728 (10%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           P+ +RVNDLI R++L+EK+  + +  AA+PRLGI  Y++WSEALHGV+  G         
Sbjct: 14  PLEERVNDLISRMTLEEKISQMCNSCAAIPRLGIPAYDYWSEALHGVARNGK-------- 65

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN------GGT---AGLTYWSPNVN 158
             AT FPQ I  A++++  L E +   ++ EARA ++      G T    GLT WSPN+N
Sbjct: 66  --ATVFPQAIGMAATWDTELIERVADAIASEARAKFHETLRKFGKTDIYQGLTMWSPNIN 123

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
           IFRDPRWGRGQET GEDP L+G+  A++VRGLQG D   LK AA  KH+T +   +    
Sbjct: 124 IFRDPRWGRGQETWGEDPYLTGEMGAAFVRGLQGKDPHYLKTAACAKHYTVH---SGPEK 180

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
           +R  FNA V+++++ DT+   F+  V E KV +VM +YN+  G P C  P +LK  +R +
Sbjct: 181 ERHTFNAIVTRRELFDTYLPAFKKLVTEAKVEAVMGAYNRTLGEPCCGSPYLLKEILRNQ 240

Query: 279 WRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFLGLHTESAVQRGL 337
           W   G++VSDC ++  ++     T    E+AA  I+ G D+ C   +   +   A+ RGL
Sbjct: 241 WGFKGHVVSDCGAINDFHLHHQVTKDGAESAALGIKNGCDMACICTYSYENLTEALNRGL 300

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           ++E DI++AL NTL  + +LG+FD +    PY H+    V    H++LA E A +  VLL
Sbjct: 301 ITEEDIDHALRNTLRTRFKLGLFDPQ-EKVPYAHISMSVVGCEAHRKLAYETAVKSAVLL 359

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG-IGRYARTIHQQG 456
           KN    LP+     +++ ++GPN+     ++GNY G++   TT ++G +GR    +  + 
Sbjct: 360 KNHNHILPVKP-DVKSILIVGPNAGNVHVLLGNYYGLSDSMTTFMEGLVGRLPEGVRMEF 418

Query: 457 CKDVACADDQLFGA--AIDASRQADATILVMGLD---QSIEAEAL-----DRAGLLLPGR 506
                  D +      ++ ++   D  I  MGL    +  E EA+     DR  + LP  
Sbjct: 419 MPGSLLTDSKKIKNDWSVASAASFDLVIAFMGLSPLLEGEEGEAILSDNGDREDIALPKA 478

Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFG 566
           QQE +    +A+ G  I+++++GG   +A    +  + AI+W GYPGQ GG AIAD++FG
Sbjct: 479 QQEYIR--DLAATGAKIVLVLTGGSA-IALNGIEDLVEAILWVGYPGQEGGRAIADLIFG 535

Query: 567 TSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTN 626
             +P GKLP+T+         P++   + P +      RTYR+     ++PFG G+SYT 
Sbjct: 536 DHSPSGKLPITF---------PVSTDQLPPFREYSMKERTYRYMTSSPLFPFGFGLSYTQ 586

Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
           F +            L   H  ++A   G+A++ T             ++ NVG  +G  
Sbjct: 587 FEYK----------NLQLEHPVLSA---GEALRGT------------FELANVGEYEGEE 621

Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
            + V+ S   A    P ++L++F++V +  G   ++   I   + + ++D  G + +  G
Sbjct: 622 VVQVYLSDLEASTIVPLQKLISFQRVRLKPGETVQLSFAIQP-EAMMMIDDEGNQVLEPG 680

Query: 746 EHNIHIGG 753
           +  + IGG
Sbjct: 681 KFKLTIGG 688


>gi|266619450|ref|ZP_06112385.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
 gi|288869013|gb|EFD01312.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
          Length = 714

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 266/719 (36%), Positives = 379/719 (52%), Gaps = 91/719 (12%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           +RV DL+ +++L+EKV  L   A AV RLGI  Y WW+EALHGV+  G           A
Sbjct: 14  ERVRDLVSQMTLEEKVSQLRYDAPAVERLGIPSYNWWNEALHGVARAG----------AA 63

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
           T FPQ I  A+ F+  L E IG V + E RA Y+        G   G+T+WSPN+NIFRD
Sbjct: 64  TVFPQAIGLAAMFDEALLEKIGDVTALEGRAKYHEAVRNGDRGLYKGITFWSPNINIFRD 123

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
           PRWGRG ET GEDP L+G+   +Y++G+QG +G RLK AA  KHF A+         R  
Sbjct: 124 PRWGRGHETYGEDPCLTGRMGTAYIKGMQG-NGKRLKAAACVKHFAAHSGPEKG---RHS 179

Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
           FN+ VSK+D+ +T+   F  CV E  V  VM  YN++NG   C   +++   +R +W  +
Sbjct: 180 FNSVVSKKDLTETYFPAFERCVKEAGVEGVMGGYNRLNGEAACGSHHLITEILREKWGFD 239

Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
           GY VSDC ++  ++     T TP+E+AA A+++G DL+CG    LH  SA  +GL+S  D
Sbjct: 240 GYYVSDCGAIKDFHMHHGLTDTPQESAALALKSGCDLNCGAVY-LHVMSAYNQGLVSAED 298

Query: 343 INNALVNTLTVQMRLGMFD--GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
           I+ A+ + +  +MRLGMFD   E    PY      + C  +H  LAL+AA + +VLLKN 
Sbjct: 299 IDRAVTHLMMTRMRLGMFDQHTEFDEIPY----EINDCA-EHHGLALKAAEESMVLLKND 353

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQG 456
           G  LPL     +TVAVIGPN D    + GNY G A    T L+GI    G+  R    +G
Sbjct: 354 GI-LPLDKTALKTVAVIGPNGDSEEILKGNYNGTATEKYTILEGIRAVLGKETRIFCSEG 412

Query: 457 -------CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
                   +++A ADD+L   A+  + ++D   L +GL+ ++E E           D+A 
Sbjct: 413 SHLYRDNVENLAEADDRL-KEAVSMAVRSDVVFLCLGLNGTLEGEEGDANNSYAGADKAD 471

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L LP  Q  L+  V   +  P IL+L +G  + + +A         IW  YPGQ GG A 
Sbjct: 472 LNLPESQMRLLKAVC-GTGTPVILLLAAGSAMAINYAAEHCSAILHIW--YPGQMGGLAA 528

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
           A +L G + P G+LP+T+Y  +    LP  T+ +M+        GRTYR+ +   +YPFG
Sbjct: 529 ARLLTGEAVPSGRLPVTFY--QTTEELPEFTDYSMK--------GRTYRYMEREALYPFG 578

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
           +G+SY +F ++   A    A P DG+        S K    + A+C+ +    +V V+  
Sbjct: 579 YGLSYGDFEYSNFKAEQTEAGP-DGQ-----VRFSVKITNRSKAECDEI---AEVYVRIA 629

Query: 680 GSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
            S+                 AP   L  F ++H+ AG    V   + V K   VV+  G
Sbjct: 630 DSELA---------------APGGSLADFRRIHMKAGESVTVPFTLPV-KAFMVVNEEG 672


>gi|182415033|ref|YP_001820099.1| Beta-glucosidase [Opitutus terrae PB90-1]
 gi|177842247|gb|ACB76499.1| Beta-glucosidase [Opitutus terrae PB90-1]
          Length = 905

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 258/714 (36%), Positives = 372/714 (52%), Gaps = 92/714 (12%)

Query: 46  SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
           S P+  R +DLI R+SL EKV  L + A  +PRLG+  Y++W+EA HG++N G       
Sbjct: 209 SKPLRVRADDLIRRMSLAEKVSQLKNAAPGIPRLGLPAYDYWNEAAHGIANNGI------ 262

Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT----------AGLTYWSP 155
               AT FPQ I  A+++N  L    G V+  E RA +N              GLTYW+P
Sbjct: 263 ----ATVFPQAIGAAAAWNPALLHQEGTVIGIEGRAKFNDYANRHNGDSKWWTGLTYWAP 318

Query: 156 NVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNW 215
           N+N+FRDPRWGRGQET GEDP L+ +    +V+G+QG D   +   A  KH+  +     
Sbjct: 319 NINLFRDPRWGRGQETYGEDPFLTAEIGIEFVKGVQGDDPRYMLAMACAKHYAVHSGPER 378

Query: 216 NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
               R  FNA++ ++D+ DT+   F   V EGKVA VM +YN VNGVP  A+  +L   +
Sbjct: 379 T---RHSFNAEIPERDLFDTYLPHFERVVREGKVAGVMSAYNAVNGVPASANSFLLTELL 435

Query: 276 RGEWRLNGYIVSDCDSV-GVYYDTQ-HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
           R  W   GY+ SDCD++  +Y + Q H+  T EEAAA A++AG +L CG         AV
Sbjct: 436 RKRWGFEGYVPSDCDAIRDIYGEKQHHYVKTAEEAAALAVKAGCNLCCGGDYNALVR-AV 494

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQ 392
           Q+GL++E D++ AL +TL  + RLG+FD  P+ Q P+     KD   P H ++ALE ARQ
Sbjct: 495 QQGLVTEKDLDGALYHTLWTRFRLGLFD--PAEQVPFSGYTLKDNDLPAHSQVALELARQ 552

Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRY 448
            IVLLKN G +LPL   + + +AVIGPN+     + GNY G A    + L  I    G  
Sbjct: 553 AIVLLKNDG-TLPLDRTKLKQIAVIGPNAASKSMLEGNYHGSASRSISILDDIRNLVGSE 611

Query: 449 ARTIHQQGCKDVACADDQLFGAAIDASRQ---------------ADATILVMGLDQSIEA 493
            +  H  G           +    + + +               ADA I V G+  + E 
Sbjct: 612 IKITHAMGSPVTTKPGTAPWSGQDNTTDRPVAELKAEALKLAAEADAIIYVGGITPAQEG 671

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E+ DR  + LP  Q++L+  +    K P ++V  SG  +  A    D  + AI+ A YPG
Sbjct: 672 ESFDRESIELPSEQEDLIRALHATGK-PVVMVNCSGSAM--ALTWQDENLPAIVQAWYPG 728

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKG 612
           Q GG A+A++LFG +NP G LP+T+Y      +LP  ++ +M+         RTYR++ G
Sbjct: 729 QEGGRAVAEVLFGETNPSGHLPITFY--RSTADLPDFSDYSMK--------NRTYRYFTG 778

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
             +Y FGHG+SY+ F +  AN                        ++V  A    LT  V
Sbjct: 779 RPLYAFGHGLSYSTFEY--AN------------------------LRVAPAANGALT--V 810

Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINI 725
            +D+ N G +DG   + +++TPPA       + L  F + HV AG  + V + +
Sbjct: 811 TLDLTNSGKRDGDDVVQLYATPPASSQPQELRALCGFRRTHVKAGETRTVTVTV 864


>gi|374372635|ref|ZP_09630297.1| Beta-glucosidase [Niabella soli DSM 19437]
 gi|373235166|gb|EHP54957.1| Beta-glucosidase [Niabella soli DSM 19437]
          Length = 734

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 257/761 (33%), Positives = 379/761 (49%), Gaps = 88/761 (11%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A    LPF    L    RVNDL+ RL+L+EKVK +++ A A+PRLGI  Y+WWSE LHGV
Sbjct: 21  ADKSQLPFWNYKLSFEARVNDLVSRLTLEEKVKQMLNHAPAIPRLGIPAYDWWSEVLHGV 80

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT------- 147
           +     T         T +PQ I  A++++      +    + E RA++N  T       
Sbjct: 81  ARTPYHT---------TVYPQAIAMAATWDTVALYTMADQSAREGRAIHNKATEEGKNGD 131

Query: 148 --AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
              GLTYW+PN+NIFRDPRWGRGQET GEDP L+     ++VRGLQG D   LK AA  K
Sbjct: 132 RYVGLTYWTPNINIFRDPRWGRGQETYGEDPFLTAMLGRAFVRGLQGEDPKYLKAAACAK 191

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H   Y + +     R  F+  VS  D+ +T+   F+  V   KVA VMC+YN     P C
Sbjct: 192 H---YAIHSGPEAVRHSFDVDVSDYDLWNTYLPAFKELVTHAKVAGVMCAYNAFRKKPCC 248

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
               ++   +R +W   GY+ SDC ++  +++        E AA DA+  G D++CG   
Sbjct: 249 GSDLLMTDILRRQWGFTGYVTSDCGAIDDFFNYHKTHPNAEAAAIDAVTNGTDVECGNRA 308

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQE 384
            L    AV+ G ++E +I+ ++     ++MRLGMFD  P S   Y    P  + +  H+ 
Sbjct: 309 YLTLTDAVKTGRIAEKEIDRSVKRLFMIRMRLGMFD--PVSMVSYAQTSPAVLESAPHKA 366

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
            AL+ A++ IVLLKN+   LPLS    + +AV+GPN+D ++ ++GNY G      T L G
Sbjct: 367 QALKMAQESIVLLKNENHLLPLSK-SIKKIAVVGPNADNSIAVLGNYNGTPSKIVTALDG 425

Query: 445 I----GRYARTIHQQGCKDVACADDQL------FGAAIDASRQADATILVMGLDQSIEAE 494
           I    G     +++   K V   +  L      F A     + ADA I V G+   +E E
Sbjct: 426 IKAKLGTNGSVVYE---KAVNFTNAMLPEGKTDFAALTSRVKDADAIIFVGGISPQLEGE 482

Query: 495 AL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
            +          DR  +LLP  Q E + K   A+  P + V+M+G  + + + + +  I 
Sbjct: 483 EMKVNEPGFNSGDRTTILLPTVQTEAM-KALKATGKPVVFVMMTGSALAIPWEQEN--IP 539

Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
           AI+ A Y GQA GTAIAD+LFG  NP G+LP+T+Y  +   +LP  +         R   
Sbjct: 540 AIVNAWYGGQAAGTAIADVLFGDYNPSGRLPVTFYKSD--ADLPAFD-------DYRMEN 590

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           RTYR++ G  +YPFG+G+SYT F +     PT V                          
Sbjct: 591 RTYRYFSGQALYPFGYGLSYTTFRYEGLKVPTTVK------------------------- 625

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
            N++ + V + + N G+K G   + L  S        P K L  F++V +  G  + +  
Sbjct: 626 -NKVRIPVSIQLTNTGAKGGEEVVQLYISYQGQPIKKPLKALKGFQRVWLNRGQTKTIKF 684

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
            +     L++   +G    P G+  I +GG +  V+  A +
Sbjct: 685 -LLTPDALAIAGENGKLLNPKGKLRISVGGGQPDVNTPATS 724


>gi|310797011|gb|EFQ32472.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 767

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 237/601 (39%), Positives = 333/601 (55%), Gaps = 18/601 (2%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +L  P+R   L+  L+++EK++ L+S A   PR+G+  Y WWSEALHGV+   PGT 
Sbjct: 43  CDRTLSPPERAAALVKALTVEEKLQNLVSKAQGAPRIGLPAYNWWSEALHGVA-YAPGTY 101

Query: 103 F-GGD--FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
           F  GD  F  +TS+P  +  A++F+  L E IG  +  EARA  N G AGL YW+PNVN 
Sbjct: 102 FPEGDVEFNSSTSYPMPLLMAAAFDDELIEQIGAAIGIEARAWGNAGWAGLDYWTPNVNP 161

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGV 218
           F+DPRWGRG ETPGED +   +YA    RGL G   G++ +V ++CKH+   D ++WNG 
Sbjct: 162 FKDPRWGRGSETPGEDVLRVKRYAEYITRGLDGPVPGEQRRVISTCKHYAGNDFEDWNGT 221

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
            R  F+AK++ QD+ + + +PF+ C  + KV S+MC+YN VNGVP+CA+  +L+  +R  
Sbjct: 222 SRHDFDAKITAQDLAEYYLMPFQQCARDSKVGSIMCAYNAVNGVPSCANEYLLQNILREH 281

Query: 279 WRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
           W     N Y+ SDC++V        +  T     A    AG+D  C          A  +
Sbjct: 282 WNWTEHNNYVTSDCEAVLDVSANHKYAPTNAAGTAICFEAGMDTSCEYTGSSDIPGAWSQ 341

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
           GLL E  ++ AL+      +R G FDG  +   Y  LG KDV + + Q LAL+AA +GIV
Sbjct: 342 GLLKEETVDRALLRLYEGLVRAGYFDGHEAI--YAKLGWKDVNSAEAQSLALQAAVEGIV 399

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR--TIH 453
           LLKN G +LPL       VA+IG  +D    + G Y+G A    TP     +     T+ 
Sbjct: 400 LLKNNG-TLPLDLKPSHKVAMIGFWADAPDKLQGGYSGRAAHLHTPAYAARQLGLDITLA 458

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
                    A D    AA++A+  AD  +   GLD S   E LDR  L  P  Q  L+ K
Sbjct: 459 SGPVLQRNNASDNWTAAALEAAEGADYILYFGGLDTSAAGETLDRTDLEWPEAQLMLIKK 518

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
           +S   K P ++ L+     D    + D  +++I+WA +PGQ GG AI  ++ G  +P G+
Sbjct: 519 LSALGK-PLVVNLLGDQLDDTPLLQLD-EVSSILWANWPGQDGGVAIMKLITGEKSPAGR 576

Query: 574 LPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
           LP+T YP  Y   +PMT M +RP  + +YPGRTYR+Y  P+   FG G+ YT F   V  
Sbjct: 577 LPVTQYPSNYTDLIPMTSMDLRP--TSQYPGRTYRWYDKPIKR-FGFGLHYTTFKAEVGG 633

Query: 634 A 634
           A
Sbjct: 634 A 634


>gi|238578959|ref|XP_002388893.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
 gi|215450599|gb|EEB89823.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
          Length = 658

 Score =  397 bits (1020), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 252/672 (37%), Positives = 356/672 (52%), Gaps = 53/672 (7%)

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           Y WWSEAL+              F  ATSFP  IT  ++F+  L  AI  V+S EARA  
Sbjct: 1   YNWWSEALN--------------FSSATSFPAPITMGATFDDGLIHAIATVISTEARAFN 46

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAA 202
           N    GL +++PN+N F+DPRWGRGQETPGEDP    +Y    V GLQG  G   LK+AA
Sbjct: 47  NVNRGGLDFFTPNINPFKDPRWGRGQETPGEDPFHISQYVYQLVTGLQGGVGPTNLKIAA 106

Query: 203 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            CKH+ AYDL+N  GV RF F+AKV+ QD+ + +   F+ C+ + KVAS+MCSYN VNG+
Sbjct: 107 DCKHWAAYDLENL-GVSRFEFDAKVTMQDLAEFYSPSFQSCIRDAKVASIMCSYNAVNGI 165

Query: 263 PTCADPNILKRTIRGEWRL--NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
           P+CA+  +L+   R  W L    +I  DC +VG  +   H+T  P    A A+ AG D+D
Sbjct: 166 PSCANRYLLQTLARDFWGLGEEQWITGDCGAVGNIFARHHYTDDPANGTAVALNAGTDID 225

Query: 321 CGPFLGLHTES---AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
           C      ++++   A+ R L+SE  +  A+       +RL                  DV
Sbjct: 226 CDSGAAAYSQNLGQALNRSLVSEDQLRTAVTRQYNSLVRLSW---------------DDV 270

Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
            T   Q+LA +AA +GIVLLKN G  LPL+    + VAV+GP ++ T  M  NY GIA  
Sbjct: 271 NTEPAQQLAYQAAVEGIVLLKNDG-ILPLAS-SVKKVAVVGPMANATTQMQSNYNGIAPF 328

Query: 438 YTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALD 497
             +P Q        +       +  +D   F AAI A+  AD    V G+D +IE E  D
Sbjct: 329 LVSPQQAFRNAGFNVTFANGTGLNSSDTSGFSAAIAAADDADVVFYVGGIDTTIEREDRD 388

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           R  +   G Q  LV +++   K P I++ M GG +D +  +++  + A+IW GYPGQ+GG
Sbjct: 389 RPEISWTGNQLALVQQLASLGK-PLIVLQMGGGQVDSSSLRDNTSVNALIWGGYPGQSGG 447

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
           TA+ D++ G   P G+LP+T YP  Y+   PMT+M +RPS S   PGRTY++Y G  ++ 
Sbjct: 448 TALVDLITGKQAPAGRLPITQYPASYVDGFPMTDMTLRPSSSN--PGRTYKWYTGAPIFE 505

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
           FG G+ YT F    A+     +V         +   S K   V H     L     V V 
Sbjct: 506 FGFGLHYTTFDAEWASGGDSFSVQ--------DLVSSAKNSGVAHVDLGVLDT-FNVTVT 556

Query: 678 NVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVD 735
           N G+    +  L+FS   AG   AP+K+LV++ +V  +  GA     + + +   ++  D
Sbjct: 557 NSGTVASDYVALLFSRTTAGPSPAPNKELVSYTRVKGIEPGASSAASLKVTLGA-VARTD 615

Query: 736 RSGTRRIPLGEH 747
             G R +  GE+
Sbjct: 616 EQGNRVLYPGEY 627


>gi|255572559|ref|XP_002527213.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
 gi|223533389|gb|EEF35139.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
          Length = 454

 Score =  397 bits (1020), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 202/459 (44%), Positives = 293/459 (63%), Gaps = 19/459 (4%)

Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
           +D++CG +   + +SAV +G L E DI+ AL+N  +VQ+RLG+FDG+  +  +  LGP+D
Sbjct: 1   MDINCGSYAIRNAQSAVDKGKLREEDIDRALLNLFSVQLRLGLFDGDRINGHFSKLGPED 60

Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
           VCT +H++LALEAARQGIVLLKN+   LPL+     ++A+IGP ++   ++ G+Y G +C
Sbjct: 61  VCTEEHKKLALEAARQGIVLLKNEKKFLPLNKKAVSSLAIIGPLANNGGSLGGDYTGYSC 120

Query: 437 GYTTPLQGIGRY-ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA 495
              +   G+  Y  RT +  GC +V+C  D  F  AI  ++ AD  I+V G+D S E E 
Sbjct: 121 NPQSLFDGVQAYIKRTSYAVGCSNVSCDSDDQFPEAIHIAKTADFVIVVAGIDLSQETED 180

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DR  LLLPG+Q  LVS V+ ASK P ILVL  GGP+DV+FAK D RIA+I+W GYPG+A
Sbjct: 181 RDRISLLLPGKQMALVSYVAAASKKPVILVLTGGGPVDVSFAKRDSRIASILWIGYPGEA 240

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           G  A+ADI+FG  NPGG+LPMTWYP+ + TN+PM +M MR + ++ YPGRTYRFY G  V
Sbjct: 241 GAKALADIIFGEYNPGGRLPMTWYPESF-TNVPMNDMNMRANPNRGYPGRTYRFYTGERV 299

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI-----KVTH------AK 664
           Y FG G+SYTN+ +   +AP+ +++      GS+ AT   + +     ++ +      + 
Sbjct: 300 YGFGEGLSYTNYAYKFLSAPSKLSLS-----GSLTATSRKRILHQRGDRLDYIFIDEISS 354

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVFS-TPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
           CN L   VQ+ V NVG  DG+H +++FS  P      P KQLV FE+++  +       I
Sbjct: 355 CNSLRFTVQISVMNVGDMDGSHVVMLFSRVPQVSEGTPEKQLVGFERINTVSHKSTETSI 414

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
            +  CK+LS+ +  G R +P+G H + +G  +H V++ A
Sbjct: 415 LLDPCKHLSIANGQGKRIMPVGSHVLLLGDLQHFVTIEA 453


>gi|189201569|ref|XP_001937121.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187984220|gb|EDU49708.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 756

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 257/748 (34%), Positives = 385/748 (51%), Gaps = 32/748 (4%)

Query: 9   FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
            LGL+L  AS   + +      D  +   ++   C V+    +R   L+  +  QEK+  
Sbjct: 3   LLGLVLFGASL--VPSAYAIGPDCVNGPLKSNAICDVTASPAKRAAALVAAMQTQEKLDN 60

Query: 69  LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLW 128
           L+S +  V RLG+  Y WW EALHGV+   PG  F G +  ATSFP  +  +++F+  L 
Sbjct: 61  LVSKSKGVARLGLPAYNWWGEALHGVAG-APGINFTGPYRTATSFPMPLLMSAAFDDDLI 119

Query: 129 EAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVR 188
             I  V+ +EARA  NGG A + +W+P++N FRDPRWGRG ETPGED +    Y  S + 
Sbjct: 120 HQIAIVIGNEARAFGNGGIAPVDFWTPDINPFRDPRWGRGSETPGEDILRIKGYTKSLLS 179

Query: 189 GLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGK 248
           GL+G    R K+ A+CKH+  YD+++WNG DR  F+AK++ QD+ + F  PF+ C  + K
Sbjct: 180 GLEGDKAQR-KIIATCKHYVGYDMEDWNGTDRHSFDAKITTQDLAEYFMPPFQQCARDSK 238

Query: 249 VASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHFTSTP 305
           V S MCSYN VNGVPTCAD  +L+  +R  W     N YI SDC++V        + +T 
Sbjct: 239 VGSFMCSYNAVNGVPTCADTYVLEDILRKHWNWTDSNNYITSDCEAVKDISLRHKYVATL 298

Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
           +EA A A   G+DL C          A  +GLL+   I+ AL       +  G FDG  +
Sbjct: 299 QEATAIAFNNGMDLSCEYSGSSDIPGAFSQGLLNVSVIDRALTRQYEGLVHAGYFDGAAA 358

Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
           +  Y +LG +D+ TP+ Q+L L+ A +G+ LLKN   +LPLS      VA++G  ++ + 
Sbjct: 359 T--YANLGVQDINTPEAQKLVLQVAAEGLTLLKNDD-TLPLSLKSGSKVAMVGFWANDSS 415

Query: 426 TMIGNYAGIACGYTTPLQGIGRYA--RTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
            + G Y+G A     P+    +      +        + A D     A+DA++++D  + 
Sbjct: 416 KLSGIYSGPAPYLHNPVYAGNKLGLDMAVATGPILQKSGAADNWTTKALDAAKKSDTILY 475

Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
             GLD S  AE  DR  +  P  Q +L++K  +A+ G  ++V+  G  +D     N   +
Sbjct: 476 FGGLDPSAAAEGSDRTDISWPSAQIDLITK--LAALGKPLVVIALGDMVDHMPILNMKGV 533

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
            ++IWA +PGQ GGTA+  ++ G     G+LP+T YP +Y T L M +M +RP  +   P
Sbjct: 534 NSLIWANWPGQDGGTAVMQVITGEHAIAGRLPITQYPAKY-TQLSMLDMNLRPGGNN--P 590

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
           GRTYR+Y    V PFG G+ YT F                G + S+   I       T  
Sbjct: 591 GRTYRWYN-ESVQPFGFGLHYTKFAAKF------------GSNSSLTVNIQDIMKSCTKD 637

Query: 664 KCNRLTL-GVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRV 721
             +   +  ++V V N G++      L F     G    P K LV++ ++   +G+Q + 
Sbjct: 638 HPDLCDVPPIEVAVTNKGNRTSDFIALAFIKGEVGPKPYPLKTLVSYARLRDISGSQTKT 697

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNI 749
                    LS VD+SG      GE+ +
Sbjct: 698 ASLALTLGTLSRVDQSGNLVAYPGEYTL 725


>gi|358061481|ref|ZP_09148135.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
           WAL-18680]
 gi|356700240|gb|EHI61746.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
           WAL-18680]
          Length = 695

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 241/599 (40%), Positives = 339/599 (56%), Gaps = 65/599 (10%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           L+ +++L+E+   +   A AVPRLGI  Y WW E LHGV+  G           AT FPQ
Sbjct: 13  LVEQMTLEERASQMRYDAPAVPRLGIPAYNWWGEGLHGVARAGT----------ATMFPQ 62

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGR 167
            I  A+ F+  L E I  VVS E RA YN            GLT+WSPNVNIFRDPRWGR
Sbjct: 63  AIAMAAMFDVELTEEIANVVSTEGRAKYNQFCEEGDRDIYKGLTFWSPNVNIFRDPRWGR 122

Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNA 225
           G ET GEDP L+ +   ++VRGLQG DG+ LK+AA  KHF  +     +G +  R  F A
Sbjct: 123 GHETYGEDPYLTSRLGTAFVRGLQG-DGEHLKIAACAKHFAVH-----SGPEALRHEFWA 176

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
             SK+D+ +T+   F  CV E  V SVM +YN  +G P CA+  +++  +RG+W   G+ 
Sbjct: 177 DTSKKDLWETYLPAFEACVKEAHVESVMGAYNSYHGEPCCANTLLMEEILRGQWGFEGHF 236

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
           VSDC ++  ++     T T  E+AA A++ G DL+CG    L    A + GLL +  +  
Sbjct: 237 VSDCWAIRDFHMNYMVTDTAMESAALAVKKGCDLNCGNTY-LQVLKACEEGLLDDACVTE 295

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
           A+V   T +  LGM  GE +   Y  +  + V   +H+ELA+EAAR+ +VLLKN G  LP
Sbjct: 296 AVVRLFTTRYLLGM--GEETE--YDDIPYEVVECKEHRELAVEAARRSMVLLKNDG-LLP 350

Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC---K 458
           L   +  T+AVIGPN+D    +IGNY G +  YTT L+GI    G   R ++ +GC   K
Sbjct: 351 LHAEKLNTIAVIGPNADNRTALIGNYHGTSSCYTTILEGIQDAVGEDVRVLYAEGCHLFK 410

Query: 459 D----VACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLPG 505
           D    +A A D+L  A I  ++ +D  +L +GLD+++E E         + D+  LLLP 
Sbjct: 411 DRVEHLAVAGDRLSEARI-VAKHSDVVVLCVGLDETLEGEEGDTGNSHASGDKKDLLLPE 469

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
            Q+ L+ ++    K P ++  MSG  ID++ A+   +  A+I   YPG  GG A+AD+LF
Sbjct: 470 SQRRLMEEILNLGK-PVVVCNMSGSAIDLSLAQE--KAGAVIQVWYPGAEGGRALADLLF 526

Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
           G ++P GKLP+T+Y    + NLP       P +     GRTYR+     +YPFG G++Y
Sbjct: 527 GKASPSGKLPVTFYKD--LENLP-------PFEDYSMDGRTYRYLTAEPLYPFGFGLTY 576


>gi|291548352|emb|CBL21460.1| Beta-glucosidase-related glycosidases [Ruminococcus sp. SR1/5]
          Length = 697

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 252/737 (34%), Positives = 385/737 (52%), Gaps = 99/737 (13%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           ++   L+ R++L+EK   L   A A+ RLGI  Y WW+E LHGV+  G           A
Sbjct: 8   KKAEALVARMTLEEKASQLRYDAPAIKRLGIPAYNWWNEGLHGVARAGQ----------A 57

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I  A++F+      +  +V+ E RA YN  +         GLT+WSPNVNIFRD
Sbjct: 58  TVFPQAIGMAAAFDRKSVAEMAGIVATEGRAKYNAYSVNGDRDIYKGLTFWSPNVNIFRD 117

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
           PRWGRG ET GEDP L+ +   S+V+ LQG +GD +K AA  KHF  +     +G +  R
Sbjct: 118 PRWGRGHETYGEDPYLTKELGVSFVKALQG-NGDTMKAAACAKHFAVH-----SGPEALR 171

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A+ S +D+E+T+   F   V E KV +VM +YN+ NG P C  P  L++ +RGEW+
Sbjct: 172 HEFDAEASAKDMEETYLPAFEGLVKEAKVEAVMGAYNRTNGEPCCGSPT-LQKKLRGEWK 230

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
             G+ VSDC ++  +++    T T  E+AA AI  G DL+CG    LH   A ++GL++E
Sbjct: 231 FQGHFVSDCWAIRDFHEHHMVTDTAVESAALAINNGCDLNCGNTY-LHIMKAYEKGLVTE 289

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
             I  A V   T +  LG+FDG      Y +L   +V +P H + A +AA +  VLLKN 
Sbjct: 290 ETITRAAVRLFTTRYLLGLFDG----SEYDNLSYMEVESPRHLDAAEKAAEKSFVLLKNN 345

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQG 456
           G  LPL   + +T+ +IGPN+D    +IGNY G A  Y T  +GI  Y     R +  +G
Sbjct: 346 G-ILPLDKEKLKTIGIIGPNADSRQALIGNYHGTASRYITIQEGIQDYVGDDVRILTSRG 404

Query: 457 C-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
           C       + +A   D++  A +  +  +D  IL MGLD+++E E         + D+  
Sbjct: 405 CDLFRDRTEHLAFTRDRIAEAKV-VAENSDVVILCMGLDETLEGEEGDTGNSYVSGDKED 463

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           + LPG Q+EL+  ++   K P +  L++G  +D+ +A        ++W  YPG  GG A 
Sbjct: 464 IELPGVQRELMEAIADTGK-PVVFCLLAGSDLDLKYAAEKFDAVMMLW--YPGCQGGKAA 520

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
           A +LFG  +P GKLP+T+Y  E +  LP  T+ +M+        GRTYR+ +    +PFG
Sbjct: 521 AKVLFGEISPSGKLPVTFY--ESLEELPDFTDYSMK--------GRTYRYMERKAQFPFG 570

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
           +G++Y+                           +    +K    K N     V+V+V+N 
Sbjct: 571 YGLTYS------------------------KVAVDKAEVKTCGQKIN-----VEVEVQNN 601

Query: 680 GSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
           G+ D    + ++     + +  P+  L  F+++ + AG  +++ I I   K  +VVD +G
Sbjct: 602 GAYDTEDVVQIYVKNIDSKNAIPNPMLAGFQRIFLKAGECRKIEIPIWE-KAFTVVDETG 660

Query: 739 TRRIPLGEHNIHIGGTK 755
            R     +  I+ G ++
Sbjct: 661 KRMEEGKKFEIYAGCSQ 677


>gi|365135698|ref|ZP_09343911.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363612160|gb|EHL63713.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 643

 Score =  396 bits (1018), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 239/613 (38%), Positives = 347/613 (56%), Gaps = 60/613 (9%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ +++L+EKV  +   A A+ RLGI  Y WW+E LHGV   G           A
Sbjct: 6   QRARALVAQMTLEEKVSQMRYDAPAIERLGIPAYNWWNECLHGVGRSGT----------A 55

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
           T FPQ I  A+SF+ +L E + + +SDEARA YN        G   GLT+WSPN+N+FRD
Sbjct: 56  TVFPQPIGMAASFDESLLEHVAQAISDEARAKYNQYKTFGETGIYQGLTFWSPNINLFRD 115

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRF 221
           PRWGRG ET GEDP+L+G+   +++RGLQ G D    K+ A+ KHF A+         R 
Sbjct: 116 PRWGRGHETYGEDPLLTGRMGTAFIRGLQEGEDSQYRKLDATVKHFAAHSGPE---AGRH 172

Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
            FNA+VS +D+ D++   FR C+   K A+VM +YN++NG P CA    LK  +  EW+ 
Sbjct: 173 SFNAEVSAEDMADSYLWAFRYCIEHAKPAAVMGAYNRINGEPACASSTYLKGVLYEEWKF 232

Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH-TESAVQRGLLSE 340
           +GY+VSDC ++    +  H T   +E+AA A+  G  L+CG     H  ++AV+ GL+SE
Sbjct: 233 DGYVVSDCGAIQDINENHHVTKNEKESAALAVNNGCQLNCGK--AYHWVKAAVEDGLISE 290

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
             +  A+      + RLGMFD   S   Y  +    +    H+EL  + A++ IVLLKN 
Sbjct: 291 DTVTCAVERLFEARFRLGMFD---SDCVYDSIPMNVIECRKHRELNRKMAQESIVLLKNN 347

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR--TIHQQGC- 457
           G  LPL+    +T+AVIGPN+D    ++GNY G    +TT L+GI   AR    + +G  
Sbjct: 348 G-ILPLN--PEKTIAVIGPNADDKTVLLGNYNGTPSHWTTLLRGIQDQARGEVYYARGSV 404

Query: 458 ---KDVACADDQLFGAAIDASRQADATILVMGLDQSIE---------AEALDRAGLLLPG 505
              K+     ++    AI  ++ AD  +L +GL   +E         A++ DR  + LP 
Sbjct: 405 LVEKEALPWAEKPLHEAIYTAKAADVVVLCLGLSPLLEGEEGDAYNGADSGDRKDISLPD 464

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
            QQ+L+  + + ++ P +LV +SGG +D+  A  D R AAI+   YPG  GG A+ADILF
Sbjct: 465 IQQQLLCAI-LDTEKPVVLVNVSGGCVDLRQA--DERCAAILQCFYPGAEGGNALADILF 521

Query: 566 GTSNPGGKLPMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
           G  +P G+LP+T+Y    + +L P T+ +M+        GRTYRF+ G  +YPFGHG++Y
Sbjct: 522 GRVSPSGRLPVTFY--RTVEDLPPFTDYSMK--------GRTYRFFDGKPLYPFGHGLTY 571

Query: 625 TNFVHTVANAPTV 637
            +      +  TV
Sbjct: 572 ADIKEQWTDPYTV 584


>gi|225878709|dbj|BAH30674.1| beta-xylosidase [Aspergillus aculeatus]
          Length = 785

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 258/726 (35%), Positives = 381/726 (52%), Gaps = 32/726 (4%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +     R   L   ++L+E +    +   A+PRLG+  Y+ W+EALHG+  +   T+
Sbjct: 62  CDRTASAHDRAAALTSMMTLEELMNSTGNRIPAIPRLGLPPYQIWNEALHGLY-LANFTE 120

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
             G F  +TSFP  I T ++ N TL   I ++++ + RA  N G  GL  +SPN+N FR 
Sbjct: 121 -SGPFSWSTSFPSPILTMATLNRTLIHQIAQIIATQGRAFNNAGRYGLNAFSPNINAFRH 179

Query: 163 PRWGRGQETPGEDP-VLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRF 221
           P WGRGQETPGED   L   YA  Y+ GLQG +    K+ A+ KH+  YD++NW    RF
Sbjct: 180 PVWGRGQETPGEDANCLCSAYAYEYITGLQG-NATNPKIIATAKHYAGYDIENWRQRSRF 238

Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
             +  +++QD+ + F   F + V + +V SVM SYN VNGVP+ A+  +L+  +R  W  
Sbjct: 239 GNDLNITQQDLAEYFTPQFVVAVRDAQVRSVMPSYNAVNGVPSSANTFLLQTLVRDSWGF 298

Query: 282 --NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
             +GY+ SDCD+V   ++   + +    A+A ++RAG D+DCG  +L    ES  Q G +
Sbjct: 299 IQDGYMASDCDAVYNVFNPHGYAANLSSASAMSLRAGTDIDCGISYLTTLNESLTQ-GQI 357

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           S  +I  A+    +  +  G FDG  +  PY  L   DV   +   +A EAA  G+VLLK
Sbjct: 358 SRSEIERAVTRFYSNLVSAGYFDGPDA--PYRDLSWSDVVRTNRWNVAYEAAVAGVVLLK 415

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCK 458
           N G  LPLS    R VA+IGP ++ T  M GNY G+A   T+PL  +      ++     
Sbjct: 416 NDG-VLPLSKSVQR-VALIGPWANATEQMQGNYHGVAPYLTSPLAAVQASGLEVNYAFGT 473

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
           ++       F AA+ A+ ++D  I   G+D ++EAE LDRA +  PG Q EL+ ++    
Sbjct: 474 NITSNVTNCFAAALAAAEKSDIIIFAGGIDNTLEAEELDRANITWPGNQLELIHRLGELG 533

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P +++ M GG +D +  K   ++ A++W GYPGQAGG A+ DIL G   P G+L  T 
Sbjct: 534 K-PLVVLQMGGGQVDSSALKASEKVGALLWGGYPGQAGGQALWDILTGQRAPAGRLTTTQ 592

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           YP EY    P T+M++RP      PG+TY +Y G  VY FGHG+ YT F        T +
Sbjct: 593 YPAEYALQFPATDMSLRPRGDN--PGQTYMWYTGEPVYAFGHGLFYTTFA-------TAL 643

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLT----LGVQVDVKNVGSKDGAHTLLVFSTP 694
           A P      S +    G  +   HA  N +     L   V V N G     +T + F+  
Sbjct: 644 AGPGQEPERSFDI---GALLARPHAGYNLVEQLPFLNFTVKVTNTGEVISDYTAMAFANT 700

Query: 695 PAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVC-KYLSVVDRSGTRRIPLGEHNIHIG 752
            AG    P+K LV F+++  P   +    +++ V    L+  D  G R I  G + + + 
Sbjct: 701 TAGPRPHPNKWLVGFDRIG-PLDPRVSARMSVPVSLDSLARTDAQGNRVIYPGPYELALN 759

Query: 753 GTKHSV 758
             +  V
Sbjct: 760 NERSPV 765


>gi|6573772|gb|AAF17692.1|AC009243_19 F28K19.27 [Arabidopsis thaliana]
          Length = 696

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 200/473 (42%), Positives = 305/473 (64%), Gaps = 9/473 (1%)

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           DCD+V + YD Q +  +PE+A AD ++AG+D++CG +L  HT+SA+Q+  +SE DI+ AL
Sbjct: 221 DCDAVSIIYDAQGYAKSPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRAL 280

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
           +N  +V++RLG+F+G+P+  PYG++ P +VC+P HQ LAL+AAR GIVLLKN    LP S
Sbjct: 281 LNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFS 340

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART-IHQQGCKDVACADDQ 466
                ++AVIGPN+ V  T++GNYAG  C   TPL  +  Y +  ++ QGC  VAC++  
Sbjct: 341 KRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVACSNAA 400

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           +   A+  ++ AD  +L+MGLDQ+ E E  DR  L LPG+QQEL++ V+ A+K P +LVL
Sbjct: 401 I-DQAVAIAKNADHVVLIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLVL 459

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           + GGP+D++FA N+ +I +IIWAGYPG+AGG AI++I+FG  NPGG+LP+TWYPQ ++ N
Sbjct: 460 ICGGPVDISFAANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFV-N 518

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN-APTVVAVPLDGR 645
           + MT+M MR +    YPGRTY+FYKGP VY FGHG+SY+ + +     A T + +     
Sbjct: 519 IQMTDMRMRSATG--YPGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQSKA 576

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW---APH 702
             + ++       ++    C+     V V+V+N G   G H +L+F+    G        
Sbjct: 577 QTNSDSVRYTLVSEMGKEGCDVAKTKVTVEVENQGEMAGKHPVLMFARHERGGEDGKRAE 636

Query: 703 KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
           KQLV F+ + +  G +  +   I +C++LS  +  G   +  G++ + +G ++
Sbjct: 637 KQLVGFKSIVLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLTVGDSE 689



 Score =  249 bits (637), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 112/205 (54%), Positives = 144/205 (70%), Gaps = 8/205 (3%)

Query: 23  AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
           +A  P +CDP + TT+   FC+  LPI +R  DL+ RL++ EK+  L++ A  +PRLG+ 
Sbjct: 19  SAPPPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVP 78

Query: 83  GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
            YEWWSEALHGV+  GPG +F G    ATSFPQVI TA+SF++  W  I +V+  EAR +
Sbjct: 79  AYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGV 138

Query: 143 YNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD----- 196
           YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++G YA +YVRGLQG   D     
Sbjct: 139 YNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDGRKTL 198

Query: 197 --RLKVAASCKHFTAYDLDNWNGVD 219
              L+ +A CKHFTAYDLD W   D
Sbjct: 199 SNHLQASACCKHFTAYDLDRWKDCD 223


>gi|94442924|emb|CAJ91136.1| beta-xylosidase [Platanus x acerifolia]
          Length = 231

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 181/232 (78%), Positives = 208/232 (89%), Gaps = 3/232 (1%)

Query: 183 AASYVRGLQGSDG--DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
           AASYVRGLQ   G   RLKVAA CKH+TAYDLDNW G+DRFHFNA+VSKQD+EDTF+VPF
Sbjct: 1   AASYVRGLQQPYGRDSRLKVAACCKHYTAYDLDNWKGIDRFHFNARVSKQDLEDTFNVPF 60

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           R CV+EGKVASVMCSYNQVNG+PTCADPN+L+ TIRGEWRLNGYIV+DCDS+GV YDTQH
Sbjct: 61  RECVVEGKVASVMCSYNQVNGIPTCADPNLLRNTIRGEWRLNGYIVTDCDSIGVLYDTQH 120

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
           +TSTPEE+AADAI+AG+DLDCGPFL +HT+ A+ R +LSE+ ++ AL NTL VQMRLGMF
Sbjct: 121 YTSTPEESAADAIKAGVDLDCGPFLAVHTQEAITRRMLSEVYVDGALANTLAVQMRLGMF 180

Query: 361 DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
           DGEPS+QP+GHLGP+DVCT  HQ+LALEAARQGIVL+KNQG SLPLS  RHR
Sbjct: 181 DGEPSAQPFGHLGPRDVCTSAHQQLALEAARQGIVLMKNQG-SLPLSTARHR 231


>gi|358380569|gb|EHK18247.1| glycoside hydrolase family 3 protein, partial [Trichoderma virens
           Gv29-8]
          Length = 722

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 258/710 (36%), Positives = 379/710 (53%), Gaps = 44/710 (6%)

Query: 60  LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG-------TKFGGDFPGATS 112
           L+L EK   L++ A  V RLG+  YEW +EALHG++ V PG       T+    F  +T 
Sbjct: 12  LTLDEKAANLVNNAPGVKRLGLPPYEWRNEALHGLAGVSPGQGINSTFTQGNVAFNSSTQ 71

Query: 113 FPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETP 172
           FP  I   ++F+  L   I   VS EARA  N   AGL YW+PN+N +RDPRWGRGQETP
Sbjct: 72  FPSPIVLGAAFDDHLVHDIATAVSTEARAFSNHLKAGLDYWAPNINPYRDPRWGRGQETP 131

Query: 173 GEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQD 231
           GEDP    +YA +YV GL+G  G  + KV ++CKHF  YD+++ +GV R  +NA +S QD
Sbjct: 132 GEDPYHVAQYAYNYVVGLKGGVGPAKSKVVSTCKHFAGYDIEDSDGVVRGSYNAIISTQD 191

Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YIVSD 288
           + + +   FR C  + K  +VMCSYN VNG P+CA+  +L   +R  W       ++  D
Sbjct: 192 LAEYYLPSFRSCFRDAKTGAVMCSYNAVNGHPSCANSYMLDTVLRDHWGWGSSAHWVTGD 251

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALV 348
           C +V   ++  H   +  +  A AI  G DLDCG     +  SAVQ    +E  ++ AL 
Sbjct: 252 CGAVDGVFNQHHVGQSAAQGVAFAINNGTDLDCGTAYASNIASAVQNNYTTEAQLDQALS 311

Query: 349 NTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSH 408
              +  + LG FD  P  Q Y  LG  DV TP  Q+LA  A  +GI +L    P  P+  
Sbjct: 312 RLYSSLIVLGYFD-PPEGQEYRTLGVSDVNTPSTQKLAYTALVEGINIL----PIRPMG- 365

Query: 409 IRHRTVAVIGP-NSDVTVTMIGNYAGIACGYTTPLQGIGRYA---RTIHQQGCKDVACAD 464
              +TV  +GP  ++ +V+M GNY G+A   T P+      A      + QG + V   D
Sbjct: 366 ---QTVLFVGPWANNASVSMFGNYNGVAPYKTIPVPTANSSAYNWNVTYSQGLQYVLSND 422

Query: 465 DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTIL 524
              F AA+ A+++AD  + + G+D+ +EAEA DR  +  PG Q  L+ +  +A+  P ++
Sbjct: 423 TSQFAAAVSAAQEADVVVYIGGIDEQVEAEAHDRTSIDWPGAQLNLIKQ--LAAVKPVVV 480

Query: 525 VLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
           V + GG +D +    +  +  ++W GYPGQ  G+ + DIL G S P G+LP+T YP  YI
Sbjct: 481 VQVGGGQVDDSSLLQNKNVKGLLWMGYPGQEFGSGLIDILSGASAPAGRLPVTQYPANYI 540

Query: 585 TNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
           T +PMT+ ++RPS S   PGRTYR+Y G V+ PFG G+ YT F  +     +       G
Sbjct: 541 TQVPMTDQSLRPSSSN--PGRTYRWYNGSVI-PFGTGIHYTKFNISWKTGGS-------G 590

Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF--STPPAGHWAPH 702
           R     A           A+ +      Q++V+NVGS    +  L+F  S+       P 
Sbjct: 591 RGTYDTADFINAEDPKDLAEFDVF----QINVENVGSTTSDYVALLFVKSSDSGPQPYPL 646

Query: 703 KQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
           K LV++ + H    G   ++ + ++V + ++  D SG   +  G + + I
Sbjct: 647 KTLVSYARAHGTQPGETTKIDLRVNVGQ-IARNDSSGNLVLYPGAYTLEI 695


>gi|115436096|ref|NP_001042806.1| Os01g0296700 [Oryza sativa Japonica Group]
 gi|113532337|dbj|BAF04720.1| Os01g0296700, partial [Oryza sativa Japonica Group]
          Length = 522

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 227/522 (43%), Positives = 309/522 (59%), Gaps = 30/522 (5%)

Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTSTPEEAAADAIRAGL 317
           +NGVP CAD  +L  T+R +W+L+GYIVSDCDSV V   D +    T  EA A A++AGL
Sbjct: 1   INGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGL 60

Query: 318 DLDCGPFL-GLHT------ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
           DLDCG F  G+H         AV++G L E  ++NAL N     MRLG FDG P  +   
Sbjct: 61  DLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFFDGIPELE--- 117

Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT--MI 428
            LG  DVCT +H+ELA +AARQG+VLLKN    LPLS  +  +VA+ G    +  T  M+
Sbjct: 118 SLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQHINATDVML 177

Query: 429 GNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD 488
           G+Y G  C   TP  G+ +   +     C   +C        A  A++  DATI+V GL+
Sbjct: 178 GDYRGKPCRVVTPYDGVRKVVSSTSVHACDKGSC------DTAAAAAKTVDATIVVAGLN 231

Query: 489 QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIW 548
            S+E E+ DR  LLLP  Q   ++ V+ AS  P +LV+MS G +DV+FA+++P+I A++W
Sbjct: 232 MSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDNPKIGAVVW 291

Query: 549 AGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR 608
           AGYPG+ GGTAIAD+LFG  NPGG+LP+TWY  EY++ +PMT MA+RP     YPGRTY+
Sbjct: 292 AGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAEHGYPGRTYK 351

Query: 609 FYKGP-VVYPFGHGMSYTNFVHTVANAPTVVAVPLDG----RHGSINATISG----KAIK 659
           FY G  V+YPFGHG+SYTNF +  A A   V V +      +  +  A +S      A+ 
Sbjct: 352 FYGGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGVSSPPACPAVN 411

Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQ 718
           V    C    +   V V N G +DG H + +++ PPA    AP KQLVAF +V V AGA 
Sbjct: 412 VASHACQE-EVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFRRVRVAAGAA 470

Query: 719 QRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
             V   ++VCK  ++V+ +    +P G   + +G    S+S 
Sbjct: 471 VEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSF 512


>gi|167519969|ref|XP_001744324.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777410|gb|EDQ91027.1| predicted protein [Monosiga brevicollis MX1]
          Length = 721

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 232/611 (37%), Positives = 336/611 (54%), Gaps = 31/611 (5%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG-------AAAVPRLGIKGYEWWSEALH 92
           LPFC +SL    R  DL  RL+L E  + L +        A  VPRLG++ Y + +E LH
Sbjct: 42  LPFCDLSLDFRDRAWDLAQRLTLDELAQQLNTYSFTPQAYAPGVPRLGLRNYSYHAEGLH 101

Query: 93  GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-------- 144
           G+ +         ++P AT +PQV   A++ NA+L   +  ++  E RA+ N        
Sbjct: 102 GIRDANVV-----NYP-ATLYPQVTAMAATANASLIHEMSTIMGTELRAVNNRAQELGEI 155

Query: 145 -GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAA 202
            G    L+ + P +NI RD RWGR QE+  EDP L+G YA ++V GL+  +  + L+ A 
Sbjct: 156 FGRGGALSIYGPTMNIIRDGRWGRSQESVSEDPWLNGLYAVNFVLGLEQRNSSKYLQAAT 215

Query: 203 SCKHFTAYDLDNWNG-VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
           SCKH  AY  + +N  + R  FNA + + DI DT+   FR CV  G V  +MCSYN VNG
Sbjct: 216 SCKHLFAYSFEGYNNTLTRHSFNAVIDELDIHDTYLPAFRACVELGHVQQIMCSYNSVNG 275

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
           +P CA  ++    +R  W   G IVSDCD+V   Y+T ++T TPE+A   A++ G DLDC
Sbjct: 276 IPACARGDVQNDRVRKAWGFEGLIVSDCDAVADIYNTHNYTRTPEDAVTVALQGGCDLDC 335

Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
           G F   H  SAVQ+ L +   +  ++   L ++  LG FD + +S PY  LG + + TP 
Sbjct: 336 GDFYSQHLASAVQQNLTTLAALQQSMTRVLEMRFLLGEFDPD-TSVPYRQLGREAIDTPF 394

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY-TT 440
            ++ +L A+R+ +VLL+N+   LP++      VA+IGP  ++T  M+G        + TT
Sbjct: 395 ARDSSLRASRESVVLLENRIKLLPVTLSADIKVALIGPYVNLTTIMMGGKLDYTPSFITT 454

Query: 441 PLQGIGRYART--IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
             QG      T      GC   A     L   A+  + QAD  +L +GL   IE E  DR
Sbjct: 455 YFQGFQAIGITHLTSSPGCNITAPLPGAL-DKAVQIATQADLVVLTLGLSSDIEHEGGDR 513

Query: 499 AGLLLPGRQQELVSKVSMA-SKGPTILVLMSGGPIDVAFAKND-PRIAAIIWAGYPGQAG 556
             L LP  QQ+L   +S A      ++VL++GGP+ V   K    R   II A Y GQ+ 
Sbjct: 514 ETLGLPTPQQDLYDAISAAIPSSKLVVVLVNGGPVSVDRIKYGIARTPTIIEAFYGGQSA 573

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           GTA+A+ +FG +NP G LP T +      ++P T+M +RP  +  +PGRT+RF+  PV++
Sbjct: 574 GTALAETIFGQNNPSGTLPYTVFFSNITAHVPFTDMHLRPDAATGFPGRTHRFFDAPVMW 633

Query: 617 PFGHGMSYTNF 627
           PFGHG+SY+ F
Sbjct: 634 PFGHGLSYSTF 644


>gi|345519864|ref|ZP_08799275.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
 gi|254836262|gb|EET16571.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
          Length = 736

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 260/768 (33%), Positives = 384/768 (50%), Gaps = 90/768 (11%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A    LPF    LP+  RV DL+ RL+L+EKV L+   + A+PRLGI  Y+WW+EALHGV
Sbjct: 19  AQVENLPFRNADLPLEVRVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-----GGTA- 148
           +               T FPQ I  A++F+    + +G + S E RA++N     G T  
Sbjct: 79  ART---------LEKVTVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGT 129

Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
              GLTYW+PN+NIFRDPRWGRGQET GEDP L+ K  A+ VRGL+G D   LK  A  K
Sbjct: 130 RYRGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGAAIVRGLEGEDPHYLKSVACAK 189

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H+  +    +N   R  F+A+ S  D+ DT+   FR  V + KV  VMC+YN++NG P C
Sbjct: 190 HYAVHSGPEYN---RHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCC 246

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE---AAADAIRAGLDLDCG 322
            +  +L   +R +W  +GY+ SDC ++    D   F  T  E   A +DA+ AG DL+CG
Sbjct: 247 GNDPLLVDILRNQWHFDGYVTSDCWALK---DFAEFHKTHPEHTIAMSDALLAGTDLECG 303

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPD 381
               L  E  V++GL SE DIN +L    T+  ++GMFD  P+ + PY  +G + +    
Sbjct: 304 NLYHLLAE-GVKKGLHSERDINVSLSRLFTILFKIGMFD--PAERVPYSSIGREVLECEA 360

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           H++ A   A++ IVLL+N+   LPL   + +++A+IGPN+D   T + NY G      TP
Sbjct: 361 HKQHAERMAKESIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTP 420

Query: 442 LQGIGRY----ARTIHQQGCKDV-ACADDQLFGAAIDASRQADATILVMGLDQSIE---- 492
              + R      +  +  G   V    D   F      + Q+D  + V G+    E    
Sbjct: 421 YMSLKRRLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEAG 480

Query: 493 ---------AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
                      + DR  + LP  Q EL+ K+    + P I+V MSG  +  +F       
Sbjct: 481 DAGAAGYGGFASGDRTTMQLPLVQIELLKKLKKTGR-PLIIVNMSGSVM--SFEWESQNA 537

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
            A++ A Y GQA G AI D+LFG  NP G++P+T Y  +   +LP       P ++    
Sbjct: 538 DALLQAWYGGQAAGDAIVDVLFGHCNPAGRMPLTTYKSD--NDLP-------PFENYSML 588

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
           GRTYR++KG   YPFG+G+SYT F ++                  ++ T +G   +VT  
Sbjct: 589 GRTYRYFKGEPRYPFGYGLSYTTFAYSDVQC--------------VDETHTGDTARVT-- 632

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPAGAQQRV 721
                     V V N G  DG   + ++   P       P   L  F+++H+  G    V
Sbjct: 633 ----------VTVSNTGDCDGDEVVQLYVVHPQDGRKQIPLCALKGFKRIHLKRGESTSV 682

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
              +   + L++ +  G      G+  + +GG + + +   ++L  IK
Sbjct: 683 SFTL-TPEELALTETDGNLVEKNGQVTLFVGGGQPNYAAGVSSLLTIK 729


>gi|316980598|dbj|BAJ51947.1| putative beta-D-xylosidase [Glycyrrhiza uralensis]
          Length = 285

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 180/278 (64%), Positives = 222/278 (79%), Gaps = 1/278 (0%)

Query: 485 MGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
           MGLDQSIEAE  DR GLLLPG QQELVS+V+  ++GP ILVLMSGGPIDV+FAKNDP+I+
Sbjct: 1   MGLDQSIEAEFRDRVGLLLPGHQQELVSRVARVARGPVILVLMSGGPIDVSFAKNDPKIS 60

Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
           AI+W GYPGQAGGTAIAD++FGT+NPGG+LPMTWYPQ Y+  +PMT M MRP+ +  YPG
Sbjct: 61  AILWVGYPGQAGGTAIADVIFGTTNPGGRLPMTWYPQNYLAKVPMTNMDMRPNPATGYPG 120

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS-GKAIKVTHA 663
           RTYRFYKGPVV+PFGHG+SYT F H++A AP  V+VP        N+T+S  KA++V+HA
Sbjct: 121 RTYRFYKGPVVFPFGHGLSYTRFTHSLAIAPKQVSVPFATLQAFTNSTVSTSKAVRVSHA 180

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
            C+ + +G  VDVKN GS DG +TLLVFS PP G W+  KQLV+F K +VPAG++QRV +
Sbjct: 181 NCDAMEVGFHVDVKNEGSMDGTNTLLVFSKPPPGKWSATKQLVSFHKTYVPAGSKQRVKV 240

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
            +HVCK+LSVVD  G RRIP+GEH + IG  KHS+S+ 
Sbjct: 241 GVHVCKHLSVVDEFGIRRIPMGEHELQIGDLKHSISVQ 278


>gi|302669556|ref|YP_003829516.1| beta-xylosidase [Butyrivibrio proteoclasticus B316]
 gi|302394029|gb|ADL32934.1| beta-xylosidase Xyl3A [Butyrivibrio proteoclasticus B316]
          Length = 709

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 256/726 (35%), Positives = 391/726 (53%), Gaps = 93/726 (12%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           +R  +L+ +++++EK   L   A A+ RLGI  Y WW+EALHGV+  G           A
Sbjct: 8   KRAKELVAKMTVEEKASQLRYDAPAIDRLGIPAYNWWNEALHGVARAGT----------A 57

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I  A++F+  L   +G V+++EARA YN  +         GLT+W+PNVNIFRD
Sbjct: 58  TMFPQAIGLAAAFDEELMSEVGEVIAEEARAKYNEQSKREDRDIYKGLTFWAPNVNIFRD 117

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
           PRWGRG ET GEDP L+ + A  +V+ +QG DG+ +K AA  KHF  +        +R  
Sbjct: 118 PRWGRGHETYGEDPFLTSRLAVPFVKAMQG-DGEYMKAAACAKHFAVHSGPEG---ERHF 173

Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
           F+AK SK+D+E+T+   F   V E +V +VM +YN+ NG P CA+  ++  T+RG+W   
Sbjct: 174 FDAKASKKDLEETYLPAFEALVKEAEVEAVMGAYNRTNGEPCCANKPLMVDTLRGKWGFQ 233

Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
           G+ VSDC ++  +++    TS+PEE+A  A+  G DL+CG        + V+ GL+ E  
Sbjct: 234 GHFVSDCWAIKDFHENHKVTSSPEESAKLALEMGCDLNCGCTYQ-SIMNGVRAGLIDEKL 292

Query: 343 INNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQG 401
           I  +     T +  LGMFD  E    PY     + V   +H  +A  AAR+ +VLLKN G
Sbjct: 293 ITESCERLFTTRFLLGMFDKTEFDEIPY-----EKVECKEHLAVAKRAARESVVLLKNDG 347

Query: 402 PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
             LPL+    +T+ V+GPN++  +++IGNY G +  Y T L+GI    G   R ++ +GC
Sbjct: 348 -LLPLNKDSIKTIGVVGPNANSRLSLIGNYHGTSSRYITVLEGIQDKVGDDVRVLYSEGC 406

Query: 458 KDV------ACADDQL---FGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
            D+        AD  L      A   +  +D  ++V+GLD+++E E         + D+ 
Sbjct: 407 -DIFQNNISNLADPNLPDRLSEAQAVADHSDVVVVVVGLDENLEGEEGDAGNQFASGDKI 465

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            L LP  Q++L++ V    K PTI++ M+G  ID++ A+++    A++ A YPG  GG  
Sbjct: 466 NLNLPLSQRQLLNAVLDCGK-PTIVIDMAGSAIDLSKAQDEAN--AVLQAFYPGARGGAD 522

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
           +ADILFG  +P GKLP+T+Y      +LP   + +M+         RTY+++ G  +YPF
Sbjct: 523 VADILFGDVSPSGKLPVTFYKS--ADDLPDFKDYSMK--------NRTYKYFTGTPLYPF 572

Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG--VQVDV 676
           G+G++Y +          V     D         +SG  I VT     +L     VQ+ +
Sbjct: 573 GYGLTYGDCYVKPDYDFNVKYADAD--------KVSGAEITVTVVNDGKLDTDEVVQLYI 624

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           K++ S         F+T        +  LV F++VHVPAG + RV + +   K  + V+ 
Sbjct: 625 KDMDS--------YFAT-------TNPSLVGFKRVHVPAGGETRVTLTVSE-KAFTSVNE 668

Query: 737 SGTRRI 742
            G R +
Sbjct: 669 EGERAV 674


>gi|402074909|gb|EJT70380.1| hypothetical protein GGTG_11406 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 793

 Score =  390 bits (1003), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 254/705 (36%), Positives = 369/705 (52%), Gaps = 59/705 (8%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  SL   +R   L+  L++ EK+  L+S A    R+G+  Y WWSEALHGV+   PGT+
Sbjct: 44  CDRSLSPSERAAALVAALNVTEKMANLVSNANGSARIGLPKYNWWSEALHGVA-YAPGTQ 102

Query: 103 FG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
           F    GDF  +TSFP  +  A+SF+ +L E IG V+  E+RA  NG  +GL YW+PNVN 
Sbjct: 103 FRRGPGDFNSSTSFPMPLLLAASFDDSLIEKIGDVIGTESRAFGNGRWSGLDYWTPNVNP 162

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAASCKHFTAYDLDNWNGV 218
           F+DPRWGRG ETPGED +   +YAAS ++GL+G   ++  +V ++CKH+ A D ++WNG 
Sbjct: 163 FKDPRWGRGSETPGEDILRIKRYAASMIKGLEGPHPEKERRVVSTCKHYAANDFEDWNGT 222

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
            R  F+A++S QD+ + + +PF+ C  + +V S+MC+YN VNGVP+CA+  +L   +R  
Sbjct: 223 SRHDFDARISAQDLAEYYLMPFQQCARDSRVGSIMCAYNAVNGVPSCANSYLLDTVLRKH 282

Query: 279 WRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
           W     N Y+ SDC++V        +  T  E  A    AG D  C          A  +
Sbjct: 283 WGWTGHNNYVTSDCEAVLDVSAGHKYARTNAEGTAMCFEAGTDTSCEYTPSSDIRGAYAQ 342

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
           GLL E  ++ AL+      +R+G FDG  S+  +  +   DV  P  Q+L+L++A +GIV
Sbjct: 343 GLLREETMDRALLRLYEGLVRVGYFDGNSSA--FSDISWADVNAPAAQDLSLQSAVEGIV 400

Query: 396 LLKNQGP-SLPLSHIRHRT-----------VAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
           +LKN G   LPL                  +A+IG  +D    + G Y+G A    TP  
Sbjct: 401 MLKNDGTLPLPLGAKCSSKSKKRSSSGGPKLAMIGFWADAPEKLRGGYSGTAAYLRTP-- 458

Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDASRQ-----------ADATILVMGLDQSIE 492
                A    Q G   V      L GAA  A+             AD  +   GLD++  
Sbjct: 459 -----AYAARQMGLDVVTAGGPVLQGAAAAAADNWTAPALAAAEGADYIVYFGGLDETAA 513

Query: 493 AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
            E  DR  +  PG Q  LV +  +A+ G  ++V+  G  +D      +  + A++WA +P
Sbjct: 514 GENKDRWDVEWPGAQLALVKR--LAALGKPLVVVQMGDQLDGTPLLANAGVGAVLWASWP 571

Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
           GQ GG A+  +L G ++P G+LP+T YP  Y   +PMTEMA+RPS S   PGRTYR+Y  
Sbjct: 572 GQDGGPAVMRLLSGAASPAGRLPVTQYPANYTRLVPMTEMALRPSASGSRPGRTYRWYST 631

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTH-AKCNRLTLG 671
           PV+ PFG G+ YTNF   V   P + A         +  +   +A +  H  +C    L 
Sbjct: 632 PVL-PFGFGLHYTNFTPAVTVPPALAAA------SGVTTSSLLEACRDPHPERCALPPL- 683

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH----KQLVAFEKVH 712
            +V V N G +   +  L F +   G + P     K L A+ ++ 
Sbjct: 684 -RVAVANTGRRASDYVALAFVS---GDYGPRPRPIKTLAAYARLR 724


>gi|346225847|ref|ZP_08846989.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
 gi|346227016|ref|ZP_08848158.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
          Length = 718

 Score =  390 bits (1003), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 250/744 (33%), Positives = 393/744 (52%), Gaps = 89/744 (11%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           F    + + +R   ++ +L+++EK+  L++ A AV RL I  Y+WW+E LHGV+  G   
Sbjct: 17  FRNPDISLDERAECIVKQLTVEEKINQLMNAAPAVDRLEIPEYDWWNECLHGVARAGR-- 74

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
                   AT FPQ I  A++++ TL   +G  +S EARA YN        G   GLT+W
Sbjct: 75  --------ATVFPQAIGMAATWDTTLVYRVGDAISTEARAKYNVFSKHGYRGQYKGLTFW 126

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +PNVNIFRDPRWGRGQET GEDP L+ +   S+V+GLQG+    LKVAA  KH+  +   
Sbjct: 127 TPNVNIFRDPRWGRGQETYGEDPFLTSRIGVSFVKGLQGNHPKYLKVAALAKHYAVH--- 183

Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             NG +  R  F+AKVS +D+ +T+   F   V E  V  VM +YN+ NG P CA P ++
Sbjct: 184 --NGPEALRHEFDAKVSMKDLWETYLPAFEALVKEAGVEGVMGAYNRTNGDPCCAHPYLM 241

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
           +  +R +W  +GY VSDC ++  +Y       TPEEAAA A+ AG +L+CG       +S
Sbjct: 242 QEVLREKWGFDGYYVSDCGAIMDFYTGHKIVDTPEEAAAMALNAGCNLNCGDTYASLLKS 301

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
            +++GL +E +I+ ++      ++RLG+F  E  + PY  +    + + +HQ+LALEAAR
Sbjct: 302 -LEKGLTTEEEIDRSVKQLFKTRLRLGLFAPE-GAVPYDTISTDVIRSKEHQKLALEAAR 359

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYA- 449
           + +VLLKN+  +LP++    + V V GP +     ++ NY G++   TT L+GI G+ + 
Sbjct: 360 KSVVLLKNEANTLPVAR-DVKKVYVTGPTATHVQALLANYYGVSEDMTTILEGIVGKVSP 418

Query: 450 --RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA---------LDR 498
                ++QG   +  A+         A+  AD T+  +G+ Q IE E           DR
Sbjct: 419 QTSVQYRQGAL-LYEANRNTMDWFSGAAASADVTVACLGISQLIEGEEGEAIASEHRGDR 477

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA----AIIWAGYPGQ 554
               LP  Q + + ++  ++K   ++V+ SG  I +      P I     A+++  YPG+
Sbjct: 478 ERTRLPQNQIDFLKRIRASAK-KLVVVITSGSAISL------PEIYDMADALLYVWYPGE 530

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
            GG A+AD+LFG + P G+LP+T      + +LP       P ++    GRTYR+ +   
Sbjct: 531 QGGKAVADVLFGDAVPSGRLPVTVVKS--VDDLP-------PYENYDMKGRTYRYMEVSP 581

Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
            +PFG G+SYT+F ++                   N T+     KV   +  RL+     
Sbjct: 582 QFPFGFGLSYTDFTYS-------------------NLTLESN--KVKSGESVRLSF---- 616

Query: 675 DVKNVGSKDGAHTLLVFSTP-PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
           D+ N G  D    +  + T   A    P + L+ F++V + AG   ++   +     + +
Sbjct: 617 DLTNEGEYDADEVVQFYITDVEASVNVPKQSLIGFKRVGLAAGESTKIEFTV-TPDMMKI 675

Query: 734 VDRSGTRRIPLGEHNIHIGGTKHS 757
           VD +G + +  GE  I+IGG+ +S
Sbjct: 676 VDNNGEKILESGEFKIYIGGSSYS 699


>gi|371776901|ref|ZP_09483223.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 720

 Score =  390 bits (1002), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 255/733 (34%), Positives = 377/733 (51%), Gaps = 80/733 (10%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           F   +L I  R   L+  L+L+EK+ LL      V RL I  Y WW+EALHGV+  G   
Sbjct: 29  FRDEALDIETRAKALLSELTLKEKISLLGYNNPPVERLQIPAYNWWNEALHGVARAGE-- 86

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                   AT FPQ I  A++F+ TL   I   +S EAR+ YN   +        G+T+W
Sbjct: 87  --------ATVFPQAIALAATFDTTLVYRIADAISTEARSKYNINRSKGFQNQYLGITFW 138

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAASCKHFTAYDL 212
           +PN+NIFRDPRWGRGQET GEDP L+     ++V+GLQGS+ +R LK AA  KHF  +  
Sbjct: 139 TPNINIFRDPRWGRGQETYGEDPFLTASMGKAFVKGLQGSEPERRLKTAAGAKHFAVHSG 198

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
                 DR HFNA V ++D+ +T+   F+  V  G V ++MC+YN+VNG P C    +L+
Sbjct: 199 PE---ADRHHFNAVVDEKDLRETYLPAFKALVENG-VTTIMCAYNRVNGEPCCTGKTLLQ 254

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
             +R EW   G +V+DC ++   +       T  E AA A++AG++LDC   L    + A
Sbjct: 255 DILRDEWGFKGQVVTDCWALDDIWLRHKTIPTRVEVAAAAVKAGVNLDCANILQEDVQDA 314

Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
           +++ LL+   +++AL+ TL  Q++LG +D +PS  PY H G   V    H  LA EAA +
Sbjct: 315 IEKRLLTLEQVDSALLPTLQTQLKLGFYD-DPSHSPYRHYGIDSVNNSYHISLAKEAAEK 373

Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRY 448
            +VLLKN G  LPL      ++ V+G N+     + GNY G++    T ++G+    G  
Sbjct: 374 SMVLLKNDG-ILPLKKDTISSIMVVGENAASISALTGNYHGLSGNMVTFVEGLVKAGGPG 432

Query: 449 ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
               +  GC   + AD   FG  I A+   D TI V+GL   +E E           D+ 
Sbjct: 433 MSVQYDYGC---SFADTSHFG-GIWAAGFTDVTIAVIGLSPLLEGEHGDAFLSNWGGDKK 488

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            L +P   +  + K+  +   P I V+  G  +D++    +P   AII+A YPG+ GGTA
Sbjct: 489 DLRMPRSHEIYLKKLRESHNHPVIAVVTGGSALDISAI--EPYADAIIYAWYPGEQGGTA 546

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
           +AD++FG  +P G+LP+T+Y    I +LP       P        RTYR+++G V+YPFG
Sbjct: 547 LADLIFGEVSPSGRLPITFYKD--IKDLP-------PYHDYNMTNRTYRYFQGDVLYPFG 597

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
           +G+SYT+F +   + P+      D                          + V + V N 
Sbjct: 598 YGLSYTSFHYEWLSKPSTKVSEDD-------------------------IISVNIAVTNT 632

Query: 680 GSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
           G+ D    + V+   P     P ++L  F ++H+ AG  Q   I I V K L   D    
Sbjct: 633 GTMDADEVIQVYIVYPDIERMPLRELKGFSRIHIKAGQTQNTDIQIPV-KNLKKWDSKNN 691

Query: 740 R-RIPLGEHNIHI 751
           R ++  G++ I +
Sbjct: 692 RWKLYKGKYKIQV 704


>gi|171695518|ref|XP_001912683.1| hypothetical protein [Podospora anserina S mat+]
 gi|170948001|emb|CAP60165.1| unnamed protein product [Podospora anserina S mat+]
          Length = 805

 Score =  390 bits (1002), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 267/767 (34%), Positives = 384/767 (50%), Gaps = 89/767 (11%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLI--------------------SGAAAVPRLGIK 82
           C  +   P R   L+  L++ EK+  L+                    S +    R+G+ 
Sbjct: 36  CDTTASPPARAAALVQALNITEKLVNLVEYVKSREAPLGISIQLITPHSMSLGAERIGLP 95

Query: 83  GYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEA 139
            Y WW+EALHGV+   PG  F   G +F  ATSF   IT A++F+  L   +   +S EA
Sbjct: 96  AYAWWNEALHGVA-ASPGVSFNQAGQEFSHATSFANTITLAAAFDNDLVYEVADTISTEA 154

Query: 140 RAMYNGGTAGLTYWSPNVNIFRDPRWGRG------------------QETPGEDPVLSGK 181
           RA  N   AGL YW+PN+N ++DPRWGRG                  Q+TPGEDPV    
Sbjct: 155 RAFSNAELAGLDYWTPNINPYKDPRWGRGHEVCYLSLLFRAVQLLRTQKTPGEDPVHIKG 214

Query: 182 YAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
           Y  + + GL+G D  R KV A+CKHF AYDL+ W G  R+ FNA V+ QD+ + +  PF+
Sbjct: 215 YVQALLEGLEGRDKIR-KVIATCKHFAAYDLERWQGALRYRFNAVVTSQDLSEYYLQPFQ 273

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDT 298
            C  + KV S MCSYN +NG P CA   ++   +R  W     N YI SDC+++  +   
Sbjct: 274 QCARDSKVGSFMCSYNALNGTPACASTYLMDDILRKHWNWTEHNNYITSDCNAIQDFLPN 333

Query: 299 QH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTE--SAVQRGLLSEIDINNALVNTLTVQ 354
            H F+ TP +AAADA  AG D  C  P     T+   A  + LLSE  I+ AL       
Sbjct: 334 FHNFSQTPAQAAADAYNAGTDTVCEVPGYPPLTDVIGAYNQSLLSEEIIDRALRRLYEGL 393

Query: 355 MRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTV 414
           +R G  D   S  PY  +    V TP  Q LAL++A  GIVLLKN G  LPL  + ++T+
Sbjct: 394 IRAGYLDSA-SPHPYTKISWSQVNTPKAQALALQSATDGIVLLKNNG-LLPLD-LTNKTI 450

Query: 415 AVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG---CKDVACADDQLFGAA 471
           A+IG  ++ T  M+G Y+GI   Y  P+    +   T H       +     +D     A
Sbjct: 451 ALIGHWANATRQMLGGYSGIPPYYANPIYAATQLNVTFHHAPGPVNQSSPSTNDTWTSPA 510

Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           + A+ ++D  + + G D SI AE  DR  +  P  Q  L++  S+A  G   +V   G  
Sbjct: 511 LSAASKSDIILYLGGTDLSIAAEDRDRDSIAWPSAQLSLLT--SLAQMGKPTIVARLGDQ 568

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
           +D     ++P I++I+W GYPGQ+GGTA+ +I+ G S+P  +LP+T YP+ Y + +P+T 
Sbjct: 569 VDDTPLLSNPNISSILWVGYPGQSGGTALLNIITGVSSPAARLPVTVYPETYTSLIPLTA 628

Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
           M++RP+ ++  PGRTYR+Y  PV+ PFGHG+ YT F                 + G   +
Sbjct: 629 MSLRPTSAR--PGRTYRWYPSPVL-PFGHGLHYTTFT---------------AKFGVFES 670

Query: 652 TISGKAIKVTHAKCNRLTL----GVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH----K 703
                A  V++     L L     V V V N G     +  LVF     G + P     K
Sbjct: 671 LTINIAELVSNCNERYLDLCRFPQVSVWVSNTGELKSDYVALVFVR---GEYGPEPYPIK 727

Query: 704 QLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
            LV ++++  +  G      + + V   L+ VD  G R +  G++  
Sbjct: 728 TLVGYKRIRDIEPGTTGAAPVGV-VVGDLARVDLGGNRVLFPGKYEF 773


>gi|295134875|ref|YP_003585551.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294982890|gb|ADF53355.1| beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 735

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 245/739 (33%), Positives = 391/739 (52%), Gaps = 85/739 (11%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           F    L + +R++DLI RL+L+EK + +++ + A+ RLGI  Y+WW+EALHG+   G   
Sbjct: 34  FYDTDLSMDERIDDLISRLTLEEKAQQMLNASPAIERLGIPAYDWWNEALHGLGRSGV-- 91

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYW 153
                   AT FPQ I   ++F+  L   +   +SDEARA +N            GLT+W
Sbjct: 92  --------ATVFPQAIGMGATFDDDLILKVSTAISDEARANFNNAVKHGYHRKYGGLTFW 143

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +PNVNIFRDPRWGRGQET GEDP L+ K   ++V+GLQG +   LK AA+ KH+  +   
Sbjct: 144 TPNVNIFRDPRWGRGQETYGEDPYLTSKLGEAFVKGLQGDNDKYLKTAAAAKHYAVH--- 200

Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             +G +  R  FNA VS++D+ +T+   F+  V +  V ++MC+YN  NG P CA+  ++
Sbjct: 201 --SGPEKLRHEFNADVSEKDLWETYLPAFKTLV-DANVETIMCAYNSTNGEPCCANNRLI 257

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
              +R +W  NG++VSDC ++  +        +PE AAA A+  G++L+CG       + 
Sbjct: 258 NDILRDKWGFNGHVVSDCWALQDFVSGHDIVESPEAAAALAVEVGIELNCGDTYNFLAK- 316

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
           AV+ GL+SE  ++  L   L  + +LG+FD E  S PY  +G + + + +H+ LA E AR
Sbjct: 317 AVEDGLVSEELVDKRLHKLLETRFKLGLFDPE-ESNPYNKIGVEVMNSDEHRALARETAR 375

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR---- 447
           + IVLLKN G  LPL +   +   + GPN+     ++GNY G+     T L+GI +    
Sbjct: 376 KSIVLLKNDG-VLPLKNNLSKYF-ITGPNATNIEVLLGNYHGVNPDMVTVLEGIAKAIKP 433

Query: 448 YARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA---------LDR 498
            ++  ++ G +     ++    A+ +A   +DAT +VMG+   +E E           DR
Sbjct: 434 ESQLQYRMGTRLNLPNENPQDWASPNAG-NSDATFVVMGISGLLEGEEGESIASPTFGDR 492

Query: 499 AGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
               LP  Q + + KVS A++  P + ++  G P+++           ++W  YPG+ GG
Sbjct: 493 MDYNLPQNQIDYLQKVSEAAEDRPVVAIVTGGSPMNLTEVHKLADAVLLVW--YPGEEGG 550

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
            A+ADI+FG ++P G+LP+T+         PMT   +   +     GRTY++     +YP
Sbjct: 551 NAVADIIFGKNSPSGRLPITF---------PMTIEDLPAYEDYTMEGRTYKYMDVVPMYP 601

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG--KAIKVTHAKCNRLTLGVQVD 675
           FG+G+SYT+F ++            +     I+ T +G  +A +V           VQV 
Sbjct: 602 FGYGLSYTDFEYSEIKLSKDKIKKKESVEARISVTNTGDFEADEV-----------VQVY 650

Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
           +K+V                A    P+ +LVAF+ +H+  G  + +   I   + LS +D
Sbjct: 651 LKDV---------------KASSRVPNFELVAFKNIHLKRGESKELTFEI-TPEMLSFID 694

Query: 736 RSGTRRIPLGEHNIHIGGT 754
            +G  ++  G   I+IGG+
Sbjct: 695 DNGKEKLEKGAFEIYIGGS 713


>gi|280977785|gb|ACZ98610.1| glucosidase [Cellulosilyticum ruminicola]
          Length = 711

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 254/738 (34%), Positives = 381/738 (51%), Gaps = 87/738 (11%)

Query: 49  IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
                 +L+ ++ L EK   L   A A+ RLGI  Y WW+EALHGV+  G          
Sbjct: 4   FKNEAKELVRQMDLLEKASQLRYDAPAIKRLGIPTYNWWNEALHGVARAGV--------- 54

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYWSPNVNIF 160
            AT FPQ I  A+ F+      I  +++ E RA YN  +         G+T+W+PN+NIF
Sbjct: 55  -ATVFPQAIGLAAMFDEEKLGEIADIIAIEGRAKYNQFSQKEDRDIYKGMTFWAPNINIF 113

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAASCKHFTAYDLDNWNGVD 219
           RDPRWGRG ET GEDP L+ +   ++++GLQG +  D LK AA  KHF  +        D
Sbjct: 114 RDPRWGRGHETYGEDPYLTARLGVAFIKGLQGDENEDYLKAAACAKHFAVHSGPE---ED 170

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R HF+A VSK+D+ +T+   F   V E  V  VM +YN+VNG P C    +L   ++ +W
Sbjct: 171 RHHFDAIVSKKDLYETYLPAFEAAVKEANVIGVMGAYNRVNGEPACGSKTLLVDILKKDW 230

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
             +GYIVSDC ++  ++     T T  E+AA AI  G +L+CG    LH   A Q GL+ 
Sbjct: 231 GFDGYIVSDCWAIRDFHTEHMVTHTAAESAALAINNGCELNCGNTY-LHMLEAHQEGLVK 289

Query: 340 EIDINNALVNTLTVQMRLGMFDG--EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           E  I  A    + ++M+LG+FD   + +  PY      + C   H+E+ALEA+R+ +V+L
Sbjct: 290 EEIITEAAEKLMRIRMQLGLFDKNCKYNEIPYA----VNDCKV-HREVALEASRRSMVML 344

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIH 453
           KN G  LPL+  + +++ +IGP ++    + GNY G A  YTT ++GI  Y     R  +
Sbjct: 345 KNDGI-LPLNKDKLKSIGIIGPTANNRTVLEGNYNGTASRYTTFVEGIQDYVGDDVRVYY 403

Query: 454 QQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALD 497
            +GC        ++A  +D+   A I  + Q+D  +L +GLD +IE E           D
Sbjct: 404 SEGCHLFANGMSNLAWENDREAEALI-VAEQSDVVVLCLGLDSTIEGEQGDTGNAFAGGD 462

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           +  L L GRQQ+L+ KV    K P ILVL +G  + + +A  D    AI    YPG  GG
Sbjct: 463 KLSLNLIGRQQQLLEKVVAVGK-PVILVLSTGSAMAINYA--DEHCNAIFQTWYPGAQGG 519

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVY 616
            A+A +LFG  +P GKLP+T+Y       LP  E  +M+         RTYR+     +Y
Sbjct: 520 KALAQLLFGEYSPSGKLPVTFYKT--TEELPAFEDYSMKD--------RTYRYMPNEALY 569

Query: 617 PFGHGMSYTNF-VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           PFG+G+SY +  V +V          LDG  G           ++T+    +    V+V+
Sbjct: 570 PFGYGLSYADIKVQSVK--------VLDGAKGE----------EITNFSAGQTKYKVKVE 611

Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           ++N  + D    + ++       +A P+  L +F+ V + AG  + V +N+   K  +V+
Sbjct: 612 LENKSNVDSYDVVQIYIKDMESQYAVPNFSLCSFKSVFLKAGESKEVTLNVGE-KAFTVI 670

Query: 735 DRSGTRRIPLGEHNIHIG 752
           +  G R +   +  + IG
Sbjct: 671 NEEGKRIVDSKKFKLFIG 688


>gi|288924872|ref|ZP_06418809.1| beta-glucosidase [Prevotella buccae D17]
 gi|288338659|gb|EFC77008.1| beta-glucosidase [Prevotella buccae D17]
          Length = 721

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 256/735 (34%), Positives = 384/735 (52%), Gaps = 94/735 (12%)

Query: 49  IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
           + +   ++I R+++ EK+  L++ + A+  LGIK Y+WWSE LHGV   G          
Sbjct: 30  MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR--------- 80

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIF 160
            AT FPQ I   ++F+  L   IG  V+ E RA +N           AGLT+WSPNVNIF
Sbjct: 81  -ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIF 139

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
           RDPRWGRG ET GEDP+LSG    +YVRGLQG D   LK  A  KH+  +     +G + 
Sbjct: 140 RDPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFYLKTGACAKHYAVH-----SGPEG 194

Query: 221 FHFNAKV--SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
               A +  S++D+ +T+   F+M V +G+V +VM +YN+V G P      +L   +R  
Sbjct: 195 TRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDILRKS 254

Query: 279 WRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ---- 334
           W  NG+IVSDCD++  +Y    +  TPEEA A AI+AGL+++CG     HT  A+Q    
Sbjct: 255 WGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECG-----HTFKAMQGALD 309

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQG 393
           +GLL+E D++ AL   +  +++LG+   EP S  PY      ++C+P H  LAL AA + 
Sbjct: 310 QGLLAEADLDRALFPLVMTRLKLGIL--EPDSACPYNSYDESEICSPAHTALALRAADEA 367

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI 452
           +VLLKN G  LPL     RT+ V GP +     ++GNY G++  Y+T LQGI  R +   
Sbjct: 368 MVLLKNNG-ILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSGT 425

Query: 453 HQQGCKDVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEA---------LDRAGL 501
                       ++L     A++ +  A+  I+VMG + ++E E           DR G+
Sbjct: 426 SVNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGI 485

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAI 560
            LP  Q   + +V     G  ++VL  G PID+   +   ++A A++ A YPGQ GG A+
Sbjct: 486 GLPASQMNYLRRVKARKGGRIVVVLTGGSPIDL---REISKLADAVVMAWYPGQEGGEAL 542

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
            D+LFG  N  G+LP+T +P + + +LP   + +M         GRTY++  G V+YPFG
Sbjct: 543 GDLLFGDKNFSGRLPIT-FPAD-VDSLPAFDDYSMN--------GRTYKYMSGNVMYPFG 592

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
           +G+SY    +T                   +A + G+  K          L V+V + N 
Sbjct: 593 YGLSYGRVTYT-------------------DARVVGRIKKGE-------PLAVEVVLTNN 626

Query: 680 GSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
           G +        + +TP AG  +P   LV F +V +P  +  +    I V + L  +   G
Sbjct: 627 GDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKI-VPERLMTIQSDG 685

Query: 739 TRRIPLGEHNIHIGG 753
           + ++  G + + IGG
Sbjct: 686 SSKLLKGNYTLTIGG 700


>gi|373954937|ref|ZP_09614897.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373891537|gb|EHQ27434.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 723

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 248/741 (33%), Positives = 386/741 (52%), Gaps = 82/741 (11%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           P   RV DLI +L+L+EKV  ++  + +VPRL +  Y WW+EALHGV+  G         
Sbjct: 30  PTDVRVRDLISKLTLEEKVHQMMDVSPSVPRLNLPKYNWWNEALHGVARSGV-------- 81

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNI 159
             AT FPQ I   ++F+  L +     +SDEARAMYN            GLT+W+PN+NI
Sbjct: 82  --ATIFPQAIALGATFDQDLAKRESTAISDEARAMYNAAMVNGYNEKYGGLTFWTPNINI 139

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
           FRDPRWGRGQET GEDP L+ +   ++++GLQG D + LKVAA  KHF  +     +G +
Sbjct: 140 FRDPRWGRGQETYGEDPFLTSQIGVAFIQGLQGDDPEHLKVAACAKHFAVH-----SGPE 194

Query: 220 RFH--FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
           R    FNA  S +D+ +T+   F+  V   +V +VMC+YN+ N    C    +L + +R 
Sbjct: 195 RLRHSFNAIASPKDLRETYLPAFKALV-NARVEAVMCAYNRTNSEVCCGSNLLLDQILRD 253

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
           EW   G++VSDC ++  +Y          EA A A++ G+DL+CG       E AV+RGL
Sbjct: 254 EWHFTGHVVSDCGAIVDFYMGHKVVPGQPEAVALAVKHGVDLNCGDEYPALIE-AVKRGL 312

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           ++E +I+ AL   L  + +LG+FD + +S PY ++    + + DH+ LA E A + IVLL
Sbjct: 313 ITEKEIDKALATLLKTRFKLGLFDPKQNS-PYNNIPVSVINSTDHRALAKEVALKSIVLL 371

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGC 457
           KN+   LPL +   +   + GPN+     ++GNY G+    +T L+GI    +   Q   
Sbjct: 372 KNE-KCLPLKNNLSKYY-ITGPNAASVDALMGNYYGVNPHMSTILEGIAGAIQPGSQMQY 429

Query: 458 KDVACAD---DQLFGAAIDASRQADATILVM---GLDQSIEAEAL------DRAGLLLPG 505
           K     D   +         ++ +D T +VM   GL +  E EA+      DR    LP 
Sbjct: 430 KPGILLDRDNNNPIDWTTGDAKASDVTFVVMGITGLLEGEEGEAIASPNYGDRLDYNLPK 489

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
            Q + + K+   +K   + ++  G P++++         A++ A YPG+ GG A+ADILF
Sbjct: 490 NQIDFLRKIRKGNKNKVVAIITGGSPMNLSEVHE--LADAVLLAWYPGEEGGNAVADILF 547

Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
           G  +P G+LP+T +P+ +    P  + +M+        GRTYR+     +Y FG+G+SY+
Sbjct: 548 GKVSPSGRLPVT-FPKSFAQLPPYEDYSMK--------GRTYRYMTAEPMYTFGYGLSYS 598

Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
            + ++                   + T+S K IK        +T+  +  V N G  +G 
Sbjct: 599 TYTYS-------------------SLTLSEKQIK------KNMTIIAETMVTNTGKMEGE 633

Query: 686 HTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
             + ++ T P     P   L  F++V++ AG  ++V   I     +  VD +G+  +  G
Sbjct: 634 EVVQLYITVPQTEKNPQYSLKGFKRVNLKAGESRKVQFQI-TPDLMKSVDANGSEVLLSG 692

Query: 746 EHNIHIGG---TKHSVSLHAA 763
            + + IGG   +K S+SL AA
Sbjct: 693 SYVVRIGGASPSKRSLSLGAA 713


>gi|348684872|gb|EGZ24687.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 805

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 246/656 (37%), Positives = 356/656 (54%), Gaps = 53/656 (8%)

Query: 18  SSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA-- 75
           +S+G AA  P AC   D   +  PFC  SL   +RV DL+ RL L EKV LL + A+   
Sbjct: 14  ASAGHAASIPRAC--SDTEHQKFPFCDASLSTSERVEDLLRRLPLDEKVTLLTARASPKG 71

Query: 76  -VPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRV 134
            +  +G+  Y W +  +HGV +   GT        ATSFP  +   + F+      + +V
Sbjct: 72  NMSSIGLPEYNWGANCVHGVQSTC-GTN------CATSFPNPVNLGAIFDPQAVFDMAQV 124

Query: 135 VSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
           +  E RA++  G           GL  WSPN+NI RDPRWGR  ETP EDP+++ KY  +
Sbjct: 125 IGWELRALWLEGARENYAAGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVA 184

Query: 186 YVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
           Y RGLQ G D   L+   + KH+ AY  ++++G+DR  FNA+VS+ D  DT+   F   V
Sbjct: 185 YTRGLQEGKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAQVSRYDFADTYLPAFHASV 244

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
           +EGK   VMCSYN VNG+P CA+  +  + +R     +GYI SD  ++   Y  +H+T +
Sbjct: 245 VEGKAKGVMCSYNSVNGMPMCANEQLNTKLLREALGFDGYITSDSGAIEGIYRQRHYTKS 304

Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
             EA   AI +G D++ G          V  G L E  +++A+  TL ++  LG+FD   
Sbjct: 305 LCEAGRLAIMSGTDVNSGSVYKKCLADLVTSGQLPEKAVDDAMRRTLKLRFELGLFD-PI 363

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
             QPY H+ P +V   + ++L+LE  R+ IVLL+N G  LPL   + + +AVIGP++   
Sbjct: 364 DDQPYWHVAPSEVGKTESKQLSLELTRKSIVLLQNHGNVLPLR--KGKKLAVIGPHAKAK 421

Query: 425 VTMIGNYAGIACG--------YTTPLQGI---GRYARTIHQQGCKDVACADDQLFGAAID 473
             ++GNY G  C           TPL+ I      + T++ +G   +       F AA  
Sbjct: 422 RALLGNYLGQMCHGDYLEVGCVQTPLEAITAANGASNTVYAKG-SGINDTSTADFDAAEA 480

Query: 474 ASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI- 532
           A+R ADA +L +G+D SIE EA DR  + +P  Q +L+ +V  A K PT++VL +GG + 
Sbjct: 481 AARGADAVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGVVG 539

Query: 533 -DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
            +      D     +  A YPG  G  A++DILFG + P GKLP+T YP  YI ++ M  
Sbjct: 540 AEELILHTD----GVAEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYINSVDMKS 595

Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV----ANAPTVVAVPLD 643
           M+M      +YPGR+YR+YK   V+PFG G+SYT F   +     + P V+   LD
Sbjct: 596 MSM-----TKYPGRSYRYYKEVPVFPFGWGLSYTKFTLALDGEMPDDPIVITRDLD 646


>gi|367032987|ref|XP_003665776.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
 gi|347013048|gb|AEO60531.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
          Length = 835

 Score =  387 bits (993), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 238/615 (38%), Positives = 336/615 (54%), Gaps = 38/615 (6%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +LP  +R   L+  L+ +EK++ L+S A   PR+G+  Y WWSEALHGV++  PGT+
Sbjct: 29  CDRTLPEAERAAALVAALTDEEKLQNLVSKAPGAPRIGLPAYNWWSEALHGVAHA-PGTQ 87

Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
           F    GDF  +TSFP  +  A++F+  L EA+G V+  EARA  N G +GL YW+PNVN 
Sbjct: 88  FRDGPGDFNSSTSFPMPLLMAAAFDDELIEAVGDVIGTEARAFGNAGWSGLDYWTPNVNP 147

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-----------GSDGDRLKVAASCKHFT 208
           FRDPRWGRG ETPGED V   +YAAS +RGL+           GS G+  +V ++CKH+ 
Sbjct: 148 FRDPRWGRGSETPGEDVVRLKRYAASMIRGLEGRSSSSSSCSFGSGGEPPRVISTCKHYA 207

Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
             D ++WNG  R  F+A +S QD+ + +  PF+ C  + +V SVMC+YN VNGVP+CA+ 
Sbjct: 208 GNDFEDWNGTTRHDFDAVISAQDLAEYYLAPFQQCARDSRVGSVMCAYNAVNGVPSCANS 267

Query: 269 NILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
            ++   +RG W     + Y+ SDC++V       H+  T  E       AG+D  C    
Sbjct: 268 YLMNTILRGHWNWTEHDNYVTSDCEAVLDVSAHHHYADTNAEGTGLCFEAGMDTSCEYEG 327

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
                 A   G L+   ++ AL       +R+G FDG  S  P+  LG  DV  P+ QEL
Sbjct: 328 SSDIPGASAGGFLTWPAVDRALTRLYRSLVRVGYFDGPES--PHASLGWADVNRPEAQEL 385

Query: 386 ALEAARQGIVLLKNQGPSLPL---------SHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
           AL AA +GIVLLKN   +LPL         +    R VA+IG  +D    + G Y+G   
Sbjct: 386 ALRAAVEGIVLLKNDNDTLPLPLPDDVVVTADGGRRRVAMIGFWADAPDKLFGGYSGAPP 445

Query: 437 GYTTPLQGIGRYARTIHQQGCK----DVACADDQLFGAAIDASRQADATILVMGLDQSIE 492
              +P     +    +   G      D    +D     A++A+  AD  +   GLD S  
Sbjct: 446 FARSPASAARQLGWNVTVAGGPVLEGDSDEEEDTWTAPAVEAAADADYIVYFGGLDTSAA 505

Query: 493 AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
            E  DR  +  P  Q  L+S+++   K P ++V M     D    + D  + A++WA +P
Sbjct: 506 GETKDRMTIGWPAAQLALISELARLGK-PVVVVQMGDQLDDTPLFELD-GVGAVLWANWP 563

Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
           GQ GGTA+  +L G  +P G+LP+T YP  Y   +P+T+M +RPS +   PGRTYR+Y  
Sbjct: 564 GQDGGTAVVRLLSGAESPAGRLPVTQYPANYTDAVPLTDMTLRPSATN--PGRTYRWYPT 621

Query: 613 PVVYPFGHGMSYTNF 627
           P V PFG G+ YT F
Sbjct: 622 P-VRPFGFGLHYTTF 635


>gi|169611757|ref|XP_001799296.1| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
 gi|160702362|gb|EAT83185.2| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
          Length = 755

 Score =  387 bits (993), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 260/756 (34%), Positives = 384/756 (50%), Gaps = 49/756 (6%)

Query: 8   FFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
           F   L++L AS +  A       D  +   +    C V+    +R   L+  +   EK+ 
Sbjct: 3   FLSALVVLGASLANAA----IGPDCANGPLKDNKICDVTAAPAERAAALVEAMQTNEKLD 58

Query: 68  LLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
            L+ G   V RLG+  Y WW EALHGV+   PG  F G +  ATSFP  +  +++F+  L
Sbjct: 59  NLMRG---VTRLGLPKYNWWGEALHGVAG-APGINFTGAYKTATSFPMPLLMSAAFDDDL 114

Query: 128 WEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYV 187
              I  ++ +EARA  NGG A + +W+P++N FRDPRWGRG ETPGED V    Y    +
Sbjct: 115 IFKIANIIGNEARAFGNGGVAPVDFWTPDINPFRDPRWGRGSETPGEDIVRIKGYTKHLL 174

Query: 188 RGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
            GL+G    R K+ A+CKH+  YD++ W G+DR  FNAK++ QD+ + +  PF+ C  + 
Sbjct: 175 AGLEGDKPQR-KIIATCKHYVGYDMEAWGGIDRHSFNAKINMQDLAEYYMPPFQQCARDS 233

Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHFTST 304
           KV S MCSYN VNGVPTCAD  +L+  +R  W     N YI SDC++V        +  T
Sbjct: 234 KVGSFMCSYNAVNGVPTCADTYVLQTILRDHWNWTESNNYITSDCEAVKDISLKHKYAKT 293

Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
             E    A  AG+D  C          A  +  LS   I+ AL       +R G FDG  
Sbjct: 294 NAEGTGLAFTAGMDNSCEYTGSSDIPGAFNQSYLSIPTIDRALKRQYEGLVRAGYFDGAA 353

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
           ++  Y +LG KD+ TP+ Q+L+L+ A +G+VLLKN   +LPLS      VA++G  ++ T
Sbjct: 354 AT--YANLGVKDINTPEAQQLSLQVASEGLVLLKNDD-TLPLSLTNGSKVAMLGFWANDT 410

Query: 425 VTMIGNYAGIACGYTTPL---QGIGR----YARTIHQQGCKDVACADDQLFGAAIDASRQ 477
             + G Y+G A    +P+   Q +G      +  I QQ     +   D     A+ A+ +
Sbjct: 411 SKLSGIYSGPAPYLRSPVWAGQKLGLDMAIASGPILQQSN---SSTRDNWTTNALAAAEK 467

Query: 478 ADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFA 537
           +D  +   GLD S  AE  DR  +  P  Q +L+ K  +A+ G  ++VL+ G  +D +  
Sbjct: 468 SDYILYFGGLDPSAAAEGFDRNSIAWPTAQVDLIKK--LAAIGKPLVVLVLGDLMDNSPL 525

Query: 538 KNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPS 597
                + ++IWA +PGQ GG+A+  ++ G     G+LP+T YP  Y T L M +M MRPS
Sbjct: 526 LELDGVNSVIWANWPGQDGGSAVMQVVTGAVAVAGRLPITQYPANY-TELSMLDMNMRPS 584

Query: 598 QSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA 657
            S   PGRTYR++ G  V PFG G+ YT F    A             + +I   IS   
Sbjct: 585 SSS--PGRTYRWFNG-AVQPFGTGLHYTTFDAKFA------------ANSTIEYDISNIT 629

Query: 658 IKVTHAKCNRLTL-GVQVDVKNVGSKDGAHTLLVF---STPPAGHWAPHKQLVAFEKVHV 713
            + T+   +  ++  + V V N G++      L F      PA +  P K L+++ +V  
Sbjct: 630 KECTNQYPDTCSVPSIPVAVTNSGNRTSDFIALAFIKGENGPAPY--PLKTLISYTRVRD 687

Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
             G Q +          L+ VD+ G   +  GE+ +
Sbjct: 688 VKGGQTKSAEMQLTLGNLARVDQMGNTVLYPGEYTV 723


>gi|402308386|ref|ZP_10827395.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400375830|gb|EJP28725.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 721

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 256/734 (34%), Positives = 381/734 (51%), Gaps = 92/734 (12%)

Query: 49  IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
           + +   ++I R+++ EK+  L++ + A+  LGIK Y+WWSE LHGV   G          
Sbjct: 30  MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR--------- 80

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIF 160
            AT FPQ I   ++F+  L   IG  V+ E RA +N           AGLT+WSPNVNIF
Sbjct: 81  -ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVAQKLKNYSRNAGLTFWSPNVNIF 139

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
           RDPRWGRG ET GEDP+LSG    +YVRGLQG D   LK  A  KH+  +     +G + 
Sbjct: 140 RDPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFYLKTGACAKHYAVH-----SGPEG 194

Query: 221 FHFNAKV--SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
               A +  S++D+ +T+   F+M V +G+V +VM +YN+V G P      +L   +R  
Sbjct: 195 TRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDILRKS 254

Query: 279 WRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ---- 334
           W  NG+IVSDCD++  +Y    +  TPEEA A AI+AGL+++CG     HT  A+Q    
Sbjct: 255 WGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECG-----HTFKAMQGALD 309

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQG 393
           +GLL+E D++ AL   +  +++LG+   EP S  PY      ++C+P H  LAL AA + 
Sbjct: 310 QGLLAEADLDRALFPLVMTRLKLGIL--EPDSACPYNSYDESEICSPAHTALALRAADEA 367

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI 452
           +VLLKN G  LPL     RT+ V GP +     ++GNY G++  Y+T LQGI  R +   
Sbjct: 368 MVLLKNNG-ILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSGT 425

Query: 453 HQQGCKDVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEA---------LDRAGL 501
                       ++L     A++ +  A+  I+VMG + ++E E           DR G+
Sbjct: 426 SVNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGI 485

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
            LP  Q   + +V     G  ++VL  G PID+   K      A++ A YPGQ GG A+ 
Sbjct: 486 GLPASQLNYLRRVKARKGGRIVVVLTGGSPIDL--RKISKLADAVVMAWYPGQEGGEALG 543

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           D+LFG  N  G+LP+T +P + + +LP   + +M         GRTY++  G V+YPFG+
Sbjct: 544 DLLFGDKNFSGRLPIT-FPAD-VDSLPAFDDYSMN--------GRTYKYMSGNVMYPFGY 593

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           G+SY    +T                   +A + G+  K          L V+V + N G
Sbjct: 594 GLSYGRVTYT-------------------DARVVGRIKKGE-------PLAVEVVLTNNG 627

Query: 681 SKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
            +        + +TP AG  +P   LV F +V +P  +  +    I V + L  +   G+
Sbjct: 628 DRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKI-VPERLMTIQSDGS 686

Query: 740 RRIPLGEHNIHIGG 753
            ++  G + + IGG
Sbjct: 687 SKLLKGNYTLTIGG 700


>gi|326791674|ref|YP_004309495.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
 gi|326542438|gb|ADZ84297.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
          Length = 696

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 247/705 (35%), Positives = 374/705 (53%), Gaps = 98/705 (13%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           ++   L+  ++L+E+   L   + A+ RLG+  Y WW+EALHGV+  G           A
Sbjct: 8   KKAKALVAEMTLEERASQLKYDSPAIKRLGVPAYNWWNEALHGVARAGV----------A 57

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRD 162
           TSFPQ I  A++F+  L + +  V+++E RA YN  +         GLT+WSPNVNIFRD
Sbjct: 58  TSFPQAIGMAATFDDELLKRVAEVIAEEGRAKYNAYSQEGDRDIYKGLTFWSPNVNIFRD 117

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
           PRWGRG ET GEDP L+ +   ++V+GLQG +G  LK AA  KHF  +        DR H
Sbjct: 118 PRWGRGHETYGEDPYLTSRLGVAFVKGLQGEEG--LKTAACAKHFAVHSGPE---ADRHH 172

Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
           F+A+VS++D+ +T+   F   V E +V SVM +YN+ NG P C  P ++K  +R +W   
Sbjct: 173 FDARVSQKDLWETYLPAFEALVKEAEVESVMGAYNRTNGEPCCGSPTLMKDILREKWGFQ 232

Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
           G+ VSDC ++  +++    TST +E+AA A+++G DL+CG    LH   A Q GL++E +
Sbjct: 233 GHYVSDCWAIKDFHEHHMVTSTAQESAALALKSGCDLNCGNTY-LHILMAYQNGLVTEEE 291

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           I  A     T +  LG+FDG      Y  +  + V +  H  +A EA  + IVLLKN G 
Sbjct: 292 ITTAAERLFTTRYLLGLFDG----STYDAIPYEVVESKPHLSVADEATAKSIVLLKNNG- 346

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQG-- 456
            LPL+    +T+ VIGPN++    +IGNY G +  Y T L+G    +G   R ++ +G  
Sbjct: 347 LLPLNKESIKTIGVIGPNANSRKALIGNYHGTSSQYITILEGLQKEVGDEVRILYSEGSH 406

Query: 457 -----CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLL 502
                 + +A   D+L  A I  ++ +D  I+ +GLD+++E E         + D+  L 
Sbjct: 407 LYADRVEPLAYQRDRLSEAKI-VAKHSDVVIVCVGLDETLEGEEGDTGNAYASGDKRDLA 465

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           LP  QQELV  ++   K P IL L +G  ID+ +A  D    A++ A YPG  GG  IA 
Sbjct: 466 LPEPQQELVEAMAKMGK-PVILCLSAGSAIDLQYA--DAHYDAVLQAWYPGARGGQVIAK 522

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
            L G   P GKLP+T+Y    ++ LP  E            GRTYR+ +   +YPFG+G+
Sbjct: 523 ALLGEIVPSGKLPVTFYRD--LSGLPAFE-------DYSMQGRTYRYMQEEALYPFGYGL 573

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV-THAKCN-RLTLGVQVDVKNVG 680
           +Y                   G+     A+    +++V  H + + +L   VQ+ +KN+ 
Sbjct: 574 TY-------------------GKCRIEEASYDQGSLRVLVHNEVDFKLEEVVQLYIKNLD 614

Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
           S+        F+        P+  L  F++V + AG  + + IN+
Sbjct: 615 SE--------FAV-------PNHSLCGFKRVSLEAGETKEIQINV 644


>gi|443695317|gb|ELT96258.1| hypothetical protein CAPTEDRAFT_179825 [Capitella teleta]
          Length = 750

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 262/774 (33%), Positives = 396/774 (51%), Gaps = 84/774 (10%)

Query: 12  LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           L L+S +S        FA  P      + PF  VSLPI  R+NDLI RL++++ +   ++
Sbjct: 11  LTLISTASFSQDNVLRFA--PSSHALDSFPFRNVSLPIETRLNDLISRLTIEDAINQTVA 68

Query: 72  G----AAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
                   + RLGIK  E+ +E L GV               AT FPQ +  A+SF+  L
Sbjct: 69  RYGKFTPGIERLGIKPIEYITECLRGVRR-----------ENATGFPQALGLAASFSRDL 117

Query: 128 WEAIGRVVSDEARAMYNG-------GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
            + +   VS E RA YN        G  G+T +SP +NI R P WGR QET GEDP LSG
Sbjct: 118 MQRVATAVSVEVRAFYNHDIQRETYGAHGITCFSPVINILRHPLWGRNQETYGEDPYLSG 177

Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
           + A+ YV GLQG D   L+V+A CKHF A+   +   V +F F+AK+ ++D++ TF   F
Sbjct: 178 ELASQYVSGLQGDDPRYLRVSAGCKHFDAHGGPDTIPVRKFGFDAKIEERDLQMTFLPAF 237

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           + C+   K  +VMCS+N +NGVP+CA+  +L   +R +W   G++VSD  +V   +   H
Sbjct: 238 KKCIA-AKPYNVMCSFNSINGVPSCANKRLLTDVLRAQWGYEGFVVSDDAAVEYIFTEHH 296

Query: 301 FTSTPEEAAADAIRAGLDLD-CGPFLGLHTE--SAVQRGLLSEIDINNALVNTLTVQMRL 357
           + S+ E AA +AI++G +++  G F   + +   A+   L+++ ++   +      +  L
Sbjct: 297 YNSSFETAAVEAIKSGCNMELVGKFDPSYWQLTKALNEHLITKDELMENVRPVFLTRFLL 356

Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
           G FD  P+  P+  +    V + +HQ LALEAA +  VLLKN    LPL     +TVAV+
Sbjct: 357 GEFD-PPALNPFNQITKDVVLSAEHQRLALEAAVKSFVLLKNDRNFLPLLKNSLKTVAVV 415

Query: 418 GPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRYARTIH-QQGCKDVACADDQL--FGAAI 472
           GP S+ T  +IG+Y+         TPL GI + A  +    GC +  C D +     AA+
Sbjct: 416 GPMSNYTDGLIGDYSTDTDPSLILTPLHGIKKLAPNVQFASGCSNSTCTDYRATDVAAAV 475

Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGP 531
           D    A    + +G    +EAE  DR+ ++LPG Q +L+      + G P +L+L +GGP
Sbjct: 476 DG---AQVVFVALGTGFIVEAENNDRSDIVLPGAQLQLLKDAVYHANGRPVVLLLFNGGP 532

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF---GTSNPGGKLPMTWYPQEYITNLP 588
           +DV FA+    I +I+   +P    G AI  +L    G S+P G+LP+TW    Y+  +P
Sbjct: 533 LDVTFAQLTSGIVSIVECFFPAMMTGEAIYRMLINNEGISSPAGRLPLTW--PAYLNQVP 590

Query: 589 -MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
            +T+  M+        GRTYR+Y    +YPFG+G+SYT F ++      +   PL+    
Sbjct: 591 NITDYTMK--------GRTYRYYTEDPLYPFGYGLSYTQFKYS-----DLKVTPLE---- 633

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLV-----FSTPPAGHWAPH 702
                  G+ I+            V+V V N+G  D     ++      S P      P 
Sbjct: 634 ----VTKGQEIR------------VKVKVTNIGLYDADEVRIIVVQAYVSWPKTEIPVPR 677

Query: 703 KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT-RRIPLGEHNIHIGGTK 755
            QLVAF+++H+ +G  + V + I     L V     T   I  GE  ++IGG +
Sbjct: 678 WQLVAFDRIHIASGKSETVELTIE-ASLLEVWQNPETGFDILEGEMTLYIGGQQ 730


>gi|297740661|emb|CBI30843.3| unnamed protein product [Vitis vinifera]
          Length = 401

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 188/427 (44%), Positives = 274/427 (64%), Gaps = 34/427 (7%)

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
           +G   E D++ +L N   V  ++G FDG PS   Y  L  KD+CT +H ELA +AARQGI
Sbjct: 2   QGKAREEDVDTSLRNLYIVLTQVGFFDGIPS---YESLDKKDLCTKEHIELAADAARQGI 58

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
           VLLKN   +LPL   + + +A+IGP+++ T+ M+GNYAG+ C Y++PL G   Y +  ++
Sbjct: 59  VLLKNINETLPLDPAKLKNLALIGPHANATIEMLGNYAGVPCQYSSPLDGFSAYGKVTYE 118

Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
            GC +V C +      A++AS+ ADATIL++GLD+++E E LDR  LLLPG Q EL+ +V
Sbjct: 119 MGCNNVTCDNKTFIMPAVEASKNADATILLVGLDKTVEGEGLDRNDLLLPGYQTELILQV 178

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
            +ASKGP ILV+MSG  +D++F+K D R+ AI+WAGYPG+ GG AIAD+++G  NPGG+L
Sbjct: 179 IVASKGPIILVIMSGSAVDISFSKTDDRVKAILWAGYPGEEGGRAIADVVYGKYNPGGRL 238

Query: 575 PMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
           P+TW+  +Y++ LPMT M++RP  +  YPGRTY+F+ G VVYPFGHG+SYT F +T    
Sbjct: 239 PLTWHQNDYLSMLPMTSMSLRPVNN--YPGRTYKFFNGSVVYPFGHGLSYTKFNYT---- 292

Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
                                  ++ ++  C +    + ++VKN+G+K G   +LV+S P
Sbjct: 293 -----------------------LRSSNMSC-KDHFELDIEVKNIGAKHGNEVVLVYSKP 328

Query: 695 PAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           P G    H KQ++ F++V VPAG  Q V    +VCK L +V  +  + +P GEH I IG 
Sbjct: 329 PTGIVGTHAKQVIGFKRVFVPAGGSQNVKFEFNVCKSLGIVGYNAYKLLPSGEHKIIIGD 388

Query: 754 TKHSVSL 760
           +  S+ +
Sbjct: 389 SPTSLPI 395


>gi|301090543|ref|XP_002895482.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
 gi|262098232|gb|EEY56284.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
          Length = 809

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 266/773 (34%), Positives = 399/773 (51%), Gaps = 77/773 (9%)

Query: 18  SSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA-- 75
           S+  + A  P AC   +   +   FC  SL   +RV DL+ RL L EKV LL + A+   
Sbjct: 15  STPTVDASIPRACSSPEH--QQFAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPKG 72

Query: 76  -VPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRV 134
            +  +G+  Y W +  +HGV +   GT        ATSFP  +   + F+      + +V
Sbjct: 73  NMSSIGLPEYNWGANCVHGVQSTC-GTN------CATSFPNPVNLGAIFDPRAVFDMAQV 125

Query: 135 VSDEARAMY---------NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
           V  E RA++          G   GL  WSPN+NI RDPRWGR  ETP EDP+++ KY  +
Sbjct: 126 VGWELRALWLEGARENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVA 185

Query: 186 YVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
           Y +GLQ G D   L+   + KH+ AY  ++++G+DR  FNA VS+ D  DT+   F   V
Sbjct: 186 YTKGLQEGKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASV 245

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
           + GK   VMCSYN VNG+P CA+  +  + +R     +GYI SD  ++   Y  +H+T T
Sbjct: 246 VHGKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKT 305

Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
             EA   AI +G D++ G          V  G L E  +++A+  TL ++  LG+FD   
Sbjct: 306 LCEAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFD-PI 364

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
             QPY H+ P +V T + ++L+L+ +R+ IVLL+N G  LPL+  + + +AVIGP++   
Sbjct: 365 DDQPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLA--KGKKLAVIGPHAAAK 422

Query: 425 VTMIGNYAGIACG--------YTTPLQGI---GRYARTIHQQGC--KDVACADDQLFGAA 471
             ++GNY G  C           TPL+ I      + T++ +G    D + A    F  A
Sbjct: 423 RALLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGINDTSTAG---FDEA 479

Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
             A+R+A+  +L +G+D SIE EA DR  + +P  Q +L+ +V  A K PT++VL +GG 
Sbjct: 480 EAAARKAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGV 538

Query: 532 I--DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
           +  +      D     ++ A YPG  G  A++DILFG + P GKLP+T YP  Y+T++ M
Sbjct: 539 VGAEELILHTD----GVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYVTSVDM 594

Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSI 649
             M+M      +YPGR+YR+YK   V+PFG G+SYT F            + LD   G  
Sbjct: 595 KSMSM-----TKYPGRSYRYYKEVPVFPFGWGLSYTRFT-----------MALDSSSG-- 636

Query: 650 NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP----PAGHWA-PHKQ 704
             T   + I VT     +L   V V + N G+  G   +  F  P      G+ A  ++Q
Sbjct: 637 -VTDPSEPIVVTR----QLDQTVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQ 691

Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
           L  + +V +     +++   I     L++VD SG +    G + + I    H 
Sbjct: 692 LFDYRRVSLRPTQYRKLKFRIQQST-LAMVDDSGNQASFPGFYEVIITNGVHE 743


>gi|326789672|ref|YP_004307493.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
 gi|326540436|gb|ADZ82295.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
          Length = 704

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 258/745 (34%), Positives = 383/745 (51%), Gaps = 96/745 (12%)

Query: 52  RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
           +   L+ ++ L EK  +L   + A+ RLG+  Y WWSEALHGV+  G           AT
Sbjct: 8   KAGQLVAQMDLLEKASMLRYDSPAIKRLGVPTYNWWSEALHGVARAGV----------AT 57

Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDP 163
            FPQ I  A+ F+      I  +++ EARA YN            G+T W+PN+NIFRDP
Sbjct: 58  VFPQAIGMAAMFDEEYLYEIADIIATEARAKYNEFAKKEDRDIYKGMTLWAPNINIFRDP 117

Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHFTAYDLDNWNGVDRFH 222
           RWGRG ET GEDP L+ +   +++ GLQG +      AA+C KHF  +        +R H
Sbjct: 118 RWGRGHETYGEDPYLTSRLGVAFIHGLQGDENHHYWKAAACAKHFAVHSGPE---EERHH 174

Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
           F+A VSK+D+ +T+   F   V +GKVA +M +YN+VNG P C    +L+  ++ EW  +
Sbjct: 175 FDAVVSKKDLYETYLPAFEAAVTKGKVAGMMGAYNRVNGEPACGSKVLLQDILKEEWGFD 234

Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
           GY+VSDC ++  ++     T T  E+AA AI  G  L+CG    LH   A + GL++E  
Sbjct: 235 GYVVSDCWAIRDFHTEHMVTHTATESAALAINNGCQLNCGNTY-LHMLQAYKEGLVTEET 293

Query: 343 INNALVNTLTVQMRLGMFDG--EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
           I  +    + ++M+LG+FD   E +  PY      + C   H+++AL+ AR+ +VLLKN 
Sbjct: 294 ITKSAQKLMAIRMKLGLFDKNCEYNKIPY----EVNDCKV-HRDIALDVARRSMVLLKNN 348

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQG 456
           G  LPL+  + + + VIGP ++    + GNY G A  YTT L+GI  Y    AR  + +G
Sbjct: 349 GI-LPLNLKQTKAIGVIGPTANSRTVLQGNYFGTASRYTTFLEGIQDYVGDAARVYYAEG 407

Query: 457 C-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
           C         ++  +D+L  A I  + Q+D  IL +GLD SIE E         A D++ 
Sbjct: 408 CHLFKNSISGLSWENDRLSEALI-VAEQSDVVILCLGLDASIEGEQGDTGNAFAAGDKSD 466

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L L GRQQ L+ +V    K PTIL+L SG  + +  A+      AI+   YPGQ+GG A+
Sbjct: 467 LNLIGRQQLLLEEVLKIGK-PTILILSSGSAMAIHTAQE--YCEAILETWYPGQSGGKAL 523

Query: 561 ADILFGTSNPGGKLPMTWYP-QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
           A +LFG  +P GKLP+T+Y   E + +     MA          GRTYR+ K   +YPFG
Sbjct: 524 AQLLFGEYSPSGKLPITFYKTTEELPDFRDYSMA----------GRTYRYMKNEALYPFG 573

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
           +G++Y             V V         +A I  + I+      N +   +Q+ V N 
Sbjct: 574 YGLNYAK-----------VEVK--------DAVIKERNIE------NEIIYEIQLQVTNQ 608

Query: 680 GSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
                   + V+       WA P+  L A++ +++ A  + ++ + I    +  +VD  G
Sbjct: 609 SEVCTYDVVQVYIKDMESRWAVPNYSLCAYKSIYLAAYDEPQITLQIKQSAF-EIVDEEG 667

Query: 739 TRRIPLGEHNIHIGGT---KHSVSL 760
            R I      + IG +   K SV L
Sbjct: 668 KRYIDSHHFKLFIGTSGPDKRSVEL 692


>gi|449299051|gb|EMC95065.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 849

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 257/764 (33%), Positives = 382/764 (50%), Gaps = 67/764 (8%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +    QR + +I  +++ EK+  L+  +    RLG+  YEWWSEALHGV+   PG  
Sbjct: 43  CDTNATPYQRASAIINAMNITEKLANLLDVSYGSARLGLPPYEWWSEALHGVAG-SPGVN 101

Query: 103 F--GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F   G++  ATSFP  IT +S+F+    + I  V+S EARA  N    GL Y++PN+N F
Sbjct: 102 FTSSGNYSYATSFPMPITFSSAFDDPSVQNIASVISTEARAYSNAARGGLDYFTPNINPF 161

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL--------KVAASCKHFTAYDL 212
           +DPRWGRG ETPGEDP+    Y  + + GL+G+D            K+ A+CKHF  YDL
Sbjct: 162 KDPRWGRGSETPGEDPLRIQGYVKNLLIGLEGTDDGYFNTSHSGYKKMIATCKHFAGYDL 221

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
           ++W+G  R+ ++A+++ QD+ + +  PF+ C  +  VAS+MCSYN VN VP CA+  + +
Sbjct: 222 EDWDGYIRYGYDAEITTQDLAEYYLPPFQTCARDQNVASIMCSYNSVNSVPACANSYLQE 281

Query: 273 RTIRGEWRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFLGLH 328
             +R  W     N YI SDC+++   Y   +++     AA  ++  G+D  C     G+ 
Sbjct: 282 TILREHWGWTIDNNYITSDCNAISDIYYNHNYSVNNAAAAGLSLSNGMDTACIVANTGVM 341

Query: 329 TE--SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
           T+   +   G ++E  I  AL+      +  G FD   SS PY  +G   V TP  Q LA
Sbjct: 342 TDVNGSYYGGYVTEATITTALIRQYEALVIAGYFD-PASSNPYRSIGWSSVNTPAAQTLA 400

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
            +AA +G  LLKN G  LP        VA+IG  ++ T  M G Y+G A    +PL    
Sbjct: 401 RQAATEGTTLLKNTG-LLPYKFTSQTKVAMIGMWANGTSQMQGGYSGPAPYLHSPLYAAS 459

Query: 447 RYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
           +   +  +  G  +           A  A++ AD  +   G+D S+EAEA+DR  +  PG
Sbjct: 460 QLGLSYNYANGPINQTTLTSNYSQNATAAAQNADVILFFGGIDWSVEAEAMDRYQIAWPG 519

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
            QQ L+++  +A+ G  ++VL  G  +D     ++  I+A++W GYPGQ GG A  DIL 
Sbjct: 520 AQQALIAQ--LAALGKPMIVLQMGSMLDATPILSNNNISALVWVGYPGQDGGVAAFDILT 577

Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
           G   P G+LP+T YP +Y+  +PMT M++RP      PGRTY++Y   V+ PF +G+ YT
Sbjct: 578 GAVAPAGRLPVTMYPADYVNQVPMTNMSLRPGPGN--PGRTYKWYNNAVL-PFAYGLHYT 634

Query: 626 NFVHTV--------------ANA------------------------PTVVAVPLDGRHG 647
            F  T                NA                        P+ VA    G   
Sbjct: 635 TFKATFNGGPPGPGSPWSPPWNAPWSAKVRRGWGWGNWGPPNWGWTQPSQVAPGNGGLSS 694

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLV 706
           S N      +    H         V + V+N G        LVFS   AG    P+K L 
Sbjct: 695 SYNIQSLLSSCTAAHPDLCAFP-SVAISVQNAGQTTSDFVALVFSNTTAGPAPYPYKSLA 753

Query: 707 AFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
           ++ ++H V AG      +N+ +   L+  D  G + +  G +N+
Sbjct: 754 SYTRLHSVAAGQTVTASLNMTL-GVLARRDDQGNQILYPGTYNL 796


>gi|339499234|ref|YP_004697269.1| beta-glucosidase [Spirochaeta caldaria DSM 7334]
 gi|338833583|gb|AEJ18761.1| Beta-glucosidase [Spirochaeta caldaria DSM 7334]
          Length = 699

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 272/735 (37%), Positives = 380/735 (51%), Gaps = 105/735 (14%)

Query: 52  RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
           ++  LI  +SL+EK+ L+I  A  +PRLGI  Y WW+EALHGV+N G           AT
Sbjct: 11  QIETLISNMSLEEKIGLMIHRAKGIPRLGIPDYNWWNEALHGVANNGE----------AT 60

Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDP 163
            FPQ I   ++F+  L   +   +S EARA +N            GLT+W+PN+NIFRDP
Sbjct: 61  VFPQAIALGATFDEDLVHRVAEAISIEARAKFNAVGKEKAEQYHRGLTFWAPNINIFRDP 120

Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RF 221
           RWGRGQET GEDPVL+ +   +YVRGLQGSD   L+ AA  KHF  +     +G +  R 
Sbjct: 121 RWGRGQETYGEDPVLTSRLGTAYVRGLQGSDPYYLRAAACAKHFAVH-----SGPEGLRH 175

Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
            FNA+VS++D+E+T+   F+  V  G V SVM +YN+VNG P C    +LK+ +R EW+ 
Sbjct: 176 TFNAEVSQKDLEETYLPAFKALVKSG-VESVMGAYNRVNGEPACGSTYLLKQKLREEWQF 234

Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEI 341
            G++VSDC ++  ++     T+   E+ A A+R+G DL+CG       E AV +G ++E 
Sbjct: 235 QGHVVSDCWAICDFHKNHKVTNDILESIALALRSGCDLNCGDAYNYLAE-AVLKGYVTED 293

Query: 342 DINNALVNTLTVQMRLGMF--DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
           DIN A+V  L    +LG+   DG     PY  +    +    H  LALEAA + IVLLKN
Sbjct: 294 DINRAVVRLLITLDKLGLIHDDG-----PYQGITIHQIDWKKHDSLALEAAEKSIVLLKN 348

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQ 455
            G  LPL   +   + V GPN+  +  ++GNYAG++    T L+ I    G      +++
Sbjct: 349 NG-VLPLKKDKISYIYVTGPNATNSDALLGNYAGVSSRLLTVLEAIVEEAGPEITVTYKK 407

Query: 456 GCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR---------AGLLLPGR 506
           GC  +A         A   ++ AD TI VMG D S+E E  D            L L   
Sbjct: 408 GCP-LAERRVNPNDWASGVTKYADVTIAVMGRDTSVEGEEGDAILSSTYGDFEDLNLNDE 466

Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAIADILF 565
           Q   + K+  + K P I+VLM G PI    +     IA AI+ A YPGQAGGTA+++I+F
Sbjct: 467 QLSYLHKLKESGK-PLIVVLMGGAPI---CSPELHEIADAILVAWYPGQAGGTAVSNIVF 522

Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
           G +NP GKLP+T +P+  +  LP  E       +    GRTYR+     +YPFG G+SYT
Sbjct: 523 GKTNPSGKLPVT-FPKS-VRQLPEFE-------NYSMQGRTYRYMTEEPLYPFGFGLSYT 573

Query: 626 --NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
              F H              GR  S                  +  L V  ++ N G+ D
Sbjct: 574 KMEFKHVT------------GRWKS----------------PEKDELIVSTELYNQGTID 605

Query: 684 GAHTLLVFSTPPAGHW------APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
           G   + ++      HW       P+  L+ F++V V AGA       I + K L  +D S
Sbjct: 606 GEEVVQLYY-----HWKDAPFAVPNWSLIDFKRVLVAAGASCICEFKIPLEK-LQCIDPS 659

Query: 738 GTRRIPLGEHNIHIG 752
           G   IP G    ++G
Sbjct: 660 GKGVIPTGTLQFYVG 674


>gi|255590044|ref|XP_002535159.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
 gi|223523880|gb|EEF27223.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
          Length = 449

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 192/450 (42%), Positives = 288/450 (64%), Gaps = 9/450 (2%)

Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
           +D++CG +L  +T+SAV++  +SE +I+ AL N  +++MRLG+F+G P+  PYG +    
Sbjct: 1   MDVNCGNYLKNYTKSAVEKKKVSESEIDRALHNLFSIRMRLGLFNGNPTKLPYGDISADQ 60

Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
           VC+ +HQ +ALEAAR GIVLLKN    LPLS  +  ++A+IGPN+D +  ++GNYAG  C
Sbjct: 61  VCSQEHQAVALEAARDGIVLLKNSNQLLPLSKSKTTSLAIIGPNADNSTILVGNYAGPPC 120

Query: 437 GYTTPLQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA 495
              TP QG+  Y +T  +  GC  VAC+   +   AI  +++AD  +LVMGLDQ+ E E 
Sbjct: 121 KTVTPFQGLQNYIKTTKYHPGCSTVACSSAAI-DQAIKIAKEADQVVLVMGLDQTQEREE 179

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DR  L+LPG+QQEL+  V+ A+K P +LVL+ GGP+D++FAK D  I  I+WAGYPG+A
Sbjct: 180 HDRVDLVLPGKQQELIISVARAAKKPVVLVLLCGGPVDISFAKYDRNIGGILWAGYPGEA 239

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG A+A+I+FG  NPGG+LP+TWYPQ++ T +PMT+M MRP  S  YPGRTYRFYKG  V
Sbjct: 240 GGIALAEIIFGNHNPGGRLPVTWYPQDF-TKVPMTDMRMRPQPSSGYPGRTYRFYKGKKV 298

Query: 616 YPFGHGMSYTNFVH---TVANAPTVVAVPLDGRHGSINATISGKAI-KVTHAKCNRLTLG 671
           + FG+G+SY+N+ +   +V      +   +D +  + ++ I  K I ++    C R    
Sbjct: 299 FEFGYGLSYSNYSYELVSVTQNKISLRSSIDQKAEN-SSPIGYKTISEIEEELCERSKFS 357

Query: 672 VQVDVKNVGSKDGAHTLLVFS-TPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
           V V VKN G   G H +L+F+     G   P K+L+AF+ V + AG    +   ++ C++
Sbjct: 358 VTVRVKNQGEMTGKHPVLLFARQDKPGSGGPIKKLIAFQSVKLNAGENAEIEYKVNPCEH 417

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           LS  +  G   +  G   + +G  ++ +++
Sbjct: 418 LSRANEDGLMVMEEGSQYLLVGDKEYPINI 447


>gi|372208556|ref|ZP_09496358.1| beta-glucosidase [Flavobacteriaceae bacterium S85]
          Length = 729

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 252/748 (33%), Positives = 380/748 (50%), Gaps = 80/748 (10%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SL   +R++ L+  ++L+EK+  L SG+  V RL I  Y WW+EALHGV+  G   
Sbjct: 26  WLDTSLTFEERIHHLVKAMTLKEKIAQLDSGSPEVKRLDIPEYNWWNEALHGVARNGK-- 83

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
                   +T FPQ I  A++F+  L + +   +SDEARA +N        G  AGLT+W
Sbjct: 84  --------STVFPQAIGLAATFDPVLAKQVASAISDEARAKFNISQSIGNRGQYAGLTFW 135

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +PNVNIFRDPRWGRGQET GEDP L+ +   ++V+GLQG+    LK AA  KHF  +   
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTSQMGVAFVKGLQGNHPKYLKSAACAKHFAVH--- 192

Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             +G +  R HFNA  SK+D+ +T+   F   V +  V  VM +YN V GVP  +   +L
Sbjct: 193 --SGPEELRHHFNANPSKKDLYETYLPAFEALVKQANVEGVMSAYNAVYGVPAGSSEFLL 250

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
           K T+R  W  +GYIVSDC ++G  +       T  EAAA A++AG++L+CG       E 
Sbjct: 251 KETLRKSWGFDGYIVSDCGALGDIFKGHKQVKTMPEAAAVALKAGVNLNCGYVYNGALEK 310

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
           AVQ+GL+SE  I+  L   L  + +LG FD +  + PY  +    + + DH  LA + A+
Sbjct: 311 AVQQGLVSEELIDTRLKQLLKTRFKLGFFDPK-EANPYNAIPTSVIHSDDHIALARKTAQ 369

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART 451
           + IVLLKN+  +LPL     +   V GP +  +  ++ NY G+     + L+GI      
Sbjct: 370 KSIVLLKNKNHTLPLDK-NIKVPYVTGPFASSSDVLLANYYGMTTNLVSVLEGIADKVSL 428

Query: 452 IHQQGCKDVACADDQLFGA---AIDASRQADATILVMGLDQSIEAEAL---------DRA 499
                 +  A   ++       A + ++ ADA I V+GL    E E +         D+ 
Sbjct: 429 GTSLNYRMGALPFNKNLNPKNWAPNVAKTADAVIAVVGLSADFEGEEVDAIASPNKGDKK 488

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            L LP  Q + V +++   KGP ILV+ SG    VA  +      AI+   YPG+ GG A
Sbjct: 489 DLKLPQNQIDYVKEMAAKKKGPLILVVASGSA--VALGELYDLADAIVLMWYPGEQGGNA 546

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
           +AD+LFG  +P G LP+T +P+      P  + +M+        GRTY++ +   ++PFG
Sbjct: 547 VADVLFGDVSPSGHLPVT-FPKSVAQLPPFEDYSMQ--------GRTYKYMEEEPLFPFG 597

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
            G+SYT+F  +                   N  IS + IK       + +  V   V N 
Sbjct: 598 FGLSYTDFKFS-------------------NVQISEEKIK------KKDSFTVSCSVANN 632

Query: 680 GSKDGAHTLLVFSTP-PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
           G  DG   + ++  P  +    P  QL+ F+++ +     + V  N+   K L  V++ G
Sbjct: 633 GKVDGEEVVQLYLVPLNSNKDLPKYQLLKFKRIEIQKNTSKTVSFNLE-AKDLFQVNKEG 691

Query: 739 TRRIPLGEHNIHIGG---TKHSVSLHAA 763
            +    G++ + +     +K S  L AA
Sbjct: 692 KKTWIKGKYKLVVANALPSKRSQDLGAA 719


>gi|301118693|ref|XP_002907074.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
 gi|262105586|gb|EEY63638.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
          Length = 809

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 265/773 (34%), Positives = 398/773 (51%), Gaps = 77/773 (9%)

Query: 18  SSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA-- 75
           S+  + A  P AC   +   +   FC  SL   +RV DL+ RL L EKV LL + A+   
Sbjct: 15  STPTVDASIPRACSSPEH--QQFAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPKG 72

Query: 76  -VPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRV 134
            +  +G+  Y W +  +HGV +   GT        ATSFP  +   + F+      + +V
Sbjct: 73  NMSSIGLPEYNWGANCVHGVQSTC-GTN------CATSFPNPVNLGAIFDPRAVFDMAQV 125

Query: 135 VSDEARAMY---------NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
           V  E RA++          G   GL  WSPN+NI RDPRWGR  ETP EDP+++ KY  +
Sbjct: 126 VGWELRALWLEGARENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVA 185

Query: 186 YVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
           Y +GLQ G D   L+   + KH+ AY  ++++G+DR  FNA VS+ D  DT+   F   V
Sbjct: 186 YTKGLQEGKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASV 245

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
           + GK   VMCSYN VNG+P CA+  +  + +R     +GYI SD  ++   Y  +H+T T
Sbjct: 246 VHGKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKT 305

Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
             EA   AI +G D++ G          V  G L E  +++A+  TL ++  LG+FD   
Sbjct: 306 LCEAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFD-PI 364

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
             QPY H+ P +V T + ++L+L+ +R+ IVLL+N G  LPL+  + + +AVIGP++   
Sbjct: 365 DDQPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLA--KGKKLAVIGPHAAAK 422

Query: 425 VTMIGNYAGIACG--------YTTPLQGI---GRYARTIHQQGC--KDVACADDQLFGAA 471
             ++GNY G  C           TPL+ I      + T++ +G    D +      F  A
Sbjct: 423 RALLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGINDTSTGG---FDEA 479

Query: 472 IDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
             A+R+A+  +L +G+D SIE EA DR  + +P  Q +L+ +V  A K PT++VL +GG 
Sbjct: 480 EAAARKAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGV 538

Query: 532 I--DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
           +  +      D     ++ A YPG  G  A++DILFG + P GKLP+T YP  Y+T++ M
Sbjct: 539 VGAEELILHTD----GVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYVTSVDM 594

Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSI 649
             M+M      +YPGR+YR+YK   V+PFG G+SYT F            + LD   G  
Sbjct: 595 KSMSM-----TKYPGRSYRYYKEVPVFPFGWGLSYTRFT-----------MALDSSSG-- 636

Query: 650 NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP----PAGHWA-PHKQ 704
             T   + I VT     +L   V V + N G+  G   +  F  P      G+ A  ++Q
Sbjct: 637 -VTDPSEPIVVTR----QLDQTVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQ 691

Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
           L  + +V +     +++   I     L++VD SG +    G + + I    H 
Sbjct: 692 LFDYRRVSLRPTQYRKLKFRIQQST-LAMVDDSGNQASFPGFYEVIITNGVHE 743


>gi|373460527|ref|ZP_09552278.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
 gi|371955145|gb|EHO72949.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
          Length = 699

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 248/727 (34%), Positives = 376/727 (51%), Gaps = 79/727 (10%)

Query: 49  IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
           + Q+   LI  ++L EK+  +++    +PRLGIK Y+WW+E LHGV   G          
Sbjct: 9   MQQKARRLINMMTLDEKISQMMNETPGIPRLGIKPYDWWNEGLHGVGRDGR--------- 59

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIF 160
            AT FPQ I   ++FN  L   IG  ++ E RA YN            GLT+WSPN+NIF
Sbjct: 60  -ATVFPQPIGMGATFNPALIRQIGDAIATEGRAKYNVAQRNNNYARYTGLTFWSPNINIF 118

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
           RDPRWGRG ET GEDP L+G    +YV+G+QG+D   LKVAA  KH+  +         R
Sbjct: 119 RDPRWGRGMETYGEDPFLTGTLGIAYVQGMQGNDPFYLKVAACGKHYAVHSGPE---ATR 175

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
              N   +K+D+ +T+   F+M V +G V ++M +YN+V G        +L   +R +W 
Sbjct: 176 HEANVSPTKRDLFETYLPAFKMLVQQGHVEAIMGAYNRVYGEACSGSKYLLTDVLRKQWG 235

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
             G+IVSDCD+V   +       T  EA A AI+AGL+++CG       ++  Q+ LL+E
Sbjct: 236 FRGHIVSDCDAVADIHAGHKIVKTEAEACAIAIKAGLNIECGHTFEAMKQAVAQK-LLTE 294

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
            +I+ AL+  +  +++LG+ + +    PY  +   ++C+P+H  LA +AA + +VLLKN 
Sbjct: 295 QEIDRALLPLMMTRLKLGILEYDAEC-PYNEVKETEICSPEHIALARKAATESMVLLKNN 353

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART---IHQQGC 457
           G  LPL    H T+ + GP +  +  ++GNY GI+  Y T LQGI     +   ++ +  
Sbjct: 354 G-ILPLDKNLH-TLFIAGPGASDSFWLMGNYFGISNRYCTYLQGIADKVSSGTAVNFRPA 411

Query: 458 KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA---------LDRAGLLLPGRQQ 508
              +         A+D +  A+ TI+VMG + ++E E           DR  + LP  Q 
Sbjct: 412 FGESTPTKNTINWALDEAIAAEKTIVVMGNNGNLEGEEGESIASETRGDRVSMRLPASQM 471

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAIADILFGT 567
           + +  +  A K   ++VL  G PIDV   +   R+A A++ A YPGQ GG A+AD+LFG 
Sbjct: 472 KFLRDLK-ARKNGIVVVLTGGSPIDV---REISRLADAVVMAWYPGQEGGYALADLLFGD 527

Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
            N  G+LP+T+         P +  A+ P +     GRTY++    + YPFG+G+SYT  
Sbjct: 528 ENFSGRLPVTF---------PESTDALPPFEDYAMKGRTYKYQTAHIQYPFGYGLSYTTV 578

Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
            +  A   T   +P  GR  +++A +                       KN G+K     
Sbjct: 579 TYAHAKVET---MPQKGRGMTVSAVL-----------------------KNTGNKAVDEV 612

Query: 688 LLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGE 746
             V+ S P AG  A    LVAF+++ +  G QQ V  +I   + L+ V   GT ++  G 
Sbjct: 613 AQVYLSAPGAGTTAALASLVAFKRIGLQPGEQQLVRFDIPFDRLLT-VQEDGTAQLLKGN 671

Query: 747 HNIHIGG 753
           + I +GG
Sbjct: 672 YTITVGG 678


>gi|315607899|ref|ZP_07882892.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315250368|gb|EFU30364.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 721

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 256/735 (34%), Positives = 383/735 (52%), Gaps = 94/735 (12%)

Query: 49  IPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP 108
           + +   ++I R+++ EK+  L++ + A+  LGIK Y+WWSE LHGV   G          
Sbjct: 30  MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR--------- 80

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIF 160
            AT FPQ I   ++F+  L   IG  V+ E RA +N           AGLT+WSPNVNIF
Sbjct: 81  -ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIF 139

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
           RD RWGRG ET GEDP+LSG    +YVRGLQG D   LK  A  KH+  +     +G + 
Sbjct: 140 RDLRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFYLKTGACAKHYAVH-----SGPEG 194

Query: 221 FHFNAKV--SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
               A +  S++D+ +T+   F+M V +G+V +VM +YN+V G P      +L   +R  
Sbjct: 195 TRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDILRKS 254

Query: 279 WRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQ---- 334
           W  NG+IVSDCD++  +Y    +  TPEEA A AI+AGL+++CG     HT  A+Q    
Sbjct: 255 WGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECG-----HTFKAMQGALD 309

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQG 393
           +GLL+E D++ AL   +  +++LG+   EP S  PY      ++C+P H  LAL AA + 
Sbjct: 310 QGLLAEADLDRALFPLVMTRLKLGIL--EPDSACPYNSYDESEICSPAHTALALRAADEA 367

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI 452
           +VLLKN G  LPL     RT+ V GP +     ++GNY G++  Y+T LQGI  R +   
Sbjct: 368 MVLLKNNG-ILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSGT 425

Query: 453 HQQGCKDVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEA---------LDRAGL 501
                       ++L     A++ +  A+  I+VMG + ++E E           DR G+
Sbjct: 426 SVNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGI 485

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAI 560
            LP  Q   + +V     G  ++VL  G PID+   +   ++A A++ A YPGQ GG A+
Sbjct: 486 GLPASQLNYLRRVKARKGGRIVVVLTGGSPIDL---REISKLADAVVMAWYPGQEGGEAL 542

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
            D+LFG  N  G+LP+T +P + + +LP   + +M         GRTY++  G V+YPFG
Sbjct: 543 GDLLFGDKNFSGRLPIT-FPAD-VDSLPAFDDYSMN--------GRTYKYMSGNVMYPFG 592

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
           +G+SY    +T                   +A + G+  K          L V+V + N 
Sbjct: 593 YGLSYGRVTYT-------------------DARVVGRIKKGE-------PLAVEVVLTNN 626

Query: 680 GSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
           G +        + +TP AG  +P   LV F +V +P  +  +    I V + L  V   G
Sbjct: 627 GDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKI-VPERLMTVQSDG 685

Query: 739 TRRIPLGEHNIHIGG 753
           + ++  G + + IGG
Sbjct: 686 SSKLLKGNYTLTIGG 700


>gi|348684866|gb|EGZ24681.1| hypothetical protein PHYSODRAFT_325770 [Phytophthora sojae]
          Length = 805

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 260/716 (36%), Positives = 377/716 (52%), Gaps = 72/716 (10%)

Query: 13  ILLSASSSGL----AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
           +LL+ +S+ L    AA  P AC  K    + LPFC  SL    RV DL+ RL LQEK  L
Sbjct: 6   LLLAIASASLVTPTAATIPRACVGK--ANQELPFCNTSLSTADRVEDLLSRLPLQEKATL 63

Query: 69  LISGAAAVPR-----LGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSF 123
           L   A A PR     +G+  Y W +  +HGV      +  G + P  TSFP  +   + F
Sbjct: 64  LT--ARASPRGNMSSIGLPEYNWGANCVHGVQ-----STCGTNCP--TSFPNPVNLGAIF 114

Query: 124 NATLWEAIGRVVSDEARAM--------YNGGT-AGLTYWSPNVNIFRDPRWGRGQETPGE 174
           +  +   + +V+  E RA+        Y GG   GL  WSPN+NI RDPRWGR  ETP E
Sbjct: 115 DPQVVFDMAQVIGWELRALWLEGATENYKGGPHLGLDCWSPNININRDPRWGRNTETPSE 174

Query: 175 DPVLSGKYAASYVRGLQGSDGDR-----LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSK 229
           DP+++ KY  +Y RGLQ  +G R     L+   + KH+ AY  +N+ GV+R  F+A VS 
Sbjct: 175 DPLVNSKYGVAYTRGLQ--EGKRQDPRFLQAVVTLKHYAAYSYENYGGVNRMEFDAIVSP 232

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
            D  DT+   FR  V++G    VMCSYN VNG+P CA+  +++  +RG    +GY+ SD 
Sbjct: 233 YDFADTYFPAFRSSVVDGNAKGVMCSYNSVNGIPMCANKELVETLLRGTLGFDGYVTSDS 292

Query: 290 DSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVN 349
            +V    D  H+  +  EAA  AI AG D++ G       ++ V    L E  +++AL +
Sbjct: 293 GAVEAISDMHHYADSQCEAARLAILAGTDINSGKSYEACLKTLVDDNQLEEKALDDALRH 352

Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
           TL ++  LG+FD     QPY ++ P +V T   + L+L A R+ +V+L+N    LPL   
Sbjct: 353 TLKLRFELGLFD-PIDDQPYWNVTPSEVNTAAAKALSLNATRKSLVMLQNNASVLPLQ-- 409

Query: 410 RHRTVAVIGPNSDVTVTMIGNYAGIAC--------GYTTPLQGI---GRYARTIHQQGCK 458
           +   +AV+GP++     ++GNY G  C           TPL  I      + T   +GC 
Sbjct: 410 KGVKLAVLGPHAKSKRGLLGNYLGQMCHGDYDEVGCVQTPLDAIRAANGASNTTFAEGCG 469

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMAS 518
            ++      F  A+ A+++ADA +L +G+D+SIE E  DR  + LP  Q +L+ +V    
Sbjct: 470 -ISGNSTAGFEKAVAAAKEADAVVLFLGIDKSIEGEVGDRNNIDLPNIQMQLLQRVHAVG 528

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           + PT++VL++GG I     +   R  A++ A YPG  G  A+AD+LFG +NP GKLP+T 
Sbjct: 529 R-PTVVVLINGGVIGA--EEIIERTDALVEAFYPGFFGARAMADVLFGDTNPSGKLPVTM 585

Query: 579 YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
           Y  +Y+  + M  M M       +PGRTYR++KG  V+PFG G+SYT F  +V +     
Sbjct: 586 YRSDYVDQVEMKSMDM-----TAHPGRTYRYFKGEPVFPFGWGLSYTTFSLSVDSG---- 636

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
               +    S NA  SG  +  T        + + V VKN G   G   +L F  P
Sbjct: 637 ---TNSSSHSNNAAFSGGEVSDTA------NVTISVVVKNDGEVAGDEVVLAFFRP 683


>gi|410617070|ref|ZP_11328046.1| beta-glucosidase [Glaciecola polaris LMG 21857]
 gi|410163339|dbj|GAC32184.1| beta-glucosidase [Glaciecola polaris LMG 21857]
          Length = 731

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 247/729 (33%), Positives = 379/729 (51%), Gaps = 81/729 (11%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR N L+  +++ EK+  L     A+ RL +  Y WW+EALHG++  G           A
Sbjct: 38  QRANLLVNAMTVDEKIAQLSHATPAIARLNVPQYNWWNEALHGIARNGK----------A 87

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMY--------NGGTAGLTYWSPNVNIFRD 162
           T FPQ I  A++F+  L   +   +SDEARA Y         G  AGLT+W+PNVNIFRD
Sbjct: 88  TIFPQAIGLAATFDPDLAHQVASAISDEARAKYAIAQSIGNQGQYAGLTFWTPNVNIFRD 147

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
           PRWGRGQET GEDP L+ +   ++V+GLQG D   LK A   KHF  +     +G +  R
Sbjct: 148 PRWGRGQETYGEDPFLTAQMGTAFVKGLQGDDPKYLKSAGVAKHFAVH-----SGPESLR 202

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
            HF+ + S++D+ +T+   F   V + KVA VMC+YN VNG P CA   +L   ++ +W 
Sbjct: 203 HHFDVEPSQKDLYETYLPAFEALVTQAKVAGVMCAYNAVNGEPACASAQLLDGILKKQWG 262

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
            +GYIVSDC ++  +      T +  E+AA A+++G++L+CG       ++A+++ L+  
Sbjct: 263 FHGYIVSDCGALNDFQAGHKVTKSGPESAALALQSGVNLNCGSTYEHFLKAALEQNLVPL 322

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
             I+  L   L ++ +LG FD  P+   PY  + P  + +P+H  L+ + AR+ IVLLKN
Sbjct: 323 ELIDQRLTQLLMIRFQLGFFD--PAGLNPYNEVTPDVIHSPEHINLSRDVARKSIVLLKN 380

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG---RYARTIHQQG 456
               LPLS    +   V GP +  +  +IGNY GI+    + L+GI        +++ + 
Sbjct: 381 DNHVLPLSK-DIKVPYVTGPFAASSDMLIGNYYGISDSLVSVLEGIAGKVSLGSSLNYRS 439

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL---------DRAGLLLPGRQ 507
                  +      A   ++ ADA I V+G+   +E E +         DR  + LP  Q
Sbjct: 440 GSLPFHNNINPLNWAPQVAKTADAVIAVVGVSADMEGEEVDAIASADRGDRVAITLPQNQ 499

Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT 567
            + V +++   KGP ILV+ +G P+D+  +  +P   AI+W  YPG+ GG A+AD+LFG 
Sbjct: 500 VDYVKQLAAHKKGPLILVVAAGSPVDI--SDLEPLADAILWIWYPGEQGGNAVADVLFGD 557

Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
           +NP G LP+T+   + I +LP       P       GRTY+F +   +YPFG G SYT F
Sbjct: 558 TNPSGHLPLTFV--KSIDDLP-------PFDDYAMTGRTYKFLEKAPLYPFGFGRSYTEF 608

Query: 628 VHTVANAPTVVAVPLDGRHGSINATIS-GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
                                 + T+S GKAI     +   LTL V+V+  N G   G  
Sbjct: 609 SFN-------------------DLTVSQGKAI-----EGEALTLSVEVE--NRGDIAGET 642

Query: 687 TLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
            +  + +P A  +      L +F+++H+     + V + I   K L  V+ +G    P G
Sbjct: 643 VVQAYLSPIARMNNEAISSLKSFKRIHLAPKETRWVELTIQ-GKDLYQVNNAGETVWPQG 701

Query: 746 EHNIHIGGT 754
            +++ +G +
Sbjct: 702 RYSLAVGDS 710


>gi|319641744|ref|ZP_07996426.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
 gi|317386631|gb|EFV67528.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
          Length = 702

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 255/751 (33%), Positives = 378/751 (50%), Gaps = 90/751 (11%)

Query: 52  RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
           RV DL+ RL+L+EKV L+   + A+PRLGI  Y+WW+EALHGV+               T
Sbjct: 2   RVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVART---------LEKVT 52

Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYN-----GGTA----GLTYWSPNVNIFRD 162
            FPQ I  A++F+    + +G + S E RA++N     G T     GLTYW+PN+NIFRD
Sbjct: 53  VFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYRGLTYWTPNINIFRD 112

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
           PRWGRGQET GEDP L+ K  A+ VRGL+G D   LK  A  KH+  +    +N   R  
Sbjct: 113 PRWGRGQETYGEDPYLTAKMGAAIVRGLEGEDPHYLKSVACAKHYAVHSGPEYN---RHS 169

Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
           F+A+ S  D+ DT+   FR  V + KV  VMC+YN++NG P C +  +L   +R +W  +
Sbjct: 170 FDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCCGNDPLLVDILRNQWHFD 229

Query: 283 GYIVSDCDSVGVYYDTQHFTST-PEE--AAADAIRAGLDLDCGPFLGLHTESAVQRGLLS 339
           GY+ SDC ++    D   F  T PE   A +DA+ AG DL+CG    L  E  V++GL S
Sbjct: 230 GYVTSDCWALK---DFAEFHKTHPEHTIAMSDALLAGTDLECGNLYHLLAE-GVKKGLHS 285

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           E DIN +L    T+  ++GMFD  P+ + PY  +G + +    H++ A   A++ IVLL+
Sbjct: 286 ERDINVSLSRLFTILFKIGMFD--PAERVPYSSIGREVLECEAHKQHAERMAKESIVLLE 343

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQ 454
           N+   LPL   + +++A+IGPN+D   T + NY G      TP   + R      +  + 
Sbjct: 344 NKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYMSLKRRLGDKIKINYL 403

Query: 455 QGCKDV-ACADDQLFGAAIDASRQADATILVMGLDQSIE-------------AEALDRAG 500
            G   V    D   F      + Q+D  + V G+    E               + DR  
Sbjct: 404 PGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEAGDAGAAGYGGFASGDRTT 463

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           + LP  Q EL+ K+    + P I+V MSG  +  +F        A++ A Y GQA G AI
Sbjct: 464 MQLPLVQIELLKKLKKTGR-PLIIVNMSGSVM--SFEWESQNADALLQAWYGGQAAGDAI 520

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
            D+LFG  NP G++P+T Y  +   +LP       P ++    GRTYR++KG   YPFG+
Sbjct: 521 VDVLFGHCNPAGRMPLTTYKSD--NDLP-------PFENYSMLGRTYRYFKGEPRYPFGY 571

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           G+SYT F ++                  ++ T +G   +VT            V V N G
Sbjct: 572 GLSYTTFAYSDVQC--------------VDETHTGDTARVT------------VTVSNTG 605

Query: 681 SKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
             DG   + ++   P       P   L  F+++H+  G    V   +   + L++ +  G
Sbjct: 606 DCDGDEVVQLYVVHPQDGRKQIPLCALKGFKRIHLKRGESTSVSFTL-TPEELALTETDG 664

Query: 739 TRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
                 G+  + +GG + + +   ++L  IK
Sbjct: 665 NLVEKNGQVTLFVGGGQPNYAAGVSSLLTIK 695


>gi|291544853|emb|CBL17962.1| Beta-glucosidase-related glycosidases [Ruminococcus champanellensis
           18P13]
          Length = 697

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 256/727 (35%), Positives = 379/727 (52%), Gaps = 105/727 (14%)

Query: 46  SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
           SL   +R  DL  RL+++E+   L   A  +PRLGI  Y WW+E LHGV+  G       
Sbjct: 13  SLTPDERAEDLADRLTVEEQASQLRYDALPIPRLGIPAYNWWNEGLHGVARAGT------ 66

Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNV 157
               AT FPQ I  A++F+  L   IG + + EARA +             GLT W+PN+
Sbjct: 67  ----ATMFPQAIGMAATFDTALLHQIGEITATEARAKHMAAREHGDFDIYKGLTLWAPNI 122

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+FRDPRWGRG ET GEDP L+ +   ++V+G+QG +G  LK AA  KHF  +     +G
Sbjct: 123 NLFRDPRWGRGHETYGEDPFLTARLGVAFVKGMQG-EGKVLKAAACAKHFAVH-----SG 176

Query: 218 VD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
            +  R  F+A+VS +D+E+++   F   V E KV  VM +YN+VNG P+CA P ++ +  
Sbjct: 177 PEALRHSFDAQVSPKDLEESYLPAFHALVAEAKVEGVMGAYNRVNGEPSCASPMLMDKL- 235

Query: 276 RGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
             +W   GY VSDC ++  ++     T    E+AA A+R G DL+CG    L+  +A++ 
Sbjct: 236 -HQWGFAGYFVSDCWAIQDFHKHHGVTKNVTESAALALRTGCDLNCGNTY-LYVLAALEE 293

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
           GL+   DI  A +  L  ++RLG+FD EP    +       + +P H+ ++L  A + +V
Sbjct: 294 GLIDAADIRRACIRVLRTRIRLGLFDPEPH---FAACTYDTIASPAHKAVSLSCAEKSMV 350

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ARTIH 453
           LLKN G  LPL   +   +AVIGPN+D    + GNY G A  Y T L+GI      R  +
Sbjct: 351 LLKNDG-ILPLDLSKLHAIAVIGPNADSRAALEGNYCGTADRYVTFLEGIQDAFPGRVHY 409

Query: 454 QQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALD 497
            QGC        ++A ADD+ +  A+ A+  +D  IL +GLD ++E E         + D
Sbjct: 410 AQGCHLYKDRTSNLAMADDR-YAEALAAAEASDVVILCLGLDATLEGEEGDTGNEFSSGD 468

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           +A L LP  Q +L+ K+    K P ILVL +G  ++   + N     A++ A YPGQ GG
Sbjct: 469 KADLRLPPPQCKLLEKLHAVGK-PVILVLAAGSALNPEISCN-----AVLQAWYPGQCGG 522

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
            A+A ILFG  +P GKLP+T+Y  E    LP  T+ +M+         RTYR+ +  V+Y
Sbjct: 523 QALAHILFGKVSPSGKLPVTFY--ETAEQLPDFTDYSMQ--------NRTYRYARNNVLY 572

Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
           PFG+G++Y            +V   L   +G    T++ + I+ T             DV
Sbjct: 573 PFGYGLTYGK----------IVCTELSYENGCARMTVTNQGIRFTE------------DV 610

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
             +  KD        ++P    WA P+  L  F ++ +  G  +R+ I +    + S VD
Sbjct: 611 VQLYIKD--------NSP----WAVPNHSLCGFARIGLEPGETRRLEIPVPDSAFES-VD 657

Query: 736 RSGTRRI 742
             G R +
Sbjct: 658 EQGVRAV 664


>gi|325192664|emb|CCA27085.1| unnamed protein product [Albugo laibachii Nc14]
          Length = 2278

 Score =  381 bits (978), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 265/762 (34%), Positives = 387/762 (50%), Gaps = 93/762 (12%)

Query: 11  GLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI 70
           GL+ L  SS   A   PF            PFC  SL +  RV DL+ RL L EKV++L 
Sbjct: 14  GLLKLILSSDAHACEYPFD---------YFPFCNSSLSLDLRVEDLLQRLQLDEKVRMLT 64

Query: 71  SGAA---AVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
           + A+   ++PRLG+  Y W +  +HGV +   GT        ATSFP  +   + F+   
Sbjct: 65  ARASTHGSIPRLGVPEYNWGANCVHGVQSTC-GTH------CATSFPNPVNLGAIFDPNE 117

Query: 128 WEAIGRVVSDEARAMY---------NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
              + +V+  E RA+           G   GL  WSPN+NI RDPRWGR  ETP EDP +
Sbjct: 118 IYKMAQVIGKELRALRLEGARENYARGPHIGLDCWSPNININRDPRWGRAMETPSEDPYV 177

Query: 179 SGKYAASYVRGLQ-GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFD 237
           + KY  +Y +GLQ G D   L+   + KH+ AY  +N+ G DR  F+A VS  D  DT+ 
Sbjct: 178 NAKYGVAYTKGLQEGQDSRFLQAVVTLKHYLAYSYENYGGTDRTQFDAIVSAYDFADTYF 237

Query: 238 VPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD 297
             F   V++GK   +MCSYN +NG+PTCA+   L + +R +   +GYI SD  ++   +D
Sbjct: 238 PAFEASVVDGKAKGIMCSYNSLNGIPTCAN-KWLNQLLRDDLEFDGYITSDTGAIQGIFD 296

Query: 298 TQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
              +T T  EA   A+ +G+D+  G  +     + A      + ID   A+  TL ++ +
Sbjct: 297 GHKYTKTLCEATKIAMESGVDICSGNAYWNCLKQLANSTNFSASID--EAIRRTLKLRFQ 354

Query: 357 LGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSH-IRHRTVA 415
           LG+FD     QP  H GP+DV T    +L+L+ AR+ IVLL+N G +LPL   +R   +A
Sbjct: 355 LGLFDA-IGDQP--HFGPEDVRTAKSLQLSLDLARKSIVLLQNHGNTLPLRLGLR---IA 408

Query: 416 VIGPNSDVTVTMIGNYAGIACG--------YTTPLQGI----GRYARTIHQQGC--KDVA 461
           VIGP+S     ++GNY G  C           +PL+ I    GR   T H  GC   D +
Sbjct: 409 VIGPHSMTRRGIMGNYYGQLCHGDYDEVRCIQSPLEAIQSVNGR-NNTHHVNGCGINDTS 467

Query: 462 CADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGP 521
            A+   F  A+ A R AD  +L +G+D SIE E+ DR  + +P  Q EL+  + +A K P
Sbjct: 468 TAE---FDDALQAVRTADVAVLFLGIDISIERESKDRDNIDVPHIQLELLKAIRVAGK-P 523

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
           T++VL +GG + +   K      +++ A YPG  G  AIA+ILFG+ NP GKLP+T Y  
Sbjct: 524 TVVVLFNGGILGI--EKLILYADSVLEAFYPGFFGAQAIAEILFGSINPSGKLPVTMYRS 581

Query: 582 EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP 641
            +I ++ M  M+M       YPGR+YR+Y    VY FG G+SYT F              
Sbjct: 582 NFINDVDMKSMSM-----TLYPGRSYRYYTEVPVYSFGWGLSYTTF-------------- 622

Query: 642 LDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH--- 698
                 SI +  S     + H    +  +  ++ + N G   G   L  F  P   H   
Sbjct: 623 ------SIQSIDSHDTRAMNHVLTAQPKM-YRILITNNGKYYGEEVLFAFFRPLDIHATG 675

Query: 699 --WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
              +  +QL  + +V +  G  + V +++   + L++ DR+G
Sbjct: 676 PVESLQQQLFNYTRVRLDPGDMREVPLHVKD-ENLALHDRNG 716


>gi|348684865|gb|EGZ24680.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 769

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 268/756 (35%), Positives = 392/756 (51%), Gaps = 75/756 (9%)

Query: 27  PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPR-----LGI 81
           P AC  K  T + LPFC  SL    RV DL+ RL LQEK  LL   A A PR     +G+
Sbjct: 23  PRACVGK--TNQELPFCNTSLSTADRVEDLLSRLPLQEKATLLT--ARASPRGNMSSIGL 78

Query: 82  KGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA 141
             Y W +  +HGV      +  G + P  TSFP  +   + F+  +   + +V+  E RA
Sbjct: 79  PEYNWGANCVHGVQ-----STCGTNCP--TSFPNPVNLGAIFDPQVVFDMAQVIGWELRA 131

Query: 142 M--------YNGGT-AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
           +        Y GG   GL  WSPN+NI RDPRWGR  ETP EDP+++ KY  +Y RGLQ 
Sbjct: 132 LWLEGATENYKGGPHLGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQ- 190

Query: 193 SDGDR-----LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEG 247
            +G R     L+   + KH+ AY  +N+ GV+R  F+A VS  D  DT+   FR  V++G
Sbjct: 191 -EGKRQDPRFLQAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDG 249

Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE 307
               VMCSYN VNG+P CA+  +++  +RG    +GY+ SD  +V    D  H+  +  E
Sbjct: 250 NAKGVMCSYNSVNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHYADSQCE 309

Query: 308 AAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ 367
           AA  AI AG D++ G       ++ V    L E  +++AL +TL ++  LG+FD     Q
Sbjct: 310 AARLAILAGTDINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFD-PIDDQ 368

Query: 368 PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTM 427
           PY ++ P +V T   + L+L A R+ +V+L+N    LPL   +   +AV+GP++     +
Sbjct: 369 PYWNVTPSEVNTAAAKALSLNATRKSLVMLQNNASVLPLQ--KGVKLAVLGPHAKSKRGL 426

Query: 428 IGNYAGIAC--------GYTTPLQGI---GRYARTIHQQGCKDVACADDQLFGAAIDASR 476
           +GNY G  C           TPL  I      + T   +GC  ++      F  A+ A++
Sbjct: 427 LGNYLGQMCHGDYDEVGCVQTPLDAIRAANGASNTTFAEGCG-ISGNSTAGFEKAVAAAK 485

Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
           +ADA +L +G+D+SIE E  DR  + LP  Q +L+ +V    + PT++VL++GG I    
Sbjct: 486 EADAVVLFLGIDKSIEGEVGDRNNIDLPNIQMQLLQRVHAVGR-PTVVVLINGGVIGA-- 542

Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
            +   R  A++ A YPG  G  A+AD+LFG +NP GKLP+T Y  +Y+  + M  M M  
Sbjct: 543 EEIIERTDALVEAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYVDQVEMKSMDM-- 600

Query: 597 SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGK 656
                +PGRTYR++KG  V+PFG G+SYT F  +V +         +    S NA  SG 
Sbjct: 601 ---TAHPGRTYRYFKGEPVFPFGWGLSYTTFSLSVDSG-------TNSSSHSNNAAFSGG 650

Query: 657 AIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQ--LVAFE---K 710
            +  T        + + V VKN G   G   L    +      A P ++  LV+F    +
Sbjct: 651 EVSDTA------NVTISVVVKNDGEVAGDEVLGPLDSTEVSTLALPDEEGNLVSFPGSYE 704

Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGE 746
           V V  G ++R+  ++ V     V+ R   +  PL E
Sbjct: 705 VIVSNGVKERLRFSVEVAGG-EVILRDQVQPFPLSE 739


>gi|268610157|ref|ZP_06143884.1| glycoside hydrolase family 3 protein [Ruminococcus flavefaciens
           FD-1]
          Length = 690

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 258/724 (35%), Positives = 366/724 (50%), Gaps = 103/724 (14%)

Query: 46  SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
           SL   +R  DL  RL+L+E+   L   A AV RL I  Y WWSE LHGV+  G       
Sbjct: 8   SLSAQERAEDLTNRLTLEEQASQLKYDAPAVDRLDIPAYNWWSEGLHGVARAGT------ 61

Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNV 157
               AT FPQ I  A+ F+      +G ++ DEARA YN  +A        GL  WSPNV
Sbjct: 62  ----ATMFPQAIGLAAMFDEEAMNKVGSIIGDEARAKYNEYSAHGDHDIYKGLCLWSPNV 117

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           NIFRDPRWGRGQET GEDP L+ +   ++ +GLQG +G+ LK AA  KH   +     +G
Sbjct: 118 NIFRDPRWGRGQETYGEDPYLTTRLGVAFAKGLQG-EGEVLKTAACAKHLAVH-----SG 171

Query: 218 VD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
            +  R  F+A  S +D+E+T+   F   V E KV  VM +YN+VNG P CA   ++ +  
Sbjct: 172 PEAIRHEFDAVASPKDMEETYLPAFEALVKEAKVEGVMGAYNRVNGEPACASKFLMGKL- 230

Query: 276 RGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
             EW  +GY VSDC ++  ++     T T  E+AA A++ G DL+CG    LH   A   
Sbjct: 231 -DEWGFDGYFVSDCWAIRDFHTNHMVTKTAPESAAMALKLGCDLNCGNTY-LHLLHAYNE 288

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
           GL+++ DI  A  + +  ++RLGMFD E     Y  L    V   +++  A + + + +V
Sbjct: 289 GLINDEDIKKACTHLMRTRVRLGMFDDETE---YDKLDYSIVANEENKAYARKCSERSMV 345

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ARTIH 453
           +LKN G  LPL   + +T+ VIGPN+D    + GNY G A  Y T L+GI      R ++
Sbjct: 346 MLKNNGI-LPLDPSKIKTIGVIGPNADSRPALEGNYNGRADRYITFLEGIQDAFGGRVLY 404

Query: 454 QQG-------CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALD 497
            +G       C  +A ADD+L  A I  +  +D  +L +GLD +IE E         + D
Sbjct: 405 SEGSHLYKDRCMGLAVADDRLSEAEI-VTEHSDVVVLCVGLDATIEGEEGDTGNEFSSGD 463

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           +  L LP  Q++LV  V M    P I+V  +G  I+V     +    A+I A YPGQ GG
Sbjct: 464 KNDLRLPEAQRKLVETV-MRKGKPVIIVTAAGSAINV-----EADCDALIHAWYPGQFGG 517

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           TA+ADILFG  +P GKLP+T+Y     T LP  T+ +M+        GRTYR+ +  ++Y
Sbjct: 518 TALADILFGKISPSGKLPVTFYTD--TTKLPEFTDYSMK--------GRTYRYTQDNILY 567

Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
           PFG+G++Y+          T V+                  +K  + K +       V V
Sbjct: 568 PFGYGLTYSK---------TEVS-----------------DLKFENGKAS-------VKV 594

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
            N G  D    +  +       + P   L  F +V +  G    V + +    +   VD 
Sbjct: 595 TNTGDFDTEDVVQFYIKGEGSDYVPFYSLCGFRRVFLKKGESTVVEVTLGDSAF-EAVDE 653

Query: 737 SGTR 740
           +G R
Sbjct: 654 NGRR 657


>gi|424661938|ref|ZP_18098975.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
           616]
 gi|404578249|gb|EKA82984.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
           616]
          Length = 722

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 250/718 (34%), Positives = 374/718 (52%), Gaps = 78/718 (10%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
           +S PI  RV  LI +++L EK   L+S + ++PRL +  Y +W+E LHGV+  G      
Sbjct: 54  LSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
                 T FPQ I  AS+++  L + +   +S EAR  Y     GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163

Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
           WGR +ET GEDP L+ +   ++V+GLQG     LK  A+ KHF A + +N    +RF  +
Sbjct: 164 WGRNEETYGEDPYLTSRLGVAFVKGLQGDHPAYLKTVATIKHFVANNEEN----NRFSSS 219

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           +++  + + + +   +  CV E  V SVM +YN  NGVP      +L   +R EW  +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEADVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEWGFDGF 279

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++GV        ++ EEAAA  + +G DL+CG         AV++GL+SE  I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLISEATID 339

Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
            AL   LT + +LG FD  P    PY H   K +      ELA EAA + +VLLKN+   
Sbjct: 340 QALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNEN-L 396

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKD 459
           LPLS  + ++VAV+GP +D     +G Y+G      T L+G+    G+  +  +  G   
Sbjct: 397 LPLSKEKTKSVAVVGPFAD--HNYLGGYSGQPPYSITLLKGVKDLMGKRGKVNYLNG--- 451

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
           +  + D +    + A +  D  ++ +G D+ +  E  D   + LP  Q++L+  +   + 
Sbjct: 452 IGASRDSI----VAAVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQVNP 507

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
              +LV  SG P+   +A  D  I AI+ A YPGQ  G A+A++LFG  NP GKLPMT Y
Sbjct: 508 -RIVLVFHSGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564

Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
             E    LP + +  M       + GRTYR+ KG  +Y FGHG+SYT+F           
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYSFGHGLSYTSF----------- 604

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
               D   G  N T+   AI           L   V++ N G   G   + V+    +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649

Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              +  P K+LVAF+KV + +G +++V   I   + LSV +  G  R+  G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703


>gi|291240563|ref|XP_002740191.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 747

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 252/699 (36%), Positives = 348/699 (49%), Gaps = 75/699 (10%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG-------AAAVPRLGIKGYEWWSEALH 92
            PF   SLP  +RV+DL+GRL+L+E V  +  G       A  + RLGI  Y W +E LH
Sbjct: 25  FPFRNTSLPWSERVDDLVGRLTLEEIVLQMSRGGTGSNGPAPPIDRLGIGPYSWNTECLH 84

Query: 93  GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-------- 144
           G    GP          ATSFPQ    A++F+A L E I    + E RA YN        
Sbjct: 85  GDVAAGP----------ATSFPQAFGLAATFDAVLIEQIANATAYEVRAKYNNYAKHKEY 134

Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
           G   GL+ +SP +NI R P WGR QET GEDP LSG  AASYV GLQG+    +   A C
Sbjct: 135 GDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASYVNGLQGNHPRYVTANAGC 194

Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           KHF AY         R  F+AKVS +D+  TF   F  C+  G   S+MCSYN +NGVP 
Sbjct: 195 KHFDAYAGPEDIPSSRSTFDAKVSDRDLRMTFLPAFHECIQAG-THSLMCSYNSINGVPA 253

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
           CA+  +L   +R EW   GY++SD  +V   YD  H+T    + A   + +GL+L+    
Sbjct: 254 CANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAHHYTKDMLDTAIACVNSGLNLELSSN 313

Query: 325 LG----LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
           L     + T  AV++G ++   +   +      +MRLG FD  P   PY  L    + + 
Sbjct: 314 LEDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTRMRLGEFD-PPEMNPYSKLDLSIIQSQ 372

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT- 439
           +HQEL+L+AA +  VLLKN+   LPL   +   +AV+GP +D    + G+Y+     YT 
Sbjct: 373 EHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKLAVVGPLADNVDALYGDYSATPNNYTV 431

Query: 440 TPLQGIGRYA-RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
           TP  G+ R A  T +  GC +  C      G    A   AD  ++ +G    IE+E  DR
Sbjct: 432 TPRNGLARLAGNTSYASGCDNPKCRKYDS-GQVKSAVSGADMVVVCVGTGTDIESEGNDR 490

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
             L LPG+Q  L+         P IL+L + GP+DV++A  +P +  I+   +P QA G 
Sbjct: 491 HELALPGKQLSLLQDAVKFGTKPVILLLFNAGPLDVSWAVENPAVQTIVACFFPAQATGD 550

Query: 559 AIADILFGT---SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           A+  +   T   SNP G+LPMTW P+      PMT+  M+        GRTYR+     +
Sbjct: 551 ALYRMFMNTSPESNPAGRLPMTW-PRSMEQVPPMTDYTMK--------GRTYRYSDADPL 601

Query: 616 YPFGHGMSYTNF-VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
           +PFG G+SYT F  +  + +PTV+                          C+ +T  + +
Sbjct: 602 FPFGFGLSYTLFKYYNTSASPTVI------------------------KSCDTVT--IPL 635

Query: 675 DVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVH 712
            V NVG   G   + V+ S   A    P  QLV F +V 
Sbjct: 636 TVTNVGDFPGDEVMQVYISWSNASVTVPKLQLVGFRRVR 674


>gi|313145345|ref|ZP_07807538.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
 gi|313134112|gb|EFR51472.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
          Length = 722

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 251/714 (35%), Positives = 372/714 (52%), Gaps = 76/714 (10%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           PI  RV  LI +++L EK   L+S + ++PRL +  Y +W+E LHGV+  G         
Sbjct: 57  PIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGR 167
              T FPQ I  AS+++  L + +   +S EAR  Y     GLTYWSP +N+ RDPRWGR
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGR 166

Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
            +ET GEDP L+ +   ++V+GLQG     LK  A+ KHF A + +N    +RF  ++++
Sbjct: 167 NEETYGEDPYLTSRLGVAFVKGLQGDHPAYLKTVATIKHFVANNEEN----NRFSSSSQI 222

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
             + + + +   +  CV E  V SVM +YN  NGVP      +L   +R EW  +G++VS
Sbjct: 223 PTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEWGFDGFVVS 282

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           DC ++GV        ++ EEAAA  + +G DL+CG         AV++GL+SE  I+ AL
Sbjct: 283 DCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLISEATIDQAL 342

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              LT + +LG FD      PY H   K +      ELA EAA + +VLLKN+   LPLS
Sbjct: 343 TRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNEN-LLPLS 400

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDVACA 463
             + ++VAV+GP +D     +G Y+G      T L+G+    G+  +  +  G   +  +
Sbjct: 401 KEKTKSVAVVGPFAD--HNYLGGYSGQPPYSVTLLKGVKDLMGKRGKVNYLNG---IGAS 455

Query: 464 DDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTI 523
            D +    + A +  D  ++ +G D+ +  E  D   + LP  Q++L+  +   +    +
Sbjct: 456 RDSI----VAAVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQVNP-RIV 510

Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
           LV  SG P+   +A  D  I AI+ A YPGQ  G A+AD+LFG  NP GKLPMT Y  E 
Sbjct: 511 LVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLLFGNENPSGKLPMTIYRAE- 567

Query: 584 ITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL 642
              LP + +  M       + GRTYR+ K   +Y FGHG+SYT+F               
Sbjct: 568 -DQLPDILDFDM-------WKGRTYRYMKEDPLYGFGHGLSYTSF-------------GF 606

Query: 643 DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STPPAGH 698
           DG  GS +   SG             TL   V++ N G   G   + V+    +TP   +
Sbjct: 607 DGIQGS-DTLKSGT------------TLQCSVELSNTGKWTGEEVVQVYVSRENTPVYTY 653

Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
             P K+LVAF+KV +  G ++RV  NI   + LSV + +G  R+  G++ + IG
Sbjct: 654 --PLKKLVAFKKVKLAPGEKKRVEFNIP-PRELSVWE-NGNWRMLTGKYTLFIG 703


>gi|51507369|emb|CAH18932.1| beta-xylosidase [Pyrus communis]
          Length = 238

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 185/238 (77%), Positives = 205/238 (86%)

Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
           ++ G ++EIDIN AL NT+TVQMRLGMFDGEPS+Q YG+LG  DVC P   ELALEAARQ
Sbjct: 1   MRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSSNELALEAARQ 60

Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
           GIVLL+N+G SLPLS IRHRTVAVIGPNSDVT TMIGNYAG+ACGYTTPLQGI RY RTI
Sbjct: 61  GIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGVACGYTTPLQGIARYTRTI 120

Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
           HQ GC DV C  +QL GAA  A+RQADAT+LV+GLDQSIEAE  DR  LLLPG QQELVS
Sbjct: 121 HQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTNLLLPGHQQELVS 180

Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
           +V+ AS+GPTILV+MSGGPIDV FAKNDPRI AIIW GYPGQAGGTAIAD+LFGT+NP
Sbjct: 181 RVARASRGPTILVIMSGGPIDVMFAKNDPRIGAIIWVGYPGQAGGTAIADVLFGTTNP 238


>gi|423279990|ref|ZP_17258903.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
           610]
 gi|404584326|gb|EKA88991.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
           610]
          Length = 722

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 250/717 (34%), Positives = 375/717 (52%), Gaps = 76/717 (10%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
           +S PI  RV  LI +++L EK   L+S + ++PRL +  Y +W+E LHGV+  G      
Sbjct: 54  LSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
                 T FPQ I  AS+++  L + +   +S EAR  Y     GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163

Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
           WGR +ET GEDP L+ +   ++V+GLQG     LK  A+ KHF A + +N    +RF  +
Sbjct: 164 WGRNEETYGEDPYLTSRLGVAFVKGLQGDHPAYLKTVATIKHFVANNEEN----NRFSSS 219

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           +++  + + + +   +  CV E  V SVM +YN  NGVP      +L   +R EW  +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEWGFDGF 279

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++GV        ++ EEAAA  + +G DL+CG         AV++GL+SE  I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLISEATID 339

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            AL   LT + +LG FD      PY H   K +      ELA EAA + +VLLKN+   L
Sbjct: 340 QALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNEN-LL 397

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDV 460
           PLS  + ++VAV+GP +D     +G Y+G      T L+G+    G+  +  +  G   +
Sbjct: 398 PLSKEKTKSVAVVGPFAD--HNYLGGYSGQPPYSVTLLKGVKDLMGKRGKVNYLNG---I 452

Query: 461 ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG 520
             + D +    + A +  D  ++ +G D+ +  E  D   + LP  Q++L+  +   +  
Sbjct: 453 GASRDSI----VAAVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQVNP- 507

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             +LV  SG P+   +A  D  I AI+ A YPGQ  G A+AD+LFG  NP GKLPMT Y 
Sbjct: 508 RIVLVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLLFGNENPSGKLPMTIYR 565

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
            E    LP + +  M       + GRTYR+ K   +Y FGHG+SYT+F            
Sbjct: 566 AE--DQLPDILDFDM-------WKGRTYRYMKEDPLYGFGHGLSYTSF------------ 604

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STPP 695
              DG  GS +   SG  ++ +            V++ N G   G   + V+    +TP 
Sbjct: 605 -GFDGIQGS-DTLKSGARLQCS------------VELSNTGKWTGEEVVQVYVSRENTPV 650

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
             +  P K+LVAF+KV +  G ++RV  NI   + LSV + +G  R+  G++ + IG
Sbjct: 651 YTY--PLKKLVAFKKVKLAPGEKKRVEFNIP-PRELSVWE-NGNWRMLTGKYTLFIG 703


>gi|374316077|ref|YP_005062505.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
           str. Grapes]
 gi|359351721|gb|AEV29495.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
           str. Grapes]
          Length = 701

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 252/739 (34%), Positives = 378/739 (51%), Gaps = 94/739 (12%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           ++   L+  +SL+E    L+  A A+PRLG+  Y WW+EALHG +  G           A
Sbjct: 8   EQAKQLVAHMSLKEMFSQLLHEAPAIPRLGLPRYNWWNEALHGAARSGT----------A 57

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I  A+ F+    + I  V+S E RA YN  +A        GLT WSPNVNIFRD
Sbjct: 58  TVFPQAIGLAAMFDDVFLKEIATVISTEQRAKYNTFSALGDRGIYKGLTLWSPNVNIFRD 117

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
           PRWGRGQET GEDP L+ +   S+++GLQG DG  LK AA  KHF  +     +G +  R
Sbjct: 118 PRWGRGQETYGEDPYLASQLGVSFIQGLQG-DGPYLKTAACVKHFAVH-----SGPEPLR 171

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             FNA VS++D+ +T+   F  CV EG+V +VM +Y+ VNG P C  P ++   +R +W 
Sbjct: 172 HDFNAIVSRKDLYETYLPAFEACVKEGEVNAVMGAYSAVNGEPCCGSPFLITDILRNDWG 231

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLS 339
             G  +SDC ++  ++     T    ++ A A+ AG DL+CG  +L L  E A Q+GL+ 
Sbjct: 232 FEGMYISDCWAIRDFHLNHAVTKNQVDSVALALNAGCDLNCGCEYLSL--EKAYQQGLID 289

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
              I  A +  +T +  LG+F  + +   Y ++G +   T +H+++A +A+   +VLLKN
Sbjct: 290 RKTITQACIRVMTTRFALGLFSEDCT---YSNIGYEQNDTEEHRKVAFKASCNSLVLLKN 346

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQ 455
            G  LPL       +A+IGPN+D    + GNY G +  YTT L+G    +G   +  + Q
Sbjct: 347 DG-MLPLDSRSLHAIAIIGPNADSREALWGNYHGTSSTYTTVLEGFRKTLGESVKVKYSQ 405

Query: 456 G-------CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRA 499
           G        + +A  +D++   AI  +  +D  IL +G D+++E E         A D+ 
Sbjct: 406 GSAIQKEKLERLAEPNDRI-AEAIAVATVSDTIILCLGYDETVEGEMHDDGNGGWAGDKQ 464

Query: 500 GLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
            L LP  Q+ L+  V+   K P +LVL+SGG ID    +  P + A++   YPGQ GG A
Sbjct: 465 DLRLPPCQRALLKAVASTGK-PIVLVLLSGGAIDPEIERF-PNVKALLQGWYPGQEGGLA 522

Query: 560 IADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFG 619
           IA  + G +NP G LP+T+Y  E  T LP            R  GRTYR+ +  V+YPFG
Sbjct: 523 IAHTILGLNNPSGHLPVTFYRSE--TVLP-------DFCDYRMEGRTYRYVQEKVLYPFG 573

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
            G+SYT F  +  N  T       G+    N  +S                     V N 
Sbjct: 574 FGLSYTTF--SYGNLST-------GKQADGNLELS-------------------FIVSNS 605

Query: 680 GSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
           G+++G   + ++       + P+  L  F  + +  G  + V   I + +  S +D  G 
Sbjct: 606 GNREGREVVQIYCHSDHPFFPPNPVLCGFTSLVLQPGEHKTVTQTI-LAEAFSAIDPEGK 664

Query: 740 RRIPLGEHNIHIGGTKHSV 758
           R    G  ++++G  + ++
Sbjct: 665 RIALKGWFDLYVGNHQKAL 683


>gi|383117083|ref|ZP_09937830.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
 gi|251947612|gb|EES87894.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
          Length = 722

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 249/718 (34%), Positives = 374/718 (52%), Gaps = 78/718 (10%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
           +S P+  RV  LI +++L EKV  L+S + ++PRL +  Y +W+E LHGV+  G      
Sbjct: 54  LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
                 T FPQ I  AS+++  L + +   +S EAR  Y     GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163

Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
           WGR +ET GEDP L+ +   ++V+GLQG     LK  A+ KHF A + +N    +RF  +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           +++  + + + +   +  CV E    SVM +YN  NGVP      +L   +R EW  +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++GV        ++ EEAAA  + +G DL+CG         AV++GL+SE+ I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEVAID 339

Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
            AL   LT + +LG FD  P    PY H   K +      ELA EAA + +VLLKN    
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
           LPL+  + ++VAV+GP +D     +G Y+G      + L+G    IG+  +  +  G   
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
            A +  Q+        + AD  ++ +G D+ +  E  D   + LP  Q++L+ K+   + 
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKIYQVNP 507

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
              +LV  +G P+   +A  D  I AI+ A YPGQ  G A+A++LFG  NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564

Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
             E    LP + +  M       + GRTYR+ KG  +Y FGHG+SYT+F           
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
               D   G  N T+   AI           L   V++ N G   G   + V+    +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649

Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              +  P K+LVAF+KV + +G +++V   I   + LSV +  G  R+  G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703


>gi|317474362|ref|ZP_07933636.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909043|gb|EFV30723.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 723

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 259/763 (33%), Positives = 384/763 (50%), Gaps = 103/763 (13%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A +   P+   SL   +R  DL+ RL+L+EK+ L+ + ++AV RLGIK YEWW+EALHGV
Sbjct: 18  ACSNQAPYQNKSLSPTERAADLVSRLTLEEKITLMQNNSSAVKRLGIKPYEWWNEALHGV 77

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT------- 147
           +  G           AT +PQ I   +SFN TL   +   +SDEAR  Y           
Sbjct: 78  ARNGL----------ATVYPQAIGMGASFNDTLLYQVFTSISDEARVKYRQAREAGNYKR 127

Query: 148 -AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-K 205
             GLT+W+PN+NIFRDPRWGRGQET GEDP L+ +   S V GLQG    +     +C K
Sbjct: 128 YTGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLSVVNGLQGPQNTKYNKTHACAK 187

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           H+  +    WN   R  FNA+ ++ +D+ +T+   F+  V++G V  VMC+YN+  G P 
Sbjct: 188 HYAVHSGPEWN---RHSFNAENINPRDLWETYLPAFQDLVIQGNVKEVMCAYNRFEGDPC 244

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSV-GVYYDTQHFT-STPEEAAADAIRAGLDLDCG 322
           C    +L   +R EW   G +VSDC ++   Y+  +H T     +A+A A+ +G DL+CG
Sbjct: 245 CGSDRLLINILRNEWNYKGLVVSDCGAIDNFYFKGRHETHKNKADASAAAVLSGTDLECG 304

Query: 323 -PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
             + GL   SAV+ GL++E  I+ +L   +  +  LG  D    + P+  L    +    
Sbjct: 305 RSYTGLI--SAVKEGLINESAIDQSLCRLMKARFELGEMD---DTTPWDQLPDSLLSCHA 359

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           HQ+LAL+ AR+ + LL+N    LPL   +  TVA+IGPN++ +V    NY G      T 
Sbjct: 360 HQQLALQMARESMTLLQNHKNILPLD--KEMTVALIGPNANDSVMQWANYNGFPVHTITL 417

Query: 442 LQGIGRY----------ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSI 491
           L+G+ +Y           + I  Q    V    + +  A I+ + +AD  I   G+  S+
Sbjct: 418 LEGLTQYLPQERLIYIPQKNIEVQKYPWVNYYPNDI-QAVINQAAKADVIIYAGGISASL 476

Query: 492 EAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           E E +          DR  + LP  Q++LV K   A+  P + V  SG  + +   + + 
Sbjct: 477 EGEEMDVDAEGFRGGDRTTIELPNVQRKLV-KALKATGKPIVFVNFSGCAMGL---QPES 532

Query: 542 RIA-AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
           +I  AI+ A YPGQAGGTAIA++LFG  NP G+LP+T+Y ++    LP  E         
Sbjct: 533 QICDAILQAWYPGQAGGTAIAEVLFGDYNPAGRLPITFYKKD--NQLPDFE-------DY 583

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
              GRTYR+     +YPFGHG+SYT F ++    P +        +G +        +KV
Sbjct: 584 NMQGRTYRYLNYEPLYPFGHGLSYTTFSYST---PFI-------ENGKLK-------VKV 626

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
           T++        +Q+ +K     DG                P K L  F+++H+PAG    
Sbjct: 627 TNSGNYNGDEVIQLYIKRYDDPDG----------------PLKTLRGFQRIHIPAGQTSE 670

Query: 721 VGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTKHSVSLHA 762
           V   +      +  D+      PL G + I +GGT     L +
Sbjct: 671 VSFPL-TSDTFTWWDKDSNTVHPLQGRYKILVGGTSEESLLKS 712


>gi|386347261|ref|YP_006045510.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
           6578]
 gi|339412228|gb|AEJ61793.1| glycoside hydrolase family 3 domain protein [Spirochaeta
           thermophila DSM 6578]
          Length = 693

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 249/727 (34%), Positives = 378/727 (51%), Gaps = 86/727 (11%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           +R+  L+ ++S++EK  L++  A  +PRLGI  Y WW+EALHGV+N G           A
Sbjct: 5   ERMTSLLSKMSIEEKAGLMLHRAKGIPRLGIPHYNWWNEALHGVANSGE----------A 54

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------GTAGLTYWSPNVNIFRD 162
           T FPQ I  A++F+  L   +   +S EARA +N            GLT+WSPN+NI+RD
Sbjct: 55  TVFPQAIGLAATFDPDLVRRVAEAISTEARAKFNAIGKERAAEYERGLTFWSPNINIYRD 114

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
           PRWGRGQET GEDP L+ K   S+V+GLQG     ++VAA  KH+  +     +G +  R
Sbjct: 115 PRWGRGQETYGEDPFLTSKIGVSFVKGLQGDHPYYMRVAACAKHYAVH-----SGPEGLR 169

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A+VS++D+ +T+   F   V  G V +VM +YN+VNG P C    +L   +R  W 
Sbjct: 170 HVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGSKRLLDEILRKRWG 228

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
             G++VSDC ++  ++     T  P E+ A A+ AG DL+CG     H   AV+ G++SE
Sbjct: 229 FKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAGCDLNCGNTYE-HLLDAVKAGVVSE 287

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
             ++ ++   L+   RLG+F  +    PY  L   D+    H+ LA EAA + +VLLKN 
Sbjct: 288 ELVDRSVARLLSTLDRLGLFTDD---HPYARLSLSDIDWEAHRALAREAAEKSVVLLKNN 344

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA----RTIHQQG 456
           G  LP    + R + V GPN+   V ++GNYAG++    T L+GI  YA       ++ G
Sbjct: 345 G-ILPFDRQKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITVTYKIG 403

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR---------AGLLLPGRQ 507
           C       + +  A+   +R AD T+ VMG D ++E E  D          + L LP  Q
Sbjct: 404 CPLQGNKINPIDWAS-GVARYADVTVAVMGRDSTVEGEEGDAIFSDNYGDLSDLDLPREQ 462

Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT 567
            E + ++    K P ++VL+SG P  V   + +    AI++A YPG+ GG AIA +LFG 
Sbjct: 463 IEYLRRIKEIGK-PLVVVLLSGAP--VCSPELEELADAIVYAWYPGEEGGNAIARVLFGE 519

Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
            +P G+LP+T +P+      P T+ +M         GRTYR+ +   +YPFG G+SY   
Sbjct: 520 ISPSGRLPIT-FPRGVDQLPPFTDYSME--------GRTYRYMREEPLYPFGFGLSY--- 567

Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQVDVKNVGSKDGAH 686
                                  AT S + ++ + ++ + R TL +  +V+N  S     
Sbjct: 568 -----------------------ATFSYRGLQSSASRWDKRETLELVCEVENTSSIPADE 604

Query: 687 TL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
            + L      A    P   L  F +V + AG +++V   +   + LS +D  G + +P G
Sbjct: 605 VVQLYVRWEDAPFRVPLWSLKGFTRVSLGAGERKQVRF-VLSPEELSFIDEEGRKVLPEG 663

Query: 746 EHNIHIG 752
             + H+G
Sbjct: 664 RLHFHVG 670


>gi|375357164|ref|YP_005109936.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
 gi|301161845|emb|CBW21389.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
          Length = 722

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 249/718 (34%), Positives = 373/718 (51%), Gaps = 78/718 (10%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
           +S P+  RV  LI +++L EKV  L+S + ++PRL +  Y +W+E LHGV+  G      
Sbjct: 54  LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
                 T FPQ I  AS+++  L + +   +S EAR  Y     GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163

Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
           WGR +ET GEDP L+ +   ++V+GLQG     LK  A+ KHF A + +N    +RF  +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           +++  + + + +   +  CV E    SVM +YN  NGVP      +L   +R EW  +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++GV        ++ EEAAA  + +G DL+CG         AV++GL+SE  I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339

Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
            AL   LT + +LG FD  P    PY H   K +      ELA EAA + +VLLKN    
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
           LPL+  + ++VAV+GP +D     +G Y+G      + L+G    IG+  +  +  G   
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
            A +  Q+        + AD  ++ +G D+ +  E  D   + LP  Q++L+ K+   + 
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKIYQVNP 507

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
              +LV  +G P+   +A  D  I AI+ A YPGQ  G A+A++LFG  NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564

Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
             E    LP + +  M       + GRTYR+ KG  +Y FGHG+SYT+F           
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
               D   G  N T+   AI           L   V++ N G   G   + V+    +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649

Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              +  P K+LVAF+KV + +G +++V   I   + LSV +  G  R+  G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703


>gi|336408348|ref|ZP_08588841.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
 gi|423248801|ref|ZP_17229817.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
           CL03T00C08]
 gi|423253750|ref|ZP_17234681.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
           CL03T12C07]
 gi|335937826|gb|EGM99722.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
 gi|392655379|gb|EIY49022.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
           CL03T12C07]
 gi|392657742|gb|EIY51373.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
           CL03T00C08]
          Length = 722

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 248/718 (34%), Positives = 373/718 (51%), Gaps = 78/718 (10%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
           +S P+  RV  LI +++L EKV  L+S + ++PRL +  Y +W+E LHGV+  G      
Sbjct: 54  LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
                 T FPQ I  AS+++  L + +   +S EAR  Y     GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163

Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
           WGR +ET GEDP L+ +   ++V+GLQG     LK  A+ KHF A + +N    +RF  +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           +++  + + + +   +  CV E    SVM +YN  NGVP      +L   +R EW  +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++GV        ++ EEAAA  + +G DL+CG         AV++GL+SE  I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339

Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
            AL   LT + +LG FD  P    PY H   K +      ELA EAA + +VLLKN    
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
           LPL+  + ++VAV+GP +D     +G Y+G      + L+G    IG+  +  +  G   
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
            A +  Q+        + AD  ++ +G D+ +  E  D   + LP  Q++L+ ++   + 
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEIYQVNP 507

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
              +LV  +G P+   +A  D  I AI+ A YPGQ  G A+A++LFG  NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564

Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
             E    LP + +  M       + GRTYR+ KG  +Y FGHG+SYT+F           
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
               D   G  N T+   AI           L   V++ N G   G   + V+    +TP
Sbjct: 605 --EFDNIQG--NDTLQSDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649

Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              +  P K+LVAF+KV + +G +++V   I   + LSV +  G  R+  G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703


>gi|109897152|ref|YP_660407.1| beta-glucosidase [Pseudoalteromonas atlantica T6c]
 gi|109699433|gb|ABG39353.1| Beta-glucosidase [Pseudoalteromonas atlantica T6c]
          Length = 733

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 258/760 (33%), Positives = 383/760 (50%), Gaps = 85/760 (11%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    LP  +R+  LI  ++L+EK   L++G  A+ RLG+  Y++W+EALHGV+  G  
Sbjct: 25  PWFDTQLPTNERIESLIDAMTLKEKASQLVNGNVAIERLGLPEYDFWNEALHGVARNGR- 83

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTY 152
                    AT FPQ I  A++F+  L      V+SDEARA +N  +        +GLT+
Sbjct: 84  ---------ATVFPQAIGMAATFDQDLLLQAATVISDEARAKFNVSSEIGNRSKYSGLTF 134

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           W+PN+NIFRDPRWGRGQET GEDP L+ +   + V GLQG     LK AA+ KHF  +  
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKYLKTAAAAKHFAVH-- 192

Query: 213 DNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              +G +  R  F+A  S++D+ +T+   F   V E  V +VM +YN+VNG P      +
Sbjct: 193 ---SGPEALRHEFDAIASEKDMYETYFPAFEALVTEADVETVMAAYNRVNGHPAGGSDFL 249

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE 330
           L   +R +W  +G+IVSDC  +  +++    T+   E+AA AI  G DL+CG       +
Sbjct: 250 LNTVLRDKWGFSGHIVSDCWGLADFHEYHKVTANAVESAALAINTGTDLNCGSVYTALPD 309

Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
            AV+ GL+ E  I+  L   L  + +LG FD +    PY  +    V +  H ++A E A
Sbjct: 310 -AVEAGLVDEKTIDTRLHKVLATKFKLGFFDPK-DDNPYNSISADVVNSDAHADVAYEMA 367

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG---R 447
            + IVLL+N+   LPL     R V V GP +  +  ++GNY G++   T  L GI     
Sbjct: 368 VKSIVLLQNENQVLPLDK-NIRNVYVTGPFASSSEVLLGNYYGLSGKTTNILDGITANVS 426

Query: 448 YARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLD---QSIEAEAL------D 497
              TI ++QG        + +     +A +  D  I VMGL    +  E EA+      D
Sbjct: 427 VGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIASPHKGD 486

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA----AIIWAGYPG 553
           R  L LP  Q E + K+   +  P I+VL +G P++V        IA    AI++A YPG
Sbjct: 487 RLSLDLPEHQIEFLRKLRKDNDKPVIVVLTAGTPVNVT------EIAQLADAIVFAWYPG 540

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
           Q GG A+ADILFG  +P G+LP+T+         P +E  + P       GRTYR+    
Sbjct: 541 QEGGKAVADILFGERSPSGRLPITF---------PKSEAQLPPYDDYSMQGRTYRYMTEE 591

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
            +YPFG G+SY               V  D      N T+ G A  ++     + TL V 
Sbjct: 592 PMYPFGFGLSY-------------ATVKFD------NITL-GNAEALSSTDGQKGTLDVS 631

Query: 674 VDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
           V+V N G+++    + L   TP AG   P + L  F+++ +  G   +V   +   K L 
Sbjct: 632 VNVTNTGTRELEEVVQLYLKTPNAGIDQPIQSLKGFQRIKLAPGQTGQVSFTVS-KKQLY 690

Query: 733 VVDRSGTRRIPLGEHNIHIGGT---KHSVSLHAATLGVIK 769
            ++  G   +  G++++ +G     K S +L AA   + K
Sbjct: 691 SINAKGKPVLLEGDYHVIVGNASPGKRSETLGAAKPQIAK 730


>gi|265765457|ref|ZP_06093732.1| beta-xylosidase [Bacteroides sp. 2_1_16]
 gi|263254841|gb|EEZ26275.1| beta-xylosidase [Bacteroides sp. 2_1_16]
          Length = 722

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 249/718 (34%), Positives = 373/718 (51%), Gaps = 78/718 (10%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
           +S P+  RV  LI +++L EKV  L+S + ++PRL +  Y +W+E LHGV+  G      
Sbjct: 54  LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
                 T FPQ I  AS+++  L + +   +S EAR  Y     GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163

Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
           WGR +ET GEDP L+ +   ++V+GLQG     LK  A+ KHF A + +N    +RF  +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           +++  + + + +   +  CV E    SVM +YN  NGVP      +L   +R EW  +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++GV        ++ EEAAA  + +G DL+CG         AV++GL+SE  I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339

Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
            AL   LT + +LG FD  P    PY H   K +      ELA EAA + +VLLKN    
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
           LPL+  + ++VAV+GP +D     +G Y+G      + L+G    IG+  +  +  G   
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
            A +  Q+        + AD  ++ +G D+ +  E  D   + LP  Q++L+ K+   + 
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKIYQVNP 507

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
              +LV  +G P+   +A  D  I AI+ A YPGQ  G A+A++LFG  NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564

Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
             E    LP + +  M       + GRTYR+ KG  +Y FGHG+SYT+F           
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
               D   G  N T+   AI           L   V++ N G   G   + V+    +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649

Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              +  P K+LVAF+KV + +G +++V   I   + LSV +  G  R+  G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703


>gi|317057539|ref|YP_004106006.1| glycoside hydrolase family protein [Ruminococcus albus 7]
 gi|315449808|gb|ADU23372.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7]
          Length = 691

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 262/756 (34%), Positives = 390/756 (51%), Gaps = 107/756 (14%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   +L   +R   L   ++ +E+   L   A AV RLGI  Y WW+E +HG++  G   
Sbjct: 4   YLDETLSAQERAEALTDEMTTEEQASQLRYDAPAVERLGIPAYNWWNEGIHGLARSGV-- 61

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                   AT FPQ I  A+ F+  L +    V S+EARA YN  +         GLT W
Sbjct: 62  --------ATMFPQAIGLAAMFDDELTKKTAEVTSEEARAKYNAYSGEEDRDIYKGLTLW 113

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +PN+NIFRDPRWGRG ET GEDP L+ K   + VRGLQG DG  +K AA  KHF  +   
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTTKNGMAVVRGLQG-DGKVIKAAACAKHFAVH--- 169

Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             +G +  R  F+AK + +D+E+T+   F   V E KV SVM +YN+VNG P CA  N L
Sbjct: 170 --SGPEAIRHSFDAKANAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACA-SNFL 226

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTE 330
              ++ EW  +GY VSDC ++  +++    T+   E+ A A++AG D++CG  +  L   
Sbjct: 227 MDKLK-EWEFDGYFVSDCWAIRDFHENHMVTANAIESTAMALKAGCDVNCGCTYQNLLV- 284

Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
            A+++G +++ DI  A V+ +  ++RLGMFD       Y  +    V   +H+ ++LE A
Sbjct: 285 -ALEKGAVTKEDIRTACVHLMRTRIRLGMFD---KKTEYDDIPYDKVACKEHKAISLECA 340

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRY- 448
            + +V+L+N G  LP+   +++T+AVIGPN+D    + GNY G++  YTT L GI  R+ 
Sbjct: 341 EKSLVMLENNGI-LPVDTSKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQDRFD 399

Query: 449 ARTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE------- 494
            R I  +GC        ++A A D+ +  A+ A++ AD TIL +GLD +IE E       
Sbjct: 400 GRVIFAEGCHLYKDRVSNLAQAGDR-YAEAVAAAKFADMTILCLGLDATIEGEEGDTGNE 458

Query: 495 --ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
             + D+ GL LP  Q+ELV K+ MA   P + V+ +G  I+      + +  A+I A YP
Sbjct: 459 FSSGDKNGLTLPPPQRELVKKI-MAVGKPVVTVVCAGSAINT-----ESKPDALIHAFYP 512

Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYK 611
           G  GG A+A++LFG  +P GKLP+T+Y  E    LP  T+ +M+        GRTYR+  
Sbjct: 513 GAEGGKALAEVLFGDVSPSGKLPVTFY--EDTDKLPEFTDYSMK--------GRTYRYTT 562

Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
             V+YPFG+G++Y         +  V  V        + A  SGKA +            
Sbjct: 563 ENVLYPFGYGLTY--------GSVKVTKVEYKDGKAVVTAENSGKATE------------ 602

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
              DV  +  KD +            H  P+  L  F+++ +  G      I +   K  
Sbjct: 603 ---DVIQLYIKDYSE-----------HAVPNVSLCGFKRIKLNEGESAVFEIEVPE-KAF 647

Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           + VD +G R++  G       GT    +L     G+
Sbjct: 648 TAVDDNGVRKV-FGSRFTLFAGTSQPDALSEKLTGI 682


>gi|60680313|ref|YP_210457.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
 gi|60491747|emb|CAH06504.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
          Length = 722

 Score =  374 bits (960), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 248/718 (34%), Positives = 372/718 (51%), Gaps = 78/718 (10%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
           +S P+  RV  LI +++L EKV  L+S + ++PRL +  Y +W+E LHGV+  G      
Sbjct: 54  LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
                 T FPQ I  AS+++  L + +   +S EAR  Y     GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163

Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
           WGR +ET GEDP L+ +   ++V+GLQG     LK  A+ KHF A + +N    +RF  +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           +++  + + + +   +  CV E    SVM +YN  NGVP      +L   +R EW  +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++GV        ++ EEAAA  + +G DL+CG         AV++GL+SE  I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339

Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
            AL   LT + +LG FD  P    PY H   K +      ELA EAA + +VLLKN    
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
           LPL+  + ++VAV+GP +D     +G Y+G      + L+G    IG+  +  +  G   
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
            A +  Q+        + AD  ++ +G D+ +  E  D   + LP  Q++ + K+   + 
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKFLKKIYQVNP 507

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
              +LV  +G P+   +A  D  I AI+ A YPGQ  G A+A++LFG  NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHILAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564

Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
             E    LP + +  M       + GRTYR+ KG  +Y FGHG+SYT+F           
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
               D   G  N T+   AI           L   V++ N G   G   + V+    +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649

Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              +  P K+LVAF+KV + +G +++V   I   + LSV +  G  R+  G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703


>gi|423258868|ref|ZP_17239791.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
           CL07T00C01]
 gi|423264161|ref|ZP_17243164.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
           CL07T12C05]
 gi|387776448|gb|EIK38548.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
           CL07T00C01]
 gi|392706427|gb|EIY99550.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
           CL07T12C05]
          Length = 722

 Score =  374 bits (960), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 248/718 (34%), Positives = 373/718 (51%), Gaps = 78/718 (10%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
           +S P+  RV  LI +++L EKV  L+S + ++PRL +  Y +W+E LHGV+  G      
Sbjct: 54  LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
                 T FPQ I  AS+++  L + +   +S EAR  Y     GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163

Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
           WGR +ET GEDP L+ +   ++V+GLQG     LK  A+ KHF A + +N    +RF  +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           +++  + + + +   +  CV E    SVM +YN  NGVP      +L   +R EW  +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++GV        ++ EEAAA  + +G DL+CG         AV++GL+SE  I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339

Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
            AL   LT + +LG FD  P    PY H   K +      ELA EAA + +VLLKN    
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
           LPL+  + ++VAV+GP +D     +G Y+G      + L+G    IG+  +  +  G   
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
            A +  Q+        + AD  ++ +G D+ +  E  D   + LP  Q++L+ ++   + 
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEIYQVNP 507

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
              +LV  +G P+   +A  D  I AI+ A YPGQ  G A+A++LFG  NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564

Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
             E    LP + +  M       + GRTYR+ KG  +Y FGHG+SYT+F           
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
               D   G  N T+   AI           L   V++ N G   G   + V+    +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649

Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              +  P K+LVAF+KV + +G +++V   I   + LSV +  G  R+  G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703


>gi|423281966|ref|ZP_17260851.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
           615]
 gi|404582453|gb|EKA87147.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
           615]
          Length = 722

 Score =  374 bits (960), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 250/719 (34%), Positives = 374/719 (52%), Gaps = 80/719 (11%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
           +S P+  RV  LI +++L EKV  L+S + ++PRL +  Y +W+E LHGV+  G      
Sbjct: 54  LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
                 T FPQ I  AS+++  L + +   +S EAR  Y     GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163

Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
           WGR +ET GEDP L+ +   ++V+GLQG     LK  A+ KHF A + +N    +RF  +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           +++  + + + +   +  CV E    SVM +YN  NGVP      +L   +R EW  +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++GV        ++ EEAAA  + +G DL+CG         AV++GL+SE  I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339

Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
            AL   LT + +LG FD  P    PY H   K +      ELA EAA + +VLLKN    
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
           LPL+  + ++VAV+GP +D     +G Y+G      + L+G    IG+  +  +  G   
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
            A +  Q+        + AD  ++ +G D+ +  E  D   + LP  Q++L+ ++   + 
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEIYQVN- 506

Query: 520 GPTI-LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
            P I LV  +G P+   +A  D  I AI+ A YPGQ  G A+A++LFG  NP GKLPMT 
Sbjct: 507 -PRIALVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTI 563

Query: 579 YPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
           Y  E    LP + +  M       + GRTYR+ KG  +Y FGHG+SYT+F          
Sbjct: 564 YKTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF---------- 604

Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----ST 693
                D   G  N T+   AI           L   V++ N G   G   + V+    +T
Sbjct: 605 ---EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENT 648

Query: 694 PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           P   +  P K+LVAF+KV + +G +++V   I   + LSV +  G  R+  G++ + IG
Sbjct: 649 PVYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703


>gi|333995841|ref|YP_004528454.1| beta-glucosidase [Treponema azotonutricium ZAS-9]
 gi|333737309|gb|AEF83258.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Treponema
           azotonutricium ZAS-9]
          Length = 706

 Score =  374 bits (959), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 248/732 (33%), Positives = 375/732 (51%), Gaps = 103/732 (14%)

Query: 52  RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
           R+ ++I +++L+EKV  L   A AV   GI  Y WW+E LHGV+  G           AT
Sbjct: 6   RIKEMISKMTLEEKVSQLSYDAPAVESAGIPKYNWWNECLHGVARAGL----------AT 55

Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDP 163
            FPQ I  A++F+     ++   +SDE RA YN            GLT+W+PNVNIFRDP
Sbjct: 56  VFPQAIALAATFDEAFIRSVADAISDEGRAKYNEAVKRGNRSQYYGLTFWTPNVNIFRDP 115

Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH- 222
           RWGRGQET GEDP L+G+   ++++GLQG D + LKVAA  KH+  +     +G ++   
Sbjct: 116 RWGRGQETYGEDPYLTGRIGLAFMKGLQGDDTEHLKVAACAKHYAVH-----SGPEKLRH 170

Query: 223 -FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
            F+A VSK+D+ +T+   F++ V  G V +VM +YN+  G P      +LK  +RG W  
Sbjct: 171 TFDAVVSKKDLFETYLPAFKLLVENG-VEAVMGAYNRTLGEPCGGSTYLLKEILRGRWGF 229

Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG---PFLGLHTESAVQRGLL 338
            G++ SDC ++  +++    T +PEE+AA A+ AG DL+CG   P+L +    + ++GL+
Sbjct: 230 KGHVTSDCWAIRDFHENHKVTKSPEESAAMALNAGCDLNCGCTYPYLTV----SHKKGLV 285

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           ++  I+ AL   L  + +LG+FD  P   PY +LG   V    H+ LALEAA++ IVLLK
Sbjct: 286 TDETIDTALTRLLRTRFKLGLFD-PPEQDPYRNLGNDIVGCEKHRNLALEAAQKSIVLLK 344

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI------ 452
           N    LPL     R + ++GP +   +T++ NY G++    T L+G+    +T       
Sbjct: 345 NDSNILPLDD-SARKILLMGPGAANILTLLANYYGMSSRLVTILEGLAEKIKTKTAISFE 403

Query: 453 HQQGCKDVACADDQL----FGAA-IDAS------RQADATILVMGLDQSIEAEA------ 495
           ++QG   +    + L    FG+  +DA        + D  I V GLD S+E E       
Sbjct: 404 YRQGS--LMYEPNHLSNVPFGSTGVDAEAPIYGLDEIDLVIAVYGLDGSMEGEEGDSIAS 461

Query: 496 ---LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
               DR  + LP  Q   + ++  A K   +L+L  G PI  AF ++     A+++A YP
Sbjct: 462 DANGDRDTIELPSWQLNFLRRIRKAGK-KVVLILTGGSPI--AFPED--LADAVLFAWYP 516

Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
           G+ GG A+ADILFG  +P GKLP+T +PQ      P  + A++        GRTYR+ K 
Sbjct: 517 GEQGGNAVADILFGDVSPSGKLPIT-FPQSTAQLPPYDDYALK--------GRTYRYMKE 567

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
             +YPFG G+SYT+F               D    S +   +G ++K             
Sbjct: 568 TPLYPFGFGLSYTSF-------------RFDSVELSSSKISAGNSVK------------A 602

Query: 673 QVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
           +V V N G +D    + L  +        P   L  F ++ + AG    V I +    + 
Sbjct: 603 KVQVSNTGKRDAEEVVQLYIAKDNRSEDEPASSLRGFRRLKILAGKSASVEIELPASAFE 662

Query: 732 SVVDRSGTRRIP 743
           ++     +  IP
Sbjct: 663 TINAEGASVLIP 674


>gi|194700280|gb|ACF84224.1| unknown [Zea mays]
          Length = 452

 Score =  374 bits (959), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 199/454 (43%), Positives = 273/454 (60%), Gaps = 10/454 (2%)

Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
           +D++CG ++  H  SA+Q+G ++E DIN AL N   V+MRLG+F+G+P    YG +GP  
Sbjct: 1   MDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQ 60

Query: 377 VCTPDHQELALEAARQGIVLLKNQGPS--LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
           VCT +HQ+LALEAA+ GIVLLKN G +  LPLS     ++AVIG N++  + + GNY G 
Sbjct: 61  VCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGP 120

Query: 435 ACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
            C   TPLQ +  Y + T    GC   AC +      A+ A+  AD+ +L MGLDQ  E 
Sbjct: 121 PCVTVTPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLDQDQER 179

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E +DR  L LPG+QQ L+  V+ A+K P ILVL+ GGP+DV+FAK +P+I AI+WAGYPG
Sbjct: 180 EEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPG 239

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
           +AGG AIA +LFG  NPGG+LP+TWYPQ++ T +PMT+M MR   +  YPGRTYRFY+GP
Sbjct: 240 EAGGIAIAQVLFGEHNPGGRLPVTWYPQDF-TRVPMTDMRMRADPATGYPGRTYRFYRGP 298

Query: 614 VVYPFGHGMSYTNFVHTVANA--PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
            V+ FG+G+SY+ + H  A    PT     L     +     S     +    C+RL   
Sbjct: 299 TVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKAVEATAGGMASYDVEAIGSETCDRLKFP 358

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPA---GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
             V V+N G  DG H++LVF   P    G   P  QL+ F+ +H+ A     V   +  C
Sbjct: 359 AVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHVEFEVSPC 418

Query: 729 KYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
           K+ S     G + I  G H + +G  +  +S  A
Sbjct: 419 KHFSRATEDGRKVIDQGSHFVMVGEDEFEMSFMA 452


>gi|350295750|gb|EGZ76727.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
          Length = 839

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 241/653 (36%), Positives = 342/653 (52%), Gaps = 75/653 (11%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C V+   P+R   L+ +L++ EK+  L+  A    R+G+  Y WWSE LHGV+   PG  
Sbjct: 37  CDVTGTAPERAASLVDQLTIDEKLVNLVDQALGASRIGLPKYAWWSEGLHGVAG-SPGVT 95

Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
           F   G  F  ATSF   I   +SF+  L   +G  +S EARA  N G  GL YW+PNVN 
Sbjct: 96  FNTTGYPFSYATSFANAINLGASFDDDLVYEVGTAISTEARAFANFGFGGLDYWTPNVNP 155

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
           ++DPRWGRG ETPGEDP+    Y  + + GL+G++  R KV A+CKH+ AYDL+ W+G+ 
Sbjct: 156 YKDPRWGRGAETPGEDPLHIKGYVKAMLAGLEGNETVR-KVIATCKHYAAYDLERWHGLT 214

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV------------------NG 261
           R+ F A V+ QD+ + +  PF+ C  + KV S+MCSYN +                    
Sbjct: 215 RYEFEAIVTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRDMAGGSKPDEIINLTTA 274

Query: 262 VPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYY-DTQHFTSTPEEAAADAIRAGL 317
            P CA+   L   +R  W     N YI SDC+++  +  D  +F+ TP EAAA A +AG 
Sbjct: 275 QPACAN-TYLMTILRDHWNWTEHNNYITSDCNAILDFLPDNHNFSQTPAEAAAAAYKAGT 333

Query: 318 DLDCGPFLGLHTE--SAVQRGLLSEIDINNALVNTLTVQMRLGMFD-----------GEP 364
           D  C       T+   A  + LL E  I+ AL       +R G  D           G  
Sbjct: 334 DTVCEVSGSPLTDVVGAYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGRSAVAGGDGGSF 393

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
           SS  Y  L   DV TP  QELAL +A +GIVLLKN G  LPL     + VA+IG  ++ T
Sbjct: 394 SSPAYDALNWNDVNTPSTQELALRSATEGIVLLKNSGSLLPLD-FSGKKVALIGHWANAT 452

Query: 425 VTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACAD-DQLFGAAIDASRQADATIL 483
            TM G Y+GI   Y  PL    +   ++       V  +D D     A+ A+  AD  + 
Sbjct: 453 GTMRGPYSGIPPFYHNPLYAAQQLNLSLSYANGPVVNASDPDTWTAPALAAAEGADVVLY 512

Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
             G D ++ +E LDR  +  P  Q +L+S+  +A  G  ++V+  G  +D +   N+  +
Sbjct: 513 FGGTDTTVASEDLDRESIAWPEAQMKLLSE--LAGLGKPVVVIQLGDQVDDSSLLNNGNV 570

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP------- 596
           ++I+W GYPGQ+GGTA+ D+L G   P G+LP+T YP+ Y+  +P+TEMA+RP       
Sbjct: 571 SSILWVGYPGQSGGTAVFDVLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNHSSSN 630

Query: 597 ----------------------SQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
                                 +++   PGRTY++Y  PV+ PFG+G+ YT F
Sbjct: 631 LEEEVSVQGGASLTIQARSTPGNKTLSSPGRTYKWYSTPVL-PFGYGLHYTTF 682


>gi|85813774|emb|CAJ65923.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 704

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 198/481 (41%), Positives = 295/481 (61%), Gaps = 15/481 (3%)

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           DCD+V V +  Q +  TPE+A ADA+++G+      +L  +T+SAV++  ++  +I+ AL
Sbjct: 229 DCDAVNVLHVEQKYAKTPEDAVADALKSGIS-----YLRNYTKSAVEKKKVTVSEIDRAL 283

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
            N  + +MRLG+F+G+P+ Q Y  +GP  VC+ +HQ LALEAA  GIVLLKN    LPLS
Sbjct: 284 HNLFSTRMRLGLFNGDPTKQLYSDIGPDQVCSQEHQALALEAALDGIVLLKNADRLLPLS 343

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCKDVACADDQ 466
                ++AVIGPN+  +  ++GNY G AC   T L+G+  Y  +  +++GC +V+C    
Sbjct: 344 KSGISSLAVIGPNAHNSTNLLGNYFGPACKNVTILEGLRNYVSSASYEKGCNNVSCTSAA 403

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
                ++ ++  D  ILVMGLDQS E E LDR  L+LPG+Q  L++ V+ A+K P +LVL
Sbjct: 404 K-KKPVEMAQTEDQVILVMGLDQSQEKERLDRMDLVLPGKQPTLITAVAKAAKRPIVLVL 462

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP---GGKLPMTWYPQEY 583
           + G P+DV FAKN+ +I +I+WAGYPGQAG TA+A I+FG  NP   GG+LPMTWYPQ++
Sbjct: 463 LGGSPMDVTFAKNNRKIGSILWAGYPGQAGATALAQIIFGEHNPGNAGGRLPMTWYPQDF 522

Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN-APTVVAVPL 642
            T +PMT+M MRP  S   PGRTYRFY+G  V+ FG+G+SY+++ +T A+ A   + V  
Sbjct: 523 -TKVPMTDMRMRPQPSTGNPGRTYRFYEGEKVFEFGYGLSYSDYSYTFASVAQNQLNVKD 581

Query: 643 DGRHGSINATISGKAI--KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST-PPAGHW 699
                  N+   G  +   +   +C  +   V V VKN G   G H +L+F+     G  
Sbjct: 582 SSNQQPENSETPGYKLVSDIGEEQCENIKFKVTVSVKNEGQMAGKHPVLLFARHAKPGKG 641

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
            P K+LV F+ V + AG +  +   +  C++LS  +  G   +  G   + +G  +H V+
Sbjct: 642 RPIKKLVGFQTVKLGAGEKTEIEYELSPCEHLSSANEDGVMVMEEGSQILLVGDKEHPVT 701

Query: 760 L 760
           +
Sbjct: 702 I 702



 Score =  235 bits (600), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 115/221 (52%), Positives = 150/221 (67%), Gaps = 15/221 (6%)

Query: 23  AAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK 82
           A++ P++CD  D +T+T  FC+ +LPI +R  DL+ RL+ +EK   L+  + A+PRLGI 
Sbjct: 25  ASQPPYSCDSSDPSTKTYDFCKTTLPISRRAEDLVSRLTFEEKATQLVDTSPAIPRLGIP 84

Query: 83  GYEWWSEALHGV---SNVGPGTKF-GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDE 138
            YEWWSE LHG+   + V  G  F       ATSFPQVI TA+SF+A +W  IG+V   E
Sbjct: 85  AYEWWSEGLHGIGFLTRVQQGISFFNRTIQHATSFPQVILTAASFDAHIWYRIGQV-GKE 143

Query: 139 ARAMYNGG-TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD--- 194
           ARA+YN G   GL +W+PNVNIFRDPRWGRGQETPGEDP++ GKY AS+VRG+QG     
Sbjct: 144 ARALYNAGQVTGLGFWAPNVNIFRDPRWGRGQETPGEDPLVVGKYGASFVRGVQGDSFEG 203

Query: 195 ----GDRLKVAASCKHFTAYDLDNW--NGVDRFHFNAKVSK 229
               GD L+ +A CKH+TA+DLDNW  + V+  H   K +K
Sbjct: 204 ESTLGDHLQASACCKHYTAHDLDNWDCDAVNVLHVEQKYAK 244


>gi|53712125|ref|YP_098117.1| beta-xylosidase [Bacteroides fragilis YCH46]
 gi|52214990|dbj|BAD47583.1| beta-xylosidase [Bacteroides fragilis YCH46]
          Length = 722

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 248/718 (34%), Positives = 373/718 (51%), Gaps = 78/718 (10%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
           +S P+  RV  LI +++L EKV  L+S + ++PRL +  Y +W+E LHGV+  G      
Sbjct: 54  LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
                 T FPQ I  AS+++  L + +   +S EAR  Y     GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163

Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
           WGR +ET GEDP L+ +   ++V+GLQG     LK  A+ KHF A + +N    +RF  +
Sbjct: 164 WGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           +++  + + + +   +  CV E    SVM +YN  NGVP      +L   +R EW  +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++GV        ++ EEAAA  + +G DL+CG         AV++GL+SE  I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339

Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
            AL   LT + +LG FD  P    PY H   K +      ELA EAA + +VLLKN    
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
           LPL+  + ++VAV+GP +D     +G Y+G      + L+G    IG+  +  +  G   
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
            A +  Q+        + AD  ++ +G D+ +  E  D   + LP  Q++L+ ++   + 
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEGQEKLLKEIYQVNP 507

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
              +LV  +G P+   +A  D  I AI+ A YPGQ  G A+A++LFG  NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564

Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
             E    LP + +  M       + GRTYR+ KG  +Y FGHG+SYT+F           
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
               D   G  N T+   AI           L   V++ N G   G   + V+    +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649

Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              +  P K+LVAF+KV + +G +++V   I   + LSV +  G  R+  G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703


>gi|307719075|ref|YP_003874607.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
           6192]
 gi|306532800|gb|ADN02334.1| glycoside hydrolase family 3 [Spirochaeta thermophila DSM 6192]
          Length = 693

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 252/727 (34%), Positives = 372/727 (51%), Gaps = 86/727 (11%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           +R+  L+ R+S++EK  L++  A  VPRLGI  Y WW+EALHGV+N G           A
Sbjct: 5   ERMTSLLSRMSIEEKAGLMVHRAKGVPRLGIPNYNWWNEALHGVANSGE----------A 54

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------GTAGLTYWSPNVNIFRD 162
           T FPQ I  A++F+  L   +   +S EARA +N            GLT+WSPN+NI+RD
Sbjct: 55  TVFPQAIGLAATFDPDLVRRVADAISREARAKFNAVGKERAAEYERGLTFWSPNINIYRD 114

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
           PRWGRGQET GEDP L+ K   ++V+GLQG     L+VAA  KH+  +     +G +  R
Sbjct: 115 PRWGRGQETYGEDPFLTSKIGVAFVKGLQGDHPYYLRVAACAKHYAVH-----SGPEGLR 169

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+A+VS++D+ +T+   F   V  G V +VM +YN+VNG P C    +L+  +R +W 
Sbjct: 170 HVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGSKRLLEEILRKKWG 228

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
             G++VSDC ++  ++     T  P E+ A A+ AG DL+CG     H   AV+ G +SE
Sbjct: 229 FKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAGCDLNCGNTYE-HLLDAVKAGAVSE 287

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
             ++ ++   L+   RLG+F  +    PY  L   D+    H+ LA EAA + +VLLKN 
Sbjct: 288 ELVDRSVARLLSTLDRLGLFTDD---HPYVRLSLADIDWEAHRALAREAAEKSVVLLKNN 344

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA----RTIHQQG 456
           G  LPL   + R + V GPN+   V ++GNYAG++    T L+GI  YA       ++ G
Sbjct: 345 G-ILPLDRRKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITVTYKIG 403

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR---------AGLLLPGRQ 507
           C       + +  A+   +R AD T+ VMG D ++E E  D          + L L   Q
Sbjct: 404 CPLQGNKINPIDWAS-GVARYADVTVAVMGRDSAVEGEEGDAIFSDNYGDLSDLNLSREQ 462

Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT 567
            + + ++    K P ++VL+SG P  V   + +    AI++A YPG+ GG AIA +LFG 
Sbjct: 463 IDYLRRIKEIGK-PLVVVLLSGAP--VCSPELEELADAIVYAWYPGEEGGNAIARVLFGE 519

Query: 568 SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
            +P G+LP+T +P+      P T+ +M         GRTYR+ K   +YPFG G+SY  F
Sbjct: 520 VSPSGRLPIT-FPKGVDQLPPFTDYSME--------GRTYRYMKEEPLYPFGFGLSYATF 570

Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
            +     P   A   D R                       TL V  +V+N  S      
Sbjct: 571 SY---RDPKSSASRWDKRE----------------------TLEVVCEVENTSSIPADEV 605

Query: 688 L-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC-KYLSVVDRSGTRRIPLG 745
           + L      A    P   L  F +V +  G  +R+ +   +  + LS +D  G + +P G
Sbjct: 606 VQLYVRWEDAPFRVPLWSLKGFTRVSL--GTGERIQVRFVLSPEDLSFIDEKGRKVLPEG 663

Query: 746 EHNIHIG 752
               H+G
Sbjct: 664 RLRFHVG 670


>gi|336261464|ref|XP_003345521.1| hypothetical protein SMAC_07509 [Sordaria macrospora k-hell]
 gi|380088197|emb|CCC13872.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 762

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 250/718 (34%), Positives = 368/718 (51%), Gaps = 56/718 (7%)

Query: 12  LILLSASSSGLAARE-PFACDPK--DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
           L  LS S++ + A + PF   P   +    +L  C  +L  PQR   L+  ++ +EK++ 
Sbjct: 15  LFSLSCSAALVYAIDLPFQTYPDCVNGPLASLKVCDATLSPPQRAAALVAAMTTEEKLQN 74

Query: 69  LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNA 125
           L+S +   PR+G+  Y WWSEALHGV+   PGT+F    G F  +TSFP  +  A++F+ 
Sbjct: 75  LVSKSKGAPRIGLPAYNWWSEALHGVA-YAPGTQFRSGNGTFNSSTSFPMPLLMAATFDD 133

Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
            L E +G V+  E RA  N G +G  YW+PNVN F+DPRWGRG ETPGED +   +YAAS
Sbjct: 134 ELIERVGEVIGIEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAAS 193

Query: 186 YVRGLQGSDGDR-LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
            +RGL+G   +R  ++ A+CKH+ A D ++WNG  R  FNAKV+ QD+ + +  PF+ C 
Sbjct: 194 MIRGLEGPVRERERRIVATCKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPFQQCA 253

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG---YIVSDCDSVGVYYDTQHF 301
            + KV S+MCSYN VNGVP CA+  +++  +R  W       YI SDC++V       H+
Sbjct: 254 RDSKVGSIMCSYNAVNGVPACANTYLMQTILRDHWNWTAPGNYITSDCEAVLDISANHHY 313

Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
             T  E  A A  AG+D  C          A  +GLL +  ++ AL       +++G FD
Sbjct: 314 AKTNAEGTALAFEAGIDSSCEYEGSSDILGAWTQGLLKQSTVDRALRRLYEGLVQVGYFD 373

Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRH---RTVAVIG 418
           G  S   Y  LG   V  P  QE+AL+AA +GIVLLKN   +LPL   ++     +A+IG
Sbjct: 374 GNRSE--YASLGWNHVNRPKSQEVALQAAVEGIVLLKNDK-TLPLGVKKNGPKLKLAMIG 430

Query: 419 PNSDVTVTMIGNYAGIACGYTTPL---QGIGRYARTIHQQGCKDVACADDQLFGAAIDAS 475
             ++   T+ G Y+G      +P+   Q +G    T      ++ + + D    AA+ A+
Sbjct: 431 FWANDPKTLSGGYSGTPAFEHSPVYATQAMGFKVTTAGGPVLQN-STSKDTWTQAALAAA 489

Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           + A+  +   G D S   E  DR  +  P  Q +L++ +S   K P ++V M G  +D  
Sbjct: 490 KDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLITDLSKLGK-PLVVVQM-GDQLDNT 547

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
                  I +I+WA +P                 P G+LP+T Y   Y   +PMT+M +R
Sbjct: 548 PLLASKAINSILWANWP----------------VPAGRLPVTQYHANYTAAVPMTDMTLR 591

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
           PS   + PGRTYR+Y  P V PFG G+ YT F   +   P      L  R G+      G
Sbjct: 592 PSD--KLPGRTYRWYPTP-VQPFGFGLHYTTFKTKIVRLPRFAIKDLLSRCGNAYPDTCG 648

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH 712
                           ++V+V N G +   + +L F     G    P K LV++ ++ 
Sbjct: 649 LP-------------PLKVEVTNTGKRSSDYVVLAFLKGDVGPKPYPIKTLVSYTRLR 693


>gi|291530120|emb|CBK95705.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum 70/3]
          Length = 689

 Score =  371 bits (952), Expect = e-99,   Method: Compositional matrix adjust.
 Identities = 256/741 (34%), Positives = 386/741 (52%), Gaps = 112/741 (15%)

Query: 47  LPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGD 106
           L   +R   L   LS +E+ + L   A A+ + G+  Y WW+E LHGV+  G        
Sbjct: 9   LSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT------- 61

Query: 107 FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVN 158
              AT FPQ I  A++F+  +   +G VVS EARAMYN            GLT W+PN+N
Sbjct: 62  ---ATVFPQAIALAAAFDKDMMCRVGEVVSTEARAMYNSAAKHGDTDIYKGLTLWAPNIN 118

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
           IFRDPRWGRG ET GEDP L+ +   ++V+G+QG +   L+ AA  KHF  +     +G 
Sbjct: 119 IFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEE-KYLRAAACAKHFAVH-----SGP 172

Query: 219 D--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
           +  R  F+A+VS++D+E+T+   F+  V EG+V  VM +YN+VNG P+CA   ++ + +R
Sbjct: 173 ESLRHEFDARVSEKDLEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMGK-LR 231

Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
            EW  +GY VSDC ++  ++     T T  ++AA A++AG D++CG    LH  +A++ G
Sbjct: 232 -EWGFDGYFVSDCGAIRDFHTNHKITDTAPQSAAMALKAGCDVNCGNTY-LHILAALEEG 289

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQGIV 395
           L+++ DI  A ++ L  ++RLG  D       +  L P D+   D ++ L+LEAA + +V
Sbjct: 290 LITKQDIRTACIHALRTRIRLGQLD----DNEFDDL-PFDIIACDGNKALSLEAAEKSMV 344

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ARTIH 453
           LL N G  LPL   R  ++AVIGPN+D    ++GNY G      T L+GI      R  +
Sbjct: 345 LLHNDGI-LPLDKSRISSIAVIGPNADSRAALLGNYEGTPDRSVTFLEGIQDAFDGRVYY 403

Query: 454 QQGCKDVACADDQLFGAAIDASR---------QADATILVMGLDQSIEAE-------ALD 497
            +GC+      D+  G A+   R          AD T++ +GLD ++E E       + D
Sbjct: 404 AEGCQ---LFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDSTLEGEEGDTENKSGD 460

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           +  L LP  Q+ L+ K+    K P I+VL +G  ++     N     A+I A YPGQ GG
Sbjct: 461 KPDLRLPEVQRVLLQKLKDTGK-PLIIVLAAGSSVNTECEGN-----ALINAWYPGQYGG 514

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFY--KGPV 614
            A+A+ILFG  +P GKLP+T+Y    +  LP  T+ +M+         RTYRF   +  V
Sbjct: 515 KALAEILFGEVSPSGKLPVTFYKSADM--LPDFTDYSMK--------NRTYRFCDDESNV 564

Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
           +YPFG+G++Y++F                   G I+   +  A+ VT             
Sbjct: 565 LYPFGYGLTYSHF-----------------ECGDISYKDNTLAVNVT------------- 594

Query: 675 DVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
              N GS+     L V+     G    +  L AFE+V +  G  + + INI    +   V
Sbjct: 595 ---NTGSRSAEDVLQVYIRSENG--VKNHSLCAFERVSLFDGESRTISINIPEGAF-ETV 648

Query: 735 DRSGTRRIPLGEHNIHIGGTK 755
           D +G R +  G + ++ G T+
Sbjct: 649 DDNGVRAVRSGRYTLYAGFTQ 669


>gi|423269271|ref|ZP_17248243.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
           CL05T00C42]
 gi|423273165|ref|ZP_17252112.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
           CL05T12C13]
 gi|392701693|gb|EIY94850.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
           CL05T00C42]
 gi|392708197|gb|EIZ01305.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
           CL05T12C13]
          Length = 722

 Score =  371 bits (952), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 247/718 (34%), Positives = 373/718 (51%), Gaps = 78/718 (10%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
           +S P+  RV  LI +++L EKV  L+S + ++PRL +  Y +W+E LHGV+  G      
Sbjct: 54  LSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE----- 108

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
                 T FPQ I  AS+++  L + +   +S EAR  Y     GLTYWSP +N+ RDPR
Sbjct: 109 -----VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPR 163

Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
           WGR +ET GE+P L+ +   ++V+GLQG     LK  A+ KHF A + +N    +RF  +
Sbjct: 164 WGRNEETYGEEPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFVANNEEN----NRFSSS 219

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           +++  + + + +   +  CV E    SVM +YN  NGVP      +L   +R EW  +G+
Sbjct: 220 SQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGF 279

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++GV        ++ EEAAA  + +G DL+CG         AV++GL+SE  I+
Sbjct: 280 VVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAID 339

Query: 345 NALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
            AL   LT + +LG FD  P    PY H   K +      ELA EAA + +VLLKN    
Sbjct: 340 RALTRVLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-L 396

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCKD 459
           LPL+  + ++VAV+GP +D     +G Y+G      + L+G    IG+  +  +  G   
Sbjct: 397 LPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGT 454

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
            A +  Q+        + AD  ++ +G D+ +  E  D   + LP  Q++L+ ++   + 
Sbjct: 455 SADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLPEGQEKLLKEIYQVNP 507

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
              +LV  +G P+   +A  D  I AI+ A YPGQ  G A+A++LFG  NP GKLPMT Y
Sbjct: 508 -RIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564

Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
             E    LP + +  M       + GRTYR+ KG  +Y FGHG+SYT+F           
Sbjct: 565 KTE--EQLPDILDFDM-------WKGRTYRYMKGEPLYGFGHGLSYTSF----------- 604

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STP 694
               D   G  N T+   AI           L   V++ N G   G   + V+    +TP
Sbjct: 605 --EFDNIQG--NDTLQPDAI-----------LQCSVELSNSGQLAGEEVVQVYVSRENTP 649

Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              +  P K+LVAF+KV + +G +++V   I   + LSV +  G  R+  G++ + IG
Sbjct: 650 VYTY--PLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWE-DGKWRMLSGKYTLFIG 703


>gi|291556907|emb|CBL34024.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum V10Sc8a]
          Length = 691

 Score =  370 bits (949), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 255/743 (34%), Positives = 387/743 (52%), Gaps = 114/743 (15%)

Query: 47  LPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGD 106
           L   +R   L   LS +E+ + L   A A+ + G+  Y WW+E LHGV+  G        
Sbjct: 9   LSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT------- 61

Query: 107 FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVN 158
              AT FPQ I  A++F+  +   +G V+S EARAMYN            GLT W+PN+N
Sbjct: 62  ---ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLWAPNIN 118

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
           IFRDPRWGRG ET GEDP L+ +   S+V+G+QG + + L+ AA  KHF  +     +G 
Sbjct: 119 IFRDPRWGRGHETYGEDPYLTSRLGVSFVKGIQGEE-EYLRAAACAKHFAVH-----SGP 172

Query: 219 D--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
           +  R  F+A+VS++D+E+T+   F+  V EG+V  VM +YN+VNG P+CA   ++ + +R
Sbjct: 173 ESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMGK-LR 231

Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
            EW  +GY VSDC ++  ++ T   T T  ++AA A++AG D++CG    LH  +A++ G
Sbjct: 232 -EWGFDGYFVSDCWAIRDFHTTHKITDTAPQSAAMALKAGCDVNCGNTY-LHILAALEEG 289

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQGIV 395
           L+++ DI  A ++ L  ++RLG  D       +  L P D+   D ++ L+LEAA + +V
Sbjct: 290 LITKQDIRTACIHALRTRIRLGQLD----DNEFDDL-PFDIIACDGNKALSLEAAEKSMV 344

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ARTIH 453
           LL N G  LPL   R  ++AVIGPN+D    ++GNY G      T L+GI      R  +
Sbjct: 345 LLHNDGI-LPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQDAFDGRVYY 403

Query: 454 QQGCKDVACADDQLFGAAIDASR---------QADATILVMGLDQSIEAE---------A 495
            +GC+      D+  G A+   R          AD T++ +GLD ++E E         +
Sbjct: 404 AEGCQ---LFRDRTQGLALPGDRYAEAVAACEAADVTVICVGLDATLEGEEGDTGNEFAS 460

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            D+  L LP  Q+ L+  +    K P I+VL +G  ++     N     A+I A YPGQ 
Sbjct: 461 GDKPDLRLPEVQRVLLQNLKDTGK-PLIIVLAAGSSVNTECEGN-----ALINAWYPGQY 514

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFY--KG 612
           GG A+A+ILFG  +P GKLP+T+Y    +  LP  T+ +M+         RTYRF   + 
Sbjct: 515 GGKALAEILFGEVSPSGKLPVTFYKSADM--LPDFTDYSMK--------NRTYRFCDDES 564

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
            V+YPFG+G++Y++F                   G ++   +  A+ VT           
Sbjct: 565 NVLYPFGYGLTYSHF-----------------ECGDVSYKDNTLAVNVT----------- 596

Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
                N GS+     L V+     G    +  L AFE+V +  G  + + INI    +  
Sbjct: 597 -----NTGSRSAEDVLQVYIKSENG--VKNHSLCAFERVSLFDGESRTISINIPEGAF-E 648

Query: 733 VVDRSGTRRIPLGEHNIHIGGTK 755
            VD +G R +  G + ++ G T+
Sbjct: 649 TVDDNGIRAVRSGRYTLYAGFTQ 671


>gi|325970053|ref|YP_004246244.1| beta-glucosidase [Sphaerochaeta globus str. Buddy]
 gi|324025291|gb|ADY12050.1| Beta-glucosidase [Sphaerochaeta globus str. Buddy]
          Length = 698

 Score =  370 bits (949), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 249/736 (33%), Positives = 366/736 (49%), Gaps = 93/736 (12%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR  +L+ R++L + +  L   A A+  LGI  Y WW+E LHG +  G           A
Sbjct: 5   QRAQELVERMNLPQMMSQLRHDAPAIESLGIPAYNWWNEGLHGSARSGT----------A 54

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRD 162
           T FPQ I  AS F+     A+  VVS E RA YN  T         GLT WSPNVNIFRD
Sbjct: 55  TVFPQAIGLASLFDPDFLYAVASVVSTEQRAKYNLFTHENDRDIYKGLTVWSPNVNIFRD 114

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--R 220
           PRWGRGQET GEDP L+ + A +++RGLQG +G  LK A+  KHF A+     +G +  R
Sbjct: 115 PRWGRGQETFGEDPYLTARLAVAFIRGLQG-EGPVLKTASCVKHFAAH-----SGPEPLR 168

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             FNA V K+D+E+T+   F   V E K  +VM +Y+ +N  P CA   +++ T+R  W 
Sbjct: 169 HGFNAVVGKKDLEETYLPAFASAVKEAKADAVMGAYSALNDEPCCASSFLMEETLRLRWG 228

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLS 339
             G  +SDC ++  ++     T   EE+AA A++ G DL CG  +  L  E A Q+GL++
Sbjct: 229 FEGMYISDCWAIRDFHLNHKVTKNEEESAALALKRGCDLACGCEYQSL--EKAFQKGLIT 286

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
              I  A +  +T + +LG FD       Y  LG + + + +H  LA EA+ + +VLLKN
Sbjct: 287 REQIKKAAIRVMTTRFKLGQFD---QGTAYDTLGLESLDSDEHAALAFEASCRSLVLLKN 343

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY----ARTIHQQ 455
               LPL       +AVIGPN+D    + GNY G +  Y T L+G+  Y     R ++ +
Sbjct: 344 DA-LLPLKKEAVSCLAVIGPNADSRQALWGNYHGTSSRYVTILEGLRDYVGSSTRILYSE 402

Query: 456 GCK------DVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAG 500
           G        +    DD     A+  ++ +D  +L +GL++++E E         A D+  
Sbjct: 403 GSNLTKNKVERLAKDDDRLSEAVFMAKASDVVVLCLGLNETVEGEMHDDGNGGWAGDKDD 462

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L LP  Q++L+  V+   K P I+VL+SGG +D    +    + A+I A YPGQ GG AI
Sbjct: 463 LRLPLCQRKLLKAVAETGK-PIIVVLLSGGSLDPEIEQY-ANVKALIQAWYPGQEGGKAI 520

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP-VVYPFG 619
           A +L+G   P GKLP+T+Y  E    LP       P        RTYR+   P V+YPFG
Sbjct: 521 AHLLYGALCPSGKLPVTFYKAE--AKLP-------PFTDYSLIRRTYRYCDDPDVLYPFG 571

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
            G+SY +F   ++ A        +     + AT+                      V+N 
Sbjct: 572 FGLSYASFSFCLSAAQ-------ETEQNGVAATVL---------------------VRNT 603

Query: 680 GSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGT 739
            + D    + ++         PH  L   + VH+ AG + ++   +   K  + V   G 
Sbjct: 604 SALDARTVVQLYLAMEGKDLPPHPVLCGMKSVHLKAGEETQITFILEE-KQFTAVQEDGN 662

Query: 740 RRIPLGEHNIHIGGTK 755
           R    G + ++ G ++
Sbjct: 663 RYAVRGGYTLYAGSSQ 678


>gi|451851086|gb|EMD64387.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
          Length = 763

 Score =  369 bits (948), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 244/753 (32%), Positives = 379/753 (50%), Gaps = 48/753 (6%)

Query: 10  LGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLL 69
           LG  L+S S + +        D  +    T   C V+ P  +R   L+  +  QEK+  L
Sbjct: 10  LGASLISKSHAAIGP------DCTNGPLSTNAICDVNAPPHERAAALVAAMEPQEKLDNL 63

Query: 70  ISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWE 129
           +S +  V RLG+  Y WW EALHGV+   PG KF   +  ATSFP  I  +++F+  L  
Sbjct: 64  VSKSKGVSRLGLPAYNWWGEALHGVAG-APGIKFVEPYKNATSFPMPILMSAAFDDDLIF 122

Query: 130 AIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRG 189
            I  ++ +EARA  NGG A + YW+P++N  RD RWGR  E+PGED      Y  + + G
Sbjct: 123 KIANIIGNEARAFGNGGVAPVDYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKALLAG 182

Query: 190 LQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
           L+G    R K+ A+CKH+  YD++ W G DR +F+AK++ QD+ + +  PF+ C  + KV
Sbjct: 183 LEGDQAQR-KIIATCKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKV 241

Query: 250 ASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHFTSTPE 306
            S MCSYN VNG+PTCAD  +L+  +R  W     N YI SDC++V    +   +  T  
Sbjct: 242 GSFMCSYNAVNGIPTCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYVETLA 301

Query: 307 EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSS 366
           +  A A   G+DL C          A  +GLL+   I+ AL       +  G FDG  ++
Sbjct: 302 QGTALAFAKGMDLSCEYTGSSDIPGAWAQGLLNISVIDKALTRQYEGLVHAGYFDGAKAT 361

Query: 367 QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT 426
             Y +L  KD+ TP+ ++L+L+   +G+V+LKN   +LPL   +   VA+IG  ++ +  
Sbjct: 362 --YANLSYKDINTPEARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWANDSSK 418

Query: 427 MIGNYAG--------IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQA 478
           + G Y+G        +  G    L     +   I      D    +      A+DA+ ++
Sbjct: 419 LQGIYSGPPPYRHSPVFAGEQMGLDMAIAWGPMIQNSSVPDNWTTN------ALDAAEKS 472

Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
           D  +   G D ++  E  DR  +  P  Q +L++K  +A  G  ++V+  G   D +   
Sbjct: 473 DYILYFGGQDWTVAQEGYDRTTISFPQVQIDLLTK--LAKLGKPLVVITLGDMTDHSPLL 530

Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
           +   + +IIWA +PGQ GG AI +++ G   P G+LP+T YP +Y+  L M +M +RP  
Sbjct: 531 SMEGVNSIIWANWPGQDGGPAILNVVSGAHAPAGRLPITEYPADYV-KLSMLDMNLRPHT 589

Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
               PGRTYR++    V PFG G+ YT F  + A+         +G    I   + G   
Sbjct: 590 ES--PGRTYRWFN-ESVQPFGFGLHYTTFEASFASE--------EGLTYDIEEILDG-CT 637

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH-VPAG 716
           +     C    L  +V V N G++      L F     G    P K L+ + ++  +  G
Sbjct: 638 QQYKDLCEVAPL--EVTVANKGNRTSDFVALAFIKGEVGPKPYPLKTLITYGRLRDIHGG 695

Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
           A++   + + + + L+ VD+SG   I  GE+ +
Sbjct: 696 AKKSASLPLTLGE-LARVDQSGNTVIYPGEYTL 727


>gi|373852136|ref|ZP_09594936.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
 gi|372474365|gb|EHP34375.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
          Length = 740

 Score =  369 bits (948), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 257/746 (34%), Positives = 380/746 (50%), Gaps = 88/746 (11%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    L +  RV DL+ RL+L EKV  +   AAA+PRLGI  Y +W+E LHGV+  G  
Sbjct: 22  PFRDPDLALDHRVRDLVSRLTLAEKVSQMEHAAAAIPRLGIPAYNYWNECLHGVARNG-- 79

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-----------G 149
                    AT FPQ+I  A++++  L   +   +SDEARA ++   A           G
Sbjct: 80  --------RATVFPQIIGLAATWDTDLVYRVATAISDEARAKHHAALARQGFAQTQQYQG 131

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFT 208
           LT+W+PN+N+FRDPRWGRGQET GEDP L+ + AA++VRGLQG   D  LK+AA  KH+ 
Sbjct: 132 LTFWTPNINLFRDPRWGRGQETWGEDPHLTARLAAAFVRGLQGDTPDTHLKLAACAKHYA 191

Query: 209 AYD-LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
            +   +N    +R  FNA+V+  D+ D++   F   V   +V SVM +YN+    P CA 
Sbjct: 192 VHSGPEN----ERHTFNARVTPHDLWDSYLPAFEHLVRHARVESVMGAYNRTLDEPCCAS 247

Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
             +L   +R  W   G++VSDC ++   ++T   T+ P E+AA A+  G DL CG    L
Sbjct: 248 QFLLLDILRERWGFEGHVVSDCWALRDIHETHRITTDPVESAALALTKGCDLACGTTFEL 307

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGH-LGPKDVCT-PDHQE 384
             E AVQRGL++E DI+ AL   L  + +LGMFD  + +  P+ +   P+ + T   H  
Sbjct: 308 LGE-AVQRGLITEADIDRALSRHLRARFKLGMFDPADDNRNPWSNPPAPEAIVTCAAHTA 366

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LA EAA    VLL+N    LPL     R++ + GP +     ++GNY G+     T L G
Sbjct: 367 LACEAAVASCVLLQNHNHILPL-RPDVRSIYITGPLAATQDALLGNYYGLPPRAITLLDG 425

Query: 445 IG----RYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEA----- 495
           +        R  ++ G        + L  A  D +   D TI  +GL   +E E      
Sbjct: 426 LAAALPEGIRADYRPGALLSTPKQNALEWAEFDCA-SCDVTIACLGLTALLEGEEGEAIA 484

Query: 496 ----LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
                DR  + LP  Q+  +   S+  +G  ++V++ GG   ++      ++ AI+WAGY
Sbjct: 485 SSLHGDRDDISLPPPQRLFLE--SLIQRGARVIVILFGGSA-LSLGPLADKVEAILWAGY 541

Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
           PGQ GG A+ADIL G ++P G+LP+T+Y  E I +LP       P  +    GRT+R++ 
Sbjct: 542 PGQEGGRALADILLGRASPSGRLPITFY--ENINDLP-------PYANYSMRGRTHRWFD 592

Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
           G   +PFG G++YT F                        T S   +   ++  N   L 
Sbjct: 593 GTPAWPFGFGLTYTRF------------------------TYSDLRVSDVYSPGNDSPLC 628

Query: 672 VQVDVKNVGSKDGAHTLLVFST---PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
             V + N G  + A  + ++ T    P     P + L  F +V +  G  +RV  +I   
Sbjct: 629 GSVLLTNTGDHEAAEIVQIYLTDFDAPGNGPVPRENLADFHRVTLAPGQSRRVEFSIPP- 687

Query: 729 KYLSVVDRSGTR-RIPLGEHNIHIGG 753
           +++ +VD +G R R PL    +H+GG
Sbjct: 688 EHILLVDTNGRRTRAPLA-FTVHVGG 712


>gi|282877070|ref|ZP_06285912.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
 gi|281300752|gb|EFA93079.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
          Length = 721

 Score =  369 bits (948), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 257/762 (33%), Positives = 376/762 (49%), Gaps = 107/762 (14%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
           T PF    L   QR +DL  RL+L+EK  L+ + +  VPRLGIK ++WW EALHG +  G
Sbjct: 23  TYPFQDARLSFEQRADDLCKRLTLEEKAGLMQNNSKPVPRLGIKQFQWWGEALHGSARTG 82

Query: 99  PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGL 150
                      AT FPQ I  A+SF+  L   +  + S EARA YN          +  +
Sbjct: 83  L----------ATVFPQTIGMAASFDDELLLQVFNIASTEARAKYNVAAKKGYFDTSWSV 132

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG--DRLKVAASCKHFT 208
           + W+PNVNIFRDPRWGRGQET GEDP L+ +   + V GLQG  G     K  A  KHF 
Sbjct: 133 SLWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVEGLQGGKGPHKYYKAFACAKHFA 192

Query: 209 AYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
            +    WN   R   +   VS +D  +T+   F+  V  G V  VMC+YN ++G P C+D
Sbjct: 193 VHSGPEWN---RHSISIDDVSPRDFHETYLPAFKHLVQVGGVKEVMCAYNSIDGEPCCSD 249

Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFL 325
             +L++ +R EW   G +VSDC ++   +        P+   A+A A++ G D+ CG   
Sbjct: 250 QRLLEQLLRDEWGFKGIVVSDCGAIDDIWRKGFHEVEPDAAHASARAVKGGTDMSCGQTY 309

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
           G   E AV+ G ++E  I+ +L   +  +M+LG FD +  ++ +  +  KDV TP  +E+
Sbjct: 310 GSLPE-AVRLGKVTEERIDKSLKRLIVGRMQLGEFDPDSITR-WNAISMKDVSTPASREV 367

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           AL+ AR+ + LL N   +LPLS  + + V V+GPN++ +V M GNY G      T L GI
Sbjct: 368 ALKMARETMTLLHNPMHALPLSK-QLKQVVVMGPNANDSVMMWGNYNGTPHHTVTILDGI 426

Query: 446 GRY---ARTIHQQGCKDV--------ACADDQLFGAAIDASRQADATILVMGLDQSIEAE 494
            R     R    +GC  V        A    QL     D        I V G+   +E E
Sbjct: 427 RRKIGAQRVKFIEGCGLVEPHRRGNQALTTQQLVEEVGDNK----TVIFVGGISPQLEGE 482

Query: 495 AL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
            L          DR  + LP  Q+E+++ +  A K   I+V  SG  I +      P + 
Sbjct: 483 QLEVEAKGFKGGDRVTIELPQVQREMIAALHAAGK-QVIMVNCSGSAIGLV-----PEVT 536

Query: 545 ---AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSK 600
              AI+ A YPG+ GG A+AD+LFG  NP GKLP+T+Y  +  + LP   +  MR     
Sbjct: 537 HTDAILQAWYPGERGGEAVADVLFGDYNPAGKLPVTFYRDD--SQLPDYLDYNMR----- 589

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
               RTYR++KG  ++PFGHG+SYT+F                               K+
Sbjct: 590 ---NRTYRYFKGKPLFPFGHGLSYTSF-------------------------------KI 615

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
             AK     L   V VKN G +DG   + ++ +       P K L  F+++ + AG Q+ 
Sbjct: 616 GKAKMRNGKL--TVSVKNTGKRDGEEVVQLYISCLDDPNGPIKSLRGFKRMALQAGEQRT 673

Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
           V +N+    +    +++ T R+  G++ ++ G +     L +
Sbjct: 674 VTLNLPRKSFERFDEQTNTIRVVPGKYRVYYGTSSDEADLQS 715


>gi|402493386|ref|ZP_10840139.1| beta-glucosidase [Aquimarina agarilytica ZC1]
          Length = 734

 Score =  369 bits (947), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 268/787 (34%), Positives = 395/787 (50%), Gaps = 106/787 (13%)

Query: 10  LGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLL 69
           L LI L+     LAA+       K+  T+   +   +    +R   L+  L+L+EK+ L+
Sbjct: 16  LMLIFLTLLPKHLAAQ-------KNENTKNFEWFDTNKSFEKRAKALVASLTLEEKISLM 68

Query: 70  ISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWE 129
           +  +A + RL I  Y WW+E LHGV+  G           AT FPQ I  A++F+  L  
Sbjct: 69  VDQSAPIDRLNIPEYNWWNECLHGVARNGR----------ATVFPQAIGLAATFDQDLIF 118

Query: 130 AIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGK 181
            +   +S EARA +N        G  AGLT+W+PN+NIFRDPRWGRGQET GEDP L+ +
Sbjct: 119 KVADAISTEARAKFNASIAIGNRGKYAGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSQ 178

Query: 182 YAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVP 239
              ++V+GLQG+    LK AA  KH+  +     +G +  R  F+A  SK+D+ +T+   
Sbjct: 179 IGVNFVKGLQGNHPKYLKSAACAKHYAVH-----SGPEELRHEFDAIASKKDMAETYLPA 233

Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
           F   V E KV  VM +YN+VNG   CA P +L++ ++  W   GYIVSDC ++   +   
Sbjct: 234 FEALVKEAKVEGVMGAYNRVNGEGACASPYLLEKLLKDTWGFKGYIVSDCWALSDLHKFH 293

Query: 300 HFTSTPEEAAADAIRAGLDLDCG---PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
             T T EE+AA A+  GL+++CG   P L    + A+++GL SE  ++N L + L  + +
Sbjct: 294 KVTQTAEESAAAALNVGLNVNCGNVYPAL----DGAIKQGLTSEKQLDNVLQHQLLTRFK 349

Query: 357 LGMFDGEPS-SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVA 415
           LG FD  PS + PY  +    V +  H+ +ALEAA++ IVLLKN   +L       ++V 
Sbjct: 350 LGFFD--PSNNNPYNKITTDVVDSEAHRAIALEAAQKSIVLLKNNN-NLLPLKKDLKSVY 406

Query: 416 VIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART----IHQQGCKDVACADDQLFGAA 471
           V GPN+     ++GNY G+     T L GI           ++QG        + +  + 
Sbjct: 407 VAGPNAAREDVLLGNYYGVTSKTQTILDGIVSKVSAGTSINYKQGLLPFQKNVNPIDWST 466

Query: 472 IDASRQADATILVMGLD---QSIEAEAL------DRAGLLLPGRQQELVSKVSMASKG-P 521
            + SR AD  I+VMGL    +  E EA+      DR  + LP  Q + + K+   + G P
Sbjct: 467 GEISR-ADVGIIVMGLSGNYEGEEGEAIASESKGDRVDIRLPQNQIDYIKKIKAKNTGNP 525

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
            +LVL  G PI  A  +    + AI++A YPG+ GG A+ADILFG   P GKLP+T +P+
Sbjct: 526 LVLVLTGGSPI--AMPEVYDLVDAIVFAWYPGEEGGQAVADILFGDVVPSGKLPIT-FPK 582

Query: 582 EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP 641
                 P  + AM+        GRTY++      +PFG G+SYT+F +            
Sbjct: 583 SVDDLPPYNDYAMK--------GRTYKYMTKTPQFPFGFGLSYTSFKY------------ 622

Query: 642 LDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWA 700
                           +KV   K +         + N G+ D      V+ S+P AG   
Sbjct: 623 --------------DNLKVYKEKAS-------FSITNNGNVDAEEVAQVYVSSPNAGKGD 661

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P   LV F +V + AGA ++V I      ++   D  G      G + IH+GG   S + 
Sbjct: 662 PLNTLVGFTRVSLKAGATKQVSIPFSKKAFVQ-FDSDGKEITRKGTYTIHVGGA--SPNT 718

Query: 761 HAATLGV 767
              TLG+
Sbjct: 719 ITTTLGI 725


>gi|167751044|ref|ZP_02423171.1| hypothetical protein EUBSIR_02029 [Eubacterium siraeum DSM 15702]
 gi|167655962|gb|EDS00092.1| glycosyl hydrolase family 3 C-terminal domain protein [Eubacterium
           siraeum DSM 15702]
          Length = 691

 Score =  369 bits (947), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 255/743 (34%), Positives = 388/743 (52%), Gaps = 114/743 (15%)

Query: 47  LPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGD 106
           L   +R   L   LS +E+ + L   A A+ + G+  Y WW+E LHGV+  G        
Sbjct: 9   LSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT------- 61

Query: 107 FPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVN 158
              AT FPQ I  A++F+  +   +G V+S EARAMYN            GLT W+PN+N
Sbjct: 62  ---ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLWAPNIN 118

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
           IFRDPRWGRG ET GEDP L+ +   ++V+G+QG + + L+ AA  KHF  +     +G 
Sbjct: 119 IFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEE-EYLRAAACAKHFAVH-----SGP 172

Query: 219 D--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
           +  R  F+A+VS++D+E+T+   F+  V EG+V  VM +YN+VNG P+CA   ++ + +R
Sbjct: 173 ESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMGK-LR 231

Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
            EW  +GY VSDC ++  ++ T   T T  ++AA A++AG D++CG    LH  +A++ G
Sbjct: 232 -EWGFDGYFVSDCWAIRDFHTTHKITDTAPQSAAMALKAGCDVNCGNTY-LHILAALEEG 289

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQGIV 395
           L+++ +I  A ++ L  ++RLG  D       +  L P D+   D ++ L+LEAA + +V
Sbjct: 290 LITKQNIRTACIHALRTRIRLGQLD----DNEFDDL-PFDIIACDGNKALSLEAAEKSMV 344

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ARTIH 453
           LL N G  LPL   R  ++AVIGPN+D    ++GNY G      T L+GI      R  +
Sbjct: 345 LLHNDGI-LPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQDAFDGRVYY 403

Query: 454 QQGCKDVACADDQLFGAAIDASR---------QADATILVMGLDQSIEAE---------A 495
            +GC+      D+  G A+   R          AD T++ +GLD ++E E         +
Sbjct: 404 AEGCQ---LFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDATLEGEEGDTGNEFAS 460

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            D+  L LP  Q+ L+ K+    K P I+VL +G  ++     N     A+I A YPGQ 
Sbjct: 461 GDKPDLRLPEVQRVLLQKLKDTGK-PLIIVLAAGSSVNTECEGN-----ALINAWYPGQY 514

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFY--KG 612
           GG A+A+ILFG  +P GKLP+T+Y    +  LP  T+ +M+         RTYRF   + 
Sbjct: 515 GGKALAEILFGEVSPSGKLPVTFYKSADM--LPDFTDYSMK--------NRTYRFCDDES 564

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
            V+YPFG+G++Y++F                   G I+   +  A+ VT           
Sbjct: 565 NVLYPFGYGLTYSHF-----------------ECGDISYKDNTLAVNVT----------- 596

Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
                N GS+     L V+     G    +  L AFE+V +  G  + + INI    +  
Sbjct: 597 -----NTGSRSAEDVLQVYIKSENG--VKNHSLCAFERVSLFDGESRTISINIPEGAF-E 648

Query: 733 VVDRSGTRRIPLGEHNIHIGGTK 755
            VD +G R +  G + ++ G T+
Sbjct: 649 TVDDNGVRAVISGRYTLYAGFTQ 671


>gi|410648100|ref|ZP_11358515.1| beta-glucosidase [Glaciecola agarilytica NO2]
 gi|410132388|dbj|GAC06914.1| beta-glucosidase [Glaciecola agarilytica NO2]
          Length = 733

 Score =  369 bits (946), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 259/786 (32%), Positives = 388/786 (49%), Gaps = 89/786 (11%)

Query: 12  LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           +I+ +A+ S LA      C    A     P+    LP  +R++ LI  ++L+EK   L++
Sbjct: 1   MIIRNAAMSLLAL-----CVATPALANDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVN 55

Query: 72  GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
           G  A+ RLG+  Y++W+EALHGV+  G           AT FPQ I  A++F+  L    
Sbjct: 56  GNVAIERLGLPEYDFWNEALHGVARNG----------RATVFPQAIGMAATFDQHLLLKA 105

Query: 132 GRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
             V+SDEARA +N  +        +GLT+W+PN+NIFRDPRWGRGQET GEDP L+ +  
Sbjct: 106 ASVISDEARAKFNVSSEIGNRSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMG 165

Query: 184 ASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFR 241
            + V GLQG     LK AA+ KHF  +     +G +  R  F+A  S +D+ +T+   F 
Sbjct: 166 KAMVNGLQGDHPKYLKTAAAAKHFAVH-----SGPEALRHEFDAIASPKDMYETYFPAFE 220

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
             V E  V +VM +YN+VNG P      +L   +R +W  +G++VSDC  +  ++     
Sbjct: 221 ALVTEANVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKV 280

Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
           T+   E+AA AI  G DL+CG       + AV+ GL+ E  I+  L   L  + +LG FD
Sbjct: 281 TANAVESAALAINTGTDLNCGAVYNALPD-AVEAGLVDEKTIDKRLSKVLATKFKLGFFD 339

Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
            +    PY ++    V +  H ++A E A + IVLL+N+   LPL     R + V GP +
Sbjct: 340 PK-DDNPYNNISADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFA 397

Query: 422 DVTVTMIGNYAGIACGYTTPLQGIG---RYARTI-HQQGCKDVACADDQLFGAAIDASRQ 477
             +  ++GNY G++   T  L GI        TI ++QG        + +     +A + 
Sbjct: 398 SSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQM 457

Query: 478 ADATILVMGLD---QSIEAEAL------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
            D  I VMGL    +  E EA+      DR  L LP  Q   + K+   +  P I+VL +
Sbjct: 458 GDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTA 517

Query: 529 GGPIDVAFAKNDPRIA----AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
           G P+++        IA    AI++A YPGQ GG A+ADILFG  +P G+LP+T+      
Sbjct: 518 GTPVNLT------EIAELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITF------ 565

Query: 585 TNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
              P +E  + P       GRTYR+     +YPFG G+SY               V  D 
Sbjct: 566 ---PKSEAQLPPYDDYSMQGRTYRYMTQEPMYPFGFGLSYAQ-------------VKFD- 608

Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHK 703
                N T+       +  +     + V V+V N G ++    + L   TP AG   P  
Sbjct: 609 -----NITLGNTQALASKNELQE-NMTVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLH 662

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            L  F ++ + AG  ++V  NI   K+L  ++  G   +  G++++ +G    S    + 
Sbjct: 663 SLKGFTRIKLAAGQTEQVLFNIP-KKHLYSINEQGKPVLLKGQYSVIVGNA--SPGNRSE 719

Query: 764 TLGVIK 769
           TLG  K
Sbjct: 720 TLGAAK 725


>gi|451996250|gb|EMD88717.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
           C5]
          Length = 763

 Score =  368 bits (945), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 248/757 (32%), Positives = 379/757 (50%), Gaps = 47/757 (6%)

Query: 8   FFLGLILLSAS--SSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
           F    ++L AS  S   AA  P   D  +    T   C V+ P  +R   L+  +  QEK
Sbjct: 3   FLSTAVVLGASLVSKSYAAIGP---DCTNGPLSTNAICDVNAPPHERAAALVAAMEPQEK 59

Query: 66  VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
           +  L+S +  V RLG+  Y WW EALHGV+   PG KF   +  ATSFP  I  +++F+ 
Sbjct: 60  LDNLVSKSKGVSRLGLPAYNWWGEALHGVAG-APGIKFVEPYKNATSFPMPILMSAAFDD 118

Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
            L   I  ++ +EARA  NGG A + YW+P++N  RD RWGR  E+PGED      Y  +
Sbjct: 119 DLIFKIANIIGNEARAFGNGGVAPMDYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKA 178

Query: 186 YVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVM 245
            + GL+G    R K+ A+CKH+  YD++ W G DR +F+AK++ QD+ + +  PF+ C  
Sbjct: 179 LLAGLEGDQAQR-KIIATCKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCAR 237

Query: 246 EGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQHFT 302
           + KV S MCSYN VNGVPTCAD  +L+  +R  W     N YI SDC++V    +   + 
Sbjct: 238 DSKVGSFMCSYNAVNGVPTCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYV 297

Query: 303 STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
            T  +  A A   G+DL C          A  +GLL+   I+ AL       +  G FDG
Sbjct: 298 ETLAQGTALAFAKGMDLSCEYSGSSDIPGAWSQGLLNLSVIDKALTRQYEGLVHAGYFDG 357

Query: 363 EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD 422
             ++  Y +L   D+ TP+ ++L+L+   +G+V+LKN   +LPL   +   VA+IG  ++
Sbjct: 358 AKAT--YANLSYNDINTPEARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWAN 414

Query: 423 VTVTMIGNYAG--------IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDA 474
            +  + G Y+G        +  G    L     +   I      D    +      A+DA
Sbjct: 415 DSSKLQGIYSGPPPYRHSPVFAGEQMGLDMAIAWGPMIQNSSVPDNWTTN------ALDA 468

Query: 475 SRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
           + ++D  +   G D ++  E  DR  +  P  Q +L++K  +A  G  ++V+  G   D 
Sbjct: 469 AEKSDYILYFGGQDWTVAQEGYDRTTISFPQVQIDLLAK--LAKLGKPLVVITLGDMTDH 526

Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
           +   +   I +IIWA +PGQ GG AI +++ G   P G+LP+T YP +Y+  L M +M +
Sbjct: 527 SPLLSMEGINSIIWANWPGQDGGPAILNVISGVHAPAGRLPITEYPADYV-KLSMLDMNL 585

Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
           RP      PGRTYR++    V PFG G+ YT F    A+         +G    I  T+ 
Sbjct: 586 RPHAES--PGRTYRWFN-ESVQPFGFGLHYTTFEAGFASE--------EGLTYDIQETLD 634

Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG-HWAPHKQLVAFEKVH- 712
               +     C    L  +V V N G++      L F     G    P K L+ + ++  
Sbjct: 635 -SCTQQYKDLCEVAPL--EVTVANKGNRTSDFVALAFIKGEVGPKPYPLKTLITYGRLRD 691

Query: 713 VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
           +  GA++   + + + + L+ VD+SG   I  GE+ +
Sbjct: 692 IHGGAKKSASLPLTLGE-LARVDQSGNTVIYPGEYTL 727


>gi|5690010|emb|CAB51937.1| Family 3 Glycoside Hydrolase [Ruminococcus flavefaciens]
          Length = 690

 Score =  367 bits (943), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 251/725 (34%), Positives = 367/725 (50%), Gaps = 104/725 (14%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   +L   +R  D+  RLS +EK +     A A  RLG   Y WWSE LHGV+  G   
Sbjct: 6   YLDEALSDLERAEDITDRLSTEEKAEQQKYDAPAEERLGKDAYNWWSEGLHGVARAGT-- 63

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                   AT FPQ I  A+ F+       G   S EARA YN  +A        GLT W
Sbjct: 64  --------ATMFPQTIGMAAMFDDEAVHRAGETTSREARAKYNEYSAHDDRDIYKGLTLW 115

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           SPNVNIFRDPRWGRGQET GEDP L+     +Y +GLQG DG  L+ AA  KHF  +   
Sbjct: 116 SPNVNIFRDPRWGRGQETYGEDPYLTSCLGVAYAKGLQG-DGKVLRTAACAKHFAVH--- 171

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           +     R  F+AK + +D+ +T+   F   V + KV SVM +YN+VNG P CA   ++ +
Sbjct: 172 SGPEATRHEFDAKANMKDMTETYIAAFEALVKDAKVESVMGAYNRVNGEPACASDFVMNK 231

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
               EW  +G+ VSDC ++  ++     T T  E+AA A++ G DL+CG    LH  +A 
Sbjct: 232 L--EEWGFDGHFVSDCWAIRDFHTNHGVTKTAPESAALALKKGCDLNCGNTY-LHLLAAF 288

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
             GL++E D+  + +  +  ++RLGMFD    S  Y  L    V   +H+E +L  + + 
Sbjct: 289 NEGLINEEDLRRSCIKLMRTRVRLGMFD---KSTEYDGLDYDIVACDEHKEFSLRCSERS 345

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-----GRY 448
           +VLLKN G  LPL   +++T+ VIGPN+D    + GNY G A  Y T L GI     GR 
Sbjct: 346 MVLLKNNGI-LPLDGSKYKTIGVIGPNADSVPALEGNYNGKADEYITFLSGIREAHDGRV 404

Query: 449 ART----IHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------A 495
             T    +++  C  +A  DD+L  A I  +R    +  +  LD +IE E         +
Sbjct: 405 LYTEGSHLYKDRCMGLALPDDRLSEAEI-ITRTLRCSGSLCWLDATIEGEEGDTGNEFSS 463

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            D+  L LP  Q++LV  V MA   P I+V  +G  I+V     +    A+I A YPGQ 
Sbjct: 464 GDKNDLRLPESQRKLVKTV-MAKGKPVIIVTAAGSAINV-----EADCDALIQAWYPGQL 517

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPV 614
           GG A+A+ILFG  +P GKLP+T+Y  E  + LP  ++ +M+         RTYR+ +G +
Sbjct: 518 GGRALANILFGKVSPSGKLPVTFY--EDASKLPDFSDYSMK--------NRTYRYSEGNI 567

Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL--GV 672
           ++PFG+G++Y                                    +  +C+ L+   GV
Sbjct: 568 LFPFGYGLTY------------------------------------SETECSELSFENGV 591

Query: 673 -QVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
             V V N GS+     + ++    + +  P+  L  F++V + AG  + V I +    ++
Sbjct: 592 ATVKVTNTGSRFTEDVVQIYIKGYSENAVPNHSLCGFKRVALDAGESRIVQITLPERAFM 651

Query: 732 SVVDR 736
           +V ++
Sbjct: 652 AVNEK 656


>gi|291240559|ref|XP_002740189.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 745

 Score =  366 bits (940), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 230/618 (37%), Positives = 325/618 (52%), Gaps = 52/618 (8%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG-------AAAVPRLGIKGYEWWSEALH 92
            PF   SLP  +RV DL+GRL L+E V  +  G       A  + RL I  Y W +E L 
Sbjct: 25  FPFRNTSLPWNKRVEDLVGRLKLEEIVLQMSRGGRYSNGPAPPIDRLNIGPYSWNTECLR 84

Query: 93  GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-------- 144
           G  + GP          ATSFPQ    A++F+A L + I    + E RA YN        
Sbjct: 85  GDLSAGP----------ATSFPQAFGLAATFDAVLIKQIANATAYEVRAKYNNYTKHKEY 134

Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
           G   GL+ +SP +NI R P WGR QET GEDP LSG  AAS+V GLQG+    +   A C
Sbjct: 135 GDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASFVTGLQGNHPRYVTANAGC 194

Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           KHF AY         R  F+AKVS +D+  TF   F  C+  G   S+MCSYN +NGVP 
Sbjct: 195 KHFDAYAGPENIPSSRSTFDAKVSDRDLRMTFLPAFHECIQAG-TYSLMCSYNSINGVPA 253

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
           CA+  +L   +R EW   GY++SD  +V   YD  H+T    + A   + +GL+L+    
Sbjct: 254 CANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAHHYTKDMLDTAIACVNSGLNLELSSN 313

Query: 325 LG----LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
           L     + T  AV++G ++   +   +      +MRLG FD  P   PY  L    + + 
Sbjct: 314 LTDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTRMRLGEFD-PPEMNPYSKLDLSIIQSQ 372

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT- 439
           +HQEL+L+AA +  VLLKN+   LPL   +   +AV+GP  D  + + G+ +      T 
Sbjct: 373 EHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKLAVVGPFGDNPIEIYGSKSPDVSNLTV 431

Query: 440 TPLQGIGRYAR--TIHQQGCKDVACA--DDQLFGAAIDASRQADATILVMGLDQSIEAEA 495
           TP  G+ + AR  T    GC   AC   D +    AID   + D  ++ +G    +E EA
Sbjct: 432 TPRYGLSKIARLATTFASGCLSPACTEYDPKSTKQAID---RVDMVVVCLGTGNEVENEA 488

Query: 496 LDRAGLLLPGRQQELVSK-VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
            DR+ L LPG+Q  L+   V+ A+  P IL+L + GP+D+ +A ++P I  I+   +P Q
Sbjct: 489 HDRSELTLPGQQLRLLQDAVTFAADKPVILLLFNAGPLDITWAVSNPAIPVIVECFFPAQ 548

Query: 555 AGGTAIADILFGT--SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
             GTA+  +   +  SNPGG+LP+TW P+      PM +  M         GRTYR++ G
Sbjct: 549 TTGTALYHLFVNSPGSNPGGRLPITW-PKSMSQVPPMEDYTME--------GRTYRYFNG 599

Query: 613 PVVYPFGHGMSYTNFVHT 630
             ++PFG+G+SYT F ++
Sbjct: 600 DPLFPFGYGLSYTTFHYS 617


>gi|332307852|ref|YP_004435703.1| glycoside hydrolase family protein [Glaciecola sp. 4H-3-7+YE-5]
 gi|332175181|gb|AEE24435.1| glycoside hydrolase family 3 domain protein [Glaciecola sp.
           4H-3-7+YE-5]
          Length = 733

 Score =  366 bits (940), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 258/786 (32%), Positives = 389/786 (49%), Gaps = 89/786 (11%)

Query: 12  LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           +I+ +A+ S LA      C    A     P+    LP  +R++ LI  ++L+EK   L++
Sbjct: 1   MIIRNAAMSLLAL-----CVATPALANDQPWFDTQLPTQERIDLLIDAMTLKEKTSQLVN 55

Query: 72  GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
           G  A+ RLG+  Y++W+EALHGV+  G           AT FPQ I  A++F+  L    
Sbjct: 56  GNVAIERLGLPEYDFWNEALHGVARNG----------RATVFPQAIGMAATFDQHLLLKA 105

Query: 132 GRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
             V+SDEARA +N  +        +GLT+W+PN+NIFRDPRWGRGQET GEDP L+ +  
Sbjct: 106 ASVISDEARAKFNVSSEIGNRSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMG 165

Query: 184 ASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFR 241
            + V GLQG     LK AA+ KHF  +     +G +  R  F+A  S +D+ +T+   F 
Sbjct: 166 KAMVNGLQGDHPKYLKTAAAAKHFAVH-----SGPEALRHEFDAIASPKDMYETYFPAFE 220

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
             + E  V +VM +YN+VNG P      +L   +R +W  +G++VSDC  +  ++     
Sbjct: 221 ALITEANVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKV 280

Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
           T+   E+AA AI  G DL+CG       + AV+ GL+ E  I+  L   L  + +LG FD
Sbjct: 281 TANAVESAALAINTGTDLNCGAVYNALPD-AVEAGLVDEKTIDKRLSKVLATKFKLGFFD 339

Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
            +    PY ++    V +  H ++A E A + IVLL+N+   LPL     R + V GP +
Sbjct: 340 PK-DDNPYNNISADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFA 397

Query: 422 DVTVTMIGNYAGIACGYTTPLQGIG---RYARTI-HQQGCKDVACADDQLFGAAIDASRQ 477
             +  ++GNY G++   T  L GI        TI ++QG        + +     +A + 
Sbjct: 398 SSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQM 457

Query: 478 ADATILVMGLD---QSIEAEAL------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
            D  I VMGL    +  E EA+      DR  L LP  Q   + K+   +  P I+VL +
Sbjct: 458 GDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTA 517

Query: 529 GGPIDVAFAKNDPRIA----AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
           G P+++        IA    AI++A YPGQ GG A+ADILFG  +P G+LP+T+      
Sbjct: 518 GTPVNLT------EIAELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITF------ 565

Query: 585 TNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
              P +E  + P       GRTYR+     +YPFG G+SY               V  D 
Sbjct: 566 ---PKSEAQLPPYDDYSMQGRTYRYMTQEPMYPFGFGLSYAQ-------------VKFD- 608

Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHK 703
                N T+ G    +      +  + V V+V N G ++    + L   TP AG   P  
Sbjct: 609 -----NITL-GNTQALASKNEPQENMTVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLH 662

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            L  F ++ + AG  ++V  +I   K+L  ++  G   +  G++++ +G    S    + 
Sbjct: 663 SLKGFTRIKLAAGQTEQVLFSIP-KKHLYSINEQGKPVLLKGQYSVIVGNA--SPGNRSE 719

Query: 764 TLGVIK 769
           TLG  K
Sbjct: 720 TLGAAK 725


>gi|157676888|emb|CAP07659.1| beta-xylosidase [uncultured rumen bacterium]
          Length = 761

 Score =  365 bits (936), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 263/809 (32%), Positives = 376/809 (46%), Gaps = 146/809 (18%)

Query: 29  ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
           +C P++     + +   S P   R   L+ +LSL+EK  L+   + AV RLGIK Y WWS
Sbjct: 23  SCAPQE-----ISYTDKSQPAELRAKALLPKLSLEEKAGLVQYNSPAVERLGIKAYNWWS 77

Query: 89  EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR-----AMY 143
           EALHGV+  G           AT FPQ I  A+SF+    E +   VSDEAR     A  
Sbjct: 78  EALHGVARNG----------SATVFPQPIGMAASFDVEKIETVFTAVSDEARVKNRIAAE 127

Query: 144 NGGT---AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLK 199
           +G     AGL++W+PN+NIFRDPRWGRG ET GEDP L G+   + VRGLQG  D D LK
Sbjct: 128 DGRVYQYAGLSFWTPNINIFRDPRWGRGMETYGEDPYLMGQLGMAVVRGLQGDPDADVLK 187

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
             A  KH+  +     N   R  F+A+VS++D+ +T+   F+  V +  V  VM +YN+ 
Sbjct: 188 THACAKHYAVHSGLESN---RHRFDAQVSERDLRETYLPAFKDLVTKAGVKEVMTAYNRF 244

Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD--TQHFTSTPEEAAADAIRAGL 317
            G P  A   ++++ +R EW   G +VSDC ++  +++     F +T EEAAA A+  GL
Sbjct: 245 RGYPCAASEYLVQKILREEWGYKGLVVSDCWAIPDFFEPGRHGFVATGEEAAALAVANGL 304

Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
           D++CG        +A+ +GLL E D++  L+  LT + RLG  DGE    P+  L P  V
Sbjct: 305 DVECGSTFS-KIPAAIDQGLLKEEDLDRNLLRVLTERFRLGEMDGE---SPWDDLDPAIV 360

Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI--- 434
             P+H+ L+L+ AR+ +VLL+N G  LPL       +A+IGPN+D      GNY  +   
Sbjct: 361 EGPEHRALSLDIARETMVLLRNNG-VLPLKA--GEKIALIGPNADDAQMQWGNYNPVPKS 417

Query: 435 -------------------ACG------------------------------YTTPLQGI 445
                              ACG                              Y   +  I
Sbjct: 418 TITLLQAMQARVPGLVYDRACGILDAEYAPQGSAYANLIGASEAQLEAAARRYAVSVNDI 477

Query: 446 GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL--------- 496
             Y R   +Q    +   D+    A +      D  +   G+   +E E +         
Sbjct: 478 KNYIRRDEEQRRSFMPALDE---AAVLKKLEGVDVVVFAGGISPRLEGEEMRVQVPGFSG 534

Query: 497 -DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DR  + LPG Q+ L+  +  A K   +LV  SG  I            AI+ A YPGQ 
Sbjct: 535 GDRTDIELPGVQRRLLKALHDAGK-KVVLVNFSGCAI--GLVPETESCDAILQAWYPGQE 591

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GGTAIAD+LFG  NP GKLP+T+Y    +  LP  E            G TYR+++G  +
Sbjct: 592 GGTAIADVLFGDVNPSGKLPVTFYKN--VDQLPDVE-------DYNMEGHTYRYFRGEPL 642

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+G+SYT+F                         + GK               +++D
Sbjct: 643 YPFGYGLSYTSFAFG-------------------EPKVKGK--------------NLEID 669

Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
           V N GS  G   + ++   P     P K L AF +V VPAG   +V I +    +L   +
Sbjct: 670 VTNTGSVAGTEVVQLYVRKPDDTAGPVKTLRAFRRVSVPAGQTVKVSIPLDKETFLWWSE 729

Query: 736 RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
           +        G + +  GG+  +  L + +
Sbjct: 730 KDQDMVPVRGRYELLCGGSSAASDLKSVS 758


>gi|410639677|ref|ZP_11350222.1| beta-glucosidase [Glaciecola chathamensis S18K6]
 gi|410140558|dbj|GAC08409.1| beta-glucosidase [Glaciecola chathamensis S18K6]
          Length = 733

 Score =  365 bits (936), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 259/786 (32%), Positives = 388/786 (49%), Gaps = 89/786 (11%)

Query: 12  LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           +I+ +A+ S LA      C    A     P+    LP  +R++ LI  ++L+EK   L++
Sbjct: 1   MIIRNAAMSLLAL-----CVATPALANDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVN 55

Query: 72  GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
           G  A+ RLG+  Y++W+EALHGV+  G           AT FPQ I  A++F+  L    
Sbjct: 56  GNVAIERLGLPEYDFWNEALHGVARNG----------RATVFPQAIGMAATFDQHLLLKA 105

Query: 132 GRVVSDEARAMYNGGT--------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
             V+SDEARA +N  +        +GLT+W+PN+NIFRDPRWGRGQET GEDP L+ +  
Sbjct: 106 ASVISDEARAKFNVSSEIGNRSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMG 165

Query: 184 ASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFR 241
            + V GLQG     LK AA+ KHF  +     +G +  R  F+A  S +D+ +T+   F 
Sbjct: 166 KAMVNGLQGDHPKYLKTAAAAKHFAVH-----SGPEALRHEFDAIASPKDMYETYFPAFE 220

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
             V E  V +VM +YN+VNG P      +L   +R +W  +G++VSDC  +  ++     
Sbjct: 221 ALVTEANVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKV 280

Query: 302 TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
           T+   E+AA AI  G DL+CG       + AV+ GL+ E  I+  L   L  + +LG FD
Sbjct: 281 TANAVESAALAINTGTDLNCGAVYNALPD-AVEAGLVDEKTIDKRLSKVLATKFKLGFFD 339

Query: 362 GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNS 421
            +    PY ++    V +  H ++A E A + IVLL+N+   LPL     R + V GP +
Sbjct: 340 PK-DDNPYNNISADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFA 397

Query: 422 DVTVTMIGNYAGIACGYTTPLQGIG---RYARTI-HQQGCKDVACADDQLFGAAIDASRQ 477
             +  ++GNY G++   T  L GI        TI ++QG        + +     +A + 
Sbjct: 398 SSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQM 457

Query: 478 ADATILVMGLD---QSIEAEAL------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
            D  I VMGL    +  E EA+      DR  L LP  Q   + K+   +  P I+VL +
Sbjct: 458 GDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTA 517

Query: 529 GGPIDVAFAKNDPRIA----AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
           G P+++        IA    AI++A YPGQ GG A+ADILFG  +P G+LP+T+      
Sbjct: 518 GTPVNLT------EIAELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITF------ 565

Query: 585 TNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
              P +E  + P        RTYR+     +YPFG G+SY               V  D 
Sbjct: 566 ---PKSEAQLPPYDDYSMQERTYRYMTQEPMYPFGFGLSYAQ-------------VKFD- 608

Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHK 703
                N T+ G    +      +  + V V+V N G ++    + L   TP AG   P  
Sbjct: 609 -----NITL-GNTQALASKNEPQENMTVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLH 662

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            L  F ++ + AG  ++V  NI   K+L  ++  G   +  G++++ +G    S    + 
Sbjct: 663 SLKGFTRIKLAAGQTEQVLFNIP-KKHLYSINAQGKPVLLKGQYSVIVGNA--SPGNRSE 719

Query: 764 TLGVIK 769
           TLG  K
Sbjct: 720 TLGAAK 725


>gi|333494646|gb|AEF56854.1| putative glycosyl hydrolase [synthetic construct]
          Length = 743

 Score =  365 bits (936), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 253/752 (33%), Positives = 367/752 (48%), Gaps = 113/752 (15%)

Query: 46  SLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGG 105
           +L   +R  DL+ R++L+EK+  +   A ++ RLG+  Y WW+EALHGV+  G       
Sbjct: 34  NLSFEERARDLVSRMTLEEKIAQMQHEAPSIERLGVPAYNWWNEALHGVARAGV------ 87

Query: 106 DFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYWSPNV 157
               +T FPQ I  A++F+A L E    V+S E RA Y+            GLT+WSP +
Sbjct: 88  ----STMFPQAIGMAATFDAELIEKTADVISTEGRARYHEFQRKGDRDIYKGLTFWSPTI 143

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           NI RDPRWGRGQET GEDP L+ + A S++RG+QG  G  LK AA  KHF  +       
Sbjct: 144 NIDRDPRWGRGQETYGEDPYLTSRLAVSFIRGIQGR-GRYLKAAACAKHFAVHSGPE--- 199

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +R  FNA+VS++D+ +T+   F   V E KVA VM +YN+VNG P C    +L   +RG
Sbjct: 200 SERHQFNAEVSQKDLWETYLPAFEASVKEAKVAGVMGAYNRVNGEPCCGSGTLLGDVLRG 259

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
           EW   GY+ SDC ++    +    T T EE++A A+++G DL+CG       + A + GL
Sbjct: 260 EWEFGGYVTSDCWAIKDINEGHGVTKTIEESSALAVKSGCDLNCGCAYASLVK-AYRAGL 318

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           + E +I+ A+   +  +MRLGMFD  P   PY  +  +     +H+  ALE A + +VLL
Sbjct: 319 IGEKEIDTAVHRLMLTRMRLGMFDA-PEKVPYSSIPYEKNDCAEHRAFALEVAEKSLVLL 377

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIH 453
           +N+   LPL   R R+VAVIGPN+D  V + GNY G A  Y T L GI    G  AR  +
Sbjct: 378 RNRSGFLPLDRSRIRSVAVIGPNADSRVALEGNYNGTASEYVTVLDGIREAVGDRARVYY 437

Query: 454 QQGCKDVACADDQLFGAAIDASRQAD--------------ATILVMGLDQSIEAE----- 494
            +G          LF  ++    Q +                ++ +GL++ IE E     
Sbjct: 438 AEGS--------HLFRNSMGGLSQKNDRLAEAAAAAERADVAVVCLGLNRDIEGEEGDPS 489

Query: 495 ----ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
               A D+  L LPG Q+EL+  V  A+  P +LVL+SG  + V +A  D    A++ A 
Sbjct: 490 NEYPAGDKRDLRLPGLQEELLETVK-ATGTPVVLVLLSGSALAVNWA--DENADAVVQAW 546

Query: 551 YPG-QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
           YPG QA G   A  LFG   P G  P         T    T          R P      
Sbjct: 547 YPGAQAEGRRGA--LFGIIRPAGGFPSR------STVRTRTSRIFGTIHENRLP-----L 593

Query: 610 YKGPVVYPFGHGMSYTNFVH-TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
            +G  +YPFG+G+SYT F +  +  A + +    D                         
Sbjct: 594 LQGDPLYPFGYGLSYTKFQYGDLKLAASEIPAGEDAE----------------------- 630

Query: 669 TLGVQVDVKNVGSKDGAHTLLVF-----STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
              V V V+N G +D    + ++     S+ P   W    QL  F +VH+  G    V  
Sbjct: 631 ---VSVTVRNAGERDSDEVVQLYLQDLESSVPVPKW----QLAGFRRVHLKPGESAGVRF 683

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
            +   + ++++D  G   +  G   ++ GG++
Sbjct: 684 TV-AARQMALIDEDGRCVLEPGGFRVYAGGSQ 714


>gi|313202830|ref|YP_004041487.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312442146|gb|ADQ78502.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 742

 Score =  362 bits (930), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 248/760 (32%), Positives = 391/760 (51%), Gaps = 90/760 (11%)

Query: 26  EPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYE 85
           +P A + ++A     PF   S  I +RV DL+ RL+L EK   ++  A A+ RLGI  Y 
Sbjct: 26  QPSAVNAQNAVK--YPFQDTSKTIDERVKDLVSRLTLDEKAGQMLHNAPAIKRLGILPYS 83

Query: 86  WWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN- 144
           WW+EALHGV+  G           AT FP+ +  A++F+  L   IG+ +SDEA A YN 
Sbjct: 84  WWNEALHGVARTG----------RATVFPENVGLAATFDEDLVYRIGQAISDEAWAKYNI 133

Query: 145 -------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR 197
                  G  +G+T+++PNVNIFRDPRWGRGQET GEDP L+ +   +YV+G+QG+D   
Sbjct: 134 AQRLENYGQYSGITFYAPNVNIFRDPRWGRGQETYGEDPFLTSRMGVAYVKGMQGNDPKY 193

Query: 198 LKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
           LK AA  KH+  +     +G +  R  ++A+   +D  +T+   F   V EGKV SVMC+
Sbjct: 194 LKTAACAKHYVVH-----SGPEALRHSYDAEPPMKDFMETYVPAFETLVKEGKVESVMCA 248

Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
           YN+  G P C    +L   +R +W   GY+ +DC ++  +Y          EA A AI++
Sbjct: 249 YNRTFGKPCCGSSFLLHDLLREKWGFTGYVTTDCWAIQNFYLHHGAAKDSLEACALAIKS 308

Query: 316 GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
           G++L+CG     +  +AV++GL++E +++ AL   L  + RLG+FD  P+  PY  +  +
Sbjct: 309 GVNLNCGNEFN-YLPAAVRKGLVTEKEVDEALSQLLRTRFRLGLFDS-PNENPYAKIKEE 366

Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
            + +  + +LA EAA + +VLL+N+  +LPL     +++ V+GP +     ++GNY G+ 
Sbjct: 367 VIGSQQNIDLAYEAAAKSLVLLQNKNNTLPLKK-DMKSLYVVGPYAANQDILLGNYNGVN 425

Query: 436 CGYTTPLQGI-GRYA--RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIE 492
              TT +Q I G+ +   +++ +   + +  +      +I  +  ADA + V G+    E
Sbjct: 426 SRLTTIMQAIVGKVSAGTSVNYRIGVEPSAPNKNSMNYSIGEAADADAVVAVFGISGVFE 485

Query: 493 AEA---------LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
            E           DR  L LP  Q + + ++    K P ILVL  G PI        P +
Sbjct: 486 GEEGESTASTSRGDRLDLNLPQNQLDYLRELKKKCKKPIILVLTGGSPICT------PEL 539

Query: 544 A----AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE-MAMRPSQ 598
           A    AI++  YPGQ GG A+AD++FG  NP G+L +T +P+  ++ LP  E  +M+   
Sbjct: 540 ADMVDAILFVWYPGQEGGHAVADVIFGDVNPSGRLCIT-FPKS-VSQLPAFEDYSMK--- 594

Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
                GRTYR+     +YPFG G+SYTN+ ++  N  T       G+   + AT+S    
Sbjct: 595 -----GRTYRYMTEEPLYPFGFGLSYTNYSYS--NIKTDKDKIKKGQSVHVTATVS---- 643

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP-PAGHWAPHKQLVAFEKVHVPAGA 717
                              N G   G     ++ T   A    P   L   ++V + AG 
Sbjct: 644 -------------------NTGKTAGEEVAQLYITDVKASAPTPLYALKGTKRVKLAAGE 684

Query: 718 QQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
            + V   +   + + +V  +G + I  G+  ++I G+  S
Sbjct: 685 SKEVSFEV-TPQMMELVTVTGEKVIEPGDFKVYIAGSTPS 723


>gi|390956994|ref|YP_006420751.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
 gi|390411912|gb|AFL87416.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
          Length = 742

 Score =  362 bits (929), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 250/739 (33%), Positives = 365/739 (49%), Gaps = 74/739 (10%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +  +S PI  R+ DLI R +LQEK   L      VPRLG+  +  W++ LHGV +  P  
Sbjct: 38  YRDMSRPIEDRITDLIKRFTLQEKAMQLNHTNRGVPRLGLPMWGGWNQTLHGVWSKQP-- 95

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG------LTYWSP 155
                    T FP      ++++  L   +   +SDEARA+YN    G      L Y SP
Sbjct: 96  --------TTLFPIPTAMGATWDPELVHTVADAMSDEARALYNAHAEGPRTPHGLVYRSP 147

Query: 156 NVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNW 215
            +NI RDPRWGR QE   EDP+L+G+   +YVRGLQG D   LK+AA+ KHF   ++++ 
Sbjct: 148 VINISRDPRWGRIQEVFSEDPLLTGRMGVAYVRGLQGDDLQHLKLAATVKHFAVNNVES- 206

Query: 216 NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
               R H NA V ++++ + +   +R  +ME    SVM SYN +NG+P   +  +L   +
Sbjct: 207 ---GRQHLNADVDERNLFEFWLPHWRAAIMEAHAQSVMSSYNAINGMPDAVNHWLLTDVL 263

Query: 276 RGEWRLNGYIVSDCDSVGVYYDT---------QHFTSTPEEAAADAIRAGLDLDCGPFLG 326
           R +W  +G++  D  +V +   T         QHF+  P  AAA AIRAG D D   F  
Sbjct: 264 RKKWGFDGFVTDDLGAVALLSGTRATNTSEPGQHFSEDPVVAAAAAIRAGNDSDDVEF-E 322

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
            +   AVQRGLL+E D++ AL N L V  RLG +D  P +  Y  +G   V +  H++L+
Sbjct: 323 TNLPLAVQRGLLTEKDVDGALRNVLRVGFRLGAYD-PPQASKYSRIGMDVVRSQAHRDLS 381

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG-- 444
              A + + LL N+   LPL   + ++VAVIGP         GNY G     T+  +G  
Sbjct: 382 QRVAEESMTLLLNRRQFLPLQRDQVKSVAVIGPAGGEAYE-TGNYYGTPAVKTSVTEGLR 440

Query: 445 --IGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
             +G   +  +++G   V  ADD+    A + +R++D  +L +G +  +EAE  DR  L 
Sbjct: 441 ALLGSGVKVEYEKGAGYVDLADDKEIERAANLARKSDVVVLCLGTNLQVEAEGRDRRDLN 500

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           LPG QQ L+  V  A+     LVLM+ GP+ V +A +   + AI+ A YPG+ GG AIA 
Sbjct: 501 LPGAQQRLLEAV-YAANPKVALVLMNAGPLGVTWAHD--HVPAILSAWYPGELGGAAIAR 557

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
            LFG +NPGG LP T Y      NL           S+ Y   TY+++KG  +YPFGHG+
Sbjct: 558 TLFGLNNPGGHLPYTVY-----ANLDGVPPQNEYDVSRGY---TYQYFKGVPLYPFGHGL 609

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
           SYT+F ++                           +KVT    +   + V   + N G  
Sbjct: 610 SYTHFDYS--------------------------KLKVTQTSGDHANVTVSFTLTNTGQS 643

Query: 683 DGAHTLLVFSTP-PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
            GA    ++S    +    P + L  FE+V +  G  + V I+I               R
Sbjct: 644 AGAEVTQLYSHQVKSSEVQPLRTLRGFERVTLQPGESKAVAISIPTSALGWYDTAVHNFR 703

Query: 742 IPLGEHNIHIGGTKHSVSL 760
           +  G  N  +G +   + L
Sbjct: 704 VEPGAFNFMVGSSSEDIRL 722


>gi|332377068|gb|AEE64772.1| Xyl3A [Ruminococcus albus 8]
          Length = 691

 Score =  360 bits (925), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 255/754 (33%), Positives = 386/754 (51%), Gaps = 105/754 (13%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SL   +R   L   ++ +E+   L   A A+ RLGI  Y WW+E +HG++  G   
Sbjct: 4   YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                   AT FPQ I  A+ F+  L +    + S+EARA YN  T         GLT W
Sbjct: 62  --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +PN+NIFRDPRWGR  ET GEDP L+ +   + VRGLQG DG  +K AA  KHF  +   
Sbjct: 114 APNINIFRDPRWGRSHETFGEDPYLTAQNGKAVVRGLQG-DGKVMKAAACAKHFAVH--- 169

Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             +G +  R  F+AK   +D+E+T+   F   V E KV SVM +YN+VNG P CA   ++
Sbjct: 170 --SGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACASDYLM 227

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
           ++    EW  +GY VSDC ++  +++    T+   E+AA A++AG D++CG     +  +
Sbjct: 228 EKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESAAMALKAGCDVNCGCTYQ-NLLA 284

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
           A+ +GL+++  I  A V+ +  ++RLGMFD       +  +    V   +H+ ++LE A 
Sbjct: 285 ALDKGLITKEQIRTACVHLMRTRIRLGMFD---KHTDFDDIPYSKVACAEHKAVSLECAE 341

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRY-A 449
           + +VLLKN G  LPL   +++T+AVIGPN+D    + GNY G++  YTT L GI  R+  
Sbjct: 342 KSLVLLKNNG-ILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQDRFEG 400

Query: 450 RTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE-------- 494
           R I  +GC         +A A D+ +  A+ A++ AD  I+ +GLD +IE E        
Sbjct: 401 RVIFAEGCHLYKKSISGLAQAGDR-YAEAVAAAKNADLVIMCVGLDATIEGEEGDTGNEF 459

Query: 495 -ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
            + D+ GL LP  Q+ LV K+ M+   P + V+ +G  I+      + +  A+I A YPG
Sbjct: 460 SSGDKNGLTLPPPQKILVEKI-MSVGKPVVTVVCAGSAINT-----ESQPDALIHAFYPG 513

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKG 612
             GG A+A++LFG  +P GKLP+T+Y  E    LP  T+ +M+        GRTYR+   
Sbjct: 514 AEGGKALAEVLFGDVSPSGKLPVTFY--EDTDKLPEFTDYSMK--------GRTYRYTTD 563

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
            +++PFG+G++Y            V AV        ++   SG+A +             
Sbjct: 564 NILFPFGYGLTY--------GGVKVNAVEYKDGKAVVSVENSGRATE------------- 602

Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
             DV  +  KD     +           P+  L  F++V +  G +  V I I   K  +
Sbjct: 603 --DVIELYLKDYCEQAV-----------PNVSLCGFKRVKLGEGEKATVEIAIPE-KAFT 648

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
            VD +G R++  G     + GT    +L     G
Sbjct: 649 AVDNNGVRKV-FGSKFTLLAGTHQPDALSEKLTG 681


>gi|325679939|ref|ZP_08159508.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
           albus 8]
 gi|324108377|gb|EGC02624.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
           albus 8]
          Length = 691

 Score =  360 bits (925), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 255/754 (33%), Positives = 386/754 (51%), Gaps = 105/754 (13%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SL   +R   L   ++ +E+   L   A A+ RLGI  Y WW+E +HG++  G   
Sbjct: 4   YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                   AT FPQ I  A+ F+  L +    + S+EARA YN  T         GLT W
Sbjct: 62  --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +PN+NIFRDPRWGRG ET GEDP L+ +   + VRGLQG DG  +K AA  KHF  +   
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTAQNGKAVVRGLQG-DGKVMKAAACAKHFAVH--- 169

Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             +G +  R  F+AK   +D+E+T+   F   V E KV SVM +YN+VNG P CA   ++
Sbjct: 170 --SGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACASDYLM 227

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
           ++    EW  +GY VSDC ++  +++    T+   E+AA A++AG D++CG     +  +
Sbjct: 228 EKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESAAMALKAGCDVNCGCTYQ-NLLA 284

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
           A+ +GL+++  I  A V+ +  ++RLGMFD       +  +    V   +H+ ++LE A 
Sbjct: 285 ALDKGLITKEQIRTACVHLMRTRIRLGMFD---KHTDFDDIPYSKVACAEHKAVSLECAE 341

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRY-A 449
           + +VLLKN G  LPL   +++T+AVIGPN+D    + GNY G++  YTT L GI  R+  
Sbjct: 342 KSLVLLKNNG-ILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQDRFEG 400

Query: 450 RTIHQQGC-------KDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE-------- 494
           R I  +GC         +A A D+ +  A+ A++ AD  I+ +GLD +IE E        
Sbjct: 401 RVIFAEGCHLYKKSISGLAQAGDR-YAEAVAAAKNADLVIMCVGLDATIEGEEGDTGNEF 459

Query: 495 -ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
            + D+ GL LP  Q+ LV K+ M+   P + V+ +G  I+      + +  A+I A YPG
Sbjct: 460 SSGDKNGLTLPPPQKILVEKI-MSVGKPVVTVVCAGSAINT-----ESQPDALIHAFYPG 513

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKG 612
             G  A+A++LFG  +P GKLP+T+Y  E    LP  T+ +M+        GRTYR+   
Sbjct: 514 AEGSKALAEVLFGDVSPSGKLPVTFY--EDTDKLPEFTDYSMK--------GRTYRYTTD 563

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
            +++PFG+G++Y            V AV        ++   SG+A +             
Sbjct: 564 NILFPFGYGLTY--------GGVKVNAVEYKDGKAVVSVENSGRATE------------- 602

Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
             DV  +  KD     +           P+  L  F++V +  G +  V I I   K  +
Sbjct: 603 --DVIELYLKDYCEQAV-----------PNVSLCGFKRVKLGEGEKATVEIAIPE-KAFT 648

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
            VD +G R++  G     + GT    +L     G
Sbjct: 649 AVDNNGVRKV-FGSKFTLLAGTHQPDALSEKLTG 681


>gi|332638085|ref|ZP_08416948.1| glycoside hydrolase family 3 protein [Weissella cibaria KACC 11862]
          Length = 713

 Score =  360 bits (924), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 241/729 (33%), Positives = 360/729 (49%), Gaps = 94/729 (12%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           ++ ++++ EK+  +   A A+ RL I  Y +W+EALHGV+  G           AT FPQ
Sbjct: 17  IVDQMTIDEKIGQIKYEAPAIERLNIPEYNYWNEALHGVARAGV----------ATVFPQ 66

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGR 167
            I  A++F+  L   I  V+  E RA YN  T         GLT+WSPNVNIFRDPRWGR
Sbjct: 67  AIGLAATFDDQLINDIADVIGTEGRAKYNEFTKHEDRDIYKGLTFWSPNVNIFRDPRWGR 126

Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           G ET GEDP L+ K+  ++++GLQG     LK+AA+ KHF  +      G+ R  F+A V
Sbjct: 127 GHETYGEDPFLTSKFGVAFIKGLQGQ-AKYLKLAATAKHFAVH--SGPEGL-RHGFDAVV 182

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
           S +D+ +T+   F+  V E  V S+M +YN V+GVP      +L+  +  +W   G++VS
Sbjct: 183 SDKDLYETYLPAFKAAVEEADVESIMTAYNAVDGVPASVSEMLLRDILHDKWSFEGHVVS 242

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNA 346
           D  +    ++   +T    E    AI+AGL+L  G     LH   A+ RGL++E +I NA
Sbjct: 243 DYMAPEDVHENHKYTKDAAETMGLAIKAGLNLVAGHIEQSLH--EALNRGLVTEEEITNA 300

Query: 347 LVNTLTVQMRLGMF--DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
           +++    ++RLGMF  D E  + PY     +   T  H  L+  AA +  VLLKN G  L
Sbjct: 301 VISLYATRVRLGMFATDNEYDAIPY-----EANDTKAHNNLSEIAAEKSFVLLKNDG-VL 354

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYART-------IH 453
           PL       +AV+GPN+   + ++GNY G      T L+GI    G   R        + 
Sbjct: 355 PLRKETMEAIAVVGPNAHSEIALLGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSGVF 414

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLP 504
           Q    +     D+    AI A+  +D  + V+GLD +IE E         A D+  L LP
Sbjct: 415 QDHAAEPLAKADERESEAIIAAEHSDVIVAVLGLDSTIEGEEGDAGNSQGAGDKPNLSLP 474

Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
           GRQ++L+ ++ +A   P +++L SG  + +   +N P + AI+   YPG  GG A+AD+L
Sbjct: 475 GRQRQLLERL-LAVGKPVVVLLASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVADVL 533

Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
           FGT +P GKLP+T+Y      NLP  E            GRTYR+     +YPFG+G++Y
Sbjct: 534 FGTVSPSGKLPVTFYKN--TDNLPAFE-------DYNMAGRTYRYMTEEALYPFGYGLTY 584

Query: 625 TNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG 684
                                        S   +     K    T    V ++N G+ D 
Sbjct: 585 -----------------------------SSVELSDLQVKSYEETATATVTIQNTGNFDT 615

Query: 685 AHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
              + V+       +A P+ QL  F++V +  G++Q +  ++    +  V D  G   I 
Sbjct: 616 DEVVQVYVKDLESEFAVPNAQLKGFKRVFLGKGSKQTITFDLRPQDF-EVFDEQGHNFID 674

Query: 744 LGEHNIHIG 752
                I +G
Sbjct: 675 SNRFEISVG 683


>gi|167537541|ref|XP_001750439.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771117|gb|EDQ84789.1| predicted protein [Monosiga brevicollis MX1]
          Length = 834

 Score =  359 bits (921), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 236/748 (31%), Positives = 369/748 (49%), Gaps = 69/748 (9%)

Query: 32  PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK-VKLLISGAAAVPRLGIKGYEWWSEA 90
           P    + + PFC   L +  R+ DL+ R+S  +   +L    +A +  +G+  Y W + A
Sbjct: 99  PCMGASSSYPFCDTKLSVDDRLKDLVSRVSTADAATQLRARESAQIDNIGLPAYYWGTNA 158

Query: 91  LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT-AG 149
           +HG+ N    T    D    TSFP     +++FN +L + +GR++  E RA YN     G
Sbjct: 159 IHGMQN----TACLADGQCPTSFPAPNGLSATFNYSLVKDMGRIIGRELRAYYNTKFHNG 214

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCKHFT 208
           L  WSP +N  RDPRWGR  E+PGE P + G+Y A+Y  GLQ G D D  +   + KH+ 
Sbjct: 215 LDTWSPTINPSRDPRWGRNVESPGESPFVCGQYGAAYTEGLQNGDDKDYTQAVVTLKHWV 274

Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
           AY +++++ V R+ +NA VS+ D+ DT+   +   V   K   VMCSYN +NGVPTC +P
Sbjct: 275 AYSVEDYDNVTRYEYNAIVSEYDLMDTYFPGWEYVVKNAKPLGVMCSYNSLNGVPTCGNP 334

Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH 328
             L   +R +W   GYI SD DS+   +   H+ S    A  D +  G D+D G     +
Sbjct: 335 -ALTAYLREDWGFEGYITSDSDSIHCIWADHHYESNAVLATRDGLLGGCDIDSGDTYADN 393

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
            E+AV + L++   ++ AL N+  ++  LG+FD   ++  Y  +   +V     QE +L 
Sbjct: 394 LEAAVNQSLVNRSAVDAALTNSYRMRFNLGLFDPNVTN-AYDRISADEVGMSSSQETSLL 452

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
           AAR+ + LLKN G +LP +    + VAVIG +S+    ++GNY G  C       G    
Sbjct: 453 AARKSMTLLKNDGQTLPFA--TGKKVAVIGKSSNSAEDILGNYVGPICP-----SGAFDC 505

Query: 449 ARTIHQ------QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
            +T++Q      QG       D      AI  +  AD  +L +  +     E  DR  + 
Sbjct: 506 VQTLYQGVAAANQGGATTLSDDVADINTAIQLAMDADQVVLTIS-NYGQAGEGKDRTYIG 564

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           L   QQELV+ V    K PT +V+++GG I + + K++ +  AI+ A  PG  GG A+A+
Sbjct: 565 LDTDQQELVAAVLKVGK-PTAIVMLNGGLISLDWIKDEAQ--AILVAFAPGVHGGQAVAE 621

Query: 563 ILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP-----------SQSKRYPGRTYRFYK 611
            +FG +NPGGKLP+T Y  +Y+ ++    M+M+             +    PGR+Y++Y 
Sbjct: 622 TIFGANNPGGKLPVTMYASDYVNDVDFLNMSMQAVAVLHLMNVNGERDDTGPGRSYKYYT 681

Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
           G  +YPF +G+SYT F  + + AP +          +  +T+       T          
Sbjct: 682 GEPLYPFAYGLSYTTFNLSWSPAPPMT---------TFTSTLRSTTYTATVTN------- 725

Query: 672 VQVDVKNVGSKDGAHTLLVFSTP--------PAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                   GS  G   +  F  P        P G+  P K++  F++V +  G   +V  
Sbjct: 726 -------TGSVGGDEVVFAFYKPKSESLKTLPVGNPVPIKEIFGFQRVALGPGQSTQVTF 778

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
            ++  + L+ V   G R +  GE  I +
Sbjct: 779 ELNA-ETLAQVTLDGHRELHSGEFEIEL 805


>gi|405955586|gb|EKC22647.1| Putative beta-D-xylosidase 2 [Crassostrea gigas]
          Length = 745

 Score =  359 bits (921), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 247/747 (33%), Positives = 378/747 (50%), Gaps = 85/747 (11%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--------AAAVPRLGIKGYEWWSEALH 92
           PF   SLP   RV DL+ RL+++E V  +  G        A AVPRLG+  + W +E L 
Sbjct: 27  PFRNTSLPWDARVKDLVDRLTIEEIVVQMSRGGSGPRASPAPAVPRLGVGPFSWNTECLR 86

Query: 93  GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-------- 144
           G    G           ATSFPQ +  A++F+  +   +    S E RA +N        
Sbjct: 87  GDVYAG----------NATSFPQALGLAATFSTEVICDVASATSIEVRAKFNDYQRRKIY 136

Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
           G   G++ +SP +NI R P WGR QET GEDP LSG+ AA +V+ LQG D   ++  A C
Sbjct: 137 GDHKGISCFSPVINIMRHPLWGRNQETYGEDPFLSGELAAIFVKCLQGDDPTYIRANAGC 196

Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           KHF  +       V RF F+AKVS++D   TF   F+ CV  G   S+MCS+N++NGVP 
Sbjct: 197 KHFDVHGGPENIPVSRFSFDAKVSERDWRLTFLPAFKRCVQAGSY-SLMCSFNRINGVPA 255

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
           C +  +L   +R EW   GY+VSD +++       H+T+   + AA  ++AG +L+    
Sbjct: 256 CGNKRLLTDILRTEWGFTGYVVSDQEAIENIMTYHHYTNNSVDTAALCVKAGCNLELSTN 315

Query: 325 ----LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
                  +   A++ G L + D+  ++      +MRLG FD  P   PY  +    + + 
Sbjct: 316 EVKPTYFYIIDALKAGKLDKEDLVKSVSPLFYTRMRLGEFD-PPDHNPYNFIDLSVIQSE 374

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA-GIACGY- 438
           +H+ ++L AA +  VLLKN+G  LP++ +   T++V+GP +D     IG+YA  +   Y 
Sbjct: 375 EHRAISLNAAMKSFVLLKNKGGFLPITKL-FDTISVLGPMADNKYQQIGSYAPDVMPSYT 433

Query: 439 TTPLQGIGRYARTI-HQQGCKDVACA--DDQLFGAAIDASRQADATILVMGLDQSIEAEA 495
           TTPLQG+ + ++ + +  GC D AC+  +      A+++S   D   + +G    IE E 
Sbjct: 434 TTPLQGLSKLSKRVQYAAGCNDNACSKYNRTEIQRAVNSS---DIFFVCLGTGPMIENED 490

Query: 496 LDRAGLLLPGRQQELVSKVSMAS-KG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
            DRA + LPG+Q +L+    M S KG P +L+L +GGP+++ +A    R+ AI+   +P 
Sbjct: 491 HDRASMELPGQQAQLLKDAIMFSAKGVPIVLLLFNGGPVNITWADRSDRVVAIMECFFPA 550

Query: 554 QAGGTAIADILFGT---SNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRF 609
           Q  G A+  ++  T   SNP G+LP TW   +Y   +P M   +M         GRTYR+
Sbjct: 551 QETGEAVLRVVTNTGNSSNPAGRLPYTW--PKYQDQIPSMVNYSME--------GRTYRY 600

Query: 610 YKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT 669
           + G  +YPFG+G+SY+ F  T A    +++   D                          
Sbjct: 601 FHGDPLYPFGYGLSYSTFNFTNAWMNPIISQGQD-------------------------- 634

Query: 670 LGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
           L V+V+V N G  DG   + V+          P  QLV FE+V + A       I +   
Sbjct: 635 LTVRVEVCNEGPTDGDEVIQVYLKWLDTNETMPIHQLVGFERVSLRAKETLSWLITVRA- 693

Query: 729 KYLSVVDRSGTRRIPLGEHNIHIGGTK 755
           + ++V + S    I  G + ++IGG +
Sbjct: 694 ENMAVWNESRGFYIEPGRYRLYIGGQQ 720


>gi|164428543|ref|XP_964543.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
 gi|157072187|gb|EAA35307.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
          Length = 786

 Score =  359 bits (921), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 235/624 (37%), Positives = 329/624 (52%), Gaps = 76/624 (12%)

Query: 73  AAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATLWE 129
           A    RLG+  Y WWSE LHGV+   PG KF   G  F  ATSF   I   +SF+  L  
Sbjct: 8   ALGASRLGLPKYAWWSEGLHGVAG-SPGVKFNTTGYPFSYATSFANAINLGASFDDDLVY 66

Query: 130 AIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRG 189
            +G  +S EARA  N G  GL YW+PNVN ++DPRWGRG ETPGEDP+    Y  + + G
Sbjct: 67  EVGTAISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAILAG 126

Query: 190 LQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
           L+G++  R KV A+CKH+ AYDL+ W+G+ R+ F A V+ QD+ + +  PF+ C  + KV
Sbjct: 127 LEGNETVR-KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKV 185

Query: 250 ASVMCSYN--------------QVN---GVPTCADPNILKRTIRGEWRL---NGYIVSDC 289
            S+MCSYN              ++N     P CA P ++   +R  W     N YI SDC
Sbjct: 186 GSIMCSYNALTIRDMASGKPDEEINLTTAQPACAKPYLMT-ILRDHWNWTEHNNYITSDC 244

Query: 290 DSVGVYY-DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--SAVQRGLLSEIDINNA 346
           +++  +  D  +F+ TP EAAA A +AG D  C       T+   A  + LL E  I+ A
Sbjct: 245 NAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVGAYNQSLLPEAVIDTA 304

Query: 347 LVNTLTVQMRLGMFD------------GEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
           L       +R G  D            G  SS  Y  L  +DV TP  QELAL +A +GI
Sbjct: 305 LRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQELALRSATEGI 364

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
           VLLKN G  LPL     + VA+IG  ++ T TM G Y+GI   Y  PL    +   +   
Sbjct: 365 VLLKNAGSLLPLD-FSGKKVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSFSY 423

Query: 455 QGCKDVACAD-DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
                V  +D D     A+ A+  AD  +   G D ++ +E LDR  +  P  Q +L+S+
Sbjct: 424 ANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPETQMQLLSE 483

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
             +A  G  ++V+  G  +D +   N+  +++I+W GYPGQ+GGTA+ D+L G   P G+
Sbjct: 484 --LAGLGKPLVVIQLGDQVDDSSLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAPAGR 541

Query: 574 LPMTWYPQEYITNLPMTEMAMRP------------------------------SQSKRYP 603
           LP+T YP+ Y+  +P+TEMA+RP                              +++   P
Sbjct: 542 LPVTQYPEGYVDEVPLTEMALRPFNYSSSSNLEQEVSVQGRGSLTIQPRSTPGNKTLSSP 601

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNF 627
           GRTY++Y  PV+ PFG+G+ YT F
Sbjct: 602 GRTYKWYSSPVL-PFGYGLHYTTF 624


>gi|336463686|gb|EGO51926.1| hypothetical protein NEUTE1DRAFT_125528 [Neurospora tetrasperma
           FGSC 2508]
          Length = 788

 Score =  356 bits (913), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 232/628 (36%), Positives = 325/628 (51%), Gaps = 79/628 (12%)

Query: 73  AAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSFNATLWE 129
           A    R+G+  Y WWSE LHGV+   PG  F   G  F  ATSF   I   +SF+  L  
Sbjct: 8   ALGASRIGLPKYAWWSEGLHGVAG-SPGVTFNTTGYPFSYATSFANAINLGASFDDDLVY 66

Query: 130 AIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRG 189
            +G  +S EARA  N G  GL YW+PNVN ++DPRWGRG ETPGEDP+    Y  + + G
Sbjct: 67  EVGTAISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAMLAG 126

Query: 190 LQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
           L+G++  R KV A+CKH+ AYDL+ W+G+ R+ F A V+ QD+ + +  PF+ C  + KV
Sbjct: 127 LEGNETVR-KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKV 185

Query: 250 ASVMCSYNQV-----------------NGVPTCADPNILKRTIRGEWRL---NGYIVSDC 289
            S+MCSYN +                    P CA+   L   +R  W     N YI SDC
Sbjct: 186 GSIMCSYNALTIRDMAGGNPDEIINLTTAQPACAN-TYLMTILRDHWNWTEHNNYITSDC 244

Query: 290 DSVGVYY-DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--SAVQRGLLSEIDINNA 346
           +++  +  D  +F+ TP EAAA A +AG D  C       T+   A  + LL E  I+ A
Sbjct: 245 NAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVGAYNQSLLPEAVIDTA 304

Query: 347 LVNTLTVQMRLGMFD------------GEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
           L       +R G  D            G  SS  Y  L  +DV TP  QELAL +A +GI
Sbjct: 305 LRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQELALRSATEGI 364

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
           VLLKN G  LPL     + VA+IG  ++ T TM G Y+GI   Y  PL    +   +   
Sbjct: 365 VLLKNSGSLLPLDFSSGKKVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSFSY 424

Query: 455 QGCKDVACAD-DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSK 513
                V  +D D     A+ A+  AD  +   G D ++ +E LDR  +  P  Q +L+S+
Sbjct: 425 ANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPKAQMKLLSE 484

Query: 514 VSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGK 573
             +A  G  ++V+  G  +D +F   +  +++I+W GYPGQ+GGTA+ D+L G   P G+
Sbjct: 485 --LAGLGKPLVVIQLGDQVDDSFLLENGNVSSILWVGYPGQSGGTAVFDVLTGKKAPAGR 542

Query: 574 LPMTWYPQEYITNLPMTEMAMRP----------------------------------SQS 599
           LP+T YP+ Y+  +P+TEMA+RP                                  +++
Sbjct: 543 LPVTQYPEGYVDEVPLTEMALRPFNHSSSTSSSSNPEEEVSVQGSGSLTIQPRSTPGNKT 602

Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
              PGRTY++Y  PV+ PFG+G+ YT F
Sbjct: 603 LSSPGRTYKWYSNPVL-PFGYGLHYTTF 629


>gi|110737298|dbj|BAF00595.1| xylosidase [Arabidopsis thaliana]
          Length = 303

 Score =  355 bits (910), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 165/222 (74%), Positives = 188/222 (84%), Gaps = 1/222 (0%)

Query: 12  LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           + LL    S  + R  FACDP +  TRTL FC+ ++PI  RV DL+GRL+LQEK++ L++
Sbjct: 19  VFLLCLVHSSESLRPLFACDPANGLTRTLRFCRANVPIHVRVQDLLGRLTLQEKIRNLVN 78

Query: 72  GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
            AAAVPRLGI GYEWWSEALHG+S+VGPG KFGG FPGATSFPQVITTA+SFN +LWE I
Sbjct: 79  NAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFPQVITTAASFNQSLWEEI 138

Query: 132 GRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ 191
           GRVVSDEARAMYNGG AGLTYWSPNVNI RDPRWGRGQETPGEDP+++ KYAASYVRGLQ
Sbjct: 139 GRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYAASYVRGLQ 198

Query: 192 GS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDI 232
           G+  G+RLKVAA CKH+TAYDLDNWNGVDRFHFNAKV+   I
Sbjct: 199 GTAAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVNLLHI 240


>gi|429738050|ref|ZP_19271875.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
           F0055]
 gi|429161155|gb|EKY03583.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
           F0055]
          Length = 722

 Score =  354 bits (908), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 242/729 (33%), Positives = 378/729 (51%), Gaps = 86/729 (11%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           Q+   +I +L+L EK+  L   A  + RLGIK Y W +EALHGV   G           A
Sbjct: 33  QKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------A 82

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMY--------NGGTAGLTYWSPNVNIFRD 162
           T FPQ I   ++F+  +   IG  ++ E RA +            AGLT+W+PNVNIFRD
Sbjct: 83  TVFPQPINLGATFDPKIVHQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRD 142

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
           PRWGRG ET GEDP L+G    ++V+G+QG D   LK AA  KHF  +     +G +R  
Sbjct: 143 PRWGRGMETYGEDPFLTGTLGTAFVKGMQGDDPFYLKAAACGKHFAVH-----SGPERTR 197

Query: 223 FNAKV--SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             A V  +K+D+ +T+   F+M V +GKV S+M +Y ++ G        +L   +R +W 
Sbjct: 198 HTANVEPTKRDLYETYLPAFKMLVQKGKVESIMGAYQRLYGESCSGSKYLLTDILRKDWG 257

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
             G++VSDC +V   Y+      +  EA A AI+AGL+L+CG  +    + A+Q+ L++E
Sbjct: 258 FKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFAIKAGLNLECGNSM-RTMKDAIQQKLITE 316

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
            D++ AL+  +  +++LG+   + +  PY       + +  ++++A +AA + +VLLKN 
Sbjct: 317 KDLDKALLPLMMTRLKLGILQPDAAC-PYNEFPESVIGSEANRKIAEQAAEESMVLLKNN 375

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI---HQQG 456
           G  LP++    RT+ V GP +     ++GNY G++  Y+T L+GI G+ +      ++QG
Sbjct: 376 G-VLPIAK-DIRTLFVTGPGATDAYYLMGNYFGLSNRYSTYLEGIVGKVSNGTSVNYKQG 433

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIE---------AEALDRAGLLLPGRQ 507
              V    + +   ++  SR A+ +IL+MG   + E         AE  DR  L LP  Q
Sbjct: 434 FMQVFKNLNDV-NWSVSESRGAEVSILIMGNSGNTEGEEGDAIASAERGDRVNLRLPDSQ 492

Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAIADILFG 566
            E + +VS       ++VL  G PIDV   K    +A A++ A YPGQ GG A+A++LFG
Sbjct: 493 MEYLREVSKDRTNKLVVVLTGGSPIDV---KEITELADAVVMAWYPGQEGGVALANLLFG 549

Query: 567 TSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
            +N  G+LP+T+   E    LP   + +M+        GRTY++    ++YPFG+G+SY+
Sbjct: 550 DANFSGRLPVTF--PESADRLPAFDDYSMK--------GRTYKYMTDNILYPFGYGLSYS 599

Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
              ++                   NA ++    K T        + V VDV N G     
Sbjct: 600 KVTYS-------------------NAAVTKMPTKTT-------PMTVYVDVTNNGDMPVD 633

Query: 686 HTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
             + V+ STP AG+ +P + L+ F++V +           I + + L  V   GT ++  
Sbjct: 634 EVVQVYLSTPGAGNTSPIESLIGFKRVKIYPHITVTKDFQIPM-ELLETVQADGTSKLLK 692

Query: 745 GEHNIHIGG 753
           GE+ I I G
Sbjct: 693 GEYQIKISG 701


>gi|121700633|ref|XP_001268581.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
 gi|119396724|gb|EAW07155.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
          Length = 743

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 237/719 (32%), Positives = 354/719 (49%), Gaps = 86/719 (11%)

Query: 52  RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
           R   LI   +L+E V    + +  VPRLG+  Y+ W+EALHG+       +  G F  +T
Sbjct: 72  RAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGLDRAYFTDE--GQFSWST 129

Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQET 171
           SFP  I T S+ N TL   +  ++S + RA  N G  GL  +SPN+N FR P WGRGQET
Sbjct: 130 SFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRYGLDVYSPNINSFRHPVWGRGQET 189

Query: 172 PGEDP-VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSK 229
           PGED   LS  YA  Y+ G+QG  D   LK+ A+ KH+  YD++NW+G  R   +  +++
Sbjct: 190 PGEDAYCLSSAYAYEYITGIQGGVDPKSLKLVATAKHYAGYDIENWDGHSRLGNDMNITQ 249

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL--NGYIVS 287
           QD+ + +   F +   + KV SVMCSYN VNGVP+CA+   L+  +R  +    +GYI S
Sbjct: 250 QDLSEYYTPQFLVAARDAKVRSVMCSYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYISS 309

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           DCDS    ++   + +    AAAD+IRAG D+DCG     + + AV + LLS  DI   +
Sbjct: 310 DCDSAYNVFNPHEYAANVSSAAADSIRAGTDIDCGTTYQYYFDEAVDQNLLSRADIERGV 369

Query: 348 VNTLTVQMRLGMFDGEP----SSQPYG-HLGPKD-VCTPDHQELALEAARQGIVLLKNQG 401
           +   +  MRLG FD  P    S+Q  G + GP   + +P      L+A R          
Sbjct: 370 IRLYSNLMRLGYFDVGPWMNVSTQLQGNYFGPAPYLISP------LDAFRD--------- 414

Query: 402 PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC-GYTTPLQGIGRYARTIHQQGCKDV 460
                SH+            DV      N +  +  G++  L    +             
Sbjct: 415 -----SHL------------DVNYAFGTNISSNSTDGFSKALSAAKK------------- 444

Query: 461 ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG 520
             +D  +F   ID S +A               E LDR  +  PG+Q EL+ ++S   K 
Sbjct: 445 --SDAIIFAGGIDNSLEA---------------ETLDRMNITWPGKQLELIDQLSQLGK- 486

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P I++ M GG +D +  K++  + ++IW GYPGQ+GG A+ DI+ G   P G+L +T YP
Sbjct: 487 PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQALLDIITGKRAPAGRLVVTQYP 546

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
            EY T  P T+M++RP  +   PG+TY +Y G  VY FGHG+ YT F   V++A  V   
Sbjct: 547 AEYATQFPATDMSLRPHGNN--PGQTYMWYTGTPVYEFGHGLFYTTF--RVSHARAVKIK 602

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
           P      +I   ++       H +     L   VD+ N G     +T ++F+   AG   
Sbjct: 603 PT----YNIQDLLAQPHPGYIHVEQMPF-LNFTVDITNTGKASSDYTAMLFANTTAGPAP 657

Query: 701 -PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
            P K LV F+++     +  ++         ++  D  G R +  G++ + +   +  V
Sbjct: 658 YPKKWLVGFDRLPTLGPSTSKLMTIPVTINSMARTDELGNRVLYPGKYELALNNERSVV 716


>gi|323344407|ref|ZP_08084632.1| beta-glucosidase [Prevotella oralis ATCC 33269]
 gi|323094534|gb|EFZ37110.1| beta-glucosidase [Prevotella oralis ATCC 33269]
          Length = 722

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 243/729 (33%), Positives = 380/729 (52%), Gaps = 86/729 (11%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           Q+   +I +L+L EK+  L   A  + RLGIK Y W +EALHGV   G           A
Sbjct: 33  QKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------A 82

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMY--------NGGTAGLTYWSPNVNIFRD 162
           T FPQ I+  ++F+  + + IG  ++ E RA +            AGLT+W+PNVNIFRD
Sbjct: 83  TVFPQPISLGATFDPEIVQQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRD 142

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
           PRWGRG ET GEDP L+G    ++V+G+QG+D   LK AA  KHF  +     +G +R  
Sbjct: 143 PRWGRGMETYGEDPFLTGVLGTAFVKGMQGNDPFYLKAAACGKHFAVH-----SGPERTR 197

Query: 223 FNAKV--SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             A V  +K D+ +T+   F+M V +GKV S+M +Y ++ G        +L   +R +W 
Sbjct: 198 HTANVEPTKHDLYETYLPAFKMLVQQGKVESIMGAYQRLYGESCSGSKYLLTDILRKDWG 257

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
             G++VSDC +V   Y+      +  EA A AI+AGL+L+CG  +    + A+++ L++E
Sbjct: 258 FKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFAIKAGLNLECGNSM-RTMKDALKQKLITE 316

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
            D++ AL+  +  +++LG+   + +  PY       + + D++ +A  AA + +VLLKN 
Sbjct: 317 KDLDKALLPLMMTRLKLGILQPDVAC-PYNEFPESVIGSIDNRNIAQRAAEESMVLLKND 375

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI---HQQG 456
           G  LP++    RT+ V GP +     ++GNY G++  Y+T L+GI G+ +      ++QG
Sbjct: 376 G-VLPIAK-DIRTLFVTGPGATDAYYLMGNYFGLSDRYSTYLEGIVGKVSNGTSVNYKQG 433

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIE---------AEALDRAGLLLPGRQ 507
              V    + +   ++  SR A+ +I++MG   + E         +E  DR  L LP  Q
Sbjct: 434 FMQVFKNLNDV-NWSVSESRGAEVSIIIMGNSGNTEGEEGDAIASSERGDRVDLRLPEPQ 492

Query: 508 QELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAIADILFG 566
            + + +VS       ++VL  G PIDV   K    +A A++ A YPGQ GG A+A++LFG
Sbjct: 493 MQYLREVSKDRTNKLVVVLTGGSPIDV---KEITELADAVVMAWYPGQEGGVALANLLFG 549

Query: 567 TSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYT 625
            +N  G+LP+T+   E    LP   + +M+        GRTY++    ++YPFG+G+SY 
Sbjct: 550 DANFSGRLPVTF--PETTDKLPSFDDYSMK--------GRTYKYMTDNILYPFGYGLSY- 598

Query: 626 NFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA 685
                             G+    NAT       VT       ++ V VD+ N G+    
Sbjct: 599 ------------------GKVAYGNAT-------VTKLPTKHSSMTVSVDLSNDGNMPVD 633

Query: 686 HTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
             + V+ STP AG  +P + LVAF++V +   A       I V + L  V   GT ++  
Sbjct: 634 EVVQVYLSTPSAGVTSPIESLVAFKRVKIAPHATVTTDFEIPVER-LETVQEDGTSKLLK 692

Query: 745 GEHNIHIGG 753
           GE+ + I G
Sbjct: 693 GEYRVMISG 701


>gi|443692971|gb|ELT94448.1| hypothetical protein CAPTEDRAFT_221920 [Capitella teleta]
          Length = 757

 Score =  353 bits (906), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 249/710 (35%), Positives = 363/710 (51%), Gaps = 91/710 (12%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQE---KVKLLISG--AAAVPRLGIKGYEWWSEALHGV 94
            PF   SL    R +DL+ RL+L+E   + +    G    A+ RLGIK Y W +E L G 
Sbjct: 20  FPFQDPSLSWDDRADDLVARLTLEEIAPQTQASYGGQHTPAIERLGIKPYVWITECLAGQ 79

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA------ 148
            N             AT++PQ I  A+SF+  L   + R +S E RA +N   A      
Sbjct: 80  VNTN-----------ATAYPQPIGMAASFSEELLFNVSRDISYEVRAHWNANRAVGKYST 128

Query: 149 --GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
             GL+ +SP +NI R P WGR QET GEDP+LSG  A S+VRGLQG D   L+  A CKH
Sbjct: 129 KVGLSCFSPVINIMRHPLWGRNQETYGEDPLLSGTLAQSFVRGLQGDDPRYLRANAGCKH 188

Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
           F  +       V RF F+AKV+ +D   TF   F+MCV  G   S+MCSYN++NG+P CA
Sbjct: 189 FDVHGGPEDIPVSRFSFDAKVNMRDWRMTFLPQFKMCVDAGSY-SLMCSYNRINGIPACA 247

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
           +  +L    R EW  +GYIVSD  ++    +  H+T++       AI+AG +L+ G    
Sbjct: 248 NKQLLTDITRDEWGFHGYIVSDSGAISNIKEQHHYTNSTVATVVAAIKAGTNLELGGGSN 307

Query: 327 LHTES---AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           ++      A+++GLL+E +I + +   L  ++RLG FD E +   Y  +G   + +P+H+
Sbjct: 308 MYYPKQLDAMKQGLLTEKEIRDNVRPLLYTRLRLGEFDPE-AMVDYNKIGVDVIQSPEHR 366

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG-IACGYT--- 439
           E A++AA  G VLLKN    LP+   ++  +A++GP ++ T  + G Y+  +   +T   
Sbjct: 367 EQAVKAAYMGFVLLKNHNNLLPIKK-QYSKLAIVGPFTNATSELFGTYSSEVNLKFTSTI 425

Query: 440 ----TPLQGIGRYARTIHQQGCKDVACA----DDQLFGAAIDASRQADATILVMGLDQSI 491
               +PL G  R A      GC + AC+    DD     A      AD  I+ +G  Q  
Sbjct: 426 FEGLSPLGGSTRSA-----NGCTNSACSGYVRDDVETAVA-----GADLVIVALGSGQRF 475

Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           E+E  DRA L L G Q +++      S G P ILVL++ GP+D+ +AK DP + AI+  G
Sbjct: 476 ESEGNDRAYLDLHGHQLDILKDAVFFSNGAPVILVLINAGPLDITWAKLDPGVTAILSCG 535

Query: 551 YPGQAGGTAIADILFGT---SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
           YP Q+ G A+   L  +   + P G+L  TW P        +T+  M+        GRTY
Sbjct: 536 YPAQSTGEALRRSLTMSEPQAAPAGRLQATW-PLNLDQVPKITDYTMQ--------GRTY 586

Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
           R+Y G  +YPFG G+SYT+F +T                 SI+ ++  +   VT      
Sbjct: 587 RYYVGEPLYPFGFGLSYTSFSYT---------------RLSISPSVITQGDNVT------ 625

Query: 668 LTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKVHVPAG 716
               V+V +KN GS D    + V+ + P   +  P   L AF +  + AG
Sbjct: 626 ----VEVCLKNTGSYDSDEVVQVYMSWPQTPFPLPKWTLAAFARPFISAG 671


>gi|390630430|ref|ZP_10258413.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
 gi|390484359|emb|CCF30761.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
          Length = 674

 Score =  350 bits (898), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 241/706 (34%), Positives = 357/706 (50%), Gaps = 94/706 (13%)

Query: 79  LGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDE 138
           + I  Y +W+EALHGV+  G           AT FPQ I  A++F+  L   I  V+  E
Sbjct: 1   MNIPEYNYWNEALHGVARAGV----------ATVFPQAIGLAATFDDHLINEIADVIGTE 50

Query: 139 ARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
            RA YN  T         GLT+WSPNVNIFRDPRWGRG ET GEDP L+ K+  ++++GL
Sbjct: 51  GRAKYNEFTKHDDRDIYKGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTSKFGVAFIKGL 110

Query: 191 QGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
           QG     LK+AA+ KHF  +      G+ R  F+A VS +D+ +T+   F+  V E  V 
Sbjct: 111 QGQ-AKYLKLAATAKHFAVH--SGPEGL-RHGFDAVVSDKDLYETYLPAFKAAVEEADVE 166

Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
           S+M +YN V+GVP      +LK  +  +W   G++VSD  +    ++   +T    E   
Sbjct: 167 SIMTAYNAVDGVPASVSEMLLKDILHDKWSFEGHVVSDYMAPEDVHENHKYTKDAAETMG 226

Query: 311 DAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF--DGEPSSQ 367
            AI+AGL+L  G     LH   A+ RGL++E +I NA+++    ++RLGMF  D E  + 
Sbjct: 227 LAIKAGLNLVAGHIEQSLH--EALDRGLVTEEEITNAVISLYATRVRLGMFATDNEYDAI 284

Query: 368 PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTM 427
           PY     +   T  H  L+  AA +  VLLKN G  LPL       +AV+GPN+   + +
Sbjct: 285 PY-----EANDTKAHNNLSEIAAEKSFVLLKNDG-VLPLRKETMEAIAVVGPNAHSEIAL 338

Query: 428 IGNYAGIACGYTTPLQGI----GRYART--------IHQQGCKDVACADDQLFGAAIDAS 475
           +GNY G      T L+GI    G   R               + +A AD++   A I A+
Sbjct: 339 LGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSGLFQDHAAEPLAKADERESEAVI-AA 397

Query: 476 RQADATILVMGLDQSIEAE---------ALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
             +D  + V+GLD +IE E         A D+  L LPGRQ++L+ ++ +A   P +++L
Sbjct: 398 EHSDVVVAVLGLDSTIEGEEGDAGNSQGAGDKPNLSLPGRQRQLLERL-LAVGKPVVVLL 456

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
            SG  + +   +N P + AI+   YPG  GG A+AD+LFG  +P GKLP+T+Y    + N
Sbjct: 457 ASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVADVLFGAVSPSGKLPVTFYKN--VDN 514

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  E            GRTYR+     +YPFG+G++Y+          +V    L  + 
Sbjct: 515 LPAFE-------DYNMAGRTYRYMTDEALYPFGYGLTYS----------SVELSDLQVKS 557

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
               AT++       +   + +   VQV VK++GS+        F+ P A       QL 
Sbjct: 558 YEDTATVTATIQNTGNFDTDEV---VQVYVKDLGSE--------FAVPNA-------QLK 599

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            F++V++  GA+Q +  ++    +  V D  G   I      I +G
Sbjct: 600 GFKRVYLGKGAKQTITFDLRPQDF-EVFDAQGRNFIDSDRFEISVG 644


>gi|427385932|ref|ZP_18882239.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726971|gb|EKU89834.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
           12058]
          Length = 732

 Score =  349 bits (896), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 249/788 (31%), Positives = 382/788 (48%), Gaps = 91/788 (11%)

Query: 7   FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
           + ++ L++L   S+G AA+                F    + +  RV DL+ RL+L++K 
Sbjct: 5   YKYMNLLVLIFLSTGAAAQSI-------GIQNNPAFLNQEMSMEARVADLMSRLTLEQKA 57

Query: 67  KLLISGAAAVPRLGIK-GYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
           +LL      V   G     + W++ LHGV    P           T+FP  I   ++++ 
Sbjct: 58  QLLNHRGKTVVVDGFSIRADQWNQCLHGVKWTEP----------TTNFPTSIALGATWDT 107

Query: 126 TLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWGRGQETPGEDP 176
            L   +  V+SDEARA+YNG            GL Y SP +NI R+P WGR  E  GEDP
Sbjct: 108 ELIHRVATVISDEARAIYNGWKQDPEFRGEHKGLIYRSPVINISRNPYWGRINEIFGEDP 167

Query: 177 VLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
             +G+   +YV+GLQG D   LK+A++ KH+   +++    VDR   +A+V ++ + + +
Sbjct: 168 YHTGRMGVAYVKGLQGDDSHYLKLASTLKHYAVNNVE----VDRMKLSAQVPERMLYEYW 223

Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
              F+ C++EGK  SVM SYN +NGVP   +  +L   ++ +W   G++VSD   V    
Sbjct: 224 LPHFKDCIVEGKAQSVMASYNAINGVPNNINKLLLTDILKNQWGHEGFVVSDLGGVKTMV 283

Query: 297 DTQHFTSTP-EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQM 355
           +  H      EEA   +I AG D     +   +   A+++G L+E  +N+AL   L V+ 
Sbjct: 284 EGHHQRQISCEEAVGRSIMAGCDFSDAEYEK-YIPDALRKGYLTEERLNDALRRVLLVRF 342

Query: 356 RLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVA 415
           RLG FD +  S PY  + P  +   +H+ L+LEAAR+ IVLLKN+   LP+     + VA
Sbjct: 343 RLGEFD-DFKSVPYSRISPDVIGCKEHRNLSLEAARKSIVLLKNEKKLLPIDRSIIKRVA 401

Query: 416 VIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDVACA-------- 463
           VIGP +D+     GNY G+     TPLQGI    G     ++ +G +             
Sbjct: 402 VIGPYADLFNQ--GNYGGVPKDPVTPLQGIKNAVGNNVEVLYCKGAQITPVKVRKGQPIP 459

Query: 464 ----DDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
                +     A++ +R +D   L +G    IE E  DR  L+LPG Q ELV  V   +K
Sbjct: 460 PRFDKEAEMKKAVEMARNSDVVFLFVGTTADIEVEGRDRKTLVLPGNQNELVKAVYEVNK 519

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
              ++VLMS GP+ V   K +  I A++ A +PG  GG AIAD+LFG  NPGGKLP T Y
Sbjct: 520 -KVVVVLMSAGPVAVPEVKKN--IPAVLQAWWPGDEGGNAIADVLFGDYNPGGKLPYTMY 576

Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
             +    +P T+      +     G TY + K   ++ FGHG+SY+ F ++     + V 
Sbjct: 577 ASD--EQVPSTD------EYDISKGFTYMYLKKKPLFAFGHGLSYSKFHYSDLQISSPVV 628

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGH 698
                   S+N T+S                 V + VKN+G + G   + L      A  
Sbjct: 629 --------SVNDTVS-----------------VVLKVKNMGKRTGEEVVQLYVRDVKAKV 663

Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS-GTRRIPLGEHNIHIGGTKHS 757
             P K+L  F+++ +    +Q + + + V K L+  D S G   +  G   I +G     
Sbjct: 664 VRPTKELRGFKRIALQPNEEQEIRLMLPV-KSLAFYDESIGDFLVEPGSFEILLGSASDD 722

Query: 758 VSLHAATL 765
           + L +  +
Sbjct: 723 IRLQSKLI 730


>gi|340369765|ref|XP_003383418.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 748

 Score =  349 bits (896), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 247/748 (33%), Positives = 366/748 (48%), Gaps = 76/748 (10%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA-------VPRLGIKGYEWWSEALH 92
            PF   SLPI +RV D++ +LSL + V+ +  G A        +P+  IK Y+W +E L 
Sbjct: 27  FPFRDPSLPIEERVKDIVDQLSLDQLVEQMAHGGAGSNGPAPGIPKFNIKPYQWGTECLS 86

Query: 93  GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA-----MYNGGT 147
           G  N G           ATSFP  I  A+SFN  L + +    + E RA     + NG  
Sbjct: 87  GDVNAG----------DATSFPMSIGMAASFNYDLLKQVSNATAYEVRAKNTAAVLNGSY 136

Query: 148 A---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
           A   GL+ WSP +NI RDPRWGR QET GEDP LSG    ++V GLQG D   +   A C
Sbjct: 137 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAFVTGLQGDDPTYVIANAGC 196

Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           KHF  +       + R  F+A V+  D   TF   F+ CV  G + S+MCSYN++NGVP 
Sbjct: 197 KHFDVHGGPEDTPLPRASFDANVTMIDWRMTFLPQFKACVEAGAL-SLMCSYNRINGVPA 255

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
           CA+  +L   +R EW   GY+VSD  ++       H+      AAADA  AG  L+ G  
Sbjct: 256 CANKKLLTDILRNEWNFKGYVVSDQGALENIVTQHHYAPDFVTAAADAANAGTCLEDGNS 315

Query: 325 LGL------HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC 378
            G       + + AV++GL+S   + +A+     V+ +LG FD   ++ PY ++    + 
Sbjct: 316 EGKGGNVFDNLDDAVEKGLVSVDTLKDAVSRLFYVRTKLGEFDPPDNNNPYANIPLSIIQ 375

Query: 379 TPDHQELALEAARQGIVLLKNQG---PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA-GI 434
           + +H +L+++AA + IVL+KN     P LPL+    +   V+GP  +   TM G+Y+  +
Sbjct: 376 SDEHIKLSIQAAMETIVLMKNDNDGSPFLPLAADDFKKACVVGPFIENADTMFGDYSPTM 435

Query: 435 ACGY-TTPLQGIGRYA----RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQ 489
              Y  TPL GI           ++ GC D    +         A    D  I+  GL +
Sbjct: 436 MTDYIVTPLAGIKTTQIGSDLLNYEDGCTDGPACEIYDGYKVRTACEGVDLVIVTAGLSR 495

Query: 490 SIEAEALDRAGLLLPGRQQELVSKVSMAS-KGPTILVLMSGGPIDVAFAKNDPRIAAIIW 548
            +E E  D + + LPG Q  L++    AS   P IL+L +  P+D+++AK++PR AAI+ 
Sbjct: 496 YLEHEGHDISDIYLPGHQMSLLTDAESASGSAPIILLLFNANPLDISYAKSNPRFAAILE 555

Query: 549 AGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR 608
           A YPGQ  G AIA++L G+ NP G+LP TW P        M +  M+         RTYR
Sbjct: 556 AYYPGQEAGVAIANVLTGSYNPAGRLPNTW-PASLDQVPDMIDYTMKE--------RTYR 606

Query: 609 FYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
           ++    +YPFG+G+S+T F ++  N  +              A  +G+            
Sbjct: 607 YFTQEPLYPFGYGLSFTTFNYSDLNVAST-------------ANTNGEG----------- 642

Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVC 728
           ++ V V V N G+ DG      +        AP+ QLV   +  +  G    V   I   
Sbjct: 643 SIAVSVTVMNTGTMDGDEVTQAYVKWDNVAEAPNIQLVGVSRKFISKGQSITVSFTIKPE 702

Query: 729 KYLSVVD-RSGTRRIPLGEHNIHIGGTK 755
           +    ++   G   IP G +++ +GG +
Sbjct: 703 QLQVWINGDDGKWSIPGGTYSLFVGGQQ 730


>gi|348688508|gb|EGZ28322.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 701

 Score =  349 bits (895), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 235/670 (35%), Positives = 341/670 (50%), Gaps = 90/670 (13%)

Query: 10  LGLILLSAS--SSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
           L L+LL AS  ++ + A  P AC  +    + L FC  SLP+  RV DL+ RL L EK  
Sbjct: 3   LPLLLLVASVVAASVDAEIPRACVGEH---QKLQFCNTSLPVSARVEDLLARLPLDEKAI 59

Query: 68  LLISGAAAVPR-----LGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
           LL   A A PR     +G+  Y W +  +HGV      +  G + P  TSFP  +  +  
Sbjct: 60  LLT--ARASPRGNMSSIGLPEYNWGANCVHGVR-----STCGTNCP--TSFPNPVNLS-- 108

Query: 123 FNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
                                             ++  RDPRWGR  ETP EDP+++ KY
Sbjct: 109 ----------------------------------IHRRRDPRWGRNTETPSEDPLVNSKY 134

Query: 183 AASYVRGLQG---SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 239
             +Y +GLQ     D   L+   + KH+ AY  +N+ G +R  FNA VS  D  DT+   
Sbjct: 135 GVAYTKGLQEGKHEDPRYLQAVVTLKHYVAYSYENYGGGNRKTFNAIVSPYDFADTYFPA 194

Query: 240 FRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQ 299
           FR  +++G    VMCSYN VNGVP CA+  +  + +RG    +GYI SD  ++    D  
Sbjct: 195 FRSSIVDGNAKGVMCSYNSVNGVPACANNELENKLLRGMLGFDGYITSDSGAIEAISDWL 254

Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHT--ESAVQRGLLSEIDINNALVNTLTVQMRL 357
           H+  T  EAA  AI AG D++ G   G     +  V+   L    +++ L +TL ++  L
Sbjct: 255 HYVPTRCEAARLAILAGTDVNSGRGFGYMACLKELVESNQLDVKVVDDVLRHTLKLRFEL 314

Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
           G+FD     QPY  + P DV T   ++L+L+ AR+ IVLL+N  P LPL   R   +AV+
Sbjct: 315 GLFD-PIEDQPYWKVTPNDVNTDAAKKLSLDLARKSIVLLQNNQPVLPLR--RGVKLAVV 371

Query: 418 GPNSDVTVTMIGNYAGIAC--------GYTTPLQGIGRY---ARTIHQQGCKDVACADDQ 466
           GP++     ++GNY G  C           TP + +      + T +  GC +V      
Sbjct: 372 GPHAQAKRALLGNYLGQMCHGDYNEVGCIKTPFEAVSASNGDSSTTYALGC-NVTGNSTA 430

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
            F  A+ A + A+A +L +G+D+S+EAE  DR  + LP  Q +L+ +V    K PT++VL
Sbjct: 431 GFVEAVKAVQGAEAVVLFLGIDKSVEAEVRDRNNIDLPAIQVQLLQRVRAVGK-PTVVVL 489

Query: 527 MSGGPIDVA--FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
           M+GG +       + D    A++ A YPG  G  A+ DILFG +NPGGKLP+T Y  +Y+
Sbjct: 490 MNGGVLTAEDIIGQTD----ALVEAFYPGFFGAQAMTDILFGDANPGGKLPVTMYRSDYV 545

Query: 585 TNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
             + M  M +       YPGR+YR++KG  V+PFG G+SYT+F     +A    A  +  
Sbjct: 546 NTVDMKSMNV-----TAYPGRSYRYFKGEPVFPFGWGLSYTSFSLKADDATATTAKSVS- 599

Query: 645 RHGSINATIS 654
              ++N TIS
Sbjct: 600 --ATMNTTIS 607


>gi|363742357|ref|XP_003642627.1| PREDICTED: probable beta-D-xylosidase 5-like [Gallus gallus]
          Length = 748

 Score =  348 bits (894), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 243/728 (33%), Positives = 366/728 (50%), Gaps = 93/728 (12%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAA-------AVPRLGIKGYEWW 87
           A  +  PF   +LP  +R+ DL+GRL+  E V  +  G A        +PRLGI  Y W 
Sbjct: 22  AEAQPFPFRDPTLPWHRRLEDLLGRLTPAEMVLQMARGGALGNGPAPPIPRLGIAPYNWN 81

Query: 88  SEALHGVSNVGPGTKFGGDFPG-ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           +E L G            + PG AT+FPQ +  A++F+  L   +    + E RA +N  
Sbjct: 82  TECLRG----------DAEAPGWATAFPQALGLAAAFSPELVYRVANATATEVRAKHNSF 131

Query: 147 TA--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL 198
            A        GL+ +SP +NI R P WGR QET GEDP L+ + A S+V+GLQG     +
Sbjct: 132 VAAGRYDDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPYLTAELATSFVQGLQGQHPRYI 191

Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
           K +A CKHF+ +       V R  F+AKV ++D   TF   F+ CV  G   S MCSYN+
Sbjct: 192 KASAGCKHFSVHGGPENIPVSRLSFDAKVLERDWHTTFLPQFQACVRAGSY-SFMCSYNR 250

Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLD 318
           +NGVP CA+  +L   +RGEW   GY+VSD  +V +      +T T  E A  ++ AGL+
Sbjct: 251 INGVPACANKKLLTDILRGEWGFEGYVVSDEGAVELILLGHRYTHTFLETAIASVNAGLN 310

Query: 319 LDCGPFLG----LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
           L+    +     +H   A+  G ++   + + +      ++RLG FD  P+  PY  L  
Sbjct: 311 LELSYGMRNNVFMHIPKALAMGNITLEMLRDRVRPLFYTRLRLGEFD-PPAMNPYNALEL 369

Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI 434
             V + +H+ L+LEAA +  VLLKNQ  +LPL  +  + +AV+GP +D    + G+YA +
Sbjct: 370 SVVQSSEHRNLSLEAAIKSFVLLKNQRDTLPLRELHGKRLAVVGPFADNPRVLFGDYAPV 429

Query: 435 --ACGYTTPLQGIGRY-ARTIHQQGCKDVAC---ADDQLFGAAIDASRQADATILVMGLD 488
                  TP +G+    A      GC++  C   + D++     +A R AD  ++ +G  
Sbjct: 430 PEPQYIYTPRRGLQTLPANVSFAAGCREPRCWVYSRDEV----ENAVRGADVVLVCLGTG 485

Query: 489 QSIEAEALDRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAII 547
             +E EA DR  L LPG Q +L+     A+ G P IL+L + GP+DV++A+    + AI+
Sbjct: 486 IDVEMEARDRKDLSLPGHQLQLLQDAVRAAAGHPVILLLFNAGPLDVSWAQLHDGVGAIL 545

Query: 548 WAGYPGQAGGTAIADILFGT--SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR 605
              +P QA G AIA +L G   ++P G+LP TW         P     + P ++    GR
Sbjct: 546 ACFFPAQATGLAIASVLLGKQGASPAGRLPATW---------PAGMHQVPPMENYTMEGR 596

Query: 606 TYRFY-KGPVVYPFGHGMSYTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
           TYR+Y +   +YPFG+G+SYT F +  +  +P V+ +                       
Sbjct: 597 TYRYYGQEAPLYPFGYGLSYTTFHYRDLVLSPPVLPI----------------------- 633

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVF------STPPAGHWAPHKQLVAFEKVHVPAGA 717
            C  L+  V V ++N G +D    + ++      S P      P  QLVAF +V VPAG 
Sbjct: 634 -CANLS--VSVVLENTGPRDSEEVVQLYLRWEQPSVP-----VPRWQLVAFRRVAVPAGG 685

Query: 718 QQRVGINI 725
             ++   +
Sbjct: 686 ATKLSFGV 693


>gi|340368019|ref|XP_003382550.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 742

 Score =  348 bits (893), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 264/776 (34%), Positives = 378/776 (48%), Gaps = 92/776 (11%)

Query: 28  FAC--DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA-------VPR 78
           FAC     D      PF   SL I  RV D++  L+L+E V+ +  G A        +PR
Sbjct: 10  FACCLAISDHVEALFPFQNTSLSIEDRVKDIVDNLTLEELVEQMAHGGATLNGPAPGIPR 69

Query: 79  LGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDE 138
           L I  Y+W +E L G  NV  G         ATSFP  I  A+SFN  L + +    + E
Sbjct: 70  LHINPYQWGTECLSG--NVSAG--------DATSFPMPIGMAASFNYDLLKRVTNATAYE 119

Query: 139 ARAMY-----NGGTA---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
            RA +     +G  A   GL+ WSP +NI RDPRWGR QET GEDP LSG    +YV GL
Sbjct: 120 VRAKHAAAVKDGSYAFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAYVNGL 179

Query: 191 QGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
           QG++   +   A CKHF  +         RF F+AKVS +D   TF   F+ CV  G + 
Sbjct: 180 QGNNSRYIIANAGCKHFDVHGGPENIPTSRFSFDAKVSMRDWRMTFLPQFKACVEAGAL- 238

Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
           S+MCSYN++NGVP CA+  +L   +R EW   GY+VSD  ++       H+     +AAA
Sbjct: 239 SLMCSYNRINGVPACANKALLTDILRNEWDFKGYVVSDQGALEFIVIEHHYAPDFMKAAA 298

Query: 311 DAIRAGLDLDCG----PFLGL--HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
           DA  AG  L+ G     F  +  H   AV+  L+S   + NA+     V+M+LG FD  P
Sbjct: 299 DAANAGTCLEDGNIGRKFFNVFEHLVDAVKNNLVSVDTLKNAVSRLFYVRMKLGEFD-PP 357

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN----QGPSLPLSHIRHRTVAVIGPN 420
            + PY ++    + +  H  L+L+AA + IVL+KN    + P LP+++   +   ++GP 
Sbjct: 358 DNNPYANIPLSVIQSDAHINLSLQAAMESIVLMKNDDGFRSPFLPITN-EVKKACMVGPF 416

Query: 421 SDVTVTMIGNYA-GIACGYT-TPLQGIGRY---ARTI-HQQGCKDVACADDQLFGAAIDA 474
           SD    + G+Y+  +   Y  T L G+        T+ +  GC+D     +        A
Sbjct: 417 SDDPEVLFGDYSPTLMRDYVITSLAGLKNANIGTDTLNYAVGCEDGPACRNYDSAKVRSA 476

Query: 475 SRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPT-ILVLMSGGPID 533
               +  I+  GL + +E+E  D + + LPG Q +L+     ASK  + IL+L +  P+D
Sbjct: 477 CDGVELIIVTAGLSKHLESEGKDLSDINLPGHQLDLMQDAEAASKNASVILILFNASPLD 536

Query: 534 VAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY-----ITNLP 588
           + +AK DPRI  I+ A YPGQ  G AIA++L G  NP G+LP TW P        ITN  
Sbjct: 537 IRYAKTDPRIVGILEAYYPGQTAGKAIANVLTGEYNPSGRLPNTW-PASLDQVPGITNYT 595

Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
           M E             RTYR++    +YPFG+G+SYT F ++  N              S
Sbjct: 596 MKE-------------RTYRYFTQEPLYPFGYGLSYTTFHYSNLNI-------------S 629

Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAF 708
             AT SG  +           + V V V N GS DG     V+       +AP  QLV  
Sbjct: 630 STATASGAGM-----------IAVSVLVTNTGSMDGTEVTQVYVWCNIS-YAPKLQLVGV 677

Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
            K  +  G    V  +I   + L V        IP+  +++ + G + + ++ A++
Sbjct: 678 NKDFISKGKTLEVSFSIKP-EQLQVWTDDDKWVIPVSTYDLSVSGQQPNQTVLASS 732


>gi|409385818|ref|ZP_11238358.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
 gi|399206850|emb|CCK19273.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
          Length = 695

 Score =  348 bits (892), Expect = 9e-93,   Method: Compositional matrix adjust.
 Identities = 220/600 (36%), Positives = 330/600 (55%), Gaps = 63/600 (10%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           ++ +++L EK+  +   A+A+ RL I  Y +W+E LHGV+  G           AT FPQ
Sbjct: 15  IVSQMTLAEKISQIDFDASAIERLNIPHYNYWNEGLHGVARAGV----------ATVFPQ 64

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGR 167
            I  A++F+  L + I  V+S E RA YN  T         GLT+WSPN+N+FRDPRWGR
Sbjct: 65  AIGLAATFDTELVKHIAEVISIEGRAKYNAYTKHGDRDIYKGLTFWSPNINLFRDPRWGR 124

Query: 168 GQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           GQET GEDP L+ +   ++++GLQG +G  L++AA  KHF  +   +    DR +F+A V
Sbjct: 125 GQETYGEDPFLTAQIGVAFIKGLQG-EGKYLRLAACTKHFAVH---SGPEADRHYFDAVV 180

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
           + +D+ + +   F+  + E  V S M +YN +NG P C +  ++ +T+ G+W   G++VS
Sbjct: 181 NPKDLNEFYLPQFKAAIEEADVESFMGAYNAINGQPACVNEELIAKTLLGKWGFEGHVVS 240

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           D  ++   ++  H+T T  E  A A++ G +L  G       E AV +GL++E +I  ++
Sbjct: 241 DYAALEDVHENHHYTQTAAETMALAMKIGTNLCAGKISDALFE-AVGKGLVTETEITASV 299

Query: 348 VNTLTVQMRLGMF--DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
           V   T  +RLGMF  D +  + PY     +   + +H+ L+L+AA + +VLLKN    LP
Sbjct: 300 VKLYTTHVRLGMFAEDNDYDTIPY-----EVNASAEHEMLSLKAAEKSMVLLKNDN-FLP 353

Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCKDVA 461
           LS    ++VAVIGP +     + GNYAG A  Y T + GI +     AR  +  GC   A
Sbjct: 354 LSQSEIKSVAVIGPTARNIGALEGNYAGTANHYETFVSGIQQALSNQARVTYALGCHLYA 413

Query: 462 -CADDQLFGA------AIDASRQADATILVMGLDQSIEAE---------ALDRAGLLLPG 505
             A+  L  A      AI A+  AD  +L +GLD +IE E         + D+  L LPG
Sbjct: 414 DHAESSLSRANERESEAIIAAEHADIAVLCVGLDPTIEGEQGDAGNVYGSGDKPSLSLPG 473

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
           +Q+ L+ KV    K   ILVL SG  + +   +    + AII A YPG  GGTA+A+IL 
Sbjct: 474 QQKRLIEKVLETGK-TVILVLTSGSALSLEGLEKHTGVKAIIQAWYPGAHGGTALANILL 532

Query: 566 GTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
           G  +P GKLP+T+        LP  ++ +M          RTY+  +  V+YPFG+G++Y
Sbjct: 533 GKVSPSGKLPVTFCKD--TQGLPDFSDYSMAE--------RTYQNTQLEVLYPFGYGLTY 582


>gi|449527525|ref|XP_004170761.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
          Length = 241

 Score =  346 bits (888), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 160/199 (80%), Positives = 179/199 (89%), Gaps = 1/199 (0%)

Query: 30  CDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSE 89
           CD ++  TR + FCQ SL I +RV DLIGRL+L EK++LL++ A AVPRLGI+GYEWWSE
Sbjct: 34  CDKRNVGTRNMGFCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSE 93

Query: 90  ALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG 149
           ALHGVSNVGPGTKFGG FPGATSFPQVITTA+SFN +LW  IGRVVSDEARAMYNGGTAG
Sbjct: 94  ALHGVSNVGPGTKFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTAG 153

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-DRLKVAASCKHFT 208
           LTYWSPNVNIFRDPRWGRGQETPGEDP+L+ KYAA+YV+GLQG+DG  RLKVAA CKH+T
Sbjct: 154 LTYWSPNVNIFRDPRWGRGQETPGEDPILAAKYAANYVQGLQGNDGKKRLKVAACCKHYT 213

Query: 209 AYDLDNWNGVDRFHFNAKV 227
           AYDLDNWNGVDR+HFNAKV
Sbjct: 214 AYDLDNWNGVDRYHFNAKV 232


>gi|291240561|ref|XP_002740190.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 763

 Score =  345 bits (886), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 252/717 (35%), Positives = 358/717 (49%), Gaps = 81/717 (11%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVP--RLGIKGYEWWSEALHGVSNVG 98
           PF   SL   +RV+DL+ RL+L E V  +   + A P  RLGIK Y W SE LHGV  V 
Sbjct: 27  PFQNTSLSWEERVDDLVSRLTLDEMVLQMARTSPAPPIDRLGIKPYVWNSECLHGV--VP 84

Query: 99  PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGL 150
           P          AT+FPQ I  A+SF+  L   + + +  E RA +N            GL
Sbjct: 85  PDGL-------ATAFPQSIGLAASFSPDLLSDVAKAIGLEVRAKHNDYVQRGVYQEHTGL 137

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
           + +SP +NI R P WGR QET GEDP L G+  ++YVRGLQG     +   A CKHF  +
Sbjct: 138 SCFSPVINIARHPLWGRNQETYGEDPFLIGELGSAYVRGLQGDHPRYVLANAGCKHFDVH 197

Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
                  V RF F+AKV ++D + TF   F  CV  G V SVMCSYN++N VP CA+  +
Sbjct: 198 GGPEDIPVSRFSFDAKVFERDWQMTFLPAFHECVKAG-VYSVMCSYNRINEVPACANTRL 256

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD----CGPFLG 326
           L   +R EW  +GY+VSD  +V     + H+T +  +  A A+ AG +LD     G  + 
Sbjct: 257 LTDILRKEWGFDGYVVSDEGAVEFIMTSHHYTDSIVDTVASAVNAGCNLDLAFPVGDGMY 316

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
           +    AV  G + E  +   +      +MRLG FD  P   PY +L    V + +H+ELA
Sbjct: 317 IKIGDAVTAGKIKEKTVVERVKPLFYTRMRLGEFD-PPELNPYANLNLSVVQSEEHRELA 375

Query: 387 LEAARQ-----GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG--YT 439
           ++AA Q       VLLK +G  LPL  + ++ +AVIGP +D    + G+Y+         
Sbjct: 376 VKAALQSFVLLNFVLLKREGRVLPLDTLVNK-LAVIGPFADNPSYLFGDYSPNPDKEFVV 434

Query: 440 TPLQGIGRYAR-TIHQQGCKDVACAD--DQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
           TP +G+   AR T    GC    C     ++  AA+     AD  ++ +G    IEAE +
Sbjct: 435 TPCKGLSNAARDTRCTPGCLTAPCTTYFSEMVKAAVTG---ADLIVVCLGTGVKIEAEFV 491

Query: 497 DRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
           DR+ L LPG+Q +L+  V   + G P IL+L + GP+D+ +A  +P I  I+   +P QA
Sbjct: 492 DRSDLSLPGKQFQLLQDVVKYANGKPIILLLFNAGPLDIVWAVENPAIQVIVACFFPSQA 551

Query: 556 GGTAIADILF-------GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR 608
            G A+  +         G  NPGG+LP+TW P+      PMT   M         GRTYR
Sbjct: 552 TGDALYRMFMNTHGVDTGNGNPGGRLPITW-PRSMNQVPPMTNYTME--------GRTYR 602

Query: 609 FYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
           ++ G  ++PFG+G+SY +F ++     ++V  P         +TI           CN +
Sbjct: 603 YFNGDPLFPFGYGLSYGSFSYS-----SLVIWP---------STIPA---------CNGV 639

Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
              V V V  +G      T +  S   A    P  QLVAF++ ++       V   I
Sbjct: 640 K--VSVTVYKLGPGGDEVTQVYMSWNNASVVVPKLQLVAFKRFYLETNGVTEVHFTI 694


>gi|372209074|ref|ZP_09496876.1| glycoside hydrolase family protein [Flavobacteriaceae bacterium
           S85]
          Length = 727

 Score =  343 bits (881), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 238/743 (32%), Positives = 370/743 (49%), Gaps = 84/743 (11%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
           L F      I +R   L+ +++L+EK+  L + A A+ RL +  Y+WW+EALHGV+  G 
Sbjct: 19  LSFLDTDKSIEERAEILVSQMTLKEKIAQLKNTAPAISRLKVPDYDWWNEALHGVARNGK 78

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--------NGGTAGLT 151
                     AT FPQ I   ++F+  L   +   +S EARA Y        +   AGLT
Sbjct: 79  ----------ATIFPQGIGIGATFDPDLALRVASAISTEARAKYTISQQMGNHSRYAGLT 128

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
           +W+PNVNIFRDPRWGRGQET GEDP L  +   ++V+GLQG D + LK AA  KH+  + 
Sbjct: 129 FWTPNVNIFRDPRWGRGQETFGEDPYLMTQMGVAFVKGLQGDDPNYLKSAACAKHYAVH- 187

Query: 212 LDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
               +G +  R  FNA  ++QD+ +T+   F   V +  V  VM ++N V G P  A+  
Sbjct: 188 ----SGPESLRLEFNAVPTQQDLYETYLPAFEALVKDANVEGVMPAHNAVFGAPMAANKF 243

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT 329
           +L   +R  W  +GY+V+DC ++        +  +   AAA A++AG +L+CG       
Sbjct: 244 LLTDVLRDRWGFDGYVVTDCGAIKQIKVGHKYVDSEVAAAAVALKAGTNLNCGATYK-EL 302

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
           + A+ +GL++E  ++         + RLGMFD + S  PY  +GP+ + + +H ELA EA
Sbjct: 303 KKAIDQGLVTEELVHERTKQLFKTRFRLGMFDKDLSKNPYSKIGPELIHSKEHIELAREA 362

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA 449
           A++ IV+LKN+   LPL     +   V GP ++ +  ++G+Y G++ G  T L GI    
Sbjct: 363 AQKSIVMLKNKNNLLPLP-TDIKVPYVTGPFANSSDMLMGSYYGVSPGVVTILAGITDAV 421

Query: 450 RTIHQQGCKDVACADDQLFGA---AIDASRQADATILVMGLDQSIEAEAL---------D 497
                   +  A    +       A + +  +D TI V+GL    E E +         D
Sbjct: 422 SLGTSLNYRSGALPFQKNINPKNWAPNVAGMSDVTICVVGLTADREGEGVDAIASNHKGD 481

Query: 498 RAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           R  L LP  Q   V +++   K  P +LV+ SG P  V+    +    AI+   YPG+ G
Sbjct: 482 RLDLKLPENQINYVKQLAAKKKDKPLVLVIASGSP--VSLEGIEEHCDAILQIWYPGEQG 539

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           G A+AD+LFG  +P G LPMT +P+  +  LP   + +M+        GRTY++     +
Sbjct: 540 GNAVADVLFGKVSPTGHLPMT-FPKS-VAQLPDYKDYSMK--------GRTYKYMTEEPM 589

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC-NRLTLGVQV 674
           +PFG G++Y+                              K + V  AK   + +L V V
Sbjct: 590 FPFGFGLTYSK--------------------------TEFKNLVVEDAKLRKKESLKVSV 623

Query: 675 DVKNVGSKDGAHTLLVFSTPPA---GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
           +V NVG  D    + ++ +P +   G   P   L AF++V +  G  Q+V   IH  + L
Sbjct: 624 EVTNVGDFDIDEIVQLYISPKSQKEGEGLPFTTLKAFKRVALKKGETQKVEFTIH-PESL 682

Query: 732 SVVDRSGTRRIPLGEHNIHIGGT 754
            V++  G +    G + + +G +
Sbjct: 683 KVINVKGQKVWRKGAYKVTVGNS 705


>gi|110740481|dbj|BAF02134.1| xylosidase [Arabidopsis thaliana]
          Length = 284

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 169/282 (59%), Positives = 209/282 (74%), Gaps = 12/282 (4%)

Query: 485 MGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
           MGLDQSIEAE  DR GLLLPG QQ+LV++V+ AS+GP ILVLMSGGPIDV FAKNDPR+A
Sbjct: 1   MGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTFAKNDPRVA 60

Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
           AIIWAGYPGQAGG AIA+I+FG +NPGGKLPMTWYPQ+Y+  +PMT MAMR   S  YPG
Sbjct: 61  AIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAKVPMTVMAMR--ASGNYPG 118

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAP----TVVAVPLDGRHGSINATISGKAIKV 660
           RTYRFYKGPVV+PFG G+SYT F H++A +P    +V    L+  +  +N+  S  +IKV
Sbjct: 119 RTYRFYKGPVVFPFGFGLSYTTFTHSLAKSPLAQLSVSLSNLNSANTILNS--SSHSIKV 176

Query: 661 THAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPAG---HWAPHKQLVAFEKVHVPAG 716
           +H  CN    + + V+V N G  DG HT+ VF+ PP         +KQL+AFEKVHV AG
Sbjct: 177 SHTNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPINGIKGLGVNKQLIAFEKVHVMAG 236

Query: 717 AQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
           A+Q V +++  CK+L VVD  G RRIP+GEH +HIG  KH++
Sbjct: 237 AKQTVQVDVDACKHLGVVDEYGKRRIPMGEHKLHIGDLKHTI 278


>gi|326427096|gb|EGD72666.1| hypothetical protein PTSG_04397 [Salpingoeca sp. ATCC 50818]
          Length = 614

 Score =  340 bits (872), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 216/587 (36%), Positives = 317/587 (54%), Gaps = 53/587 (9%)

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCKHFTAYDL 212
           SPN+NI RDPRWGR QE P EDP+L+G++   Y  GLQ G D    KV  + KH+ AY L
Sbjct: 11  SPNININRDPRWGRNQEVPSEDPLLNGEFGKLYTMGLQQGEDSRYTKVVVTLKHWDAYSL 70

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
           ++ +G  R +F+AKVS   + DT+   FR  VMEG    VMCSYN +NG PTC  P +L 
Sbjct: 71  EDSDGFTRHNFDAKVSNFALMDTYWPAFRKAVMEGNAKGVMCSYNALNGRPTCTHP-LLT 129

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAG-LDLDCGPFLGLHTES 331
           + +R  W+ +GY+ SD  ++   Y   H+T+    A A A+R G  D+D G         
Sbjct: 130 KVLRDIWKFDGYVTSDTGAIEDIYAKHHYTANASAAVAAALRDGRCDMDSGAVYHDALLD 189

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
           AV  G  S  D++ AL NTL ++  LG+FD     QPY  +    + T   Q+L ++   
Sbjct: 190 AVNSGECSMDDVDRALYNTLKLRFELGLFD-PIEDQPYWRINASSINTTYAQDLNMKITL 248

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC------GYTTPL--- 442
           + ++LL+N   +LP    + R VAVIGP+ +    ++GNY G  C        T+PL   
Sbjct: 249 ESMILLQNHNNALPFK--KGRKVAVIGPHINAQEALVGNYLGQLCPDDSFDCITSPLAAI 306

Query: 443 QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
           + I   + T+   G   +AC D  +   A++ ++ AD  +L++G++ +IEAE+ DR  + 
Sbjct: 307 EAINGMSNTVSAMGSGVLACTDASI-QEAVNVAKDADYVVLLIGINDTIEAESNDRTSID 365

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           LP  Q +L + ++  +K  T  VL++GG +  A  +   ++ AII AGYPG  GG AIA 
Sbjct: 366 LPQCQHKLTAAIAHLNK-TTAAVLINGGML--AIEQEKKQLPAIIEAGYPGFYGGAAIAK 422

Query: 563 ILFGTSNP-GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
            +FG +N  GGKLP T YP +YI  + M++M M  S     PGR+YR+Y G  ++PFG G
Sbjct: 423 TIFGDNNHLGGKLPYTVYPADYIHKINMSDMEMTNS-----PGRSYRYYTGQPLWPFGFG 477

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           ++YT F           +V   G   S  AT S              +  + V V N G 
Sbjct: 478 LAYTTF-----------SVQSPGPSASTFATGSNT------------SFSLPVHVVNTGK 514

Query: 682 KDGAHTLLVFSTP---PAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
           + G   + V+  P   P   ++  KQL+AFE+VH+     QR+G+ I
Sbjct: 515 RTGDTVVQVYMAPVSLPHRSFSLKKQLIAFERVHLT--PNQRLGVTI 559


>gi|372209036|ref|ZP_09496838.1| glycoside hydrolase [Flavobacteriaceae bacterium S85]
          Length = 859

 Score =  340 bits (871), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 248/782 (31%), Positives = 386/782 (49%), Gaps = 93/782 (11%)

Query: 6   AFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
           AF F  + L S SS     + P A                   I  RVNDL+  ++L+EK
Sbjct: 7   AFLFAIVFLFSFSSIAQTWKNPNAS------------------IEDRVNDLLANMTLEEK 48

Query: 66  VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
           +    S    + RLGI  +EW+ EALHG+ +              T FPQ I   +++N 
Sbjct: 49  ISYCGSRIPEIKRLGIPYFEWYGEALHGIISWN-----------CTQFPQNIAMGATWNP 97

Query: 126 TLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
            L   +   +S+EARA+ N G   +  +SP VN+ RDPRWGR  E   EDP L  + A  
Sbjct: 98  DLMFDVATAISNEARALKNAGKKEVMMFSPTVNMARDPRWGRNGECYAEDPHLMSEMARM 157

Query: 186 YVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVM 245
           YVRG+QG+D   +K   + KH+ A +++      R   ++ + K+D+ + +   ++ C++
Sbjct: 158 YVRGMQGNDPKYVKTVTTVKHYVANNVE----TKREWIHSNIGKKDLYEYYFPAYKTCIV 213

Query: 246 EGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP 305
           + +   +M + N +NG+P  A   ++   +R EW   GY+++D  +V        + S+ 
Sbjct: 214 DEEATGIMTALNGLNGIPCSAHDWLVNGVLRNEWGFKGYVIADWAAVQGLEKRMKYASSQ 273

Query: 306 EEAAADAIRAGLDLDC-------GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLG 358
            +AAA AI+AG+D +C        P +    + A+Q+GL++E +++  +   L ++   G
Sbjct: 274 AQAAAMAIKAGVDQECFRNKVRQAPMVQALPD-ALQQGLITEKELDVTVKRLLRLRFMTG 332

Query: 359 MFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
            FD +PS  PY  + P  V   D H++LAL+AA Q IVLLKN    LPL     +++A+I
Sbjct: 333 DFD-DPSLNPYSAI-PTSVLECDAHKQLALKAAEQSIVLLKNDA-VLPLKK-DLKSIAMI 388

Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--ARTIHQQGCKDVACADD-QLFGAAIDA 474
           GP +D     +G Y+G      +PL GI  Y  A+    QGC+  A  DD Q    A+  
Sbjct: 389 GPFAD--RCWMGIYSGHPKSKVSPLDGIKAYTNAKVSFAQGCEVTAKEDDEQKIAEAVAL 446

Query: 475 SRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
           +++++  ILV+G D++   E  DR  + LPG Q +L+  V   +K   ILVL+  GP  V
Sbjct: 447 AKKSEQVILVVGNDETTSTENTDRKSIKLPGNQHQLIKAVQAVNKN-VILVLVPSGPTAV 505

Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY-ITNLPMTEMA 593
            + + +  I  I+ A   GQ  GTA+A +LFG  NPGGKL  TWY  +  + N    +MA
Sbjct: 506 TWEQKN--IPGIVCAWPNGQEQGTALAKVLFGDVNPGGKLNATWYQSDKDLPNFHDYKMA 563

Query: 594 MRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATI 653
                      RTY ++KG  +YPFG+G+SYTNF        T+  V ++ +    N  +
Sbjct: 564 G--------GNRTYMYFKGKPLYPFGYGLSYTNF--------TISDVSINKKTLQANEYV 607

Query: 654 SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHV 713
           + KA KV +         VQV +++V SK+                 P K L  F+++ V
Sbjct: 608 TVKA-KVNNTGAVAGDEVVQVYIRDVKSKEK---------------TPLKALKGFQRISV 651

Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG------TKHSVSLHAATLGV 767
            AGA + V I I    +     +     +  GE  I +G       T  ++ L   TL  
Sbjct: 652 AAGASKWVEIKIPYEAFSHYNTKKEALMVAKGEFEILVGNASDAIVTTKTIELKKGTLPK 711

Query: 768 IK 769
           +K
Sbjct: 712 LK 713


>gi|405968899|gb|EKC33925.1| Putative beta-D-xylosidase 5 [Crassostrea gigas]
          Length = 748

 Score =  339 bits (870), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 245/737 (33%), Positives = 363/737 (49%), Gaps = 95/737 (12%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAA--------AVPRL 79
           FA  P    +   PF  VSL   +RV+DL+GRL+L + V+ L  G A        A+  L
Sbjct: 14  FALTP--LASSNFPFQNVSLSWSERVDDLVGRLTLDQIVQQLARGGAGLNGGPAPAIENL 71

Query: 80  GIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEA 139
           GI  Y+W +E L G    G           ATSFPQ I  A++F+  L   + +  + E 
Sbjct: 72  GIGPYQWNTECLRGDVEAG----------NATSFPQAIGLAAAFSKDLIFNVSKAAATEV 121

Query: 140 RAMYNGGT--------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ 191
           RA +N            GL+ +SP VNI R P WGR QET GEDP LSG YA+ +V+GLQ
Sbjct: 122 RAKHNDFVKRGIFTDHTGLSCFSPVVNIMRHPLWGRNQETYGEDPYLSGTYASYFVQGLQ 181

Query: 192 GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVAS 251
           G     ++  A CKHF A+         R  F+AKVS +D+  TF   F+ CV  G   S
Sbjct: 182 GDHDRYIQANAGCKHFDAHGGPEDIPESRMGFDAKVSMRDLRLTFLPAFQKCVQAGAY-S 240

Query: 252 VMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAAD 311
           +MCSYN +NGVP C++  ++   +RGEW   GY+VSD  ++       H+ +  E+AAA 
Sbjct: 241 LMCSYNSINGVPACSNKLLMMDILRGEWNFTGYVVSDEGAIENQISFHHYYNNSEDAAAG 300

Query: 312 AIRAGLDLDCGPFLG----LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ 367
           ++ AG +L+    L     +    AV+ G L E  + N +      +MRLG FD  P   
Sbjct: 301 SVNAGCNLELSGNLTEPVFMKIGDAVKSGKLEESVVRNRVKPLFYTRMRLGEFD-PPEMN 359

Query: 368 PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS-LPLSHI----RHRTVAVIGPNSD 422
           PY  +    + + +H+ L+L AA + +VLLK   PS     H+        +AVIGP ++
Sbjct: 360 PYSSVNLSVIQSEEHRNLSLTAAAKSLVLLKR--PSKFSKRHLIGGFPSERMAVIGPMAN 417

Query: 423 VTVTMIGNYAGIA--CGYTTPLQGIGRYARTI-HQQGCKD----VACADDQLFGAAIDAS 475
            T  + G+Y+         TPL+G+     ++ +  GC D    +  + D +  A +   
Sbjct: 418 NTDQIFGDYSPTTDPRFVKTPLKGLTELNFSMNYAAGCVDGTRCLNYSQDDVKTALVG-- 475

Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
             AD  ++ +G  + +E+E +DR  ++LPG+Q +L+  V   +     L++ S GP+++ 
Sbjct: 476 --ADLVVVCLGTGKDLESENVDRKDMMLPGKQLQLLQDVVSMTNKAVYLLVFSAGPVNIT 533

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILF---GTSNPGGKLPMTWYPQEYITNLP-MTE 591
           +A+   R+  I+   YP Q+ G AI   L    G  NP G+LP TWY   Y   +P MT+
Sbjct: 534 WAQESERVLIILQCFYPAQSAGDAITQALIMRDGRFNPAGRLPYTWY--RYTEQIPEMTD 591

Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
            +M          +TYR++ G  +YPFG+G+SY+ FV +      +  +P          
Sbjct: 592 YSM--------ARKTYRYFTGVPLYPFGYGLSYSTFVFS-----KLYFLP---------- 628

Query: 652 TISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEK 710
                  KV     N     VQV V N G  DG   L V+          P  QLVAFE+
Sbjct: 629 -------KVNAGDPNV----VQVRVFNEGPFDGDEVLQVYIKWMSTKERMPRVQLVAFER 677

Query: 711 VHVPAGAQQRVGINIHV 727
           V +   +QQ V I+  +
Sbjct: 678 VFI--RSQQYVDISTAI 692


>gi|167524198|ref|XP_001746435.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775197|gb|EDQ88822.1| predicted protein [Monosiga brevicollis MX1]
          Length = 834

 Score =  339 bits (869), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 245/721 (33%), Positives = 357/721 (49%), Gaps = 86/721 (11%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA-------VPRLGIKGYEWWSEALHG 93
           PF    LP   R++DL+GRL+L+EK++ L  G AA       V RLGI  + W SE + G
Sbjct: 35  PFRNPDLPWAARLDDLVGRLTLEEKLQQLQHGGAAQMTPAPAVERLGIGPFVWGSECVTG 94

Query: 94  VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA----- 148
           +     GT   G+ P  T+FPQ +  A++F+  L +     ++ E RA  N         
Sbjct: 95  L-----GTD--GNDPHGTAFPQPLGMAATFDPALLKRAAGTIALELRAQRNFDRENGVVK 147

Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
              GL+ WSP VNI R P WGR  ET GE PVLS   A S+V G+QG+       AA+CK
Sbjct: 148 FHHGLSCWSPVVNINRHPLWGRNDETFGECPVLSSFMARSFVEGIQGNHTRYYAAAAACK 207

Query: 206 HFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           H     LD + G D  R+ F+A VS+ D+  TF + F  C   G V   MCSYN + GVP
Sbjct: 208 H-----LDVYGGPDNLRYVFDADVSQADLTGTFLMAFEECAAAG-VMGYMCSYNSIRGVP 261

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL---D 320
            CA+   +    R +W   GY+VSD  +V    ++ ++T+     A  A+ AG D+   D
Sbjct: 262 ACANYRTMTFFAREQWGFEGYVVSDQGAVFRITESHNYTANQTLGAVAALNAGCDMEDSD 321

Query: 321 CGPFLGLHTES-AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
               +  +  S A+   L     I+ ++     V+MRLG FD  P + P+  L    V +
Sbjct: 322 DAQHVAYYNLSLALDLKLTDMATIDASVSRLFYVRMRLGEFD-PPENDPWRSLNMSIVSS 380

Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYA------ 432
           P H E+A + A   IVLLKNQ  +LPLS   ++ +  ++GP +D    M+G Y+      
Sbjct: 381 PAHVEMARDVATASIVLLKNQNETLPLSAAAKNASYCLLGPFADNADLMMGKYSPHGSTN 440

Query: 433 ---GIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQ--ADATILVMGL 487
                  G    LQ   + A   + +GC    C  D L  AA+    Q   D  +L +G 
Sbjct: 441 VTVTYRAGLAAALQNASQTASFQYLEGCTGPFC--DGLDTAAVTTFIQQGCDTVLLAVGT 498

Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKV--SMASKGPTILVLMSGGPIDVAFAKNDPRIAA 545
              +E+E+LDR+ +  PG Q  LV  V  ++ +K   +L++ + GP+D+A  + D R+AA
Sbjct: 499 SYHVESESLDRSNMSFPGAQPTLVQTVLEALGTKQRLVLLVSTAGPVDLAALEQDTRVAA 558

Query: 546 IIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR 605
           I+   Y GQ  GTA+ADIL G ++P G+LP +W P +     P+ +  M+        GR
Sbjct: 559 ILDLIYLGQTAGTALADILLGETSPSGRLPFSW-PNKVSDVPPIDDYTMQ--------GR 609

Query: 606 TYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC 665
           TYRF +  V++PFG+G+SYT F  +   AP ++ V                        C
Sbjct: 610 TYRFAQADVLFPFGYGLSYTQFNLSHLAAPYILPV------------------------C 645

Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
             L L   V+V N G   GA  L V+   P     P +QL    +V V A + + V ++I
Sbjct: 646 QALRLS--VNVTNTGRLSGAIPLQVYVEWPNAVGGPIRQLATTTRVFVDAASSKTVQLSI 703

Query: 726 H 726
            
Sbjct: 704 R 704


>gi|390340546|ref|XP_001186857.2| PREDICTED: probable beta-D-xylosidase 2-like [Strongylocentrotus
           purpuratus]
          Length = 623

 Score =  337 bits (865), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 215/608 (35%), Positives = 323/608 (53%), Gaps = 54/608 (8%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAA-------AVPRLGIKG 83
           D + A    LPF   SLP  QR++DL+ RL + +    L  G A       A+ RL I  
Sbjct: 21  DDEFAQKSQLPFWNQSLPWDQRLDDLLSRLKVDDMTYQLARGGADPNGPAPAIGRLQIGK 80

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           Y W +E L G +  G           AT+FPQ +  +++F+  L   +      E RA Y
Sbjct: 81  YVWNTECLRGDAQAG----------NATAFPQALGLSAAFSRDLLFEVANATGYEVRAKY 130

Query: 144 N--------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG 195
           N            GL  +SP +NI R P WGR QET GEDP L+G+ A S+V GLQG+  
Sbjct: 131 NYYLQKGDFNNHQGLNCFSPVINIMRHPYWGRNQETYGEDPYLTGELAKSFVWGLQGNHP 190

Query: 196 DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
             L   A CKHF AY         RF F+AKVS +D++ TF   F+ C+  G   SVMCS
Sbjct: 191 RYLLTNAGCKHFAAYSGPENYPSSRFSFDAKVSDKDLQVTFFPAFKECIKAGTY-SVMCS 249

Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
           YN VNG+P CA+  +L   +R EW   GY+VSD  ++ +     ++T++  + A  +++A
Sbjct: 250 YNSVNGIPACANSYLLNDVLRTEWGFKGYVVSDQRALELEELAHNYTTSYLDTAIKSLKA 309

Query: 316 GLDLDCG---PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL 372
           G +LD G   P +  +   AV+ G+L+  D+ +++      ++RLG FD  P   PY  L
Sbjct: 310 GCNLDLGTTKPAVYDYLAEAVELGMLTAQDLRDSIAPLFYTRLRLGEFD-PPDHNPYVKL 368

Query: 373 GPKDVC-TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
               V  +P+HQE+AL+AA +  VL+KN G +LP+    H T+AV+GP ++ +  + G+Y
Sbjct: 369 NVDQVVESPEHQEIALKAALKSFVLVKNDGSTLPIEGTIH-TLAVVGPFANNSKLLFGDY 427

Query: 432 AGIACG--YTTPLQGIGRYA-RTIHQQGCKDVACA--DDQLFGAAIDASRQADATILVMG 486
           A        TT L+G+   A +T H  GC    C   D Q     ++A   AD  ++ +G
Sbjct: 428 APNPDPRFVTTVLEGLSPMATKTRHASGCPSPKCVTYDQQ---GVLNAVTGADVVVVCLG 484

Query: 487 LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGPIDVAFAKNDPRIAA 545
               +E+E  DR  +LLPG+Q++L+   +  + G P IL+L + GP+++ +A + P + A
Sbjct: 485 TGIELESEGNDRRDMLLPGKQEQLLQDAARYAAGKPVILLLFNAGPLNITWALSSPSVQA 544

Query: 546 IIWAGYPGQAGGTAIADILFGT---SNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRY 602
           I+   +P QA G A+  ++F     +NPGG+LP TW         P T   + P ++   
Sbjct: 545 IVECFFPAQATGVALR-MMFQNAPGANPGGRLPSTW---------PATVAQIPPMENYSM 594

Query: 603 PGRTYRFY 610
            GRTYR++
Sbjct: 595 DGRTYRYF 602


>gi|413925161|gb|AFW65093.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 323

 Score =  335 bits (860), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 155/296 (52%), Positives = 209/296 (70%), Gaps = 3/296 (1%)

Query: 22  LAAREPFACDPKDA-TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG 80
           +A   PF+C P  A  +  L FC V+L   QR  DL+ RL+  EK+  L   A  VPRLG
Sbjct: 25  VAGDPPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLG 84

Query: 81  IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
           + GY+WW+EALHG++  G G  F      ATSFPQV+ TA++F+  LW  IG+ +  EAR
Sbjct: 85  VPGYKWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREAR 144

Query: 141 AMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRL 198
           A++N G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ +YA ++VRG+QG S    L
Sbjct: 145 ALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSSLL 204

Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
           + +A CKH TAYDL++WNGV R+ F A+V++QD+EDTF+ PFR CV+E K + VMC+Y  
Sbjct: 205 QTSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYTA 264

Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIR 314
           +NGVP CA+ ++L  T+RG+W L+GY+ SDCD+V +  D Q +  TPE+A A +++
Sbjct: 265 INGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLK 320


>gi|361127339|gb|EHK99311.1| putative exo-1,4-beta-xylosidase bxlB [Glarea lozoyensis 74030]
          Length = 569

 Score =  333 bits (853), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 198/538 (36%), Positives = 283/538 (52%), Gaps = 50/538 (9%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  + P   R   L+  +   EK++ +IS +A V RLG+  Y WWSEALHGV+   PG +
Sbjct: 65  CDTTAPPADRAAALVKAMQSSEKLQNIISKSAGVSRLGLPPYNWWSEALHGVAGA-PGIQ 123

Query: 103 FGGDFPG--ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIF 160
           F    P   ATS P  I  A++F+  L E +G ++  EARA  NG  +G+ +W+PN+N F
Sbjct: 124 FSSSSPWNYATSLPMPILMAAAFDDDLIEKVGTLIGTEARAFGNGNHSGIDFWTPNINPF 183

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDR 220
           +DPRWGRG ETPGED +    Y A+ +RGL+G+   R ++ A+CKH+ A DL++WNGV R
Sbjct: 184 KDPRWGRGSETPGEDTLRLKGYVAALLRGLEGNKAQR-RIIATCKHYAANDLESWNGVTR 242

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
             F+AK+S QD+ + +  PF+ C  + KV S MCSYN VNGVP CA+  +L+  +R  W 
Sbjct: 243 HDFDAKISMQDLAEYYLQPFQQCARDSKVGSFMCSYNSVNGVPACANKYLLQTILRDHWN 302

Query: 281 L---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
               N Y+ SDC++V       H+ ST     A A  AG D  C                
Sbjct: 303 WTSENQYVTSDCEAVQDISLNHHYASTNAAGTALAFNAGTDSSC---------------- 346

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
                               G FDG  S   Y  LG  DV TP  Q+LAL+A   GIV+L
Sbjct: 347 ------------------EAGYFDG--SKALYSSLGWSDVNTPQAQQLALQATVDGIVML 386

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL---QGIGRYARTIHQ 454
           KN G +LPL       VA+IG  +  +  + G Y+G A    TP+   Q +G +   +  
Sbjct: 387 KNDG-TLPLKLDSKSKVAMIGFWASDSSKLQGGYSGKAPYLRTPVYAAQQLG-FTPNVAT 444

Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
              +  A A D     A+ A+ ++D  +   GLD S  AE +DR  L  P  Q  L+ K+
Sbjct: 445 GPVQQSASATDNWTTNALAAASKSDYILYFGGLDTSAAAEGVDRTSLEWPSAQLALIKKL 504

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
           S  + G  ++++  G  +D      +  +++I+WA +PGQ GG A+  I+ G  +P G
Sbjct: 505 S--ALGKPLIIIQEGDQMDNTPLLTNKGVSSILWASWPGQDGGPAVMQIISGAKSPAG 560


>gi|336275603|ref|XP_003352555.1| hypothetical protein SMAC_01389 [Sordaria macrospora k-hell]
 gi|380094444|emb|CCC07823.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 833

 Score =  333 bits (853), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 239/759 (31%), Positives = 356/759 (46%), Gaps = 113/759 (14%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C  +   P R   L+ +L++ EK+  L+  +   PRLG+  Y WWSE LHGV+   PG  
Sbjct: 37  CDSTASAPDRAASLVEQLTIDEKLVNLVDQSKGAPRLGLPPYAWWSEGLHGVAG-SPGVV 95

Query: 103 F---GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
           F   G  F  ATSF  VIT  ++ +  L   +G  +S EARA    G  GL YW+PN+N 
Sbjct: 96  FNTSGYPFSYATSFANVITLGAALDDDLVYEVGTAISTEARAFAKFGFGGLDYWTPNINP 155

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
           ++DPRWGRG ETPGEDP+    Y  + V GL+G +G   KV A+CKHF AYDL+ W G+ 
Sbjct: 156 YKDPRWGRGAETPGEDPLRIKGYVKAMVAGLEG-NGTVRKVIATCKHFAAYDLERWRGLT 214

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC------------------------- 254
           R+ F+A VS QD+ + +  PF+ C  + +V S+MC                         
Sbjct: 215 RYDFDAVVSLQDLSEYYLPPFQQCARDSRVGSIMCRYVSFFLPPFPSFPRLVTRQSGNQV 274

Query: 255 -------SYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYY-DTQHFTS 303
                  SYN +NG P CA   ++   +R  W     N YI SDC+++  +  D  +F+ 
Sbjct: 275 DIVDNFRSYNALNGTPACASTYLMTNILRDHWNWTNHNNYITSDCNAIQDFLPDNHNFSQ 334

Query: 304 TPEEAAADAIRAGLDLDCG-----PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLG 358
           TP EAAA A  AG D  C      P+  +    A  + LLSE  I+ AL       +R G
Sbjct: 335 TPAEAAAAAYIAGTDTVCEVSGWPPYTDV--VGAYNQSLLSESVIDTALRRLYEGLIRAG 392

Query: 359 MFD-GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
             D G P+S            +PD    +               P      +  +TVA+I
Sbjct: 393 YLDHGRPASS-----------SPDKAPFS--------------SPDFLPLDLTGKTVALI 427

Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACAD-DQLFGAAIDASR 476
           G  ++ T T+ G Y+G+   Y  P+  + +   + +      V   D D    AA+ A+ 
Sbjct: 428 GHWANATRTIRGPYSGLPPFYHNPMYAVRQLKLSFYYANGPVVNSTDADTWTAAAMLAAE 487

Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
            AD  +   G D ++ +E LDR  +  P  Q  L+ K  +A  G  ++V+  G  +D   
Sbjct: 488 SADVVLYFGGTDTTVASEDLDRESIAWPKTQLTLIEK--LAQVGKPMVVIQLGDQVDDTP 545

Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRP 596
             N+  I++I+W GYPGQ+GGTA+ D+L G     G+LP+T YP  Y+  +P+TEM +RP
Sbjct: 546 LLNNKNISSILWVGYPGQSGGTAVFDVLTGKKASAGRLPVTQYPAGYVDEVPLTEMGLRP 605

Query: 597 --------------------------------SQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
                                           +++   PGRTY++Y  PV+ PFG+G+ Y
Sbjct: 606 FNHSSSTTSSDVSQSGVEEGNGLTIQTRSTRGNKTLSSPGRTYKWYPRPVL-PFGYGLHY 664

Query: 625 TNF-VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           T F +    +  +  +   D    SI + ++ +     H      +    V + N GS  
Sbjct: 665 TPFNISLSLSTSSNASSTTDNTSISIRSLLTSQTCTAIHLDLCPFS-PFSVSITNTGSHT 723

Query: 684 GAHTLLVFSTPPAG-HWAPHKQLVAFEKVHVPAGAQQRV 721
             +  L+F +   G    P K LV +++V      + RV
Sbjct: 724 SDYVALLFLSGKFGPKPDPLKTLVGYKRVKDIKPGETRV 762


>gi|397642422|gb|EJK75223.1| hypothetical protein THAOC_03061, partial [Thalassiosira oceanica]
          Length = 534

 Score =  332 bits (852), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 213/565 (37%), Positives = 299/565 (52%), Gaps = 89/565 (15%)

Query: 192 GSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCV------ 244
           G  G R ++AA+CKH  AY L+     DRF+F+A  + + D E T+   F  CV      
Sbjct: 2   GGAGLRPRIAATCKHLAAYSLE----TDRFNFSADGIDRTDWEGTYLPAFDACVHAERFL 57

Query: 245 MEGKVAS-------------VMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
           +E   AS             VMCSYN ++GVP CADP +LK  +R +W   G +VSDC +
Sbjct: 58  LEHYNASGGGGGGQDRGALGVMCSYNAIDGVPACADPALLKDMLRRDWNFTGLVVSDCWA 117

Query: 292 VGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTL 351
           V   +    F ++ EEA   A+R+G+DLDCG         A    LL E DI+ AL    
Sbjct: 118 VDNIHSNHRFVASYEEAVGLALRSGVDLDCGNTFQDFGRLAYDESLLDEDDIDEALSRLF 177

Query: 352 TVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN------QGPSLP 405
            V M LG FD   + +P       ++   +H +LALEAA Q IVLLKN       GP LP
Sbjct: 178 RVLMDLGYFD--ETDEPDAKSSDDEM---EHDQLALEAALQSIVLLKNGINEDEPGP-LP 231

Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADD 465
           LS  +H+ +A+ GP +D    ++GNY G+     TPL G+ +            V  A  
Sbjct: 232 LSLAKHKEIALFGPLADNQTVLLGNYHGLPSTIVTPLMGLAKMG----------VEVAFR 281

Query: 466 QLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG---PT 522
           Q   A++       ATILV+GLDQS+EAE  DR  LLLP  Q++L+  +S  SK    P 
Sbjct: 282 Q--RASVCDFHGESATILVVGLDQSLEAEDQDRTTLLLPVEQRDLIKTISRCSKVRDLPV 339

Query: 523 ILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE 582
           +LV++SGG +D++  KN   I A+I   YPGQ GG+A+A +L+G  NP GKL  T YP+ 
Sbjct: 340 VLVVVSGGMVDLSRYKNSSDIDAMIHMSYPGQNGGSALAQVLYGAYNPSGKLVGTMYPES 399

Query: 583 YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL 642
           Y+  + + +M MRP    ++PGRT+R+Y+G V+YPFG+G+SYT+F + +           
Sbjct: 400 YLNEVSLHDMRMRP--DGKFPGRTHRYYRGDVIYPFGYGLSYTSFRYAM----------- 446

Query: 643 DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGH-WA 700
                       G  +KVT              V N GS DG+  +L+F S P AG+   
Sbjct: 447 ---------EFLGGTVKVT--------------VSNSGSMDGSVAVLLFHSAPQAGNEQE 483

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINI 725
           P + L+ FEK++V  G  Q V  ++
Sbjct: 484 PFRSLIGFEKIYVSVGDSQLVSFDV 508


>gi|198425898|ref|XP_002119549.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 754

 Score =  331 bits (849), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 248/755 (32%), Positives = 380/755 (50%), Gaps = 90/755 (11%)

Query: 36  TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG-------AAAVPRLGIKGYEWWS 88
           T+   PF   SLPI +R+ DL+ RL+++E +  L  G       A A+ RLGI  Y+W +
Sbjct: 21  TSEEFPFRNFSLPIEERLEDLVNRLTIEEVILQLSRGGVRDNGPAPAITRLGIGPYQWNT 80

Query: 89  EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN---- 144
           E L G +         GD   AT FPQ I  A++F+  L   + + V+ EARA +N    
Sbjct: 81  ECLRGYA-------MNGD---ATCFPQPIGLAATFDQGLIYKMAKTVALEARAKHNNFTK 130

Query: 145 ----GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
               G   GL+ +SP +NI R P WGR QET GEDPVL+   A +YV GLQG D   L  
Sbjct: 131 NGNFGDHTGLSCFSPVINILRHPLWGRNQETYGEDPVLTSLMARAYVTGLQG-DEIYLPA 189

Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
            A CKHF AY         RF F+A VS  DI  TF   FR CV  G    VMCSYN +N
Sbjct: 190 TAVCKHFVAYGGPENIPTTRFSFSANVSDHDIGTTFYPAFRECVHAG-AQGVMCSYNAIN 248

Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSD---CDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
           GVP+CA+P +L+ T+R ++  +GY+VSD    +++ +Y++   FT +  E AA A+ AG+
Sbjct: 249 GVPSCANP-MLETTLRKKFHFDGYVVSDENALENIDLYFN---FTKSKLETAAVALNAGV 304

Query: 318 DLDCGPFLGLHTES----AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG 373
           DL+   F   +  S    AV++GL++E  +  +       +M LG FD  P    + ++ 
Sbjct: 305 DLELTGFGKTNRYSLLNQAVEQGLVTEAALRRSAKRLFRTRMALGEFD-PPEFNHWLNVP 363

Query: 374 PKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY-A 432
              V +  H++ A+E A +  VLLKN G  LPL  +  + V+++GP  + +  + G+Y A
Sbjct: 364 IDVVQSLAHRKQAVEVAAKSFVLLKNDG-ILPLKQLYDK-VSIVGPFINNSEALTGDYPA 421

Query: 433 GIACGY-TTPL--------QGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
                Y ++PL         G+ R+         +++            +    +D  ++
Sbjct: 422 EFNLKYFSSPLFAANSLSSSGVARFTTGCVGTNNQNLPICATYNSTNVKEVVTGSDIVLV 481

Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
            +G  + +EAE+ DR  + LPG+Q +L+  V   + GP I+VL + GP+DV++   +   
Sbjct: 482 TLGTGRGVEAESNDRRDINLPGKQLQLIQDVVKYANGPVIVVLFNAGPLDVSWVMGN--T 539

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
           AA+I   +  Q  G A+ ++L G  NP G+LP TW P       PMT+ +M         
Sbjct: 540 AAVIACHFSAQMTGEAMLEVLTGVVNPAGRLPNTW-PASMEQVPPMTDYSMHE------- 591

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
            RTYR+     ++PFG+G+SYT F +  A     V  P          TI    I V   
Sbjct: 592 -RTYRYSTSSPLFPFGYGLSYTKFWYLDA-----VVEP---------TTIQRCQIPV--- 633

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH-WAPHKQLVAFEKVHVPAGAQQRVG 722
                   V+V ++N G  DG   + ++ T          +QLVAF++V + AG +  + 
Sbjct: 634 --------VRVLIQNTGHLDGEEVVQIYMTSKKKRDRELLRQLVAFQRVPIKAGEEVSIS 685

Query: 723 INI-HVCKYLSVV-DRSGTRRIPLGEHNIHIGGTK 755
           + I H+ + + +  +RS        E  +++GG +
Sbjct: 686 LPIPHLSRAIWMKHNRSFDYYFDADEFTLYVGGQQ 720


>gi|256394020|ref|YP_003115584.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
 gi|256360246|gb|ACU73743.1| glycoside hydrolase family 3 domain protein [Catenulispora
           acidiphila DSM 44928]
          Length = 1212

 Score =  331 bits (848), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 253/758 (33%), Positives = 370/758 (48%), Gaps = 110/758 (14%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKV-KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           + Q      +R  DL+ R++L EKV +L  + A A+PRLG++ Y +WSE  HG++ +G  
Sbjct: 45  YLQTRYSFAERAADLVSRMTLPEKVAQLHTNSAPAIPRLGVQSYTYWSEGQHGINLLGAD 104

Query: 101 TKFGGDF--PGATSFPQVITTASSFNATLWEAIGRVVSDEAR----------AMYNGGTA 148
           +  GG    P ATSFP  +++  S++  L       +SDE R          A  N G +
Sbjct: 105 SNNGGAAGGPHATSFPTNLSSTMSWDPALVYQETTAISDEVRGELDKSLWGVAQNNIGPS 164

Query: 149 G-----LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD------R 197
                 LTYW+P VN+ RDPRWGR  E  GEDP L GK A ++V G QG   D       
Sbjct: 165 ADDYGSLTYWAPTVNMDRDPRWGRTDEAFGEDPYLVGKMAGAFVAGYQGETIDGTPTSPY 224

Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
           LKVAA+ KHF    L+N N  DR   +A  S+ DI D +   FR  V +  VA +M SYN
Sbjct: 225 LKVAATAKHFA---LNN-NENDRHADSADASESDIRDYYTAQFRSLVEDSHVAGLMTSYN 280

Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG------------------VYYDTQ 299
            +NG P+ AD        +  W  +GYI SDC +VG                  V   T 
Sbjct: 281 AINGTPSPADTYTTDALAQRTWGFDGYITSDCGAVGDVTASSSHDWAPPGWTVSVVNGTS 340

Query: 300 HFTST------PEEAA--ADAIRAGLDLDC--GPFLGLHTESAVQRGLLSEIDINNALVN 349
            +T+T      P +A   A A+RAG D +C  G     + E+A++  +LSE  I++ALV 
Sbjct: 341 TWTNTATGVQVPADAGGQAYALRAGTDANCTGGDATLGNIEAAIKAEILSEGVIDHALVQ 400

Query: 350 TLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ------GP 402
             TV+M+ G FD  P+++  Y  +    + +P+HQ LA + A   +VLLKN         
Sbjct: 401 LFTVRMQTGEFD--PANKVAYTRITKAQIQSPEHQALAEKVAANSLVLLKNDPMPGSAAK 458

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR--------TIHQ 454
            LP +      V V+G  ++ TVT+ G Y+G        +QGI    +        T   
Sbjct: 459 VLPANPASLNNVVVVGDLAN-TVTL-GGYSGDPTLQVNAVQGITSAVKAANPNATVTFDA 516

Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
            G    A A      A   A + AD  ++ +G D S   E+ DRA L +PG    L+S+V
Sbjct: 517 CGTSTTATAAASCSAATQAAIKTADLVVVFVGTDGSTAGESNDRASLAMPGNYDSLISQV 576

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
           +      T+L + + GP+D+   K D    AI+++ Y G++ GTA+AD+LFG  NP G L
Sbjct: 577 AALGNPRTVLSMQTDGPVDIENVKGD--FPAIVYSAYNGESQGTALADVLFGKQNPSGHL 634

Query: 575 PMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
             TWY  +  + LP +    + P+ +    GRTY+++ G   YPFG+G+SYT+F ++   
Sbjct: 635 DFTWYKDD--SQLPSIKNYGLNPADTGGL-GRTYQYFTGTPTYPFGYGLSYTDFAYSKVQ 691

Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST 693
           A             + +A   GKA              V+ DV N G   GA    ++ T
Sbjct: 692 A-------------TDHADAQGKAT-------------VRFDVTNTGKTPGATVAQLYIT 725

Query: 694 PPA--GHWAPHKQLVAFEKVHV-PAGAQQRVGINIHVC 728
           PP+  G   P +QL  F K  V   G  Q + +++++ 
Sbjct: 726 PPSVPGTQQPAEQLEGFAKTAVLKPGQTQHLSVSVNIA 763


>gi|323451996|gb|EGB07871.1| hypothetical protein AURANDRAFT_71699 [Aureococcus anophagefferens]
          Length = 1202

 Score =  329 bits (843), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 266/776 (34%), Positives = 378/776 (48%), Gaps = 102/776 (13%)

Query: 31   DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
            D +  +    P+C  +LPI  RV DL  R ++ E +  + + AAAVPRLG+    +  EA
Sbjct: 332  DVESPSAAAYPYCDRALPIRARVADLAARFTVNETISQMGTMAAAVPRLGLPALNYGGEA 391

Query: 91   LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN------ 144
            LHGV +    T   G  P  T FP      +SF+  LW A+G     EARA++       
Sbjct: 392  LHGVWS----TCAAGRCP--TQFPAPHAMGASFDRDLWRAVGAASGLEARALFRWNQRHN 445

Query: 145  --------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
                     G  GLT+++PNVN+ RDPRWGR +E P EDP+L+G Y A +VRG QG DG 
Sbjct: 446  ASDCARSLEGCLGLTFYAPNVNLARDPRWGRIEEVPSEDPLLNGVYGAEFVRGFQG-DGA 504

Query: 197  RLKVAASCKHFTAYDLD---------NWNGV-------DRFHFNAKVSKQDIEDTFDVPF 240
                 A  KHF  Y+L+         +W G        DR  F+A+VS +D E+T+  PF
Sbjct: 505  YRVANAVVKHFAVYNLEVDVEDTPPADWCGSAACAPPNDRHSFDARVSPRDFEETYVGPF 564

Query: 241  RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
                +    A+ MCSYN VNG P C D  +L+  +RG     G + +DC ++        
Sbjct: 565  VA-PVAAGAAAAMCSYNAVNGEPACTDGALLRGALRGALNFTGVLATDCGALEDAVARHK 623

Query: 301  FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
              +T  EAAA AI AG+D +CG  L      A+  GL+    +   L   L  ++RLG+ 
Sbjct: 624  RYATEAEAAAAAIAAGVDSNCGKVLTSALPEALAAGLVRPDALRPPLERLLEARLRLGLL 683

Query: 361  DGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
            D   +  P        V +P H+ LAL AAR+G+VLL+N    LPL      T+AVIGPN
Sbjct: 684  DDWDADAPVPRPDVDAVDSPAHRALALRAAREGLVLLQNPNQILPLDG--RGTLAVIGPN 741

Query: 421  SDVTVTMIGNYAGIACG--YTTPLQGIG---RYARTIHQQGCKDVACADDQLFGAAIDAS 475
            ++ ++ ++  Y G        +PLQ +    R  + ++  GC     A   L   A+D +
Sbjct: 742  ANASMNLLSGYHGTPPPDLLRSPLQELEARWRGGKVVYAVGCNASGAATAAL-DEAVDLA 800

Query: 476  RQADATILVMGL------------DQSI----EAEALDRAGLLLPGRQQELVSKVSMASK 519
            + AD  +L +GL            D +     EAE++DR  L LPG Q+ L SK+    K
Sbjct: 801  KTADVVVLGLGLCGDNYGGGPPKEDATCFSIDEAESVDRTSLKLPGAQEALFSKIWALGK 860

Query: 520  GPTILV-LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
               + V L+S G +D +FAK+    AA++ AGY G+ GG A+AD L G  NPGG L  T 
Sbjct: 861  PVAVAVFLVSAGAVDASFAKDK---AALLLAGYGGEFGGVAVADALLGAYNPGGALTATM 917

Query: 579  YPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP---FGHGMSYTNFVHTVANAP 635
             P   +   P  +MAMRPS +   PGRTYRF     V P   FG G+SYT F  ++A  P
Sbjct: 918  LPDAGLP--PFRDMAMRPSAAS--PGRTYRFLDERRVAPLWRFGFGLSYTAFAVSLAG-P 972

Query: 636  TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
            T                      +V      R +    V V+NVG+  G   +  F    
Sbjct: 973  T----------------------RVPRRAATRFS----VVVRNVGAVSGDVVVACFVAAV 1006

Query: 696  AGHWAPHKQLVAFEKVH-VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
                AP ++L  F +V  +   A  +V + +   + LS+VD +G R    G +++ 
Sbjct: 1007 GRPDAPLRELFDFARVRDLAPAASTKVSMELRP-RSLSLVDEAGVRSTTAGAYDVR 1061


>gi|296081549|emb|CBI20072.3| unnamed protein product [Vitis vinifera]
          Length = 333

 Score =  328 bits (842), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 160/334 (47%), Positives = 219/334 (65%), Gaps = 3/334 (0%)

Query: 427 MIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMG 486
           MIGNY G    YTTPLQG+     T +  GC +VAC   Q+   A   +  ADAT+L++G
Sbjct: 1   MIGNYEGTPGKYTTPLQGLTALVATTYLPGCSNVACGTAQI-DEAKKIAAAADATVLIVG 59

Query: 487 LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAI 546
           +DQSIEAE  DR  + LPG+Q  L+++V+ ASKG  ILV+MSGG  D++FAKND +I +I
Sbjct: 60  IDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITSI 119

Query: 547 IWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT 606
           +W GYPG+AGG AIAD++FG  NP G+LP TWYPQ Y+  +PMT M MRP  +  YPGRT
Sbjct: 120 LWVGYPGEAGGAAIADVIFGFYNPSGRLPTTWYPQSYVDKVPMTNMNMRPDPASGYPGRT 179

Query: 607 YRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCN 666
           YRFY G  +Y FG G+SYT F H +  AP  V++P++  H   ++    K++      C 
Sbjct: 180 YRFYTGETIYTFGDGLSYTQFNHHLIQAPKSVSIPIEEGHSCHSSKC--KSVDAVQESCQ 237

Query: 667 RLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
            L   + + V N G+  G+HT+ +FS+PP+ H +P K L+ FEKV V A A+  V   + 
Sbjct: 238 NLAFDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAKAEALVRFKVD 297

Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           VCK LS+VD  GTR++ LG H +H+G  KHS+++
Sbjct: 298 VCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNV 331


>gi|374313710|ref|YP_005060140.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358755720|gb|AEU39110.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 883

 Score =  327 bits (839), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 176/444 (39%), Positives = 254/444 (57%), Gaps = 31/444 (6%)

Query: 29  ACDPKDATTRT--LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEW 86
           A  P  A TRT  LP+   +LP  QR  DL+GRL+L EK   L++ A  +PRLG+  Y++
Sbjct: 22  AQSPSPAGTRTPLLPYQDTTLPAEQRAADLVGRLTLDEKAAQLVTSAPGIPRLGVPAYDF 81

Query: 87  WSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG 146
           WSE LHG++  G           AT FPQ +  A++F+  L   IG V+S EARA YN  
Sbjct: 82  WSEGLHGIARSG----------YATLFPQAVGMAATFDEPLLHQIGEVISTEARAKYNDA 131

Query: 147 TA--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL 198
            A        GLT WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V GLQG D +  
Sbjct: 132 VAHDLRSIFYGLTIWSPNINIFRDPRWGRGQETYGEDPFLTARLGTAFVEGLQGDDPNYY 191

Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
           +   + KHF  +   +    +R  FNA  S  D+ DT+   FR  ++EGK  S+MC+YN 
Sbjct: 192 RAIGTPKHFAVH---SGPESERHRFNADPSPHDLWDTYLPAFRATIVEGKAGSIMCAYNA 248

Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY--DTQHFTSTPEEAAADAIRAG 316
           + G P CA   +L   +R +W   G++ SDC ++  ++  D  H++   E+A+ D IRAG
Sbjct: 249 IEGKPACASDLLLDEVLRKDWAFKGFVTSDCGAIDNFFEKDGHHYSKDAEQASVDGIRAG 308

Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
            D +CG     +  SAV++G++ E +++  L      + +LG+FD  PS   Y  +   +
Sbjct: 309 TDTNCGGTY-RNLASAVRKGMIQESELDVPLRRLFLARFKLGLFD-PPSQVKYASMPITE 366

Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
             +  H ELAL+AAR+ +VLLKN+  +LPL   R +T+AVIGPN+   +++ GNY  I  
Sbjct: 367 NMSSSHTELALQAAREAVVLLKNEHHTLPLD-ARVKTIAVIGPNASSLISLEGNYNAIPK 425

Query: 437 GYTTPLQGIGRY---ARTIHQQGC 457
                + GI R    A+ ++ QG 
Sbjct: 426 NPVMQVDGIAREFRDAKVLYAQGS 449



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 98/302 (32%), Positives = 152/302 (50%), Gaps = 50/302 (16%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSM 516
           L   A++A +QADA +  +GL   +E E +D          R  L+LP  QQ+L+ + + 
Sbjct: 602 LRAQAMEAVKQADAVVAFVGLSPELEGEEMDVHIPGFSGGDRTDLVLPAAQQQLL-EAAK 660

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
           AS  P ++VL++G  + V +A+      AI+ A YPGQAG  AIA+ L G +NP G+LP+
Sbjct: 661 ASGKPLVVVLLNGSALAVNWAQE--HADAILEAWYPGQAGAQAIAETLSGKNNPSGRLPV 718

Query: 577 TWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           T+Y    + +L P T+ AM          RTYR++KG  +Y FG+G+SY+ F ++     
Sbjct: 719 TFY--RSVNDLPPFTDYAM--------ANRTYRYFKGKPLYEFGYGLSYSTFSYS----- 763

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
                         NA +S + +          TL V+ DVKN  +  G     ++ TPP
Sbjct: 764 --------------NAHLSKERLDAGD------TLRVEADVKNTSTLAGDEVAELYLTPP 803

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
                P + L  FE VH+  G  + V   +   + LS VD  G R +  G +++ +GG +
Sbjct: 804 QNGVYPLRSLEGFEHVHLLPGQSKHVSFTLD-PRQLSEVDEKGIRAVRAGVYSVTVGGGQ 862

Query: 756 HS 757
            S
Sbjct: 863 PS 864


>gi|256393789|ref|YP_003115353.1| beta-glucosidase [Catenulispora acidiphila DSM 44928]
 gi|256360015|gb|ACU73512.1| Beta-glucosidase [Catenulispora acidiphila DSM 44928]
          Length = 1548

 Score =  327 bits (838), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 239/768 (31%), Positives = 357/768 (46%), Gaps = 129/768 (16%)

Query: 52  RVNDLIGRLSLQEKVKLLISGAA-AVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPG- 109
           R  DL+ R++L EKV+ L + +  A+PRLG++ Y +WSE  HGV+ +G     GG+    
Sbjct: 74  RAADLVSRMTLPEKVEQLSTNSGPAIPRLGVQQYTYWSEGQHGVNTLGANQDNGGNGGAV 133

Query: 110 -ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG---------------LTYW 153
            +TSFP    +  S++ +L       +SDEAR   +    G               LT+W
Sbjct: 134 HSTSFPTNFASTMSWDPSLIYQETTAISDEARGFLDKSLFGVNQNNLGPSAADYGSLTFW 193

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------SDGDRLKVAASCKHF 207
           +P VN+ RDPRWGR  E  GEDP L+   A ++V G +G      S    LKVAA+ KH+
Sbjct: 194 APTVNMDRDPRWGRTDEAFGEDPYLTSTMAGAFVNGYEGNTPTGQSKTGTLKVAATAKHY 253

Query: 208 TAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
              D++     DR   ++ VS  D+ D +   F   +    V+ +M SYN +NG P+ AD
Sbjct: 254 ALNDVEQ----DRTGISSNVSDTDLHDYYTKQFASLIENAHVSGLMTSYNAINGTPSVAD 309

Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYY--------------------DTQHFTST--- 304
                +  + ++  NGY+ SDC ++G  Y                     T  +T+T   
Sbjct: 310 TYTANQLAQRQFGFNGYVTSDCGAIGTAYQSFPSGHDWAPPGWTTDGKSSTGTWTNTATG 369

Query: 305 ---PEEAA--ADAIRAGLDLDC--GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
              P +A   A A+RAG DL+C  G        +A+  G+LSE  I+NALV   TV++  
Sbjct: 370 ATVPAQAGGQAYALRAGTDLNCAGGENTYAQITAAISAGVLSEGVIDNALVKIFTVRVET 429

Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS-------------- 403
           G FD    S PY  +    + +P HQ LA + A   +VLLKNQ P+              
Sbjct: 430 GEFD-PAGSNPYTGITKAQIQSPAHQALATKVADNSLVLLKNQPPAASGTSTTPPAASSA 488

Query: 404 -----------LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR-- 450
                      LPLS      + ++G  ++     +GNY+       +P+QGI    R  
Sbjct: 489 ASSAAAAAKPLLPLSAAATAKIVIVGDMANAVT--LGNYSSDPALKVSPVQGITAAVRKA 546

Query: 451 ------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
                 T    G    A A        +     ADA I+ +G +Q I  E  DR  + +P
Sbjct: 547 NPGASVTFDACGTSTTASAAASCSAQTLADVAGADAVIVFVGTNQQIADEGKDRTSIAMP 606

Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
           G    L+S+V+       +L + SGGP+ +   + D   A+I+++G+ G++ GTA+AD+L
Sbjct: 607 GNYDSLISQVAAVGNPRMVLAVQSGGPVRIDDVQKD--FASIVFSGFNGESQGTALADVL 664

Query: 565 FGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           FG  NP G L  TWY  +  + LP M+   + P+Q+    GRTY ++ G   YPFG+G+S
Sbjct: 665 FGAQNPDGHLDFTWYADD--SQLPAMSNYGLTPAQTGGL-GRTYMYFTGTPTYPFGYGLS 721

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           Y+ F        +   V  +GR    N + S                 V V VKN G   
Sbjct: 722 YSTF--------SFSGVHAEGRSVDANGSQS-----------------VSVTVKNTGKTA 756

Query: 684 GAHTLLVFSTPP---AGHWAPHKQLVAFEKVHV-PAGAQQRVGINIHV 727
           G+    +++ P    AG   P++QLV F K  V   G  Q + I  H+
Sbjct: 757 GSTVAQLYAQPKFTVAGQTFPNEQLVGFAKSKVLKPGESQHLTITAHI 804


>gi|308208211|gb|ADO20356.1| putative beta-D-xylosidase/alpha-L-arabinosidase [uncultured rumen
           bacterium]
          Length = 780

 Score =  325 bits (833), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 258/796 (32%), Positives = 365/796 (45%), Gaps = 147/796 (18%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+   SLP  +R  DL+ RL+L+EK  L +  +A V  LGIK Y WWSEALHGV+  G  
Sbjct: 45  PYKDRSLPPEERAKDLVSRLTLEEKASLSMHPSAPVEALGIKAYNWWSEALHGVARNG-- 102

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                    AT FPQ I  A+SF+  L   +   VSDEAR  Y         G   G+T+
Sbjct: 103 --------AATVFPQPIGMAASFDEPLLYEVFTAVSDEARVKYKIAKESGHIGQYQGVTF 154

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
           W+PN+NIFRDPRWGRG ET GEDP L+G+   + VRGLQG SD   LK  A  KH+  + 
Sbjct: 155 WTPNINIFRDPRWGRGMETYGEDPYLTGQMGMAVVRGLQGPSDSPVLKAHACAKHYAVHS 214

Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
              WN   R  ++A+VS++D+ +T+   F+  V +  V  VM +YN+  G P  A   ++
Sbjct: 215 GPEWN---RHSYDAEVSERDLRETYLPAFKDLVTKANVQEVMTAYNRFRGEPCGASDYLI 271

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHT 329
              +RGEW   G I SDC +V  +Y       +P+   AAA A+ AG+D +CG     H 
Sbjct: 272 NTILRGEWGYKGLITSDCWAVEDFYVQGRHGYSPDVASAAAAAVHAGVDTECGQAY-RHI 330

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
             AV+RGLL E D++  L+   T + +LG  D       +  L    +  P+H  L+ + 
Sbjct: 331 PEAVERGLLDEKDLDRNLIRLFTARYQLGEMD---DISLWDDLPASILEGPEHLALSRKM 387

Query: 390 ARQGIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGI-------------- 434
           A++ +VLL+N+G  LPL+  +R   VA++GPN D      GNY  +              
Sbjct: 388 AQESMVLLQNKGGILPLAPDVR---VALVGPNGDDREMQWGNYNPVPGRTVTLYDALKER 444

Query: 435 --------ACG---------------------------------YTTPLQGIGRYARTIH 453
                    CG                                 Y   +Q I  Y R   
Sbjct: 445 FPGIKYVRGCGIVGAEFAPKPDPNNPLSQALGKSREEMEAIARQYAIGVQDILNYVRRQE 504

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLL 503
           +     +   D Q   + +      D  I   G+    E E +          DR  + L
Sbjct: 505 RMQASFLPELDVQ---SVLKELEGIDVVIFAGGISPRFEGEEMPVNLPGFKGGDRTDIQL 561

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           P  Q++L+  +  A K   ILV  SG  I            AI+ A YPG+ GG AI D+
Sbjct: 562 PQVQRDLMKALHDAGKK-VILVNFSGCAI--GLVPETESCDAILQAWYPGEEGGLAITDV 618

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFG  NP GKLP+T+Y    + +LP  E       +    G TYR++KG  ++PFG+G+S
Sbjct: 619 LFGDVNPSGKLPVTFY--RSVEDLPDFE-------NYDMKGHTYRYFKGKPLFPFGYGLS 669

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           Y+ F +  A            R+ S+        I V +      T  VQV V+  G  D
Sbjct: 670 YSTFRYKRAKV----------RNNSL-------IIPVKNTGKREATEVVQVYVRRKGDPD 712

Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
           G                P K L AF +V +PAG   +V I +    +L   + +    +P
Sbjct: 713 G----------------PVKTLRAFRRVTIPAGKTVKVCIPLEDETFLWWSEEA-QDMVP 755

Query: 744 L-GEHNIHIGGTKHSV 758
           L G++ +  GG+   V
Sbjct: 756 LPGKYELLYGGSSDQV 771


>gi|443717728|gb|ELU08656.1| hypothetical protein CAPTEDRAFT_228276 [Capitella teleta]
          Length = 731

 Score =  325 bits (832), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 209/628 (33%), Positives = 332/628 (52%), Gaps = 57/628 (9%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS--GAAAVP--RLGIKGYEWWSEALHGVS 95
            PF  V+L   +RV+DL+ RL+++E V + ++  G + +P  RLG+K Y++ +E + GV 
Sbjct: 20  FPFEDVTLSWDKRVDDLVQRLTIEEVVNISVAQYGKSTIPVDRLGVKPYQFINECITGVR 79

Query: 96  NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG-------GTA 148
                      +  +T+FPQ I   +SF+  L   + + ++ E R  YN        G  
Sbjct: 80  -----------WENSTAFPQAIGLGASFSPDLAFNMSQAIARELRGFYNTEVKSQIYGHR 128

Query: 149 GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFT 208
           G+  ++P +NI R P WGR QET GEDP LSG+ +  +V+GLQG     ++ +  CKHF 
Sbjct: 129 GVNCFTPVINIMRHPLWGRNQETYGEDPWLSGQLSVGFVKGLQGDHPRYIQASGGCKHFD 188

Query: 209 AYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
            ++      V RF F+AKVS++D   TF   F+ CV  G + ++MCSYN++NGVP CA+ 
Sbjct: 189 VHNGPENIPVSRFGFDAKVSERDWRMTFLPQFKTCVEAGSI-NIMCSYNRINGVPACANK 247

Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG-- 326
            +L   +R EW  NGY++SD  ++        +T T  EAAAD+++AG +++     G  
Sbjct: 248 KLLTDILRKEWGFNGYVISDSGAIENIVYHHKYTKTLAEAAADSVKAGCNVELTGATGSG 307

Query: 327 ---LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
               +  +AV++ L+SE ++   L   +  +MR G FD      P+  +    V + +HQ
Sbjct: 308 VAYFNLLNAVKQNLISEEELRENLKKPMYSRMRQGEFD-PVDMNPFTKIDMSVVLSQEHQ 366

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA--GIACGYTTP 441
           +LA++A+    VL+KN    LPL   R   +A+IGP +D   T+ G+Y         +TP
Sbjct: 367 DLAVKASAMSFVLMKNLNRVLPLKK-RFDRLAIIGPFADNAETLFGDYIPNWDPKFVSTP 425

Query: 442 LQGIGRYARTI-HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAG 500
            +G+      + +  GC D +C +     A   A + A    + +G+  ++E E  DRA 
Sbjct: 426 YEGLKSLGDDVRYASGCDDPSCTNYDP-KAIEKAVKGAQFVFVCLGVGSNLEREGHDRAD 484

Query: 501 LLLPGRQQELVSKVSMASK-GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTA 559
           L LPG Q +++      S+  P +LVL + GP+D+ + K  P +  II   YP    G A
Sbjct: 485 LDLPGYQLQILKDAEFFSREAPLVLVLFNAGPVDLTWPKLSPEVDGIIECFYPAMGTGKA 544

Query: 560 IADILFGTSN---PGGKLPMTWYPQEY----ITNLPMTEMAMRPSQSKRYPGRTYRFYK- 611
           +  ++  T +   P  +LP TW  Q +    IT+  MT             G TYR++  
Sbjct: 545 LYQVVTATGDDGVPAARLPSTWPAQLHQVPSITDYNMT-------------GHTYRYFDG 591

Query: 612 GPVVYPFGHGMSYTNF-VHTVANAPTVV 638
           G  +YPFG+G+SYT+F   TV+ +PT V
Sbjct: 592 GDPLYPFGYGLSYTSFHYQTVSVSPTSV 619


>gi|116621778|ref|YP_823934.1| glycoside hydrolase family 3 protein [Candidatus Solibacter
           usitatus Ellin6076]
 gi|116224940|gb|ABJ83649.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 850

 Score =  321 bits (823), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 177/419 (42%), Positives = 253/419 (60%), Gaps = 27/419 (6%)

Query: 36  TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
           TT  LPF    L   +R  DL+ R++L EKV  + + A A+PRLGI  Y+WW+EALHGV+
Sbjct: 20  TTSQLPFMDPDLSAERRAADLVARMTLDEKVLQMQNSAPAIPRLGIPAYDWWNEALHGVA 79

Query: 96  NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA------- 148
             G           AT FPQ I  A++++ATL   I   +S EARA YN           
Sbjct: 80  RAGL----------ATVFPQAIGLAATWDATLMHRIAETISTEARAKYNEAIRNDDHSRY 129

Query: 149 -GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHF 207
            GLT+WSPN+NIFRDPRWGRGQET GEDP L+ + A ++++G+QG D    KV A+ KH 
Sbjct: 130 RGLTFWSPNINIFRDPRWGRGQETYGEDPFLTSRMAVAFIKGMQGEDPHYYKVIATAKH- 188

Query: 208 TAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
             Y + +     R  F+ K S +D+ DT+   FR  ++E +  S+MC+YN+V+G+P CA 
Sbjct: 189 --YAVHSGPESSRHQFDVKPSPRDLADTYLPAFRASIVEARADSLMCAYNRVDGIPACAS 246

Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGL 327
            ++L++ +RGEW   G++VSDC +V   +   H+      A+A A++AG DL CG     
Sbjct: 247 TDLLEKRLRGEWGFQGFVVSDCGAVSDIFRGHHYQPDAASASAVAVKAGTDLTCGNEYRA 306

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
             + AV+ GL++E +IN +L      + +LGMFD  P   P+ ++   +V +  H+++AL
Sbjct: 307 LVD-AVKTGLITEPEINRSLERLFVARFKLGMFD-PPERVPFSNIPYSEVDSAGHRKIAL 364

Query: 388 EAARQGIVLLKNQGPSLPL-SHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           EAAR+ IVLLKN G +LPL S I+   +AVIGP +D    ++GNY G +    TPL GI
Sbjct: 365 EAARKSIVLLKNDG-TLPLKSSIKK--IAVIGPAADDAEALLGNYNGFSSLQVTPLAGI 420



 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 80/272 (29%), Positives = 127/272 (46%), Gaps = 53/272 (19%)

Query: 461 ACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQEL 510
           A  D  L  AAI+A   AD T+  +GL+ S+E E +          DR  L LP  Q++L
Sbjct: 587 APPDAPLLAAAIEAVSNADVTLAFVGLNPSLEGEEMPVSVPGFQGGDRTNLELPEPQEKL 646

Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
           + + ++A+  P ++VL SG  + + FA      +A++   Y G+  GTAIAD L G +NP
Sbjct: 647 I-EAAIATGKPVVVVLASGSAVAMNFAAQ--HASALLETWYNGEETGTAIADTLAGINNP 703

Query: 571 GGKLPMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH 629
            G+LP+T+Y    +  L P  E AM+        GRTYR++ G  +Y FG G+SY+ F +
Sbjct: 704 SGRLPVTFY--RSVDQLPPFEEYAMK--------GRTYRYFNGDALYSFGFGLSYSKFQY 753

Query: 630 TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLL 689
           +                          A+K   A    +   V   V+N  S +G   + 
Sbjct: 754 S--------------------------ALKTRRAGSGTI---VASRVRNASSIEGDEVVQ 784

Query: 690 VFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
           ++         P + L  F+++H+  G  + V
Sbjct: 785 LYVNGSGADGDPIRSLRGFQRIHLRPGESREV 816


>gi|397690575|ref|YP_006527829.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
 gi|395812067|gb|AFN74816.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
          Length = 860

 Score =  320 bits (820), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 175/428 (40%), Positives = 253/428 (59%), Gaps = 29/428 (6%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +  V+LP  +R  DL+ RLSL EK+ L++  + A+ RLGI  Y WW+EALHGV+  G   
Sbjct: 23  YLNVNLPFEERAEDLLQRLSLDEKISLMVHQSPAIERLGIPEYNWWNEALHGVARNGR-- 80

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
                   AT FP  I  A++++  L   I  V+S+EARA YN        G   G++ W
Sbjct: 81  --------ATVFPMPIGLAATWDRDLIYRIADVISNEARAKYNSALKKNQRGIYQGISLW 132

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +PN+NIFRDPRWGRG ET GEDP L+G+ A S+++GLQG D   LK  A+ KH   +   
Sbjct: 133 APNINIFRDPRWGRGMETYGEDPYLTGELAVSFIKGLQGQDKKYLKTIATPKHLAVHSGP 192

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
                +R HFNA VS  D+ +T+   F+  +M+GK  SVMC+YN++ G   C    +L  
Sbjct: 193 E---PERHHFNALVSNYDLNETYLPHFKKSIMKGKAYSVMCAYNRLRGKACCGHDTLLTD 249

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESA 332
            +R +W   G +VSDC +V   +++     +PE+AAA A+ +G DL+CG  FL L  ++A
Sbjct: 250 ILRNKWGFEGIVVSDCWAVYDIFNSHKIVDSPEKAAALAVSSGTDLECGNTFLSL--KNA 307

Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
            + GL++E +I++AL   L  + +LGMFD  P    Y  +    +    ++E+ALEAAR+
Sbjct: 308 YRDGLITEKEIDSALRRVLLARFKLGMFD-PPEIVSYSQIDESYLDNSYNREIALEAARK 366

Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYART- 451
            IVLLKN    LPL    ++ +AVIGPN+D   +++GNY G    Y TPLQ I R  +  
Sbjct: 367 SIVLLKNDNKLLPLDSSINK-IAVIGPNADNLESLLGNYHGFPSEYITPLQAIRRVLKNG 425

Query: 452 --IHQQGC 457
              +++GC
Sbjct: 426 EVFYEKGC 433



 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 98/292 (33%), Positives = 140/292 (47%), Gaps = 49/292 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A   + ++DA I+ MGL   +E EAL          DR  L LP  Q +L+ K+    K 
Sbjct: 591 AYKTALKSDAVIMFMGLCPRMEGEALKIKLDGFKGGDRLKLSLPANQLKLIKKIHSTGK- 649

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P ILVL++GGPI   +   +  I AI+ A YPGQAGG AI D+++G  NP GKLP+T Y 
Sbjct: 650 PVILVLLNGGPISTVWESEN--IPAILEAWYPGQAGGRAITDVIWGKYNPSGKLPVTIYK 707

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
            E   +LP       P ++    GRTYR++KG V+YPFG G++YT+              
Sbjct: 708 SE--NDLP-------PFENYDMEGRTYRYFKGEVLYPFGWGLNYTDI------------- 745

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
                      TIS   +     K N  T+ V V +KN G+  G  T+ +++     +  
Sbjct: 746 -----------TISNIELSANEIKDND-TIRVVVKLKNNGNLAGEETVQLYTKALKDN-R 792

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
             K L  FEK+ +  G +  V   +        VD  G   +P G + I +G
Sbjct: 793 TIKTLRGFEKIKLEPGTEGMVEFYLSKSDLAVWVDGLGFETMP-GVYEIIVG 843


>gi|296081550|emb|CBI20073.3| unnamed protein product [Vitis vinifera]
          Length = 309

 Score =  320 bits (820), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 158/248 (63%), Positives = 184/248 (74%), Gaps = 18/248 (7%)

Query: 133 RVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
           +VVS EARAMYN G AGLT+WSPNVNIF+DPRWGRGQETPGEDP+LS KYA+ YVRGLQ 
Sbjct: 77  QVVSTEARAMYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQQ 136

Query: 193 SDG---DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
           SD    DRLKVAA CKH+TAYDLDNW GVD FHFNA V+ QD++DTF  PF+ CV++G V
Sbjct: 137 SDDGSPDRLKVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTFQPPFKSCVIDGNV 196

Query: 250 ASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAA 309
           ASVMCSYNQ           I K  ++       Y+VSDCDSV V+Y++QH+T TPEEA 
Sbjct: 197 ASVMCSYNQFYL--------IYKYHLQ-------YLVSDCDSVDVFYNSQHYTKTPEEAV 241

Query: 310 ADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPY 369
           A AI AGLDL+CG FLG HTE+AV+ GL+ E  ++ A+ N     MRLG FDG PS   Y
Sbjct: 242 AKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIY 301

Query: 370 GHLGPKDV 377
           G LGPKDV
Sbjct: 302 GKLGPKDV 309


>gi|255572557|ref|XP_002527212.1| beta-glucosidase, putative [Ricinus communis]
 gi|223533388|gb|EEF35138.1| beta-glucosidase, putative [Ricinus communis]
          Length = 349

 Score =  319 bits (818), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 156/340 (45%), Positives = 215/340 (63%), Gaps = 31/340 (9%)

Query: 6   AFFFLGLILLSAS-SSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
           + FF  ++L   S S+  + +  + C P    + T  FC  SL +P R + LI  L+L+E
Sbjct: 7   SLFFSCIVLFELSVSNSESHKLQYPCQPPLHNSYT--FCNQSLSVPTRAHSLISLLTLEE 64

Query: 65  KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSF 123
           K+K L   A+ +PR GI  YEWWSE+LHG++  GPG  F  G    AT FPQVI +A++F
Sbjct: 65  KIKQLSDNASGIPRFGIPPYEWWSESLHGIAINGPGVSFTIGPVSAATGFPQVIISAAAF 124

Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
           N TLW  IG  ++ EARAM+N G +GLT+W+PNVNIFRDPRWGRGQETPGEDP+L+  YA
Sbjct: 125 NRTLWFLIGSAIAIEARAMHNVGQSGLTFWAPNVNIFRDPRWGRGQETPGEDPMLTSAYA 184

Query: 184 ASYVRGLQGSDGD--------------------------RLKVAASCKHFTAYDLDNWNG 217
             +V+G QG +                             L ++A CKH TAYDL+ W  
Sbjct: 185 IEFVKGFQGGNWKSGVSGSGSGRYGFGEKRMLRDDDGDDGLMLSACCKHLTAYDLEKWGN 244

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R+ FNA V++QD+EDT+  PFR C+ EGK + +MCSYN+VNGVP CA  ++L++  R 
Sbjct: 245 FSRYSFNAVVTEQDLEDTYQPPFRSCIEEGKASCLMCSYNEVNGVPACAREDLLQKA-RE 303

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
           EW   GYIVSDCD+V   ++ Q+++ + E+A A A++AG+
Sbjct: 304 EWGFEGYIVSDCDAVATIFEYQNYSKSAEDAVAIALKAGM 343


>gi|413925165|gb|AFW65097.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 412

 Score =  318 bits (816), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 157/316 (49%), Positives = 206/316 (65%), Gaps = 11/316 (3%)

Query: 11  GLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI 70
            L+L+   +   A   PF+C      +  LPFC   LP  QR  DL+ R++  EK   L 
Sbjct: 69  ALLLMRVVAVVAAGAPPFSCG--GGPSLGLPFCNTKLPAAQRAADLVSRMTPAEKASQLG 126

Query: 71  SGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG-GDFPGATSFPQVITTASSFNATLWE 129
             A  VPRLG+  Y+WW+EALHGV+  G G     G    ATSFPQV+ TA+SFN  LW 
Sbjct: 127 DVANGVPRLGVPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWF 186

Query: 130 AIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVR 188
            IG+    EARA YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA++VR
Sbjct: 187 RIGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVR 246

Query: 189 GLQGSDGDR------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
           GLQGS  +       L  +A CKH TAYDL++W GV R+ F A V+ QD+ DTF+ PFR 
Sbjct: 247 GLQGSSSNTKSVPPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRS 306

Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG-YIVSDCDSVGVYYDTQHF 301
           CV++GK + VMC+Y  VNGVP+CA+ ++L +T RG W L+G Y+ +DCD+V +  ++Q +
Sbjct: 307 CVVDGKASCVMCAYTSVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIMRNSQFY 366

Query: 302 TSTPEEAAADAIRAGL 317
             T E+  A  ++AG+
Sbjct: 367 RPTAEDTVATTLKAGM 382


>gi|189467437|ref|ZP_03016222.1| hypothetical protein BACINT_03826 [Bacteroides intestinalis DSM
           17393]
 gi|189435701|gb|EDV04686.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 863

 Score =  318 bits (816), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 181/448 (40%), Positives = 254/448 (56%), Gaps = 30/448 (6%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
           T  F    LPI +RVNDL+GRL+L+EK+  +++ A A+ RLGI  Y WW+E LHGV+   
Sbjct: 23  TCKFLNPELPIVERVNDLVGRLTLEEKISQMLNNAPAIDRLGIPAYNWWNECLHGVAR-- 80

Query: 99  PGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GL 150
                   +P  TSFPQ I  A++++      +    SDE RA+Y+  T         GL
Sbjct: 81  ------SPYP-VTSFPQAIAMAATWDTESVHQMAVYASDEGRAIYHDATRKGTPGIFRGL 133

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
           TYWSPN+NIFRDPRWGRGQET GEDP L+     S+V+GLQG D   LK +A  KH+  +
Sbjct: 134 TYWSPNINIFRDPRWGRGQETYGEDPFLTASIGVSFVKGLQGDDPVYLKSSACAKHYAVH 193

Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
               WN   R  ++AKV+  D+ DT+   F+  V+EGKV  VMC+YN   G P C +  +
Sbjct: 194 SGPEWN---RHTYDAKVNNHDLWDTYLPAFKELVVEGKVTGVMCAYNSFFGQPCCGNDLL 250

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE 330
           +   +R  W+  GY+ SDC +V  +Y+T         A+ADA+  G D +CG        
Sbjct: 251 MMDILRNHWKFGGYVTSDCGAVEDFYNTHKTHQDAAAASADAVLHGTDCECGNGAYRALA 310

Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEA 389
            AV RGL++E  I+ +L     ++ RLGMFD +    PY ++ P  V   D H+  AL+ 
Sbjct: 311 DAVLRGLITEKQIDESLKKLFEIRFRLGMFDPD-DRVPYSNI-PLSVLECDAHKAHALKI 368

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
           ARQ IVLLKNQ   LPL+  + + +AV+GPN+D    ++ NY G     TT L+GI    
Sbjct: 369 ARQSIVLLKNQDQLLPLNKNKIKKIAVVGPNADDKSVLLANYYGYPSHITTALEGIQKKV 428

Query: 446 GRYARTIHQQGCKDVACADDQLFGAAID 473
           G     ++++G   V   DD +F +A +
Sbjct: 429 GNQVEVVYEKG---VNLTDDFVFTSAYE 453



 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 88/298 (29%), Positives = 140/298 (46%), Gaps = 52/298 (17%)

Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
           A + AD  I V GL   +E E +          DR  + +P  QQ L+ ++    K P +
Sbjct: 595 AVKDADVIIFVGGLSAKVEGEEMGVEIEGFKRGDRTSISIPSVQQNLLKELYATGK-PVV 653

Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
            V+M+G  + + +      + AI+ A Y GQAGG AIAD+LFG  NP G+LP+T+Y    
Sbjct: 654 FVMMTGSALGLEW--ESAHLPAILNAWYGGQAGGQAIADVLFGDYNPSGRLPLTFYKS-- 709

Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
           + +LP  E             RTYR++ G  VYPFG+G+SYT F ++             
Sbjct: 710 VNDLPDFE-------DYSMENRTYRYFTGTPVYPFGYGLSYTTFQYSSL----------- 751

Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
                +  +   +++KVT              + N G  +G     ++ + P     P +
Sbjct: 752 ----KLQPSPDKRSVKVT------------AKITNTGKMEGEEVAQLYVSNPRDFVTPIR 795

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTKHSVSL 760
            L  F+++++  G  Q V   +   K LSVVD SG + +P+ G+  I +GG + S  +
Sbjct: 796 ALKGFKRINLKPGESQTVEF-VLTSKELSVVDISG-KSVPMKGKVQISLGGGQPSTQM 851


>gi|297738404|emb|CBI27605.3| unnamed protein product [Vitis vinifera]
          Length = 581

 Score =  318 bits (815), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 169/406 (41%), Positives = 237/406 (58%), Gaps = 50/406 (12%)

Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
           LKV++ CKH+  YD+D+W           VS+QD+++TF  PF                 
Sbjct: 19  LKVSSCCKHYATYDIDSW---------LNVSEQDMKETFFSPFE---------------- 53

Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAG 316
                             R EW L+GYIVSDC  + V  D Q++ +  + +A A  ++AG
Sbjct: 54  ------------------RDEWDLHGYIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAG 95

Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
           LDL+CG +       +V  G +S+ +++ AL N   + MR+G FDG P+   Y  LG KD
Sbjct: 96  LDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA---YESLGLKD 152

Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
           +C  DH ELA EAARQGIVLLKN    LPL     + + ++GP+++ T  MIGNYAG+  
Sbjct: 153 ICAADHIELAREAARQGIVLLKNDYEVLPLK--PGKKLVLVGPHANATEVMIGNYAGLPY 210

Query: 437 GYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL 496
            Y +PL+         +  GC D +C++D  F  A +A++ A+ TI+ +G D SIEAE +
Sbjct: 211 KYVSPLEAFSAIGNVTYATGCLDASCSNDTYFSEAKEAAKFAEVTIIFVGTDLSIEAEFV 270

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DR   LLPG Q EL+ +V+  S GP ILV++SG  ID+ FAKN+PRI+AI+W G+PG+ G
Sbjct: 271 DRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQG 330

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM-AMRPSQSKR 601
           G AIAD++FG  NPGG+LP+TWY  +Y+  L    M A  PS  KR
Sbjct: 331 GHAIADVVFGKYNPGGRLPVTWYEADYVACLETHIMDAKTPSPQKR 376


>gi|90021134|ref|YP_526961.1| Beta-glucosidase [Saccharophagus degradans 2-40]
 gi|89950734|gb|ABD80749.1| b-xylosidase-like protein [Saccharophagus degradans 2-40]
          Length = 893

 Score =  318 bits (814), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 183/462 (39%), Positives = 266/462 (57%), Gaps = 44/462 (9%)

Query: 8   FFLGLILLSASSSGLAA----REPFACDPKD--ATTRTLPFCQVSLPIPQRVNDLIGRLS 61
           F +G+I L     GL A      P A  P+D  A+T T PF   SL +  RV+DL+ RL+
Sbjct: 5   FLVGVITLGVILQGLTACSKSAAPNANQPQDTAASTATYPFRDASLSVDARVDDLVSRLT 64

Query: 62  LQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTAS 121
             EK+  + +   A+ RLGI  Y WW+E+LHGV+  G           AT +PQ I  AS
Sbjct: 65  TTEKIAQMFNDTPAIERLGIPAYNWWNESLHGVARAGK----------ATVYPQAIGLAS 114

Query: 122 SFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETPG 173
           +F+  L   +   +SDE RA Y+   +        GLT+WSPN+NIFRDPRWGRGQET G
Sbjct: 115 TFDEDLMLRVATSISDEGRAKYHDFLSKDVRTIYGGLTFWSPNINIFRDPRWGRGQETYG 174

Query: 174 EDPVLSGKYAASYVRGLQGS--DGDRLKVAASCKHFTAYD-LDNWNGVDRFHFNAKVSKQ 230
           EDP L+G+ A ++V+G+QG   + D LK  A+ KH+  +   +     D +H     +++
Sbjct: 175 EDPFLTGRMAINFVKGIQGENDNSDYLKAVATIKHYAVHSGPEKTRHSDDYH----PTRK 230

Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
           D+ +T+   FRM + E  V S+MC+YN+V+G P C +  +++  +RG+   NGY+VSDC 
Sbjct: 231 DLFETYLPAFRMAIAETNVQSLMCAYNRVDGAPACGNNELMQEILRGDMGFNGYVVSDCG 290

Query: 291 SVGVYYDTQ--HFTSTPEEAAADAIRAGLDLDCG-----PFLGLHTESAVQRGLLSEIDI 343
           ++  +Y+++  H   +P EAAA A+++G DL+CG      +  LH   A+Q+GL++E  I
Sbjct: 291 AIADFYESRSHHVVDSPAEAAAWAVKSGTDLNCGDSHGNTYTNLHY--ALQQGLITEDYI 348

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           + A+      +++LGMFD E    PY  +G   V +P H  L  EAA + IVLLKN G  
Sbjct: 349 DIAVKRLFKARIKLGMFD-EQDRVPYSEIGMDVVGSPKHLALTQEAAEKSIVLLKNNG-V 406

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           LPL       VAVIGPN+     ++GNY G+      PL+GI
Sbjct: 407 LPLK--AGVKVAVIGPNAVDEDVLVGNYHGVPVKPVLPLEGI 446



 Score =  106 bits (264), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 90/313 (28%), Positives = 143/313 (45%), Gaps = 55/313 (17%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+ A+R+AD  I + G+D  +E E +          DR  + LP  Q  L+ ++    K 
Sbjct: 620 ALAAARKADVIIFMGGIDAHLEGEEMPLELDGFTHGDRTHINLPKVQTNLLKQLKATGK- 678

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P ++V  SG  +  A      ++ AI+ A YPG+A GTA+A+IL+G  +P G+LP+T+Y 
Sbjct: 679 PVVMVNFSGSAM--ALNWESEKLDAILQAFYPGEATGTALANILWGDVSPSGRLPVTFYK 736

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
              + +LP                RTY+FY+G  +Y FGHG+ Y +F +       VVA 
Sbjct: 737 G--VDDLPAF-------NDYHMENRTYKFYRGEPLYAFGHGLGYVDFAYN----NLVVA- 782

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG---SKDGAHTLLVFSTPPAG 697
                    N   +GKA            L + V V N G   ++D A   +     PA 
Sbjct: 783 ---------NTAEAGKA------------LPIAVSVTNTGKMQAEDVAQVYISLLDAPAN 821

Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG-GTKH 756
              P + L AF++  + AG    +  N+   + L+ +D +G  +   G   + +G G K 
Sbjct: 822 --TPIRDLKAFKRTKLAAGESTELEFNLP-ARVLTYIDDNGKTQTYTGRVEVTVGSGQKG 878

Query: 757 SVSLHAATLGVIK 769
            V  +A  +  I 
Sbjct: 879 YVKENAIAVATIN 891


>gi|449489074|ref|XP_002195511.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Taeniopygia guttata]
          Length = 685

 Score =  316 bits (809), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 226/683 (33%), Positives = 342/683 (50%), Gaps = 88/683 (12%)

Query: 73  AAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPG-ATSFPQVITTASSFNATLWEAI 131
           A  +PRLGI  Y W +E L G           G+ PG AT+FPQ +  A++F+  L   +
Sbjct: 6   APPIPRLGIAPYNWNTECLRG----------DGEAPGWATAFPQALGLAAAFSPELIYRV 55

Query: 132 GRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
               + E RA +N   A        GL+ +SP +NI R P WGR QET GEDP LSG+ A
Sbjct: 56  ANATATEVRAKHNSFAAAGRYSDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPFLSGELA 115

Query: 184 ASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMC 243
            S+V+GLQG     +K +A CKHF+ +        +   +   V ++D   TF   F+ C
Sbjct: 116 RSFVQGLQGPHPRYVKASAGCKHFSVHGGHE----NILLYLLTVLERDWRMTFLPQFQAC 171

Query: 244 VMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS 303
           V  G   S MCSYN++NGVP CA+  +L   +RGEW  +GY+VSD  +V +     H+T 
Sbjct: 172 VRAGSY-SFMCSYNRINGVPACANKKLLTDILRGEWGFDGYVVSDEGAVELIMLGHHYTR 230

Query: 304 TPEEAAADAIRAGLDLDCGPFLG----LHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
           +  E A  ++ AG +L+    +     +    A+  G ++   + + +      +MRLG 
Sbjct: 231 SFLETAVASVNAGCNLELSYGMRNNVFMRIPEALAMGNITLQMLRDRVRPLFYTRMRLGE 290

Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS--HIRHRTVAVI 417
           FD  P+  PY  L    V +P+H+ L+LEAA +  VLLKN   +LPL    +  + +AV+
Sbjct: 291 FD-PPAMNPYSSLDLSVVQSPEHRNLSLEAAVKSFVLLKNVRGTLPLKAQDLSSQHLAVV 349

Query: 418 GPNSDVTVTMIGNYAGIACG--YTTPLQGIGRY-ARTIHQQGCKDVACADDQLFGAA--I 472
           GP +D    + G+YA +       TP +G+    A      GC +  C   Q +  A  +
Sbjct: 350 GPFADNPRVLFGDYAPVPEPRYIYTPRRGLEMLGANVSFAAGCSEPRC---QRYSRAELV 406

Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG-PTILVLMSGGP 531
                AD  ++ +G    +E EA DR+ L LPG Q EL+     A+ G P IL+L + GP
Sbjct: 407 KVVGAADVVLVCLGTGVDVETEAKDRSDLSLPGHQLELLQDAVQAAAGRPVILLLFNAGP 466

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGT--SNPGGKLPMTWYPQEYITNLPM 589
           +DV++A+    + AI+   +P QA G AIA +L G   ++P G+LP TW         P 
Sbjct: 467 LDVSWAQAHDGVGAILACFFPAQATGLAIARVLLGEAGASPAGRLPATW---------PA 517

Query: 590 TEMAMRPSQSKRYPGRTYRFY-KGPVVYPFGHGMSYTNFVHT-VANAPTVVAVPLDGRHG 647
               + P ++    GRTYR+Y +   +YPFG+G+SYT F +  +  +P V+ +       
Sbjct: 518 GMHQVPPMENYTMEGRTYRYYGQEAPLYPFGYGLSYTTFRYRDLVLSPPVLPL------- 570

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-----STPPAGHWAPH 702
                            C  L+  V V ++N G +D    + ++     S+ P   W   
Sbjct: 571 -----------------CANLS--VSVVLENTGLRDSEEVVQLYLRWEHSSVPVPRW--- 608

Query: 703 KQLVAFEKVHVPAGAQQRVGINI 725
            QLVAF +V VPAG + ++   +
Sbjct: 609 -QLVAFRRVAVPAGREAKLSFQV 630


>gi|167521708|ref|XP_001745192.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776150|gb|EDQ89770.1| predicted protein [Monosiga brevicollis MX1]
          Length = 614

 Score =  311 bits (798), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 209/610 (34%), Positives = 307/610 (50%), Gaps = 56/610 (9%)

Query: 41  PFCQVSLPIP-QRVNDLIGRLSLQEKVKLLIS---------GAAAVPRLGIKGYEWWSEA 90
           P C     +P   +  LI  +   EK  LL +             V R+G+  Y+W   A
Sbjct: 27  PHCLTMFRVPIGWMQSLISLIHDDEKPPLLTAREGGGGSPGPPGNVSRIGLPEYDWGMNA 86

Query: 91  LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-- 148
           +HGV +       G  +   TSFP  +    ++N + +  +GR++  E RA++  G    
Sbjct: 87  IHGVQSSCIKDDDGTVY-CPTSFPNPVNYGFTWNYSAYLELGRIIGVETRALWLAGAVEA 145

Query: 149 ---------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
                    GL  WSPN+NI R P WGR QE PGEDP ++G++  +Y  GLQG D   L+
Sbjct: 146 STWSGRPHIGLDTWSPNINIARSPLWGRNQEVPGEDPFMNGQFGKAYTLGLQGDDDTYLQ 205

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
              + KH+ AY L++ +G  R +FNA VS   + DT+   FR+ V EGK   VMCSYN V
Sbjct: 206 AIVTLKHWDAYSLEDSDGATRHNFNAIVSNFSLMDTYWPAFRVAVTEGKAKGVMCSYNAV 265

Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAG-LD 318
           NG+PTCA P +L+  +R  W+ +GY+ SD  +V    D   +T +   AA  AIR G  D
Sbjct: 266 NGIPTCAHP-LLRTVLRDLWKFDGYVSSDTGAVEDISDNHKYTPSWATAACAAIRDGQTD 324

Query: 319 LDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC 378
           +D G          V  G     D++NAL NTL ++  LG+FD    +Q Y H+    V 
Sbjct: 325 IDSGAVYMKSLLQGVSEGHCRMEDVDNALRNTLRLRFELGLFD-PVENQSYWHVPLAAVN 383

Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC-- 436
           T   +   +    + +VLL+N+   LPL+   +  VA+IGP++     M+GNY G  C  
Sbjct: 384 TNASRATNMLHTLESMVLLQNKNNVLPLA--SNTKVALIGPHAKAQEDMVGNYLGQLCPD 441

Query: 437 --------GYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD 488
                    +   +  +G  A T +  G     C+   +   A+  +  AD  +L++G+D
Sbjct: 442 NNFDCVVSPHDALVSILGTDAVT-YAPGTNVTTCSQSHI-DEAVSVATAADVAVLMLGID 499

Query: 489 QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIW 548
           +SIEAE+ DR  + LP  Q +L S +    K PT++VL++GG + +   K   +  AII 
Sbjct: 500 ESIEAESNDRKSIDLPECQHQLASAIFAVGK-PTVIVLLNGGMLAIENEKQ--QADAIIE 556

Query: 549 AGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR 608
           AGYPG  GGTAIA  L G +   G         +YI  + M++M M        PGRTYR
Sbjct: 557 AGYPGFYGGTAIAQTLTGQNEHLG---------DYINWINMSDMEMTSG-----PGRTYR 602

Query: 609 FYKGPVVYPF 618
           +YK   ++ F
Sbjct: 603 YYKNETLWAF 612


>gi|389636381|ref|XP_003715843.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|351648176|gb|EHA56036.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|440480767|gb|ELQ61414.1| beta-xylosidase [Magnaporthe oryzae P131]
          Length = 517

 Score =  311 bits (798), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 187/517 (36%), Positives = 268/517 (51%), Gaps = 13/517 (2%)

Query: 7   FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
           +F LG     +      +R     D  +    T   C  +L  P+R   L+  LS++EK+
Sbjct: 5   WFILGASAFVSGQEDPESRRQSYPDCVNGPLSTNNVCDRTLSPPERAAALVEALSIEEKL 64

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSF 123
           + L+S +   PR+G+  Y WWSEALHGV+   PGT F     +F  +TS+P  +  A+ F
Sbjct: 65  QNLVSKSQGAPRIGLPAYNWWSEALHGVA-YAPGTYFPQGNVEFNSSTSYPMPLLMAAGF 123

Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
           +  L E IG  +  EARA  N G AG  YW+PNVN F+DPRWGRG ETPGED +   +YA
Sbjct: 124 DDNLIEKIGTAIGIEARAWGNSGWAGFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYA 183

Query: 184 ASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
               RGL G    ++ ++ ++CKH+   D ++WNG  R  FNAK++ QD+ + +  PF+ 
Sbjct: 184 EYITRGLDGPVPNEQRRIISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQ 243

Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQ 299
           C  + KV S+MC+YN VNGVP+CA+  +L+  +R  W+    N Y+ SDC++V       
Sbjct: 244 CARDSKVGSIMCAYNAVNGVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANH 303

Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
           H+  T     A    AG+D  C          A  +GLL E  ++ AL+      +R G 
Sbjct: 304 HYAPTNAAGTAICFEAGMDTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGY 363

Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
           FDGE +   Y  L  + V + + Q LAL+AA +G+VLLKN G +LPL       +A+IG 
Sbjct: 364 FDGEEAM--YADLDWQHVNSAEAQSLALQAAVEGMVLLKNNG-TLPLDLDPSHKIAMIGF 420

Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYAR--TIHQQGCKDVACADDQLFGAAIDASRQ 477
            +D    + G Y+G A    +P     +     T+          A D     A++A+  
Sbjct: 421 WADAPEKLQGGYSGRAHHLYSPAFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASG 480

Query: 478 ADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
           AD  +   GLD S   E LDR  L  P  Q  LV  V
Sbjct: 481 ADYILYFGGLDTSAAGETLDRTDLDWPEAQLTLVKVV 517


>gi|329850151|ref|ZP_08264997.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
 gi|328842062|gb|EGF91632.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
          Length = 877

 Score =  311 bits (797), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 198/511 (38%), Positives = 269/511 (52%), Gaps = 62/511 (12%)

Query: 29  ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
           A DP  A    + +   +L    R  DL+ R++L+EK   L   A A+PRLG+  Y WW+
Sbjct: 10  ALDPVPADVAAMAYRDTALDPKARAADLVSRMTLEEKAAQLGHTAPAIPRLGVPKYNWWN 69

Query: 89  EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY----- 143
           E LHGV+  G           AT FPQ I  A++++  +   +G VVS E RA Y     
Sbjct: 70  EGLHGVARAGV----------ATVFPQAIGMAATWDEPMMTTVGDVVSTEFRAKYVERVH 119

Query: 144 -NGGT---AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
            +GGT    GLT WSPN+NIFRDPRWGRGQET GEDP L+ +    Y+ GLQG+D    K
Sbjct: 120 PDGGTDWYRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTSRIGIGYIHGLQGNDPKFFK 179

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
             A+ KHF  +     N   R   +   SK D+EDT+   FR  V EGK  SVMC YN V
Sbjct: 180 TVATSKHFAVHSGPESN---RHKEDVYPSKFDLEDTYLPAFRATVTEGKAYSVMCVYNAV 236

Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY--DTQHFTSTPEEAAADAIRAGL 317
            GVP CA   +++  +R  W   G++VSDC +    +  D  H+T T EE  A  ++AG+
Sbjct: 237 YGVPGCASDFLMEEKLRQNWGFPGFVVSDCGAAANIFREDALHYTKTAEEGVAVGLKAGM 296

Query: 318 DLDCGPFLG-LHTE-----SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGH 371
           DL CG +   + TE     +AV+ G L    ++ ALV     ++RLGMFD  P+S P+ H
Sbjct: 297 DLICGDYRNKMSTEVQPIINAVKAGQLPIAVVDQALVRLFEGRIRLGMFD-PPASLPFAH 355

Query: 372 LGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
           +   D  TP H  +AL+ A++ +VLLKN G  LPL     +T+AVIGPN+D    ++GNY
Sbjct: 356 ITADDSDTPAHHAVALDMAKKSMVLLKNDG-LLPLK-AEPKTIAVIGPNADSLDALVGNY 413

Query: 432 AGIACGYTTPLQGI-GRY--ARTIHQQG----------------CKDVAC---------- 462
            G      T L GI  R+  A+ ++ +G                C D AC          
Sbjct: 414 YGKPSKPVTVLDGIRARFPTAKIVYAEGTGLIGPAEPPVPDSVLCLDGACTKQGLTAEHF 473

Query: 463 ADDQLFGAAIDASRQADATILVMGLDQSIEA 493
           A D L GA +    +A+A +   G D+S  A
Sbjct: 474 AGDDLQGAPVATRTEANARLDWQGEDKSSSA 504



 Score =  116 bits (291), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 94/298 (31%), Positives = 146/298 (48%), Gaps = 52/298 (17%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSM 516
           + G A+D ++ AD  + V GL   +E E +          DR  + LP  QQ+L+ KV +
Sbjct: 587 MAGQAVDVAKTADFVVFVGGLSARVEGEEMKVEAEGFAGGDRTSIDLPKPQQQLLEKV-I 645

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
            +  PT+LVLMSG  + V +A  D  + AII A YPG  GG A+A ++ G  +P G+LP+
Sbjct: 646 GTGKPTVLVLMSGSALGVNWA--DKHVPAIIEAWYPGGEGGHAVAQLIAGDYSPAGRLPV 703

Query: 577 TWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           T+Y    +  LP  ++  M+         RTYR++ G V+YPFGHG+SYT F +  AN  
Sbjct: 704 TFY--RSVDALPGFSDYTMK--------NRTYRYFNGEVLYPFGHGLSYTTFAY--ANPK 751

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
                                      +     ++ V VDV N G+ D    + ++ + P
Sbjct: 752 V-----------------------SAASVAAGSSVTVSVDVSNSGAMDSDEVVQLYVSHP 788

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            G     + L  F++V +  G  + V   +   + LSVVD  G R++  G+ ++ IGG
Sbjct: 789 GG--TAIRSLQGFQRVSLKKGETKTVQFKLD-DRALSVVDEHGGRKVQAGQVDLWIGG 843


>gi|325919363|ref|ZP_08181395.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
 gi|325550152|gb|EGD20974.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
          Length = 876

 Score =  311 bits (797), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 178/444 (40%), Positives = 249/444 (56%), Gaps = 29/444 (6%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+     P   R  DL+ R++L+EK   + + A A+PRL +  Y+WW+EALHGV+  G  
Sbjct: 19  PYLDTQRPFDARAADLVARMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 76

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTY 152
                   GAT FPQ I  A++F+  L   +   +SDEARA ++   A        GLT+
Sbjct: 77  --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEYKRYQGLTF 128

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQ   G   K+ A+ KHF  +  
Sbjct: 129 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKHFAVH-- 186

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
            +    DR HF+   S++D+ +T+   F+  V EGKVA+VM +YN+VNG    A    L+
Sbjct: 187 -SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGKVAAVMGAYNRVNGESASASTR-LE 244

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
             +R +W  +GYIVSDC ++   +       TPE AAA  ++ G DLDCG        +A
Sbjct: 245 GILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYAA-LPAA 303

Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
           V+ GL+ E  I+ AL   +T +MRLGMFD  P+  P+  +      +P H  LA   AR+
Sbjct: 304 VRAGLIDEATIDTALKRLMTTRMRLGMFD-PPAKVPWAQIPASANQSPQHDALARRTARE 362

Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---GRYA 449
            +VLLKN G  LPL     R +AVIGP +D  ++++GNY G      T LQGI      A
Sbjct: 363 SLVLLKNDG-VLPLKPTLKR-IAVIGPTADDPMSLLGNYYGTPAAPVTILQGIRDAAPQA 420

Query: 450 RTIHQQGCKDVACADDQLFGAAID 473
           + I+ +G   V   +D    A ID
Sbjct: 421 QVIYARGSDLVEGREDPNAAAPID 444



 Score =  120 bits (300), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 88/302 (29%), Positives = 143/302 (47%), Gaps = 53/302 (17%)

Query: 471 AIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSMASKG 520
           A+DA+R A+  + V GL   +E E +D          R    LP  Q+EL+  +  A+  
Sbjct: 603 AVDAARDAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQ-ATGT 661

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GG+A+ D+LFG ++PGG+LP+T+Y 
Sbjct: 662 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPVTFYK 719

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
           +     LP   + AMR        GRTYR+++G  +YPFGHG+SYT F ++         
Sbjct: 720 E--AERLPAFDDYAMR--------GRTYRYFQGKPLYPFGHGLSYTQFAYS--------- 760

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAG 697
              D R         G             TL   V +KN G + G   + ++  P  P  
Sbjct: 761 ---DLRLDRTTVAADG-------------TLTATVTLKNTGQRAGDEVVQLYLHPLKPQR 804

Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKH 756
             A  K+L   +++ +  G Q+++   I     L + D +     +  G + + IG +  
Sbjct: 805 ERA-LKELHGLQRITLQPGEQRQLRFTIKAQDALRIYDEQRKAYAVDPGAYEVQIGASSA 863

Query: 757 SV 758
            +
Sbjct: 864 DI 865


>gi|440476402|gb|ELQ45004.1| beta-xylosidase, partial [Magnaporthe oryzae Y34]
          Length = 515

 Score =  311 bits (796), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 186/514 (36%), Positives = 267/514 (51%), Gaps = 13/514 (2%)

Query: 7   FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
           +F LG     +      +R     D  +    T   C  +L  P+R   L+  LS++EK+
Sbjct: 5   WFILGASAFVSGQEDPESRRQSYPDCVNGPLSTNNVCDRTLSPPERAAALVEALSIEEKL 64

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF---GGDFPGATSFPQVITTASSF 123
           + L+S +   PR+G+  Y WWSEALHGV+   PGT F     +F  +TS+P  +  A+ F
Sbjct: 65  QNLVSKSQGAPRIGLPAYNWWSEALHGVA-YAPGTYFPQGNVEFNSSTSYPMPLLMAAGF 123

Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYA 183
           +  L E IG  +  EARA  N G AG  YW+PNVN F+DPRWGRG ETPGED +   +YA
Sbjct: 124 DDNLIEKIGTAIGIEARAWGNSGWAGFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYA 183

Query: 184 ASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
               RGL G    ++ ++ ++CKH+   D ++WNG  R  FNAK++ QD+ + +  PF+ 
Sbjct: 184 EYITRGLDGPVPNEQRRIISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQ 243

Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL---NGYIVSDCDSVGVYYDTQ 299
           C  + KV S+MC+YN VNGVP+CA+  +L+  +R  W+    N Y+ SDC++V       
Sbjct: 244 CARDSKVGSIMCAYNAVNGVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANH 303

Query: 300 HFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
           H+  T     A    AG+D  C          A  +GLL E  ++ AL+      +R G 
Sbjct: 304 HYAPTNAAGTAICFEAGMDTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGY 363

Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
           FDGE +   Y  L  + V + + Q LAL+AA +G+VLLKN G +LPL       +A+IG 
Sbjct: 364 FDGEEAM--YADLDWQHVNSAEAQSLALQAAVEGMVLLKNNG-TLPLDLDPSHKIAMIGF 420

Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGIGRYAR--TIHQQGCKDVACADDQLFGAAIDASRQ 477
            +D    + G Y+G A    +P     +     T+          A D     A++A+  
Sbjct: 421 WADAPEKLQGGYSGRAHHLYSPAFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASG 480

Query: 478 ADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
           AD  +   GLD S   E LDR  L  P  Q  LV
Sbjct: 481 ADYILYFGGLDTSAAGETLDRTDLDWPEAQLTLV 514


>gi|433677589|ref|ZP_20509555.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
 gi|430817300|emb|CCP39963.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
          Length = 913

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 173/444 (38%), Positives = 250/444 (56%), Gaps = 36/444 (8%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +        QR  DL+ R++L+EK   + + A A+PRLG+  Y+WW+EALHGV+  G   
Sbjct: 37  YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYW 153
                  GAT FPQ I  A++F+  L   +   +SDEARA ++            GLT+W
Sbjct: 94  -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD---------RLKVAASC 204
           SPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG D D           K+ A+ 
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEDVDVPKNAQGEAYRKLDATA 206

Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           KHF  +   +    DR HF+A  S++D+ +T+   F   V EGKV +VM +YN+V G   
Sbjct: 207 KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGESA 263

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
            A   +L+  +R  W  +GY+VSDC ++   +      +T EEAAA A++ G +L+CG  
Sbjct: 264 SASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATREEAAALAVKHGTELECGAE 323

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
                 +AV++GL+SE D++NAL   +  +MRLGMFD  P    +  +      +P+H  
Sbjct: 324 YST-LPTAVRKGLISEADVDNALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQSPEHDA 381

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LA   AR+ +VLLKN G  LPLS  + + +AV+GP +D T+ ++GNY G      T LQG
Sbjct: 382 LARRTARESLVLLKNDG-VLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTVLQG 440

Query: 445 IGRY---ARTIHQQGCKDVACADD 465
           I      A  ++ +G   V   DD
Sbjct: 441 IREAAPDAEVLYARGADLVEGRDD 464



 Score =  123 bits (308), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 91/301 (30%), Positives = 142/301 (47%), Gaps = 51/301 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+DA+R+AD  + V GL   +E E +          DR  L LP  Q+ L+  +    K 
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK- 689

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GG+A+AD+LFG +NPGG+LP+T+Y 
Sbjct: 690 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFY- 746

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
           +E  T     + AMR        GRTYR++ G  +YPFGHG+SYT F ++          
Sbjct: 747 KESETLPAFDDYAMR--------GRTYRYFAGTALYPFGHGLSYTQFAYS---------- 788

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAGH 698
             D R         G+             L   + VKN G + G   + ++  P  P   
Sbjct: 789 --DLRLDRSKLAADGR-------------LHATLKVKNTGQRAGDEVVQLYLQPLSPQRE 833

Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIGGTKHS 757
            A  K L  F+++ +  G  + V   I     L + D +     +  G++ + +G +   
Sbjct: 834 RA-SKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKAYVVDPGDYELQVGASSSD 892

Query: 758 V 758
           V
Sbjct: 893 V 893


>gi|389736853|ref|ZP_10190363.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
 gi|388438821|gb|EIL95541.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
          Length = 868

 Score =  310 bits (794), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 172/406 (42%), Positives = 243/406 (59%), Gaps = 33/406 (8%)

Query: 52  RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
           R   L+ +++L EKV  + + A A+PRLG+  Y+WWSE LHG++  G           AT
Sbjct: 32  RAVALVAKMTLPEKVAQMQNDAPAIPRLGVPAYDWWSEGLHGIARNG----------YAT 81

Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDP 163
            FPQ I  A+S++ +L  A+G V+S EARA +N        G   GLT WSPN+NIFRDP
Sbjct: 82  VFPQAIGLAASWDTSLLHAVGTVISTEARAKFNASGSGRAHGLFQGLTLWSPNINIFRDP 141

Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHF 223
           RWGRGQET GEDP L+G+ A ++VRG+QG D    +  A+ KHF A+   +     R  F
Sbjct: 142 RWGRGQETYGEDPYLTGQLAVAFVRGIQGDDPQHPRAIATPKHFVAH---SGPEAGRDSF 198

Query: 224 NAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG 283
           +  VS  D+EDT+   FR  V++G   SVMC+YN ++G P CA+  +L   +R +W   G
Sbjct: 199 DVDVSPHDLEDTYLPAFRTAVVDGHAGSVMCAYNALHGTPACANAGLLDTRLRKDWGFAG 258

Query: 284 YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES----AVQRGLLS 339
           Y+VSDCD+VG      +F     +A+  A++AG DLDCG     HT +    AV++G ++
Sbjct: 259 YVVSDCDAVGDIASYHYFKPDDVQASVAAVQAGTDLDCG-----HTYASLAQAVRQGDIA 313

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
           E  ++ +LV   T + RLG   G   + PY  +G   + +P H++LAL+AA + +VLLKN
Sbjct: 314 ESALDASLVRLFTARYRLGEL-GSRGNDPYARIGADQIDSPAHRKLALQAALESLVLLKN 372

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
              +LPL H   R +AVIGP++D   T+  NY G A    TPLQG+
Sbjct: 373 AHSTLPL-HAGMR-LAVIGPDADALETLEANYHGTARHPVTPLQGL 416



 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 100/285 (35%), Positives = 144/285 (50%), Gaps = 48/285 (16%)

Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           ADA +  +GL   +E E L          DR  + LP  Q+ L+ + + AS  P I+VL+
Sbjct: 598 ADAVVAFIGLSPDVEGEQLRIDVPGFDGGDRTDIGLPAPQRALLER-ARASGKPLIVVLL 656

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           SG  + + +A+      AI+ A YPGQAGGTAIA +L G  NPGG+LP+T+Y      +L
Sbjct: 657 SGSAVALDWAQQ--HADAILAAWYPGQAGGTAIAQVLAGDYNPGGRLPVTFY--RSTRDL 712

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
           P       P  S    GRTYR++ G  +YPFG+G+SYT F +    APT+ A        
Sbjct: 713 P-------PYVSYAMQGRTYRYFDGRPLYPFGYGLSYTRFTYA---APTLSA-------- 754

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
              AT+               TL V  +V+N G + G   + V+   P    AP   LV 
Sbjct: 755 ---ATLKAGG-----------TLQVSAEVRNAGQRAGDEVVQVYLDTPPSPLAPRHALVG 800

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           F ++H+ AG Q+ V   +   + LS VD +G R +  G++ + IG
Sbjct: 801 FRRIHLAAGEQRLVRFTL-APRQLSSVDAAGARAVEPGQYRVFIG 844


>gi|289668505|ref|ZP_06489580.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 902

 Score =  309 bits (791), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 174/445 (39%), Positives = 249/445 (55%), Gaps = 36/445 (8%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+        QR  DL+ R++L+EK   + + A A+PRLG+  Y+WW+EALHGV+  G  
Sbjct: 34  PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------GTAGLTY 152
                   GAT FPQ I  A++F+  L   +   +SDEARA ++            GLT+
Sbjct: 92  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 143

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------AS 203
           WSPN+NIFRDPRWGRGQET GEDP L+ +   ++VRGLQG  GD  K A         A+
Sbjct: 144 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRKLDAT 203

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
            KHF  +        DR HF+A+ S++D+ +T+   F   V +GKV +VM +YN+V G  
Sbjct: 204 AKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 260

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
             A   +L+  +R +W   GY+VSDC ++   +      +T E+AAA A++ G +L+CG 
Sbjct: 261 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGE 320

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
                  +AV +GL+ E  I+ +L   +T +MRLGMFD  P   P+  +      +P H 
Sbjct: 321 EYST-LPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPAHD 378

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
            LA   AR+ +VLLKN G  LPLS  + + +AVIGP +D T+ ++GNY G      T LQ
Sbjct: 379 ALARRTARESLVLLKNDG-LLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 437

Query: 444 GI---GRYARTIHQQGCKDVACADD 465
           GI      A+ ++ +G   V   DD
Sbjct: 438 GIRAAAPNAQVLYARGADLVEGRDD 462



 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 95/304 (31%), Positives = 146/304 (48%), Gaps = 57/304 (18%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +R A+  + V GL   +E E +          DR  L LP  Q+EL+  +    K 
Sbjct: 629 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 687

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y 
Sbjct: 688 PVVAVLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTV 637
           +     LP   + AM         GRTYR++ G  +YPFGHG+SYT F ++    +  TV
Sbjct: 746 ES--EALPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAYSDLRLDRNTV 795

Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--P 695
            A       GS  AT++                     VKN G + G     ++  P  P
Sbjct: 796 AA------DGSFTATVT---------------------VKNTGQRAGDEVAQLYLHPLTP 828

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGT 754
               A  K+L  F++V +  G Q+ +   I+  + L + D +  T  +  G + + IG +
Sbjct: 829 QRERA-GKELRGFQRVALHPGEQRELRFPINAKEALRIYDEQRKTYTVDPGAYEVQIGAS 887

Query: 755 KHSV 758
              +
Sbjct: 888 SADI 891


>gi|285016879|ref|YP_003374590.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
 gi|283472097|emb|CBA14604.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
           PC73]
          Length = 914

 Score =  308 bits (790), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 169/412 (41%), Positives = 242/412 (58%), Gaps = 33/412 (8%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR +DL+ R++L+EKV  + + A A+PRLG+  Y+WW+E LHGV+  G          GA
Sbjct: 43  QRADDLVARMTLEEKVAQMQNAAPAIPRLGVPAYDWWNEGLHGVARAG----------GA 92

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
           T FPQ I  A++F+  L   +   +SDEARA ++        G   GLT+WSPN+NIFRD
Sbjct: 93  TVFPQAIGLAATFDLPLMHEVSTAISDEARAKHHEALRRGEHGRYQGLTFWSPNINIFRD 152

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------ASCKHFTAYDLD 213
           PRWGRGQET GEDP L+ +   ++V+G+QG   D  K A         A+ KHF  +   
Sbjct: 153 PRWGRGQETYGEDPFLTARMGVTFVQGMQGEGADAPKNAQGETYRKLDATAKHFAVH--- 209

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           +    +R HF+A+ S++D+ +T+   F   V EGKV +VM +YN++ G    A   +L+ 
Sbjct: 210 SGPESERHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRLFGESASASKFLLRD 269

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
            +R  W  +GY+VSDC ++   +      +T E+AAA A++ G  L+CG        +AV
Sbjct: 270 VLRERWGFHGYVVSDCWAIVDIWKNHKIVATREQAAALAVKNGTQLECGQEYAT-LPAAV 328

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
           Q+GL+ E DI+ AL   +T +MRLGMFD  P    +  L      +P+H  LA   AR+ 
Sbjct: 329 QQGLIGETDIDAALRTLMTARMRLGMFD-PPGQLRWAQLPISVNQSPEHDALARRTARES 387

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           +VLLKN G  LPLS  +H+ +AVIGP +D T+ ++GNY G      T LQGI
Sbjct: 388 LVLLKNDG-LLPLSRAKHKRIAVIGPTADDTMALLGNYYGTPATPVTILQGI 438



 Score =  127 bits (318), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 93/303 (30%), Positives = 149/303 (49%), Gaps = 51/303 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +R+AD  + V GL   +E E +          DR  L LP  Q+EL+  +S   K 
Sbjct: 628 ALDTARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLQALSATGK- 686

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GG+A+AD+LFG +NPGG+LP+T+Y 
Sbjct: 687 PVVAVLTTGSALAIDWAQE--HVPAILLAWYPGQRGGSAVADVLFGDTNPGGRLPVTFYK 744

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
                 LP   + AMR        GRTYR++ G  +YPFGHG+SYT F ++         
Sbjct: 745 AS--ETLPAFDDYAMR--------GRTYRYFAGTPLYPFGHGLSYTQFAYS--------D 786

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
           + LD R  + +  +S   +KVT                N G++ G   + ++  P A   
Sbjct: 787 LRLDRRKVAADGQLSA-TLKVT----------------NTGTRAGDEVVQLYLHPLAPTR 829

Query: 700 APH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIGGTKHS 757
           A   K+L  F+++ +  G  + V   I     L + D +     +  G++ + +G +   
Sbjct: 830 ARAIKELRGFQRIALAPGESRDVHFTISPQTDLRIYDEAQKHYVVDPGDYELQVGASSAD 889

Query: 758 VSL 760
           V +
Sbjct: 890 VRV 892


>gi|289666226|ref|ZP_06487807.1| beta-glucosidase precursor [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 902

 Score =  308 bits (790), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 174/445 (39%), Positives = 249/445 (55%), Gaps = 36/445 (8%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+        QR  DL+ R++L+EK   + + A A+PRLG+  Y+WW+EALHGV+  G  
Sbjct: 34  PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------GTAGLTY 152
                   GAT FPQ I  A++F+  L   +   +SDEARA ++            GLT+
Sbjct: 92  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 143

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------AS 203
           WSPN+NIFRDPRWGRGQET GEDP L+ +   ++VRGLQG  GD  K A         A+
Sbjct: 144 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRKLDAT 203

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
            KHF  +        DR HF+A+ S++D+ +T+   F   V +GKV +VM +YN+V G  
Sbjct: 204 AKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 260

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
             A   +L+  +R +W   GY+VSDC ++   +      +T E+AAA A++ G +L+CG 
Sbjct: 261 ASASKFLLQDLLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGE 320

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
                  +AV +GL+ E  I+ +L   +T +MRLGMFD  P   P+  +      +P H 
Sbjct: 321 EYST-LPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPAHD 378

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
            LA   AR+ +VLLKN G  LPLS  + + +AVIGP +D T+ ++GNY G      T LQ
Sbjct: 379 ALARRTARESLVLLKNDG-LLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 437

Query: 444 GI---GRYARTIHQQGCKDVACADD 465
           GI      A+ ++ +G   V   DD
Sbjct: 438 GIRAAAPNAQVLYARGADLVEGRDD 462



 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 95/304 (31%), Positives = 146/304 (48%), Gaps = 57/304 (18%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +R A+  + V GL   +E E +          DR  L LP  Q+EL+  +    K 
Sbjct: 629 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 687

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y 
Sbjct: 688 PVVAVLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTV 637
           +     LP   + AM         GRTYR++ G  +YPFGHG+SYT F ++    +  TV
Sbjct: 746 ES--EALPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAYSDLRLDRNTV 795

Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--P 695
            A       GS  AT++                     VKN G + G     ++  P  P
Sbjct: 796 AA------DGSFTATVT---------------------VKNTGQRAGDEVAQLYLHPLTP 828

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGT 754
               A  K+L  F++V +  G Q+ +   I+  + L + D +  T  +  G + + IG +
Sbjct: 829 QRERA-GKELRGFQRVALHPGEQRELSFPINAKEALRIYDEQRKTYTVDPGAYEVQIGAS 887

Query: 755 KHSV 758
              +
Sbjct: 888 SADI 891


>gi|256394357|ref|YP_003115921.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
 gi|256360583|gb|ACU74080.1| glycoside hydrolase family 3 domain protein [Catenulispora
           acidiphila DSM 44928]
          Length = 1357

 Score =  308 bits (790), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 237/814 (29%), Positives = 379/814 (46%), Gaps = 121/814 (14%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV-KLLISGAAAVPRLGIKGYEWWSEALHG 93
           A + T  +   S P   R  DL+ R++L EK  +L  + A A+PRLG++ Y + +EA HG
Sbjct: 42  AASSTPIYLNTSYPFEARAADLVSRMTLAEKAAQLNTTSAPAIPRLGVQQYTYQAEAQHG 101

Query: 94  VSNVGPGTKFG---GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG- 149
           ++ +G     G   G+ P ATSFP    ++ S++  L       VSDEAR + +    G 
Sbjct: 102 INYLGGDQNSGSVAGNPPVATSFPTNFASSMSWDPALVYQETTAVSDEARGLVDKSLFGT 161

Query: 150 --------------LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-- 193
                         LT+W+P VN+ RDPRWGR  E  GEDP L G+ A ++V G QG+  
Sbjct: 162 GQNNLGPSASDYGSLTFWAPTVNLDRDPRWGRTDEAFGEDPYLVGQMAGAFVNGFQGNSM 221

Query: 194 -----DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGK 248
                DG  LK AA+ KH+   D++     +R   ++ VS  D+ D +   F   +    
Sbjct: 222 TGQSLDG-YLKAAATAKHYALNDVEQ----NRTGISSNVSDTDLRDYYTKQFADLIENSH 276

Query: 249 VASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY------------ 296
           VA +M SYN +NG P+ AD     +  +  +  NGY+ SDC +VG  Y            
Sbjct: 277 VAGLMTSYNAINGTPSVADTYTANQLAQRTYGFNGYVTSDCGAVGTAYRNFPAGHAWAPP 336

Query: 297 -------DTQHFTSTPEEAA---------ADAIRAGLDLDCG--PFLGLHTESAVQRGLL 338
                  DT    +     A         A ++RAG  ++CG   F   + ++A+  G+L
Sbjct: 337 GWTTDGGDTNSIWTNTSTGAKISGAAGGEAYSLRAGTQVNCGGDEFSLQNIQAAISAGIL 396

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           SE  I++ L    T++M  G FD  P+S+ PY  +    + +P HQ LA   A   +VLL
Sbjct: 397 SEGVIDSDLTKLFTIRMETGEFD--PASKVPYTSITKAQIQSPAHQALATSVADNSLVLL 454

Query: 398 KN------QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC-------GYTTPLQG 444
           KN        P LP S  +   V ++G  ++     +G+Y+G          G TT ++ 
Sbjct: 455 KNANVSGTSAPLLPASASKLANVVILGDMANQVT--LGDYSGAPSLQVNAVQGLTTAIKA 512

Query: 445 IGRYARTIHQQGCKDVACADDQLFGAAIDAS-RQADATILVMGLDQSIEAEALDRAGLLL 503
               A  +                 +A  A+ ++AD  ++ +G +Q+   E  DR  L +
Sbjct: 513 ANPSANILFDAAGTSSTTTSAATLSSATQAAIKKADLVVMFVGTNQNNAQEGNDRTTLNM 572

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PG    L+++ +      T LV+ S GP+ ++  +    + A++++GY G++ GTA+AD+
Sbjct: 573 PGNYDSLITQTTALGNPKTALVVQSDGPVKISDVQG--SVPAVVFSGYNGESQGTALADV 630

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
           L G  NP G L  TWY  +  + LP M+   + P  +    GRTY+++ G   YPFG+G+
Sbjct: 631 LLGKQNPSGHLNFTWYADD--SQLPAMSNYGLTPGDTSGL-GRTYQYFTGTPTYPFGYGL 687

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGS 681
           SY+ F ++ A                           V +A  N   T+ V   V N GS
Sbjct: 688 SYSAFTYSAA--------------------------TVDNASPNADGTVNVSFKVTNSGS 721

Query: 682 KDGAHTLLVFST---PPAGHWAPHKQLVAFEKVHV-PAGAQQRVGINIHVCKYLSVVDRS 737
             GA    +++      +G   P K+LV F+K  V   GA Q++ I + +   LS  + +
Sbjct: 722 TAGATVAQLYAATQFTESGVQLPTKRLVGFQKTGVLNPGAAQQITIPVKISD-LSFWNAT 780

Query: 738 GTRRIPL-GEHNIHIGGTKHSV--SLHAATLGVI 768
             + +   G + + +G +   +  S++ A  G I
Sbjct: 781 TMKSVVYDGTYALQVGASASDIRTSVNVAVSGAI 814


>gi|392537607|ref|ZP_10284744.1| Beta-glucosidase [Pseudoalteromonas marina mano4]
          Length = 870

 Score =  308 bits (789), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 170/426 (39%), Positives = 247/426 (57%), Gaps = 31/426 (7%)

Query: 30  CDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
           CD  + + +  P +   S  I +RVNDL+ RL+L+EKV  L   + A+ RL I  Y WW+
Sbjct: 21  CDNDEQSMQPPPLYLNESASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNIPEYNWWN 80

Query: 89  EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA 148
           EALHGV+  G           AT FPQ I  A++F+  L   +G  +SDE RA ++   A
Sbjct: 81  EALHGVARAGK----------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLA 130

Query: 149 --------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
                   GLTYWSPN+NIFRDPRWGRGQET GEDP L+ + A +++ GLQG + + LK 
Sbjct: 131 ENNRSMYTGLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDNTEYLKS 190

Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
            A+ KH+  +   +   V R   +   SK+D+ +T+   F+  + + KVASVMC+YN VN
Sbjct: 191 VATLKHYAVH---SGPEVSRHSDDYTASKKDLAETYLPAFKDVIAQTKVASVMCAYNSVN 247

Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD--TQHFTSTPEEAAADAIRAGLD 318
           G P C +  +++  +R E+  +GYIVSDC ++  +YD  + +  +T  +AAA A++ G D
Sbjct: 248 GTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTEAKAAAMALKTGTD 307

Query: 319 LDCGPFLG---LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
           L+CG   G    +   AV+ GL+ E D++ AL   +  + +LGMFD  P + PY      
Sbjct: 308 LNCGDHHGNTYSYLSQAVKEGLVEEKDVDKALKRLMYARFKLGMFDN-PENVPYSDTSID 366

Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
            V +  H  L  EAA++ +VLLKN+   LPL    +  VA+IGPN+D    ++GNY G+ 
Sbjct: 367 IVGSNKHLALTQEAAKKSLVLLKNE-QVLPLK--GNEKVALIGPNADNEAILLGNYNGMP 423

Query: 436 CGYTTP 441
               TP
Sbjct: 424 IVPITP 429



 Score =  105 bits (263), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 81/279 (29%), Positives = 131/279 (46%), Gaps = 51/279 (18%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A++ + +AD  + V G+  ++E E +          DR  + LP  Q  L+ K+    K 
Sbjct: 599 ALNNANEADVIVFVGGISANLEGEEMPLQIDGFSHGDRTNINLPKSQLNLLKKLKQTGK- 657

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P +LV MSG  +  A    +  I AII   YPG+A G+A+  +L+G  +P GKLP+T+Y 
Sbjct: 658 PIVLVNMSGSAM--ALNWENENIDAIIQGFYPGEAAGSALVSLLYGEYSPSGKLPITFYK 715

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
              +++LP   + +M+         RTY++Y+G V+YPFG G+SY +F +          
Sbjct: 716 S--VSDLPDFKDYSMK--------NRTYKYYEGEVLYPFGFGLSYADFKYKNTRH----- 760

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
             +D   G +N T +     +T+         VQV V               S P A   
Sbjct: 761 -SIDAGSGDLNLTTT-----ITNQSSFSADDVVQVYV---------------SMPDAPIK 799

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
            P+KQLV F+ + +   ++  +   I   K LS ++  G
Sbjct: 800 TPNKQLVGFKHITLKNESKNDIKFTIPKNK-LSYINEQG 837


>gi|440731995|ref|ZP_20911965.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
 gi|440370332|gb|ELQ07251.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
          Length = 913

 Score =  308 bits (788), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 173/444 (38%), Positives = 248/444 (55%), Gaps = 36/444 (8%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +        QR  DL+ R++L+EK   + + A A+PRLG+  Y+WW+EALHGV+  G   
Sbjct: 37  YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYW 153
                  GAT FPQ I  A++F+  L   +   +SDEARA ++            GLT+W
Sbjct: 94  -------GATVFPQAIGMAATFDVPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------ASC 204
           SPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG   D  K A         A+ 
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEAYRKLDATA 206

Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           KHF  +   +    DR HF+A  S++D+ +T+   F   V EGKV +VM +YN+V G   
Sbjct: 207 KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGESA 263

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
            A   +L+  +R  W  +GY+VSDC ++   +      +T EEAAA A++ G +L+CG  
Sbjct: 264 SASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATREEAAALAVKHGTELECGAE 323

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
                 SAV++GL+SE D++ AL   +  +MRLGMFD  P    +  +      +P+H  
Sbjct: 324 YST-LPSAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQSPEHDA 381

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LA   AR+ +VLLKN G  LPLS  + + +AV+GP +D T+ ++GNY G      T LQG
Sbjct: 382 LARRTARESLVLLKNDG-VLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTVLQG 440

Query: 445 IGRY---ARTIHQQGCKDVACADD 465
           I      A  ++ +G   V   DD
Sbjct: 441 IREAAPDAEVLYARGADLVEGRDD 464



 Score =  123 bits (308), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 94/305 (30%), Positives = 144/305 (47%), Gaps = 53/305 (17%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+DA+R+AD  + V GL   +E E +          DR  L LP  Q+ L+  +    K 
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK- 689

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GG+A+AD+LFG +NPGG+LP+T+Y 
Sbjct: 690 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFY- 746

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
           +E  T     + AMR        GRTYR++ G  +YPFGHG+SYT F ++          
Sbjct: 747 KESETLPAFDDYAMR--------GRTYRYFAGTPLYPFGHGLSYTQFAYS---------- 788

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAGH 698
             D R         G+             L   + VKN G + G   + ++  P  P   
Sbjct: 789 --DLRLDRSKLAADGR-------------LHATLKVKNTGQRAGDEVVQLYLQPLSPQRE 833

Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD--RSGTRRIPLGEHNIHIGGTKH 756
            A  K L  F+++ +  G  + V   I     L + D  R G    P G++ + +G +  
Sbjct: 834 RA-SKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKGYVVDP-GDYELQVGASSS 891

Query: 757 SVSLH 761
            V + 
Sbjct: 892 DVRVR 896


>gi|116181370|ref|XP_001220534.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
 gi|88185610|gb|EAQ93078.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
          Length = 549

 Score =  308 bits (788), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 191/500 (38%), Positives = 275/500 (55%), Gaps = 25/500 (5%)

Query: 43  CQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTK 102
           C      P+R   L+  L+++EK++ L+  +    RLG+  Y WWSEALHGV+   PG +
Sbjct: 39  CDPKATPPERAAALVKALNIEEKLQNLVDMSKGAERLGLPAYAWWSEALHGVA-ASPGVR 97

Query: 103 F----GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVN 158
           F    GG F  ATSF   IT +++F+  L   +   +S EARA  N G AGL YW+PN+N
Sbjct: 98  FNRTAGGRFSSATSFANSITLSAAFDDELVYKVADTISTEARAFANAGLAGLDYWTPNIN 157

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
            ++DPRWGRG ETPGEDPV    Y  + + GL+G D    KV A+CKH+ AYDL+ W G 
Sbjct: 158 PYKDPRWGRGHETPGEDPVRIKGYVKALLAGLEGDDPSIRKVVATCKHYAAYDLERWQGT 217

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
            R  F+A VS QD+ + +  PF+ C  + KV S MCSYN +NG P CA   ++   +R  
Sbjct: 218 TRHRFDAVVSLQDLSEYYLPPFQQCARDSKVGSFMCSYNALNGTPACASTYLMDDILRKH 277

Query: 279 W---RLNGYIVSDCDSVGVYY---DTQHFTSTPEE--AAADAIRAGLDLDCG-PFLGLHT 329
           W     N YI SDC+++  +       +F+ST  E  AAA A +AG D  C  P    +T
Sbjct: 278 WGWTEHNNYITSDCNAIQDFLPGPKWHNFSSTQTEAEAAAVAYQAGTDTVCEVPGWPPYT 337

Query: 330 E--SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
           +   A  + LLSE  I+ AL       +R+G FD   S  PY  +G +DV TP+ QELAL
Sbjct: 338 DVIGAYNQTLLSEEVIDTALKRLYEGLVRVGYFD-PASGSPYRSIGWEDVNTPEAQELAL 396

Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIG--PNSDVTVTMIGNYAGIACGYTTPLQGI 445
           ++   G+VLLKN G +LPL ++  +TVA+IG   NS     ++G Y+G      +P+   
Sbjct: 397 QSGTDGLVLLKNDG-TLPL-NLEDKTVALIGFWANSTNGGRILGGYSGFPPYIHSPVDAA 454

Query: 446 GRYARTIHQQG---CKDVA-CADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
            +   T H       +++   A D     A++ +++++  +   G D SI AE LDR  +
Sbjct: 455 EKLNLTYHYASGPLAENITQAAIDDWVAKALEPAKKSNVILYFGGTDTSIAAEDLDRDSI 514

Query: 502 LLPGRQQELVSKVSMASKGP 521
             P  Q  ++  +S   + P
Sbjct: 515 AWPEIQLAVIEALSALRQAP 534


>gi|386819249|ref|ZP_10106465.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
 gi|386424355|gb|EIJ38185.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
          Length = 878

 Score =  307 bits (787), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 172/421 (40%), Positives = 245/421 (58%), Gaps = 29/421 (6%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           + +   PF    LP  +RVNDLI RL++ EK+  L+  + A+ RLGI  Y WW+E+LHGV
Sbjct: 19  SQSEKYPFQNTELPEDERVNDLINRLTVDEKIAQLLYQSPAIERLGIPAYNWWNESLHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT------- 147
           +  G           AT FPQ IT A+S++  L   +  V+SDEARA ++          
Sbjct: 79  ARAG----------YATVFPQSITIAASWDDELVAEVANVISDEARAKHHEYLRRGQHDI 128

Query: 148 -AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
             GLT+WSPN+NIFRDPRWGRG ET GEDP L+G     YV+GLQG++   LKV A+ KH
Sbjct: 129 YQGLTFWSPNINIFRDPRWGRGHETYGEDPYLTGVLGTEYVKGLQGNNAKYLKVVATAKH 188

Query: 207 FTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           F  +     +G +  R  F+   S++D+ +T+   FR  V +G V S+M +YN++ G   
Sbjct: 189 FAVH-----SGPEPLRHEFDVAPSQRDLWETYLPAFRTLVKDGNVYSIMTAYNRIYGEAA 243

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
            A  N L   +R +W  NGY+VSDC ++   + T H      EA+A A++ G DL+CG  
Sbjct: 244 SAS-NSLYSILRDKWGFNGYVVSDCGAIADMWKTHHVAKDAAEASAMAVKEGCDLNCGNS 302

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
               T+ A+Q GL++E D++ AL   +  + +LGMFD +    PY  +       P H+ 
Sbjct: 303 YEKLTD-ALQDGLITEADLDVALHRLMRARFKLGMFDSD-EKVPYAKIPFSVNNNPKHKV 360

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LAL+AA++ IVLLKN+   LPLS    + +AVIGPN+D   ++ GNY G+     T L+G
Sbjct: 361 LALKAAQKSIVLLKNENAILPLSK-NLKNIAVIGPNADNIQSLWGNYNGMPKNPVTVLEG 419

Query: 445 I 445
           I
Sbjct: 420 I 420



 Score =  143 bits (360), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 94/296 (31%), Positives = 157/296 (53%), Gaps = 50/296 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+ A+ ++D  +L +GL++ +E E +          DR  L LP +Q EL+ +V +A+  
Sbjct: 592 AVLAANKSDVVVLALGLNERLEGEEMKVEVEGFADGDRTSLNLPKKQVELMKEV-VATGK 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P +LVL++G  + + +A  +  I AII AGYPGQ GG AIA++LFG  NP G+LP+T+Y 
Sbjct: 651 PVVLVLLNGSALSINWASEN--IPAIISAGYPGQEGGNAIANVLFGDYNPAGRLPVTYYK 708

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
              + +LP       P +     GRTY+++K   +YPFG+G+SYT F +      + + +
Sbjct: 709 S--VDDLP-------PFEDYNMDGRTYKYFKKEPLYPFGYGLSYTKFKY------SNLEI 753

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW- 699
           PL+              IK+         + V V V N G  DG   + ++     G   
Sbjct: 754 PLE--------------IKINEP------IKVSVQVANEGDFDGDEVVQLYVRDEEGSTP 793

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
            P  +LV F+++H+  GA+Q+V   I   + L+++++     I  G  +I +GG++
Sbjct: 794 RPICELVGFKRIHLKKGARQKVEFTIQ-PRELAMINKDDKFVIEPGWFSISVGGSQ 848


>gi|320105647|ref|YP_004181237.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
 gi|319924168|gb|ADV81243.1| glycoside hydrolase family 3 domain protein [Terriglobus saanensis
           SP1PR4]
          Length = 885

 Score =  306 bits (785), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 185/466 (39%), Positives = 265/466 (56%), Gaps = 46/466 (9%)

Query: 9   FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
            L L   +   SGLA ++P   DP             +L  P R  DL+ R++L+EK   
Sbjct: 11  LLSLTFAAFGQSGLA-QKPAYLDP-------------TLSPPARARDLVHRMTLEEKTAQ 56

Query: 69  LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLW 128
           +I+ A A+ RLG+  Y++WSE LHGV+  G           AT FPQ I  A++++  L 
Sbjct: 57  MINTAPAIDRLGVPAYDFWSEGLHGVARSG----------YATLFPQAIGMAATWDEPLM 106

Query: 129 EAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
             IG VVS EARA YN            GLT WSPN+NIFRDPRWGRGQET GEDP L+ 
Sbjct: 107 HEIGTVVSTEARAKYNDAVQHGVHSIYFGLTIWSPNINIFRDPRWGRGQETYGEDPFLTA 166

Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
           +   ++VRG+QG D +  +  A+ KHF  +         R  FN  VS+ D+ DT+   F
Sbjct: 167 RMGTAFVRGIQGDDPNYFRTIATPKHFAVHSGPE---STRHTFNVDVSQHDLWDTYLPAF 223

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT-- 298
           R  ++EGK  S+MC+YN+++G P CA   +LK+ +RG+W   G++ SDC ++  +Y    
Sbjct: 224 RSTIIEGKADSIMCAYNRIDGQPACASDLLLKQILRGDWGFRGFVTSDCGAIDDFYTKIG 283

Query: 299 QHFTSTPEEAAADAIRAGLDLDCGP-FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRL 357
            HF+   E+A+A  ++AG D  CG  +LGL   SAV+ GL++E +++ +L      ++RL
Sbjct: 284 HHFSKEKEDASAAGVKAGTDTACGKTYLGL--TSAVKSGLITEHEMDISLERLFEARIRL 341

Query: 358 GMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
           G+FD +P+  PY  L   +V +P H+ LAL AAR+ IVLLKN    LPL  +++  +AVI
Sbjct: 342 GLFD-DPARMPYARLTMAEVNSPAHRALALRAARESIVLLKNANNLLPLHGVKN--IAVI 398

Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGCKDV 460
           GPN+     + GNY  IA     P+ GI      A+ ++ QG   V
Sbjct: 399 GPNAASLDALEGNYNAIARDPAMPVDGIAAAFPGAKVVYAQGAPYV 444



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 94/295 (31%), Positives = 135/295 (45%), Gaps = 50/295 (16%)

Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
           D  +  +GL   +E E +          DR  + LP  Q EL+  V    K P I+VLM+
Sbjct: 620 DVVVAFVGLSPELEGEEMPIKVKGFAGGDRTDIELPQTQLELLRAVKATGK-PLIVVLMN 678

Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
           G     A A  D    A++ A YPG+AG  AIA+ L G +NP G+LP+T+Y    I  LP
Sbjct: 679 GS----AIALKDSETDALLEAWYPGEAGAQAIAETLAGKNNPSGRLPLTFYSN--IDQLP 732

Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
             +             RTYR++KG  +Y FG G+SYT F                 R+G 
Sbjct: 733 AFD-------DYSMANRTYRYFKGQPLYAFGGGLSYTTF-----------------RYGK 768

Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAF 708
           +       ++  TH       L V+ +V N G   G     V+ TPP    AP   LV +
Sbjct: 769 V-------SLSATHLHAGE-DLTVEAEVTNTGKVAGDEVAQVYLTPPQTSIAPRFALVGY 820

Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
           ++VH+  G  + +   +H  + LS VD  G R    G + I +GG+ +  +L AA
Sbjct: 821 QRVHLLPGQSKPMRFTLH-PRELSQVDAQGVRAASAGHYEIKVGGSSNVTTLSAA 874


>gi|389737578|ref|ZP_10190998.1| beta-glucosidase [Rhodanobacter sp. 115]
 gi|388434298|gb|EIL91245.1| beta-glucosidase [Rhodanobacter sp. 115]
          Length = 898

 Score =  306 bits (784), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 181/453 (39%), Positives = 252/453 (55%), Gaps = 29/453 (6%)

Query: 35  ATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
           A  +T P +   +    +R  DL+ R++L EKV  + + A A+PRLG+  Y+WW+EALHG
Sbjct: 35  AAKQTQPLYLDTAHSFQERAADLVSRMTLAEKVAQMQNSAPAIPRLGVPAYDWWNEALHG 94

Query: 94  VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------G 145
           V+  G           AT FPQ I  A++F+  L       +SDEARA YN        G
Sbjct: 95  VARAGE----------ATVFPQAIGLAATFDPALLHHEATAISDEARAKYNDFQRRGMRG 144

Query: 146 GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCK 205
              GLT+WSPN NIFRDPRWGRGQET GEDP L+ +   ++VRGL+G D    K+ A+ K
Sbjct: 145 RYEGLTFWSPNTNIFRDPRWGRGQETYGEDPYLTSRMGVAFVRGLEGDDPTYQKLDATAK 204

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           HF  +   +    +R  F+   S++D+ +T+   F+  V +G V +VM +YN+V+GVP  
Sbjct: 205 HFAVH---SGPESERHRFDVHPSERDLHETYLPAFQALVQQGGVDAVMGAYNRVDGVPAT 261

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
           A   +L+  +R +W   GY+VSDCD+V   Y       T E+AAA A+  G DL+CG   
Sbjct: 262 ASHRLLQDILRRDWGFKGYVVSDCDAVADIYQFHKVVPTAEQAAALAVNNGDDLNCGTTY 321

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
               + AV  GL++E  I+ A+   +  + RLGMFD  P   P+  L    V +P H  L
Sbjct: 322 ATLVK-AVHDGLVNEHTIDTAVTRLMLARFRLGMFD-PPGRVPWSTLPMSVVQSPQHDAL 379

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           AL  A++ +VLLKN G  LPLSH   R +AVIGP +D    ++GNY G      T LQGI
Sbjct: 380 ALRTAQESMVLLKNDG-LLPLSH-NVRRIAVIGPTADNVTALLGNYHGTPKAPVTILQGI 437

Query: 446 GRY---ARTIHQQGCKDVACADDQLFGAAIDAS 475
                 A+  + QG + V   D    G+ I A+
Sbjct: 438 REAVPNAQVTYVQGTELVQGFDGPDVGSVIGAA 470



 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 90/307 (29%), Positives = 145/307 (47%), Gaps = 53/307 (17%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F AA+DA+R AD  I   GL   +E E +          DR  L LP  Q++L+  + + 
Sbjct: 623 FEAALDAARHADVVIFAGGLSSDLEGEEMPVDYPGFAGGDRTTLALPATQRKLLQALQVT 682

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K P +LVL +G  + + +AK    + AI+ A YPGQ GG A+AD LFG  +P G+LP+T
Sbjct: 683 GK-PVVLVLTTGSALAIDWAKQ--HLPAILLAWYPGQDGGHAVADALFGNVDPAGRLPVT 739

Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
           +Y        P  + AM+        GRTYR++ G  ++PFG G+SYT F ++       
Sbjct: 740 FYKSARQLP-PFDDYAMK--------GRTYRYFTGQPLFPFGFGLSYTRFAYS------- 783

Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP-PA 696
             + LD               + T    +R+ + ++  VKN G + G   + ++  P  A
Sbjct: 784 -DLQLD---------------RDTLGPSDRMRISLR--VKNTGQRAGDEVVQLYLRPLRA 825

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHV---CKYLSVVDRSGTRRIPLGEHNIHIGG 753
            H    K L  F+++ +  G ++ V  +I      KY  V   +    +  G + + +G 
Sbjct: 826 PHARAIKSLRGFQRISLKPGEERSVSFDISPQTDLKYYDVAHHAYA--VAPGRYQVQVGA 883

Query: 754 TKHSVSL 760
           +   + L
Sbjct: 884 SSADIRL 890


>gi|365118446|ref|ZP_09337032.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363649697|gb|EHL88801.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 1283

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 241/767 (31%), Positives = 372/767 (48%), Gaps = 120/767 (15%)

Query: 30  CDPKDATTRTLPFCQVS--------------LPIPQRVNDLIGRLSLQEKVKLLIS--GA 73
            DP++A  R++    VS              +PI +R++DL+ RL+L+EKV  L    G+
Sbjct: 46  MDPEEAYNRSMANAGVSQQVSKETPKYLNPNIPIEERIDDLLPRLTLEEKVIQLSDSWGS 105

Query: 74  AAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGR 133
             + RL I      +E LHG S             G+T FP  I   S+F+  L + +G+
Sbjct: 106 KGIARLKIPAM-LKTEGLHGQSYA----------TGSTIFPHGINMGSTFDTELIQEVGK 154

Query: 134 VVSDEARAMYNGGTAGL-TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
             + EA+A      A L   WSP +++ RD RWGR +ET GEDP L G+   ++++G QG
Sbjct: 155 ATAIEAKA------ANLRVSWSPVLDVARDARWGRVEETYGEDPYLVGRIGVAWIKGFQG 208

Query: 193 SDGDRLKVAASCKHFTAY-------DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVM 245
                  + A  KHF  +       D  ++   DR   N  ++          PFR  + 
Sbjct: 209 E-----HMFACPKHFAGHGQPVGGRDSHDYGLSDRVMRNIHLA----------PFRDVIK 253

Query: 246 EGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP 305
           E     VM +Y   NGVP      +L++ +R EW   G++VSDC         Q    T 
Sbjct: 254 EANAFGVMAAYGLWNGVPDNGSKELLQKILREEWGFEGFVVSDCSGPENIQRKQSVVGTM 313

Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
           EEAAA A+RAG+D++CG        SAV++G++ E +++  L      +MRLG+FD  PS
Sbjct: 314 EEAAAMAVRAGVDIECGSAYKKALASAVKKGIIKESELDANLRRVFRAKMRLGLFD-RPS 372

Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
            +        +  TP+H+ LA + A +  VLLKN+   LPL     +T+AVIGPN+D   
Sbjct: 373 IENMVWNKLPEYDTPEHRALARKVAVKSTVLLKNENNLLPLDK-NIKTIAVIGPNADQGQ 431

Query: 426 TMIGNY-AGIACG-YTTPLQGIGRY----ARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
           T  G+Y A  A G   + L+G+  +     + ++ QGC  +   D   F  A++ ++QAD
Sbjct: 432 T--GDYSAKYAPGQIISVLEGVKNHVSPSTKVLYAQGCTQLDM-DTTGFAEAVNIAKQAD 488

Query: 480 ATILVMGLD---------QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGG 530
           A ILV+G +         +S   E +D A L +PG Q++L+  V    K P +LVL++G 
Sbjct: 489 AVILVVGDNSNRHENGNKKSTTGENVDGATLEIPGVQRQLIKAVEATGK-PVVLVLVNGK 547

Query: 531 PIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMT 590
           P  + +   D  I +I+   YPG+ GG A ADI+FG  NP G+LP++ +P+ +   LP+ 
Sbjct: 548 PFTLTW--EDENIESILETWYPGEEGGNATADIIFGDENPSGRLPIS-FPR-HPGQLPLW 603

Query: 591 EMAMRPSQSKRYPGRTYRFYKGPV--VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
                   +    GR Y +Y  P   +Y FGHG+SYT F                 R+ +
Sbjct: 604 -------YNYETSGRNYDYYDMPFTPLYRFGHGLSYTTF-----------------RYSN 639

Query: 649 INATI-SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP-PAGHWAPHKQLV 706
           + AT  SG    VT          V VD++N G + G     ++ T   A        L 
Sbjct: 640 LKATTKSGDPGFVT----------VSVDIENTGKRPGEEVAQLYITDLVASVNTAVIDLK 689

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            F++V +  G ++ V   ++    LS+++    R +  G+  +H+GG
Sbjct: 690 GFKRVFLKPGEKKTVTFELNPY-LLSLLNPDMKRVLEAGKFRMHVGG 735


>gi|427386425|ref|ZP_18882622.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726465|gb|EKU89330.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
           12058]
          Length = 864

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 175/445 (39%), Positives = 252/445 (56%), Gaps = 30/445 (6%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           F    LP+ +RVNDL+G L+L+EK+  +++ A A+ RLGI  Y WW+E LHGV+      
Sbjct: 26  FQNPDLPVEERVNDLVGHLTLEEKISQMMNNAPAIERLGIPAYNWWNECLHGVAR----- 80

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
                +P  TSFPQ I  A++++      +    SDE RA+Y+        G   GLTYW
Sbjct: 81  ---SPYP-VTSFPQAIAMAATWDTKSVYQMAEYASDEGRAIYHDAARKGTPGIFRGLTYW 136

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           SPN+NIFRDPRWGRGQET GEDP L+     ++V+GLQG D   LK +A  KH+  +   
Sbjct: 137 SPNINIFRDPRWGRGQETYGEDPYLTAAIGVAFVKGLQGDDPVYLKSSACAKHYAVHSGP 196

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
            WN   R  +NA+VS  D+ DT+   FR  V++ KV  VMC+YN     P C +  ++  
Sbjct: 197 EWN---RHTYNAEVSNHDLWDTYLPAFRELVVDAKVTGVMCAYNSFFEQPCCGNDLLMMD 253

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
            +R +W+ +GY+ SDC ++  +Y+T +      EA+ADA+  G D +CG         A+
Sbjct: 254 ILRNQWKFDGYVTSDCGAIEDFYNTHNTHEDAAEASADAVLHGTDCECGNGAYRALADAI 313

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQ 392
            RGL++E  ++ +L     ++ RLGMFD +    PY  + P  V   D H+  AL+ ARQ
Sbjct: 314 VRGLITEEQVDVSLKKLFEIRFRLGMFDPD-DRVPYSDI-PISVLECDAHKAHALKMARQ 371

Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRY 448
            IVLLKN+   LPL   + + +AV+GPN+D    ++ NY G     TT L+GI    G  
Sbjct: 372 SIVLLKNEKQLLPLDMNKIKKIAVVGPNADDKSVLLANYYGYPSCVTTVLEGIKGKVGDQ 431

Query: 449 ARTIHQQGCKDVACADDQLFGAAID 473
              I+++G   V   DD +F +A D
Sbjct: 432 VEVIYEKG---VNLTDDFIFTSAYD 453



 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 90/293 (30%), Positives = 140/293 (47%), Gaps = 52/293 (17%)

Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
           + AD  + V GL   +E E +          DR  + +P  QQ L+ ++    K P I +
Sbjct: 597 KDADVVVFVGGLSAKVEGEEMKVEIDGFKRGDRTSISIPVVQQNLLKELYATGK-PVIFI 655

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
           LM+G  + + +      + AI+ A Y GQAGG AIAD+LFG  NP G+LP+T+Y    + 
Sbjct: 656 LMTGSAVGLEWESE--HLPAILNAWYGGQAGGQAIADVLFGDYNPSGRLPLTFYKN--VN 711

Query: 586 NLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
           +LP   + +M+         RTYR++ G  VYPFG+G+SYT+F +               
Sbjct: 712 DLPDFEDYSMK--------NRTYRYFTGIPVYPFGYGLSYTDFQYN-------------- 749

Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQ 704
                        IKV     ++L++ V  +V NVG  +G   + ++ + P     P + 
Sbjct: 750 ------------TIKV-QPSLDKLSVKVTAEVSNVGKYEGEEVVQLYVSNPRDFVTPIRA 796

Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
           L  F ++++  G  Q V   +   K LSVVD +G      GE  I +GG + S
Sbjct: 797 LKGFRRINLKPGESQMVEF-VLTSKELSVVDVAGNFVPMKGEVQISLGGGQPS 848


>gi|300785890|ref|YP_003766181.1| beta-glucosidase [Amycolatopsis mediterranei U32]
 gi|384149201|ref|YP_005532017.1| beta-glucosidase [Amycolatopsis mediterranei S699]
 gi|399537773|ref|YP_006550435.1| beta-glucosidase [Amycolatopsis mediterranei S699]
 gi|299795404|gb|ADJ45779.1| beta-glucosidase [Amycolatopsis mediterranei U32]
 gi|340527355|gb|AEK42560.1| beta-glucosidase [Amycolatopsis mediterranei S699]
 gi|398318543|gb|AFO77490.1| beta-glucosidase [Amycolatopsis mediterranei S699]
          Length = 1218

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 238/785 (30%), Positives = 356/785 (45%), Gaps = 122/785 (15%)

Query: 51  QRVNDLIGRLSLQEKV-KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPG 109
           +R  DL+ R++L EKV +L  + A A+PRLG++ Y +WSE  HG++ +G  T  G    G
Sbjct: 54  ERAADLVARMTLPEKVLQLRTNSAPAIPRLGVQQYTYWSEGQHGLNTLGANTNDGTVTGG 113

Query: 110 --ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG---------------LTY 152
             ATSFP  + +  S++  L +     +SDEAR M +    G               LTY
Sbjct: 114 VHATSFPTNLASTMSWDPELIQQETTAISDEARGMLDKSLWGVAQNNIGPDKNNYGSLTY 173

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD------RLKVAASCKH 206
           W+P VN+ RDPRWGR  E  GEDP L  K A ++V G QG           LKVAA+ KH
Sbjct: 174 WAPTVNLDRDPRWGRTDEGFGEDPYLVAKMAGAFVNGYQGQTASGRPATPYLKVAATAKH 233

Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
           +   +++N    DR   ++  ++ ++ D +   FR  + +  V+ +M SYN +NG P+ +
Sbjct: 234 YALNNVEN----DRHADSSDTTEANLRDYYTKQFRNLIQDAHVSGLMTSYNAINGTPSPS 289

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVG-VYYDTQHFTSTPEEAAAD-------------- 311
           D        +  +  +GY  SDC +VG VY    H  + P    A               
Sbjct: 290 DTYTANAIAQRTYGFDGYTTSDCGAVGDVYAPGSHNWAPPGWTTATSNGGTQWTNTATGQ 349

Query: 312 -----------AIRAGLDLDCGPFLGL--HTESAVQRGLLSEIDINNALVNTLTVQMRLG 358
                      A+RAG  L+C        + + A++ G+LSE  ++NALV+  T +M+ G
Sbjct: 350 QVAGAAGGQAYALRAGTQLNCTGTEATVANIQEAIKAGVLSEGVLDNALVHVFTTRMQTG 409

Query: 359 MFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ------GPSLPLSHIRHR 412
            FD  P    Y  +    + +P+HQ LA + A   +VLLKN        P LP    +  
Sbjct: 410 EFD-PPDRVAYTKITKDVIQSPEHQALAAKVAAHSLVLLKNDPVPGTAAPLLPADPAKLG 468

Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR--------TIHQQGCKDVACAD 464
           TV V+G  +      +G Y+G        +QGI    +        T    G        
Sbjct: 469 TVVVVGDLAGKVT--LGGYSGEPALQVNAVQGITSAVKAANPAATVTFDACGTSTATTTA 526

Query: 465 DQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTIL 524
                  + A + AD  ++  G D ++  E  DR  + +PG    L+ +V  A    T L
Sbjct: 527 ASCSAETLAALKTADLVVVFAGTDGNVATEGRDRTTIAMPGNYDSLIDQVKAAGNPRTAL 586

Query: 525 VLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
            + +GG + +  A   P    I+++GY G++ GTA+AD+LFG  NP G L  TWY  +  
Sbjct: 587 AVQAGGAVSLGHAAGIP---GIVFSGYNGESQGTALADVLFGKQNPSGHLNFTWYADD-- 641

Query: 585 TNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
           + LP +    + PSQ+    GRTY+++ G   YPFG+G+SYT F ++  +A T  A    
Sbjct: 642 SQLPAIKNYGLTPSQTGGL-GRTYQYFTGTPAYPFGYGLSYTKFAYSRVHADTWAA---- 696

Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGA-----HTLLVFSTPPAGH 698
                 N  ++                 V VDV N GS  GA     +    F  P  G 
Sbjct: 697 ----DANGQVT-----------------VHVDVTNTGSTPGATVAQLYAATAFGVP--GV 733

Query: 699 WAPHKQLVAFEKVHVPA-GAQQRVGINIHV--------CKYLSVVDRSGTRRIPLGEHNI 749
             P ++L  F K  V A G  Q + I + +         K   VV   G  R  +G  + 
Sbjct: 734 ELPRQRLAGFAKTDVLAPGRTQHLAIPVRIGDLAFWDEGKRREVV-HPGAYRFGVGADSA 792

Query: 750 HIGGT 754
           H+ GT
Sbjct: 793 HLAGT 797


>gi|294627323|ref|ZP_06705909.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292598405|gb|EFF42556.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 886

 Score =  306 bits (783), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 173/407 (42%), Positives = 237/407 (58%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N +L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 86  TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D D  +  A+ KH   +     +G +  
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLDHPRTIATPKHIAVH-----SGPEPG 200

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS  D+E T+   FR  ++EG+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 201 RHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 260

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A A++AG DL+CG  +  L T  A++RG +
Sbjct: 261 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRDLGT--AIERGDV 318

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KDV    H+ LAL+AA + IVLLK
Sbjct: 319 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 377

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 378 NDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422



 Score =  136 bits (343), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 94/279 (33%), Positives = 143/279 (51%), Gaps = 48/279 (17%)

Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
           +GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVAL 680

Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
            +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP      
Sbjct: 681 NWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY---- 732

Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
               S    GRTYR++KG  ++PFG+G+SYT F +   +AP +    L           +
Sbjct: 733 ---VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------A 776

Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
           G  ++VT              V+N G++ G     V+   P    +P + LV F++VH+ 
Sbjct: 777 GNPLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 824

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           AG Q+ +  ++   + LS VDRSG R +  G++ + +GG
Sbjct: 825 AGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVGG 862


>gi|294665226|ref|ZP_06730524.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292605014|gb|EFF48367.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 886

 Score =  305 bits (782), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 173/407 (42%), Positives = 237/407 (58%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N +L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 86  TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D D  +  A+ KH   +     +G +  
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLDHPRTIATPKHIAVH-----SGPEPG 200

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS  D+E T+   FR  ++EG+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 201 RHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 260

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A A++AG DL+CG  +  L T  A++RG +
Sbjct: 261 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRDLGT--AIERGDV 318

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KDV    H+ LAL+AA + IVLLK
Sbjct: 319 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 377

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 378 NDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422



 Score =  135 bits (341), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 93/279 (33%), Positives = 143/279 (51%), Gaps = 48/279 (17%)

Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
           +GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVAL 680

Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
            +AK      AI+ A YPGQ+GGTA+A +L G  NPGG+LP+T+Y      +LP      
Sbjct: 681 NWAKT--HADAIVAAWYPGQSGGTAMARMLAGDDNPGGRLPVTFY--RSTKDLPAY---- 732

Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
               S    GRTYR++KG  ++PFG+G+SYT F +   +AP +    L           +
Sbjct: 733 ---VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------A 776

Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
           G  ++VT              V+N G++ G     V+   P    +P + LV F++VH+ 
Sbjct: 777 GNPLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 824

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           AG Q+ +  ++   + LS VDRSG R +  G++ + +GG
Sbjct: 825 AGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVGG 862


>gi|182415162|ref|YP_001820228.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
 gi|177842376|gb|ACB76628.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
           PB90-1]
          Length = 747

 Score =  305 bits (782), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 229/716 (31%), Positives = 364/716 (50%), Gaps = 64/716 (8%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
           LPF    LP  QR++DLIGR++L+EK+  + +  AAVPRLG+KG     E  HGV+  GP
Sbjct: 32  LPFQDPELPAEQRIDDLIGRMTLEEKIDCM-AMRAAVPRLGVKGSRH-IEGYHGVAQGGP 89

Query: 100 GTKFGGDFPGATS-FPQVITTASSFNATLWEAIGRVVSDEARAMYNG---GTAGLTYWSP 155
            + +G   P AT+ FPQ     ++++  L   +    ++EAR ++       AGL   +P
Sbjct: 90  -SNWGRRNPTATTQFPQAYGLGATWDPELIRQVAAQEAEEARYLFQSPRYDRAGLIVRAP 148

Query: 156 NVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNW 215
           N ++ RDPRWGR +E  GEDP  +G  A ++VRGLQG D    K  +  KHF A    N 
Sbjct: 149 NADLARDPRWGRTEEVYGEDPFHAGTLATAFVRGLQGDDPRYFKAVSLVKHFLA----NS 204

Query: 216 NGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTI 275
           N   R   ++  S++   + +  PF M +++G   ++M +YN VNG P    P +L+  +
Sbjct: 205 NEDGRESSSSNFSERQWREYYAKPFEMAIVDGGAPALMAAYNAVNGTPAHVHP-MLRDIV 263

Query: 276 RGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES---A 332
             EW+LNG + +D   + +  +  H       AAA  ++AG++     FL  H ++   A
Sbjct: 264 MAEWKLNGILCTDGGGLRLLVEKHHAFPDLPSAAAACVKAGINH----FLDRHKDAVTEA 319

Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG----PKDVCTPDHQELALE 388
           V RG ++E D++ AL     V ++LG+ D +    PY  +G     +    PD Q L  +
Sbjct: 320 VARGSITERDLDAALRGLFRVSLKLGLLDPD-ERVPYAAIGRNGEAEPWLRPDTQALVRK 378

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
             ++ IVLLKN G  LPL   + +TVA++GP   +  T++ ++ G    YT P   IG  
Sbjct: 379 VTQRSIVLLKNSGALLPLDRTKVKTVALVGP---LVNTVLPDWYGGTPPYTVP-PSIG-- 432

Query: 449 ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD------------QSIEAEAL 496
              +  +G K    AD  +  AA++ +R ++  I+ +G D             S   EA+
Sbjct: 433 VEKVAGEGVKVGWLAD--MGDAAVELARTSEIAIVCVGNDPISAGGWELVRTPSEGKEAV 490

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DR  L LP  Q++ + +V +A+   TI+VL+S  P  + +      + AI+   +  Q  
Sbjct: 491 DRKDLALPRDQEKFIRRV-LAANPRTIVVLISNFPYAMPWVVK--HVPAIVHLTHASQEL 547

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           G A+ D+L+G  NP GKL  TW P+      PM +  +         GRTY+++KG   +
Sbjct: 548 GHALGDVLWGEVNPDGKLAQTW-PKSLKQLPPMMDYDLT-------HGRTYQYFKGEPQF 599

Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLD-GRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           PFG G+SYT F   ++N    + V LD  RH    A    ++        N + L + V+
Sbjct: 600 PFGFGLSYTTF--NLSN----LRVGLDVARHVGAGAETPAESPAPRTFAPNAI-LSIAVE 652

Query: 676 VKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
           V N G++ G   + V++  P    + P KQL  F+++ V AG    V + +   ++
Sbjct: 653 VTNTGTRAGDEVVQVYARYPHSKVSRPLKQLCGFQRISVAAGETAHVRLQLPASRF 708


>gi|325916103|ref|ZP_08178390.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325537647|gb|EGD09356.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 896

 Score =  305 bits (781), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 176/451 (39%), Positives = 249/451 (55%), Gaps = 29/451 (6%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A T   P+    LP   R  DL+ R++L+EK   + + A A+PRL +  Y+WW+EALHGV
Sbjct: 33  AATPPPPYLDTQLPFETRAADLVSRMTLEEKAAQMQNAAPAIPRLRVPAYDWWNEALHGV 92

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA------ 148
           +  G          GAT FPQ I  A++F+  L   +   +SDEARA ++   A      
Sbjct: 93  ARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARDEHKR 142

Query: 149 --GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
             GLT+WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQ   G   K+ A+ KH
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKH 202

Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
           +  +   +    DR HF+   S++D+ +T+   F+  V EG VA+VM +YN+VNG    A
Sbjct: 203 YAVH---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGHVAAVMGAYNRVNGESASA 259

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
               L+  +R +W  +GYIVSDC ++   +       TPE AAA  ++ G DLDCG    
Sbjct: 260 STR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYA 318

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
                AV+ GL+ E  I+ +L   +T +MRLGMFD  P+   +  +      +P H  LA
Sbjct: 319 A-LPKAVRAGLIDEATIDTSLKRLMTTRMRLGMFD-PPAKVAWAQIPASVNQSPQHDALA 376

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI- 445
              AR+ +VLLKN G  LPL     R +AV+GP +D  ++++GNY G      T LQGI 
Sbjct: 377 RRTARESLVLLKNDG-LLPLKPTLKR-IAVVGPTADDPMSLLGNYYGTPAAPVTILQGIR 434

Query: 446 --GRYARTIHQQGCKDVACADDQLFGAAIDA 474
                A  ++ +G   V   +D    A IDA
Sbjct: 435 DAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465



 Score =  125 bits (314), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 89/302 (29%), Positives = 144/302 (47%), Gaps = 53/302 (17%)

Query: 471 AIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSMASKG 520
           A+DA+R A+  + V GL   +E E +D          R    LP  Q+EL+  +  A+  
Sbjct: 623 AVDAARNAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQ-ATGT 681

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + V +A+    + AI+ A YPGQ GG+A+ D+LFG ++PGG+LP+T+Y 
Sbjct: 682 PVVAVLTTGSALAVDWAQQ--HVPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPITFYK 739

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
           +     LP   + AMR        GRTYR++ G  +YPFGHG+SYT F ++         
Sbjct: 740 E--AERLPAFDDYAMR--------GRTYRYFTGTALYPFGHGLSYTQFAYSDLRLDRTTL 789

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAG 697
               G  G++ AT+                      V+N G + G   + ++  P  P  
Sbjct: 790 ----GADGTLRATLK---------------------VRNTGKRAGDEVVQLYLHPLDPKR 824

Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKH 756
             A  K+L  F+++ +  G Q+ V   +     L + D +  T  +  G + + IG +  
Sbjct: 825 ERA-GKELRGFQRMTLQPGEQREVAFTLKAADALRIYDEQRKTYAVDPGAYEVQIGASSS 883

Query: 757 SV 758
            +
Sbjct: 884 DI 885


>gi|371777036|ref|ZP_09483358.1| glycoside hydrolase [Anaerophaga sp. HS1]
          Length = 890

 Score =  305 bits (781), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 176/430 (40%), Positives = 245/430 (56%), Gaps = 30/430 (6%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   +LP  +R  DL+ +++L+EKV  +   A A+ RLGI  Y WW+E LHGV   G   
Sbjct: 40  YLDPTLPFEERAADLVSKMTLEEKVSQMQHAAPAIERLGIPEYNWWNECLHGVGRAGI-- 97

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
                   AT FPQ I  A+ ++      I   VSDEARA ++        G   GLT+W
Sbjct: 98  --------ATVFPQAIGMAAMWDDEEMYRIATAVSDEARAKHHDFARRGKRGIYQGLTFW 149

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +PN+NIFRDPRWGRG ET GEDP L+G+ A  Y++GLQG D   LK+ A+ KHF  +   
Sbjct: 150 TPNINIFRDPRWGRGMETYGEDPFLTGELAVDYIKGLQGDDDRYLKLVATSKHFLVHSGP 209

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
                DR HF+A+ S +D   T+   F+  + E  V SVMC+YN+ NG+P C    + + 
Sbjct: 210 E---PDRHHFDARTSARDSLMTYTPHFKKTIQEAGVYSVMCAYNRYNGLPCCGSKPV-EN 265

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTES 331
            +R EW   GYIVSDC +V  +Y   H    P  EEAAA A++AG DL+CG       + 
Sbjct: 266 LLRNEWGFKGYIVSDCWAVADFYKKGHHEVVPTVEEAAAMAVKAGTDLNCGNSYPALVD- 324

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
           AV++GL+SE +I+  +   +  ++RLGMFD  P   PY ++    V + +H+ELAL AAR
Sbjct: 325 AVKQGLVSEEEIDVLVKRLMEARLRLGMFD-PPEMVPYTNIPYSVVDSKEHRELALIAAR 383

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY--- 448
           + +VLLKN   +LPL     + VAVIGPN++    ++ NY G      TPL GI +    
Sbjct: 384 KSMVLLKNDNNTLPLDK-NVKNVAVIGPNANNLDVLLANYNGYPSNPVTPLDGIRQKLPN 442

Query: 449 ARTIHQQGCK 458
           A   +  GC+
Sbjct: 443 ANVQYALGCR 452



 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 90/295 (30%), Positives = 147/295 (49%), Gaps = 49/295 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           AI  +  +D  ++ MGL  ++E E +          DR  + LP  Q +LV K  M+   
Sbjct: 610 AIQIAAASDVVLMFMGLSPNLEGEEMPVNVPGFSGGDRVDIKLPQIQTDLV-KAIMSLGK 668

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P +LVL++G  + + +   +  + AI+ A YPGQAGGTAIAD+LFG  NP G+LP+T+Y 
Sbjct: 669 PVVLVLLNGSALAINWEAEN--VPAILEAWYPGQAGGTAIADVLFGDYNPAGRLPVTFY- 725

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
            + +T LP       P +     GRTY+++KG  ++PFG+G+SYT+F +     P  +  
Sbjct: 726 -KSVTQLP-------PFEDYSMDGRTYQYFKGEALFPFGYGLSYTSFKYDNLVVPDKLE- 776

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
                        +GK +             V VDV N G++DG   + ++ + P    A
Sbjct: 777 -------------AGKEVT------------VHVDVTNTGNRDGDEVVQLYVSHPDVESA 811

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
           P + L  F+++ + AG  + V   +   + L+V        +P G   + +GG +
Sbjct: 812 PIRSLQGFDRIALKAGETKTVSFTLK-PEQLAVYQPQNGLVVPAGNLKLSVGGRQ 865


>gi|294667502|ref|ZP_06732718.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292602731|gb|EFF46166.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 901

 Score =  305 bits (781), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 176/458 (38%), Positives = 250/458 (54%), Gaps = 36/458 (7%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
             C    A   T P+         R  DL+ R++L+EK   + + A A+PRL +  Y+WW
Sbjct: 20  LGCATGAAQAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG-- 145
           +EALHGV+  G          GAT FPQ I  A++F+  L   +   +SDEARA ++   
Sbjct: 80  NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129

Query: 146 ------GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
                    GLT+WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG  GD  K
Sbjct: 130 RQNQHERYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGGDAPK 189

Query: 200 VA---------ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
            A         A+ KHF  +        DR HF+A  S++D+ +T+   F   V +GKV 
Sbjct: 190 NAQGERYRKLDATAKHFAVHSGPE---ADRHHFDAHPSQRDLYETYLPAFEALVKDGKVD 246

Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
           +VM +YN+V G    A   +L+  +R +W   GY+VSDC ++   +      +T E+AAA
Sbjct: 247 AVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAA 306

Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
            A++ G +L+CG        +AV++GL+ E  I+ AL   +T +MRLGMFD  P   P+ 
Sbjct: 307 LAVKHGTELECGEEYST-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWS 364

Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
            +      +P H  LA   AR+ +VLLKN G  LPLS  R + +AVIGP +D T+ ++GN
Sbjct: 365 QIPASVNQSPAHDALARRTARESLVLLKNDG-LLPLSRARLKRIAVIGPTADDTMALLGN 423

Query: 431 YAGIACGYTTPLQGI---GRYARTIHQQGCKDVACADD 465
           Y G      T LQGI      A+ ++ +G   V   DD
Sbjct: 424 YYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 93/302 (30%), Positives = 148/302 (49%), Gaps = 53/302 (17%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +R A+  + V GL   +E E +          DR  L LP  Q++L+  +    K 
Sbjct: 628 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALHATGK- 686

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y 
Sbjct: 687 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 743

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVA 639
           +E  T     + AMR        GRTYR++ G  +YPFGHG+SYT F ++ +    T +A
Sbjct: 744 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 795

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAG 697
                  GS+ AT++                     VKN G + G   + ++  P  P  
Sbjct: 796 T-----DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHPLTPQR 829

Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKH 756
             A  K+L  F+++ +  G Q+ +G  I+    L + D +     +  G + + IG +  
Sbjct: 830 ERA-GKELHGFQRIALTPGEQRELGFTINAKDALRLYDEQRKAYVVDPGAYEVQIGASSA 888

Query: 757 SV 758
            +
Sbjct: 889 DI 890


>gi|188574621|ref|YP_001911550.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
 gi|188519073|gb|ACD57018.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 904

 Score =  305 bits (781), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 169/422 (40%), Positives = 241/422 (57%), Gaps = 33/422 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+ Q      QR  DL+ R++L+EK   + + A A+PRLG+  Y+WW+EALHGV+  G  
Sbjct: 36  PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                   GAT FPQ I  A++F+  L   +   +SDEARA ++            GLT+
Sbjct: 94  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTF 145

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--------SDGDRL-KVAAS 203
           WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG        + G+R  K+ A+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDAT 205

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
            KHF  +        DR HF+A+ S++D+ +T+   F   V +GKV +VM +YN+V G  
Sbjct: 206 AKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 262

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
             A   +L+  +R +W   GY+VSDC ++   +      +T E+AAA A+  G +L+CG 
Sbjct: 263 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELECGE 322

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
                  +AV +GL+ E  I+ AL   +T +MRLGMFD  P   P+  +      +P H 
Sbjct: 323 EYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPAHD 380

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
            LA   AR+ +VLLKN G  LPLS    + +AVIGP +D T+ ++GNY G      T LQ
Sbjct: 381 ALARRTARESLVLLKNDG-LLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 439

Query: 444 GI 445
           GI
Sbjct: 440 GI 441



 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 93/301 (30%), Positives = 141/301 (46%), Gaps = 51/301 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +R AD  + V GL   +E E +          DR  L LP  Q+EL+  +    K 
Sbjct: 631 ALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQATGK- 689

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y 
Sbjct: 690 PVVAVLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 746

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
           +E  T     + AM         GRTYR++ G  +YPFGHG+SYT F ++          
Sbjct: 747 KESETLPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAYS---------- 788

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAGH 698
             D R      T  G              L   V VKN G + G   + ++  P  P   
Sbjct: 789 --DLRLDRSTLTADG-------------ALTATVAVKNTGQRAGDEVVQLYLHPLKPQRE 833

Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHS 757
            A  K+L  F+++ +  G Q+ +   I+    L + D +     +  G + + IG +   
Sbjct: 834 RA-GKELRGFQRLALQPGQQRELRFTINAKDALRIYDAQRKAYTVDPGAYEVQIGASSAD 892

Query: 758 V 758
           +
Sbjct: 893 I 893


>gi|58584046|ref|YP_203062.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84625823|ref|YP_453195.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|58428640|gb|AAW77677.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84369763|dbj|BAE70921.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
          Length = 904

 Score =  305 bits (781), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 169/422 (40%), Positives = 241/422 (57%), Gaps = 33/422 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+ Q      QR  DL+ R++L+EK   + + A A+PRLG+  Y+WW+EALHGV+  G  
Sbjct: 36  PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                   GAT FPQ I  A++F+  L   +   +SDEARA ++            GLT+
Sbjct: 94  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTF 145

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--------SDGDRL-KVAAS 203
           WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG        + G+R  K+ A+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDAT 205

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
            KHF  +        DR HF+A+ S++D+ +T+   F   V +GKV +VM +YN+V G  
Sbjct: 206 AKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 262

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
             A   +L+  +R +W   GY+VSDC ++   +      +T E+AAA A+  G +L+CG 
Sbjct: 263 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELECGE 322

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
                  +AV +GL+ E  I+ AL   +T +MRLGMFD  P   P+  +      +P H 
Sbjct: 323 EYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPAHD 380

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
            LA   AR+ +VLLKN G  LPLS    + +AVIGP +D T+ ++GNY G      T LQ
Sbjct: 381 ALARRTARESLVLLKNDG-LLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 439

Query: 444 GI 445
           GI
Sbjct: 440 GI 441



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/301 (31%), Positives = 141/301 (46%), Gaps = 51/301 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +R AD  + V GL   +E E +          DR  L LP  Q+EL+  +    K 
Sbjct: 631 ALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQATGK- 689

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + V +A+    + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y 
Sbjct: 690 PVVAVLTAGSALAVDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 746

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
           +E  T     + AM         GRTYR++ G  +YPFGHG+SYT F ++          
Sbjct: 747 KESETLPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAYS---------- 788

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAGH 698
             D R      T  G              L   V VKN G + G   + ++  P  P   
Sbjct: 789 --DLRLDRSTLTADG-------------ALTATVAVKNTGQRAGDEVVQLYLHPLKPQRE 833

Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHS 757
            A  K+L  F+++ +  G Q+ +   I+    L + D +     +  G + + IG +   
Sbjct: 834 RA-GKELRGFQRLALQPGQQRELRFTINAKDALRIYDAQRKAYTVDPGAYEVQIGASSAD 892

Query: 758 V 758
           +
Sbjct: 893 I 893


>gi|418518550|ref|ZP_13084692.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|418522850|ref|ZP_13088880.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410700720|gb|EKQ59264.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410703176|gb|EKQ61671.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 901

 Score =  305 bits (781), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 175/458 (38%), Positives = 254/458 (55%), Gaps = 36/458 (7%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
             C    A T T P+         R  DL+ R++L+EK   + + A A+PRL +  Y+WW
Sbjct: 20  LGCATGAAQTATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--- 144
           +EALHGV+  G          GAT FPQ I  A++F+  L   +   +SDEARA ++   
Sbjct: 80  NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129

Query: 145 -----GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------- 192
                    GLT+WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG       
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPK 189

Query: 193 -SDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
            + G+R  K+ A+ KHF  +        DR HF+A+ S++D+ +T+   F   V +GKV 
Sbjct: 190 NAQGERYRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVD 246

Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
           +VM +YN+V G    A   +L+  +R +W   GY+VSDC ++   +      +T E+AAA
Sbjct: 247 AVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAA 306

Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
            A++ G +L+CG        +AV++GL+ E  I+ AL   +T +MRLGMFD  P   P+ 
Sbjct: 307 LAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWS 364

Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
            +      +P H  LA   AR+ +VLLKN G  LPLS  + + +AVIGP +D T+ ++GN
Sbjct: 365 TIPASVNQSPAHDALARRTARESLVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGN 423

Query: 431 YAGIACGYTTPLQGI---GRYARTIHQQGCKDVACADD 465
           Y G      T LQGI      A+ ++ +G   V   DD
Sbjct: 424 YYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461



 Score =  129 bits (324), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 93/301 (30%), Positives = 147/301 (48%), Gaps = 51/301 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +R AD  + V GL   +E E +          DR  L LP  Q++L+  +    K 
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 686

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y 
Sbjct: 687 PVVAVLTAGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 743

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVA 639
           +E  T     + AMR        GRTYR++ G  +YPFGHG+SYT F ++ +    T +A
Sbjct: 744 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 795

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                  GS+ AT++                     VKN G + G   + ++  P A   
Sbjct: 796 T-----DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHPLAPQR 829

Query: 700 A-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHS 757
               K+L  F+++ +  G Q+ +G  I+    L + D +     +  G + + IG +   
Sbjct: 830 ERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIGASSAD 889

Query: 758 V 758
           +
Sbjct: 890 I 890


>gi|225873995|ref|YP_002755454.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
 gi|225792796|gb|ACO32886.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
          Length = 896

 Score =  305 bits (780), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 173/417 (41%), Positives = 245/417 (58%), Gaps = 32/417 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+   + PI +RV++L+ +++LQE+   +++ A A+PRLG+  Y WWSE LHG++  G  
Sbjct: 38  PWDNPNQPIQKRVHELVSQMTLQEEAAQMMNTAPAIPRLGVPAYNWWSEGLHGIARSG-- 95

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTY 152
                    AT FPQ I  +++F+      +G  VS EARA YN            GLT 
Sbjct: 96  --------YATVFPQAIGMSATFDPAAIHQMGTTVSTEARAKYNWAIRHDIHSIYFGLTL 147

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           W+PN+NI RDPRWGRGQET GEDP L+G  AA YV GLQG++   LK  A+ KHF+ Y  
Sbjct: 148 WAPNINIVRDPRWGRGQETYGEDPFLTGTMAAEYVSGLQGNNPKYLKTVATPKHFSVY-- 205

Query: 213 DNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              NG +  R   NA  S  D++DT+   FRM + +G   S+MCSYN V GVP+CA+  +
Sbjct: 206 ---NGPESMRHKINANPSAHDMQDTYLAAFRMAITKGHADSMMCSYNAVYGVPSCAN-KL 261

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
           L   +RG+W  +GYI SDC ++  +Y       +P+   AAA A+ AG D DCG    + 
Sbjct: 262 LADVVRGKWGFDGYITSDCGAISDFYRPGAHGYSPDAVHAAASAVLAGTDTDCGTGYKVL 321

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
            +S VQ+GL+S+  I+ A+    T + RLGMFD + +  PY  +    V +  H+  ALE
Sbjct: 322 PQS-VQQGLISKAAIDRAVERLFTARFRLGMFDPK-ADVPYNSIPYSVVDSAAHRAQALE 379

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            A + +VLLKN+G  LPL +   RT+AV+GPN+    ++ GNY  I    + P+ GI
Sbjct: 380 DASKSMVLLKNEGGILPLRNA--RTIAVVGPNAANLNSIEGNYNAIPSHPSLPVDGI 434



 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 89/282 (31%), Positives = 147/282 (52%), Gaps = 50/282 (17%)

Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
           MGL  +  +E +          DR  L LP  QQ+L+  + +A+  P +LVL++G  + +
Sbjct: 633 MGLSPTFVSEEMPIKIPGFDGGDRTRLSLPQTQQDLLHAL-VATGKPVVLVLLNGSALSI 691

Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMA 593
            +AK    +  I+ A YPG+AGG AI + L G ++PGGKLP+T+Y    + +L P T+ +
Sbjct: 692 DWAKQ--HVQGILEAWYPGEAGGEAIGETLSGQNDPGGKLPITFYTS--VKDLPPFTDYS 747

Query: 594 MRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATI 653
           M+        GRTYR+Y G  ++PFG+G+SYT F ++                   +  +
Sbjct: 748 MK--------GRTYRYYTGKPLFPFGYGLSYTTFEYS-------------------HVRL 780

Query: 654 SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHV 713
           S   +K          L V+ +VKN G   G     V+ TPP     P K+L  F++VH+
Sbjct: 781 STSNLKAGEP------LTVEAEVKNTGHVAGDAVTEVYVTPPQNGVNPLKELKGFDRVHL 834

Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
             G  +++   ++  + LS+VD +G R +  G ++I +GG++
Sbjct: 835 APGQSRQLTFTLN-PRDLSLVDEAGKRSVQPGVYSIFVGGSQ 875


>gi|289670678|ref|ZP_06491753.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 886

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 175/425 (41%), Positives = 243/425 (57%), Gaps = 38/425 (8%)

Query: 39  TLPFCQVSLPIP------QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
           TLP   ++ P        QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LH
Sbjct: 18  TLPCLALAAPTERAATPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLH 77

Query: 93  GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT--- 147
           G++  G           AT FPQ I  A+S+N  L + +G VVS EARA +N  GG    
Sbjct: 78  GIARNG----------HATVFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKD 127

Query: 148 ----AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
               AGLT WSPN+NIFRDPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+
Sbjct: 128 HKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIAT 187

Query: 204 CKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
            KH   +     +G +  R  F+  VS  D+E T+   FR  +++G+  SVMC+YN ++G
Sbjct: 188 PKHLAVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHG 242

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
            P CA   +L   +RG+W   G++VSDCD+V       +F      ++A A++AG DL+C
Sbjct: 243 TPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNC 302

Query: 322 G-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
           G  +  L T  A++RG + E  ++ +LV     + RLG  +  P   PY  LG KDV   
Sbjct: 303 GHAYRELGT--AIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNA 359

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
            H+ LAL+AA + IVLLKN   +LPL+      +AVIGPN+D    +  NY G +    T
Sbjct: 360 AHRALALQAAAESIVLLKNDANTLPLNA--GTRLAVIGPNADALAALEANYQGTSSAPVT 417

Query: 441 PLQGI 445
           PL G+
Sbjct: 418 PLLGL 422



 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 98/287 (34%), Positives = 147/287 (51%), Gaps = 48/287 (16%)

Query: 477 QADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           Q+DA +  +GL   +E E L          DR  + LP  QQ L+ +   + K P ++VL
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVL 672

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           MSG  + + +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +
Sbjct: 673 MSGSAVALNWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKD 728

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP          S    GRTYR++KG  ++PFG+G+SYT F +   +AP + +  L    
Sbjct: 729 LPAY-------VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSSTTLQ--- 775

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
                  +G  ++VT              V+N G+  G     V+   P    +P + LV
Sbjct: 776 -------AGNPLQVT------------TTVRNTGTHAGDEVAQVYLQYPDRPQSPLRSLV 816

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            F++VH+ AG Q+ +  N+   + LS VDRSG R +  G + + +GG
Sbjct: 817 GFQRVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 862


>gi|359450637|ref|ZP_09240068.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
 gi|358043611|dbj|GAA76317.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
          Length = 468

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 167/426 (39%), Positives = 247/426 (57%), Gaps = 31/426 (7%)

Query: 30  CDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
           C   + +T+T P +   S  I +RVNDL+ RL+L+EKV  L   + A+ RL +  Y WW+
Sbjct: 21  CAKDEQSTQTPPLYLNKSASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNMPEYNWWN 80

Query: 89  EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT- 147
           EALHGV+  G           AT FPQ I  A++F+  L   +G  +SDE RA ++    
Sbjct: 81  EALHGVARAGK----------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLE 130

Query: 148 -------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
                   GLTYWSPN+NIFRDPRWGRGQET GEDP L+ + A +++ GLQG + + LK 
Sbjct: 131 ENNRSMYTGLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDNAEYLKS 190

Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
            A+ KH+  +   +   V R   +   S++D+ +T+   F+  + + KVASVMC+YN VN
Sbjct: 191 VATLKHYAVH---SGPEVSRHSDDYTASEKDLAETYLPAFKDVIAQTKVASVMCAYNSVN 247

Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD--TQHFTSTPEEAAADAIRAGLD 318
           G P C +  +++  +R E+  +GYIVSDC ++  +YD  + +  +T  +AAA A++ G D
Sbjct: 248 GTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTGAKAAAMALKTGTD 307

Query: 319 LDCGPFLG---LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
           L+CG   G    +   AV+ GL+ E D++ AL   +  + +LGMFD  P + PY      
Sbjct: 308 LNCGDHHGNTYSYLTQAVKEGLVEEKDVDKALKRLMYARFKLGMFDN-PENVPYSDTSID 366

Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
            V +  H  L  EAA++ +VLLKN+   LPL    +  +A+IGPN+D    ++GNY G+ 
Sbjct: 367 VVGSNKHLALTQEAAQKSLVLLKNE-QVLPLK--GNEKIALIGPNADNEAILLGNYNGMP 423

Query: 436 CGYTTP 441
               TP
Sbjct: 424 IVPITP 429


>gi|325925754|ref|ZP_08187127.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
 gi|325543811|gb|EGD15221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
          Length = 874

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 172/407 (42%), Positives = 236/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 24  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 73

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 74  TVFPQAIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 133

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+ KH   +     +G +  
Sbjct: 134 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHIAVH-----SGPEPG 188

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS +D+E T+   FR  ++EG+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 189 RHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 248

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A A++AG DL+CG  +  L T  A+ RG +
Sbjct: 249 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 306

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KDV    H+ LAL+AA + IVLLK
Sbjct: 307 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 365

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 366 NDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 410



 Score =  135 bits (340), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 94/278 (33%), Positives = 141/278 (50%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 611 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 669

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP       
Sbjct: 670 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 720

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
              S    GRTYR++KG  ++ FG+G+SYT F +   +AP +    L           +G
Sbjct: 721 --VSYDMKGRTYRYFKGEPLFAFGYGLSYTRFAY---DAPQLSTTTLQ----------AG 765

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
            +++VT              V+N G++ G     V+   P    +P + LV F++VH+ A
Sbjct: 766 SSLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 813

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G Q+ +  N+   + LS VDRSG R +  G + + +GG
Sbjct: 814 GEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 850


>gi|58581402|ref|YP_200418.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|58425996|gb|AAW75033.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
          Length = 889

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 169/407 (41%), Positives = 234/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR  DL+  +S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 39  QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 88

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G VVS EARA +N            AGLT WSPN+NIFR
Sbjct: 89  TVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGNDHKRYAGLTIWSPNINIFR 148

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++ GLQG D D  +  A+ KH   +     +G +  
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIHGLQGEDLDHPRTIATPKHLAVH-----SGPEPG 203

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS +D+E T+   FR  ++EG+  +VMC+YN ++G P CA   ++   +RG+W
Sbjct: 204 RHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLINGRVRGDW 263

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A A++AG DL+CG  +  L T  A+ RG +
Sbjct: 264 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 321

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KDV    H+ LAL+AA + IVLLK
Sbjct: 322 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRALALQAAAESIVLLK 380

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL+      +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 381 NNANTLPLNA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 425



 Score =  139 bits (350), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 96/278 (34%), Positives = 142/278 (51%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 684

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP       
Sbjct: 685 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 735

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
              S    GRTYR++KG  ++PFG+G+SYT F +   +AP +          S  A  +G
Sbjct: 736 --VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQL----------SSTAVQAG 780

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
             ++VT              V+N G++ G     V+   P    +P + LV F++VH+ A
Sbjct: 781 STLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 828

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G Q+ +  N+   + LS VDRSG R +  G + + +GG
Sbjct: 829 GEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 865


>gi|399025517|ref|ZP_10727513.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
 gi|398077894|gb|EJL68841.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
          Length = 875

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 168/446 (37%), Positives = 252/446 (56%), Gaps = 32/446 (7%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           F C          PF   +LP+ QR+ +L+G L++ EK+ +++  + AVPRL I  Y WW
Sbjct: 11  FICAFSFGQNYKYPFRNPNLPVEQRIENLLGLLTVDEKIGMMMDNSKAVPRLEIPAYGWW 70

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--- 144
           +EALHGV+  G           AT FPQ I  A++++         ++SDEARA YN   
Sbjct: 71  NEALHGVARAGT----------ATVFPQAIGMAAAWDVPEHLKTFEMISDEARAKYNKSF 120

Query: 145 ------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL 198
                 G   GLT+W+PN+NIFRDPRWGRGQET GEDP L+     + V+GLQG+D    
Sbjct: 121 DEASKTGRYEGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGNDPKYF 180

Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQ 258
           K  A  KHF  +    WN   R  +NA+VSK+D+ +T+   F+  V+EG V  VMC+YN 
Sbjct: 181 KTHACAKHFAVHSGPEWN---RHSYNAEVSKRDLYETYLPAFKSLVLEGNVREVMCAYNA 237

Query: 259 VNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAG 316
            +G P CA   +L   +RG+W+ +G +VSDC ++  +Y  ++  + P+E   AADA++  
Sbjct: 238 FDGQPCCASNTLLNEILRGKWKYDGMVVSDCWALADFYQEKYHGTHPDEKSTAADALKHS 297

Query: 317 LDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD 376
            DL+CG     +   ++  GL++E DI+ ++   L     LGM D  P S    +  P  
Sbjct: 298 TDLECGDTYN-NLNKSLAGGLITEKDIDISMRRILKGWFELGMLD--PKSSVLWNQIPYS 354

Query: 377 VCTPD-HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
           V   D H++ AL+ A++ IVL+KN+   LP +    + +AV+GPN+D  +  +GNY G  
Sbjct: 355 VVDSDEHKKQALKMAQKSIVLMKNENNILPFNK-NIKKIAVVGPNADDEMMQLGNYNGTP 413

Query: 436 CGYTTPLQGI-GRYART--IHQQGCK 458
               T L+GI  ++  T  I+++G +
Sbjct: 414 SSIVTILEGIKAKFPNTEIIYEKGSE 439



 Score =  105 bits (263), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 80/311 (25%), Positives = 137/311 (44%), Gaps = 39/311 (12%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F +  +  + AD  +   GL  S+E E +          D+  + LP  Q+EL++++   
Sbjct: 592 FASVKEKVKDADVIVFAGGLSPSLEGEEMLVNAEGFKGGDKTSIELPKVQRELLAELRKT 651

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K P + VL +G  +     +++     ++ A Y GQ+GGTA+AD+L G  NP G+LP+T
Sbjct: 652 GK-PVVFVLCTGSSL--GLEQDEKNYDVLLNAWYGGQSGGTAVADVLAGDYNPSGRLPVT 708

Query: 578 WYPQ-EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y   E + N        +  ++    GRTYR+     +Y FGHG+SY+ F +       
Sbjct: 709 FYKNLEQLDNALSKTSKHQGFENYDMQGRTYRYMTENPLYAFGHGLSYSKFNYG------ 762

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                        NA +S  +I           + + V V N+  +DG   + V+     
Sbjct: 763 -------------NAKLSKNSISPNE------DIIITVPVTNISDRDGEEVVQVYVKRNN 803

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
              AP K L AFE+V + +   + + + I    +    +++       G++ I  GGT  
Sbjct: 804 DVLAPVKTLRAFERVLIRSKETKNIQLTISKESFKFYDEKADDLISKSGDYTILYGGTSA 863

Query: 757 SVSLHAATLGV 767
           +  L    L V
Sbjct: 864 NSGLKNMQLKV 874


>gi|381170979|ref|ZP_09880130.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
 gi|380688543|emb|CCG36617.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
          Length = 901

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 175/458 (38%), Positives = 253/458 (55%), Gaps = 36/458 (7%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
             C    A   T P+        QR  DL+ R++L+EK   + + A A+PRL +  Y+WW
Sbjct: 20  LGCATGAAQAATPPYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--- 144
           +EALHGV+  G          GAT FPQ I  A++F+  L   +   +SDEARA ++   
Sbjct: 80  NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129

Query: 145 -----GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------- 192
                    GLT+WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG       
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPK 189

Query: 193 -SDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
            + G+   K+ A+ KHF  +        DR HF+A+ S++D+ +T+   F   V EGKV 
Sbjct: 190 NAQGEPYRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKEGKVD 246

Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
           +VM +YN+V G    A   +L+  +R +W   GY+VSDC ++   +      +T E+AAA
Sbjct: 247 AVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAA 306

Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
            A++ G +L+CG        +AV++GL+ E  I+ AL   +T +MRLGMFD  P   P+ 
Sbjct: 307 LAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWS 364

Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
            +      +P H  LA   AR+ +VLLKN G  LPLS  + + +AVIGP +D T+ ++GN
Sbjct: 365 TIPASVNQSPAHDALARRTARESLVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGN 423

Query: 431 YAGIACGYTTPLQGI---GRYARTIHQQGCKDVACADD 465
           Y G      T LQGI      A+ ++ +G   V   DD
Sbjct: 424 YYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461



 Score =  128 bits (322), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 93/301 (30%), Positives = 148/301 (49%), Gaps = 51/301 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +R AD  + V GL   +E E +          DR  L LP  Q++L+  +  A+  
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQ-ATGR 686

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y 
Sbjct: 687 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 743

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVA 639
           +E  T     + AMR        GRTYR++ G  +YPFGHG+SYT F ++ +    T +A
Sbjct: 744 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 795

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                  GS+ AT++                     VKN G + G   + ++  P A   
Sbjct: 796 T-----DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHPLAPQR 829

Query: 700 A-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHS 757
               K+L  F+++ +  G Q+ +G  I+    L + D +     +  G + + IG +   
Sbjct: 830 ERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIGASSAD 889

Query: 758 V 758
           +
Sbjct: 890 I 890


>gi|84623339|ref|YP_450711.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188577358|ref|YP_001914287.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
 gi|84367279|dbj|BAE68437.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188521810|gb|ACD59755.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 889

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 169/407 (41%), Positives = 234/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR  DL+  +S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 39  QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 88

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G VVS EARA +N            AGLT WSPN+NIFR
Sbjct: 89  TVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFR 148

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++ GLQG D D  +  A+ KH   +     +G +  
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDLDHPRTIATPKHLAVH-----SGPEPG 203

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS +D+E T+   FR  ++EG+  +VMC+YN ++G P CA   ++   +RG+W
Sbjct: 204 RHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLINGRVRGDW 263

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A A++AG DL+CG  +  L T  A+ RG +
Sbjct: 264 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 321

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KDV    H+ LAL+AA + IVLLK
Sbjct: 322 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRALALQAAAESIVLLK 380

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL+      +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 381 NNANTLPLNA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 425



 Score =  139 bits (350), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 96/278 (34%), Positives = 142/278 (51%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 684

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP       
Sbjct: 685 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 735

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
              S    GRTYR++KG  ++PFG+G+SYT F +   +AP +          S  A  +G
Sbjct: 736 --VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQL----------SSTAVQAG 780

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
             ++VT              V+N G++ G     V+   P    +P + LV F++VH+ A
Sbjct: 781 STLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 828

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G Q+ +  N+   + LS VDRSG R +  G + + +GG
Sbjct: 829 GEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 865


>gi|384420163|ref|YP_005629523.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353463076|gb|AEQ97355.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 889

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 169/407 (41%), Positives = 233/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR  DL+  +S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 39  QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 88

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G VVS EARA +N            AGLT WSPN+NIFR
Sbjct: 89  TVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFR 148

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++ GLQG D D  +  A+ KH   +     +G +  
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDLDHPRTIATPKHLAVH-----SGPEPG 203

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS +D+E T+   FR  ++EG+  +VMC+YN ++G P CA   ++   +RG+W
Sbjct: 204 RHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLINGRVRGDW 263

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A A++AG DL+CG  +  L T  A+ RG +
Sbjct: 264 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 321

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KDV    H+ LAL+AA + IVLLK
Sbjct: 322 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRALALQAAAESIVLLK 380

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 381 NNANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 425



 Score =  135 bits (340), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 95/278 (34%), Positives = 141/278 (50%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 684

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP       
Sbjct: 685 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 735

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
              S    GRTYR++KG  ++PFG+G+SYT F +   +AP +          S  A  +G
Sbjct: 736 --VSYDMKGRTYRYFKGEPLFPFGYGLSYTCFAY---DAPQL----------SSTAVQAG 780

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
             ++VT              V+N G++ G     V+   P    +P + LV F++VH+ A
Sbjct: 781 STLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 828

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G Q+ +  N+   + LS VD SG R +  G + + +GG
Sbjct: 829 GEQRTLTFNLD-ARALSDVDPSGQRAVEAGNYTLFVGG 865


>gi|289664871|ref|ZP_06486452.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 886

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 175/425 (41%), Positives = 242/425 (56%), Gaps = 38/425 (8%)

Query: 39  TLPFCQVSLPIP------QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
           TLP   ++ P        QR   L+  +S +EKV   ++ A A+PRLGI  YEWWSE LH
Sbjct: 18  TLPCLALAAPTERAATPEQRAAALVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLH 77

Query: 93  GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT--- 147
           G++  G           AT FPQ I  A+S+N  L + +G VVS EARA +N  GG    
Sbjct: 78  GIARNG----------HATVFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKD 127

Query: 148 ----AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
               AGLT WSPN+NIFRDPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+
Sbjct: 128 HKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIAT 187

Query: 204 CKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
            KH   +     +G +  R  F+  VS  D+E T+   FR  +++G+  SVMC+YN ++G
Sbjct: 188 PKHLAVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHG 242

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
            P CA   +L   +RG+W   G++VSDCD+V       +F      ++A A++AG DL+C
Sbjct: 243 TPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNC 302

Query: 322 G-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
           G  +  L T  A++RG + E  ++ +LV     + RLG  +  P   PY  LG KDV   
Sbjct: 303 GHAYRELGT--AIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNA 359

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
            H+ LAL+AA + IVLLKN   +LPL+      +AVIGPN+D    +  NY G +    T
Sbjct: 360 AHRALALQAAAESIVLLKNDANTLPLNA--GTRLAVIGPNADALAALEANYQGTSSAPVT 417

Query: 441 PLQGI 445
           PL G+
Sbjct: 418 PLLGL 422



 Score =  146 bits (368), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 99/287 (34%), Positives = 148/287 (51%), Gaps = 48/287 (16%)

Query: 477 QADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           Q+DA +  +GL   +E E L          DR  + LP  QQ L+ +   + K P ++VL
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVL 672

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           MSG  + + +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +
Sbjct: 673 MSGSAVALNWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKD 728

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP          S    GRTYR++KG  ++PFG+G+SYT F +   +AP +         
Sbjct: 729 LPAY-------VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQL--------- 769

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
            S  A  +G  ++VT              V+N G++ G     V+   P    +P + LV
Sbjct: 770 -STTALQAGNPLQVT------------TTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLV 816

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            F++VH+ AG Q+ +  N+   + LS VDRSG R +  G + + +GG
Sbjct: 817 GFQRVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 862


>gi|389794400|ref|ZP_10197553.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
 gi|388432423|gb|EIL89432.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
          Length = 902

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 170/414 (41%), Positives = 238/414 (57%), Gaps = 29/414 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +  +S     R  DL+  ++L+EK   + + A A+PRLG+  Y+WW+E LHGV+  G   
Sbjct: 47  YRDLSRSFHDRAADLVAHMTLEEKAAQMQNTAPAIPRLGVAAYDWWNEGLHGVARAGQ-- 104

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
                   AT FPQ I  A++F+  L   +   +SDEARA YN        G   GLTYW
Sbjct: 105 --------ATVFPQAIGLAATFDVPLMHEVATAISDEARAKYNEFQRKGSHGRYEGLTYW 156

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           SPN+NIFRDPRWGRGQET GEDP L+ +   ++V GLQG +    K+ A+ KHF  +   
Sbjct: 157 SPNINIFRDPRWGRGQETYGEDPYLTERMGVAFVTGLQGDNPTYRKLDATAKHFAVH--- 213

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           +    DR HF+   S++D+ +T+   F+  V E  V +VM +YN+VNG P    P +L +
Sbjct: 214 SGPEADRHHFDVHPSERDLYETYLPAFQTLVQEADVDAVMSAYNRVNGEPATGSPRLLGQ 273

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
            +R +W   GY+VSDC +V   Y       T E A+A A++ G+DLDCG       + AV
Sbjct: 274 ILRKDWGFKGYVVSDCGAVEDIYKHHKVVDTVEAASALAVKNGVDLDCGTEYAALVK-AV 332

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQ 392
             GL+ E +I+ AL   +  +MRLGMFD  P+S+ P+  +      +P H  LA  AAR+
Sbjct: 333 HDGLIKESEIDAALTRLMQARMRLGMFD--PASKVPWSDVPYSVNQSPQHDALARRAARE 390

Query: 393 GIVLLKNQGPSLPLSH-IRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            +VLLKN G  LPLS  I+H  +AVIGP +D  + ++GNY G      T L+GI
Sbjct: 391 SMVLLKNDG-VLPLSKDIKH--IAVIGPTADDVMALVGNYHGTPADPVTILRGI 441



 Score =  109 bits (273), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 81/293 (27%), Positives = 129/293 (44%), Gaps = 57/293 (19%)

Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           +   GL   +E E +          DR  L LP  Q++L+  +    K P +LVL SG  
Sbjct: 642 VFAGGLTSDVEGEEMKVNYPGFAGGDRTDLRLPATQRKLLEALQATGK-PVVLVLTSGSA 700

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
           + V +A     + A++ A YPGQ GG A+AD+LFG ++P G+LP+T+Y       LP  +
Sbjct: 701 LAVDWANQ--HLPAVLLAWYPGQRGGNAVADVLFGKADPAGRLPVTFYKAS--EKLPAFD 756

Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
                    R  GRTYR++KG  +YPFG+G+SYT F +                      
Sbjct: 757 -------DYRMDGRTYRYFKGEPLYPFGYGLSYTKFTYA--------------------- 788

Query: 652 TISGKAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVF----STPPAGHWAPHKQLV 706
                 +K+ H K  +   L V V V N G + G   + ++     TP   H   +K L 
Sbjct: 789 -----DLKLDHNKIGKNDKLHVTVKVHNAGKRAGDEVVQLYLRGVGTP---HERSNKDLR 840

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSV 758
             +++ +  G  + V  ++     L   D +     +  G + + IG +   +
Sbjct: 841 GIQRITLQPGQTRDVSFDVSPATDLRYYDTKKAAYAVDAGRYEVQIGASSADI 893


>gi|346726970|ref|YP_004853639.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346651717|gb|AEO44341.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 902

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 172/435 (39%), Positives = 246/435 (56%), Gaps = 36/435 (8%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR  DL+ R++L+EK   + + A A+PRLG+  Y+WW+EALHGV+  G          GA
Sbjct: 44  QRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------GA 93

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
           T FPQ I  A++F+  L   +   +SDEARA ++            GLT+WSPN+NIFRD
Sbjct: 94  TVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRD 153

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------ASCKHFTAYDLD 213
           PRWGRGQET GEDP L+ +   ++V+GLQG   D  K A         A+ KHF  +   
Sbjct: 154 PRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRKLDATAKHFAVHSGP 213

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
                DR HF+A+ S++D+ +T+   F   V +GKV +VM +YN+V G    A   +L+ 
Sbjct: 214 E---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESASASKFLLQD 270

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
            +R +W   GY+VSDC ++   +      +T E+AAA A++ G +L+CG        +AV
Sbjct: 271 VLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEEYST-LPAAV 329

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
            +GL+ E  I+ AL   +T +MRLGMFD  P   P+  +      +P H  LA   AR+ 
Sbjct: 330 HQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAHDALARRTARES 388

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---GRYAR 450
           +VLLKN G  LPLS  + + +AVIGP +D T+ ++GNY G      T LQGI      A+
Sbjct: 389 LVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQ 447

Query: 451 TIHQQGCKDVACADD 465
            ++ +G   V   DD
Sbjct: 448 VLYARGADLVEGRDD 462



 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/303 (30%), Positives = 146/303 (48%), Gaps = 55/303 (18%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +R AD  + V GL   +E E +          DR  L LP  Q++L+  +    K 
Sbjct: 629 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 687

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y 
Sbjct: 688 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 744

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTVV 638
           +E  T     + AMR        GRTYR++ G  +YPFGHG+SYT F ++    +  T+ 
Sbjct: 745 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 796

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PA 696
           A       GS+ AT++                     VKN G + G   + ++  P  P 
Sbjct: 797 A------DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHPLTPQ 829

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTK 755
              A  K+L  F+++ +  G Q+ +   +     L + D +     +  G + + IG + 
Sbjct: 830 RERA-GKELHGFQRIALQPGEQRALHFTLDAKNALRIYDAQRKAYAVDPGAYEVQIGASS 888

Query: 756 HSV 758
             +
Sbjct: 889 ADI 891


>gi|188993706|ref|YP_001905716.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
 gi|167735466|emb|CAP53681.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
          Length = 896

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 174/451 (38%), Positives = 251/451 (55%), Gaps = 29/451 (6%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A T   P+   + P+  R  DL+ R++L+EK   + + A A+PRL +  Y+WW+EALHGV
Sbjct: 33  AATPPPPYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGV 92

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----- 149
           +  G          GAT FPQ I  A++F+  L   +   +SDEARA ++   AG     
Sbjct: 93  ARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKR 142

Query: 150 ---LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
              LT+WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQ   G   K+ A+ KH
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKH 202

Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
           +  +   +    DR HF+   S++D+ +T+   F+  V EG VA+VM +YN+VNG    A
Sbjct: 203 YAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESASA 259

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
               L+  +R +W  +GYIVSDC ++   +       TPE AAA  ++ G DLDCG    
Sbjct: 260 STR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYA 318

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
               +AV+ GL+ E  I+ +L   +  ++RLGMFD  P+  P+  +      +P H  LA
Sbjct: 319 A-LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQIPASANQSPQHDALA 376

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI- 445
              AR+ +VLLKN G  LPL     R +AV+GP +D  ++++GNY G      T LQGI 
Sbjct: 377 RRTARESLVLLKNDG-LLPLKPTLKR-IAVVGPTADDPMSLLGNYYGTPAAPVTILQGIR 434

Query: 446 --GRYARTIHQQGCKDVACADDQLFGAAIDA 474
                A  ++ +G   V   +D    A IDA
Sbjct: 435 DAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465



 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 93/304 (30%), Positives = 149/304 (49%), Gaps = 57/304 (18%)

Query: 471 AIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSMASKG 520
           A+DA+R AD  + V GL   +E E +D          R    LP  Q+EL+  +  A+  
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQ-ATGT 681

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+ D+LFG ++PGG+LP+T+Y 
Sbjct: 682 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTV 637
           ++    LP   + AMR        GRTYR++ G  +YPFGHG++YT F ++    +  TV
Sbjct: 740 ED--ERLPAFDDYAMR--------GRTYRYFDGKPLYPFGHGLAYTQFAYSNLRLDRTTV 789

Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--P 695
            A       G++ AT+S                     VKN G + G   + ++  P  P
Sbjct: 790 AA------DGTLRATVS---------------------VKNTGQRAGDEVVQLYLHPLNP 822

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGT 754
               A  K+L  F+++ +  G  + V  NI   + L + D +     +  G + + IG +
Sbjct: 823 QRERA-RKELRGFQRITLQPGEHREVSFNITPREALRIYDEQRKAYAVDPGAYELQIGAS 881

Query: 755 KHSV 758
              +
Sbjct: 882 SADI 885


>gi|346725879|ref|YP_004852548.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346650626|gb|AEO43250.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 889

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 172/407 (42%), Positives = 236/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 39  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 88

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 89  TVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 148

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+ KH   +     +G +  
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHIAVH-----SGPEPG 203

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS +D+E T+   FR  ++EG+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 204 RHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 263

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A A++AG DL+CG  +  L T  A+ RG +
Sbjct: 264 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 321

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KDV    H+ LAL+AA + IVLLK
Sbjct: 322 DEALLDQSLVRLFATRYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 380

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 381 NDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 425



 Score =  135 bits (341), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 94/278 (33%), Positives = 141/278 (50%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 684

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP       
Sbjct: 685 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 735

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
              S    GRTYR++KG  ++ FG+G+SYT F +   +AP +    L           +G
Sbjct: 736 --VSYDMKGRTYRYFKGEPLFAFGYGLSYTRFAY---DAPQLSTTTLQ----------AG 780

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
            +++VT              V+N G++ G     V+   P    +P + LV F++VH+ A
Sbjct: 781 SSLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 828

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G Q+ +  N+   + LS VDRSG R +  G + + +GG
Sbjct: 829 GEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 865


>gi|78048767|ref|YP_364942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
 gi|78037197|emb|CAJ24942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
          Length = 889

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 172/407 (42%), Positives = 236/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 39  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 88

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 89  TVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 148

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+ KH   +     +G +  
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHIAVH-----SGPEPG 203

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS +D+E T+   FR  ++EG+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 204 RHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 263

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A A++AG DL+CG  +  L T  A+ RG +
Sbjct: 264 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 321

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KDV    H+ LAL+AA + IVLLK
Sbjct: 322 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 380

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 381 NDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 425



 Score =  135 bits (341), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 94/278 (33%), Positives = 141/278 (50%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 684

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP       
Sbjct: 685 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 735

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
              S    GRTYR++KG  ++ FG+G+SYT F +   +AP +    L           +G
Sbjct: 736 --VSYDMKGRTYRYFKGEPLFAFGYGLSYTRFAY---DAPQLSTTTLQ----------AG 780

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
            +++VT              V+N G++ G     V+   P    +P + LV F++VH+ A
Sbjct: 781 SSLQVT------------TTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 828

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G Q+ +  N+   + LS VDRSG R +  G + + +GG
Sbjct: 829 GEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 865


>gi|21243803|ref|NP_643385.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21109396|gb|AAM37921.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 886

 Score =  303 bits (776), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 176/425 (41%), Positives = 242/425 (56%), Gaps = 38/425 (8%)

Query: 39  TLPFCQVSLPIP------QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
           TLP   ++ P        QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LH
Sbjct: 18  TLPCLALATPAKPAGSPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLH 77

Query: 93  GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT--- 147
           G++  G           AT FPQ I  A+S+N +L + +G VVS EARA +N  GG    
Sbjct: 78  GIARNG----------YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKD 127

Query: 148 ----AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
               AGLT WSPN+NIFRDPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+
Sbjct: 128 HQRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIAT 187

Query: 204 CKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
            KH   +     +G +  R  F+  VS  D+E T+   FR  ++EG+  SVMC+YN ++G
Sbjct: 188 PKHIAVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHG 242

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
            P CA   +L   +RG+W   G++VSDCD+V       +F      ++A A++AG DL+C
Sbjct: 243 TPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNC 302

Query: 322 G-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
           G  +  L T  A+ RG + E  ++ +LV     + RLG  +  P   PY  LG KDV   
Sbjct: 303 GHAYRELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNA 359

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
            H+ LAL+AA + IVLLKN   +LPL       +AVIGPN+D    +  NY G +    T
Sbjct: 360 AHRALALQAAAESIVLLKNDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVT 417

Query: 441 PLQGI 445
           PL G+
Sbjct: 418 PLLGL 422



 Score =  139 bits (349), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 95/279 (34%), Positives = 144/279 (51%), Gaps = 48/279 (17%)

Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
           +GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVAL 680

Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
            +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP      
Sbjct: 681 NWAKM--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLP------ 730

Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
            P  S    GRTYR++KG  ++PFG+G+SYT F +   +AP +    L           +
Sbjct: 731 -PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------A 776

Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
           G  ++VT              V+N G++ G     V+   P    +P + LV F++VH+ 
Sbjct: 777 GNPLQVT------------ATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 824

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           AG Q+ +  ++   + LS VDRSG R +  G++ + +GG
Sbjct: 825 AGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVGG 862


>gi|197106390|ref|YP_002131767.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
 gi|196479810|gb|ACG79338.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
          Length = 888

 Score =  303 bits (776), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 185/479 (38%), Positives = 263/479 (54%), Gaps = 41/479 (8%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +    LP  +R  DL+ R++L+EK + +   A A+PRLG+  Y WW+E LHGV+  G   
Sbjct: 38  YRDTRLPAERRAADLVARMTLEEKSRQIGHTAPAIPRLGVPAYNWWNEGLHGVARAGI-- 95

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY------NGGT---AGLTY 152
                   AT FPQ I  A++++         V+  E RA Y      +G T    GLT 
Sbjct: 96  --------ATVFPQAIGMAATWDVDRMRGTADVIGTEFRAKYAERVHPDGSTDWYRGLTV 147

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           WSPN+NIFRDPRWGRGQET GEDP L+G+   +++RGLQG D +  K  A+ KH+  +  
Sbjct: 148 WSPNINIFRDPRWGRGQETYGEDPYLTGRMGVAFIRGLQGQDPNFFKTIATAKHYAVHSG 207

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
              N   R   +   S  D+EDT+   FR  V EGKV +VMC+YN V+GVP CA  +++ 
Sbjct: 208 PESN---RHREDVHPSAYDLEDTYLPAFRAAVTEGKVQAVMCAYNAVDGVPACASEDLMD 264

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY--DTQHFTSTPEEAAADAIRAGLDLDCGPFLG-LHT 329
           + +R +W  +G++VSDC +    Y  D+  +  TPEE    A+ AG+DL CG +    +T
Sbjct: 265 QRLRRDWGFSGHVVSDCGAAANIYREDSLAYVKTPEEGITRALNAGMDLVCGDYRADWNT 324

Query: 330 E-----SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
           E     SAV++G+L E  ++ ALV     ++RLG+FD  P+  P+  +      TP+H+ 
Sbjct: 325 EAEATVSAVRKGMLDETVLDGALVRLFADRIRLGLFD-PPAEVPFSKITAAQNDTPEHRA 383

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           ++LE A+  + LLKN G  LPL     R +AV+GPN+D    +IGNY G      T L G
Sbjct: 384 MSLEMAKASMTLLKNDG-VLPLKG-EPRRIAVVGPNADSVDALIGNYYGTPSNPVTVLAG 441

Query: 445 I-GRY--ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSI-EAEALDRA 499
           I  R+  A  ++ +G   V  A   +     DA   ADA     GL Q + E  AL+ A
Sbjct: 442 IRARFPKAEVVYAEGTGLVGPASLPV----PDAVLCADAACRTKGLKQEVFEGVALEGA 496



 Score =  122 bits (307), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 136/286 (47%), Gaps = 50/286 (17%)

Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
           D  + V GL   +E E +          DR  L LP  QQ+L+ ++    K P +LVLM+
Sbjct: 613 DLVVFVGGLTARVEGEEMKLQVPGFAGGDRTSLDLPAPQQDLLRRLHATGK-PVVLVLMN 671

Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL- 587
           G  + V +A  D  + AI+ A YPG  GG A+A +L G  +P G+LP+T+Y      +L 
Sbjct: 672 GSALSVNWA--DANLPAIVEAWYPGGEGGHAVAQLLAGDYSPAGRLPVTFY--RSAGDLP 727

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
           P  + AM+        GRTYR++ G V+YPFG+G+SYT F +    AP      L  R  
Sbjct: 728 PFADYAMK--------GRTYRYFGGEVLYPFGYGLSYTRFSY---GAPQ-----LSARSV 771

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
           S +  I+                 V   V N G  DG   + ++ + P     P + L  
Sbjct: 772 SADGEIT-----------------VTTQVTNTGGMDGEEVVQLYVSHPGRDGTPIRALQG 814

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           F+++ +  G  + V   +   + LSVVD  G RR+  G   + +GG
Sbjct: 815 FQRIGLKRGETRPVSFTLK-DRQLSVVDAEGNRRVEPGRVEVWVGG 859


>gi|383114360|ref|ZP_09935124.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
 gi|313693934|gb|EFS30769.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
          Length = 863

 Score =  303 bits (776), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 172/417 (41%), Positives = 244/417 (58%), Gaps = 31/417 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L + QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+  G  
Sbjct: 27  PYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYNGG-----TAGLTY 152
                    AT FPQ I  A+SFN  L   +   VSDEARA    +N         GLT 
Sbjct: 86  ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLTM 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
           W+PNVNIFRDPRWGRGQET GEDP LSG+   + VRGLQG  D +  K+ A  KHF  + 
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHFAVHS 196

Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              WN   R  FNA+ ++ +D+ +T+   F+  V +  V  VMC+YN+  G P C    +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
           L + +R +W   G +V+DC ++G ++  +   + P+   A+ADA+ +G DL+CG      
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGNFKSI 313

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
           T+ AV++GL+SE  IN ++   L  +  LG  +   S+ P+ ++    +  P H+ELAL+
Sbjct: 314 TD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPKHKELALK 369

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            A + +VLL+N    LPL+  R   VAVIGPN++ +V   GNY G      T L+GI
Sbjct: 370 MAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424



 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 90/307 (29%), Positives = 140/307 (45%), Gaps = 49/307 (15%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           ++  + AD  I   G+   +E E++          DR  + LP  Q+E+++ +    K  
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
           T+ V  SG  +  A         AI+ A YPGQAGGTA+AD+LFG  NP G+LP+T+Y  
Sbjct: 653 TVFVNFSGSAM--AIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY-- 708

Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
           + +  LP  E  +M+        GRTYRF     +YPFG+G+SYT F +  A        
Sbjct: 709 KSMQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSYGKA-------- 752

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
                  ++N +      K+T  +   LT    + V NVG +DG   + V+   P     
Sbjct: 753 -------TLNQS------KLTKGEKAILT----IPVSNVGQRDGEEVVQVYICRPDDKEG 795

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L  F++V +  G  Q V I +    +      + T R   G + I  G + +   L
Sbjct: 796 PQKTLRGFQRVSIAKGKTQNVQIELPYDSFEWFDAATNTIRPLNGTYKILYGNSSNEKDL 855

Query: 761 HAATLGV 767
              ++ +
Sbjct: 856 QTCSIQI 862


>gi|418519424|ref|ZP_13085476.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410704868|gb|EKQ63347.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
          Length = 886

 Score =  303 bits (776), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 176/425 (41%), Positives = 242/425 (56%), Gaps = 38/425 (8%)

Query: 39  TLPFCQVSLPIP------QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
           TLP   ++ P        QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LH
Sbjct: 18  TLPCLALAAPAKSAGSPDQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLH 77

Query: 93  GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT--- 147
           G++  G           AT FPQ I  A+S+N +L + +G VVS EARA +N  GG    
Sbjct: 78  GIARNG----------YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKD 127

Query: 148 ----AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
               AGLT WSPN+NIFRDPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+
Sbjct: 128 HQRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIAT 187

Query: 204 CKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNG 261
            KH   +     +G +  R  F+  VS  D+E T+   FR  ++EG+  SVMC+YN ++G
Sbjct: 188 PKHIAVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHG 242

Query: 262 VPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC 321
            P CA   +L   +RG+W   G++VSDCD+V       +F      ++A A++AG DL+C
Sbjct: 243 TPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNC 302

Query: 322 G-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
           G  +  L T  A+ RG + E  ++ +LV     + RLG  +  P   PY  LG KDV   
Sbjct: 303 GHAYRELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNA 359

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
            H+ LAL+AA + IVLLKN   +LPL       +AVIGPN+D    +  NY G +    T
Sbjct: 360 AHRALALQAAAESIVLLKNDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVT 417

Query: 441 PLQGI 445
           PL G+
Sbjct: 418 PLLGL 422



 Score =  136 bits (343), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 94/279 (33%), Positives = 143/279 (51%), Gaps = 48/279 (17%)

Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
           +GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQTLLERAKASGK-PLVVVLMSGSAVAL 680

Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
            +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP      
Sbjct: 681 NWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY---- 732

Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
               S    GRTYR++KG  ++PFG+G+SYT F +   +AP +    L           +
Sbjct: 733 ---VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------A 776

Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
           G  ++VT              V+N G++ G     V+   P    +P + LV F++VH+ 
Sbjct: 777 GNPLQVT------------ATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 824

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           AG Q+ +  ++   + LS VDRSG R +  G++ + +GG
Sbjct: 825 AGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVGG 862


>gi|78049893|ref|YP_366068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
 gi|78038323|emb|CAJ26068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
          Length = 902

 Score =  303 bits (776), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 171/435 (39%), Positives = 247/435 (56%), Gaps = 36/435 (8%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR  DL+ R++L+EK   + + A A+PRLG+  Y+WW+EALHGV+  G          GA
Sbjct: 44  QRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------GA 93

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
           T FPQ I  A++F+  L   +   +SDEARA ++            GLT+WSPN+NIFRD
Sbjct: 94  TVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRD 153

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------ASCKHFTAYDLD 213
           PRWGRGQET GEDP L+ +   ++V+GL+G   D  K A         A+ KHF  +   
Sbjct: 154 PRWGRGQETYGEDPFLTARMGVTFVQGLRGEGADAPKNAQGEPYRKLDATAKHFAVHSGP 213

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
                DR HF+A+ S++D+ +T+   F   V +GKV +VM +YN+V G    A   +L+ 
Sbjct: 214 E---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESASASKFLLQD 270

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV 333
            +R +W   GY+VSDC ++   +      +T E+AAA A++ G +L+CG        +AV
Sbjct: 271 VLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEEYST-LPAAV 329

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQG 393
           ++GL+ E  I+ AL   +T +MRLGMFD  P   P+  +      +P H  LA   AR+ 
Sbjct: 330 RQGLIDEAQIDTALTTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAHDALARRTARES 388

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---GRYAR 450
           +VLLKN G  LPLS  + + +AVIGP +D T+ ++GNY G      T LQGI      A+
Sbjct: 389 LVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQ 447

Query: 451 TIHQQGCKDVACADD 465
            ++ +G   V   DD
Sbjct: 448 VLYARGADLVEGRDD 462



 Score =  122 bits (307), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 92/303 (30%), Positives = 146/303 (48%), Gaps = 55/303 (18%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +  AD  + V GL   +E E +          DR  L LP  Q++L+  +    K 
Sbjct: 629 ALDVASSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 687

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y 
Sbjct: 688 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 744

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTVV 638
           +E  T     + AMR        GRTYR++ G  +YPFGHG+SYT F ++    +  T+ 
Sbjct: 745 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 796

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PA 696
           A       GS+ AT++                     VKN G + G   + ++  P  P 
Sbjct: 797 A------DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHPLTPQ 829

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTK 755
              A  K+L  F+++ + AG Q+ +   +     L + D +     +  G + + IG + 
Sbjct: 830 RERA-GKELHGFQRITLQAGEQRALHFILDAKNALRIYDAQRKAYAVDPGAYEVQIGASS 888

Query: 756 HSV 758
             +
Sbjct: 889 ADI 891


>gi|386718620|ref|YP_006184946.1| glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
 gi|384078182|emb|CCH12773.1| Glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
          Length = 897

 Score =  303 bits (775), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 175/436 (40%), Positives = 246/436 (56%), Gaps = 28/436 (6%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+  VS    QR   L+ +++L EK   + + A A+ RLG+  Y+WW+E LHGV+  G  
Sbjct: 37  PWLDVSASFEQRAASLVAQMTLDEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ- 95

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                    AT FPQ I  A++F+  L   +   +SDEARA ++        G   GLT+
Sbjct: 96  ---------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLRQGAHGRYQGLTF 146

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           WSPNVNIFRDPRWGRGQET GEDP L+ +   ++VRGLQG D    K+ A+ KH   +  
Sbjct: 147 WSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDDPVYRKLDATAKHLAVHSG 206

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
                 DR HF+A+ S++D+ DT+   F   V EG V +VM +YN+V G    A   +L+
Sbjct: 207 PE---ADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESASASRFLLR 263

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
             +R +W   GY+VSDC ++   +   H  +T E AAA A+R G +L+CG        SA
Sbjct: 264 DVLRRDWGFKGYVVSDCWAIVDIWKHHHIVTTREAAAALAVRNGTELECGQEYAT-LPSA 322

Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
           V++GL+SE +I++A+    T +MRLGMFD  P    +  +       P H  LAL+AA+ 
Sbjct: 323 VRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPSHDALALKAAQA 381

Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
            +VLLKN G  LPLS    R +AV+GP +D T+ ++GNY G      T LQGI   A+ +
Sbjct: 382 SLVLLKNDG-ILPLSRDIKR-IAVVGPTADDTMALLGNYFGTPAAPVTILQGIREAAKGV 439

Query: 453 ---HQQGCKDVACADD 465
              + +G   V   DD
Sbjct: 440 EVRYARGVDLVEGRDD 455



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/300 (31%), Positives = 143/300 (47%), Gaps = 51/300 (17%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F  A+DA+R+AD  + V GL   +E E +          DR  L LP  Q+ L+  +   
Sbjct: 619 FDEALDAAREADVVVFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALHAT 678

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K P ++VL  G  I V +A++   + AI+ + YPGQ GGTA+   LFG  NP G+LP+T
Sbjct: 679 GK-PVVMVLTGGSAIAVDWAQS--HLPAILMSWYPGQRGGTAVGQALFGDVNPAGRLPVT 735

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y       LP   + AM         GRTYR+++G  +YPFGHG+SYT F +       
Sbjct: 736 FYKAS--EALPAFDDYAME--------GRTYRYFRGTPLYPFGHGLSYTRFDYGT----- 780

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPP 695
              + LD   GS+ A   G+             LGV VDV N G++ G   + L      
Sbjct: 781 ---LRLDA--GSLRA--DGR-------------LGVAVDVTNAGTRSGDEVVQLYVRREH 820

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS-GTRRIPLGEHNIHIGGT 754
           AG     ++L  F+++H+  G  + V   +   + L   D +     +  G + + +G +
Sbjct: 821 AGSGDAVQELRGFQRIHLAPGEHRTVTFTLEAAQALRHYDEARAAYEVRPGAYEVRVGAS 880


>gi|21233528|ref|NP_639445.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66770493|ref|YP_245255.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|21115383|gb|AAM43327.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66575825|gb|AAY51235.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 896

 Score =  302 bits (774), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 174/451 (38%), Positives = 250/451 (55%), Gaps = 29/451 (6%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A T   P+   + P+  R  DL+ R++L+EK   + + A A+PRL +  Y+WW+EALHGV
Sbjct: 33  AATPPPPYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGV 92

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----- 149
           +  G          GAT FPQ I  A++F+  L   +   +SDEARA ++   AG     
Sbjct: 93  ARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKR 142

Query: 150 ---LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
              LT+WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQ   G   K+ A+ KH
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKH 202

Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
           +  +   +    DR HF+   S++D+ +T+   F+  V EG VA+VM +YN+VNG    A
Sbjct: 203 YAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESASA 259

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
               L+  +R +W  +GYIVSDC ++   +       TPE AAA  ++ G DLDCG    
Sbjct: 260 S-TRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYA 318

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
               +AV+ GL+ E  I+ +L   +  ++RLGMFD  P+  P+         +P H  LA
Sbjct: 319 A-LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQSPQHDALA 376

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI- 445
              AR+ +VLLKN G  LPL     R +AV+GP +D  ++++GNY G      T LQGI 
Sbjct: 377 RRTARESLVLLKNDG-LLPLKPTLKR-IAVVGPTADDPMSLLGNYYGTPAAPVTILQGIR 434

Query: 446 --GRYARTIHQQGCKDVACADDQLFGAAIDA 474
                A  ++ +G   V   +D    A IDA
Sbjct: 435 DAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465



 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 93/304 (30%), Positives = 149/304 (49%), Gaps = 57/304 (18%)

Query: 471 AIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSMASKG 520
           A+DA+R AD  + V GL   +E E +D          R    LP  Q+EL+  +  A+  
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQ-ATGT 681

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+ D+LFG ++PGG+LP+T+Y 
Sbjct: 682 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTV 637
           ++    LP   + AMR        GRTYR++ G  +YPFGHG++YT F ++    +  TV
Sbjct: 740 ED--ERLPAFDDYAMR--------GRTYRYFDGKPLYPFGHGLAYTQFAYSNLRLDRTTV 789

Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--P 695
            A       G++ AT+S                     VKN G + G   + ++  P  P
Sbjct: 790 AA------DGTLRATVS---------------------VKNTGQRAGDEVVQLYLHPLNP 822

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGT 754
               A  K+L  F+++ +  G  + V  NI   + L + D +     +  G + + IG +
Sbjct: 823 QRERA-RKELRGFQRITLQPGEHREVSFNITPREALRIYDEQRKAYAVDPGAYELQIGAS 881

Query: 755 KHSV 758
              +
Sbjct: 882 SADI 885


>gi|374312362|ref|YP_005058792.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358754372|gb|AEU37762.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 874

 Score =  302 bits (774), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 184/467 (39%), Positives = 252/467 (53%), Gaps = 34/467 (7%)

Query: 44  QVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF 103
           Q    I  R+++LI ++++ E++  L   A A+ RLG+  Y WW+E LHG++  G     
Sbjct: 30  QTPQQIDARIDELIAKMTVSERIAQLQDRAPAIERLGLPSYNWWNEGLHGLARDG----- 84

Query: 104 GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGT-----AGLTYWSPN 156
                 AT FPQ I  A++++A L   +G VVS EARA +  +GG       GLT WSPN
Sbjct: 85  -----YATVFPQAIGLAATWDAPLLHEVGDVVSTEARAKFYSHGGENTPRFGGLTVWSPN 139

Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
           +NIFRDPRWGRGQET GEDP L+      +V G+QG+D   LK  A+ KHF A+      
Sbjct: 140 INIFRDPRWGRGQETYGEDPFLTATLGTQFVEGVQGNDPFYLKADATPKHFAAHSGPEEG 199

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
              R  FNA VS  D+ DT+   F         A++MCSYN+++G P+CA  N L+  +R
Sbjct: 200 ---RDSFNAVVSPHDLADTYLPAFHALTTNAHAAALMCSYNEIDGTPSCASGNNLQDLVR 256

Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRG 336
             W   GY+VSDCD+VG      HF +     AADA+ AG+DLDCG      ++S + + 
Sbjct: 257 ERWGFKGYVVSDCDAVGNIAGYHHFATDNAHGAADALNAGVDLDCGNTYAALSKS-LDQN 315

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           L +E  +N AL   L  ++RLGM D   S  PY  +G +++ +P H  LAL AA + IVL
Sbjct: 316 LTTEAKLNQALHRLLLARVRLGMLD-PLSCSPYRDIGAEELDSPAHHTLALRAAEESIVL 374

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           LKN G  LPL     + V+VIGP +D+   +  NY G A    TPL G   +    H   
Sbjct: 375 LKNDG-VLPLQASTQK-VSVIGPTADMVKVLEANYHGTALHPITPLDG---FRSRFH--- 426

Query: 457 CKDVACADDQLFGAAIDA--SRQADATILVMGLDQSIEAEALDRAGL 501
             DV+ A   L    + A   R A       G    ++AE  D+A L
Sbjct: 427 --DVSYAQGSLLAEGVSAPVPRNALRVAAAPGSSAGLQAEYFDKASL 471



 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 90/300 (30%), Positives = 140/300 (46%), Gaps = 50/300 (16%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSM 516
           L   A+  + ++D  +  +GL   +E EAL          DR  L LP  Q+ L+S+++ 
Sbjct: 594 LLDQAVQTAAKSDVIVAFVGLSPDLEGEALQLRLKGFNGGDRTSLDLPEAQRTLLSRLTQ 653

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
             K P I+VL SG    VA        A ++ A YPG+AGG A+A IL G  NP G+LP+
Sbjct: 654 LHK-PVIIVLTSGS--GVALGPEAKDAAGVLEAWYPGEAGGEALAGILAGNVNPSGRLPV 710

Query: 577 TWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           T+Y    + +LP  T+ +M          RTYR++ GPV++PFG+G+SY++F +      
Sbjct: 711 TFY--RSVDDLPAFTDYSM--------AHRTYRYFDGPVLFPFGYGLSYSHFQY------ 754

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
                              G+    TH       L   V V N   ++G     ++  PP
Sbjct: 755 -------------------GQLRLSTHMLKTSEPLVAMVTVHNESQREGTEVAELYLQPP 795

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
               AP   L   ++V +  G  + +   +   + LS VD SG R +  GE+ + +GG +
Sbjct: 796 QASGAPRLTLQGVQRVALRPGETRELTFKLAPGQ-LSTVDTSGARTVRAGEYKLFVGGVQ 854


>gi|384430040|ref|YP_005639401.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
           756C]
 gi|341939144|gb|AEL09283.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
           756C]
          Length = 896

 Score =  302 bits (774), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 174/451 (38%), Positives = 249/451 (55%), Gaps = 29/451 (6%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A T   P+   + P+  R  DL+ R++L+EK   + + A A+PRL +  Y+WW+EALHGV
Sbjct: 33  AATPPPPYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGV 92

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA------ 148
           +  G          GAT FPQ I  A++F+  L   +   +SDEARA ++   A      
Sbjct: 93  ARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEHKR 142

Query: 149 --GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKH 206
             GLT+WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQ   G   K+ A+ KH
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKH 202

Query: 207 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
           +  +        DR HF+   S++D+ +T+   F+  V EG VA+VM +YN+VNG    A
Sbjct: 203 YAVHSGPE---ADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESASA 259

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
               L+  +R +W  +GYIVSDC ++   +       TPE AAA  ++ G DLDCG    
Sbjct: 260 STR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYA 318

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
               +AV+ GL+ E  I+ +L   +  ++RLGMFD  P+  P+         +P H  LA
Sbjct: 319 A-LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQSPQHDALA 376

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI- 445
              AR+ +VLLKN G  LPL     R +AV+GP +D  ++++GNY G      T LQGI 
Sbjct: 377 RRTARESLVLLKNDG-LLPLKPTLKR-IAVVGPTADDPMSLLGNYYGTPAAPVTILQGIR 434

Query: 446 --GRYARTIHQQGCKDVACADDQLFGAAIDA 474
                A  ++ +G   V   +D    A IDA
Sbjct: 435 DAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465



 Score =  122 bits (306), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 91/302 (30%), Positives = 143/302 (47%), Gaps = 53/302 (17%)

Query: 471 AIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSMASKG 520
           A+DA+R AD  + V GL   +E E +D          R    LP  Q+EL+  +  A+  
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQ-ATGT 681

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+ D+LFG ++PGG+LP+T+Y 
Sbjct: 682 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
           ++    LP   + AMR        GRTYR++ G  +YPFGHG++YT F ++         
Sbjct: 740 ED--ERLPAFDDYAMR--------GRTYRYFDGKPLYPFGHGLAYTQFAYSNLRLDRTT- 788

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAG 697
           V  DG                        TL   V VKN G + G   + ++  P  P  
Sbjct: 789 VAADG------------------------TLRATVWVKNTGQRAGDEVVQLYLHPLNPQR 824

Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKH 756
             A  K+L  F+++ +  G  + V   I   + L + D +     +  G + + IG +  
Sbjct: 825 ERA-RKELRGFQRITLQPGEHREVSFTITPREALRIYDEQRKAYAVDPGAYELQIGASSA 883

Query: 757 SV 758
            +
Sbjct: 884 DI 885


>gi|424796589|ref|ZP_18222299.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422794891|gb|EKU23686.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 913

 Score =  302 bits (774), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 172/444 (38%), Positives = 248/444 (55%), Gaps = 36/444 (8%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +        QR  DL+ R++L+EK   + + A A+PRLG+  Y+WW+EALHGV+  G   
Sbjct: 37  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLTYW 153
                  GAT FPQ I  A++F+  L   +   +SDEARA ++            GLT+W
Sbjct: 94  -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------ASC 204
           SPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG   D  K A         A+ 
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGDAYRKLDATA 206

Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           KHF  +   +    DR HF+A  S++D+ +T+   F   V EGKV +VM +YN+V G   
Sbjct: 207 KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGESA 263

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
            A   +L+  +R  W  +GY+VSDC ++   +      +T E+AAA A+  G +L+CG  
Sbjct: 264 SASKFLLRDVLRDTWGFDGYVVSDCWAIVDIWKNHKIVATREQAAALAVNNGTELECGEE 323

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
                 +AV++GL+SE D++ AL   +  +MRLGMFD  P +  +  +      +P+H  
Sbjct: 324 YST-LPAAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPDTLRWAQIPLSANQSPEHDA 381

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LA   AR+ +VLLKN G  LPLS  + + +AVIGP +D T+ ++GNY G      T LQG
Sbjct: 382 LARRTARESLVLLKNDG-VLPLSRGKIKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQG 440

Query: 445 IGRY---ARTIHQQGCKDVACADD 465
           I      A  ++ +G   V   DD
Sbjct: 441 IREAAPDAEVLYARGADLVEGRDD 464



 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 94/306 (30%), Positives = 148/306 (48%), Gaps = 61/306 (19%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+DA+R+AD  + V GL   +E E +          DR  L LP  Q+EL+  +    K 
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQGTGK- 689

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GG+A+AD+LFG +NPGG+LP+T+Y 
Sbjct: 690 PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
           +     LP   + AMR        GRTYR++ G  +YPFGHG+SYT F ++         
Sbjct: 748 ES--EKLPAFDDYAMR--------GRTYRYFAGTALYPFGHGLSYTQFAYS--------D 789

Query: 640 VPLD----GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP- 694
           + LD       GS++AT+                      VKN G + G   + ++  P 
Sbjct: 790 LRLDRSKLATDGSLHATLK---------------------VKNTGQRAGDEVVQLYLHPL 828

Query: 695 -PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIG 752
            P    A  K+L  F+++ +  G  + V   I     L + D +     +  G++ + +G
Sbjct: 829 SPQRERA-RKELRGFQRIALQPGETREVSFAISPQTDLRLYDEARKAYVVDPGDYELQVG 887

Query: 753 GTKHSV 758
            +   V
Sbjct: 888 ASSSDV 893


>gi|381169747|ref|ZP_09878910.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
 gi|380689765|emb|CCG35397.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
          Length = 874

 Score =  302 bits (774), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 172/407 (42%), Positives = 236/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 24  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 73

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N +L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 74  TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 133

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+ KH   +     +G +  
Sbjct: 134 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHIAVH-----SGPEPG 188

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS  D+E T+   FR  ++EG+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 189 RHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDW 248

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A A++AG DL+CG  +  L T  A+ RG +
Sbjct: 249 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 306

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KDV    H+ LAL+AA + IVLLK
Sbjct: 307 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 365

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 366 NDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 410



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 95/279 (34%), Positives = 144/279 (51%), Gaps = 48/279 (17%)

Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
           +GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + +
Sbjct: 610 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVAL 668

Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
            +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP      
Sbjct: 669 NWAKM--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLP------ 718

Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
            P  S    GRTYR++KG  ++PFG+G+SYT F +   +AP +    L           +
Sbjct: 719 -PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------A 764

Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
           G  ++VT              V+N G++ G     V+   P    +P + LV F++VH+ 
Sbjct: 765 GNPLQVT------------ATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 812

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           AG Q+ +  ++   + LS VDRSG R +  G++ + +GG
Sbjct: 813 AGEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVGG 850


>gi|390992294|ref|ZP_10262532.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372552957|emb|CCF69507.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 886

 Score =  302 bits (773), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 172/407 (42%), Positives = 236/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N +L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 86  TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+ KH   +     +G +  
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHIAVH-----SGPEPG 200

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS  D+E T+   FR  ++EG+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 201 RHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDW 260

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A A++AG DL+CG  +  L T  A+ RG +
Sbjct: 261 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIARGEV 318

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KDV    H+ LAL+AA + IVLLK
Sbjct: 319 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 377

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 378 NDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422



 Score =  138 bits (347), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 95/279 (34%), Positives = 142/279 (50%), Gaps = 48/279 (17%)

Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
           +GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQTLLERAKASGK-PLVVVLMSGSAVAL 680

Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
            +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP      
Sbjct: 681 NWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY---- 732

Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
               S    GRTYR++KG  ++PFG+G+SYT F +   +AP +    L           +
Sbjct: 733 ---VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------A 776

Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
           G  ++VT              V+N G++ G     V+   P    +P + LV F++VH+ 
Sbjct: 777 GNPLQVT------------ATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 824

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           AG Q+ +  N+   + LS VDRSG R +  G + + +GG
Sbjct: 825 AGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVGG 862


>gi|325922365|ref|ZP_08184139.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
 gi|325547147|gb|EGD18227.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
          Length = 889

 Score =  302 bits (773), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 173/416 (41%), Positives = 241/416 (57%), Gaps = 33/416 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 39  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 88

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 89  TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 148

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D D  +  A+ KH   +     +G +  
Sbjct: 149 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLDHPRTIATPKHIAVH-----SGPEPG 203

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS +D+E T+   FR  +++G+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 204 RHSFDVDVSPRDVEATYTPAFRAALIDGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 263

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A +++AG DL+CG  +  L T  A++RG +
Sbjct: 264 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGYAYRALGT--AIERGEV 321

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KD+    ++ LAL+AA Q IVLLK
Sbjct: 322 DEALLDQSLVRLFAARYRLGELEA-PHKDPYATLGAKDIDNTANRALALKAAAQSIVLLK 380

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQ 454
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+ R    +HQ
Sbjct: 381 NDANTLPLKA--GARLAVIGPNADALAALEANYQGTSSTPVTPLLGL-RQRFGVHQ 433



 Score =  137 bits (344), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 95/278 (34%), Positives = 142/278 (51%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 684

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP       
Sbjct: 685 WAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLP------- 733

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
           P  S    GRTYR++KG  ++PFG+G+SYT+F +    AP + +  L  + GS       
Sbjct: 734 PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTSFAY---GAPQLSSTTL--QAGS------- 781

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
                        TL V   V+N G++ G     V+   P    +P + LV F++VH+  
Sbjct: 782 -------------TLQVTTTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLKP 828

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G Q+ +   +   + LS VDR+G R +  G++ + +GG
Sbjct: 829 GEQRTLTFTLD-ARALSDVDRTGQRAVEAGDYTLFVGG 865


>gi|298386950|ref|ZP_06996504.1| beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298260100|gb|EFI02970.1| beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 846

 Score =  302 bits (773), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 166/403 (41%), Positives = 245/403 (60%), Gaps = 28/403 (6%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +  ++ PI +R+ DL+ +L+++EK+ LL + +  + R+GI  Y   +EALHG+  + PG 
Sbjct: 23  YKNMNAPIHERIQDLLSKLTIEEKISLLRATSPGIERMGIDKYYMGNEALHGI--IRPG- 79

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LT 151
           KF       T FPQ I  AS +N  L   I  V+SDEARA +N    G          LT
Sbjct: 80  KF-------TVFPQAIGLASMWNPELHHIIASVISDEARARWNELERGKKQKDQFSDLLT 132

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
           +WSP VN+ RDPRWGR  ET GEDP LSG    ++V+GLQG     LK  ++ KHF A +
Sbjct: 133 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDHPRYLKSVSTPKHFAANN 192

Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
            ++    +RF+ +A +++ D+ + +   F  C+ EGK  S+M +YN +NGVP  A+  +L
Sbjct: 193 EEH----NRFYCDAAITETDMREYYLPAFEKCIREGKAESIMTAYNAINGVPCTANNWLL 248

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTE 330
            + ++ +W  NGYIVSDC + G+      +  TPE AA  AI+AGLDL+CG +  G    
Sbjct: 249 NKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKTPEAAAMIAIKAGLDLECGDYVFGAPLL 308

Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
           +A ++ ++S  +I++A  + L  +MRLGMFD +P   PY HL P+ V    H+ELALEAA
Sbjct: 309 NAYKQYMVSTAEIDSAAYHVLRARMRLGMFD-DPEKNPYNHLSPEIVGCEKHKELALEAA 367

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           RQ IVLLKNQ  +LPL+  + +++AV+G N+       G+Y+G
Sbjct: 368 RQSIVLLKNQKNTLPLNAKKIKSIAVVGINA--ANCEFGDYSG 408



 Score =  139 bits (349), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 97/296 (32%), Positives = 146/296 (49%), Gaps = 41/296 (13%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           ++G A    R++D  I VMG++QSIE E  DR+ + LP  QQ  + +   A+   TI+VL
Sbjct: 586 MYGDASKVIRESDVVIAVMGINQSIEREGQDRSSIELPKDQQIFIREAYKANPN-TIVVL 644

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           ++G  + V +   D  I AII A YPG+ GGTAIA++LFG  NP G+LP+T+Y    I +
Sbjct: 645 VAGSSMAVGWM--DQNIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS--IED 700

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP    A      K    RTY +++G  +Y FG+G+SYT F               D R+
Sbjct: 701 LP----AFNDYNVKN--NRTYMYFEGKPLYAFGYGLSYTKF---------------DYRN 739

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
            +I              K +   + +   VKN G  +G     V+   P  G   P KQL
Sbjct: 740 LNI--------------KQDSQNITLNFSVKNSGKYNGDEVAQVYVQFPDLGIKTPLKQL 785

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             F++VH+  GA +++ I I   +     D+      P G +N  +G +  ++ L 
Sbjct: 786 KGFKRVHIKKGATEQISIEIPKEELRLWDDQKKQFYTPSGTYNFMVGKSSDNICLQ 841


>gi|46127231|ref|XP_388169.1| hypothetical protein FG07993.1 [Gibberella zeae PH-1]
          Length = 712

 Score =  301 bits (772), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 201/599 (33%), Positives = 295/599 (49%), Gaps = 74/599 (12%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           R+   C  +    +R   L+  L+ +EKV  L+S A   PR+G+  Y WW+EALHGV+  
Sbjct: 37  RSNGICDTTASPAERAAALVSALTPREKVNNLVSNATGAPRIGLPRYNWWNEALHGVAG- 95

Query: 98  GPGTKFGGDFP--GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG-LTYWS 154
            PG  +    P   ATSFP  +   S+F+  L   IG V+  EARA  NGG  G + YW+
Sbjct: 96  APGNDYNDKPPYDSATSFPMPLLMGSTFDDDLIHDIGEVIGTEARAWNNGGWGGGVDYWT 155

Query: 155 PNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDN 214
           PNVN F+DPRWGRG ETPGED +    + + Y R ++                       
Sbjct: 156 PNVNPFKDPRWGRGSETPGEDAL----HVSRYARAME----------------------- 188

Query: 215 WNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRT 274
                                       C  + KV S+MCSYN VNG+P CA+  + +  
Sbjct: 189 ----------------------------CTRDAKVGSIMCSYNAVNGIPACANSYLQETL 220

Query: 275 IRGEWRL---NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
           +R  W     N +I SDC ++   +   ++T T  EAA  A   G D  C          
Sbjct: 221 LRKHWNWTHTNNWITSDCGAMQDIWQHHNYTKTGAEAAKAAFENGQDSSCEYTTTKDISD 280

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
           + ++GLL+E  ++ AL       +  G FDG+ S   +  L   DV T   Q+LAL++A 
Sbjct: 281 SYEQGLLTEKVMDRALKRLFEGLVHTGFFDGDKSE--WSSLDFDDVNTRHAQDLALQSAV 338

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL---QGIGRY 448
           +G VLLKN   +LPL+  +  +VA+IG  +D    + G Y+G A    TP    + +G  
Sbjct: 339 RGAVLLKNDN-TLPLNIKKKESVALIGFWADDKTKLQGGYSGPAPHVRTPAYAAKMLGLN 397

Query: 449 ARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQ 508
                    ++ +  D+    A ++A++++D  + + GLD +   E  DR  L  P  Q 
Sbjct: 398 TNVAWGPTLQNSSVPDNWTTNA-LEAAKKSDYIVYLGGLDATAAGEERDRTDLDWPSTQL 456

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
            L+ K+S   K P ++V +     D    KN   + +I+W  YPGQ GGTA+ +++ G  
Sbjct: 457 TLLKKLSNLGK-PLVVVQLGDQVDDTPLLKNK-GVNSILWVNYPGQEGGTAVMELITGRK 514

Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
            P G+LP+T YP +Y   + M EM +RP++S   PGRTYR+Y   V+ PFG G  YT F
Sbjct: 515 GPAGRLPLTQYPSKYTEQVGMLEMELRPTKSS--PGRTYRWYSDSVL-PFGFGKHYTTF 570


>gi|160886913|ref|ZP_02067916.1| hypothetical protein BACOVA_04927 [Bacteroides ovatus ATCC 8483]
 gi|423288977|ref|ZP_17267828.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
           CL02T12C04]
 gi|423294866|ref|ZP_17272993.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
           CL03T12C18]
 gi|156107324|gb|EDO09069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
 gi|392668741|gb|EIY62235.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
           CL02T12C04]
 gi|392676057|gb|EIY69498.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
           CL03T12C18]
          Length = 863

 Score =  301 bits (772), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 172/417 (41%), Positives = 243/417 (58%), Gaps = 31/417 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L   QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+  G  
Sbjct: 27  PYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYNGG-----TAGLTY 152
                    AT FPQ I  A+SFN  L   +   VSDEARA    +N         GLT 
Sbjct: 86  ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLTM 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
           W+PNVNIFRDPRWGRGQET GEDP LSG+   + VRGLQG  D +  K+ A  KHF  + 
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHFAVHS 196

Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              WN   R  FNA+ ++ +D+ +T+   F+  V +  V  VMC+YN+  G P C    +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
           L + +R +W   G +V+DC ++G ++  +   + P+   A+ADA+ +G DL+CG      
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGNFKSI 313

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
           T+ AV++GL+SE  IN ++   L  +  LG  +   S+ P+ ++    +  P H+ELAL+
Sbjct: 314 TD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPKHKELALK 369

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            A + +VLL+N    LPL+  R   VAVIGPN++ +V   GNY G      T L+GI
Sbjct: 370 MAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424



 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 90/307 (29%), Positives = 140/307 (45%), Gaps = 49/307 (15%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           ++  + AD  I   G+   +E E++          DR  + LP  Q+E+++ +    K  
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
           T+ V  SG  +  A         AI+ A YPGQAGGTA+AD+LFG  NP G+LP+T+Y  
Sbjct: 653 TVFVNFSGSAM--AIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY-- 708

Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
           + +  LP  E  +M+        GRTYRF     +YPFG+G+SYT F +  A        
Sbjct: 709 KSMQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSYGKA-------- 752

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
                  ++N +      K+T  +   LT    + V NVG +DG   + V+   P     
Sbjct: 753 -------TLNQS------KLTKGEKAILT----IPVSNVGQRDGEEVVQVYICRPDDKEG 795

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L  F++V +  G  Q V I +    +      + T R   G + I  G + +   L
Sbjct: 796 PQKTLRGFQRVSIAKGKTQNVQIELPYDSFEWFDAATNTIRPLNGTYKILYGNSSNEKDL 855

Query: 761 HAATLGV 767
              ++ +
Sbjct: 856 QTCSIQI 862


>gi|293370402|ref|ZP_06616956.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292634550|gb|EFF53085.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 863

 Score =  301 bits (771), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 171/417 (41%), Positives = 243/417 (58%), Gaps = 31/417 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L + QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+  G  
Sbjct: 27  PYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYNGG-----TAGLTY 152
                    AT FPQ I  A+SFN  L   +   VSDEARA    +N         GLT 
Sbjct: 86  ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLTM 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
           W+PNVNIFRDPRWGRGQET GEDP LSG+   + VRGLQG  D +  K+ A  KHF  + 
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHFAVHS 196

Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              WN   R  FNA+ ++ +D+ +T+   F+  V +  V  VMC+YN+  G P C    +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
           L + +R +W   G +V+DC ++G ++  +   + P+   A+ADA+ +G DL+CG      
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGNFKSI 313

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
           T+ AV++ L+SE  IN ++   L  +  LG  +   S+ P+ ++    +  P H+ELAL+
Sbjct: 314 TD-AVKKDLISEEKINTSVKRVLKARFELGEMN---STHPWSNIPFSVIDCPKHKELALK 369

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            A + +VLL+N    LPL+  R   VAVIGPN++ +V   GNY G      T L+GI
Sbjct: 370 MAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424



 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 90/307 (29%), Positives = 140/307 (45%), Gaps = 49/307 (15%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           ++  + AD  I   G+   +E E++          DR  + LP  Q+E+++ +    K  
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
           T+ V  SG  +  A         AI+ A YPGQAGGTA+AD+LFG  NP G+LP+T+Y  
Sbjct: 653 TVFVNFSGSAM--AIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY-- 708

Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
           + +  LP  E  +M+        GRTYRF     +YPFG+G+SYT F +  A        
Sbjct: 709 KSMQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSYGKA-------- 752

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
                  ++N +      K+T  +   LT    + V NVG +DG   + V+   P     
Sbjct: 753 -------TLNQS------KLTKGEKAILT----IPVSNVGQRDGEEVVQVYICRPDDKEG 795

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L  F++V +  G  Q V I +    +      + T R   G + I  G + +   L
Sbjct: 796 PQKTLRGFQRVSIAKGKTQNVQIELPYDSFEWFDAATNTIRPLNGTYKILYGNSSNEKDL 855

Query: 761 HAATLGV 767
              ++ +
Sbjct: 856 QTCSIQI 862


>gi|408824590|ref|ZP_11209480.1| Glucan 1,4-beta-glucosidase [Pseudomonas geniculata N1]
          Length = 897

 Score =  300 bits (769), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 174/436 (39%), Positives = 246/436 (56%), Gaps = 28/436 (6%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+  VS    QR   L+ +++L+EK   + + A A+ RLG+  Y+WW+E LHGV+  G  
Sbjct: 37  PWLDVSASFEQRAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ- 95

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                    AT FPQ I  A++F+  L   +   +SDEARA ++        G   GLT+
Sbjct: 96  ---------ATVFPQAIGLAATFDVPLMGQVAATISDEARAKHHQFLREGAHGRYQGLTF 146

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           WSPNVNIFRDPRWGRGQET GEDP L+ +   ++VRGLQG D    K+ A+ KH   +  
Sbjct: 147 WSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDDPVYRKLDATAKHLAVHSG 206

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
                 DR HF+A+ S++D+ DT+   F   V EG V +VM +YN+V G    A   +L+
Sbjct: 207 PE---ADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESASASRFLLR 263

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESA 332
             +R +W   GY+VSDC ++   +      +T E AAA A+R G +L+CG        SA
Sbjct: 264 DVLRRDWGFKGYVVSDCWAIVDIWKHHRIVTTREAAAALAVRNGTELECGQEYAT-LPSA 322

Query: 333 VQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQ 392
           V++GL+SE +I++A+    T +MRLGMFD  P    +  +       P H  LAL+AA+ 
Sbjct: 323 VRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPAHDALALKAAQA 381

Query: 393 GIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI 452
            +VLLKN G  LPLS    R +AV+GP +D T+ ++GNY G      T LQGI   A+ +
Sbjct: 382 SLVLLKNDG-ILPLSR-NTRRIAVVGPTADDTMALLGNYFGTPAAPVTILQGIREAAKGV 439

Query: 453 ---HQQGCKDVACADD 465
              + +G   V   DD
Sbjct: 440 EVRYARGVDLVEGRDD 455



 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 84/291 (28%), Positives = 129/291 (44%), Gaps = 53/291 (18%)

Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           + V GL   +E E +          DR  L LP  Q+ L+  +    K P ++VL  G  
Sbjct: 633 VFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALHGTGK-PVVMVLTGGSA 691

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
           I V +A+    + AI+ + YPGQ GGTA+   LFG  NP G+LP+T+Y            
Sbjct: 692 IAVDWAQA--HLPAILMSWYPGQRGGTAVGQALFGDVNPSGRLPVTFY---------KAG 740

Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV--ANAPTVVAVPLDGRHGSI 649
            AM         GRTYR+++G  +YPFGHG+SYT F +     +A ++ A   DGR    
Sbjct: 741 EAMPAFDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYGTLRLDADSLRA---DGR---- 793

Query: 650 NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAF 708
                               LGV VDV N G++ G   + L      AG     ++L  F
Sbjct: 794 --------------------LGVAVDVANTGTRSGDEVVQLYVRREHAGSGDAVQELRGF 833

Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRS-GTRRIPLGEHNIHIGGTKHSV 758
           ++V +  G ++ V   +   + L   D +     +  G + + +G +   +
Sbjct: 834 QRVQLAPGERRTVTFTLEAAQALRHYDEARAAYAVQPGAYEVRVGASSADI 884


>gi|329851587|ref|ZP_08266344.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
 gi|328840433|gb|EGF90005.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
          Length = 883

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 177/446 (39%), Positives = 250/446 (56%), Gaps = 40/446 (8%)

Query: 12  LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           L+ LSA ++   A+ P        TT+T           QR  DL+ R+SL+EK   LI+
Sbjct: 18  LVCLSAPTA--QAQNPLESPAYQDTTKT---------AEQRAADLVSRMSLEEKAAQLIN 66

Query: 72  GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
            A A+PRLG++ Y WW+E LHGV+  G           AT FPQ +  A++F+  L   +
Sbjct: 67  DAPAIPRLGVREYNWWNEGLHGVAAHG----------YATVFPQAVGMAATFDEPLIHRV 116

Query: 132 GRVVSDEARAMY------NGGT---AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
              +S E RA Y       GG+    GLT WSPN+NIFRDPRWGRGQET GEDP L+ + 
Sbjct: 117 ADTISVEFRAKYVASRHRFGGSDWFRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTARI 176

Query: 183 AASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRM 242
             ++V+GLQG D    +  A+ KH+  +         R   N   S+ D+EDT+   FR 
Sbjct: 177 GVAFVKGLQGEDPVYYRTIATPKHYAVHSGPE---ASRHRDNINPSRYDLEDTYLPAFRA 233

Query: 243 CVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG-VYYDT-QH 300
            ++EGK  S+MC+YN ++G P CA+ ++L + +R +W   G++VSDCD+VG +YY T  H
Sbjct: 234 TIVEGKAVSIMCAYNAIDGQPACANDDLLVKHLRQDWGFKGFVVSDCDAVGDIYYKTSHH 293

Query: 301 FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGM 359
           +  TPEE    A +AG DL CG      H  SAV++G+L E  ++ ALV   + + +LG 
Sbjct: 294 YRPTPEEGVTVAYQAGTDLICGNANEADHVASAVRKGILPESLVDTALVRLFSARFKLGQ 353

Query: 360 FDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
           FD  P +Q +  +   D  T  +++ +   A   +VLLKN G  LPL     RT+AVIGP
Sbjct: 354 FD--PPAQVFPAITADDYDTQANRDFSQHVAESAMVLLKNDG-LLPLKS-EPRTIAVIGP 409

Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGI 445
           N+D   +++GNY G      T L GI
Sbjct: 410 NADTMDSLVGNYNGDPSHPVTVLAGI 435



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 95/294 (32%), Positives = 154/294 (52%), Gaps = 50/294 (17%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+ A++++D  I V GL Q +E E +          DR  L LP  QQ+++ +VS   K 
Sbjct: 600 AVAAAKESDLVIFVAGLSQRVEGEEMRVETPGFSGGDRTSLDLPPVQQKVLEQVSATGK- 658

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P +LVL++G  + V +A  D  + AI+ A YPG  GG A+A ++ G  +P G+LP+T+Y 
Sbjct: 659 PVVLVLINGSALSVNWA--DKNVPAIVEAWYPGGQGGAAVARLIAGDFSPAGRLPVTFY- 715

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
                 +P  T+  M+        GRTYR++KG  +YPFG+G+SYT F +    AP  ++
Sbjct: 716 -RSADQIPAFTDYTMK--------GRTYRYFKGEALYPFGYGLSYTKFSY----APAKLS 762

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                      A ++G   +VT          V VDV N G++DG   + ++ + P    
Sbjct: 763 A----------AKVAGNG-EVT----------VSVDVTNSGARDGDEVVQLYLSHPGQKD 801

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            P + L  F+++H+ AG  + V   +   + LS V+  G+R +  G+ N+ +GG
Sbjct: 802 TPIRALARFDRIHLKAGETKTVTFTLD-SRALSTVNADGSRSVKPGKVNLWLGG 854


>gi|390991557|ref|ZP_10261819.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372553724|emb|CCF68794.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 901

 Score =  300 bits (767), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 173/458 (37%), Positives = 252/458 (55%), Gaps = 36/458 (7%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
             C    A   T P+         R  DL+ R++L+EK   + + A A+PRL +  Y+WW
Sbjct: 20  LGCATGAAHAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--- 144
           +EALHGV+  G          GAT FPQ I  A++F+  L   +   +SDEARA ++   
Sbjct: 80  NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129

Query: 145 -----GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------- 192
                    GLT+WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG       
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPK 189

Query: 193 -SDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
            + G+   K+ A+ KHF  +        DR HF+A+ S++D+ +T+   F   V +GKV 
Sbjct: 190 NAQGEPYRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVD 246

Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
           +VM +YN+V G    A   +L+  +R +W   GY+VSDC ++   +      +T E+AAA
Sbjct: 247 AVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAA 306

Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
            A++ G +L+CG        +AV++GL+ E  I+ AL   +T +MRLGMFD  P   P+ 
Sbjct: 307 LAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWS 364

Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
            +      +P H  LA   AR+ +VLLKN G  LPLS  + + +AVIGP +D T+ ++GN
Sbjct: 365 TIPASVNQSPAHDALARRTARESLVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGN 423

Query: 431 YAGIACGYTTPLQGI---GRYARTIHQQGCKDVACADD 465
           Y G      T LQGI      A+ ++ +G   V   DD
Sbjct: 424 YYGTPAAPVTVLQGIRAAAPKAQVLYARGADLVEGRDD 461



 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 94/305 (30%), Positives = 149/305 (48%), Gaps = 59/305 (19%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +R AD  + V GL   +E E +          DR  L LP  Q++L+  +  A+  
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQ-ATGR 686

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y 
Sbjct: 687 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 743

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVA 639
           +E  T     + AMR        GRTYR++ G  +YPFGHG+SYT F ++ +    T +A
Sbjct: 744 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 795

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                  GS+ AT++                     VKN G + G   + ++  P     
Sbjct: 796 T-----DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHP----L 825

Query: 700 APH-----KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGG 753
           AP      K+L  F+++ +  G Q+ +G  I+    L + D +     +  G + + IG 
Sbjct: 826 APQRERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIGA 885

Query: 754 TKHSV 758
           +   +
Sbjct: 886 SSADI 890


>gi|418518029|ref|ZP_13084183.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|410705279|gb|EKQ63755.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 886

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 170/407 (41%), Positives = 235/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N +L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 86  TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+ KH   +     +G +  
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHIAVH-----SGPEPG 200

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS  D+E T+   FR  ++EG+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 201 RHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDW 260

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD++       +F      ++  A++AG DL+CG  +  L T  A+ RG +
Sbjct: 261 GFKGFVVSDCDAIDDMTQFHYFRPDNAGSSVAALKAGHDLNCGHAYRELGT--AIARGEV 318

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KDV    H+ LAL+AA + IVLLK
Sbjct: 319 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNVAHRALALQAAAESIVLLK 377

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 378 NDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422



 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 93/278 (33%), Positives = 141/278 (50%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVLMSGSAVALN 681

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP       
Sbjct: 682 WAKT--HADAIMAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFY--RSTKDLPAY----- 732

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
              S    GRTYR++KG  ++PFG+G+SYT F +   +AP +    L           +G
Sbjct: 733 --VSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQ----------AG 777

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
             ++V               V+N G++ G     V+   P    +P + LV F++VH+ A
Sbjct: 778 NPLQVI------------ATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAA 825

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G Q+ +  ++   + LS VDRSG R +  G++ + +GG
Sbjct: 826 GEQRTLTFHLD-ARALSDVDRSGQRAVEAGDYTLFVGG 862


>gi|298482082|ref|ZP_07000270.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298271639|gb|EFI13212.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 863

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 171/417 (41%), Positives = 242/417 (58%), Gaps = 31/417 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L   QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+  G  
Sbjct: 27  PYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYN-----GGTAGLTY 152
                    AT FPQ I  A+SFN  L   +   VSDEARA    +N         GLT 
Sbjct: 86  ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLTM 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
           W+PNVNIFRDPRWGRGQET GEDP LSG+   + VRGLQG  D +  K+ A  KHF  + 
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAVVRGLQGPEDAEYDKLHACAKHFAVHS 196

Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              WN   R  FNA+ ++ +D+ +T+   F+  V +  V  VMC+YN+  G P C    +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
           L + +R +W   G +V+DC ++G ++  +   + P+   A+ADA+  G DL+CG      
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAVHASADAVLNGTDLECGGNFKSI 313

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
           T+ AV++GL+SE  IN ++   L  +  LG  +    + P+ ++    +  P H+ELAL+
Sbjct: 314 TD-AVKKGLISEEKINTSVKRLLKARFELGEMN---PTHPWSNIPYSVIDCPKHKELALK 369

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            A + +VLL+N+   LPL+  R   VAVIGPN++ +V   GNY G      T L+GI
Sbjct: 370 MAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424



 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 90/308 (29%), Positives = 139/308 (45%), Gaps = 51/308 (16%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           ++  + AD  I   G+   +E E++          DR  + LP  Q+E+++ +    K  
Sbjct: 594 LNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
           T+ V  SG  +  A         AI+ A YPGQAGGTA+AD+LFG  NP G+LP+T+Y  
Sbjct: 653 TVFVNFSGSAM--AIVPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY-- 708

Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
           + I  LP  E  +M+        GRTYRF     +YPFG+G+SYT F +           
Sbjct: 709 KSIQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY----------- 749

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGV-QVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                         GKA  +  +K N+    +  + V NVG +DG   + V+   P    
Sbjct: 750 --------------GKAT-LNQSKLNKGEKAILTIPVSNVGQRDGEEVVQVYICRPDDKE 794

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
            P K L  F++V++  G  Q V I +    +      + T R   G + I  G + +   
Sbjct: 795 GPQKTLRGFQRVNIAKGKTQNVSIELPYDSFEWFDTATNTIRPLSGTYKILYGNSSNEND 854

Query: 760 LHAATLGV 767
           L   ++ +
Sbjct: 855 LQTCSIQI 862


>gi|315500297|ref|YP_004089100.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
           excentricus CB 48]
 gi|315418309|gb|ADU14949.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
           excentricus CB 48]
          Length = 882

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 173/450 (38%), Positives = 249/450 (55%), Gaps = 33/450 (7%)

Query: 8   FFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
           +F   +L +  S+G A +   A +P DA      +   S P   R  DL+ R++L+EK  
Sbjct: 6   YFTAAVLAALMSAG-ALQPALAQNPLDAPG----YQDASKPPEARAADLVSRMTLEEKTA 60

Query: 68  LLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
            LI+ A A+PRL ++ Y WW+E LHGV+  G           AT FPQ +  A++++  L
Sbjct: 61  QLINDAPAIPRLNVREYNWWNEGLHGVAAAG----------YATVFPQAVGLAATWDEPL 110

Query: 128 WEAIGRVVSDEARAMY------NGGT---AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
              +   +S E RA Y       GG+    GLT WSPN+NIFRDPRWGRGQET GEDP L
Sbjct: 111 IHRVAETISVEFRAKYLKERHRFGGSDWFGGLTVWSPNINIFRDPRWGRGQETYGEDPYL 170

Query: 179 SGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDV 238
           + +   ++VRGLQG D    +  A+ KH+  +         R   N   S  D+ DT+  
Sbjct: 171 TARMGVAFVRGLQGDDPVYYRTVATPKHYAVHSGPE---AGRHRDNVNPSPYDLADTYLP 227

Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG-VYYD 297
            FR  + EG+  S+MC+YN +NG P CA+ ++L + +R +W   GY+VSDCD+VG +YY 
Sbjct: 228 AFRATITEGQAGSIMCAYNAINGQPACANEDLLVKYLRKDWGFKGYVVSDCDAVGDIYYK 287

Query: 298 TQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQM 355
           T H +  TPEE    A + G DL CG      H   AV++GLL E  ++ AL+   T + 
Sbjct: 288 TSHAYRPTPEEGVTAAYQVGTDLICGNANEADHLTRAVRQGLLPEKTLDTALIRLFTARF 347

Query: 356 RLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVA 415
           +LG FD  P ++ +  +  +D  TP +++ + + A   +VLLKN+   LPL     R +A
Sbjct: 348 KLGQFD--PPAKVFPKITAEDYDTPANRDFSQKVAESAMVLLKNENNLLPLKG-EPRQIA 404

Query: 416 VIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           VIGPN+D   +++GNY G      T L GI
Sbjct: 405 VIGPNADSMDSLVGNYNGDPSHPVTVLSGI 434



 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 94/295 (31%), Positives = 151/295 (51%), Gaps = 50/295 (16%)

Query: 470 AAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASK 519
           +A+ A+++AD  + V GL Q +E E +          DR  L LP  QQ+++ +VS A K
Sbjct: 598 SAVAAAKEADLVVFVAGLSQRVEGEEMRVETEGFSGGDRTTLNLPPAQQKVLEQVSAAGK 657

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
            P +LVL++G  + + +A  D  + AII A YPG  GG A+A ++ G  +P G+LP+T+Y
Sbjct: 658 -PVVLVLINGSALGINWA--DKNVPAIIEAWYPGGQGGAAVARLIAGDYSPAGRLPVTFY 714

Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
                  LP   +  M+        GRTYR++KG  +YPFG+G+S+T F +    AP   
Sbjct: 715 --RSADQLPAFNDYNMK--------GRTYRYFKGEALYPFGYGLSFTTFRY----AP--- 757

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH 698
            + L  R  + +  +S                 V  DV N GS+D    + ++ + P   
Sbjct: 758 -LTLSARQVAGDGQVS-----------------VSADVTNSGSRDSDEVVQLYVSYPGQK 799

Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            AP + L  FE++H+ AG  + V   +   + LS V+  G+R +  G+  + +GG
Sbjct: 800 LAPIRALARFERIHLKAGETKTVRFTLD-PQALSTVNADGSRSVKPGKVELWLGG 853


>gi|336415490|ref|ZP_08595829.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940369|gb|EGN02236.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
           3_8_47FAA]
          Length = 863

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 171/417 (41%), Positives = 242/417 (58%), Gaps = 31/417 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L   QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+  G  
Sbjct: 27  PYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYN-----GGTAGLTY 152
                    AT FPQ I  A+SFN  L   +   VSDEARA    +N         GLT 
Sbjct: 86  ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLTM 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
           W+PNVNIFRDPRWGRGQET GEDP LSG+   + VRGLQG  D +  K+ A  KHF  + 
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHFAVHS 196

Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              WN   R  FNA+ ++ +D+ +T+   F+  V +  V  VMC+YN+  G P C    +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
           L + +R +W   G +V+DC ++G ++  +   + P+   A+ADA+  G DL+CG      
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLNGTDLECGGNFKSI 313

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
           T+ AV++GL+SE  IN ++   L  +  LG  +    + P+ ++    +  P H+ELAL+
Sbjct: 314 TD-AVKKGLISEEKINTSVKRLLKARFELGEMN---PTHPWSNIPYSVINCPKHKELALK 369

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            A + +VLL+N+   LPL+  R   VAVIGPN++ +V   GNY G      T L+GI
Sbjct: 370 MAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424



 Score =  118 bits (295), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 89/307 (28%), Positives = 138/307 (44%), Gaps = 49/307 (15%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           ++  + AD  I   G+   +E E++          DR  + LP  Q+E+++ +    K  
Sbjct: 594 LNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
           T+ V  SG  +  A         AI+ A YPGQAGGTA+AD+LFG  NP G+LP+T+Y  
Sbjct: 653 TVFVNFSGSAM--AIVPETQSCDAILQAWYPGQAGGTAVADVLFGNYNPAGRLPITFY-- 708

Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
           + I  LP  E  +M+        GRTYRF     +YPFG+G+SYT F +  A        
Sbjct: 709 KSIQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSYGKA-------- 752

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
                  ++N +   K  K            + + V NVG +DG   + V+   P     
Sbjct: 753 -------TLNQSKLAKGEKAI----------LTIPVSNVGQRDGEEVVQVYICRPDDKGG 795

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L  F++V++  G  Q V I +    +      + T R   G + I  G + +   L
Sbjct: 796 PQKTLRGFQRVNIAKGKTQNVNIELPYDSFEWFDTATNTIRPLSGTYKILYGNSSNENDL 855

Query: 761 HAATLGV 767
              ++ +
Sbjct: 856 QTCSIQI 862


>gi|393786911|ref|ZP_10375043.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
           CL02T12C05]
 gi|392658146|gb|EIY51776.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
           CL02T12C05]
          Length = 863

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 178/440 (40%), Positives = 249/440 (56%), Gaps = 43/440 (9%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
           LPF    LP+ +RV DL+ RL+L EKV L+   +++VPRLGIK Y WW+EALHGV   G 
Sbjct: 22  LPFNNPDLPVEERVEDLVRRLTLHEKVLLMCDYSSSVPRLGIKQYNWWNEALHGVGRAGL 81

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLT 151
                     AT FPQ I  A++F+    + +   VSDEARA Y+            GLT
Sbjct: 82  ----------ATVFPQAIGMAATFDDCAVKQVFECVSDEARAKYHHSENKDGSERYRGLT 131

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
           +W+PNVNIFRDPRWGRGQET GEDP L+ +   + VRGLQG S+    K+ A  KH+  +
Sbjct: 132 FWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPSESKYDKLHACAKHYALH 191

Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
               WN   R  F+ + +S +D+ +T+   F+  V +G V  VMC+YN+  G P C    
Sbjct: 192 SGPEWN---RHRFDVENISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEPCCGSNR 248

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT--STPEEAAADAIRAGLDLDCG-PFLG 326
           +L   +R EW  +G +VSDC ++  +Y   H    ST E A A A++AG DLDCG  +  
Sbjct: 249 LLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHSTKESAVAAAVKAGTDLDCGVDYQS 308

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKDVCTPDH 382
           L  E AV++G+++E  I+ +L   L  +  LG+ D E     S  PY       V +  H
Sbjct: 309 L--EKAVEKGIITEKQIDVSLSRLLKARFELGLMDEEHLVSWSDIPY-----TVVDSEKH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRH-RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           +  ALE AR+ + LLKN+  +LPLS  +H   + VIGPN++ ++ M GNY G      T 
Sbjct: 362 RAKALEVARKSMTLLKNKNGTLPLS--KHCGKIVVIGPNANDSIMMWGNYNGFPSHTVTI 419

Query: 442 LQGIGRY---ARTIHQQGCK 458
           L+GI       + I+ +GC+
Sbjct: 420 LEGITHKLDAGQVIYDKGCE 439



 Score =  125 bits (315), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 89/293 (30%), Positives = 140/293 (47%), Gaps = 47/293 (16%)

Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           A+A + V G+   +E E L          DR  + LP  Q++L+ ++    K P IL+L 
Sbjct: 600 AEAIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELYKTGK-PIILILC 658

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           SG  I ++ A+ D    AII A YPGQAGGTA+AD+LFG  NP G+LP+T+Y        
Sbjct: 659 SGSAIGLS-AEVD-LADAIIQAWYPGQAGGTAVADVLFGDYNPAGRLPVTFYK------- 709

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
             T   +   +     GRTYR++KG  ++PFG+G+SYT+F                    
Sbjct: 710 --TTEQLPDFEDYNMQGRTYRYFKGEALFPFGYGLSYTSFEIG----------------- 750

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
              A +S K I          ++ + + +KN G +DG   + V+         P K L A
Sbjct: 751 --KAQLSKKRIHANE------SVNLDLWIKNTGERDGEEVIQVYIRKLKDKEGPLKTLRA 802

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           F++VHV +G ++++ I++    +          R+  GE+ +  G +     L
Sbjct: 803 FKRVHVKSGEKKQISIHLPNDSFEFFDPEFNVMRVMAGEYEVLYGTSSEGKDL 855


>gi|299148437|ref|ZP_07041499.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298513198|gb|EFI37085.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 863

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 171/417 (41%), Positives = 242/417 (58%), Gaps = 31/417 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L   QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+  G  
Sbjct: 27  PYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYN-----GGTAGLTY 152
                    AT FPQ I  A+SFN  L   +   VSDEARA    +N         GLT 
Sbjct: 86  ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLTM 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
           W+PNVNIFRDPRWGRGQET GEDP LSG+   + VRGLQG  D +  K+ A  KHF  + 
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHFAVHS 196

Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              WN   R  FNA+ ++ +D+ +T+   F+  V +  V  VMC+YN+  G P C    +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
           L + +R +W   G +V+DC ++G ++  +   + P+   A+ADA+  G DL+CG      
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLNGTDLECGGNFKSI 313

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
           T+ AV++GL+SE  IN ++   L  +  LG  +    + P+ ++    +  P H+ELAL+
Sbjct: 314 TD-AVKKGLISEEKINTSVKRLLKARFELGEMN---PTHPWSNIPYSVINCPKHKELALK 369

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            A + +VLL+N+   LPL+  R   VAVIGPN++ +V   GNY G      T L+GI
Sbjct: 370 MAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424



 Score =  117 bits (294), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 89/307 (28%), Positives = 138/307 (44%), Gaps = 49/307 (15%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           ++  + AD  I   G+   +E E++          DR  + LP  Q+E+++ +    K  
Sbjct: 594 LNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
           T+ V  SG  +  A         AI+ A YPGQAGGTA+AD+LFG  NP G+LP+T+Y  
Sbjct: 653 TVFVNFSGSAM--AIVPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY-- 708

Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
           + I  LP  E  +M+        GRTYRF     +YPFG+G+SYT F +  A        
Sbjct: 709 KSIQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSYGKA-------- 752

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
                  ++N +   K  K            + + V NVG +DG   + V+   P     
Sbjct: 753 -------TLNQSKLAKGEKAI----------LTIPVSNVGQRDGEEVVQVYICRPDDKGG 795

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L  F++V++  G  Q V I +    +      + T R   G + I  G + +   L
Sbjct: 796 PQKTLRGFQRVNIAKGKTQNVNIELPYDSFEWFDTATNTIRPLSGTYKILYGNSSNENDL 855

Query: 761 HAATLGV 767
              ++ +
Sbjct: 856 QTCSIQI 862


>gi|21244948|ref|NP_644530.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21110666|gb|AAM39066.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 901

 Score =  299 bits (766), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 173/458 (37%), Positives = 251/458 (54%), Gaps = 36/458 (7%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
             C    A   T P+         R  DL+ R++L+EK   + + A A+PRL +  Y+WW
Sbjct: 20  LGCATGAAHAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--- 144
           +EALHGV+  G          GAT FPQ I  A++F+  L   +   +SDEARA ++   
Sbjct: 80  NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129

Query: 145 -----GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------- 192
                    GLT+WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG       
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPK 189

Query: 193 -SDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVA 250
            + G+   K+ A+ KH   +        DR HF+A+ S++D+ +T+   F   V EGKV 
Sbjct: 190 NAQGEPYRKLDATAKHLAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKEGKVD 246

Query: 251 SVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAA 310
           +VM +YN+V G    A   +L+  +R +W   GY+VSDC ++   +      +T E+AAA
Sbjct: 247 AVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAA 306

Query: 311 DAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
            A++ G +L+CG        +AV++GL+ E  I+ AL   +T +MRLGMFD  P   P+ 
Sbjct: 307 LAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWS 364

Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
            +      +P H  LA   AR+ +VLLKN G  LPLS  + + +AVIGP +D T+ ++GN
Sbjct: 365 TIPASVNQSPAHDALARRTARESLVLLKNDG-LLPLSRAKFKRIAVIGPTADDTMALLGN 423

Query: 431 YAGIACGYTTPLQGI---GRYARTIHQQGCKDVACADD 465
           Y G      T LQGI      A+ ++ +G   V   DD
Sbjct: 424 YYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461



 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 93/301 (30%), Positives = 148/301 (49%), Gaps = 51/301 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +R AD  + V GL   +E E +          DR  L LP  Q++L+  +  A+  
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQ-ATGR 686

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y 
Sbjct: 687 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 743

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVA 639
           +E  T     + AMR        GRTYR++ G  +YPFGHG+SYT F ++ +    T +A
Sbjct: 744 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 795

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                  GS+ AT++                     VKN G + G   + ++  P A   
Sbjct: 796 T-----DGSLAATVT---------------------VKNTGQRAGDEVVQLYLHPLAPQR 829

Query: 700 A-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHS 757
               K+L  F+++ +  G Q+ +G  I+    L + D +     +  G + + IG +   
Sbjct: 830 ERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIGASSAD 889

Query: 758 V 758
           +
Sbjct: 890 I 890


>gi|189468358|ref|ZP_03017143.1| hypothetical protein BACINT_04755 [Bacteroides intestinalis DSM
           17393]
 gi|189436622|gb|EDV05607.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 865

 Score =  299 bits (766), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 174/447 (38%), Positives = 259/447 (57%), Gaps = 41/447 (9%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           AT +  P+   +L   +R  DL+ R++L+EK+  + +G+ A+ RLGI  Y+WW+EALHGV
Sbjct: 18  ATAQNEPYRNPNLSPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYDWWNEALHGV 77

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSF-NATLWEAIGRVVSDEARAMYN--------G 145
           +  G           AT FPQ I  A++F N  ++E    +VSDEARA Y+        G
Sbjct: 78  ARAGK----------ATVFPQAIGLAATFDNQAVYETFD-IVSDEARAKYHDFQRKGERG 126

Query: 146 GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC- 204
           G  GLT+W+PN+NI+RDPRWGRG ET GEDP L+     + V+GLQG+   +   A +C 
Sbjct: 127 GYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGNGAGKYDKAHACA 186

Query: 205 KHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           KH+  +    WN   R  F++K +S++D+ +T+   F+  V EGKV  VMC+YN+  G P
Sbjct: 187 KHYAVHSGPEWN---RHSFDSKNISQRDLWETYLPAFKTLVTEGKVKEVMCAYNRFEGEP 243

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDC 321
            C++  +L R +R +W  +  +VSDC ++G +Y   H  + P  E A+ADA+ +G DL+C
Sbjct: 244 CCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPSAEAASADAVVSGTDLEC 303

Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKDV 377
           G       E AV++GL++E  IN ++   L  + +LGMFD +     S  PY  +  K  
Sbjct: 304 GGSYSSLNE-AVKKGLITEDKINESVFRLLRARFQLGMFDDDTLVSWSEIPYSVVESK-- 360

Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
              +H + ALE AR+ +VLL N+  SLPLS    R VAV+GPN++ +V +  NY G    
Sbjct: 361 ---EHVDKALEMARKSMVLLTNKNNSLPLSK-SIRKVAVLGPNANDSVMLWANYNGFPTK 416

Query: 438 YTTPLQGIGRY---ARTIHQQGCKDVA 461
             T L+GI          +++GC  V+
Sbjct: 417 SVTILEGIRSKLPEGAVYYEKGCDFVS 443



 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/298 (28%), Positives = 137/298 (45%), Gaps = 47/298 (15%)

Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           I V GL  ++E E +          DR  + LP  Q+E++  +    K P I V+ SG  
Sbjct: 605 IFVGGLSSALEGEEMPVDLPGFKKGDRTNIDLPRVQEEMLKALKKTGK-PVIFVVCSGST 663

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
           + + +   +  + A++ A YPGQ GGTA+AD+LFG  NP G+LP+T+Y  +  ++LP  E
Sbjct: 664 LALPWEAEN--LDAMLEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASD--SDLPDFE 719

Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
                        RTYR++KG  ++PFG+G+SYT F +                     A
Sbjct: 720 -------DYNMSNRTYRYFKGKPLFPFGYGLSYTTFDYG-------------------KA 753

Query: 652 TISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKV 711
            +  K+IK         ++ + + +KN G  DG   + V+   PA    P K L AF +V
Sbjct: 754 KVDKKSIKTGD------SMTLTIPLKNTGKMDGDEVVQVYLRNPADKEGPIKMLRAFRRV 807

Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
            + AG  + + I +    +      +    I  G + +  GGT    SL    + + K
Sbjct: 808 SLKAGQAENIQIELPASTFECFNPATNRMEILPGNYELLYGGTSDGKSLQKVAVTLKK 865


>gi|380509734|ref|ZP_09853141.1| beta-glucosidase-related glycosidase [Xanthomonas sacchari NCPPB
           4393]
          Length = 883

 Score =  299 bits (766), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 176/448 (39%), Positives = 252/448 (56%), Gaps = 32/448 (7%)

Query: 29  ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
           A D  DA     P+   S     R   L+ +++L+EK   + + A A+ RLG+  Y+WW+
Sbjct: 15  AADAVDAK----PWQDTSASFEARAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWN 70

Query: 89  EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN---- 144
           EALHGV+  G           AT FPQ I  A++F+  L   +   +SDEARA ++    
Sbjct: 71  EALHGVARAGQ----------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLR 120

Query: 145 ----GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKV 200
               G   GLT+WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG D    K+
Sbjct: 121 EGAHGRYQGLTFWSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVQGLQGDDPVYRKL 180

Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
            A+ KHF  +   +    DR HF+A+ SK+D+ DT+   F   V EGKV +VM +YN+V 
Sbjct: 181 DATAKHFAVH---SGPEADRHHFDARPSKRDLYDTYLPAFEALVKEGKVDAVMGAYNRVY 237

Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
           G    A   +L+  +R +W   GY+VSDC ++   +   H   + E AAA A++ G +L+
Sbjct: 238 GESASASQFLLRDVLRRDWGFTGYVVSDCWAIVDIWKHHHLAPSREAAAALAVKNGTELE 297

Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
           CG        +AV++GL+ E +I++A+    T +MRLGMFD  P    +  +       P
Sbjct: 298 CGQEYAT-LPAAVRQGLIGEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQVP 355

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
            H  LAL+AA++ +VLLKN G  LPLS    R +AV+GP +D T+ ++GNY G      T
Sbjct: 356 AHDALALQAAQESLVLLKNDG-VLPLSRTLKR-IAVVGPTADDTMALLGNYFGTPAAPVT 413

Query: 441 PLQGIGRYARTI---HQQGCKDVACADD 465
            LQGI   A+ I   + +G   V   DD
Sbjct: 414 ILQGIRDAAKGIEVRYARGVDLVEGRDD 441



 Score =  123 bits (309), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 95/312 (30%), Positives = 145/312 (46%), Gaps = 59/312 (18%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F  A+DA+R AD  + V GL   +E E +          DR  L LP  Q+ L+  +   
Sbjct: 605 FDEALDAARNADVVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDLRLPAPQRALLEALHAT 664

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K P ++VL  G  + V +A+    + AI+ + YPGQ GGTA+   LFG  NP G+LP+T
Sbjct: 665 GK-PVVMVLTGGSALAVDWAQA--HLPAILMSWYPGQRGGTAVGQALFGEVNPAGRLPVT 721

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV--ANA 634
           +Y  +    LP   + AM         GRTYR+++G  +YPFGHG+SYT F +     +A
Sbjct: 722 FYRAD--QALPAFDDYAME--------GRTYRYFRGTPLYPFGHGLSYTRFDYGKLHLDA 771

Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
           P +     DGR                        L +QV+V N G + G     ++   
Sbjct: 772 PRIAD---DGR------------------------LKLQVEVANTGKRAGDEVAQLYVRR 804

Query: 695 PAGHWAP---HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS-GTRRIPLGEHNIH 750
            A   AP    + L  F++VH+  G ++ +   +   + L   D + G   +P G + + 
Sbjct: 805 LAA--APGDAQQTLRGFQRVHLAPGERRTLTFELDAQQALRQYDDARGAYVVPAGRYEVR 862

Query: 751 IGGTKHSVSLHA 762
           IGG+     + A
Sbjct: 863 IGGSSADARVRA 874


>gi|315499711|ref|YP_004088514.1| beta-glucosidase [Asticcacaulis excentricus CB 48]
 gi|315417723|gb|ADU14363.1| Beta-glucosidase [Asticcacaulis excentricus CB 48]
          Length = 869

 Score =  299 bits (766), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 179/415 (43%), Positives = 238/415 (57%), Gaps = 37/415 (8%)

Query: 57  IGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQV 116
           I R+++++K   + + A  +P  G+  YEWW+E LHGV+  G           AT FPQ 
Sbjct: 40  IARMTVEQKAAQMQNRAPDLPSAGLTAYEWWNEGLHGVARAGE----------ATVFPQA 89

Query: 117 ITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRG 168
           I  A+++N  L + +G VVS EARA +N            GLT WSPN+NIFRDPRWGRG
Sbjct: 90  IGLAATWNPALLKQVGDVVSTEARAKFNSTDPAGDHQRYYGLTLWSPNINIFRDPRWGRG 149

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           QET GEDP L+ + A  +V GLQG D    KV AS KH   +         R  F A VS
Sbjct: 150 QETYGEDPFLTSRLAEGFVTGLQGPDPQHPKVVASVKHLAVHSGPE---AGRHGFAASVS 206

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
             D+E T+   FR  VM  K  SVMC+YN V GVP CA   +LK  +R  W   GY+V+D
Sbjct: 207 PYDLEMTYLPAFRYSVMTTKAQSVMCAYNAVGGVPACASDLLLKTYVREAWGFKGYVVTD 266

Query: 289 CDSVGVYYDTQHF---TSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
           CD++   YD   F        E++A++++AG+DL+CG       E AVQ+GL+ E  ++ 
Sbjct: 267 CDAI---YDMTRFHFYRLNDAESSAESLKAGVDLNCGNAYAALPE-AVQKGLIPESLMDQ 322

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
           +L   L V+ RLG+ DG PS  P+  + P+ + TP  Q LAL+AA Q +VLLKN G  LP
Sbjct: 323 SLNRLLDVRKRLGI-DGAPS--PWARISPEAINTPQAQGLALQAAEQSLVLLKNNG-VLP 378

Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGC 457
           L     +TVAVIGPN+D   T+ GNY GIA    TPL G+      A+ ++ QG 
Sbjct: 379 LKP--GQTVAVIGPNADTEETLRGNYNGIARQPVTPLTGLRAQLGAAKVLYAQGA 431



 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 88/272 (32%), Positives = 133/272 (48%), Gaps = 45/272 (16%)

Query: 482 ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           ILV G D+       DR  L LP  Q++L+  V    K P ++VL+SG  + + +A    
Sbjct: 620 ILVPGFDRG------DRTDLGLPRTQEDLLKAVKATGK-PLVVVLLSGSAVALNWADAHA 672

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
                 W  YPG+AGGTAIA  L G +NP G+LP+T+Y    + +LP       P    R
Sbjct: 673 DAVVAAW--YPGEAGGTAIARTLTGEANPSGRLPVTFY--RSVQDLP-------PFIDYR 721

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
             GRTYR++KG  +YPFGHG+SYT F ++         + LD      +   +G+ ++V+
Sbjct: 722 MEGRTYRYFKGKPLYPFGHGLSYTQFSYS--------DLKLD-----TSTLTAGQPLRVS 768

Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
                       V V+N G + G   + ++   P   +  +  L AF +V + AG  + V
Sbjct: 769 ------------VRVRNNGQRAGDEVVQLYVKRP-DTFGLNASLAAFARVSLKAGESRTV 815

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            + I   + LS V   G R I  G + + +GG
Sbjct: 816 VMTID-PRDLSTVTLEGERAIRAGAYGLSVGG 846


>gi|373951852|ref|ZP_09611812.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373888452|gb|EHQ24349.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 871

 Score =  298 bits (764), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 169/446 (37%), Positives = 246/446 (55%), Gaps = 32/446 (7%)

Query: 12  LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           + +LS     +A   PFA     A T   P+    L    RVNDL+ R++L+EKV  +++
Sbjct: 1   MKILSKICILVATMLPFAVI---AQTSDYPYQNYHLDFTTRVNDLVKRMTLEEKVSQMLN 57

Query: 72  GAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAI 131
            + A+PRL I  Y+WW+E LHGV+     T F       T +PQ I  A++F+      +
Sbjct: 58  SSPAIPRLKIPAYDWWNEVLHGVAR----TPFK-----VTVYPQAIAMAATFDRQSLNQM 108

Query: 132 GRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
               + E RA++N             GLTYW+PN+NIFRDPRWGRGQET GEDP L+G  
Sbjct: 109 ADYAALEGRAVHNKALQMRKPGEKYLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGAM 168

Query: 183 AASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPF 240
            +++V GLQG+D   LK AA  KH+  +     +G +  R  FNA +S  D+ DT+   F
Sbjct: 169 GSAFVSGLQGNDPKYLKAAACAKHYAVH-----SGPEPLRHVFNADISTYDLWDTYLPAF 223

Query: 241 RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
           +  V++ KVA VMC+YN     P C    ++   +R +W+ +GY+ SDC  +  ++    
Sbjct: 224 KKLVVDDKVAGVMCAYNAFKTQPCCGSDLLMVDILRNQWKFSGYVTSDCGGIDDFFKNHK 283

Query: 301 FTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
             +T E+A+ DA+  G D++CG        +AV+ G +SE  I+ ++     ++ RLGMF
Sbjct: 284 THATAEDASTDAVLHGTDIECGTDAYKSLVAAVKEGKISETQIDISVKRLFMIRFRLGMF 343

Query: 361 DGEPSSQPYGHLGPKDVC-TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGP 419
           D  PS        P  V  +P+HQ  AL+ ARQ +VLLKN   +LPLS    R + V+GP
Sbjct: 344 D--PSDVVKYAQTPVSVLESPEHQAHALKMARQSVVLLKNANHTLPLSKTI-RKIVVLGP 400

Query: 420 NSDVTVTMIGNYAGIACGYTTPLQGI 445
           N+D  + ++GNY G     TT  QGI
Sbjct: 401 NADNPIAILGNYNGTPSNLTTVYQGI 426



 Score =  103 bits (256), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 91/298 (30%), Positives = 139/298 (46%), Gaps = 57/298 (19%)

Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           ADA + V G+   +E E +          DR  + LP  Q  L+ K   A+  P + V+M
Sbjct: 603 ADAIVYVGGISPQLEGEEMQVNYPGFNGGDRTSIQLPAAQTNLM-KTLQATGKPVVFVMM 661

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           +G  +   +   +  I AI+ A Y GQA GTA+AD+LFG  NP G+LP+T+Y  +  T+L
Sbjct: 662 TGSALATPWEAEN--IPAIVNAWYGGQAAGTAVADVLFGDYNPAGRLPVTFYKSD--TDL 717

Query: 588 P-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           P  T+ +M          RTYR++KG  +Y FG+G+SYT F +     P  V        
Sbjct: 718 PDFTDYSMT--------NRTYRYFKGIPLYGFGYGLSYTQFKYDKLIVPATVK------- 762

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQL 705
                  SGKAI ++            V V N G   G   + ++    +     P K L
Sbjct: 763 -------SGKAIHLS------------VTVTNSGQIAGDEVVQIYMKHHSQRIKVPLKAL 803

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTK---HSVS 759
             F +V++ AG  +R  +N  +      V  S    +P+ G+  I  GG++   H+V+
Sbjct: 804 KGFARVYLKAG--ERRTLNFILSPDDLAVTSSNGGLVPIKGKITISAGGSQPDEHNVT 859


>gi|409197445|ref|ZP_11226108.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
           21150]
          Length = 737

 Score =  298 bits (763), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 230/716 (32%), Positives = 341/716 (47%), Gaps = 85/716 (11%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVG 98
           + PF    L +  RV+DL+ R++L+EKV  L S   +VPRLGIKG     E  HGV+  G
Sbjct: 39  SYPFQNADLDMETRVDDLLSRMTLEEKVSAL-STDPSVPRLGIKGAPH-IEGYHGVAMGG 96

Query: 99  PGT--KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG---GTAGLTYW 153
           P      G +    T FPQ     +++N  L    G + S EAR ++        GL   
Sbjct: 97  PANWAPKGDERVPTTQFPQAYGMGATWNPELIRKAGEIESIEARYIFQNPEISKGGLVVR 156

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +PN ++ RDPRWGR +E  GEDP L G  + ++ +GLQG D    + A+  KHF A    
Sbjct: 157 APNADLGRDPRWGRTEEVLGEDPFLVGTLSTAFTKGLQGDDEKYWRTASLLKHFLA---- 212

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           N N   R   ++    Q   + +   FR  ++EG   + M +YN VNGVP    P + K 
Sbjct: 213 NSNENTRDSSSSNFDTQLFYEYYGATFRRAILEGGSNAYMTAYNAVNGVPAHIHP-MHKE 271

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--- 330
                W +NG I +D     +             AA   I+AGL+     FL  + E   
Sbjct: 272 ISMARWGVNGIICTDGGGYTLLVRAHKAYDDYYRAAEGVIKAGLN----QFLDNYREGVW 327

Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG----PKDVCTPDHQELA 386
            A+  G L+E D++  L     V ++LG  D +    PY  +G    P    +P+HQE A
Sbjct: 328 GALAHGYLAEEDLDEVLKGVYRVMIKLGQLDPQ-DKVPYASIGRDGKPAPWTSPEHQEAA 386

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
           L+ AR+ +VLLKN+  +LPL+      VAVIG  +D    ++  Y+G+    +TPL GI 
Sbjct: 387 LQMARESVVLLKNEKQTLPLAGDELGKVAVIGHLAD--TILLDWYSGMPPFMSTPLDGIK 444

Query: 447 RYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL-------------DQSIEA 493
                  + G   V  A D  + AA++A+ QAD  I+V+G              D  +  
Sbjct: 445 ------EKMGADKVLFAPDNDYNAAVEAASQADVAIVVLGNHPYCDSERWGDCPDPGMGR 498

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           EA+DR  L L     E +++    +   TILVL S  P  + +++ +  + AI+   + G
Sbjct: 499 EAVDRKTLRL---TDEWLAQRVFEANPNTILVLQSSFPYGINWSQEN--LPAIVHITHNG 553

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKG 612
           Q+ GTA+AD+LFG  NPGGKL  TW   E    LP M E  +R        G TY ++ G
Sbjct: 554 QSTGTALADVLFGDYNPGGKLTQTWPKSE--EQLPDMMEYDIR-------KGHTYMYFNG 604

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
             +YPFG G+SYT+F                     ++  I+G ++     K N   + V
Sbjct: 605 EPLYPFGFGLSYTSFEW-------------------VDMEITGSSV-----KSNEEEVIV 640

Query: 673 QVDVKNVGSKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHV 727
            V +KNVG   G   + ++++ P      P K L  F++V +  G  + V I + +
Sbjct: 641 TVKLKNVGQVKGDEVIQLYASFPETSSRRPDKALKGFKRVTLEPGESKNVQIPVKL 696


>gi|380692997|ref|ZP_09857856.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 837

 Score =  298 bits (762), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 166/403 (41%), Positives = 244/403 (60%), Gaps = 28/403 (6%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +  ++ PI +RV DL+ +L+++EKV LL + +  + R+GI  Y   +EALHG+  + PG 
Sbjct: 14  YKNMNAPIHERVQDLLSKLTIEEKVSLLRATSPGIERMGIDKYYMGNEALHGI--IRPG- 70

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LT 151
           KF       T FPQ I  AS +N  L   I  V+SDEARA +N    G          LT
Sbjct: 71  KF-------TVFPQAIGLASMWNPELHHIIAGVISDEARARWNELERGKKQKDQFSDLLT 123

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
           +WSP VN+ RDPRWGR  ET GEDP LSG    ++V+GLQG     LK  A+ KHF A +
Sbjct: 124 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDHPRYLKAVATPKHFAANN 183

Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
            ++    +RF+ +A +++ D+ + +   F  C+ EGK  S+M +YN +NGVP  A+  +L
Sbjct: 184 EEH----NRFYCDAAITETDLREYYFPAFEKCIREGKAESIMTAYNAINGVPCTANNWLL 239

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH-TE 330
            + ++ +W  NGYIVSDC + G+      +  TPE AA  AI+AGLD++CG ++  +   
Sbjct: 240 NKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKTPEAAAMIAIKAGLDVECGDYVFANPLL 299

Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
           +A ++ ++S  +I++A    L  +MRLGMFD +P   PY HL P+ V    H +LALEAA
Sbjct: 300 NAYKQYMVSAAEIDSAAYRVLRARMRLGMFD-DPEKNPYNHLSPEIVGCKKHHDLALEAA 358

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           RQ IVLLKNQ  +LPL+  + +++AV+G N+       G+Y+G
Sbjct: 359 RQSIVLLKNQQNTLPLNAQKIKSIAVVGINA--ANCEFGDYSG 399



 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 94/296 (31%), Positives = 146/296 (49%), Gaps = 41/296 (13%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           ++G A    R++D  I VMG++QSIE E  DR  + LP  QQ  + +   A+   TI+VL
Sbjct: 577 MYGDASKIIRESDVVIAVMGINQSIEREGQDRNSIELPKDQQIFIREAYKANPN-TIVVL 635

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           ++G  + + +   D  I AII A YPG+ GGTAIA++LFG  NP G+LP+T+Y    I +
Sbjct: 636 VAGSSMAIGWM--DQHIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS--IED 691

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP    A      K    RTY +++G  +Y FG+G+SYT F +   N      +  D ++
Sbjct: 692 LP----AFDDYNVKN--NRTYMYFEGKPLYAFGYGLSYTKFDYRNLN------IKQDTQN 739

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
            ++N +I                       KN G  +G     V+   P  G   P KQL
Sbjct: 740 VTLNFSI-----------------------KNSGKYNGDEVAQVYVKFPDQGIKTPLKQL 776

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             F++VH+  GA +++ I I   +     D+      P G ++  +G +  ++ L 
Sbjct: 777 KGFKRVHIKKGATEQISIEIPKEELRLWDDQKKQFYTPSGTYHFMVGKSSDNICLQ 832


>gi|423290405|ref|ZP_17269254.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
           CL02T12C04]
 gi|392665792|gb|EIY59315.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
           CL02T12C04]
          Length = 861

 Score =  298 bits (762), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 175/441 (39%), Positives = 245/441 (55%), Gaps = 34/441 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  + LP+   SL   QR  DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19  ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
              G           AT FPQ I   +SFN +L   +    SDEAR              
Sbjct: 79  GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
             GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG +  R  K+ A  K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  F+A+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P 
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
           C    +L + +R EW   G +VSDC ++  +Y      + P  E A+ADA+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASADAVRAGTDLECG 305

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                    AV+ GL+ E +I+ +L   LT +  LG  D +P+   +  +    + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WAEIPTSVLNSKEH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q LAL  AR+ +VLL+N+   LPL+   H  VAV+GPN++ +V   GNY GI     T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419

Query: 443 QGIGRY---ARTIHQQGCKDV 460
           + +       + I++ GC  V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440



 Score =  116 bits (290), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 135/308 (43%), Gaps = 49/308 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+     AD  +   G+  S+E E +          DR  + LP  Q++L+  +  A K 
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             + +  SG  I            AI+ A YPGQAGGTAI D L+G  NPGG+LP+T+Y 
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
              +  LP   + +M+        GRTYR+ +   ++PFGHG+SYT F +  A       
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEAKL----- 752

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                   S N    G+ + +T            + V NVG +DG   + V+   P    
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
            P   L AF++VH+PAG  + V I +    +      S T R   G + +  GGT     
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDAESNTMRPLEGTYELLYGGTSDRNK 852

Query: 760 LHAATLGV 767
           L    + V
Sbjct: 853 LKTIVMNV 860


>gi|300777563|ref|ZP_07087421.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
 gi|300503073|gb|EFK34213.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
          Length = 896

 Score =  297 bits (761), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 162/433 (37%), Positives = 248/433 (57%), Gaps = 32/433 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LP+ +R+ +L+  L+ +EK+ +++  + AVPRL I  Y WW+EALHGV+  G  
Sbjct: 45  PFRNPDLPVNERIENLLTLLTTEEKIGMMMDNSQAVPRLEIPAYGWWNEALHGVARAGI- 103

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN---------GGTAGLT 151
                    AT FPQ I  A++++         ++SDEARA YN         G   GLT
Sbjct: 104 ---------ATVFPQAIGMAATWDVPEHFKTFEMISDEARAKYNRSFDEALKTGRYEGLT 154

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
           +W+PN+NIFRDPRWGRGQET GEDP L+     + V+GLQG+D    K  A  KHF  + 
Sbjct: 155 FWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGNDPKFFKTHACAKHFAVHS 214

Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
              WN   R  +NA++SK+D+ +T+   F+  V EG V  VMC+YN  +G P CA+  +L
Sbjct: 215 GPEWN---RHSYNAEISKRDLYETYLPAFKALVQEGNVREVMCAYNAFDGQPCCANNTLL 271

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCGPFLGLHT 329
              +RG+W+ +G +VSDC ++  ++  ++  + P+E   AADA++   DL+CG     + 
Sbjct: 272 TEILRGKWKYDGMVVSDCWALADFFQKKYHGTHPDEKTTAADALKHSTDLECGDTYN-NL 330

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC-TPDHQELALE 388
             ++  GL++E DI+ ++   L     LGM D  P S  + +  P  V  + +H++ AL+
Sbjct: 331 NKSLASGLITEKDIDESMRRILKGWFELGMLD--PKSSVHWNTIPYSVVDSEEHKKQALK 388

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
            A++ IVL+KN+   LPL+    + +AV+GPN+D  +  +GNY G      T L GI   
Sbjct: 389 MAQKSIVLMKNEKNILPLNR-NIKKIAVVGPNADDGLMQLGNYNGTPSSIVTILDGIKTK 447

Query: 449 ---ARTIHQQGCK 458
              A  I+++G +
Sbjct: 448 FPNAEIIYEKGSE 460



 Score =  109 bits (273), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 78/311 (25%), Positives = 138/311 (44%), Gaps = 39/311 (12%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F +  +  + AD  +   GL  S+E E +          D+  + LP  Q++L++++   
Sbjct: 613 FASVREKVKNADVIVFAGGLSPSLEGEEMMVNAEGFKGGDKTSIALPKVQRDLLAELRKT 672

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K P + VL +G  +     +++    A++ A Y GQ+GGTA+AD+L G  NP GKLP+T
Sbjct: 673 GK-PVVFVLCTGSAL--GLEQDEKNYDALLNAWYGGQSGGTAVADVLAGDYNPSGKLPIT 729

Query: 578 WYPQ-EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y   E + N           ++    GRTYR+     +YPFGHG+SY+ FV+       
Sbjct: 730 FYKNLEQLDNALSKTSKHEGFENYDMQGRTYRYMTEKPLYPFGHGLSYSKFVYG------ 783

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                        ++ +S  +I V         + + + V N+  ++G   + V+     
Sbjct: 784 -------------DSKLSKNSISVNE------NVTITIPVTNISEREGEEVVQVYIKRNN 824

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
              AP K L AFE+  + +   + + + +    +    +++       G++ I  GGT  
Sbjct: 825 DAQAPVKTLRAFERTPIKSKETKNIQLILSKDSFAFYDEKADDLVSKPGDYTIFYGGTSD 884

Query: 757 SVSLHAATLGV 767
              L    L V
Sbjct: 885 DAGLKNVLLKV 895


>gi|299147288|ref|ZP_07040353.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298514566|gb|EFI38450.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 861

 Score =  297 bits (761), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 175/441 (39%), Positives = 245/441 (55%), Gaps = 34/441 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  + LP+   SL   QR  DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19  ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
              G           AT FPQ I   +SFN +L   +    SDEAR              
Sbjct: 79  GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
             GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG +  R  K+ A  K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  F+A+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P 
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
           C    +L + +R EW   G +VSDC ++  +Y      + P  E A+ADA+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDLECG 305

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                    AV+ GL+ E +I+ +L   LT +  LG  D +P+   +  +    + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNSKEH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q LAL  AR+ +VLL+N+   LPL+   H  VAV+GPN++ +V   GNY GI     T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419

Query: 443 QGIGRY---ARTIHQQGCKDV 460
           + +       + I++ GC  V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440



 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 134/308 (43%), Gaps = 49/308 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+     AD  +   G+  S+E E +          DR  + LP  Q+ L+  +  A K 
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             + +  SG  I            AI+ A YPGQAGGTAI D L+G  NPGG+LP+T+Y 
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
              +  LP   + +M+        GRTYR+ +   ++PFGHG+SYT F +  A       
Sbjct: 708 N--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEAKL----- 752

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                   S N    G+ + +T            + V NVG +DG   + V+   P    
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
            P   L AF++VH+PAG  + V I +    +      S T R   G + +  GGT     
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGVNFEWFDAESNTMRPLEGTYELLYGGTSDRNK 852

Query: 760 LHAATLGV 767
           L    + V
Sbjct: 853 LKTIVMNV 860


>gi|325914134|ref|ZP_08176487.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325539637|gb|EGD11280.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 874

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 170/410 (41%), Positives = 237/410 (57%), Gaps = 32/410 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S  EKV   ++ A A+PRL I  YEWWSE LHG++  G           A
Sbjct: 24  QRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNG----------YA 73

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 74  TVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 133

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+ KH   +     +G +  
Sbjct: 134 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLNHPRTIATPKHIAVH-----SGPEPG 188

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS +D+E T+   FR  +++G+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 189 RHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPACAADWLLNGRVRGDW 248

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A A++AG DL+CG  +  L T  A++RG +
Sbjct: 249 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIERGEV 306

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KDV    H+ LAL+AA + IVLLK
Sbjct: 307 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 365

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+ ++
Sbjct: 366 NTATTLPLKA--GTRLAVIGPNADALAALEANYQGTSATPITPLLGLRQH 413



 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 98/287 (34%), Positives = 147/287 (51%), Gaps = 48/287 (16%)

Query: 477 QADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           Q+DA +  +GL   +E E L          DR  + LP  QQ L+ +   + K P ++VL
Sbjct: 602 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLERAKASGK-PLVVVL 660

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           MSG  + + +AK +    AI+ A YPGQ+GGTAIA  L G  NPGG+LP+T+Y      +
Sbjct: 661 MSGSAVALNWAKAN--ADAIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFY--RSTKD 716

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP          S    GRTYR++KG  ++PFG+G+SYT+F +   +AP +         
Sbjct: 717 LPAY-------VSYDMKGRTYRYFKGEPLFPFGYGLSYTSFAY---DAPRL--------- 757

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
            S     +G  ++VT              V+N GS+ G     V+   P    +P + LV
Sbjct: 758 -STRTLQAGNPLQVT------------TTVRNTGSRAGDEVAQVYLQYPDRPQSPLRSLV 804

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            F++VH+  G Q+ +   +   + LS VDRSG R +  GE+ + +GG
Sbjct: 805 GFQRVHLKPGEQRELTFTLD-ARALSDVDRSGQRAVEAGEYRVFVGG 850


>gi|298481648|ref|ZP_06999839.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298272189|gb|EFI13759.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 861

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 175/441 (39%), Positives = 245/441 (55%), Gaps = 34/441 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  + LP+   SL   QR  DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19  ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
              G           AT FPQ I   +SFN +L   +    SDEAR              
Sbjct: 79  GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
             GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG +  R  K+ A  K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  F+A+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P 
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
           C    +L + +R EW   G +VSDC ++  +Y      + P  E A+ADA+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDLECG 305

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                    AV+ GL+ E +I+ +L   LT +  LG  D +P+   +  +    + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNSKEH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q LAL  AR+ +VLL+N+   LPL+   H  VAV+GPN++ +V   GNY GI     T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419

Query: 443 QGIGRY---ARTIHQQGCKDV 460
           + +       + I++ GC  V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440



 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 134/308 (43%), Gaps = 49/308 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+     AD  +   G+  S+E E +          DR  + LP  Q+ L+  +  A K 
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             + +  SG  I            AI+ A YPGQAGGTAI D L+G  NPGG+LP+T+Y 
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
              +  LP   + +M+        GRTYR+ +   ++PFGHG+SYT+F +  A       
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL----- 752

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                   S N    G+ + +T            + V NVG  DG   + V+   P    
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQCDGEEVVQVYLRRPGDKE 792

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
            P   L AF++VH+PAG  + V I +    +      S T R   G + +  GGT     
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDVESNTMRPLEGTYELLYGGTSDRNK 852

Query: 760 LHAATLGV 767
           L    + V
Sbjct: 853 LKTIVMNV 860


>gi|94970273|ref|YP_592321.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
 gi|94552323|gb|ABF42247.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
          Length = 881

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 160/419 (38%), Positives = 237/419 (56%), Gaps = 32/419 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SL   +R  DL+ R++++EKV  L + + AVPRL +  Y+WWSEALHGV+      
Sbjct: 30  YLNPSLAPEKRAADLVHRMTVEEKVSQLTNDSRAVPRLNVPDYDWWSEALHGVAQ----- 84

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                 PG T +PQ +  A++F+    + + R +  E R  +  G          GL +W
Sbjct: 85  ------PGVTEYPQPVALAATFDNDKVQRMARFIGIEGRIKHEEGMKDGHSDIFQGLDFW 138

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +PN+NIFRDPRWGRGQET GEDP L+ +   +YV+GLQG D       ++ KH+  +   
Sbjct: 139 APNINIFRDPRWGRGQETYGEDPFLTARMGVAYVKGLQGDDPKYYLAISTPKHYAVH--- 195

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           +     R   + KVSK D  DT+   FR  V E K  SVMC+YN +NG P C +  +L+ 
Sbjct: 196 SGPETTRHFADVKVSKHDELDTYLPAFRATVTEAKAGSVMCAYNSINGQPACVNEFLLQD 255

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--- 330
            +RG+W   GY+VSDC+++   Y    FT T  EA+A A++ G+D +C  F     +   
Sbjct: 256 QLRGKWNFQGYVVSDCEAIINIYRDHKFTKTQAEASALAVQRGMDNECVDFGKQKDDHDY 315

Query: 331 ----SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
                A ++G+L E +I+ ALV   T +M+LGMFD  P   PY  + PK++ + +H+ELA
Sbjct: 316 RPYFDAYKQGILKESEIDTALVRLFTARMKLGMFD-PPEMVPYSKIDPKELESAEHRELA 374

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
              A + +VLLKN G +LPL     + +AVIGP ++ T  ++GNY G      + L+G+
Sbjct: 375 RTLANESMVLLKNDG-TLPLKKSGLK-IAVIGPLAEQTRYLLGNYNGTPSHTVSVLEGL 431



 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 94/295 (31%), Positives = 147/295 (49%), Gaps = 50/295 (16%)

Query: 470 AAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASK 519
           AA+ A++ AD  I V+G+   +E E +          DR  L LP  +Q+L+  +S A K
Sbjct: 602 AAVTAAKNADVVIAVLGITSDLEGEEMPVSEEGFNGGDRTSLDLPKPEQQLLESISAAGK 661

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
            P +LVL +G  + V +A+      AI+   YPG+ GGTAIA  L G +NP G+LP+T+Y
Sbjct: 662 -PVVLVLSNGSALSVNWAQQ--HANAILEGWYPGEEGGTAIAQTLSGKNNPAGRLPVTFY 718

Query: 580 P-QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
              E +   P  + AM+        GRTYR+++G  +YPFG+G+SYT F +     P   
Sbjct: 719 TGTEQLP--PFEDYAMK--------GRTYRYFEGKPLYPFGYGLSYTTFSYRDLALPK-- 766

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH 698
             PL+          +G  +              QV V N G  +G     ++ + P   
Sbjct: 767 -APLN----------AGDPVT------------AQVTVTNTGKVEGDEVAQLYLSFPNIA 803

Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            AP + L  F ++H+ AG  Q +   +   + LS+V+ +G   I  GE+++ +GG
Sbjct: 804 GAPLRALRGFRRIHLKAGESQTIKFELK-DRDLSMVNEAGDPIIAEGEYSVSVGG 857


>gi|423215029|ref|ZP_17201557.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692292|gb|EIY85530.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 861

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 175/441 (39%), Positives = 245/441 (55%), Gaps = 34/441 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  + LP+   SL   QR  DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19  ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
              G           AT FPQ I   +SFN +L   +    SDEAR              
Sbjct: 79  GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
             GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG +  R  K+ A  K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  F+A+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P 
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
           C    +L + +R EW   G +VSDC ++  +Y      + P  E A+ADA+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDLECG 305

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                    AV+ GL+ E +I+ +L   LT +  LG  D +P+   +  +    + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNSKEH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q LAL  AR+ +VLL+N+   LPL+   H  VAV+GPN++ +V   GNY GI     T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419

Query: 443 QGIGRY---ARTIHQQGCKDV 460
           + +       + I++ GC  V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440



 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 136/308 (44%), Gaps = 49/308 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+     AD  +   G+  S+E E +          DR  + LP  Q++L+  +  A K 
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             + +  SG  I            AI+ A YPGQAGGTAI D L+G  NPGG+LP+T+Y 
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
              +  LP   + +M+        GRTYR+ +   ++PFGHG+SYT+F +  A       
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL----- 752

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                   S N    G+ + +T            + V NVG +DG   + V+   P    
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
            P   L AF++VH+PAG  + V I +    +      S T R   G + +  GGT     
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGVNFEWFDVESNTMRPLEGTYELLYGGTSDRNK 852

Query: 760 LHAATLGV 767
           L    + V
Sbjct: 853 LKTIVMNV 860


>gi|384428895|ref|YP_005638255.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
 gi|341937998|gb|AEL08137.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
          Length = 888

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 168/407 (41%), Positives = 235/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S +EKV   ++ A A+PRLGI  YEWW+E LHG++  G           A
Sbjct: 38  QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWNEGLHGIARNG----------YA 87

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 88  TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 147

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+ KH   +     +G +  
Sbjct: 148 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHPRTIATPKHIAVH-----SGPEPG 202

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS +D+E T+   FR  ++EG+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 203 RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 262

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A A++AG DL+CG  +  L T  A++RG +
Sbjct: 263 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGTAYRALGT--AIERGEV 320

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG     P    Y  LG KD+    ++ LAL+AA + IVLLK
Sbjct: 321 DEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIVLLK 379

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 380 NANATLPLKASTR--LAVIGPNADALAALEANYQGTSSQPVTPLLGL 424



 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 97/287 (33%), Positives = 145/287 (50%), Gaps = 48/287 (16%)

Query: 477 QADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           Q+DA +  +GL   +E E L          DR  + LP  QQ L+ +   + K P ++VL
Sbjct: 616 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLERAKASGK-PLVVVL 674

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           MSG  + + +AK      AI+ A YPGQ+GGTAIA  L G  NPGG+LP+T+Y      +
Sbjct: 675 MSGSAVALNWAKT--HADAIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFY--RSTKD 730

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP       P  S    GRTYR++KG  ++PFG+G+SYT F +     P + A  L    
Sbjct: 731 LP-------PYVSYDMKGRTYRYFKGEALFPFGYGLSYTRFAY---ETPRLSATTLQ--- 777

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
                  +G  ++VT              V+N G + G     V+   P    +P + LV
Sbjct: 778 -------AGSPLQVT------------TTVRNTGERAGDEVAQVYLQYPERPQSPLRSLV 818

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            F++VH+  G Q+ +   +   + LS VDR+GTR +  G++ + +GG
Sbjct: 819 GFQRVHLQPGEQRTLTFTLD-ARALSDVDRTGTRAVEAGDYRLFVGG 864


>gi|21232323|ref|NP_638240.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|21114093|gb|AAM42164.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
          Length = 888

 Score =  296 bits (758), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 168/407 (41%), Positives = 235/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 38  QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 87

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 88  TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 147

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+ KH   +     +G +  
Sbjct: 148 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHPRTIATPKHIAVH-----SGPEPG 202

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS +D+E T+   FR  ++EG+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 203 RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 262

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A +++AG DL+CG  +  L T  A++RG +
Sbjct: 263 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAYRALGT--AIERGEV 320

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG     P    Y  LG KD+    ++ LAL+AA + IVLLK
Sbjct: 321 DEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIVLLK 379

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 380 NANATLPLKA--GTRLAVIGPNADALAALEANYQGTSSQPVTPLLGL 424



 Score =  139 bits (349), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 94/278 (33%), Positives = 143/278 (51%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 683

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA  L G  NPGG+LP+T+Y      +LP       
Sbjct: 684 WAKT--HADAIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFY--RSTKDLP------- 732

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
           P  S    GRTYR++KG  ++PFG+G+SYT+F +   +AP + +  L           +G
Sbjct: 733 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTSFAY---DAPQLSSTTLQ----------AG 779

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
             ++VT              V+N G++ G     V+   P    +P + LV F++VH+  
Sbjct: 780 SPLQVT------------TTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLQP 827

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G Q+ +   +   + LS VDR+GTR +  G++ + +GG
Sbjct: 828 GEQRTLTFTLD-ARALSDVDRTGTRAVEAGDYRLFVGG 864


>gi|188990656|ref|YP_001902666.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
 gi|167732416|emb|CAP50610.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
          Length = 888

 Score =  296 bits (758), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 168/407 (41%), Positives = 235/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 38  QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 87

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 88  TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 147

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+ KH   +     +G +  
Sbjct: 148 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHPRTIATPKHIAVH-----SGPEPG 202

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS +D+E T+   FR  ++EG+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 203 RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 262

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A +++AG DL+CG  +  L T  A++RG +
Sbjct: 263 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAYRALGT--AIERGEV 320

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG     P    Y  LG KD+    ++ LAL+AA + IVLLK
Sbjct: 321 DEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIVLLK 379

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 380 NANATLPLKA--GTRLAVIGPNADALAALEANYQGTSSQPVTPLLGL 424



 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 92/278 (33%), Positives = 137/278 (49%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 683

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA  L G  NPGG+LP+T+Y      +LP       
Sbjct: 684 WAKT--HADAIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFY--RSTKDLP------- 732

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
           P  S    GRTYR++KG  ++PFG+G+SYT F +                  S+    +G
Sbjct: 733 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTRFAYETPRL-------------SVTTLQAG 779

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
             ++VT              V+N G + G     V+   P    +P + LV F++VH+  
Sbjct: 780 SPLQVT------------TTVRNTGERAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLQP 827

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G Q+ +   +   + LS VDR+GTR +  G++ + +GG
Sbjct: 828 GEQRTLTFTLD-ARALSDVDRTGTRVVEAGDYRLFVGG 864


>gi|408369545|ref|ZP_11167326.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
 gi|407745291|gb|EKF56857.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
          Length = 881

 Score =  296 bits (757), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 181/466 (38%), Positives = 262/466 (56%), Gaps = 51/466 (10%)

Query: 7   FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
            FF+  +L  + S+G                +  PF    L    RV DL+ RL+++EK+
Sbjct: 10  LFFIAFVLSMSISNG----------------QQYPFQNPELDDSARVADLLERLTVEEKI 53

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
             L+  + A+ RLGI  Y WW+E+LHGV+  G           AT FPQ IT A+++++ 
Sbjct: 54  DQLLYTSPAIERLGIPEYNWWNESLHGVARAG----------YATVFPQSITIAAAWDSD 103

Query: 127 LWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
           L + +   +SDEARA ++        G   GLT+WSPN+NIFRDPRWGRG ET GEDP L
Sbjct: 104 LLKEVADAISDEARAKHHEYIRRGQRGIYQGLTFWSPNINIFRDPRWGRGHETYGEDPYL 163

Query: 179 SGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTF 236
           +G+   +YV+GLQG+D + LK+ A+ KHF  +     +G +  R  F+   SK+D+ +T+
Sbjct: 164 TGQLGIAYVKGLQGNDPNYLKLVATAKHFAVH-----SGPEPLRHEFDVSPSKRDLWETY 218

Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
              FR  V +G V SVM +YN+V G    A  + L   +R  W  +GY+VSDC ++   +
Sbjct: 219 LPAFRYLVKQGDVKSVMTAYNRVYGEAASAS-DTLFTILRDYWDFDGYVVSDCFAISDIW 277

Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMR 356
                     EA+A A+  G DL+CG         A Q+G+++E DI+ AL   +  +++
Sbjct: 278 KYHKIAKDAAEASAMAVIEGCDLNCGDSYE-KLNQAYQQGMVTEKDIDIALSRLMEARIK 336

Query: 357 LGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVA 415
           LGMFD E    PY  + P +V T + H +LAL+AA++ IVLLKNQG  LPLS    ++VA
Sbjct: 337 LGMFDPE-QLVPYAQI-PFNVNTSEKHNQLALKAAKESIVLLKNQGDLLPLSK-DLKSVA 393

Query: 416 VIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
           VIGPN+D   ++ GNY G      T LQGI    G     ++Q+G 
Sbjct: 394 VIGPNADNIQSLWGNYNGNPKDPITVLQGIQNALGPQTTVVYQEGS 439



 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 88/284 (30%), Positives = 144/284 (50%), Gaps = 50/284 (17%)

Query: 483 LVMGLDQSIEAEALD----------RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
           +V+GL++ +E E +D          R  L LP  Q+ L+ +V+   K P +LVL++G  +
Sbjct: 607 MVLGLNERLEGEEMDVVVEGFAGGDRTALDLPASQRTLLKEVAKTGK-PIVLVLLNGSAL 665

Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
            + +A  +  I AI+ AGY GQ GG A+A++LFG  NP  +LP+T+Y    + +LP  E 
Sbjct: 666 SINWAAEN--IPAIMTAGYAGQQGGNAVAEVLFGDYNPAARLPVTYYKS--VEDLPDFE- 720

Query: 593 AMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINAT 652
                      GRTYR+++   +YPFG+G+SYT F ++    P+ +          +N +
Sbjct: 721 ------DYNMDGRTYRYFEKEPLYPFGYGLSYTTFDYSKFQLPSKI---------DMNES 765

Query: 653 ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKV 711
           I                  + V+V N G+ DG   + V+ T   G    P ++LV F+++
Sbjct: 766 IE-----------------LSVEVTNTGAYDGDEVVQVYLTDEKGSTPRPIRELVGFKRI 808

Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
           H+  G  Q+V   I   + LS++D  G   I  G  +I +GG +
Sbjct: 809 HLKKGESQKVQFTIE-PRQLSMIDDKGDLVIEPGVFSISVGGEQ 851


>gi|322437617|ref|YP_004219707.1| glycoside hydrolase family protein [Granulicella tundricola
           MP5ACTX9]
 gi|321165510|gb|ADW71213.1| glycoside hydrolase family 3 domain protein [Granulicella
           tundricola MP5ACTX9]
          Length = 892

 Score =  296 bits (757), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 169/436 (38%), Positives = 251/436 (57%), Gaps = 33/436 (7%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           + LP+   +L   QRV+DL+ R++L+EKV   I+ A A+ RL +  Y++WSE LHG++  
Sbjct: 30  KPLPYMDPALTTQQRVDDLVSRMTLEEKVSQTINSAPAISRLNVPEYDYWSEGLHGIARS 89

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------G 149
           G           AT FPQ I  A++++A L + IG V+S EARA +N            G
Sbjct: 90  G----------YATMFPQAIGMAATWDAPLLQQIGDVISIEARAKFNEAIRHNIHSIYYG 139

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
           LT WSPN+NIFRDPRWGRGQET GEDP L+G+   ++V+G+QG D +  +  A+ KHF  
Sbjct: 140 LTIWSPNINIFRDPRWGRGQETYGEDPFLTGRLGVAFVKGIQGPDPNYFRAIATPKHFAV 199

Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
           +   +     R   N + +  D+ DT+   FR  + E    S+MC+YN V G P CA   
Sbjct: 200 H---SGPESTRHSANIEPTPHDLHDTYLPAFRATITEAHADSIMCAYNAVEGSPACASKL 256

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDT----QHFTSTPEEAAADAIRAGLDLDCGPFL 325
           +L+ T+R +W   G++ SDC ++  +Y T     H +   E AAA  I+AG D +CG   
Sbjct: 257 LLQDTLRRDWGFKGFVTSDCGAIDDFYATDYPSHHTSPDKEAAAAAGIKAGTDSNCGQTY 316

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQE 384
            L   SAV++GL++E +I+ AL +  T + +LG+FD  P+++  +  +   +V +P HQ 
Sbjct: 317 -LTLGSAVKKGLVTEAEIDTALKHLFTARFQLGLFD--PAAKVAFNAIPFSEVNSPAHQA 373

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LAL+AA + IVLLKN   +LP      RT+AVIGP++     + GNY  I      PL G
Sbjct: 374 LALKAAEESIVLLKNDAHTLPFKP-SVRTIAVIGPSAATLNNLEGNYNAIPLHPVLPLDG 432

Query: 445 I---GRYARTIHQQGC 457
           I    + ++ ++ QG 
Sbjct: 433 ILTQFKSSKVLYAQGS 448



 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 136/281 (48%), Gaps = 48/281 (17%)

Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
           +GL   +E E +          DR  + LP  QQ+++  V+   K P ++VL++G  + V
Sbjct: 630 LGLSPELEGEEMPIHIEGFAGGDRTDIKLPAAQQQMLEAVAATGK-PLVVVLLNGSALAV 688

Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM 594
            +A +    AAI+ A YPGQAGGTAIA+ L G +NP G+LP+T+Y    I  +P  +   
Sbjct: 689 NWAND--HAAAILEAWYPGQAGGTAIAETLAGKNNPAGRLPVTFYSS--IDQIPAFD--- 741

Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
                     RTYR+ K   ++ FG+G+SYT F ++                   N  +S
Sbjct: 742 ----DYSMANRTYRYSKAKPLFEFGYGLSYTTFTYS-------------------NIKLS 778

Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
            + +   HA      L V+ DV+N G   G     ++ TPP    +P + L AF +VH+ 
Sbjct: 779 TQTL---HAGD---PLTVEADVRNTGRVAGDEVAELYLTPPHTAVSPQRALSAFTRVHLA 832

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
            G  + V   +   + LS VD  G R +  G + + + GT+
Sbjct: 833 PGELRHVTFTLD-PRTLSQVDEKGARAVTPGNYTLSVSGTQ 872


>gi|66767544|ref|YP_242306.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|66572876|gb|AAY48286.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 888

 Score =  296 bits (757), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 168/407 (41%), Positives = 235/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S +EKV   ++ A A+PRLGI  YEWWSE LHG++  G           A
Sbjct: 38  QRAAALVAQMSREEKVAQSMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YA 87

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 88  TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 147

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+ KH   +     +G +  
Sbjct: 148 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHPRTIATPKHIAVH-----SGPEPG 202

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS +D+E T+   FR  ++EG+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 203 RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDW 262

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A +++AG DL+CG  +  L T  A++RG +
Sbjct: 263 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAYRALGT--AIERGEV 320

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG     P    Y  LG KD+    ++ LAL+AA + IVLLK
Sbjct: 321 DEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIVLLK 379

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N   +LPL       +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 380 NANATLPLKA--GTRLAVIGPNADALAALEANYQGTSSQPVTPLLGL 424



 Score =  139 bits (349), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 94/278 (33%), Positives = 143/278 (51%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  + LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 683

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA  L G  NPGG+LP+T+Y      +LP       
Sbjct: 684 WAKT--HADAIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFY--RSTKDLP------- 732

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
           P  S    GRTYR++KG  ++PFG+G+SYT+F +   +AP + +  L           +G
Sbjct: 733 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTSFAY---DAPQLSSTTLQ----------AG 779

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
             ++VT              V+N G++ G     V+   P    +P + LV F++VH+  
Sbjct: 780 SPLQVT------------TTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLQP 827

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G Q+ +   +   + LS VDR+GTR +  G++ + +GG
Sbjct: 828 GEQRTLTFTLD-ARALSDVDRTGTRAVEAGDYRLFVGG 864


>gi|62321294|dbj|BAD94522.1| beta-xylosidase - like protein [Arabidopsis thaliana]
          Length = 287

 Score =  296 bits (757), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 137/287 (47%), Positives = 192/287 (66%), Gaps = 7/287 (2%)

Query: 479 DATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
           D+ +LV+G DQSIE E  DR  L LPG+QQELV++V+MA++GP +LV+MSGG  D+ FAK
Sbjct: 1   DSVVLVVGADQSIEREGHDRVDLYLPGKQQELVTRVAMAARGPVVLVIMSGGGFDITFAK 60

Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
           ND +I +I+W GYPG+AGG AI D++FG  NP G LPMTWYPQ Y+  +PM+ M MRP +
Sbjct: 61  NDKKITSIMWVGYPGEAGGLAIPDVIFGRHNPSGNLPMTWYPQSYVEKVPMSNMNMRPDK 120

Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
           SK YPGR+YRFY G  VY F   ++YT F H +  AP +V++ LD  H   ++    +++
Sbjct: 121 SKGYPGRSYRFYTGETVYAFADALTYTKFDHQLIKAPRLVSLSLDENHPCRSSEC--QSL 178

Query: 659 KVTHAKCNRLTLG-----VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHV 713
                 C     G     V ++VKN G + G+HT+ +F+T P  H +P KQL+ FEK+ +
Sbjct: 179 DAIGPHCENAVEGGSDFEVHLNVKNTGDRAGSHTVFLFTTSPQVHGSPIKQLLGFEKIRL 238

Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
               +  V  N++VCK LSVVD +G R+I LG H +H+G  KHS+++
Sbjct: 239 GKSEEAVVRFNVNVCKDLSVVDETGKRKIALGHHLLHVGSLKHSLNI 285


>gi|384421334|ref|YP_005630694.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353464247|gb|AEQ98526.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 904

 Score =  295 bits (756), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 165/422 (39%), Positives = 239/422 (56%), Gaps = 33/422 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+   +    QR  DL+ R++L+EK   + + A A+PRL +  Y+WW+EALHGV+  G  
Sbjct: 36  PYLDTARSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 93

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                   GAT FPQ I  A++F+  L   +   +SDEARA ++            GLT+
Sbjct: 94  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTF 145

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--------SDGDRL-KVAAS 203
           WSPN+NIFRDPRWGRGQET GEDP L+ +   ++V+GLQG        + G+   K+ A+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRKLDAT 205

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
            KHF  +        +R HF+A+ S++D+ +T+   F   V +GKV +VM +YN+V G  
Sbjct: 206 AKHFAVHSGPE---AERHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 262

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
             A   +L+  +R +W   GY+VSDC ++   +      +T E+AAA A+  G +L+CG 
Sbjct: 263 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELECGE 322

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
                  +AV +GL+ E  I+ AL   +T +MRLGMFD  P   P+  +      +P H 
Sbjct: 323 EYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPAHD 380

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
            LA   AR+ +VLLKN G  LPLS    + +AVIGP +D T+ ++GNY G      T LQ
Sbjct: 381 ALARRTARESLVLLKNDG-LLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 439

Query: 444 GI 445
           GI
Sbjct: 440 GI 441



 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/301 (30%), Positives = 140/301 (46%), Gaps = 51/301 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +R AD  + V GL   +E E +          DR  L LP  Q+EL+  +    K 
Sbjct: 631 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 689

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y 
Sbjct: 690 PVVAVLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 746

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
           +E  T     +  M         GRTYR++ G  +YPFGHG+SYT F ++          
Sbjct: 747 KESETLPAFDDYTMH--------GRTYRYFGGTPLYPFGHGLSYTQFAYS---------- 788

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PAGH 698
             D R      T  G              L   V VKN G + G   + ++  P  P   
Sbjct: 789 --DLRLDRSTLTADG-------------ALTATVAVKNTGQRAGDEVVQLYLHPLKPQRE 833

Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHS 757
            A  K+L  F+++ +  G Q+ +   I+    L + D +     +  G + + IG +   
Sbjct: 834 RA-GKELRGFQRLALQPGEQRELRFTINATDALRIYDAQRKAYTVDPGAYEVQIGASSAD 892

Query: 758 V 758
           +
Sbjct: 893 I 893


>gi|256393466|ref|YP_003115030.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
 gi|256359692|gb|ACU73189.1| glycoside hydrolase family 3 domain protein [Catenulispora
           acidiphila DSM 44928]
          Length = 1343

 Score =  295 bits (755), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 230/733 (31%), Positives = 339/733 (46%), Gaps = 112/733 (15%)

Query: 51  QRVNDLIGRLSLQEKV-KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPG 109
           +R  DL+ R++L EK  +L  + A A+PRLG++ Y +WSE  HGV+ +G  +  G D  G
Sbjct: 58  ERAADLVSRMTLPEKAAQLQTNSAPAIPRLGVQEYTYWSEGQHGVNTLGADSNRG-DVTG 116

Query: 110 ---ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG---------------LT 151
              ATSFP       S++  L       VSDE R   +    G               LT
Sbjct: 117 GVHATSFPVNFAATMSWDPALTYKETTAVSDEVRGFLDKSLWGTGQNNLGPSASDYGALT 176

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD--GDR----LKVAASCK 205
           +W+PNVN+ RDP WGR  E+ GEDP L+   A ++V G QG    G +    LKVAA+ K
Sbjct: 177 FWAPNVNMDRDPLWGRTNESFGEDPYLTSTMAGAFVDGYQGQSMTGQQQTPYLKVAATAK 236

Query: 206 HFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           H++  ++++     R   ++  +  +I D +   F   V +  V+ +M SYN VNG P+ 
Sbjct: 237 HYSLNNIED----SRHTGSSDTTDANIRDYYTKQFASLVRDAHVSGIMTSYNAVNGTPSP 292

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYD------------------TQHFTSTPEE 307
           AD   +   ++  +   GY  SDC ++G  Y                   T + T     
Sbjct: 293 ADTYTVDELLQATYGFAGYTTSDCGAIGDVYGAASHGWAPPGWTSNGTSWTNNATGRQIS 352

Query: 308 AAAD----AIRAGLDLDC--GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD 361
           AAA     AIRAG  L+C  G     +  +A+  GLLS   ++  L    TV+M  G FD
Sbjct: 353 AAAGGQAFAIRAGTQLNCAGGEMTAQNISAAIDLGLLSNGVVDATLTRLFTVRMETGEFD 412

Query: 362 GEPSSQPYGHLG----PKD-VCTPDHQELALEAARQGIVLLKNQGPS------LPLSHIR 410
                 P G +G     KD + +P HQ LA + A   IVLL+N   S      LP+   +
Sbjct: 413 ------PAGKVGYTKITKDQIESPAHQALAEQVAANDIVLLQNGAVSGTSAKLLPVDPAK 466

Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDV-ACADDQLF- 468
             +V ++G  ++     +G Y+G        +QGI    +  +        AC       
Sbjct: 467 TDSVVIVGDLANKVT--LGGYSGEPTHEVNAVQGITAAVQAANPSATVTFDACGTGTQIT 524

Query: 469 -----GAAIDAS-RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPT 522
                 AA  A+ + A   ++V G D S+  EA DR+ L LPG    L+S+VS      T
Sbjct: 525 TPASCSAATQAAIKSASLVLVVAGSDLSVADEANDRSTLALPGNYDSLISQVSALGNPRT 584

Query: 523 ILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE 582
            LV+ + GP D+  A+ D    AI+++GY GQ+ GTA+A +LFG  NP G L  TWY  +
Sbjct: 585 ALVMQADGPYDIQDAQKD--FPAIVFSGYNGQSQGTALAQVLFGQQNPAGHLDFTWYSGD 642

Query: 583 YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL 642
                PM    + PSQ+    GRTY+++ G   YPFG+G SY++F ++            
Sbjct: 643 SQL-APMDNYGLTPSQTGGL-GRTYQYFTGTPTYPFGYGQSYSSFAYSHVQV-------- 692

Query: 643 DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWA 700
               G  N    G             T+ V  DVKN G+  G     +++ PP    +  
Sbjct: 693 ----GPQNTNADG-------------TVHVSFDVKNTGTVAGTTVAQLYAAPPGAGTNDT 735

Query: 701 PHKQLVAFEKVHV 713
             +QL  F+K + 
Sbjct: 736 TREQLAGFQKTNT 748


>gi|423226625|ref|ZP_17213090.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392628884|gb|EIY22909.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 863

 Score =  295 bits (755), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 170/432 (39%), Positives = 247/432 (57%), Gaps = 32/432 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+   +L   +R NDL+GRL+L+EK  L+ + + A+PRLGIK Y+WW+EALHGV   G  
Sbjct: 25  PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 83

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM-----YNGGTA---GLTY 152
                    AT FPQ I   +SFN  L   +   VSDEARA        GG     GLT 
Sbjct: 84  ---------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKRYQGLTM 134

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYD 211
           W+PN+NIFRDPRWGRGQET GEDP L+G+   + VRGLQG +G++  K+ A  KH+  + 
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYDKLHACAKHYAVHS 194

Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              WN   R  FNA+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P C    +
Sbjct: 195 GPEWN---RHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNRL 251

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCGPFLGLH 328
           L + +R EW     +VSDC ++  +Y+     + P++  A+A A+ +G D++CG      
Sbjct: 252 LMQILRDEWGYKEIVVSDCWAISDFYNKDAHETDPDKQHASAKAVLSGTDVECGDSYASL 311

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
            E AV+ GL+ E  I+ +L   +  +  LG  D EPS   +  +    V + +H+ELAL 
Sbjct: 312 PE-AVKEGLIDEKQIDISLKRLMKARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELALR 369

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
            AR+ +VLL+N    LPL+  ++  VAV+GPN++ +V   GNY G      T L+GI  Y
Sbjct: 370 MARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNGFPSHTITLLEGIREY 427

Query: 449 ---ARTIHQQGC 457
              ++ I++ GC
Sbjct: 428 LPESQIIYEPGC 439



 Score =  117 bits (292), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 91/302 (30%), Positives = 135/302 (44%), Gaps = 49/302 (16%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           +D  ++AD  I   G+  ++E E +          DR  + LP  Q  L++++  A K  
Sbjct: 594 VDKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKAGK-K 652

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
            + V  SG  I  A         AI+ A YPGQAGGTAIA++LFG  NP G+LP+T+Y  
Sbjct: 653 IVFVNFSGSAI--ALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYKS 710

Query: 582 EYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
                LP   + +M+        GRTYR+     ++PFGHG+SYT F +           
Sbjct: 711 --TKQLPDFEDYSMK--------GRTYRYMTENPLFPFGHGLSYTTFQYG---------- 750

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
                    NA+++   IK       ++TL   + V N G  DG   + V+   P     
Sbjct: 751 ---------NASLNTSEIK----DGEQVTL--TIPVSNTGKYDGEEVVQVYLRHPGDKEG 795

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P   L AF++V +  GA   V I +    +      + T R   G++ I  GGT     L
Sbjct: 796 PSHALRAFKRVAIAKGATNNVTIPLSKENFEWFDTSTNTMRPIEGDYEILYGGTSELKQL 855

Query: 761 HA 762
            +
Sbjct: 856 KS 857


>gi|224537384|ref|ZP_03677923.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521009|gb|EEF90114.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 863

 Score =  294 bits (753), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 170/432 (39%), Positives = 247/432 (57%), Gaps = 32/432 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+   +L   +R NDL+GRL+L+EK  L+ + + A+PRLGIK Y+WW+EALHGV   G  
Sbjct: 25  PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 83

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM-----YNGGTA---GLTY 152
                    AT FPQ I   +SFN  L   +   VSDEARA        GG     GLT 
Sbjct: 84  ---------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKRYQGLTM 134

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYD 211
           W+PN+NIFRDPRWGRGQET GEDP L+G+   + VRGLQG +G++  K+ A  KH+  + 
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYDKLHACAKHYAVHS 194

Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              WN   R  FNA+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P C    +
Sbjct: 195 GPEWN---RHSFNAENIDPRDLWETYLPAFKNLVQKAHVKEVMCAYNRFEGEPCCGSNRL 251

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCGPFLGLH 328
           L + +R EW     +VSDC ++  +Y+     + P++  A+A A+ +G D++CG      
Sbjct: 252 LMQILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKAVLSGTDVECGDSYASL 311

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
            E AV+ GL+ E  I+ +L   +  +  LG  D EPS   +  +    V + +H+ELAL 
Sbjct: 312 PE-AVKEGLIDEKQIDISLKRLMKARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELALR 369

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
            AR+ +VLL+N    LPL+  ++  VAV+GPN++ +V   GNY G      T L+GI  Y
Sbjct: 370 MARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNGFPSHTITLLEGIREY 427

Query: 449 ---ARTIHQQGC 457
              ++ I++ GC
Sbjct: 428 LPESQIIYEPGC 439



 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 91/302 (30%), Positives = 135/302 (44%), Gaps = 49/302 (16%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           +D  ++AD  I   G+  ++E E +          DR  + LP  Q  L++++  A K  
Sbjct: 594 VDKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKAGK-K 652

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
            + V  SG  I  A         AI+ A YPGQAGGTAIA++LFG  NP G+LP+T+Y  
Sbjct: 653 IVFVNFSGSAI--ALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYKS 710

Query: 582 EYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
                LP   + +M+        GRTYR+     ++PFGHG+SYT F +           
Sbjct: 711 --TKQLPDFEDYSMK--------GRTYRYMTENPLFPFGHGLSYTTFQYG---------- 750

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
                    NA+++   IK       ++TL   + V N G  DG   + V+   P     
Sbjct: 751 ---------NASLNTSEIK----DGEQVTL--TIPVSNTGKYDGEEVVQVYLRHPGDKEG 795

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P   L AF++V +  GA   V I +    +      + T R   G++ I  GGT     L
Sbjct: 796 PSHALRAFKRVAIAKGATNNVTIPLSKENFEWFDTSTNTMRPIEGDYEILYGGTSELKQL 855

Query: 761 HA 762
            +
Sbjct: 856 KS 857


>gi|229580225|ref|YP_002838625.1| glycoside hydrolase family protein [Sulfolobus islandicus
           Y.G.57.14]
 gi|229581131|ref|YP_002839530.1| glycoside hydrolase family protein [Sulfolobus islandicus
           Y.N.15.51]
 gi|228010941|gb|ACP46703.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           Y.G.57.14]
 gi|228011847|gb|ACP47608.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           Y.N.15.51]
          Length = 754

 Score =  294 bits (752), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 216/690 (31%), Positives = 349/690 (50%), Gaps = 98/690 (14%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
           +T+FPQ I  AS++N  L   I  V+  + R +  G    L   SP +++ +DPRWGR +
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV--GVNQCL---SPVLDVCKDPRWGRCE 155

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
           ET GEDP L      +Y+ GLQG +    ++ A+ KHF A+   +    + + H    V 
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++ +TF  PF + V  GKV S+M +Y++++G+P   +P +L   +R EW  +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
            D +          S   EAA  A+ +G+D++  P +  + E   +A++ GL+ E  I+ 
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYGEPLVNALKEGLVPESLIDR 326

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPS 403
           A+   L ++ RLG+ D      P+ +         DH+  ELAL+ AR+ IVLLKN+   
Sbjct: 327 AVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNI 381

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRY---ARTIH 453
           LPLS   ++ +AVIGPN++    M+G+Y       I  G    T LQGI +    ++ ++
Sbjct: 382 LPLSKNVNK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGESKVLY 440

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMG---------LD----------QSIEAE 494
            +GC D+A    + F  AI+ +RQAD  I +MG         +D          Q++  E
Sbjct: 441 AKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFKKYQAVTGE 499

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             DR+ L LPG Q+EL+ ++    K P ILVL++G P+ ++   N   + A+I A +PG+
Sbjct: 500 GNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIIN--YVKAVIEAWFPGE 556

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
            GG AIAD++FG  NPGG+LP+T+         PM +    P    R P   R Y   + 
Sbjct: 557 EGGNAIADVIFGDYNPGGRLPITF---------PM-DTGQIPLYYNRKPSSFRPYVMLRS 606

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
             ++ FG+G+SYT F ++                   N  ++ K I           + +
Sbjct: 607 SPLFTFGYGLSYTQFEYS-------------------NLEVTPKEIGPNS------NIAI 641

Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
            +DVKNVG  +G   + ++ +      A P K+L  F K+H+  G ++RV   I   + L
Sbjct: 642 SIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRVKF-ILPTEAL 700

Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           +  D      +  GE+ + IG +  ++ L 
Sbjct: 701 AFYDSFMRLVVEKGEYQLLIGNSSENIILR 730


>gi|325929067|ref|ZP_08190221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
 gi|325540562|gb|EGD12150.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
          Length = 850

 Score =  293 bits (751), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 167/426 (39%), Positives = 241/426 (56%), Gaps = 36/426 (8%)

Query: 60  LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITT 119
           ++L+EK   + + A A+PRLG+  Y+WW+EALHGV+  G          GAT FPQ I  
Sbjct: 1   MTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------GATVFPQAIGM 50

Query: 120 ASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGRGQET 171
           A++F+  L   +   +SDEARA ++            GLT+WSPN+NIFRDPRWGRGQET
Sbjct: 51  AATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRDPRWGRGQET 110

Query: 172 PGEDPVLSGKYAASYVRGLQGSDGDRLKVA---------ASCKHFTAYDLDNWNGVDRFH 222
            GEDP L+ +   ++V+GLQG   D  K A         A+ KHF  +        DR H
Sbjct: 111 YGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRKLDATAKHFAVHSGPE---ADRHH 167

Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
           F+A+ S++D+ +T+   F   V +GKV +VM +YN+V G    A   +L+  +R +W   
Sbjct: 168 FDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFK 227

Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
           GY+VSDC ++   +      +T E+AAA A++ G +L+CG        +AV++GL+ E  
Sbjct: 228 GYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEEYST-LPAAVRQGLIDEAQ 286

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           I+ AL   +T +MRLGMFD  P   P+  +      +P H  LA   AR+ +VLLKN G 
Sbjct: 287 IDTALTTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAHDALARRTARESLVLLKNDG- 344

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---GRYARTIHQQGCKD 459
            LPLS  + + +AVIGP +D T+ ++GNY G      T LQGI      A+ ++ +G   
Sbjct: 345 LLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADL 404

Query: 460 VACADD 465
           V   DD
Sbjct: 405 VEGRDD 410



 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 92/303 (30%), Positives = 146/303 (48%), Gaps = 55/303 (18%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+D +R AD  + V GL   +E E +          DR  L LP  Q++L+  +    K 
Sbjct: 577 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 635

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P + VL +G  + + +A+    + AI+ A YPGQ GGTA+AD LFG +NPGG+LP+T+Y 
Sbjct: 636 PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFY- 692

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT--VANAPTVV 638
           +E  T     + AMR        GRTYR++ G  +YPFGHG+SYT F ++    +  T+ 
Sbjct: 693 KESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTTIA 744

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--PA 696
           A       GS+ AT++                     VKN G + G   + ++  P  P 
Sbjct: 745 A------DGSLTATVT---------------------VKNTGQRAGDEVVQLYLHPLTPQ 777

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTK 755
              A  K+L  F+++ +  G Q+ +   +     L + D +     +  G + + IG + 
Sbjct: 778 RERA-GKELHGFQRIALQPGEQRALHFTLDAKNALRIYDAQRKAYAVDPGAYEVQIGASS 836

Query: 756 HSV 758
             +
Sbjct: 837 ADI 839


>gi|385774250|ref|YP_005646817.1| glycoside hydrolase family protein [Sulfolobus islandicus HVE10/4]
 gi|323478365|gb|ADX83603.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           HVE10/4]
          Length = 754

 Score =  293 bits (751), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 215/690 (31%), Positives = 347/690 (50%), Gaps = 98/690 (14%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
           +T+FPQ I  AS++N  L   I  V+  + R +  G    L   SP +++ +DPRWGR +
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQGRLV--GVNQCL---SPVLDVCKDPRWGRCE 155

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
           ET GEDP L      +Y+ GLQG +    ++ A+ KHF A+   +    + + H    V 
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++ +TF  PF + V  GKV S+M +Y++++G+P   +P +L   +R EW  +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
            D +          S   EAA  A+ +G+D++  P +  ++E   +A+  GL+ E  I+ 
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYSEPLVNALTEGLVPESLIDR 326

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPS 403
           A+   L ++ RLG+ D      P+ +         DH+  ELAL+ AR+ IVLLKN+   
Sbjct: 327 AVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNI 381

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRY---ARTIH 453
           LPLS   ++ +AVIGPN++    M+G+Y       I  G    T LQG+ +    ++ ++
Sbjct: 382 LPLSKNVNK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKKVGESKVLY 440

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLD-------------------QSIEAE 494
            +GC D+A    + F  AI+ +RQAD  I VMG                     Q++  E
Sbjct: 441 AKGC-DIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFKKYQAVTGE 499

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             DR+ L LPG Q+EL+ ++    K P ILVL++G P+ ++   N   + A+I A +PG+
Sbjct: 500 GNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAVIEAWFPGE 556

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
            GG AIAD++FG  NPGG+LP+T+         PM +    P    R P   R Y   + 
Sbjct: 557 EGGNAIADVIFGDYNPGGRLPITF---------PM-DTGQIPLYYNRKPSSFRPYVMLRS 606

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
             ++ FG+G+SYT F ++                   N  ++ K I           + +
Sbjct: 607 SPLFTFGYGLSYTQFEYS-------------------NLEVTPKEIGPNS------NIAI 641

Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
            +DVKNVG  +G   + ++ +      A P K+L  F K+H+  G ++RV   I   + L
Sbjct: 642 SIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRVKF-ILPTEAL 700

Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           +  D      +  GE+ + IG +  ++ L 
Sbjct: 701 AFYDSFMRLVVEKGEYQLLIGNSSENIILR 730


>gi|218186207|gb|EEC68634.1| hypothetical protein OsI_37026 [Oryza sativa Indica Group]
          Length = 1241

 Score =  293 bits (751), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 161/328 (49%), Positives = 203/328 (61%), Gaps = 30/328 (9%)

Query: 133  RVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG 192
            + VS EARAMYN G  GLTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q 
Sbjct: 916  QAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQD 975

Query: 193  SDGDR------------LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPF 240
              G              LK +A CKH+ AYDLD+W+   RF F+A+V ++D+ +TF  PF
Sbjct: 976  IPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPF 1035

Query: 241  RMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQH 300
             MCV +G V+SVMCSYN+VNG+P CAD  +L +TIR +W L+GYIVSDCD+V V  D   
Sbjct: 1036 EMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNAT 1095

Query: 301  FTS-TPEEAAADAIRAGLDLDCGP-------------FLGLHTESAVQRGLLSEIDINNA 346
            +   T  EA+A A++AGLDLDCG              FL  +   AV +G + E DI+NA
Sbjct: 1096 WLGYTGAEASAAALKAGLDLDCGESWKNETDGHPLMDFLTTYGMEAVNKGKMRESDIDNA 1155

Query: 347  LVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPL 406
            L N     MRLG FD       Y  LG +D+CT  H+ LAL+ ARQGIVLLKN    LPL
Sbjct: 1156 LTNQYMTLMRLGYFD---DIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPL 1212

Query: 407  SHIRHRTVAVIGPNSDV-TVTMIGNYAG 433
               +   V V GP+       M G+Y G
Sbjct: 1213 DANKVGFVNVRGPHVQAPEKIMDGDYTG 1240


>gi|189464583|ref|ZP_03013368.1| hypothetical protein BACINT_00926 [Bacteroides intestinalis DSM
           17393]
 gi|189436857|gb|EDV05842.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 879

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 169/432 (39%), Positives = 246/432 (56%), Gaps = 32/432 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+   +L   +R NDL+GRL+L+EK  L+ + + A+PRLGIK Y+WW+EALHGV   G  
Sbjct: 41  PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 99

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM-----YNGGTA---GLTY 152
                    AT FPQ I   +SFN  L   +   +SDEARA        GG     GLT 
Sbjct: 100 ---------ATVFPQAIGMGASFNNELLYDVFTAISDEARAKNTEFSKEGGLKRYQGLTM 150

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYD 211
           W+PN+NIFRDPRWGRGQET GEDP L+ +   + VRGLQG +G++  K+ A  KH+  + 
Sbjct: 151 WTPNINIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEGEKYDKLHACAKHYAVHS 210

Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              WN   R  FNA+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P C    +
Sbjct: 211 GPEWN---RHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNRL 267

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCGPFLGLH 328
           L   +R EW     +VSDC ++  +Y+     + P++  A+A A+ +G D++CG   G  
Sbjct: 268 LMHILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKAVLSGTDIECGDSYGSL 327

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
            E AV+ GL+ E  I+ +L   +  +  LG  D EPS   +  +    V + +H+ELAL 
Sbjct: 328 PE-AVKEGLIDEKQIDISLKRLMKARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELALR 385

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
            AR+ +VLL+N    LPL+  ++  VAV+GPN++ +V   GNY G      T L+GI  Y
Sbjct: 386 MARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNGFPSHTITLLEGIREY 443

Query: 449 ---ARTIHQQGC 457
              ++ I++ GC
Sbjct: 444 LPESQIIYEPGC 455



 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 87/302 (28%), Positives = 137/302 (45%), Gaps = 49/302 (16%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           ++  ++AD  I   G+  ++E E +          DR  + LP  Q  L++++  A K  
Sbjct: 610 VNKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKAGK-K 668

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
            + V  SG  I  A         AI+ A YPGQAGGTAIA++LFG  NP G+LP+T+Y  
Sbjct: 669 IVFVNFSGSAI--ALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYKS 726

Query: 582 EYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
              + LP   + +M+         RTYR+     ++PFGHG+SYT F             
Sbjct: 727 --TSQLPGFEDYSMKE--------RTYRYMTEAPLFPFGHGLSYTTF------------- 763

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
               R+G  +A+++ + +K            + + V NVG  DG   + V+   P     
Sbjct: 764 ----RYG--DASLNTQEVKDGEQTI------LTIPVSNVGEYDGEEVVQVYLRRPGDKEG 811

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P   L AF++ ++  GA   V +++    +      + T R   G++ I  GGT     L
Sbjct: 812 PSHALRAFKRANIAKGATSNVTVSLSKEDFEWFDTETNTMRPIEGDYEILYGGTSELKQL 871

Query: 761 HA 762
            A
Sbjct: 872 KA 873


>gi|319788503|ref|YP_004147978.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
 gi|317467015|gb|ADV28747.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
           suwonensis 11-1]
          Length = 916

 Score =  293 bits (749), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 175/445 (39%), Positives = 248/445 (55%), Gaps = 37/445 (8%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+   SL   +R   L+ R++L+EK   + + + A+ RLG+  Y+WW+EALHGV+  G  
Sbjct: 49  PWLDTSLSFEERAAALVSRMTLEEKAAQMQNDSPAIERLGLPAYDWWNEALHGVARAG-- 106

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                   GAT FPQ I  A+SF+  L + +   +SDEARA ++        G   GLT+
Sbjct: 107 --------GATVFPQAIGMAASFDVPLMDQVSAAISDEARAKHHDFLRKGEHGRYQGLTF 158

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD---GDRL-----KVAASC 204
           WSPN+NIFRDPRWGRGQET GEDP L+ +   S+VRGLQG D   G  L     K+ A+ 
Sbjct: 159 WSPNINIFRDPRWGRGQETYGEDPFLTTRMGVSFVRGLQGMDPQTGQPLDPKYRKLDATA 218

Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           KHF  +   +    DR  F+   SKQD+ DT+   F   V E  V +VM +YN+V G   
Sbjct: 219 KHFAVH---SGPEADRHTFDVHPSKQDLYDTYLPAFESLVKEADVYAVMGAYNRVYGESA 275

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
                +L  T+R +W  +GY++SDC ++   +       TPEEAAA A++ G +L+CG  
Sbjct: 276 SGSKFLLLDTLRRDWGFDGYVMSDCWAIVDIWKNHKIVETPEEAAALAVKNGTELNCGST 335

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC-TPDHQ 383
              H   AV++GL+SE ++++AL      +M LGMFD  P  Q      P  V  + +H 
Sbjct: 336 YADHLPVAVKKGLISEAELDDALTRLFVARMELGMFD--PPEQVRWAQVPYSVNQSAEHD 393

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
            LA + A++ +VLLKN G  LPLS    R +AV+GP +D T+ ++GNY G      T L+
Sbjct: 394 ALARKMAQESLVLLKNDG-VLPLSKDIRR-LAVVGPTADDTMALLGNYYGTPADPVTILR 451

Query: 444 GIGRYA---RTIHQQGCKDVACADD 465
           GI   A     ++ +G   V   DD
Sbjct: 452 GIREAAPGVDVVYARGVDLVEGRDD 476



 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 97/314 (30%), Positives = 154/314 (49%), Gaps = 55/314 (17%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F  A++A+  ADA + V GL   +E E +          DR  + LP  QQ+L+  V   
Sbjct: 640 FETALEAANSADAVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDIRLPATQQKLLEAVHAT 699

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K P ++VL +G  + + +A+ +  +  I+ A YPGQ GGTA+ + LFG  NPGG+LP+T
Sbjct: 700 GK-PVVMVLTTGSALGIDWARRN--VPGILVAWYPGQRGGTAVGEALFGDYNPGGRLPVT 756

Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
           +Y  +     P  + AM+         RTYR++ G  ++PFGHG+SYT+F ++       
Sbjct: 757 FYSADEKLP-PFDDYAMKE--------RTYRYFTGQPLFPFGHGLSYTSFGYS------- 800

Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP--P 695
             + LD +        +G   +VT          V V VKN G + G   + ++  P  P
Sbjct: 801 -GLKLDRKR-------AGAGDEVT----------VSVTVKNQGKRAGDEVVQLYLAPVKP 842

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR-RIPLGEHNIHIGGT 754
               A  K+L  F++VH+  G  + V  +I   + L V D +  R  +  G + + +G +
Sbjct: 843 QRERA-LKELRGFQRVHLQPGESRTVTFSIVPERDLRVYDEAAGRYTVDPGRYEVQVGAS 901

Query: 755 ----KHSVSLHAAT 764
               + SV L  AT
Sbjct: 902 SADIRASVPLEVAT 915


>gi|15837447|ref|NP_298135.1| family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
 gi|9105751|gb|AAF83655.1|AE003924_1 family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
          Length = 882

 Score =  293 bits (749), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 166/407 (40%), Positives = 236/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           Q    L+ +++LQEK+   ++ A A+PRLGI  Y+WWSE LHG++  G           A
Sbjct: 32  QHAAALVAKMTLQEKITQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YA 81

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G V S EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 82  TVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLAGGPGKDHPRYAGLTLWSPNINIFR 141

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A S++RGLQG+  D  +  A+ KHF  +     +G +  
Sbjct: 142 DPRWGRGMETYGEDPYLTGQLAVSFIRGLQGNIPDHPRTIATPKHFAVH-----SGPEPG 196

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS  D+E T+   FR  +++G   SVMC+YN ++G P CA   +L   +R +W
Sbjct: 197 RHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRNDW 256

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
             NG++VSDCD++        F      A+A A+++G DL+CG  +  L+   A+ RG +
Sbjct: 257 GFNGFVVSDCDAIDDMTRFHFFRQDNASASAAALKSGNDLNCGNTYRDLN--QAIARGDI 314

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ AL+     + RLG         PY  +G K + TP H+ LAL+AA Q +VLLK
Sbjct: 315 DEALLDQALIRLFAARQRLGTLQPR-EHDPYATIGIKHIDTPAHRALALQAAVQSLVLLK 373

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N G +LPL+     T+AV+GP++D    +  NY G +    TPL G+
Sbjct: 374 NSGNTLPLT--PGTTLAVLGPDADSLTALEANYQGTSSTPVTPLTGL 418



 Score =  132 bits (331), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 94/290 (32%), Positives = 139/290 (47%), Gaps = 48/290 (16%)

Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
           A   ADA +  +GL   +E E L          DR  + LP  Q+ L+  V    K P I
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665

Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
           +VLMSG  + + +A++     AI+ A YPGQ+GGTAIA  L G  NPGG+LP+T+Y    
Sbjct: 666 VVLMSGSAVALNWAQH--HANAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFY--RS 721

Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
             +LP       P  S    GRTYR++KG  +YPFG+G+SYT F +      T      D
Sbjct: 722 TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFTYEAPQLSTATLKAGD 774

Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
                                    TL V   V+N G++ G   + ++  PP    AP +
Sbjct: 775 -------------------------TLTVTAHVRNTGTRAGDEVVQLYLEPPHSPQAPLR 809

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            LV F++V +  G  + +   +   + LS V ++G R +  G +++ +GG
Sbjct: 810 NLVGFKRVTLRPGESRLLTFTLD-TRQLSSVQQTGQRSVEAGHYHLFVGG 858


>gi|385776908|ref|YP_005649476.1| glycoside hydrolase family protein [Sulfolobus islandicus REY15A]
 gi|323475656|gb|ADX86262.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           REY15A]
          Length = 754

 Score =  293 bits (749), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 215/690 (31%), Positives = 347/690 (50%), Gaps = 98/690 (14%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
           +T+FPQ I  AS++N  L   I  V+  +AR +  G    L   SP +++ +DPRWGR +
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQARLV--GVNQCL---SPVLDVCKDPRWGRCE 155

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
           ET GEDP L      +Y+ GLQG +    ++ A+ KHF A+   +    + + H    V 
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++ +TF  PF + V  GKV S+M +Y++++G+P   +P +L   +R EW  +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
            D +          S   EAA  A+ +G+D++  P +  ++E   +A+  GL+ E  I+ 
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYSEPLVNALTEGLVPESLIDR 326

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPS 403
           A+   L ++ RLG+ D      P+ +         DH+  ELAL+ AR+ IVLLKN+   
Sbjct: 327 AVERVLRIKDRLGLLDN-----PFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNI 381

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRY---ARTIH 453
           LPLS   ++ +AVIGPN++    M+G+Y       I  G    T LQG+ +    ++ ++
Sbjct: 382 LPLSKNVNK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKKVGESKVLY 440

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLD-------------------QSIEAE 494
            +GC D+A    + F  AI+ +RQAD  I VMG                     Q++  E
Sbjct: 441 AKGC-DIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFKKYQAVTGE 499

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             DR+ L LPG Q+EL+ ++    K P ILVL++G P+ ++   N   + A+I A +PG+
Sbjct: 500 GNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAVIEAWFPGE 556

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
            GG AIAD++FG  NP G+LP+T+         PM +    P    R P   R Y   + 
Sbjct: 557 EGGNAIADVIFGDYNPSGRLPITF---------PM-DTGQIPLYYNRKPSSFRPYVMLRS 606

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
             ++ FG+G+SYT F ++                   N  ++ K I           + +
Sbjct: 607 SPLFTFGYGLSYTQFEYS-------------------NLEVTPKEIGPNS------NIAI 641

Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
            +DVKNVG  +G   + ++ +      A P K+L  F K+H+  G ++RV   I   + L
Sbjct: 642 SIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRVKF-ILPTEAL 700

Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           +  D      +  GE+ + IG +  ++ L 
Sbjct: 701 AFYDSFMRLVVEKGEYQLLIGNSSENIILR 730


>gi|431798021|ref|YP_007224925.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
 gi|430788786|gb|AGA78915.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
          Length = 906

 Score =  292 bits (748), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 178/463 (38%), Positives = 258/463 (55%), Gaps = 37/463 (7%)

Query: 9   FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
           +LG+ LL    +G+        + +D T     F  +     +RV+ L+ ++SL+EKV  
Sbjct: 25  YLGMGLLIYLMAGI-----MRVNAQDGTL-DFSFLDMEKNFEERVDILVDQMSLEEKVSQ 78

Query: 69  LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLW 128
           +++ + A+PRL +  Y WW+E LHGV+  G           AT FPQ I+ A+SF+  L 
Sbjct: 79  MMNASPAIPRLKVPEYNWWNECLHGVARAGY----------ATVFPQSISVAASFDKNLM 128

Query: 129 EAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSG 180
           + IG V+SDEARA ++        G   GL +WSPN+NIFRDPRWGRG ET GEDP L+G
Sbjct: 129 KDIGSVISDEARAKHHEFIRNGKRGIYTGLDFWSPNINIFRDPRWGRGHETYGEDPYLTG 188

Query: 181 KYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDV 238
           + A+ ++ GLQ SDG  LK  A+ KHF  +     +G +  R  F+  VS +D+ +T+  
Sbjct: 189 ELASQFIEGLQDSDGKYLKTIATSKHFAVH-----SGPEPLRHTFDVDVSDRDLYETYLP 243

Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT 298
            FR  V E KV S+M +YN+  G        +L + +R +W   GY+VSDC ++   +  
Sbjct: 244 AFRKTVKEAKVYSIMGAYNRFRGESCSGHDFLLNQLLREQWGFEGYVVSDCGAIQDIHTG 303

Query: 299 QHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLG 358
               ST  EAAA  +  G DL+CG +   H   AV  GL+SE +I+ A+      + RLG
Sbjct: 304 HKIASTAAEAAAIGVSGGCDLNCGNYY-THLTEAVAEGLISEEEIDIAVKRLFLARFRLG 362

Query: 359 MFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
           MFD E +   Y  +    VC+  H  LA +AA++ +VLLKNQ   LPLS  + + +AVIG
Sbjct: 363 MFDPEEAVS-YAQIPFGIVCSEAHNTLARQAAQKSMVLLKNQKNLLPLSVDKIKRIAVIG 421

Query: 419 PNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
           PN+D   +++GNY GI     T L GI    G  A  ++ +G 
Sbjct: 422 PNADNVESLLGNYHGIPKKPVTFLDGIKHKVGPKAEVLYTEGV 464



 Score =  146 bits (369), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 99/321 (30%), Positives = 160/321 (49%), Gaps = 52/321 (16%)

Query: 461 ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALD----------RAGLLLPGRQQEL 510
           A  D      A+  ++ AD  ++V+GL Q +E E++D          R  + LP +Q+ L
Sbjct: 610 AMPDVSKIDEAVAMAKSADLAVVVLGLSQRLEGESMDVVTPGFDRGDRTAITLPAQQEAL 669

Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
           +  V    K P ILVL +G  + + +AK +  + AII AGYPG+ GG A+AD++FG  NP
Sbjct: 670 LKAVKETGK-PVILVLNAGSAMAINWAKEN--VDAIISAGYPGEEGGNALADVVFGDYNP 726

Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT 630
            G+LP+T+Y  + + +LP       P +     GRTYR+++G  +YPFG+G+SYT F + 
Sbjct: 727 AGRLPITYY--QSVEDLP-------PFEDYDMKGRTYRYFEGKPLYPFGYGLSYTRFSYK 777

Query: 631 VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-L 689
               P             +NA   G  ++++            V V N+GS+ G   + L
Sbjct: 778 DLEVP-----------AKVNA---GDPVQIS------------VTVTNIGSRAGDEVVQL 811

Query: 690 VFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNI 749
             +   A    P +QL  F+++H+  G  + V   +   + LS+++    R I  G  +I
Sbjct: 812 YLNDKEASTMRPIRQLEGFQRIHLKPGESKVVNFTLS-ARQLSMINGESKRVIEEGVFSI 870

Query: 750 HIGGTKHSV--SLHAATLGVI 768
           H+GG +      L A T  V+
Sbjct: 871 HVGGEQPGFDGKLQAETTEVV 891


>gi|423294294|ref|ZP_17272421.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
           CL03T12C18]
 gi|392675485|gb|EIY68926.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
           CL03T12C18]
          Length = 861

 Score =  292 bits (747), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 173/441 (39%), Positives = 246/441 (55%), Gaps = 34/441 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  + LP+   SL   QR  DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19  ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYNGGTA--- 148
              G           AT FPQ I   +SFN +L   +    SDEAR    ++    A   
Sbjct: 79  GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGALKR 128

Query: 149 --GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
             GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG +  +  K+ A  K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDTKYDKLHACAK 188

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  F+A+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P 
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
           C    +L + +R EW   G +VSDC ++  +Y      + P  E A+A A+R G DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAAAVRTGTDLECG 305

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                    AV+ GL+ E +I+ +L   LT +  LG  D +P+   +  +    + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPASVLNSKEH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q LAL  AR+ +VLL+N+   LPL+   H  VAV+GPN++ +V   GNY GI     T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419

Query: 443 QGIGRY---ARTIHQQGCKDV 460
           + +       + I++ GC  V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440



 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 134/308 (43%), Gaps = 49/308 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+     AD  +   G+  S+E E +          DR  + LP  Q+ L+  +  A K 
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             + +  SG  I            AI+ A YPGQAGGTAI D L+G  NPGG+LP+T+Y 
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
              +  LP   + +M+        GRTYR+ +   ++PFGHG+SYT F +  A       
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEAKL----- 752

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                   S N    G+ + +T            + V NVG +DG   + V+   P    
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
            P   L AF++VH+PAG  + V I +    +      S T R   G + +  GGT     
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDVESNTMRPLEGTYELLYGGTSDRNK 852

Query: 760 LHAATLGV 767
           L    + V
Sbjct: 853 LKTIVMNV 860


>gi|217967241|ref|YP_002352747.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
 gi|217336340|gb|ACK42133.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
           DSM 6724]
          Length = 762

 Score =  292 bits (747), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 220/687 (32%), Positives = 340/687 (49%), Gaps = 90/687 (13%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARA--MYNGGTAGLTYWSPNVNIFRDPRWG 166
           GAT FPQ I  AS+F   L   +  V+    RA  ++ G        SP ++I RDPRWG
Sbjct: 106 GATVFPQAIGMASTFEPELIRRVSDVIRQHMRAANVHQG-------LSPVLDIPRDPRWG 158

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
           R +ET GEDP L  + AA YV+GLQG D  R  + A+ KHFTAY +       R    AK
Sbjct: 159 RTEETFGEDPYLVSRMAAEYVKGLQGEDW-REGIIATVKHFTAYGISEGA---RNLGPAK 214

Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
           V ++++ + F  PF + + EG+  S+M +Y++++GVP  +   +L + +R EW   GY+V
Sbjct: 215 VGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRWEWGFKGYVV 274

Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDI 343
           SD  ++ +  +        +EAA  A+ AG+D++  P +  + E    AV+ GL+SE  I
Sbjct: 275 SDYIAIRMLENFHRVAKDAKEAAVLALEAGIDIEL-PSVDCYGEPLIQAVKEGLISEEVI 333

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           N ++   L  +  LG+FDG+    P       D   P+ +EL+ E AR+ IVLLKN G  
Sbjct: 334 NASVERVLRAKFMLGLFDGDLEKDPKKVYDIFD--KPEFRELSREVARRSIVLLKNDG-I 390

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA-----------------CGYTTP--LQG 444
           LPLS    RTVAVIGPN+D    + G+Y+  A                 C   T   L+G
Sbjct: 391 LPLSK-NIRTVAVIGPNADNPRNLHGDYSYTAHIPSVSETLEGVKIPEECAVRTVSILEG 449

Query: 445 IGRYA----RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD-----QSIEAEA 495
           I        + ++ +GC ++     + F  AI+ +++AD  I VMG +     + I  E 
Sbjct: 450 IKNKVSAETQVLYAKGC-EILSDSKEGFDEAIEIAKRADVIIAVMGEESGLFHRGISGEG 508

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DR  L L G Q++L+ ++    K P +LVL++G P  + +   +  + AI+ A YPG+ 
Sbjct: 509 NDRTTLELFGIQRDLLRELHKLGK-PIVLVLVNGRPQALKWEHEN--LNAILEAWYPGEE 565

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG A+AD++FG  NP GKLP++ +P   +T         +PS    Y   + +      +
Sbjct: 566 GGDAVADVIFGDYNPSGKLPIS-FPA--VTGQVPVYYNRKPSAFTDYVEESAK-----PL 617

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL-TLGVQV 674
           YPFGHG+SYT F ++                           +K+   K N L  + +  
Sbjct: 618 YPFGHGLSYTTFEYS--------------------------NLKIHPEKVNALEKVEISF 651

Query: 675 DVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
            +KN G ++G   + L      A    P K+L  F+K+H+  G  +RV   ++  + L+ 
Sbjct: 652 TIKNTGVREGEEVVQLYVHDQVASLERPVKELKGFKKIHLKPGESKRVTFILY-PEQLAF 710

Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            D      +  G   I IG +   + L
Sbjct: 711 YDEFMRFVVEKGIFEIMIGSSSEDIRL 737


>gi|227831319|ref|YP_002833099.1| glycoside hydrolase family protein [Sulfolobus islandicus L.S.2.15]
 gi|227457767|gb|ACP36454.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           L.S.2.15]
          Length = 754

 Score =  291 bits (746), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 215/690 (31%), Positives = 348/690 (50%), Gaps = 98/690 (14%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
           +T+FPQ I  AS++N  L   I  V+  + R +  G    L   SP +++ +DPRWGR +
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV--GVNQCL---SPVLDVCKDPRWGRCE 155

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
           ET GEDP L      +Y+ GLQG +    ++ A+ KHF A+   +    + + H    V 
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++ +TF  PF + V  GKV S+M +Y++++G+P   +P +L   +R EW  +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
            D +          S   EAA  A+ +G+D++  P +  + E   +A++ GL+ E  I+ 
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYGEPLVNALKEGLVPESLIDR 326

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPS 403
           A+   L ++ RLG+ D      P+ +         DH+  ELAL+ AR+ IVLLKN+   
Sbjct: 327 AVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNI 381

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRY---ARTIH 453
           LPLS   ++ +AVIGPN++    M+G+Y       I  G    T LQGI +    ++ ++
Sbjct: 382 LPLSKNVNK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGESKVLY 440

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMG---------LD----------QSIEAE 494
            +GC D+A    + F  AI+ +RQAD  I +MG         +D          Q++  E
Sbjct: 441 AKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSKEEFKKYQAVTGE 499

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             DR+ L LPG Q+EL+ ++    K P ILVL++G P+ ++   N   + A+I A +PG+
Sbjct: 500 GNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIIN--YVKAVIEAWFPGE 556

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
            GG AIAD++FG  NP G+LP+T+         PM +    P    R P   R Y   + 
Sbjct: 557 EGGNAIADVIFGDYNPSGRLPITF---------PM-DTGQIPLYYNRKPSSFRPYVMLRS 606

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
             ++ FG+G+SYT F ++                   N  ++ K I           + +
Sbjct: 607 SPLFTFGYGLSYTQFEYS-------------------NLEVTPKEIGPNS------NIAI 641

Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
            +DVKNVG  +G   + ++ +      A P K+L  F K+H+  G ++RV   I   + L
Sbjct: 642 SIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRVKF-ILPTEAL 700

Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           +  D      +  GE+ + IG +  ++ L 
Sbjct: 701 AFYDSFMRLVVEKGEYQLLIGNSSENIILR 730


>gi|237719778|ref|ZP_04550259.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
 gi|229451047|gb|EEO56838.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
          Length = 861

 Score =  291 bits (746), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 173/441 (39%), Positives = 243/441 (55%), Gaps = 34/441 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  + LP+   SL   QR  DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19  ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
              G           AT FPQ I   +SFN +L   +    SDEAR              
Sbjct: 79  GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
             GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG +  R  K+ A  K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  F+A+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P 
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
           C    +L + +R EW   G +VSDC ++  +Y      + P  E A+A A+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETYPDKEHASAGAVRAGTDLECG 305

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                    AV+ GL+ E +I+ +L   LT +  LG  D + +   +  +    + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSA---WSEIPTSVLNSKEH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q LAL  AR+ +VLL+N+   LPL+   H  VAV+GPN++ +V   GNY GI     T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419

Query: 443 QGIGRY---ARTIHQQGCKDV 460
           + +       + I++ GC  V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440



 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 92/309 (29%), Positives = 136/309 (44%), Gaps = 51/309 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+     AD  +   G+  S+E E +          DR  + LP  Q++L+  +  A K 
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             + +  SG  I            AI+ A YPGQAGGTAI D L+G  NPGG+LP+T+Y 
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
              +  LP   + +M+        GRTYR+ +   ++PFGHG+SYT+F +  A       
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL----- 752

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                   S N    G+ + +T            + V NVG +DG   + V+   P    
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTKHSV 758
            P   L AF++VH+PAG  + V I +   +     D       PL G + +  GGT    
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPL-TGENFEWFDVESNTMCPLEGTYELLYGGTSDRN 851

Query: 759 SLHAATLGV 767
            L    + V
Sbjct: 852 KLKTIVMNV 860


>gi|284998833|ref|YP_003420601.1| glycoside hydrolase family protein [Sulfolobus islandicus L.D.8.5]
 gi|284446729|gb|ADB88231.1| glycoside hydrolase, family 3 domain protein [Sulfolobus islandicus
           L.D.8.5]
          Length = 754

 Score =  291 bits (746), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 215/690 (31%), Positives = 348/690 (50%), Gaps = 98/690 (14%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
           +T+FPQ I  AS++N  L   I  V+  + R +  G    L   SP +++ +DPRWGR +
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV--GVNQCL---SPVLDVCKDPRWGRCE 155

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
           ET GEDP L      +Y+ GLQG +    ++ A+ KHF A+   +    + + H    V 
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++ +TF  PF + V  GKV S+M +Y++++G+P   +P +L   +R EW  +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
            D +          S   EAA  A+ +G+D++  P +  + E   +A++ GL+ E  I+ 
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYGEPLVNALKEGLVPESLIDR 326

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPS 403
           A+   L ++ RLG+ D      P+ +         DH+  ELAL+ AR+ IVLLKN+   
Sbjct: 327 AVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNI 381

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRY---ARTIH 453
           LPLS   ++ +AVIGPN++    M+G+Y       I  G    T LQGI +    ++ ++
Sbjct: 382 LPLSKNVNK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGESKVLY 440

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMG---------LD----------QSIEAE 494
            +GC D+A    + F  AI+ +RQAD  I +MG         +D          Q++  E
Sbjct: 441 AKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFKKYQAVTGE 499

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             DR+ L LPG Q+EL+ ++    K P ILVL++G P+ ++   N   + A+I A +PG+
Sbjct: 500 GNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIIN--YVKAVIEAWFPGE 556

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
            GG AIAD++FG  NP G+LP+T+         PM +    P    R P   R Y   + 
Sbjct: 557 EGGNAIADVIFGDYNPSGRLPITF---------PM-DTGQIPLYYNRKPSSFRPYVMLRS 606

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
             ++ FG+G+SYT F ++                   N  ++ K I           + +
Sbjct: 607 SPLFTFGYGLSYTQFEYS-------------------NLEVTPKEIGPNS------NIAI 641

Query: 673 QVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYL 731
            +DVKNVG  +G   + ++ +      A P K+L  F K+H+  G ++RV   I   + L
Sbjct: 642 SIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRVKF-ILPTEAL 700

Query: 732 SVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           +  D      +  GE+ + IG +  ++ L 
Sbjct: 701 AFYDSFMRLVVEKGEYQLLIGNSSENIILR 730


>gi|295086418|emb|CBK67941.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
           XB1A]
          Length = 861

 Score =  291 bits (745), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 173/441 (39%), Positives = 243/441 (55%), Gaps = 34/441 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  + LP+   SL   QR  DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19  ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
              G           AT FPQ I   +SFN +L   +    SDEAR              
Sbjct: 79  GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
             GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG +  R  K+ A  K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  F+A+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P 
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
           C    +L + +R EW   G +VSDC ++  +Y      + P  E A+A A+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASAGAVRAGTDLECG 305

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                    AV+ GL+ E +I+ +L   LT +  LG  D + +   +  +    + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSA---WSEIPTSVLNSKEH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q LAL  AR+ +VLL+N+   LPL+   H  VAV+GPN++ +V   GNY GI     T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419

Query: 443 QGIGRY---ARTIHQQGCKDV 460
           + +       + I++ GC  V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440



 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 92/309 (29%), Positives = 136/309 (44%), Gaps = 51/309 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+     AD  +   G+  S+E E +          DR  + LP  Q++L+  +  A K 
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             + +  SG  I            AI+ A YPGQAGGTAI D L+G  NPGG+LP+T+Y 
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
              +  LP   + +M+        GRTYR+ +   ++PFGHG+SYT+F +  A       
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL----- 752

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                   S N    G+ + +T            + V NVG +DG   + V+   P    
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTKHSV 758
            P   L AF++VH+PAG  + V I +   +     D       PL G + +  GGT    
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPL-TGENFEWFDVESNTMCPLEGTYELLYGGTSDRN 851

Query: 759 SLHAATLGV 767
            L    + V
Sbjct: 852 KLKTIVMNV 860


>gi|262405256|ref|ZP_06081806.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294644754|ref|ZP_06722499.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294810589|ref|ZP_06769241.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345508031|ref|ZP_08787672.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|229444722|gb|EEO50513.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|262356131|gb|EEZ05221.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292639876|gb|EFF58149.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294442250|gb|EFG11065.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
          Length = 861

 Score =  291 bits (745), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 173/441 (39%), Positives = 243/441 (55%), Gaps = 34/441 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  + LP+   SL   QR  DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19  ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
              G           AT FPQ I   +SFN +L   +    SDEAR              
Sbjct: 79  GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
             GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG +  R  K+ A  K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  F+A+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P 
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
           C    +L + +R EW   G +VSDC ++  +Y      + P  E A+A A+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDLECG 305

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                    AV+ GL+ E +I+ +L   LT +  LG  D + +   +  +    + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSA---WSEIPTSVLNSKEH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q LAL  AR+ +VLL+N+   LPL+   H  VAV+GPN++ +V   GNY GI     T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419

Query: 443 QGIGRY---ARTIHQQGCKDV 460
           + +       + I++ GC  V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440



 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 90/308 (29%), Positives = 137/308 (44%), Gaps = 49/308 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+     AD  +   G+  S+E E +          DR  + LP  Q++L+  +  A K 
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             + +  SG  I            AI+ A YPGQAGGTAI D L+G  NPGG+LP+T+Y 
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
              +  LP   + +M+        GRTYR+ +   ++PFGHG+SYT F +  A       
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEAKL----- 752

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                   S N    G+ + +T            + V NVG +DG   + V+   P    
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
            P   L AF++VH+PAG  + V I++    +    + + T       + +  GGT     
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAISLTHESFEWFDEATNTMHPVADTYELLYGGTSEQNQ 852

Query: 760 LHAATLGV 767
           L + T+ V
Sbjct: 853 LKSVTVHV 860


>gi|336415363|ref|ZP_08595703.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940959|gb|EGN02821.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
           3_8_47FAA]
          Length = 861

 Score =  291 bits (745), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 173/441 (39%), Positives = 243/441 (55%), Gaps = 34/441 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  + LP+   SL   QR  DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19  ACKQLLPYQDTSLAAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
              G           AT FPQ I   +SFN +L   +    SDEAR              
Sbjct: 79  GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
             GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG +  R  K+ A  K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  F+A+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P 
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
           C    +L + +R EW   G +VSDC ++  +Y      + P  E A+A A+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDLECG 305

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                    AV+ GL+ E +I+ +L   LT +  LG  D + +   +  +    + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSA---WSEIPTSVLNSKEH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q LAL  AR+ +VLL+N+   LPL+   H  VAV+GPN++ +V   GNY GI     T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419

Query: 443 QGIGRY---ARTIHQQGCKDV 460
           + +       + I++ GC  V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440



 Score =  123 bits (309), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 90/301 (29%), Positives = 134/301 (44%), Gaps = 49/301 (16%)

Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           AD  +   G+  S+E E +          DR  + LP  Q++L+  +    K   + +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKVGK-KVVFINY 656

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           SG  I            AI+ A YPGQAGGTAI D L+G  NPGG+LP+T+Y    +  L
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712

Query: 588 P-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           P   + +M+        GRTYR+ +   ++PFGHG+SYT+F +  A              
Sbjct: 713 PDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL------------ 752

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
            S N    G+ + +T            + V NVG +DG   + V+   P     P   L 
Sbjct: 753 -SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKEGPRYTLR 799

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
           AF++VH+PAG  + V I++    +      S T R   G + +  GGT     L    + 
Sbjct: 800 AFKRVHIPAGKTESVAISLTGENFEWFDVESNTMRPLEGTYELLYGGTSDRNKLKTIVMN 859

Query: 767 V 767
           V
Sbjct: 860 V 860


>gi|160885419|ref|ZP_02066422.1| hypothetical protein BACOVA_03419 [Bacteroides ovatus ATCC 8483]
 gi|156109041|gb|EDO10786.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 861

 Score =  291 bits (744), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 174/441 (39%), Positives = 245/441 (55%), Gaps = 34/441 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  + LP+   SL   QR  DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19  ACKQLLPYQDTSLAAEQRTEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY----NGGT--- 147
              G           AT FPQ I   +SFN +L   +    SDEAR       + G    
Sbjct: 79  GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGDSGVLKR 128

Query: 148 -AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCK 205
             GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG +  R  K+ A  K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAK 188

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  F+A+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P 
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
           C    +L + +R EW   G +VSDC ++  +Y      + P  E A+A A+RAG DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYEGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDLECG 305

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                    AV+ GL+ E +I+ +L   LT +  LG  D + +   +  +    + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSA---WSEIPTSVLNSKEH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q LAL  AR+ +VLL+N+   LPL+   H  VAV+GPN++ +V   GNY GI     T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419

Query: 443 QGIGRY---ARTIHQQGCKDV 460
           + +       + I++ GC  V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440



 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 91/302 (30%), Positives = 134/302 (44%), Gaps = 51/302 (16%)

Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           AD  +   G+  S+E E +          DR  + LP  Q++L+  +  A K   + +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK-VVFINY 656

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           SG  I            AI+ A YPGQAGGTAI D L+G  NPGG+LP+T+Y    +  L
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712

Query: 588 P-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           P   + +M+        GRTYR+ +   ++PFGHG+SYT+F +  A              
Sbjct: 713 PDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL------------ 752

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
            S N    G+ + +T            + V NVG +DG   + V+   P     P   L 
Sbjct: 753 -SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKEGPRYTLR 799

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTKHSVSLHAATL 765
           AF++VH+PAG  + V I +   +     D       PL G + +  GGT     L    +
Sbjct: 800 AFKRVHIPAGKTESVAIPL-TGENFEWFDVESNTMCPLEGTYELLYGGTSDRNKLKTIVM 858

Query: 766 GV 767
            V
Sbjct: 859 NV 860


>gi|15899739|ref|NP_344344.1| Beta-xylosidase [Sulfolobus solfataricus P2]
 gi|13816430|gb|AAK43134.1| Beta-xylosidase [Sulfolobus solfataricus P2]
          Length = 754

 Score =  290 bits (743), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 218/691 (31%), Positives = 347/691 (50%), Gaps = 100/691 (14%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
           +T+FPQ I  AS++N  L   +   +  + R +  G    L   SP +++ RDPRWGR +
Sbjct: 101 STAFPQAIGLASTWNPELLTNVASTIRSQGRLI--GVNQCL---SPVLDVCRDPRWGRCE 155

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
           ET GEDP L      +Y+ GLQG      ++ A+ KHF A+   +    + + H    V 
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGET----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++ +TF  PF + V  GKV S+M +Y++++GVP   +P +L   +R EW  +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQEWGFDGIVVSD 267

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
            D +          S   EAA  A+ +G+D++  P +  + E   +A++ GL+SE  I+ 
Sbjct: 268 YDGIRQLEAIHKVASNKMEAAILALESGVDIEF-PTIDCYGEPLVTAIKEGLVSEAIIDR 326

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLG--PKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           A+   L ++ RLG+ D      P+      P+ +     +ELAL+AAR+ IVLLKN+   
Sbjct: 327 AVERVLRIKERLGLLD-----NPFVDESAVPERLDDRKSRELALKAARESIVLLKNENNM 381

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRYA---RTIH 453
           LPLS   ++ +AVIGPN++    M+G+Y       I  G    T LQGI +     + ++
Sbjct: 382 LPLSKNINK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKKVGEGKVLY 440

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLD-------------------QSIEAE 494
            +GC D+A    + F  AI+ ++QAD  I VMG                     Q++  E
Sbjct: 441 AKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFKKYQAVTGE 499

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             DRA L L G Q+EL+ ++    K P ILVL++G P+ ++   N   + AII A +PG+
Sbjct: 500 GNDRASLRLLGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAIIEAWFPGE 556

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
            GG AIADI+FG  NP G+LP+T+         PM +    P    R P   R Y     
Sbjct: 557 EGGNAIADIIFGDYNPSGRLPITF---------PM-DTGQIPLYYSRKPSSFRPYVMLHS 606

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT-LG 671
             ++ FG+G+SYT F ++                           ++VT  +   L+ + 
Sbjct: 607 SPLFTFGYGLSYTQFEYS--------------------------NLEVTPKEVGPLSYIT 640

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
           + +DVKNVG+ +G   + ++ +      A P K+L  F KVH+  G ++RV   + + + 
Sbjct: 641 ILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLKPGEKRRVKFALPM-EA 699

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           L+  D      +  GE+ I IG +  ++ L 
Sbjct: 700 LAFYDNFMRLVVEKGEYQILIGNSSENIILK 730


>gi|284174578|ref|ZP_06388547.1| Beta-xylosidase [Sulfolobus solfataricus 98/2]
 gi|356934752|gb|AET42953.1| beta-xylosidase-like protein [Sulfolobus solfataricus 98/2]
          Length = 754

 Score =  290 bits (742), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 218/691 (31%), Positives = 347/691 (50%), Gaps = 100/691 (14%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
           +T+FPQ I  AS++N  L   +   +  + R +  G    L   SP +++ RDPRWGR +
Sbjct: 101 STAFPQAIGLASTWNPELLTNVASTIRSQGRLI--GVNQCL---SPVLDVCRDPRWGRCE 155

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
           ET GEDP L      +Y+ GLQG      ++ A+ KHF A+   +    + + H    V 
Sbjct: 156 ETYGEDPYLVASMGLAYITGLQGET----QLVATAKHFAAHGFPEGGRNIAQVH----VG 207

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++ +TF  PF + V  GKV S+M +Y++++GVP   +P +L   +R EW  +G +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQEWGFDGIVVSD 267

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINN 345
            D +          S   EAA  A+ +G+D++  P +  + E   +A++ GL+SE  I+ 
Sbjct: 268 YDGIRQLEAIHKVASNKMEAAILALESGVDIEF-PTIDCYGEPLVTAIKEGLVSEAIIDR 326

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLG--PKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           A+   L ++ RLG+ D      P+      P+ +     +ELAL+AAR+ IVLLKN+   
Sbjct: 327 AVERVLRIKERLGLLD-----NPFVDESAVPERLDDRKSRELALKAARESIVLLKNENNM 381

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACG--YTTPLQGIGRYA---RTIH 453
           LPLS   ++ +AVIGPN++    M+G+Y       I  G    T LQGI +     + ++
Sbjct: 382 LPLSKNINK-IAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKKVGEGKVLY 440

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLD-------------------QSIEAE 494
            +GC D+A    + F  AI+ ++QAD  I VMG                     Q++  E
Sbjct: 441 AKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFKKYQAVTGE 499

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             DRA L L G Q+EL+ ++    K P ILVL++G P+ ++   N   + AII A +PG+
Sbjct: 500 GNDRASLRLLGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAIIEAWFPGE 556

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKG 612
            GG AIADI+FG  NP G+LP+T+         PM +    P    R P   R Y     
Sbjct: 557 EGGNAIADIIFGDYNPSGRLPITF---------PM-DTGQIPLYYSRKPSSFRPYVMLHS 606

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLT-LG 671
             ++ FG+G+SYT F ++                           ++VT  +   L+ + 
Sbjct: 607 SPLFTFGYGLSYTQFEYS--------------------------NLEVTPKEVGPLSYIT 640

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
           + +DVKNVG+ +G   + ++ +      A P K+L  F KVH+  G ++RV   + + + 
Sbjct: 641 ILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLKPGEKRRVKFALPM-EA 699

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           L+  D      +  GE+ I IG +  ++ L 
Sbjct: 700 LAFYDNFMRLVVEKGEYQILIGNSSENIILK 730


>gi|399029285|ref|ZP_10730258.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
 gi|398072895|gb|EJL64089.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
          Length = 871

 Score =  290 bits (741), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 174/419 (41%), Positives = 238/419 (56%), Gaps = 31/419 (7%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
               F   +L   QRV+DL+ R+S+ EK+  L+  + A+ RLG+  Y WW+E+LHGV+  
Sbjct: 22  ENFAFKNPNLTTEQRVDDLVSRMSIDEKISQLMDSSPAIERLGVPEYNWWNESLHGVARA 81

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAG 149
           G           AT FPQ I+ ASS++  L   +  V+SDEARA ++        G   G
Sbjct: 82  G----------YATVFPQSISIASSWDRQLIFDVANVISDEARAKHHEYLRRGQHGMYQG 131

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
           LT+WSPNVNIFRDPRWGRG ET GEDP L+G+    YV GLQG++   LKV A+ KH+  
Sbjct: 132 LTFWSPNVNIFRDPRWGRGHETYGEDPFLTGQLGLKYVNGLQGTNEKYLKVIATAKHYAV 191

Query: 210 YDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
           +     +G +  R  FNA+ S  D+ +T+   FR  V EG V SVM +YN+  G    A 
Sbjct: 192 H-----SGPEPSRHLFNAETSDIDLYETYLPAFRTLVKEGHVYSVMGAYNRFRGESCSAS 246

Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLG 326
           P  L   +R  W  +GYIVSDC +V   +     T     A+A A++ GLDL+CG  F  
Sbjct: 247 P-FLFNILRNVWGFDGYIVSDCGAVTDIWKYHKITGDAATASALALKDGLDLECGSSFKS 305

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
           L  + A+ R L+SE DI+ A+    T + +LGMFD E     Y  +         H  LA
Sbjct: 306 L--KEAIDRKLISEADIDIAVKRLFTARFKLGMFDPEEIVS-YAQIPYSVNNNSAHDWLA 362

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
             A+++ IVLLKNQ  +LPLS    +TVAVIGPN++   ++ GNY+G+     T L+GI
Sbjct: 363 RVASQKSIVLLKNQNNTLPLSR-DIKTVAVIGPNANDVQSLWGNYSGVPSNPITVLKGI 420



 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 106/332 (31%), Positives = 157/332 (47%), Gaps = 57/332 (17%)

Query: 442 LQGIGRYARTIHQQGCKDVACAD-------DQLFGAAIDASRQADATILVMGLDQSIEAE 494
           L+   +Y  T+  Q     A A        + +   A+  + QADA +LV+GL++ +E E
Sbjct: 556 LEAEKKYKITVKYQNFYGDAIAQLLWAEPQENVLQEAVQVAGQADAIVLVLGLNERLEGE 615

Query: 495 AL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
            +          DR  L LP  Q+EL+ K   A+  P ILVL++G  + + +A +   + 
Sbjct: 616 EMKVEADGFEGGDRTSLDLPSNQEELM-KAMTATGKPVILVLINGSALSINWAND--HVP 672

Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
           AI+ AGYPGQ GG AIAD+LFG  NP G+LP+T+Y       LP  E       +    G
Sbjct: 673 AILTAGYPGQQGGNAIADVLFGDYNPAGRLPVTYYKS--TEQLPAFE-------NYDMKG 723

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           RTYR+++   +YPFG G+SYT F ++    PT V    D                     
Sbjct: 724 RTYRYFQKKPLYPFGFGLSYTKFKYSNLKLPTNVTPEKD--------------------- 762

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                  + VDV N+G +DG   + L      A    P  QL  FE+V++  G  + V  
Sbjct: 763 -----FEILVDVTNIGERDGDEVIELYLKDEKASTPRPILQLEGFERVNLKKGETKTVRF 817

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
            I   + LS++++ G R I  G   I +GG +
Sbjct: 818 TI-TPRQLSLINKKGQRVIEPGWFTISVGGKQ 848


>gi|336404627|ref|ZP_08585320.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
 gi|335941531|gb|EGN03384.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
          Length = 861

 Score =  289 bits (740), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 171/441 (38%), Positives = 243/441 (55%), Gaps = 34/441 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  + LP+   SL   QR  DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19  ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
              G           AT FPQ I   +SFN +L   +    SDEAR              
Sbjct: 79  GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCK 205
             GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG  D    K+ A  K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYDKLHACAK 188

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  F+A+ ++ +D+ +T+   F+  V +  V  VMC+YN+  G P 
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIAPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
           C    +L + +R EW   G +VSDC ++  +Y      + P  E A+A A+R G DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASAAAVRTGTDLECG 305

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                    AV+ GL+ E +I+ +L   LT +  LG  D +P+   +  +    + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WAEIPTSVLNSKEH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q LAL  AR+ +VLL+N+   LPL+   +  +AV+GPN++ +V   GNY GI     T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--TNLKIAVMGPNANDSVMQWGNYNGIPAHTVTLL 419

Query: 443 QGIGRY---ARTIHQQGCKDV 460
           + +       + I++ GC  V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440



 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 136/308 (44%), Gaps = 49/308 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+     AD  +   G+  S+E E +          DR  + LP  Q++L+  +  A K 
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             + +  SG  I            AI+ A YPGQAGGTAI D L+G  NPGG+LP+T+Y 
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
              +  LP   + +M+        GRTYR+ +   ++PFGHG+SYT+F +  A       
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL----- 752

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                   S N    G+ + +T            + V NVG +DG   + V+   P    
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
            P   L AF++VH+PAG  + V I +    +      S T R   G + +  GGT     
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDAESNTMRPLEGTYELLYGGTSDRNK 852

Query: 760 LHAATLGV 767
           L    + V
Sbjct: 853 LKTIVMNV 860


>gi|189463167|ref|ZP_03011952.1| hypothetical protein BACCOP_03878 [Bacteroides coprocola DSM 17136]
 gi|189430146|gb|EDU99130.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           coprocola DSM 17136]
          Length = 865

 Score =  289 bits (739), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 174/451 (38%), Positives = 248/451 (54%), Gaps = 39/451 (8%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           +  P+   SL   QR +DL+ RL+L+EKV L+ + + A+PRLGIK Y+WW+EALHGV   
Sbjct: 23  QQFPYQNTSLTPEQRASDLLERLTLEEKVSLMQNASPAIPRLGIKAYDWWNEALHGVGRA 82

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------G 149
           G           AT FPQ I  A+SF+  L   +   VSDEARA Y   +         G
Sbjct: 83  GI----------ATVFPQTIGMAASFDDELIYKVFTAVSDEARAKYTEFSKSGNLKRYQG 132

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFT 208
           LT+W+PN+NIFRDPRWGRGQET GEDP L+ +   + VRGLQG D  +  K+ A  KH+ 
Sbjct: 133 LTFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVRGLQGPDNMKYDKLHACAKHYA 192

Query: 209 AYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
            +    WN   R  FNA+ ++ +D+ +T+   F+  V E  V  VMC+YN+  G P C  
Sbjct: 193 VHSGPEWN---RHSFNAENIAPRDLWETYLPAFKALVQEADVKEVMCAYNRFEGEPCCGS 249

Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFL 325
             +L + +R EW+  G IVSDC ++  ++      + P  E A+A A+ +G DL+CG   
Sbjct: 250 NRLLMQILRDEWKYKGIIVSDCGAISDFWRKGDHETHPDKETASAGAVLSGTDLECGNNY 309

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP--SSQPYGHLGPKDVCTPDHQ 383
               E AVQ+GL+ E  I+ ++   LT +  LG  D      S PY  +  K      H+
Sbjct: 310 KSLPE-AVQKGLIDEKQIDISVKRLLTARFELGEMDEHVCWDSIPYSVVDSK-----AHK 363

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
           +LALE AR+ IVLL+N+   LPL       +A+IGPN++ +V   GNY G     +T  +
Sbjct: 364 DLALEIARKSIVLLQNRNNILPLKE--DMKIALIGPNANDSVMQWGNYNGFPSHTSTLYE 421

Query: 444 GIGRYARTIHQQGCKDVACADDQLFGAAIDA 474
            +    R    Q   D  C  D+  G ++++
Sbjct: 422 ALKE--RIPANQLIYDFGC--DRTSGISLES 448



 Score =  129 bits (325), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 96/312 (30%), Positives = 144/312 (46%), Gaps = 51/312 (16%)

Query: 470 AAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASK 519
           A+ID  + AD  +   G+  S+E E +          DR  + LP  Q+ L+S++    K
Sbjct: 593 ASIDKVKAADVIVFAGGISPSLEGEEMPVNAEGFKGGDRTTIELPAIQRRLISELKKLGK 652

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
            P I V  SG    V          AI+ A YPGQAGGTA+AD+LFG  NP GKLP+T+Y
Sbjct: 653 -PIIFVNYSGSA--VGLEPESKICDAILQAWYPGQAGGTAVADVLFGDYNPSGKLPVTFY 709

Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
             ++   LP   + +M+        GRTYR+     +Y FGHG+SYTNF +     P  +
Sbjct: 710 --KHTDQLPDFQDYSMK--------GRTYRYMTESPLYSFGHGLSYTNFTY----GPATL 755

Query: 639 AVPLDGRHGSINATIS-GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
           +            TIS GK + +T            + V+N G+ DG   + V+ +    
Sbjct: 756 S----------QQTISQGKEVTLT------------IPVQNTGNYDGEEVVQVYLSCSGD 793

Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
              P   L AF++VH+  G +  V   +    +      + T R+  G + +  GGT   
Sbjct: 794 KEGPSHTLRAFKRVHIAKGQRANVSFTLDSETFQWFDTNTNTMRMVEGNYELLYGGTSRI 853

Query: 758 VSLHAATLGVIK 769
             L   ++ VI+
Sbjct: 854 DQLQKISIKVIE 865


>gi|383125190|ref|ZP_09945844.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
 gi|251838523|gb|EES66609.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
          Length = 853

 Score =  289 bits (739), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 165/403 (40%), Positives = 245/403 (60%), Gaps = 40/403 (9%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           P+ +RV DLI RL+++EK+ LL + +  +PRLGI  Y   +EALHGV  V PG +F    
Sbjct: 36  PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 88

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
              T FPQ I  A+++N  L + +  V+SDEARA +N    G          LT+WSP V
Sbjct: 89  ---TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR  ET GEDP LSG    ++V GLQG D   LK+ ++ KHF A + ++   
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHYLKIVSTPKHFAANNEEH--- 202

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +RF  N ++S++ + + +   F MCV EGK AS+M +YN +N VP   +P +L++ +R 
Sbjct: 203 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNPWLLQKVLRQ 261

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC------GPFLGLHTES 331
           +W   GY+VSDC    +  +   +  T E AA  +I+AGLDL+C      GP L     +
Sbjct: 262 DWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLL-----N 316

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAA 390
           A ++ ++S+ DI++A  + LT +M+LG+FD GE    PY  + P  + + +HQ++AL+AA
Sbjct: 317 AYKQYMVSDADIDSAAYHVLTARMKLGLFDSGE--RNPYTKISPSVIGSKEHQQIALDAA 374

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           RQ IVLLKNQ   LPL+  + +++AV+G N+       G+Y+G
Sbjct: 375 RQCIVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSG 415



 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 91/298 (30%), Positives = 147/298 (49%), Gaps = 45/298 (15%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A R+ +  + VMG+++SIE E  DR  + LP  Q+E + ++   +  P I+V+
Sbjct: 593 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 650

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           +  G   +A    D  I AI+ A YPG+ GGTA+A++LFG  NP G+LP+T+Y    +  
Sbjct: 651 LVAGS-SLAINWMDEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS--LDE 707

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SY++F ++           L  + 
Sbjct: 708 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFTYS----------DLQVKD 751

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
           G    T+S +                   +KN G ++G     V+   P  G   P K+L
Sbjct: 752 GVGEVTVSFR-------------------LKNTGKRNGDEVAQVYVRIPETGGIVPLKEL 792

Query: 706 VAFEKVHVPAGAQQRVGI--NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             F +V + +G  +RV I  N    +Y  V    G   +P G  ++ +G +   + L 
Sbjct: 793 KGFRRVPLKSGESRRVEIKLNKEQLRYWDV--EKGQFVVPKGAFDVMVGASSKDIRLQ 848


>gi|433679952|ref|ZP_20511614.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
 gi|430814928|emb|CCP42243.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
          Length = 909

 Score =  288 bits (738), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 161/399 (40%), Positives = 231/399 (57%), Gaps = 32/399 (8%)

Query: 59  RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
           +++ +EKV   ++ A A+PRLG+  YEWW+E LHG++  G           AT FPQ I 
Sbjct: 67  KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------ATVFPQAIG 116

Query: 119 TASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFRDPRWGRGQ 169
            A+++N  L E +G V S EARA +N  GG        AGLT WSPN+NIFRDPRWGRG 
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKV 227
           ET GEDP L+G+ A  ++RGLQG D    +  A+ KH   +     +G +  R  F+  V
Sbjct: 177 ETYGEDPYLTGQLAVGFIRGLQGDDLTHPRTIATPKHLAVH-----SGPEPGRHGFDVDV 231

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
           S  D+E T+   FR  +++G+  +VMC+YN ++G P CA   +L   +RG+W   G++VS
Sbjct: 232 SPHDLEATYTPAFRAAIVDGRAGAVMCAYNSLHGTPACAADWLLNGRLRGDWGFTGFVVS 291

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           DCD+V       +F +    ++A A++AG DL+CG +       A+ RG   E  ++ +L
Sbjct: 292 DCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEAVLDQSL 350

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
           V     + RLG    +    PY  LG KDV +  H+ LAL+AA+Q IVLL+N+  +LPL 
Sbjct: 351 VRLFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQNRNATLPLR 409

Query: 408 -HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
             +R   +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 410 PGLR---LAVIGPNADALAALEANYQGTSAAPVTPLLGL 445



 Score =  137 bits (344), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 96/278 (34%), Positives = 139/278 (50%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  L LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 646 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 704

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP       
Sbjct: 705 WAKQ--HADAIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFY--RSTKDLPAYV---- 756

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
              S    GRTYR++KG  ++ FG G+SYT F +    AP + A  L             
Sbjct: 757 ---SYDMKGRTYRYFKGEPLFAFGSGLSYTRFTYA---APQLSATTL------------- 797

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
                  A  N   L V+  V+N G++ G   + V+  PP G  +P + LV F++V +  
Sbjct: 798 ------QAGAN---LQVRTQVRNSGTRAGDEVVQVYLQPPQGAQSPLRTLVGFQRVTLQP 848

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G  + VG  +   + LS VDR+G R +  G++ + +GG
Sbjct: 849 GEAREVGFEL-TPRQLSDVDRAGQRAVQPGDYRVFVGG 885


>gi|146301622|ref|YP_001196213.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
           UW101]
 gi|146156040|gb|ABQ06894.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
           [Flavobacterium johnsoniae UW101]
          Length = 875

 Score =  288 bits (738), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 161/418 (38%), Positives = 229/418 (54%), Gaps = 29/418 (6%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
             F   SL   QRV+DL+ RL+L+EKV  +++ +  + RLGI  Y+WW+E LHGV+    
Sbjct: 27  FQFQNPSLSFEQRVDDLVSRLTLEEKVSQMLNSSPEIARLGIPAYDWWNETLHGVARTPF 86

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA---------GL 150
            T         T +PQ I  A++F+      +    + E RA+YN             GL
Sbjct: 87  KT---------TVYPQAIGMAATFDKNSLFTMADYSALEGRAIYNKAVELKRTNERYLGL 137

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
           TYW+PN+NIFRDPRWGRGQET GEDP L+     ++V+GLQG D   LK AA  KH+  +
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDDPKYLKAAACAKHYAVH 197

Query: 211 DLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADP 268
                +G +  R  F+  V+  ++ DT+   FR  + E  VA VMC+YN     P CA  
Sbjct: 198 -----SGPESLRHTFDVDVTPYELWDTYLPAFRKLITESNVAGVMCAYNAFRTQPCCASD 252

Query: 269 NILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH 328
            ++   +R EW+ +GY+ SDC ++  ++         E AAADA+  G D+DCG      
Sbjct: 253 ILMNDILRKEWKFDGYVTSDCWAIDDFFKNHKTHPDAESAAADAVFHGTDIDCGTDAYKA 312

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC-TPDHQELAL 387
              AV+ G +SE  I+ ++     ++ RLGMFD  P S       P  V  + +HQ  AL
Sbjct: 313 LVQAVKNGKISEKQIDISVKRLFMIRFRLGMFD--PVSMVKYAQTPSSVLESKEHQLHAL 370

Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           + ARQ IVLLKN+   LPL+    + + V+GPN+D  ++++GNY G     TT LQGI
Sbjct: 371 KMARQSIVLLKNEKNILPLNK-NLKKIVVLGPNADNAISILGNYNGTPSKLTTVLQGI 427



 Score =  103 bits (257), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 84/301 (27%), Positives = 137/301 (45%), Gaps = 54/301 (17%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F   I+  + ADA I   G+   +E E +          DR  +L P  Q +L+ K   +
Sbjct: 596 FADLIEHHKNADAFIFAGGISPQLEGEEMPVDFPGFKGGDRTSILFPEVQTKLL-KALQS 654

Query: 518 SKGPTILVLMSGGPIDVAF-AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
           S  P +  +M+G  I + + A+N P I  I W G  GQ+ GTA AD++FG  NP G+LP+
Sbjct: 655 SGKPVVFAMMTGSAIAIPWEAENIPAILNI-WYG--GQSAGTAAADVIFGDYNPAGRLPV 711

Query: 577 TWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           T+Y  +  ++LP   +  M          +TYR++KG  +Y FG+G+SYT+F ++    P
Sbjct: 712 TFYKND--SDLPSFVDYKM--------DNKTYRYFKGTPLYGFGYGLSYTSFKYSDLKTP 761

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTP 694
                                 +K+   +    ++ + V V N G  +G     L     
Sbjct: 762 ----------------------VKIKKGQ----SVSILVKVANTGKTEGEEVAQLYLINQ 795

Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
                 P K L  FE+ ++  G  + +  N+   + LS V   G+ +   G+  I IGG+
Sbjct: 796 DTAIKTPLKSLKGFERFNLKPGENKTITFNLS-PEDLSYVTPEGSLKQYEGKIKISIGGS 854

Query: 755 K 755
           +
Sbjct: 855 Q 855


>gi|29347188|ref|NP_810691.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29339087|gb|AAO76885.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 853

 Score =  288 bits (738), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 164/403 (40%), Positives = 245/403 (60%), Gaps = 40/403 (9%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           P+ +RV DLI RL+++EK+ LL + +  +PRLGI  Y   +EALHGV  V PG +F    
Sbjct: 36  PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 88

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
              T FPQ I  A+++N  L + +  V+SDEARA +N    G          LT+WSP V
Sbjct: 89  ---TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR  ET GEDP LSG    ++V GLQG D   LK+ ++ KHF A + ++   
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHYLKIVSTPKHFAANNEEH--- 202

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +RF  N ++S++ + + +   F MCV EGK AS+M +YN +N VP   +P +L++ +R 
Sbjct: 203 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNPWLLQKVLRQ 261

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC------GPFLGLHTES 331
           +W   GY+VSDC    +  +   +  T E AA  +I+AGLDL+C      GP L     +
Sbjct: 262 DWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLL-----N 316

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAA 390
           A ++ ++S+ DI++A  + LT +M+LG+FD GE    PY  + P  + + +HQ++AL+AA
Sbjct: 317 AYKQYMVSDADIDSAAYHVLTARMKLGLFDSGE--RNPYTKISPSVIGSKEHQQIALDAA 374

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           RQ +VLLKNQ   LPL+  + +++AV+G N+       G+Y+G
Sbjct: 375 RQCVVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSG 415



 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 90/298 (30%), Positives = 147/298 (49%), Gaps = 45/298 (15%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A R+ +  + VMG+++SIE E  DR  + LP  Q+E + ++   +  P I+V+
Sbjct: 593 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 650

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           +  G   +A    D  I AI+ A YPG+ GGTA+A++LFG  NP G+LP+T+Y    +  
Sbjct: 651 LVAGS-SLAINWMDEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS--LDE 707

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SY++F ++           L  + 
Sbjct: 708 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFTYS----------DLQVKD 751

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
           G    T+S +                   +KN G ++G     V+   P  G   P K+L
Sbjct: 752 GGGEVTVSFR-------------------LKNTGKRNGDEVAQVYVRIPETGGIVPLKEL 792

Query: 706 VAFEKVHVPAGAQQRVGINI--HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             F +V + +G  +RV I +     +Y  V    G   +P G  ++ +G +   + L 
Sbjct: 793 KGFRRVPLKSGESRRVEIKLDKEQLRYWDV--EKGQFVVPKGAFDVMVGASSKDIRLQ 848


>gi|94969405|ref|YP_591453.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
 gi|94551455|gb|ABF41379.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
          Length = 902

 Score =  288 bits (738), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 173/437 (39%), Positives = 248/437 (56%), Gaps = 43/437 (9%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   + P  +R +DL+ R++L EK   L   A A+PRLG+  Y+ WSEALHGV+  G   
Sbjct: 38  YRDATRPANERAHDLVQRMTLDEKAAQLEDWATAIPRLGVPDYQTWSEALHGVARAG--- 94

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                   AT FPQ I  A++++  + + +G V+S EAR  YN            GLT+W
Sbjct: 95  -------HATVFPQAIGMAATWDTEMVKQMGDVISTEARGKYNEAQREGNHRIFWGLTFW 147

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           SPN+NIFRDPRWGRGQET GEDP L+GK   +++ G+QG D    K  A+ KHF  +   
Sbjct: 148 SPNINIFRDPRWGRGQETYGEDPFLTGKMGIAFIDGVQGPDAAHPKAVATSKHFAVH--- 204

Query: 214 NWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             +G +  R  F+ KVS +D+E+T+   FR  V +G V SVMC+YN V+G+  CA+  +L
Sbjct: 205 --SGPESLRHGFDVKVSPRDLEETYLAAFRATVTDGHVKSVMCAYNAVDGMGACANKMLL 262

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYD-TQHFTSTPE--EAAADAIRAGLDLDCGPFL-GL 327
           +  ++  W   G++VSDC   G   D TQ   + P+   AAA ++ AG DL C  +  G 
Sbjct: 263 EEHLKQAWGFKGFVVSDC---GAIMDVTQGHKNAPDIVHAAAISLAAGTDLSCSIWEPGF 319

Query: 328 HT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
           +T   AV++GL++E  +  A       +  LGMFD EP S P   +    V + +H+  A
Sbjct: 320 NTLADAVRKGLVTEDMVTRAAERLYAARFELGMFD-EPGSNPNDKIDMSQVASEEHRAEA 378

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
           L+AA + IVLLKN G  LPL +   +T+AVIGP +++  ++ GNY G      TPL GI 
Sbjct: 379 LKAAEESIVLLKNDG-LLPLKNA--KTIAVIGPTAELLASLEGNYNGQPVRPVTPLDGI- 434

Query: 447 RYARTIHQQGCKDVACA 463
                + Q G ++V  A
Sbjct: 435 -----VKQFGAENVRYA 446



 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 91/283 (32%), Positives = 133/283 (46%), Gaps = 55/283 (19%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL  ++E E +          DR  + LP  Q++L+  +  A K P ++V +SG  + + 
Sbjct: 640 GLSPNLEGEEMPIKIEGFSGGDRTSIDLPATQEKLLEALGAAGK-PVVVVNLSGSAVALN 698

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAM 594
           +A       AI+ A YPG  GGTAIA  L G SNP G+LP+T+Y    + +LP  TE AM
Sbjct: 699 WANQ--HAGAILQAWYPGVEGGTAIAKTLAGESNPAGRLPVTFYAS--VQDLPAFTEYAM 754

Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN-ATI 653
           +         RTYR+Y G  ++ FG G+SY+ F                 ++G +  A+ 
Sbjct: 755 K--------NRTYRYYAGKPLWGFGFGLSYSTF-----------------KYGEVKLAST 789

Query: 654 SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVH 712
           S  A K         +L   V V N     G   +  +  TP  G   P   LV F++V 
Sbjct: 790 SVDAGK---------SLTATVTVTNTSQVAGDEVVEAYLKTPQKG--GPSHSLVGFQRVP 838

Query: 713 VPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
           +  G  + V I +   + LS VD SG R I  GE+ + IG T+
Sbjct: 839 LNPGESREVAIEVS-PRSLSAVDDSGKRSILAGEYRLSIGSTQ 880


>gi|399030621|ref|ZP_10730998.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
 gi|398071229|gb|EJL62496.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
          Length = 876

 Score =  288 bits (737), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 165/453 (36%), Positives = 241/453 (53%), Gaps = 41/453 (9%)

Query: 5   IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
           + F  +  +LL+  S+    ++ F             F    L   +RV+DL+ RL+L+E
Sbjct: 6   LIFQIVLFVLLAIQSTAQKKQKEFL------------FQNPDLSFEKRVDDLVNRLTLEE 53

Query: 65  KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
           KV  +++ + A+PRL I  Y+WW+E LHGV+     T F       T +PQ I  A++F+
Sbjct: 54  KVSQMLNSSPAIPRLDIPAYDWWNETLHGVAR----TPFK-----VTVYPQAIAMAATFD 104

Query: 125 ATLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQETPGED 175
                 +    + E RA+YN             GLTYW+PN+NIFRDPRWGRGQET GED
Sbjct: 105 KNSLYKMADFSALEGRAIYNKAVESGRTNERYLGLTYWTPNINIFRDPRWGRGQETYGED 164

Query: 176 PVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIE 233
           P L+G    S+V+GLQG D   LK AA  KH+  +     +G +  R  F+  V+  ++ 
Sbjct: 165 PYLTGVLGDSFVKGLQGDDPKYLKAAACAKHYAVH-----SGPEPLRHTFDVDVTPYELW 219

Query: 234 DTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG 293
           DT+   F+  V E KVA VMC+YN     P CA   ++   +R +W+  GY+ SDC ++ 
Sbjct: 220 DTYLPAFQKLVTESKVAGVMCAYNAFRTQPCCASDILMTDILRNQWKFEGYVTSDCWAID 279

Query: 294 VYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTV 353
            ++         E A+ADA+  G D+DCG         AV+ G +SE  I+ ++     +
Sbjct: 280 DFFKNHKTHPDAESASADAVFHGTDIDCGTDAYKALVQAVKDGKISEKQIDISVKRLFMI 339

Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPD-HQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
           + RLGMFD  P         P  V   D H+  AL+ ARQ IVLL+N+  +LPLS  + +
Sbjct: 340 RFRLGMFD--PVEMVKYAQTPTSVLENDEHKAHALKMARQSIVLLRNENKTLPLSK-KLK 396

Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            + V+GPN D  + ++GNY G     TT L+GI
Sbjct: 397 KIVVLGPNVDNAIAILGNYNGTPSKLTTVLEGI 429



 Score =  116 bits (290), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 89/301 (29%), Positives = 139/301 (46%), Gaps = 54/301 (17%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F   ++  + ADA + V G+   +E E +          DR  +LLP  Q +L+  +   
Sbjct: 598 FNTLVNRVKDADAFVFVGGISPQLEGEEMKVNFPGFKGGDRTSILLPKIQTDLMKALKTT 657

Query: 518 SKGPTILVLMSGGPIDVAF-AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
            K P + V+M+G  I + + A+N P IA    A Y GQA GTA+AD+LFG  NP G+LP+
Sbjct: 658 GK-PIVFVMMTGSAIAIPWEAENIPAIAN---AWYGGQAAGTAVADVLFGNYNPAGRLPV 713

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH-TVANAP 635
           T+Y          ++  + P    +   RTYR++KG  +Y FG+G+SYT F +  +  AP
Sbjct: 714 TFYK---------SDADLSPFVDYKMDNRTYRYFKGKPLYGFGYGLSYTTFKYDNLKIAP 764

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFS-TP 694
           +V               I GK + +T            V V N G   G   + ++    
Sbjct: 765 SV---------------IKGKNVPIT------------VKVTNTGKVSGEEVVQLYVINQ 797

Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
                AP K L  FE++ + AG  + +   +   + LS +   G  +   G+  I IGG 
Sbjct: 798 NTAIKAPLKTLKGFERISLKAGKSKTITFTLS-PEDLSYITAEGNHQQYNGKIKIAIGGN 856

Query: 755 K 755
           +
Sbjct: 857 Q 857


>gi|293370605|ref|ZP_06617157.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292634339|gb|EFF52876.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 861

 Score =  288 bits (737), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 172/441 (39%), Positives = 241/441 (54%), Gaps = 34/441 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  + LP+   SL   QR  DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV
Sbjct: 19  ACKQLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
              G           AT FPQ I   +SFN +L   +    SDEAR              
Sbjct: 79  GRAGL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKR 128

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCK 205
             GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG  D    K+ A  K
Sbjct: 129 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYDKLHACAK 188

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  F+A+ +  +D+ +T+   F+  V +  V  VMC+YN+  G P 
Sbjct: 189 HFAVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPC 245

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
           C    +L + +R EW   G +VSDC ++  +Y      + P  E A+A A+R G DL+CG
Sbjct: 246 CGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAAAVRTGTDLECG 305

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                    AV+ GL+ E +I+ +L   LT +  LG  D + +   +  +    + + +H
Sbjct: 306 SEYA-SLADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSA---WSEIPTSVLNSKEH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           Q LAL  AR+ +VLL+N+   LPL+   H  VAV+GPN++ +V   GNY GI     T L
Sbjct: 362 QALALRMARESLVLLQNKNNILPLN--THLKVAVMGPNANDSVMQWGNYNGIPAHTVTLL 419

Query: 443 QGIGRY---ARTIHQQGCKDV 460
           + +       + I++ GC  V
Sbjct: 420 EAVRAKLPEGQIIYEPGCDRV 440



 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 136/308 (44%), Gaps = 49/308 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A+     AD  +   G+  S+E E +          DR  + LP  Q++L+  +  A K 
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             + +  SG  I            AI+ A YPGQAGGTAI D L+G  NPGG+LP+T+Y 
Sbjct: 651 -VVFINYSGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
              +  LP   + +M+        GRTYR+ +   ++PFGHG+SYT+F +  A       
Sbjct: 708 D--VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTYGEAKL----- 752

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                   S N    G+ + +T            + V NVG +DG   + V+   P    
Sbjct: 753 --------SKNTIAKGENVVLT------------IPVSNVGQRDGEEVVQVYLRRPGDKE 792

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
            P   L AF++VH+PAG  + V I +    +      S T R   G + +  GGT     
Sbjct: 793 GPRYTLRAFKRVHIPAGKTESVAIPLTGENFEWFDVESNTMRPLEGTYELLYGGTSDRNK 852

Query: 760 LHAATLGV 767
           L    + V
Sbjct: 853 LKTIVMNV 860


>gi|393781488|ref|ZP_10369683.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676551|gb|EIY69983.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
           CL02T12C01]
          Length = 850

 Score =  288 bits (737), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 166/419 (39%), Positives = 238/419 (56%), Gaps = 34/419 (8%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
           LP+    L   QR  DL+ RL+++EK+ L+ + +  +PRLGI+ YEWW+EALHGV+  G 
Sbjct: 14  LPYQNPDLTPEQRATDLLQRLTVEEKISLMQNNSPGIPRLGIRPYEWWNEALHGVARAGL 73

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLT 151
                     AT FPQ I  A+SFN +L + +   VSDEARA               GLT
Sbjct: 74  ----------ATVFPQTIGMAASFNDSLVQKVFTAVSDEARAKNRAFNDQGQYKRYQGLT 123

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAY 210
            W+PNVNIFRDPRWGRGQET GEDP L+ +   + V+GLQG D  R  K+ A  KHF  +
Sbjct: 124 MWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYDKLHACAKHFAVH 183

Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
               WN   R  FNA+ ++ +D+ +T+   F+  V E  V  VMC+YN+  G P C    
Sbjct: 184 SGPEWN---RHSFNAENIAPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDPCCGSNR 240

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGL 327
           +L + +R EW  NG +VSDC ++  ++  +   + P+   A+ADA+ +G DL+CG     
Sbjct: 241 LLTQILRDEWGFNGIVVSDCGAISDFWGAKKHNTHPDAAHASADAVLSGTDLECGSNYRK 300

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLG-MFDGEPSSQPYGHLGPKDVCTPDHQELA 386
            T+ AV+ G++SE  I+ ++   L  +  LG M +  P + PY       V  P+H+ LA
Sbjct: 301 LTD-AVKAGIISEEQIDISVKRLLKARFELGEMEESHPWALPYSI-----VDCPEHRHLA 354

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           L+ A + + LL+N+   LPL   +H  VAVIGPN++ +V   GNY G     +T L  +
Sbjct: 355 LQIAHETMTLLQNKENILPLD--KHAKVAVIGPNANDSVMQWGNYNGTPSHTSTLLSAL 411



 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 97/361 (26%), Positives = 153/361 (42%), Gaps = 56/361 (15%)

Query: 425 VTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQL-------FGAAIDASRQ 477
           V   GN       YT   Q    Y   I  +  KD A  +  L         A ++  + 
Sbjct: 527 VIKTGNIKNPENLYTLQAQAGKSYEIVIEFKQIKDGAYFNFDLVEDIPLNMNATLEKLKD 586

Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
            +  I   G+   +E E +          DR  + LP  Q+ +++ +  A K   I V  
Sbjct: 587 TEIVIFAGGISPLLEGEEMKVSAAGFKGGDRTDIELPAVQRNVLAALKKAGK-KVIFVNF 645

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           SG  +  A         AI+ A YPGQ GGTA+AD+LFG  NP G+LP+T+Y    +  L
Sbjct: 646 SGSAM--ALTPETENCDAILQAWYPGQEGGTAVADVLFGDYNPAGRLPVTFYKN--MEQL 701

Query: 588 P-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           P   + +M+        GRTYR+ K   ++PFG+G+SYT F +  A A        D + 
Sbjct: 702 PDFEDYSMQ--------GRTYRYMKEAPLFPFGYGLSYTTFTYGKARA--------DKKR 745

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
            S     +G+ + +T            + V N+GS+DG   + V+         P K L 
Sbjct: 746 IS-----TGEKMTLT------------IPVSNIGSRDGEEVVQVYLRREDDPEGPTKTLR 788

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLG 766
           AF++V +  G    V I +    +    + + T     GE+ +  GG+  +  L +  + 
Sbjct: 789 AFKRVEITKGKSLNVKIELPYTAFEWFDNSTHTMHSMKGEYEVLYGGSSRTEDLQSIKMQ 848

Query: 767 V 767
           +
Sbjct: 849 I 849


>gi|383110854|ref|ZP_09931672.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
 gi|313694427|gb|EFS31262.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
          Length = 861

 Score =  288 bits (737), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 174/445 (39%), Positives = 248/445 (55%), Gaps = 40/445 (8%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           F+C  K      LP+   SL   QR  DL+ RL+L+EKV L+ + + A+PRLGIK Y+WW
Sbjct: 18  FSCAQK------LPYQDTSLTAEQRAEDLLPRLTLEEKVALMQNASPAIPRLGIKEYDWW 71

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM----- 142
           +EALHGV   G           AT FPQ I   +SFN +L   +   VSDEAR       
Sbjct: 72  NEALHGVGRAGL----------ATVFPQSIGMGASFNDSLLYEVFDAVSDEARVKSRIFS 121

Query: 143 YNG---GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRL 198
            NG      GLT+W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG  +G   
Sbjct: 122 ENGVLKRYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQLGMAVVRGLQGPENGKYD 181

Query: 199 KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
           K+ A  KHF  +    WN   R  F+A+ ++ +D+ +T+   F+  V +  V  VMC+YN
Sbjct: 182 KLHACAKHFAVHSGPEWN---RHSFDAENITPRDLWETYLPAFKDLVQKADVKEVMCAYN 238

Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRA 315
           +  G P C    +L + +R EW   G +VSDC ++  +Y      + P  E A+A A+ +
Sbjct: 239 RFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVLS 298

Query: 316 GLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK 375
           G DL+CG   G     AV+ GL+ E  I+ +L   LT +  LG  D +P+   +  +   
Sbjct: 299 GTDLECGGEYG-SLADAVKAGLIDEKQIDVSLKRLLTARFELGEMDEQPA---WAEIPAS 354

Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
            + + +HQ+LAL  AR+ +VLL+N+   LPL+      VAV+GPN++ +V   GNY GI 
Sbjct: 355 TLNSKEHQDLALRMARESLVLLQNKNDILPLN--TDLKVAVMGPNANDSVMQWGNYNGIP 412

Query: 436 CGYTTPLQGIGRY---ARTIHQQGC 457
               T L+ +       + +++ GC
Sbjct: 413 GHTVTLLEAVRSKLPEGQVMYEPGC 437



 Score =  112 bits (280), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 89/308 (28%), Positives = 137/308 (44%), Gaps = 49/308 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A++  + AD  +   G+  S+E E +          DR  + LP  Q++L+  +  A K 
Sbjct: 591 AVEKVKDADVVLFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPAVQRDLLKALKKAGKK 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             + +  SG  I +    N     AI+   YPGQAGGTAI D+LFG  NP G+LP+T+Y 
Sbjct: 651 -VVFINYSGSAIGLVPESN--TCEAILQGWYPGQAGGTAIVDVLFGDYNPAGRLPVTFYK 707

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
                 LP   + +M+        GRTYR+ +   ++PFGHG+SYT F +  A+      
Sbjct: 708 D--AGQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEADL----- 752

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                   S N    G  + +T            + V N G +DG   + V+    A   
Sbjct: 753 --------SKNTIGDGGTVTLT------------IPVSNAGQRDGDEVVQVYLRCMADKE 792

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
            PH  L AF++VH+PAG  ++V I +    +      + T     G + +  GG+     
Sbjct: 793 GPHYTLRAFKRVHIPAGETKQVTIPLTYESFEWFDTATNTVHPLKGTYELLYGGSSDKNK 852

Query: 760 LHAATLGV 767
           L    + V
Sbjct: 853 LKTIVMNV 860


>gi|332665860|ref|YP_004448648.1| beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
 gi|332334674|gb|AEE51775.1| Beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
          Length = 887

 Score =  288 bits (737), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 171/482 (35%), Positives = 258/482 (53%), Gaps = 47/482 (9%)

Query: 6   AFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK 65
           A FFL L+ +  ++          C  +  + +  P    +L    RV DL+ RL+L+EK
Sbjct: 17  AIFFLSLLTILIAN---------VCQAQ--SFKGFPMWDTNLSFEVRVKDLVSRLTLEEK 65

Query: 66  VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
           V  +++ A A+PRLGI  Y+WW+E LHGV+     T F       T +PQ I  A+ +++
Sbjct: 66  VGQMLNAAPAIPRLGIPAYDWWNEVLHGVAR----TPFH-----VTVYPQAIGMAAGWDS 116

Query: 126 TLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQETPGEDP 176
           T    +    + E RA++N  TA         GLTYW+PN+NIFRDPRWGRGQET GEDP
Sbjct: 117 TSLAMMAHYSALEGRAVFNKATALGRNNERYLGLTYWTPNINIFRDPRWGRGQETYGEDP 176

Query: 177 VLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIED 234
            L+     ++VRGLQG D   LK AA  KHF  +     +G +  R   N   S  D+ D
Sbjct: 177 FLTSMLGRAFVRGLQGDDPKYLKAAACAKHFAVH-----SGPEPSRHSDNFSPSNYDLWD 231

Query: 235 TFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGV 294
           T+   F+  V + KV  VMC+YN  +G P C    ++   +R +W+  GY+ SDC ++  
Sbjct: 232 TYLPAFKELVTKAKVEGVMCAYNAFHGQPCCGSDVLMNDILRKQWQFKGYVTSDCWAIDD 291

Query: 295 YYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQ 354
           ++           A+ DA+  G D++CG  +       V++G+++E  ++ +L+   T +
Sbjct: 292 FFKFHKTHPDATSASVDAVLHGTDVECGTDVYKSLLDGVKKGMIAEAQLDISLIRLFTTR 351

Query: 355 MRLGMFDGEPSSQPYGHLGPKDVC-TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
            RLGMFD  P S       P+ +  T +H+  +L+ A+Q IVLLKN+G +LPLS    + 
Sbjct: 352 YRLGMFD--PVSMVKYAQTPESILETAEHKAHSLKMAQQSIVLLKNEGNTLPLSK-NIKK 408

Query: 414 VAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDVACADDQLFG 469
           +AV+GPN+D  + ++GNY G      T LQGI    G+    I++   K +   +D L  
Sbjct: 409 IAVLGPNADNRIVVLGNYNGQPSEIITALQGIKNKLGQEVELIYE---KAINFTNDTLLA 465

Query: 470 AA 471
            A
Sbjct: 466 YA 467



 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 95/301 (31%), Positives = 142/301 (47%), Gaps = 54/301 (17%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
             A ++  + ADA + V G+   +E E +          DR  +LLP  Q EL+  +   
Sbjct: 609 LSAIVNRVKDADAIVYVGGISPQLEGEEMRVDFPGFNGGDRTSILLPAVQTELLKMLKGT 668

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K P + V+M+G  I  A    D  I AI+ A Y GQ+ GTAIAD+LFG  NP G+LP+T
Sbjct: 669 GK-PLVFVVMTGSAI--ALPYEDQNIPAIVNAWYGGQSAGTAIADVLFGDYNPAGRLPVT 725

Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
           +Y  +  ++LP         +S     RTYR++KG  +YPFGHG+SYT+F ++    P  
Sbjct: 726 FYKAD--SDLP-------DFKSYDMNNRTYRYFKGDALYPFGHGLSYTSFQYSKLKTP-- 774

Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPA 696
                    G I    SG + KV+              + N G KDG   + L  + P  
Sbjct: 775 ---------GKIK---SGASFKVS------------ATLTNTGKKDGDEVVQLYLAYPEV 810

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINI--HVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
              AP + L  F ++ + AG  + V   +    C+   +V+  G    P G+  I +GG 
Sbjct: 811 AGKAPIRALKGFNRIRLKAGESKTVSFTLSPEQCQ---LVNEEGALYQPKGKMEISLGGA 867

Query: 755 K 755
           +
Sbjct: 868 Q 868


>gi|333381842|ref|ZP_08473521.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332829771|gb|EGK02417.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 861

 Score =  288 bits (736), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 171/452 (37%), Positives = 247/452 (54%), Gaps = 33/452 (7%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           + +P+   +L   +R  DL+ RL+L+EKV L+   +  V RLG+K + WWSEALHGV+N 
Sbjct: 19  QIMPYKDANLTPEERAQDLLSRLTLKEKVGLMGDNSIEVTRLGVKKFAWWSEALHGVANQ 78

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------- 148
           G          G T FP+ I  A+SFN  L   +   +SDEARA ++             
Sbjct: 79  G----------GVTVFPEPIGMAASFNDELLYHVFDAISDEARARFHFREKKGDERRQDN 128

Query: 149 GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHF 207
           GL+ W+PNVNIFRDPRWGRGQET GEDP L+ +   S V GLQG  D    K+ A  KH+
Sbjct: 129 GLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGISVVNGLQGPKDAKYKKLLACAKHY 188

Query: 208 TAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
             +    WN   R   N   +  + + +T+   F++ V +  V+ VMC+Y++ +  P C 
Sbjct: 189 AVHSGPEWN---RHVLNLNNLDNRHLWETYMPAFQVLVQKADVSQVMCAYHRQDDDPCCG 245

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
           + ++LKR +R EW     +VSDC ++  +Y +   +S    +A   + AG D++CG    
Sbjct: 246 NNHLLKRILRDEWGFKRMVVSDCGAIADFYTSHKVSSDALHSAVKGVLAGTDVECGFGYT 305

Query: 327 LHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
            H    AV RGL+ E DI+ +++  LT + RLG FD + S  P+ ++    +    HQ L
Sbjct: 306 YHELVDAVSRGLIYEADIDKSVLRLLTERFRLGDFD-DNSIVPWANIPDTIINCKKHQAL 364

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           ALE ARQ + LL+N+   LPLS    + +AVIGPN+D    M GNY GI     T L+GI
Sbjct: 365 ALEMARQSMTLLQNKNNILPLSS--KKKIAVIGPNADDAKLMWGNYNGIPVKTVTILEGI 422

Query: 446 GRYA--RTIHQQGCKDVACADDQLFGAAIDAS 475
              A     +++GC  V   DD +  + I  S
Sbjct: 423 KSIAGKDIFYEKGCDIV---DDMILESYITRS 451



 Score = 99.8 bits (247), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 84/313 (26%), Positives = 134/313 (42%), Gaps = 52/313 (16%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           +   +D  +  D  +   G+   +E E +          DR  + LP  Q+  +  +  A
Sbjct: 589 YKGLVDRLKDIDVVVFAGGISGELEGEEMPIEMPGFKGGDRTDIELPASQRNCIKALKKA 648

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K   I+V  SG  I            AI+ A Y GQ+GG AIA++LFG  NP GKLP+T
Sbjct: 649 GK-RVIMVNCSGSAI--GLMPESESCEAILQAWYGGQSGGQAIAEVLFGKYNPSGKLPIT 705

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y    I  LP   E  M+        GRTYR+ +   ++PFG+G+SYT F    A A +
Sbjct: 706 FYKN--IDQLPDFEEYDMK--------GRTYRYLEDKPLFPFGYGLSYTTFDIGRATASS 755

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
           + A              +G+ IK+             + VKN G + G+ T+ V+     
Sbjct: 756 ISAK-------------AGEKIKLV------------IPVKNTGKRTGSETVQVY-VKKV 789

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTK 755
               P K L +F+++ +P    Q +   +    +    D +  R   L G++ I  G + 
Sbjct: 790 DSGGPIKTLRSFKRIELPPNVSQDLTFELEPS-FFEWYDPATLRMNVLPGDYEILYGTSS 848

Query: 756 HSVSLHAATLGVI 768
            +  L    + +I
Sbjct: 849 DNKDLQTLNITLI 861


>gi|380696428|ref|ZP_09861287.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 851

 Score =  288 bits (736), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 162/402 (40%), Positives = 243/402 (60%), Gaps = 38/402 (9%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           P+ +RV DLI RL+++EK+ LL + +  +PRLGI  Y   +EALHGV  V PG +F    
Sbjct: 34  PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 86

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
              T FPQ I  A+++N  L   +  V+SDEARA +N    G          LT+WSP V
Sbjct: 87  ---TVFPQAIGLAATWNPELQRRVATVISDEARARWNELDQGRAQKEQFSDVLTFWSPTV 143

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR  ET GEDP LSG    ++V+GLQG D   LK+ ++ KHF A + ++   
Sbjct: 144 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPHYLKIVSTPKHFAANNEEH--- 200

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +RF  N ++S++ + + +   F MCV EGK AS+M +YN +N VP   +  +L++ +R 
Sbjct: 201 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNAWLLQKVLRK 259

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC------GPFLGLHTES 331
           +W   GY+VSDC    +  +   +  T E AA  +++AGLDL+C      GP L     +
Sbjct: 260 DWGFQGYVVSDCGGPALLVNAHKYLKTKEAAATLSLKAGLDLECGDDVYDGPLL-----N 314

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
           A ++ ++S+ DI++A  + LT +M+LG+FDG     PY  + P  + + +HQ++AL+AAR
Sbjct: 315 AYKQYMVSDADIDSAAYHVLTARMKLGLFDG-VERNPYTKISPSVIGSKEHQQIALDAAR 373

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           Q IVLLKNQ   LPL+  + +++AV+G N+       G+Y+G
Sbjct: 374 QCIVLLKNQKNMLPLNASKLKSIAVVGINA--GKCEFGDYSG 413



 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 89/298 (29%), Positives = 148/298 (49%), Gaps = 45/298 (15%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A R+ +  I VMG+++SIE E  DR  + LP  Q+E + ++   +    I++L
Sbjct: 591 LYGEAGKAVRECETVIAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNSN-MIVIL 649

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           ++G  + + +   D  + AI+ A YPG+ GGTA+A++LFG  NP G+LP+T+Y    +  
Sbjct: 650 VAGSSLAINWM--DEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS--LDE 705

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SY++F ++                
Sbjct: 706 LPPFD------DYDITKGRTYKYFKGEVLYPFGYGLSYSSFKYS---------------- 743

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
              +  +  +A +V           V   +KN G ++G     V+   P  G   P K+L
Sbjct: 744 ---DLRVKDEADEV----------AVSFRLKNTGKRNGDEVTQVYVRIPETGGIVPVKEL 790

Query: 706 VAFEKVHVPAGAQQRVGI--NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             F +V + +G  +RV I  N    +Y  V    G   +P G  +I +G +   + L 
Sbjct: 791 KGFRRVPLKSGESRRVEIRLNKEQLRYWDV--GKGQFVVPKGTFDIMVGASSKDIRLQ 846


>gi|371777646|ref|ZP_09483968.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 865

 Score =  288 bits (736), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 169/453 (37%), Positives = 241/453 (53%), Gaps = 45/453 (9%)

Query: 7   FFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
            F + +IL+S +  G                + LPF    L   +R  DLI RL++QEK 
Sbjct: 5   LFLIAVILISMTVKG----------------QVLPFQNPDLSSEERAKDLISRLTVQEKA 48

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
           +LL   + A+PRLGIK + WWSEALHG +N              T FPQ I  A+SFN  
Sbjct: 49  RLLCDQSEAIPRLGIKKFNWWSEALHGYANND----------SVTVFPQPIGMAASFNEE 98

Query: 127 LWEAIGRVVSDEARAMY---------NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPV 177
           L   I   +SDEARA Y         N     L+ W+PNVNIFRDPRWGRGQET GEDP 
Sbjct: 99  LVFEIFNAISDEARAKYHQAQRRGEENRRFLSLSVWTPNVNIFRDPRWGRGQETYGEDPY 158

Query: 178 LSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDT 235
           L+ +     V+GLQG  D    K+ A  KH+T +    W+   R   N   VS ++  +T
Sbjct: 159 LTSRMGVQVVKGLQGPEDAKYRKLLACAKHYTVHSGPEWS---RHELNINDVSPREFYET 215

Query: 236 FDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVY 295
           +   F+  V +  V  VMC+Y++++  P C++  IL+R +R EW     +V+DC ++  +
Sbjct: 216 YMPAFKALVQKADVRQVMCAYHRLDDEPCCSNTRILQRILRDEWGYEHMVVADCGAISDF 275

Query: 296 YDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDINNALVNTLT 352
           Y T   +STP  AAA  + AG DL+C  +   H +    A+++ L++E DI+ +L+  L 
Sbjct: 276 YTTHGISSTPVHAAATGLLAGTDLEC-IWDNYHYKMLPEALEKDLITEKDIDRSLMRVLK 334

Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
            +  LG  D + S  P+  + P  +    H++LA + A+Q IVLL+N+   LPL      
Sbjct: 335 GRFDLGEMD-DNSLVPWAQIPPSVLNCEKHRQLAYKMAQQSIVLLQNKNKVLPLDKSSIN 393

Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            +AV+GPN+D  V + GNY G      T L GI
Sbjct: 394 KIAVVGPNADDEVVLWGNYNGTPIRTITVLDGI 426



 Score =  100 bits (249), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 119/278 (42%), Gaps = 50/278 (17%)

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
           D    +D  F A I      +  I V GL   +E E +          DR  + LP  Q+
Sbjct: 584 DFGREEDIDFTALIKKLEGIETVIFVGGLSGFLEGEEMPVSYPGFKGGDRTNIELPSVQR 643

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
             +  +  A K   I V  SG  I  A         AII A Y G++GG AIAD+LFG  
Sbjct: 644 NCLKALKEAGK-TVIFVNCSGSAI--ALEPETESCDAIIQAWYGGESGGQAIADVLFGDY 700

Query: 569 NPGGKLPMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
           NP GKLP+T+Y      NL    + +M         GRTYR Y    ++PFG G+SYTNF
Sbjct: 701 NPSGKLPVTFYRNS--DNLGDFEDYSME--------GRTYR-YTNNHLFPFGFGLSYTNF 749

Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
                                  A +S   IK         T+ +++ VKN G +DG   
Sbjct: 750 EIG-------------------KARLSKSTIKADE------TISIKIPVKNTGKRDGTEI 784

Query: 688 LLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
           + V+         P K L  F+++ VPAG  ++  I++
Sbjct: 785 VQVYVRKVNDIDGPLKTLKGFQRIAVPAGKTRQANISL 822


>gi|255690202|ref|ZP_05413877.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260624221|gb|EEX47092.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 853

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 162/397 (40%), Positives = 240/397 (60%), Gaps = 28/397 (7%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           P+ +RV DLI RL+++EK+ LL + +  +PRLGI  Y   +EALHGV  V PG +F    
Sbjct: 35  PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 87

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
              T FPQ I  A+++N  L + I  V+SDEARA +N    G          LT+WSP V
Sbjct: 88  ---TVFPQAIGLAATWNPELQKRIATVISDEARARWNELDQGRNQKEQFSDVLTFWSPTV 144

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR  ET GEDP LSG    ++V+GLQG D   LK+ ++ KHF A + ++   
Sbjct: 145 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPHYLKIVSTPKHFAANNEEH--- 201

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +RF  N ++S++ + + +   F MCV EGK AS+M +YN +N VP   +  +L++ +R 
Sbjct: 202 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNNVPCTLNSWLLQKVLRR 260

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRG 336
           +W   GY+VSDC    +  +   +  T E AA  +I+AGLDL+CG      +  +A ++ 
Sbjct: 261 DWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDEYLLNAYKQY 320

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           + SE DI++A  + LT +M+LG+FDG     PY  + P  + + +HQ +AL AAR+ IVL
Sbjct: 321 MASEADIDSAAYHVLTARMKLGLFDG-VERNPYAKISPSVIGSKEHQTVALNAARECIVL 379

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           LKNQ   LPL+  + +++AV+G N+       G+Y+G
Sbjct: 380 LKNQKNMLPLNVKKLKSIAVVGINAG--KCEFGDYSG 414



 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 87/300 (29%), Positives = 144/300 (48%), Gaps = 42/300 (14%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A R+ +  + VMG+++SIE E  DR  + LP  Q+E + ++   +    ILVL
Sbjct: 592 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IILVL 650

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           ++G  + V +   +  + AI+ A YPG+ GGTA+A++LFG  NP G+LP+T+Y    +  
Sbjct: 651 VAGSSLAVNW--ENEHLPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS--LEQ 706

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++K  V+YPFG+G+SYT F                 ++
Sbjct: 707 LPAFD------DYDITKGRTYQYFKKDVLYPFGYGLSYTTF-----------------KY 743

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP--AGHWAPHKQ 704
            ++    +GK            T+ V   +KN G + G     V+   P  AG     +Q
Sbjct: 744 SNLKVDDAGK------------TVNVSFTLKNTGKRAGDEVAQVYVRLPEIAGSTQAIRQ 791

Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
           L  F +V + AG  ++V I +   +     ++     +P G     +G +   + L   T
Sbjct: 792 LKGFRRVALKAGESRKVEITLDKEQLRYWDEKQACFVVPQGSFTFMVGASSGDIRLENTT 851


>gi|206901280|ref|YP_002250567.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
 gi|206740383|gb|ACI19441.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
          Length = 762

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 216/690 (31%), Positives = 341/690 (49%), Gaps = 96/690 (13%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARA--MYNGGTAGLTYWSPNVNIFRDPRWG 166
           GAT FPQ I  AS+F   L   +  V+    +A  ++ G        SP ++I RDPRWG
Sbjct: 106 GATVFPQAIGMASTFEPELIRRVSDVIRQHMKAANVHQG-------LSPVLDIPRDPRWG 158

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN-- 224
           R +ET GEDP L  + A  YV+GLQG D  R  + A+ KHFTAY      G+     N  
Sbjct: 159 RTEETFGEDPYLVSRMATEYVKGLQGEDW-REGIVATVKHFTAY------GISEGARNLG 211

Query: 225 -AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG 283
            AKV ++++ + F  PF + + EG+  S+M +Y++++GVP  +   +L + +R EW   G
Sbjct: 212 PAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRWEWGFKG 271

Query: 284 YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSE 340
           Y+VSD  +V +  +        +EAA  A+ AG+D++  P +  + E    AV+ GL+SE
Sbjct: 272 YVVSDYIAVRMLENFHKVARDAKEAAVLALEAGIDIEL-PSVDCYGEPLIQAVKEGLISE 330

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
             IN ++   L  +  LG+FD      P       D   P+ ++L+ E AR+ IVLLKN 
Sbjct: 331 EVINASVERVLRAKFMLGLFDDNLEKDPKKVYEVFD--KPEFRDLSREVARRSIVLLKND 388

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA------GIACGY-------------TTP 441
           G +LPLS    + VAVIGPN+D    + G+Y+       IA G               + 
Sbjct: 389 G-TLPLSK-NLKKVAVIGPNADNPRNLHGDYSYTAHIPSIAEGLEGVKVEEKCVVRTVSI 446

Query: 442 LQGIGRYA----RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD-----QSIE 492
           L+GI          ++ +GC  ++ + D  F  AI+ +++AD  I VMG +     + I 
Sbjct: 447 LEGIRNKVSPETEVLYAKGCDIISDSKDG-FAEAIEMAKEADVIIAVMGEESGLFHRGIS 505

Query: 493 AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYP 552
            E  DR  L L G Q++L+ ++    K P +LVL++G P  + +   +  + AI+ A YP
Sbjct: 506 GEGNDRTTLELFGVQRDLLKELHKLGK-PIVLVLINGRPQALKWEHEN--LNAILEAWYP 562

Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
           G+ GG A+AD++FG  NP GKLP++ +P   +T         +PS    Y   + +    
Sbjct: 563 GEEGGNAVADVIFGDYNPSGKLPIS-FPA--VTGQIPVYYNRKPSAFSDYIDESAK---- 615

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL-TLG 671
             +YPFGHG+SYT F ++                           +K++  K N L  + 
Sbjct: 616 -PLYPFGHGLSYTTFEYS--------------------------DLKISPEKVNSLEKVE 648

Query: 672 VQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
           +   +KN G++DG   + L      A    P K+L  F+K+++  G  +RV   ++  + 
Sbjct: 649 ISFTIKNTGNRDGEEVVQLYIHDQVASLERPVKELKGFKKIYLKPGESKRVTFTLY-PEQ 707

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           L+  D      +  G   + IG +   + L
Sbjct: 708 LAFYDEFMRFIVEKGVFEVMIGSSSEDIRL 737


>gi|423289663|ref|ZP_17268513.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
           CL02T12C04]
 gi|423298156|ref|ZP_17276215.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
           CL03T12C18]
 gi|392663697|gb|EIY57244.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
           CL03T12C18]
 gi|392667374|gb|EIY60884.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
           CL02T12C04]
          Length = 850

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 158/397 (39%), Positives = 242/397 (60%), Gaps = 28/397 (7%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           P+ +RV DL+ RL+++EK+ LL + +  +PRLGI  Y   +EALHGV  V PG +F    
Sbjct: 33  PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 85

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
              T FPQ I  A+++N  L + +  V+SDEARA +N    G          LT+WSP V
Sbjct: 86  ---TVFPQAIGLAATWNPVLQQKVATVISDEARARWNELDQGRNQKEQFSDVLTFWSPTV 142

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR  ET GEDP LSG    ++V+GLQG D   LK+ ++ KHF A + ++   
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGEDPRYLKIVSTPKHFVANNEEH--- 199

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +RF  N ++S++ + + +   F MCV +GK AS+M +YN +N VP   +  +L++ +R 
Sbjct: 200 -NRFICNPQISEKQLREYYFPAFEMCVKKGKAASIMTAYNALNDVPCTLNAWLLQKVLRQ 258

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRG 336
           +W   GY+VSDC    +  +   +  T E AA  +I+AGLDL+CG      +  +A ++ 
Sbjct: 259 DWGFRGYVVSDCGGPSLLVNAHKYVKTKETAATLSIKAGLDLECGDDVYDEYLLNAYKQY 318

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           ++S+ DI++A  + L  +M+LGMFD +    PY  + P  + + DHQ++AL+AAR+ IVL
Sbjct: 319 MVSDADIDSAACHVLAARMKLGMFDSK-ERNPYARISPSVIGSKDHQQVALDAARECIVL 377

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           LKNQ   LPL+  + +++AV+G N+       G+Y+G
Sbjct: 378 LKNQKNMLPLNVDKLKSIAVVGINA--GTCEFGDYSG 412



 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 88/297 (29%), Positives = 143/297 (48%), Gaps = 43/297 (14%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A  + +  + VMG+++SIE E  DR  + LP  Q+E + ++   +  P I+V+
Sbjct: 590 LYGEAGKAVSECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 647

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           +  G   +A    D  I AI+ A YPG+ GGTA+AD+LFG  NP G+LP+T+Y    +  
Sbjct: 648 LVAGS-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDE 704

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SY++F ++                
Sbjct: 705 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYS---------------- 742

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
              +  +     KVT          V   +KN G + G     V+   P  G   P K+L
Sbjct: 743 ---DLKVKDSTDKVT----------VSFRLKNTGRRKGDEVAQVYVRIPETGGIVPIKEL 789

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI-PLGEHNIHIGGTKHSVSLH 761
             F +V +  G  + + I +   + L   D +  + I P G  ++ +G +   + L 
Sbjct: 790 KGFRRVPLEPGESRAIDIELDK-EQLRYWDTTKEQFILPAGTFDVMVGASSKDIRLQ 845


>gi|317477144|ref|ZP_07936385.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316906687|gb|EFV28400.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 814

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 216/685 (31%), Positives = 333/685 (48%), Gaps = 89/685 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GR ++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 168 GTTVFPTSIGQASTWNPELIRRMGRAIATEASA--QGAHIG---YGPVLDLARDPRWSRV 222

Query: 169 QETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GED  L+G   A+ V+G QG     + KV A+ KHF AY    W        +A V
Sbjct: 223 EETYGEDAYLNGVMGAALVKGFQGEFPRTKGKVIATLKHFAAY---GWTEGGHNGGSAHV 279

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
             +++E+    PFR  V  G + SVM SYN+++G+P  A+ N+L   ++  W+  G++VS
Sbjct: 280 GNREMEEAIYPPFREAVAAGAL-SVMSSYNEIDGIPCTANSNLLTGLLKKRWQFKGFVVS 338

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP--FLGLHTESAVQRGLLSEIDINN 345
           D  ++G   +      T  EAA  A+ AG+D D G   + G    +AV+RG + E+ IN 
Sbjct: 339 DLYAIGGLRE-HGVADTDYEAAVKAVNAGVDSDLGTNVYAG-QLVNAVKRGDVQEVVINK 396

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPY-GHLGPKD-VCTPDHQELALEAARQGIVLLKNQGPS 403
           A+   L ++  +G+FD      P+     P+  V + +H ELA E ARQ I+LLKN+   
Sbjct: 397 AVSRILALKFHMGLFD-----HPFVDEREPEQVVASTEHLELAREVARQSIILLKNKNEL 451

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAG--IACGYTTPLQGIGRYART----IHQQGC 457
           LPL+  + +T+AVIGPN+D    M+G+Y          T L GI +        I+ +GC
Sbjct: 452 LPLNK-KMKTIAVIGPNADNIYNMLGDYTAPQSESSVVTVLDGIRQKVSNDTHIIYAKGC 510

Query: 458 KDVACADDQLFGAAIDASRQADATILVMG-----------------------LDQSIEAE 494
             V  +    F  AI+A+RQ+D  ++VMG                       +      E
Sbjct: 511 A-VRDSSKSGFQEAIEAARQSDVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGE 569

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             DR+ L L GRQ+EL+ +V   +K P +LVL+ G P+       +  + AI+ A YPG 
Sbjct: 570 GYDRSTLELLGRQRELIREVGKLNK-PIVLVLIKGRPL--LLEGIEAEVDAIVDAWYPGM 626

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
            GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     +Y        +G  
Sbjct: 627 QGGNAVADVLFGDYNPAGRLTIS-VPRS-VGQLPVYYNTKRKGNRSKYIEE-----EGTP 679

Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
            YPFG+G+SYT+F ++   A  V A                         C    + + V
Sbjct: 680 RYPFGYGLSYTSFNYSDLKAEVVEA----------------------EDSC---LVNISV 714

Query: 675 DVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
            V+N GS+DG   + L      A    P KQL  F+++H+  G  + +   +   K L++
Sbjct: 715 KVRNEGSRDGDEVVQLYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLD-KKSLAL 773

Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSV 758
             ++    +  G   + +GG+   +
Sbjct: 774 YMQNEEWAVEPGRFTLMLGGSSEQI 798


>gi|393786908|ref|ZP_10375040.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
           CL02T12C05]
 gi|392658143|gb|EIY51773.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
           CL02T12C05]
          Length = 854

 Score =  287 bits (734), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 164/418 (39%), Positives = 246/418 (58%), Gaps = 29/418 (6%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +  ++ P  +R+ DL+ +L+++EK+ LL + +  +PRL I  Y   +EALHGV  V PG 
Sbjct: 28  YLDMNAPRHERILDLLSKLTIEEKISLLRATSPGIPRLHIDKYYHGNEALHGV--VRPGN 85

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LT 151
                    T FPQ I  A+ +N  L   I  V+SDEARA +N    G          LT
Sbjct: 86  --------FTVFPQAIGLAAMWNPQLLNEISTVISDEARARWNELEQGKKQLGQFSDLLT 137

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
           +WSP VN+ RDPRWGR  ET GEDP LSGK   S+V+GLQG D   LK+ ++ KHF A +
Sbjct: 138 FWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQGDDPRYLKIVSTPKHFAANN 197

Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
            ++    +RF  N  +S++D+ + +   F  C++EGK AS+M +YN +N VP   +  +L
Sbjct: 198 EEH----NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAINDVPCTLNNWLL 253

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH-TE 330
           K+ +R +W  +GY+VSDC           +  T E AAA +I+AGLDL+CG  + +    
Sbjct: 254 KKVLRHDWGFDGYVVSDCGGPSFLVTHHKYVKTLEAAAALSIQAGLDLECGDEVYMEPLL 313

Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
           +A ++ ++SE +I++A  + L  +MRLG+FD +P+  PY  + P  V    H +LALEAA
Sbjct: 314 NAYKQYMVSEAEIDSAAYHVLRARMRLGLFD-DPALNPYNKISPSIVGCEKHSKLALEAA 372

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY-TTPLQGIGR 447
           RQ IVLLKN+   LPL   + +++AV+G N+    +  G+Y+G       + L+GI +
Sbjct: 373 RQSIVLLKNEKKFLPLDSKKIKSIAVVGINA--GNSEFGDYSGTPVNQPVSILEGIKK 428



 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 90/299 (30%), Positives = 149/299 (49%), Gaps = 45/299 (15%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A D  R+ D T+ V+G+++SIE E  DR  + LP  QQ  + +    +   T++VL
Sbjct: 591 LYGEAGDIMRKCDLTVAVLGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPN-TVVVL 649

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           ++G  + + +   D  I AI+ A YPG+AGGTA+A++LFG  NPGGKLP+T+Y    +  
Sbjct: 650 VAGSSLAINWM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYYRS--LDE 705

Query: 587 LP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
           LP   +  +R        GRTY+F++G  +Y FGHG+SYT F +               +
Sbjct: 706 LPAFDDYDIRK-------GRTYQFFEGDPLYAFGHGLSYTTFSY---------------K 743

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG---AHTLLVFSTPPAGHWAPH 702
             SI+A  +G  + V+              +KN G  +G   A   + +    +    P 
Sbjct: 744 KLSIDA--AGDVVSVSFT------------LKNTGKYEGDEVAQLYVKYQGSDSQVKLPL 789

Query: 703 KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           KQL  FE++H+  G  +++ + +   +     +  G    P G++   +G    ++ L 
Sbjct: 790 KQLKGFERIHLKKGESKQINLTVPKSELRFWNEEKGEFYTPAGDYLFMVGTASDAIQLQ 848


>gi|440733337|ref|ZP_20913088.1| beta-glucosidase [Xanthomonas translucens DAR61454]
 gi|440362904|gb|ELQ00083.1| beta-glucosidase [Xanthomonas translucens DAR61454]
          Length = 895

 Score =  287 bits (734), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 161/399 (40%), Positives = 230/399 (57%), Gaps = 32/399 (8%)

Query: 59  RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
           +++ +EKV   ++ A A+PRLG+  YEWW+E LHG++  G           AT FPQ I 
Sbjct: 53  KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIG 102

Query: 119 TASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFRDPRWGRGQ 169
            A+++N  L E +G V S EARA +N  GG        AGLT WSPN+NIFRDPRWGRG 
Sbjct: 103 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 162

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKV 227
           ET GEDP L+G+ A  ++ GLQG D    +  A+ KH   +     +G +  R  F+  V
Sbjct: 163 ETYGEDPYLTGQLAVGFIHGLQGDDLTHPRTIATPKHLAVH-----SGPEPGRHGFDVDV 217

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
           S  D+E T+   FR  +++G+  SVMC+YN ++G P CA   +L   +RG+W   G++VS
Sbjct: 218 SPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWGFTGFVVS 277

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           DCD+V       +F +    ++A A++AG DL+CG +       A+ RG   E  ++ +L
Sbjct: 278 DCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEALLDQSL 336

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
           V     + RLG    +    PY  LG KDV +  H+ LAL+AA+Q IVLL+N+  +LPL 
Sbjct: 337 VRLFAARYRLGELQPQ-RKDPYAQLGAKDVDSAAHRALALQAAQQSIVLLQNRNATLPLR 395

Query: 408 -HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
             +R   +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 396 PGLR---LAVIGPNADALAALEANYQGTSAAPVTPLLGL 431



 Score =  135 bits (341), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 96/278 (34%), Positives = 138/278 (49%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  L LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 632 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 690

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP       
Sbjct: 691 WAKQ--HADAIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFY--RSTKDLPAY----- 741

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
              S    GRTYR++KG  ++ FG G+SYT F +    AP + A  L             
Sbjct: 742 --VSYDMKGRTYRYFKGEPLFAFGSGLSYTRFTYA---APQLSATTL------------- 783

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
                  A  N   L V+  V N G++ G   + V+  PP G  +P + LV F++V +  
Sbjct: 784 ------QAGAN---LQVRTQVSNSGTRAGDEVVQVYLQPPQGAQSPLRTLVGFQRVTLQP 834

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G  + VG  +   + LS VDR+G R +  G++ + +GG
Sbjct: 835 GEAREVGFEL-TPRQLSDVDRAGQRAVQPGDYRVFVGG 871


>gi|285018984|ref|YP_003376695.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
 gi|283474202|emb|CBA16703.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
           PC73]
          Length = 904

 Score =  287 bits (734), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 163/404 (40%), Positives = 230/404 (56%), Gaps = 28/404 (6%)

Query: 52  RVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGAT 111
           R   L+ +++  EK+   ++ A A+PRLGI  YEWWSE LHG++  G           AT
Sbjct: 55  RATALVAKMTRAEKIAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGE----------AT 104

Query: 112 SFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGTA-------GLTYWSPNVNIFRD 162
            FPQ I  A+S+N  L  A+G V S EARA +N  GG         GLT WSPN+NIFRD
Sbjct: 105 VFPQAIGLAASWNTDLLHAVGTVTSTEARAKFNLAGGPGKNHARYGGLTIWSPNINIFRD 164

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
           PRWGRG ET GEDP L+G+ A  ++ GLQG D    +  A+ KH   +   +     R  
Sbjct: 165 PRWGRGMETYGEDPYLTGQLAVGFIHGLQGDDPTHPRTIATPKHLAVH---SGPESGRHG 221

Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
           F+  VS  D E T+   FR  ++EG   SVMC+YN ++G+P CA   ++   +RG W   
Sbjct: 222 FDVDVSPHDFEATYSPAFRAAIVEGHAGSVMCAYNALHGIPACAADWLIDGRVRGNWGFK 281

Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
           G++VSDCD++       ++ +    +AA A++AG DL+CG +      +A+ RG   E  
Sbjct: 282 GFVVSDCDAIDDMTQFHYYRADNAGSAAAALKAGHDLNCG-YAYRDLGTALDRGEAEEAM 340

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           ++ +LV     + RLG      S  PY  LG KD+ +P H+ LAL+AA+Q +VLL+N+  
Sbjct: 341 LDRSLVRLFAARYRLGELQPR-SKDPYARLGAKDIDSPTHRALALQAAQQSLVLLQNRND 399

Query: 403 SLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           +LPL   +R   +AVIGPN+D    +  NY G +    TPLQG+
Sbjct: 400 TLPLRPGLR---LAVIGPNADALAALEANYQGTSVAPVTPLQGL 440



 Score =  132 bits (332), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 95/278 (34%), Positives = 138/278 (49%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  L LP  QQ L+ +   + K P I+VLMSG  + + 
Sbjct: 641 GLSPDVEGEELRIDVPGFDGGDRNDLSLPAAQQALLERAKASGK-PLIVVLMSGSAVALN 699

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA  L G  NPGG+LP+T+Y      +LP       
Sbjct: 700 WAKQ--HADAILAAWYPGQSGGTAIAQALAGDINPGGRLPVTFY--RSTKDLP------- 748

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
           P  S    GRTYR++KG  ++PFG+G+SYT+F +T   AP + +  L             
Sbjct: 749 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTHFAYT---APQLSSTTLQAGD--------- 796

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
                        TL V   V+N G++ G   + V+   P    +P + LV F++V +  
Sbjct: 797 -------------TLHVTTTVRNTGARAGDEVVQVYLQYPPRAQSPLRALVGFQRVSLQP 843

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G  + +   +   + LS VDRSG R +  G++ + +GG
Sbjct: 844 GEARTLSFALE-PRQLSDVDRSGQRAVEAGDYRLFVGG 880


>gi|365122193|ref|ZP_09339098.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363642907|gb|EHL82241.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 853

 Score =  287 bits (734), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 162/410 (39%), Positives = 240/410 (58%), Gaps = 29/410 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A T+ L +  ++ P  +R+ DL+ RL+++EK+ LL + +  +PRLGI  Y   +EALHGV
Sbjct: 24  AQTKEL-YKDMNAPQHERIMDLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV 82

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----- 149
             V PG          T FPQ I  AS +N  L   I   +SDEAR  +N    G     
Sbjct: 83  --VRPGN--------FTVFPQAIGLASMWNPELLYEISTAISDEARGRWNELNRGKDQKG 132

Query: 150 -----LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
                LT+WSP VN+ RDPRWGR  ET GEDP LSGK   ++V+GLQG+D   LK+ ++ 
Sbjct: 133 FFSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQGNDPRYLKIVSTP 192

Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           KHF A + ++    +RF  N  +S++++ + +   F  C+ EGK  S+M +YN +N VP 
Sbjct: 193 KHFAANNEEH----NRFECNPHISERNLREYYLPAFESCIKEGKAQSIMSAYNAINDVPC 248

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
             +P +L + +R EW  NGY+VSDC   G       +  TPE AA  +I+AGLDL+CG  
Sbjct: 249 TLNPWLLTQVLRKEWGFNGYVVSDCGGPGFLVTHHKYVKTPEAAATLSIKAGLDLECGDN 308

Query: 325 LGLH-TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
           + +    +A ++ ++++ DI+ A    L  +M LG+FD +P   PY  + P  V    H+
Sbjct: 309 VYIEPLMNAYKQCMVTDADIDTAAYRILRARMMLGLFD-DPEKNPYNAISPSIVGCEKHR 367

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           +LALEAARQ +VLLKN+   LPL+  + +++AV+G N+       G+Y+G
Sbjct: 368 QLALEAARQSLVLLKNEKNFLPLNPKKVKSIAVVGINA--GNCEFGDYSG 415



 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 89/296 (30%), Positives = 144/296 (48%), Gaps = 41/296 (13%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A R+ D T+ V+G+++SIE E  DR  + LP  QQ  + +   A+   T++VL
Sbjct: 593 LYGEAGRAIRECDVTVAVLGINKSIEREGQDRYTIELPADQQLFIKEAYKANPN-TVVVL 651

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           ++G  + + +   D  I AI+ A YPG+ GGTA+A+ LFG  NPGG+LP+T+Y    +  
Sbjct: 652 VAGSSLAINWI--DENIPAILNAWYPGEQGGTAVAEALFGDYNPGGRLPLTYY--RSLDE 707

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY +++   +YPFG+G+SYT F                  +
Sbjct: 708 LPAFD------DYDIQKGRTYMYFENKPLYPFGYGLSYTRF-----------------DY 744

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
            ++ + +S  A+             ++  VKN G   G     V+   P +G   P KQL
Sbjct: 745 KNLKSEVSDDAV------------NLKFTVKNTGKYAGDEVAQVYVRFPESGIKVPLKQL 792

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             FE+VH+  G   +V ++I   +     ++ G    P G +   +G +   + L 
Sbjct: 793 KGFERVHIGKGKSAQVSVSIPKKELRLWDEKDGKFYTPSGNYIFMVGSSSDDIRLQ 848


>gi|427385138|ref|ZP_18881643.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727306|gb|EKU90166.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
           12058]
          Length = 863

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 171/432 (39%), Positives = 242/432 (56%), Gaps = 32/432 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+   +L   +R  DL+GRL+L+EK  L+ + + A+PRLGIK Y+WW+EALHGV   G  
Sbjct: 25  PYKNPALTPEERAADLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 83

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM-----YNGGTA---GLTY 152
                    AT FPQ I   +SFN  L   +   VSDEARA        GG     GLT 
Sbjct: 84  ---------ATVFPQAIGMGASFNNDLLYDVFTAVSDEARAKTAEFSKEGGLKRYQGLTM 134

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYD 211
           W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG +G +  K+ A  KHF  + 
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGGKYDKLHACAKHFAVHS 194

Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              WN   R  F+A+ V  +D+ +T+   F+  V +  V  VMC+YN+  G P C    +
Sbjct: 195 GPEWN---RHSFDAENVDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNRL 251

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLH 328
           L + +R EW  +G IVSDC ++  +++     + P  E A+A A+  G D++CG      
Sbjct: 252 LVQILRDEWAYDGIIVSDCWAINDFFNKGAHETEPDKEHASAKAVLTGTDVECGESYA-S 310

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
              AV+ GL+ E  I+ +L   +  +  LG  D  P    +  +    V + +H+ELAL 
Sbjct: 311 LPQAVKAGLIDEKKIDISLKRLMKARFELGEMDN-PELVSWAQIPYSVVDSKEHRELALR 369

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
            AR+ +VLL+N    LPL+  +   VAV+GPN++ +V   GNY G      T L+GI +Y
Sbjct: 370 MARESLVLLQNNQNVLPLN--KSLKVAVVGPNANDSVMQWGNYNGFPGHTVTLLEGIRQY 427

Query: 449 ---ARTIHQQGC 457
              A+ I++ GC
Sbjct: 428 LPEAQLIYEPGC 439



 Score =  117 bits (292), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 89/302 (29%), Positives = 136/302 (45%), Gaps = 49/302 (16%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           I   + AD  +   G+  ++E E +          DR  + LP  Q  L++++  A K  
Sbjct: 594 IQRVKDADIIVFAGGISPAVEGEEMRVTIPGFKGGDRETIELPSIQSRLLAELKKAGK-K 652

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
            + V  SG  I  A         AI+ A YPGQAGGTAIA++LFG  NP G+LP+T+Y  
Sbjct: 653 VVFVNFSGSAI--ALTPETKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYKS 710

Query: 582 EYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
              + LP   + +M+        GRTYR+     ++PFGHG+SYT F             
Sbjct: 711 --TSQLPDFEDYSMK--------GRTYRYMAEAPLFPFGHGLSYTTF------------- 747

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
               R+G  +A++S + +K            + + V N G +DG   + V+   P     
Sbjct: 748 ----RYG--DASLSTQEVKEGEQAI------LTIPVSNTGERDGEEVVQVYLRRPGDKEG 795

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P   L AF++V++  G    V I++    +      + T R   G++ I  GGT     L
Sbjct: 796 PSHALRAFKRVNIAKGTTGNVTISLSKEDFEWFDTETNTMRPIEGDYEILYGGTSELKQL 855

Query: 761 HA 762
            A
Sbjct: 856 KA 857


>gi|299149391|ref|ZP_07042448.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298512578|gb|EFI36470.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 853

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 159/397 (40%), Positives = 243/397 (61%), Gaps = 28/397 (7%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           P+ +RV DL+ RL+++EK+ LL + +  +PRLGI  Y   +EALHGV  V PG +F    
Sbjct: 36  PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 88

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
              T FPQ I  A+++N  L + +  V+SDEARA +N    G          LT+WSP V
Sbjct: 89  ---TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR  ET GEDP LSG    ++V+GLQG D   LK+ ++ KHF A + ++   
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPRYLKIVSTPKHFAANNEEH--- 202

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +RF  N ++S++ + + +   F MCV EGK AS+M +YN +N VP   +  +LK+ +R 
Sbjct: 203 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAWLLKKVLRQ 261

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRG 336
           +W   GY+VSDC    +  +   +  T E AA  +I+AGLDL+CG      +  +A ++ 
Sbjct: 262 DWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGDDVYDEYLLNAYKQY 321

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           ++S+ DI++A  + LT +M+LG+FDG     PY  + P  + + +HQ++AL+AAR+ IVL
Sbjct: 322 MVSDADIDSAACHVLTARMKLGLFDGT-ERNPYTRISPSVIGSKEHQQIALDAARECIVL 380

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           LKN+   LPL+  + +++AV+G N+       G+Y+G
Sbjct: 381 LKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSG 415



 Score =  132 bits (333), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 88/296 (29%), Positives = 142/296 (47%), Gaps = 41/296 (13%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A R+ +  + VMG+++SIE E  DR  + LP  Q+E + ++   +  P I+V+
Sbjct: 593 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 650

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           +  G   +A    D  I AI+ A YPG+ GGTA+AD+LFG  NP G+LP+T+Y    +  
Sbjct: 651 LVAGS-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDE 707

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SY++F ++            DG +
Sbjct: 708 LPAFD------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYSDLKVK-------DGAN 754

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
                                 T+ V   +KN G + G     V+   P  G   P K+L
Sbjct: 755 ----------------------TISVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKEL 792

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             F ++ + +G  + V I +   +        G   +P G  +I +G +   + L 
Sbjct: 793 KGFRRIPLKSGESRVVDIELDKEQLRYWDAGLGQFIVPQGAFDIMVGASSKDIRLQ 848


>gi|254786805|ref|YP_003074234.1| glycoside hydrolase family 3 domain-containing protein
           [Teredinibacter turnerae T7901]
 gi|237686035|gb|ACR13299.1| glycoside hydrolase family 3 domain protein [Teredinibacter
           turnerae T7901]
          Length = 888

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 178/458 (38%), Positives = 247/458 (53%), Gaps = 39/458 (8%)

Query: 10  LGLILLSASSSGLAARE---PFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKV 66
           LGL L S   +G +      P     +        +   +L I  RV+DL+ R+ L EK+
Sbjct: 12  LGLTLASLLFTGCSPDNNPVPKPVSERSTANEQPAYMDTTLDIDTRVDDLVSRMDLAEKI 71

Query: 67  KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNAT 126
             + + + A+  LGI  Y+WW+EALHGV+  G           AT FPQ I  A+ ++  
Sbjct: 72  SQMYNESPAIEHLGIAEYDWWNEALHGVARAG----------KATVFPQAIGMAAMWDRE 121

Query: 127 LWEAIGRVVSDEARAMY-----NG---GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
               I   VSDEARA +     NG      GLT+WSPN+NIFRDPRWGRGQET GEDP L
Sbjct: 122 TMFDIAEAVSDEARAKHHYFVENGVHFRYTGLTFWSPNINIFRDPRWGRGQETYGEDPYL 181

Query: 179 SGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDV 238
           +G+ A  Y+ GLQG +   LK AA  KHF  +         R   N   S +D+ +T+  
Sbjct: 182 TGELALPYISGLQGENPKYLKTAAMAKHFAVHSGPE---KSRHSDNYIASPKDLNETYLP 238

Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY-- 296
            F   V+EG V SVMC+YN+VN  P C +  +LK T+RG+W   G++VSDC ++  +Y  
Sbjct: 239 AFEKAVVEGDVESVMCAYNRVNDEPACGNDMLLKETLRGKWGFKGHVVSDCGAIADFYAP 298

Query: 297 DTQHFTSTPEEAAADAIRAGLDLDCG-----PFLGLHTESAVQRGLLSEIDINNALVNTL 351
           +  H    P  AAA A+R+G DL+CG      F  LH   A+QR ++++ +I+ ++   +
Sbjct: 299 EAHHVVMAPAAAAAWAVRSGTDLNCGTDRLSTFANLHF--ALQREMITQDEIDQSVKRLM 356

Query: 352 TVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
             + +LGMFD  P  Q PY  +    V +  H  L  +AA +  VLLKN G  LPL   +
Sbjct: 357 KTRFKLGMFD--PDDQVPYSKIPMDVVGSQAHLALTQKAAEKSFVLLKNSG-ILPLK--K 411

Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY 448
              VA+IGPN+     ++GNY G      TPL GI +Y
Sbjct: 412 SSKVAIIGPNATNPTVLVGNYFGDPIKPVTPLDGIQQY 449



 Score =  115 bits (289), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 86/272 (31%), Positives = 136/272 (50%), Gaps = 43/272 (15%)

Query: 486 GLDQSIEAEALD---RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPR 542
           G + S+E E  D   R  + LP  Q++L++ +   +K P +LV  SG  I + +A N+  
Sbjct: 636 GEEMSVEIEGFDHGDRTDIRLPEPQRKLLATLKKLNK-PIVLVNFSGSAIALNWANNN-- 692

Query: 543 IAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKR 601
           + AI+   YPG+A GTA+A IL+G  +P G+LP+T+Y    + +LP   + AM       
Sbjct: 693 VDAILQGFYPGEATGTALARILWGEVSPSGRLPITFY--RSLDDLPGFKDYAMT------ 744

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
              RTY++Y+G V+YPFG+G+SYT F ++  +AP  +A              SG+ + +T
Sbjct: 745 --NRTYKYYQGDVLYPFGYGLSYTQFAYSELSAPATMA--------------SGEPLAIT 788

Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
               N            V S +     +    P  G   P ++L  F+++++  GA Q V
Sbjct: 789 AQVSNS---------GKVASDEVVQVYVSMKVP--GLSLPQRELKEFKRIYLEPGASQTV 837

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
             +I   K LS VD  G R    G   + +GG
Sbjct: 838 EFSI-AGKDLSYVDDQGVRHPYHGPLTLSVGG 868


>gi|393781363|ref|ZP_10369562.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676856|gb|EIY70278.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
           CL02T12C01]
          Length = 863

 Score =  286 bits (732), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 178/440 (40%), Positives = 245/440 (55%), Gaps = 43/440 (9%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
           LPF    LP+ +R  DL+ RL+LQEKV L+   ++ +PRLGIK Y WW+EALHGV   G 
Sbjct: 22  LPFNNSDLPVEERAQDLLQRLTLQEKVLLMCDYSSPIPRLGIKRYNWWNEALHGVGRAGL 81

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT--------AGLT 151
                     AT FPQ I  A++F+          VSDEARA Y+            GLT
Sbjct: 82  ----------ATVFPQAIGMAATFDDCAVRQAFECVSDEARAKYHHSENKEGSERYQGLT 131

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
           +W+PNVNIFRDPRWGRGQET GEDP L+ +   + VRGLQG S+    K+ A  KH+  +
Sbjct: 132 FWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGLAVVRGLQGPSESKYDKLHACAKHYALH 191

Query: 211 DLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
               WN   R  F+   +S +D+ +T+   F+  V +G V  VMC+YN+  G P C    
Sbjct: 192 SGPEWN---RHSFDVDSISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEPCCGSNR 248

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG-PFLG 326
           +L   +R EW  +G +VSDC ++  +Y   H  + P  E A A A++AG DLDCG  +  
Sbjct: 249 LLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHPTKEAAVAAAVKAGTDLDCGVDYYA 308

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKDVCTPDH 382
           L  + AV+ G+++E  I+ +L   L  +  LG+ D E     S  PY       V +  H
Sbjct: 309 L--QKAVEEGIITEKQIDVSLFRLLKARFELGLMDEEHLVSWSDIPY-----TVVDSEKH 361

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRH-RTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           +E ALE AR+ + LLKN   +LPLS  +H   +AVIGPN++ +V M GNY G      T 
Sbjct: 362 REKALEMARKSMTLLKNDHGTLPLS--KHCGKIAVIGPNANDSVMMWGNYNGFPSHTVTI 419

Query: 442 LQGIGRY---ARTIHQQGCK 458
           L+GI       + I+ +GC+
Sbjct: 420 LEGITHKLGAEQIIYDKGCE 439



 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 94/307 (30%), Positives = 146/307 (47%), Gaps = 51/307 (16%)

Query: 474 ASRQADATILVM--GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           A+R  DA ++V   G+   +E E L          DR  + LP  Q++L+ ++    K P
Sbjct: 594 AARVGDAEVIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELHKTGK-P 652

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
            IL+L SG  I ++ A+ D    AII A Y GQAGGTA+AD+LFG  NP G+LP+T+Y  
Sbjct: 653 VILILCSGSAIGLS-AEVD-LADAIIQAWYLGQAGGTAVADVLFGDYNPAGRLPVTFYKA 710

Query: 582 EYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
                LP   + +M+        GRTYR+++G  ++PFG+G+SYT+F             
Sbjct: 711 --TEQLPDFEDYSMQ--------GRTYRYFEGEALFPFGYGLSYTSFEIG---------- 750

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
                     A +S K I+   +   +LT      V+N G  DG   + ++         
Sbjct: 751 ---------KARLSKKRIRENESVSLKLT------VENTGKLDGDEVIQIYIRKLQDKEG 795

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L AF++ H+ AG ++ V  ++    +      S T R+  GE+ I  G +     L
Sbjct: 796 PLKTLRAFKRFHLRAGEKKDVTFHLQNDHFNFFDTESNTMRVMPGEYEILYGASSLEKDL 855

Query: 761 HAATLGV 767
               + +
Sbjct: 856 RRINIKI 862


>gi|383113364|ref|ZP_09934136.1| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
 gi|382948729|gb|EFS32368.2| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
          Length = 850

 Score =  286 bits (732), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 159/397 (40%), Positives = 243/397 (61%), Gaps = 28/397 (7%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           P+ +RV DL+ RL+++EK+ LL + +  +PRLGI  Y   +EALHGV  V PG +F    
Sbjct: 33  PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 85

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
              T FPQ I  A+++N  L + +  V+SDEARA +N    G          LT+WSP V
Sbjct: 86  ---TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 142

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR  ET GEDP LSG    ++V+GLQG D   LK+ ++ KHF A + ++   
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPRYLKIVSTPKHFAANNEEH--- 199

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +RF  N ++S++ + + +   F MCV EGK AS+M +YN +N VP   +  +LK+ +R 
Sbjct: 200 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAWLLKKVLRQ 258

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRG 336
           +W   GY+VSDC    +  +   +  T E AA  +I+AGLDL+CG      +  +A ++ 
Sbjct: 259 DWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGDDVYDEYLLNAYKQY 318

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           ++S+ DI++A  + LT +M+LG+FDG     PY  + P  + + +HQ++AL+AAR+ IVL
Sbjct: 319 MVSDADIDSAACHVLTARMKLGLFDG-TERNPYTRISPSVIGSKEHQQIALDAARECIVL 377

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           LKN+   LPL+  + +++AV+G N+       G+Y+G
Sbjct: 378 LKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSG 412



 Score =  132 bits (332), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 89/296 (30%), Positives = 142/296 (47%), Gaps = 41/296 (13%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A R+ +  + VMG+++SIE E  DR  + LP  Q+E + ++   +  P I+V+
Sbjct: 590 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 647

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           +  G   +A    D  I AI+ A YPG+ GGTA+AD+LFG  NP G+LP+T+Y    +  
Sbjct: 648 LVAGS-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDE 704

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SY++F ++            DG +
Sbjct: 705 LPAFD------DYDITQGRTYKYFKGDVLYPFGYGLSYSSFKYSDLKVK-------DGAN 751

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
                                 T+ V   +KN G + G     V+   P  G   P K+L
Sbjct: 752 ----------------------TVSVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKEL 789

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             F ++ + +G  + V I +   +        G   +P G  +I IG +   + L 
Sbjct: 790 KGFRRIPLKSGESRVVEIELDKEQLRYWDAGLGQFIVPQGAFDIMIGASSKDIRLQ 845


>gi|336417083|ref|ZP_08597412.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
           3_8_47FAA]
 gi|335936708|gb|EGM98626.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
           3_8_47FAA]
          Length = 850

 Score =  286 bits (732), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 159/397 (40%), Positives = 243/397 (61%), Gaps = 28/397 (7%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           P+ +RV DL+ RL+++EK+ LL + +  +PRLGI  Y   +EALHGV  V PG +F    
Sbjct: 33  PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 85

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
              T FPQ I  A+++N  L + +  V+SDEARA +N    G          LT+WSP V
Sbjct: 86  ---TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 142

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR  ET GEDP LSG    ++V+GLQG D   LK+ ++ KHF A + ++   
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPRYLKIVSTPKHFAANNEEH--- 199

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +RF  N ++S++ + + +   F MCV EGK AS+M +YN +N VP   +  +LK+ +R 
Sbjct: 200 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAWLLKKVLRQ 258

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRG 336
           +W   GY+VSDC    +  +   +  T E AA  +I+AGLDL+CG      +  +A ++ 
Sbjct: 259 DWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGDDVYDEYLLNAYKQY 318

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           ++S+ DI++A  + LT +M+LG+FDG     PY  + P  + + +HQ++AL+AAR+ IVL
Sbjct: 319 MVSDADIDSAACHVLTARMKLGLFDG-TERNPYTRISPSVIGSKEHQQIALDAARECIVL 377

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           LKN+   LPL+  + +++AV+G N+       G+Y+G
Sbjct: 378 LKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSG 412



 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 88/296 (29%), Positives = 142/296 (47%), Gaps = 41/296 (13%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A R+ +  + VMG+++SIE E  DR  + LP  Q+E + ++   +  P I+V+
Sbjct: 590 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 647

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           +  G   +A    D  I AI+ A YPG+ GGTA+AD+LFG  NP G+LP+T+Y    +  
Sbjct: 648 LVAGS-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDE 704

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SY++F ++            DG +
Sbjct: 705 LPAFD------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYSDLKVK-------DGAN 751

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
                                 T+ V   +KN G + G     V+   P  G   P K+L
Sbjct: 752 ----------------------TVSVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKEL 789

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             F ++ + +G  + V I +   +        G   +P G  +I +G +   + L 
Sbjct: 790 KGFRRIPLKSGESRVVEIELDKEQLRYWDAGLGRFIVPQGAFDIMVGASSKDIRLQ 845


>gi|189464211|ref|ZP_03012996.1| hypothetical protein BACINT_00548 [Bacteroides intestinalis DSM
           17393]
 gi|189438001|gb|EDV06986.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 814

 Score =  286 bits (732), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 216/685 (31%), Positives = 333/685 (48%), Gaps = 89/685 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GR ++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 168 GTTVFPTSIGQASTWNPELIRRMGRAIATEASA--QGAHIG---YGPVLDLARDPRWSRV 222

Query: 169 QETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GED  L+G   A+ V+G QG     + KV A+ KHF AY    W        +A V
Sbjct: 223 EETYGEDAYLNGVMGAALVKGFQGEFPRTKGKVIATLKHFAAY---GWTEGGHNGGSAHV 279

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
             +++E+    PFR  V  G + SVM SYN+++G+P  A+ N+L   ++  W+  G++VS
Sbjct: 280 GNREMEEAIYPPFREAVAAGAL-SVMSSYNEIDGIPCTANSNLLTGLLKERWQFKGFVVS 338

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP--FLGLHTESAVQRGLLSEIDINN 345
           D  ++G   +      T  EAA  A+ AG+D D G   + G    +AV+RG + E+ IN 
Sbjct: 339 DLYAIGGLRE-HGVADTDYEAAVKAVNAGVDSDLGTNVYAG-QLVNAVKRGDVQEVVINK 396

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPY-GHLGPKD-VCTPDHQELALEAARQGIVLLKNQGPS 403
           A+   L ++  +G+FD      P+     P+  V + +H ELA E ARQ I+LLKN+   
Sbjct: 397 AVSRILALKFHMGLFD-----HPFVDEREPEQVVASTEHLELAREVARQSIILLKNKNEL 451

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAG--IACGYTTPLQGIGRYART----IHQQGC 457
           LPL+  + +T+AVIGPN+D    M+G+Y          T L GI +        I+ +GC
Sbjct: 452 LPLNK-KTKTIAVIGPNADNIYNMLGDYTAPQSESSVVTVLDGIRQKVSNDTHIIYAKGC 510

Query: 458 KDVACADDQLFGAAIDASRQADATILVMG-----------------------LDQSIEAE 494
             V  +    F  AI+A+RQ+D  ++VMG                       +      E
Sbjct: 511 A-VRDSSKSGFQEAIEAARQSDVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGE 569

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             DR+ L L GRQ+EL+ +V   +K P +LVL+ G P+       +  + AI+ A YPG 
Sbjct: 570 GYDRSTLELLGRQRELIREVGKLNK-PIVLVLIKGRPL--LLEGIEAEVDAIVDAWYPGM 626

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
            GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     +Y        +G  
Sbjct: 627 QGGNAVADVLFGDYNPAGRLTIS-VPRS-VGQLPVYYNTKRKGNRSKYIEE-----EGTP 679

Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
            YPFG+G+SYT+F ++   A  V A                         C    + + V
Sbjct: 680 RYPFGYGLSYTSFNYSDLKAEVVEA----------------------EDSC---LVNISV 714

Query: 675 DVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
            V+N GS+DG   + L      A    P KQL  F+++H+  G  + +   +   K L++
Sbjct: 715 KVRNEGSRDGDEVVQLYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLD-KKSLAL 773

Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSV 758
             ++    +  G   + +GG+   +
Sbjct: 774 YMQNEEWAVEPGRFTLMLGGSSEQI 798


>gi|424792251|ref|ZP_18218496.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422797157|gb|EKU25539.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 909

 Score =  286 bits (732), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 161/399 (40%), Positives = 230/399 (57%), Gaps = 32/399 (8%)

Query: 59  RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
           +++ +EKV   ++ A A+PRLG+  YEWW+E LHG++  G           AT FPQ I 
Sbjct: 67  KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIG 116

Query: 119 TASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFRDPRWGRGQ 169
            A+++N  L E +G V S EARA +N  GG        AGLT WSPN+NIFRDPRWGRG 
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKV 227
           ET GEDP L+G+ A  ++ GLQG D    +  A+ KH   +     +G +  R  F+  V
Sbjct: 177 ETYGEDPYLTGQLAVGFIHGLQGDDLTHPRTIATPKHLAVH-----SGPEPGRHGFDVDV 231

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
           S  D+E T+   FR  +++G+  SVMC+YN ++G P CA   +L   +RG+W   G++VS
Sbjct: 232 SPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWGFTGFVVS 291

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           DCD+V       +F +    ++A A++AG DL+CG +       A+ RG   E  ++ +L
Sbjct: 292 DCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEALLDKSL 350

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
           V     + RLG    +    PY  LG KDV +  H+ LAL+AA+Q IVLL+N+  +LPL 
Sbjct: 351 VRLFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQNRNATLPLR 409

Query: 408 -HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
             +R   +AVIGPN+D    +  NY G +    TPL G+
Sbjct: 410 PGLR---LAVIGPNADALAALEANYQGTSAAPVTPLLGL 445



 Score =  125 bits (315), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 93/279 (33%), Positives = 136/279 (48%), Gaps = 50/279 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  L LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 646 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 704

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK      AI+ A YPGQ+GGTAIA +L G  NPGG+LP+T+Y      +LP       
Sbjct: 705 WAKQ--HADAIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFY--RSTKDLPAY----- 755

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD-GRHGSINATIS 654
              S    GRTYR++KG  ++ FG G+SYT F +    AP + A  L  G H        
Sbjct: 756 --VSYDMKGRTYRYFKGEPLFAFGSGLSYTRFTYA---APQLSATTLQAGAH-------- 802

Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
                          L V+  V+N G++ G   + V+   P    +P + LV F++V + 
Sbjct: 803 ---------------LQVRTQVRNSGTRAGDEVVQVYLEFPQRAQSPLRTLVGFQRVTLQ 847

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            G  + V   +   + LS VDR+G R +  G++ + +GG
Sbjct: 848 PGEARDVSFEL-APRQLSDVDRAGQRAVQPGDYRVFVGG 885


>gi|393789624|ref|ZP_10377744.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
           CL02T12C05]
 gi|392650340|gb|EIY44009.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
           CL02T12C05]
          Length = 855

 Score =  286 bits (732), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 170/436 (38%), Positives = 247/436 (56%), Gaps = 43/436 (9%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  +T  F  ++ P  +R+ DL+ RL+++EKV LL++ A  +PRL I  Y   +EALHGV
Sbjct: 22  AQNKTELFRDMTAPQHERILDLLNRLTVEEKVSLLVNDAREIPRLNIDKYNHGNEALHGV 81

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN---------- 144
             V PG          T FPQ I  A+++N  L   +   +SDEAR  +           
Sbjct: 82  --VRPGE--------FTVFPQAIGLAATWNPNLIFRVSTAISDEARGRWKELDYGKKQIA 131

Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
           GG+  LT+WSP VN+ RDPRWGR  ET GEDP LSG+    +V+GLQG +   LK  ++ 
Sbjct: 132 GGSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGRIGCEFVKGLQGDNPRYLKTVSTP 191

Query: 205 KHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           KHF A + ++    +R   NA++S++D+ + +   F  C+++GK  S+M +YN VN VP 
Sbjct: 192 KHFAANNEEH----NRSSCNARMSERDLREYYLPAFERCIVDGKAQSIMMAYNAVNDVPC 247

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-- 322
             +  ++K+ +RG+W  NGYIVSDC +         +    E AA  A++AGLDL+CG  
Sbjct: 248 TVNIYLIKKVLRGDWNFNGYIVSDCSAPEWMVTKHKYVKNLEAAATLALKAGLDLECGDR 307

Query: 323 ----PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC 378
               P L  + E  V     SE +I++A  + L  +M LG+FD +PS  PY  + P  + 
Sbjct: 308 VYTAPLLKAYNEYMV-----SEAEIDSAAYHILRGRMLLGLFD-DPSQNPYNKIEPSVIG 361

Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
             +HQELALE ARQ +VLLKNQ   LPL+  + R++AV+G ++       G+Y+G     
Sbjct: 362 CKEHQELALETARQSMVLLKNQKNFLPLNRKKIRSIAVVGISA--AHCEFGDYSGNP--K 417

Query: 439 TTP---LQGIGRYART 451
            TP   L GI +YA  
Sbjct: 418 NTPVSVLDGIKKYAEN 433



 Score =  141 bits (355), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 88/296 (29%), Positives = 149/296 (50%), Gaps = 42/296 (14%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           LFG A   +++ D T+ V+G+++SIE E  DR  L LP  QQE + ++   +   T++VL
Sbjct: 592 LFGDAGKVAKECDVTVAVLGINKSIEREGQDRYSLELPIDQQEFIKELYKVNPN-TVVVL 650

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           ++G  + + +   D  + AI+ A YPG+ GG A+A++LFG  NPGG+LP+T+Y    +  
Sbjct: 651 VAGSSMAINWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS--LDE 706

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +             RTY++++G  +Y FG+G+SYTNF +                 
Sbjct: 707 LPAFD-------DYSVKNRTYQYFEGKPLYEFGYGLSYTNFKY----------------- 742

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
                    K+I  ++      T+ +  ++ NVG  DG     V+   P  G + P KQL
Sbjct: 743 -------KKKSIMQSND-----TVDITFNLSNVGKYDGDEVAQVYVRYPETGTYMPLKQL 790

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             F +VH+  G    + I+I   +     +++     P GE+   +GG+  ++S+ 
Sbjct: 791 KGFSRVHLKKGKSADITISIPKKELRYWDEKTRQFVTPTGEYVFQVGGSSENISIE 846


>gi|298387490|ref|ZP_06997042.1| beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298259697|gb|EFI02569.1| beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 853

 Score =  286 bits (732), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 164/403 (40%), Positives = 244/403 (60%), Gaps = 40/403 (9%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           P+ +RV DLI RL+++EK+ LL + +  +PRLGI  Y   +EALHGV  V PG +F    
Sbjct: 36  PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG-RF---- 88

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
              T FPQ I  A+++N  L + +  V+SDEARA +N    G          LT+WSP V
Sbjct: 89  ---TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR  ET GEDP LSG    ++V GLQG D   LK+ ++ KHF A + ++   
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHYLKIVSTPKHFAANNEEH--- 202

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +RF  N ++S++ + + +   F MCV EGK AS+M +YN +N VP   +  +L++ +R 
Sbjct: 203 -NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNSWLLQKVLRQ 261

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC------GPFLGLHTES 331
           +W   GY+VSDC    +  +   +  T E AA  +I+AGLDL+C      GP L     +
Sbjct: 262 DWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLL-----N 316

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAA 390
           A ++ ++S+ DI++A  + LT +M+LG+FD GE    PY  + P  + + +HQ++AL+AA
Sbjct: 317 AYKQYMVSDADIDSAACHVLTARMKLGLFDSGE--RNPYTKISPSVIGSKEHQQIALDAA 374

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           RQ IVLLKNQ   LPL+  + +++AV+G N+       G+Y+G
Sbjct: 375 RQCIVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSG 415



 Score =  137 bits (345), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 89/298 (29%), Positives = 147/298 (49%), Gaps = 45/298 (15%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A R+ +  + VMG+++SIE E  DR  + LP  Q+E + ++   +  P I+V+
Sbjct: 593 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIIVV 650

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           +  G   +A    D  + AI+ A YPG+ GGTA+A++LFG  NP G+LP+T+Y    +  
Sbjct: 651 LVAGS-SLAVNWMDEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS--LDE 707

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SY++F ++           L  + 
Sbjct: 708 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFTYS----------DLQVKD 751

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
           G    T+S +                   +KN G ++G     V+   P  G   P K+L
Sbjct: 752 GGDEVTVSFR-------------------LKNTGKRNGDEVAQVYVRIPETGGIVPLKEL 792

Query: 706 VAFEKVHVPAGAQQRVGINI--HVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             F +V + +G  +RV I +     +Y  V    G   +P G  ++ +G +   + L 
Sbjct: 793 KGFRRVPLKSGESRRVEIKLDKEQLRYWDV--EKGQFVVPKGAFDVMVGASSKDIRLQ 848


>gi|365121873|ref|ZP_09338785.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363644185|gb|EHL83481.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 850

 Score =  286 bits (731), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 157/403 (38%), Positives = 239/403 (59%), Gaps = 28/403 (6%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +  +  P  +R+ DL+ RL+++EK+ LL + +  +PRL I+ Y   +EALHG+  V PG 
Sbjct: 27  YKDMDAPQHERIMDLLSRLTIEEKISLLRATSPGIPRLEIEKYYHGNEALHGI--VRPGN 84

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LT 151
                    T FPQ I  AS +N      I  V+SDEARA +N    G          LT
Sbjct: 85  --------FTVFPQAIGLASMWNPDFLYEISTVISDEARARWNELNRGKDQKRLFSDLLT 136

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
           +WSP VN+ RDPRWGR  ET GEDP LSGK   ++V+GLQG+D   LKV ++ KHF A +
Sbjct: 137 FWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQGNDPRYLKVVSTPKHFAANN 196

Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
            ++    +RF  N ++S++D+ + +   F  C+++GK  S+M +YN +N VP   +  +L
Sbjct: 197 EEH----NRFECNPQISERDLREYYLPAFERCIIDGKAQSIMTAYNAINDVPCTLNTWLL 252

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH-TE 330
           K+ +R +W  NGY+VSDC +  +      +  TPE AA  A++AGLDL+CG  + +    
Sbjct: 253 KKVLRTDWGFNGYVVSDCGAPSLLVTHHKYVKTPEAAATLALKAGLDLECGDNVYIEPLM 312

Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
           +A ++ ++SE +I+ A    L  +M LG+FD +P+  PY  L P  V    H+ +ALEAA
Sbjct: 313 NAYKQYMVSEAEIDTAAYRILRARMMLGLFD-DPAKNPYNALSPSIVGCEKHKNMALEAA 371

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           RQ +VLLKN+   LP++  + +++AV+G N+       G+Y+G
Sbjct: 372 RQSLVLLKNENNFLPINPKKIKSIAVVGINA--GNCEFGDYSG 412



 Score =  122 bits (305), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 90/297 (30%), Positives = 139/297 (46%), Gaps = 43/297 (14%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A ++ D TI VMG+++SIE E  DR  + LP + QEL  + +        +VL
Sbjct: 590 LYGDAKKAIQECDMTIAVMGINKSIEREGRDRDHIELP-KDQELFIEEAYKLNPKMAVVL 648

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           ++G  + V +   D  + AI+ A YPG+ GGTA+A+ LFG  NP G+LP+T+Y    + +
Sbjct: 649 VAGSSLAVNWM--DEHVPAILNAWYPGEQGGTAVAEALFGDYNPAGRLPLTYYRS--LDD 704

Query: 587 L-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
           L P  + A++ +       RTY ++ G  +Y FG+G+SYT F               D R
Sbjct: 705 LPPFDDYAVQKN-------RTYMYFTGKPLYAFGYGLSYTKF---------------DYR 742

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQ 704
             S++       +  T              +KN G  +G     V+   P  G   P KQ
Sbjct: 743 KLSVDQDAENVRLSFT--------------IKNSGKYNGDEVAQVYVQFPEIGVKVPIKQ 788

Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           L  FE+VH+  G    V I +   +     +R G    P G +   +G +   + L 
Sbjct: 789 LKGFERVHIAKGKTLPVTITVPKKELRIWNERKGEFFTPSGNYVFMVGASSDDIRLQ 845


>gi|333380551|ref|ZP_08472242.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826546|gb|EGJ99375.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 854

 Score =  285 bits (730), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 168/415 (40%), Positives = 239/415 (57%), Gaps = 39/415 (9%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           P   R+ DL+ RL+++EK+ LL + +  +PRL I  Y   +E+LHGV  V PG +F    
Sbjct: 36  PTHDRIMDLLSRLTIEEKISLLRATSPGIPRLQIPKYYHGNESLHGV--VRPG-RF---- 88

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
              T FPQ I  AS +N  L   I   +SDEAR  +N    G          LT+WSP V
Sbjct: 89  ---TVFPQAIGLASMWNPELHHKIATAISDEARGRWNELEQGKLQTQRFTDLLTFWSPTV 145

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR  ET GEDP LSG    ++VRGLQG D   LK+ ++ KHF A + ++   
Sbjct: 146 NMARDPRWGRTPETYGEDPYLSGILGTAFVRGLQGDDPRYLKIVSTPKHFAANNEEH--- 202

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +RF  N ++S++ + + +   F MCV +GK AS+M +YN +N VP  A+P +L + +R 
Sbjct: 203 -NRFVCNPQISERQLREYYFPAFEMCVKDGKSASIMSAYNAINDVPCTANPWLLTKVLRH 261

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTES 331
           +W  NGY+VSDC    +      +  T E AA  +I+AGLDL+CG      P L  + + 
Sbjct: 262 DWGFNGYVVSDCGGPSLLVSAMKYVKTKEAAATLSIKAGLDLECGDDVYMQPLLNAYNQY 321

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
            V R      DI+ A    L  +M LG+FD +P   PY  + P  V + +H++LALEAAR
Sbjct: 322 MVSRA-----DIDTAAYRVLRARMHLGLFD-DPDLNPYNKISPSVVGSAEHKQLALEAAR 375

Query: 392 QGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI-ACGYTTPLQGI 445
           Q IVLLKN   +LPL+  + +++AV+G N+    +  G+Y+GI A    + LQGI
Sbjct: 376 QSIVLLKNNNRTLPLNPKKVKSIAVVGINA--GNSEFGDYSGIPANAPVSILQGI 428



 Score =  139 bits (349), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 91/296 (30%), Positives = 147/296 (49%), Gaps = 43/296 (14%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           ++G A  A R+ +  I V+G++++IE E  DR  + LP  Q+E + ++   +  P I+V+
Sbjct: 593 MYGEAGKAVRECEQVIAVLGINKTIEREGQDRYDIHLPADQEEFIREIYKVN--PNIVVV 650

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ-EYIT 585
           +  G   +A    D  + AI+ A YPG+ GGTA+A++LFG  NPGG+LP+T+Y   E I 
Sbjct: 651 LVAGS-SLAINWMDEHVPAIVNAWYPGEQGGTAVAEVLFGEYNPGGRLPVTYYNSLEEIP 709

Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
           +    ++           GRTY+++KG  +YPFG+G+SYT F +                
Sbjct: 710 SFDDYDITK---------GRTYQYFKGKPLYPFGYGLSYTTFAYK--------------- 745

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQ 704
             ++    +G  IKV+             ++KN G  DG     V+   P +G + P K+
Sbjct: 746 --NLQINDNGNNIKVS------------FELKNTGRMDGDEVSQVYVKIPSSGIFMPIKE 791

Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           L  F++  +  GA + V INI         D + T   P GE+   IG +   + L
Sbjct: 792 LKGFQRSTLKKGATKNVEINIRKDLLRYWDDATETFITPKGEYEFMIGTSSQDIQL 847


>gi|218262493|ref|ZP_03476939.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
           DSM 18315]
 gi|218223341|gb|EEC95991.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
           DSM 18315]
          Length = 868

 Score =  285 bits (729), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 170/454 (37%), Positives = 241/454 (53%), Gaps = 37/454 (8%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LPI +R++DL+ RL+ +EKV  +++   A+ RLGI  Y+WW+EALHGV+  G  
Sbjct: 27  PFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARAGK- 85

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                    AT FPQ I  A++F+         +VSDEARA Y+            GLT+
Sbjct: 86  ---------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           W+PN+NIFRDPRWGRG ET GEDP L+ +   + V+GLQG D    K  A  KH+  +  
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKYFKTHACAKHYAVHSG 196

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
             WN   R  F+  V+ +D+  T+   F   V EG V  VMC+YN+  G P C+   +L 
Sbjct: 197 PEWN---RHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNRYQGKPCCSSDKLLI 253

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFT------STPEEAAADAIRAGLDLDCG-PFL 325
             +R  W     I+SDC ++  +++    T         E A+ADA+  G DL+CG  + 
Sbjct: 254 DILRNSWGYENIILSDCGAINDFWERDERTPRHETHPDAESASADAVLNGTDLECGNSYR 313

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQE 384
            L    A++ G +SE D++ +L   L  +  LGMFD  P  Q PY  +    V +P+H  
Sbjct: 314 AL--VKALKDGKISENDLDVSLRRLLKGRFELGMFD--PDEQVPYAQIPYNVVESPEHVA 369

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
            ALE A + +VLLKN+  +LPLS    R +AV+GPN+  +  +  NY G      T L+G
Sbjct: 370 QALEMAHKSMVLLKNKNNTLPLSKTI-RKIAVVGPNAADSTMLWANYNGFPTHTVTILEG 428

Query: 445 IGRY---ARTIHQQGCKDVACADDQLFGAAIDAS 475
           I         I++ GC   A    Q  G+ I +S
Sbjct: 429 IRNKVPDTEVIYELGCNHAADFVIQDLGSHITSS 462



 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 100/311 (32%), Positives = 141/311 (45%), Gaps = 49/311 (15%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           + A     + AD  + V G+   +E E +          DR  + LP  QQE+V K   A
Sbjct: 595 YAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMV-KALKA 653

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
           +  P + VL +G  +  A    +  I AI+ A Y GQ  GTA+ADILFG  NP G+LP+T
Sbjct: 654 TGKPVVYVLCTGSAL--ALNWEEANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVT 711

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y    I  LP   + +M+        GRTYR+     +YPFG+G+SYTNF +       
Sbjct: 712 FYKS--IDQLPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR------ 755

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                        NA +S   I    AK   +TL    D+ N G  DG     ++   P 
Sbjct: 756 -------------NAKLSSGKI----AKDQSVTL--TFDIANTGKMDGDEVAQIYIKNPN 796

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K L AF +VHV AG  Q V I +    + S  D + T  +  G++ I  GG+  
Sbjct: 797 DPEGPIKALKAFLRVHVKAGDSQEVNIELAPETFHSFNDNTQTMEVRPGKYQILYGGSSD 856

Query: 757 SVSLHAATLGV 767
             +L    L +
Sbjct: 857 DKALQRLELTI 867


>gi|212692496|ref|ZP_03300624.1| hypothetical protein BACDOR_01992 [Bacteroides dorei DSM 17855]
 gi|212664971|gb|EEB25543.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           dorei DSM 17855]
          Length = 864

 Score =  285 bits (729), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 175/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M + I + F  L++LS   +  A +                    +L   +R  DL+ +L
Sbjct: 1   MRNRILYLFASLVILSGCGNQPAYKNS------------------NLSPEERAEDLLQQL 42

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EKV L++  +  V RLGIK Y WW+EALHGV+  G           AT FPQ I  A
Sbjct: 43  TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
           +SF       I   VSDEARA     +A        GLT W+P VNI+RDPRWGRG ET 
Sbjct: 93  ASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152

Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
           GEDP L+     + V+GLQ +D ++   K+ A  KHF  +    WN   R  FNA+ +  
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +D+ +T+ VPF   V EGKV  VMC+YN++ G P C    +L + +R EW   G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDC 269

Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
            ++  +Y  +   + P  E A+A A+ +G DL+CG       ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L  +  LG  D +PS   +  +    VC+ +H  L+L+ AR+ + LL N+   LPL 
Sbjct: 329 KRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
               +T+AV+GPN++ +V   GNY G      T L+GI    G   + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 136/301 (45%), Gaps = 50/301 (16%)

Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
           + AD  I   G+  S+E E +          DR  + LP  Q+EL+  +  A K   I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
             SG PI  A         AI+ A YPGQ+GG A A++LFG  NP G+LP+T+Y    I 
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IA 714

Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
            LP  E            GRTYR++KG  ++PFG+G+SYT F +          + LD  
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NIKLD-- 757

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
                     + IKV        T  + + V N G++DG   + V+         P K L
Sbjct: 758 ----------QTIKVGE------TAKMVIPVTNAGNRDGEEVVQVYLKKQEDAEGPAKTL 801

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
            AF++V +PAG    V + +   K L   D ++ T R   G  +I +GG      L   T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860

Query: 765 L 765
           L
Sbjct: 861 L 861


>gi|330996729|ref|ZP_08320604.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
 gi|329572574|gb|EGG54217.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
          Length = 852

 Score =  285 bits (729), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 171/421 (40%), Positives = 242/421 (57%), Gaps = 29/421 (6%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           F  +  P  +R+ DL+ RL+++EK+ LL++ A A+ RLGI  Y   +EALHGV  V PG 
Sbjct: 29  FRDMKAPQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGV--VRPG- 85

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN----------GGTAGLT 151
               DF   T FPQ I  A+ +N  L   I   +SDEAR  +           G +  LT
Sbjct: 86  ----DF---TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAGASDLLT 138

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
           +WSP VN+ RDPRWGR  ET GEDP LSG    ++V+GLQG+    LK  ++ KHF   +
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGNHPRYLKTVSTPKHFAVNN 198

Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
            ++    +R   NAKVS++D+ + +   F  C+ EGK  S+M +YN VN VP   +  ++
Sbjct: 199 EEH----NRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVNDVPCTVNTYLI 254

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTE 330
           K  +RG+W  NGYIVSDC +        H+  T E AA  A++AGLDL+CG    G    
Sbjct: 255 KNVLRGDWGFNGYIVSDCSAPEWMITKHHYVKTREAAATLAVKAGLDLECGNQVYGEGLL 314

Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
            A ++ ++SE DI++A    L  +M LG+FD +PS  PY  + P  V    HQ+LALEAA
Sbjct: 315 KAYRQYMVSEADIDSAAYRILRGRMMLGLFD-DPSQNPYNQIEPSVVGCKAHQDLALEAA 373

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY-TTPLQGIGRYA 449
           RQ +VLLKN+   LPL+  + +++AV+G ++       G+Y+G       T L GI +YA
Sbjct: 374 RQSMVLLKNKDNFLPLNPQKVKSIAVVGISA--GHCEFGDYSGTPKNEPVTILDGIKQYA 431

Query: 450 R 450
            
Sbjct: 432 E 432



 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 89/296 (30%), Positives = 147/296 (49%), Gaps = 44/296 (14%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           +G A   + + D T+ V+G+++SIE E  DR  L LP  QQE + ++   +   T++VL+
Sbjct: 594 YGDAGKVAAECDVTVAVLGINKSIEREGQDRFTLELPIDQQEFIKELYKVNPN-TVVVLV 652

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           +G  + V +   D  + AI+ A YPG+ GG A+A++LFG  NPGG+LP+T+Y        
Sbjct: 653 AGSSLAVNWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS------ 704

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
            + E+    + S +  GRTY++++G  +Y FG+G+SYT F                 R+ 
Sbjct: 705 -LDEIPAFDNYSVK--GRTYQYFEGQPLYEFGYGLSYTKF-----------------RYK 744

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLV 706
           S   +++            R T+ V  +V N G  DG     V+   P  G + P KQL 
Sbjct: 745 SKGVSVA------------RDTVKVSFEVSNTGKYDGDEVAQVYVKYPETGTYMPLKQLH 792

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI-PLGEHNIHIGGTKHSVSLH 761
            F++VH+  G   +V + +   K L   D    + + P GE+   +G +   +   
Sbjct: 793 GFKRVHIKKGKTSKVTVGVPK-KDLRYWDEQERKFVTPKGEYTFMVGASSEDIKFR 847


>gi|237709184|ref|ZP_04539665.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|229456880|gb|EEO62601.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
          Length = 864

 Score =  285 bits (729), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 175/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M + I + F  L++LS   +  A +                    +L   +R  DL+ +L
Sbjct: 1   MRNRILYLFASLVILSGCGNQPAYKNS------------------NLSPEERAEDLLQQL 42

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EKV L++  +  V RLGIK Y WW+EALHGV+  G           AT FPQ I  A
Sbjct: 43  TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
           +SF       I   VSDEARA     +A        GLT W+P VNI+RDPRWGRG ET 
Sbjct: 93  ASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152

Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
           GEDP L+     + V+GLQ +D ++   K+ A  KHF  +    WN   R  FNA+ +  
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +D+ +T+ VPF   V EGKV  VMC+YN++ G P C    +L + +R EW   G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDC 269

Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
            ++  +Y  +   + P  E A+A A+ +G DL+CG       ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L  +  LG  D +PS   +  +    VC+ +H  L+L+ AR+ + LL N+   LPL 
Sbjct: 329 KRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
               +T+AV+GPN++ +V   GNY G      T L+GI    G   + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  126 bits (316), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 136/301 (45%), Gaps = 50/301 (16%)

Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
           + AD  I   G+  S+E E +          DR  + LP  Q+EL+  +  A K   I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
             SG PI  A         AI+ A YPGQ+GG A A++LFG  NP G+LP+T+Y    I 
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IA 714

Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
            LP  E            GRTYR++KG  ++PFG+G+SYT F +          + LD  
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NIKLD-- 757

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
                     + IKV        T  + + V N G++DG   + V+         P K L
Sbjct: 758 ----------QTIKVGE------TAKMVIPVTNAGNRDGEEVVQVYLKKQEDAEGPAKTL 801

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
            AF++V +PAG    V + +   K L   D ++ T R   G  +I +GG      L   T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860

Query: 765 L 765
           L
Sbjct: 861 L 861


>gi|313204584|ref|YP_004043241.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443900|gb|ADQ80256.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 727

 Score =  285 bits (729), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 225/725 (31%), Positives = 355/725 (48%), Gaps = 109/725 (15%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK--GYEWWSEALHGVSN 96
           T PF    LP  +R+++L+  ++L EKV  L S    VPRLGI+  G+   SE LHG++ 
Sbjct: 24  TFPFQNTGLPDNERLDNLLSLMTLDEKVNAL-STNLGVPRLGIRNTGH---SEGLHGMAL 79

Query: 97  VGPGTKFGGDFPG------ATSFPQVITTASSFNATLWEAIGRVVSDEARAMY---NGGT 147
            GPG  +GG   G       T FPQ      +++  L + +  + + E R      N   
Sbjct: 80  GGPGN-WGGSERGVAKTYPTTIFPQAYGLGETWDTELIQKVADIEATEIRFYAQNANLQK 138

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHF 207
            G+   +PN ++ RDPRWGR +E+ GED  L  +   ++V+GLQG+D    K A+  KHF
Sbjct: 139 GGMVMRAPNADLARDPRWGRTEESYGEDAFLGSRLTVAFVKGLQGNDPKYWKSASLMKHF 198

Query: 208 TAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
            A    N N   R   ++   ++   + +  PF   + EG   + M SYN  NGVP   +
Sbjct: 199 LA----NSNEDGRDSTSSNFDERLFREYYSFPFYKGITEGGSRAFMASYNAWNGVPMTVN 254

Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG- 326
           P ILK+  R EW  NG I +D  ++ +  +  H   T  E AA  ++A +    G FL  
Sbjct: 255 P-ILKKIARDEWGNNGIICTDGGALSLLVNAHHAFPTLTEGAAAVVKASV----GQFLDN 309

Query: 327 --LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP---- 380
              +   A+++GLL+E +I+N +     V ++LG+ D + S  PY  +G  D  +P    
Sbjct: 310 FRSYIYEALKKGLLTEKNIDNVIRGNFYVALKLGLLDADQSKVPYTGIGVTDTVSPWNKQ 369

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
           D +    +   + +VLLKN    LPL+  + +++AVIGP ++    ++  Y+G     T 
Sbjct: 370 DTKAFVRKVTAKSVVLLKNTAGLLPLNKSKIKSIAVIGPRAN--EVLLDWYSG-----TP 422

Query: 441 PLQGIGRYARTIHQQGCKDVACADDQLFGAAID-------ASRQADATILVMGLD----- 488
           P      YA +I  QG K+    D ++F A  D       A+R+AD  I+ +G       
Sbjct: 423 P------YAVSI-LQGIKNAVGKDIEVFYAPSDEMDKATLAARKADVAIVCVGNHPYGTD 475

Query: 489 --------QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
                    S   EA+DR  + L   +QE + K+ M +   T++VL+S  P  + +++ +
Sbjct: 476 ARWKISPVPSDGREAVDRKSITL---EQEDLVKLVMQANPKTVMVLVSNFPFAINWSQEN 532

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMAMRPSQS 599
             + AI+      Q  G  +AD++FG  +P G+   TW   + IT+L PM +  +R    
Sbjct: 533 --VPAILHVTNNSQELGNGLADVIFGDVSPAGRTTQTWV--KSITDLPPMMDYDIR---- 584

Query: 600 KRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIK 659
               GRTY+++K   +YPFG G+SYT+F ++             G   S N T++     
Sbjct: 585 ---HGRTYQYFKSKPLYPFGFGLSYTSFEYS-------------GLETS-NPTLTD---- 623

Query: 660 VTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQ 718
                    ++ V V VKN+G +DG   + L  S P +    P KQL  F++V +PAG  
Sbjct: 624 ---------SIFVSVKVKNIGKRDGDEVIQLYVSYPDSKVERPMKQLKGFKRVFIPAGKS 674

Query: 719 QRVGI 723
           + V I
Sbjct: 675 KTVEI 679


>gi|423240769|ref|ZP_17221883.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
           CL03T12C01]
 gi|392643731|gb|EIY37480.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
           CL03T12C01]
          Length = 864

 Score =  285 bits (728), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 175/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M + I + F  L++LS   +  A +                    +L   +R  DL+ +L
Sbjct: 1   MRNRILYLFASLVILSGCGNQPAYKNS------------------NLSPEERAEDLLQQL 42

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EKV L++  +  V RLGIK Y WW+EALHGV+  G           AT FPQ I  A
Sbjct: 43  TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
           +SF       I   VSDEARA     +A        GLT W+P VNI+RDPRWGRG ET 
Sbjct: 93  ASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152

Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
           GEDP L+     + V+GLQ +D ++   K+ A  KHF  +    WN   R  FNA+ +  
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +D+ +T+ VPF   V EGKV  VMC+YN++ G P C    +L + +R EW   G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDC 269

Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
            ++  +Y  +   + P  E A+A A+ +G DL+CG       ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPNAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L  +  LG  D +PS   +  +    VC+ +H  L+L+ AR+ + LL N+   LPL 
Sbjct: 329 KRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
               +T+AV+GPN++ +V   GNY G      T L+GI    G   + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  126 bits (316), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 136/301 (45%), Gaps = 50/301 (16%)

Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
           + AD  I   G+  S+E E +          DR  + LP  Q+EL+  +  A K   I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
             SG PI  A         AI+ A YPGQ+GG A A++LFG  NP G+LP+T+Y    I 
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IA 714

Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
            LP  E            GRTYR++KG  ++PFG+G+SYT F +          + LD  
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NIKLD-- 757

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
                     + IKV        T  + + V N G++DG   + V+         P K L
Sbjct: 758 ----------QTIKVGE------TAKMVIPVTNAGNRDGEEVVQVYLKKQEDAEGPAKTL 801

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
            AF++V +PAG    V + +   K L   D ++ T R   G  +I +GG      L   T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860

Query: 765 L 765
           L
Sbjct: 861 L 861


>gi|345514226|ref|ZP_08793739.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|229437207|gb|EEO47284.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
          Length = 864

 Score =  285 bits (728), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 175/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M + I + F  L++LS   +  A +                    +L   +R  DL+ +L
Sbjct: 1   MRNRILYLFASLVILSGCGNQPAYKNS------------------NLSPEERAEDLLQQL 42

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EKV L++  +  V RLGIK Y WW+EALHGV+  G           AT FPQ I  A
Sbjct: 43  TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
           +SF       I   VSDEARA     +A        GLT W+P VNI+RDPRWGRG ET 
Sbjct: 93  ASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152

Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
           GEDP L+     + V+GLQ +D ++   K+ A  KHF  +    WN   R  FNA+ +  
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +D+ +T+ VPF   V EGKV  VMC+YN++ G P C    +L + +R EW   G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDC 269

Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
            ++  +Y  +   + P  E A+A A+ +G DL+CG       ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L  +  LG  D +PS   +  +    VC+ +H  L+L+ AR+ + LL N+   LPL 
Sbjct: 329 KRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
               +T+AV+GPN++ +V   GNY G      T L+GI    G   + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  126 bits (316), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 136/301 (45%), Gaps = 50/301 (16%)

Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
           + AD  I   G+  S+E E +          DR  + LP  Q+EL+  +  A K   I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
             SG PI  A         AI+ A YPGQ+GG A A++LFG  NP G+LP+T+Y    I 
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IA 714

Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
            LP  E            GRTYR++KG  ++PFG+G+SYT F +          + LD  
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NIKLD-- 757

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
                     + IKV        T  + + V N G++DG   + V+         P K L
Sbjct: 758 ----------QTIKVGE------TAKMVIPVTNAGNRDGEEVVQVYLKKQEDAEGPAKTL 801

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
            AF++V +PAG    V + +   K L   D ++ T R   G  +I +GG      L   T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860

Query: 765 L 765
           L
Sbjct: 861 L 861


>gi|423230604|ref|ZP_17217008.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
           CL02T00C15]
 gi|423244313|ref|ZP_17225388.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
           CL02T12C06]
 gi|392630748|gb|EIY24734.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
           CL02T00C15]
 gi|392642494|gb|EIY36260.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
           CL02T12C06]
          Length = 864

 Score =  285 bits (728), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 175/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M + I + F  L++LS   +  A +                    +L   +R  DL+ +L
Sbjct: 1   MRNRILYLFASLVILSGCGNQPAYKNS------------------NLSPEERAEDLLQQL 42

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EKV L++  +  V RLGIK Y WW+EALHGV+  G           AT FPQ I  A
Sbjct: 43  TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
           +SF       I   VSDEARA     +A        GLT W+P VNI+RDPRWGRG ET 
Sbjct: 93  ASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152

Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
           GEDP L+     + V+GLQ +D ++   K+ A  KHF  +    WN   R  FNA+ +  
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +D+ +T+ VPF   V EGKV  VMC+YN++ G P C    +L + +R EW   G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDC 269

Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
            ++  +Y  +   + P  E A+A A+ +G DL+CG       ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L  +  LG  D +PS   +  +    VC+ +H  L+L+ AR+ + LL N+   LPL 
Sbjct: 329 KRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
               +T+AV+GPN++ +V   GNY G      T L+GI    G   + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/301 (31%), Positives = 133/301 (44%), Gaps = 50/301 (16%)

Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
           + AD  I   G+  S+E E +          DR  + LP  Q+EL+  +  A K   I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
             SG PI  A         AI+ A YPGQ+GG A A++LFG  NP G+LP+T+Y    I 
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IA 714

Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
            LP  E            GRTYR++KG  ++PFG+G+SYT F +        + V     
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYDNIKLEQTIKV----- 762

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
                    G+  K+             + V N G++DG   + V+         P K L
Sbjct: 763 ---------GETAKMV------------IPVTNTGNRDGEEVVQVYLKKQEDTEGPTKTL 801

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
            AF++V +PAG    V + +   K L   D ++ T R   G  +I +GG      L   T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860

Query: 765 L 765
           L
Sbjct: 861 L 861


>gi|365121914|ref|ZP_09338824.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363643627|gb|EHL82934.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 1073

 Score =  285 bits (728), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 163/409 (39%), Positives = 237/409 (57%), Gaps = 38/409 (9%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF   +L   +R+ DL+ RL++ EK+ LL + + A+PRLGI  Y   +EALHGV  V PG
Sbjct: 28  PFRDTTLSHHERIKDLLSRLNVSEKISLLRATSPAIPRLGIDKYYHGNEALHGV--VRPG 85

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN----------GGTAGL 150
            KF       T FPQ I  AS +N    + +   +SDEAR  +N          G +  L
Sbjct: 86  -KF-------TVFPQAIGLASMWNPDFLQEVSTAISDEARGRWNELNQGKDQTAGASDLL 137

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
           T+WSP +N+ RDPRWGR  ET GEDP L+G    ++VRGLQG+D   +KV ++ KHF A 
Sbjct: 138 TFWSPTINMARDPRWGRTPETYGEDPFLTGTLGTAFVRGLQGNDPKYIKVVSTPKHFAAN 197

Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
           + ++    +R   NA +S++D+ + +   F  C+ EG+  SVM +YN VNG+P   +  +
Sbjct: 198 NEEH----NRASGNAVISERDLREYYFPAFEKCIKEGQAQSVMSAYNAVNGIPCTLNKWL 253

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PF 324
           L   +R +W  +GY+VSDC +        H+  T EEAA+  I+AGLDL+CG      P 
Sbjct: 254 LTDVLRDDWGFDGYVVSDCSAPEYIVSQHHYVDTYEEAASLCIKAGLDLECGDNVYITPL 313

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
           L     +A  RG+++  +I++A    L  +MRLG+FD +P+  PY  + P  V    H+E
Sbjct: 314 L-----NAYNRGMVTMSEIDSAAYRVLRGRMRLGLFD-DPNENPYNKISPSIVGCEKHRE 367

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           LALEAARQ +VLLKN    LP+     +++AV+G N+       G+Y+G
Sbjct: 368 LALEAARQSLVLLKNDKDMLPIQTDNIKSIAVVGINA--ANCEFGDYSG 414



 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 87/296 (29%), Positives = 148/296 (50%), Gaps = 43/296 (14%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           +G A +  R +D TI V+G+D++IE E  DR+ + LP  QQ  + +   A+   T++VL+
Sbjct: 733 YGDAGEIIRGSDLTIAVLGIDRTIEREGQDRSTIELPEDQQIFIEEAYKANPN-TVVVLV 791

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           +G  + + +   D  I A++ A YPG+ GGTA+A+ LFG  NPGG+LP+T+Y    +++L
Sbjct: 792 AGSSLAINWI--DQNIPAVLDAWYPGEQGGTAVAEALFGDYNPGGRLPLTFYNS--LSDL 847

Query: 588 P-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           P   +  +R +       RTY +++G  +YPFG+G+SYT+F +                 
Sbjct: 848 PAFDDYNVRNN-------RTYMYFEGKPLYPFGYGLSYTDFAY----------------- 883

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
                    + + VT  + N   + V+  V N G+ DG     V+   P  G   P KQL
Sbjct: 884 ---------RGLDVTQDEEN---VTVKFFVSNTGNYDGDEVAQVYIQFPDQGTTLPLKQL 931

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             F++VH+  G +  + + I   +     + +     P G +   +G +   + L 
Sbjct: 932 KGFKRVHISKGQETEITVRIPKKELRLWSENNSEFYTPEGNYIFLVGASSEDIRLQ 987


>gi|265752711|ref|ZP_06088280.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           3_1_33FAA]
 gi|263235897|gb|EEZ21392.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           3_1_33FAA]
          Length = 864

 Score =  285 bits (728), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 175/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M + I + F  L++LS   +  A +                    +L   +R  DL+ +L
Sbjct: 1   MRNRILYLFASLVILSGCGNQPAYKNS------------------NLSPEERAEDLLQQL 42

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EKV L++  +  V RLGIK Y WW+EALHGV+  G           AT FPQ I  A
Sbjct: 43  TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
           +SF       I   VSDEARA     +A        GLT W+P VNI+RDPRWGRG ET 
Sbjct: 93  ASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152

Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
           GEDP L+     + V+GLQ +D ++   K+ A  KHF  +    WN   R  FNA+ +  
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +D+ +T+ VPF   V EGKV  VMC+YN++ G P C    +L + +R EW   G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDC 269

Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
            ++  +Y  +   + P  E A+A A+ +G DL+CG       ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L  +  LG  D +PS   +  +    VC+ +H  L+L+ AR+ + LL N+   LPL 
Sbjct: 329 KRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
               +T+AV+GPN++ +V   GNY G      T L+GI    G   + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  126 bits (316), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 136/301 (45%), Gaps = 50/301 (16%)

Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
           + AD  I   G+  S+E E +          DR  + LP  Q+EL+  +  A K   I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
             SG PI  A         AI+ A YPGQ+GG A A++LFG  NP G+LP+T+Y    I 
Sbjct: 659 NFSGSPI--AMEPETQYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IA 714

Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
            LP  E            GRTYR++KG  ++PFG+G+SYT F                  
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTF-----------------N 750

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
           +G+I      + IKV        T  + + V N G++DG   + V+         P K L
Sbjct: 751 YGNIKLE---QTIKVGE------TAKMVIPVTNTGNRDGEEVVQVYLKKQEDTEGPAKTL 801

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
            AF++V +PAG    V + +   K L   D ++ T R   G  +I +GG      L   T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860

Query: 765 L 765
           L
Sbjct: 861 L 861


>gi|395803818|ref|ZP_10483061.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
 gi|395434089|gb|EJG00040.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
          Length = 875

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 162/427 (37%), Positives = 235/427 (55%), Gaps = 35/427 (8%)

Query: 31  DPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEA 90
           DPK       PF    L   +RV +L+ +L+L+EKV  +++ A A+PRLGI  Y+WW+E 
Sbjct: 24  DPK------FPFQNTDLTFEERVENLVSQLTLEEKVAQMLNAAPAIPRLGIPAYDWWNET 77

Query: 91  LHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA-- 148
           LHGV+     T         T FPQ I  A++F+      +    + E RA+YN      
Sbjct: 78  LHGVARTPFKT---------TVFPQAIAMAATFDKNSLFKMADYSALEGRAIYNKAVELN 128

Query: 149 -------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA 201
                  GLTYW+PN+NIFRDPRWGRGQET GEDP L+     ++V+GLQG D   LK A
Sbjct: 129 RTKERYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDDPKYLKAA 188

Query: 202 ASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
           A  KH+  +     +G +  R  F+  V+  ++ DT+   F+  V   KVA VMC+YN  
Sbjct: 189 ACAKHYAVH-----SGPESLRHTFDVDVTPYELWDTYLPAFKKLVTNSKVAGVMCAYNAF 243

Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
              P CA   ++   +R +W+  GY+ SDC ++  ++           A+ADA+  G D+
Sbjct: 244 RTQPCCASDILMNDILRNQWKFTGYVTSDCWAIDDFFKNHKTHPDAASASADAVLHGTDI 303

Query: 320 DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC- 378
           DCG         AV+ G ++E  I+ ++     ++ RLGMFD  P S       P  V  
Sbjct: 304 DCGTDAYKSLVQAVKNGQITEKQIDVSVKRLFMIRFRLGMFD--PVSMVKYAQTPSSVLE 361

Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
           + +H+E AL+ ARQ IVLLKN+  +LPLS  + + + V+GPN+D +++++GNY G     
Sbjct: 362 SEEHKEHALKMARQSIVLLKNEKNTLPLSK-KLKKIVVLGPNADNSISILGNYNGTPSKL 420

Query: 439 TTPLQGI 445
           TT LQGI
Sbjct: 421 TTVLQGI 427



 Score =  100 bits (250), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 84/300 (28%), Positives = 132/300 (44%), Gaps = 52/300 (17%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F   I+  + ADA I   G+   +E E +          DR  +LLP  Q  L+ K   +
Sbjct: 596 FANLIERHKNADAFIFAGGISPQLEGEEMPVDAPGFNGGDRTSILLPEVQTRLL-KALQS 654

Query: 518 SKGPTILVLMSGGPIDVAF-AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
           S  P + ++M+G  I V + A+N P I  I W G  GQ+ GTA AD++FG  NP G+LP+
Sbjct: 655 SGKPVVFLIMTGSAIAVPWEAENIPAILNI-WYG--GQSAGTASADVIFGDYNPAGRLPV 711

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           T+Y           +  +      +   +TYR++KG  +Y FG+G+SYT F ++    P 
Sbjct: 712 TFYK---------GDSDLSSFVDYKMDNKTYRYFKGIPLYGFGYGLSYTEFKYSGLKTPD 762

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPP 695
            +                G+ + ++            V V N G  +G     L    P 
Sbjct: 763 KIK--------------KGQPVTIS------------VKVTNTGKMEGEEVAQLYLINPN 796

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
               +P K L  FE+ ++  G    V   +   + LS V  SG  +   G+  I +GG++
Sbjct: 797 TSIKSPLKSLKGFERFNLKPGQSTVVNFTLS-PEDLSYVTESGNLKPYEGKIQIAVGGSQ 855


>gi|262381651|ref|ZP_06074789.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
 gi|262296828|gb|EEY84758.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
          Length = 868

 Score =  284 bits (726), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 164/439 (37%), Positives = 239/439 (54%), Gaps = 35/439 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LP+ +R++DL+ RL+ +EK+  +++   A+ RLGI  Y+WW+EALHGV+  G  
Sbjct: 27  PFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG-- 84

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                    AT FPQ I  A++F+         +VSDEARA Y+            GLT+
Sbjct: 85  --------RATVFPQAIAMAATFDDNAVHETFTIVSDEARAKYHQYQKDKEYDRYKGLTF 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           W+PN+NIFRDPRWGRG ET GEDP L+ K   +  RGLQG D +  K  A  KH+  +  
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYAVHSG 196

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
             WN   R  FNA+ + +D+ +T+   F   V EG V  VMC+YN+  G P C+   +L 
Sbjct: 197 PEWN---RHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKLLI 253

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY----DTQHFTSTP--EEAAADAIRAGLDLDC-GPFL 325
             +R  W  +  I+SDC ++  ++    +T    + P  E A+ADA+  G DL+C G + 
Sbjct: 254 DILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGSYR 313

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
            L+   A+  G +SE D++ +L   L  +  LGMFD +    PY  +    V +P+H   
Sbjct: 314 ALN--KALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIAK 370

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           AL+ AR+ IVLLKN+   LPL     + +AV+GPN+  +  +  NY G      T ++GI
Sbjct: 371 ALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPSKTVTIVEGI 429

Query: 446 GRY---ARTIHQQGCKDVA 461
                 A  I++ GC   A
Sbjct: 430 RNKVPNAEVIYELGCNHTA 448



 Score =  127 bits (319), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 90/309 (29%), Positives = 146/309 (47%), Gaps = 49/309 (15%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F A     + AD  + V G+   +E E +          DR  + +P  Q+E+V K  +A
Sbjct: 595 FQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMV-KALVA 653

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
           +  P + V+ +G  + + + +ND  + AI+ A Y GQ GGTA+AD+LFG  NP G+LP+T
Sbjct: 654 TGKPVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPIT 711

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y    +  LP   + +M+        GRTYR+     +YPFG+G+SYT F +       
Sbjct: 712 FYKS--VDQLPDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYK------ 755

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                        NA +S   I       +  ++ +  D+ N G  DG     ++   P 
Sbjct: 756 -------------NAKLSKDKI------ASNESVTLSFDIANTGKMDGDEVAQIYIKNPN 796

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K + AF++V+V AG++Q V I +    + S  D + T  +  G++ I  GG+  
Sbjct: 797 DPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYGGSSD 856

Query: 757 SVSLHAATL 765
             +L    L
Sbjct: 857 DKTLKKIDL 865


>gi|225873993|ref|YP_002755452.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
 gi|225791521|gb|ACO31611.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
          Length = 894

 Score =  284 bits (726), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 156/419 (37%), Positives = 229/419 (54%), Gaps = 31/419 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SLP   R  DL+ R++L+EK   L++ A A+PRL +  Y WWSEALHGV+      
Sbjct: 39  YLNPSLPPVVRARDLVSRMTLKEKASQLVNAARAIPRLKVPAYNWWSEALHGVA------ 92

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                  G T FP+ I   ++F+      +   +  E R +Y             GL +W
Sbjct: 93  -----VNGTTEFPEPIGLGATFDVPAIHEMAVDIGTEGRVVYEENEKDGSSKIFHGLDFW 147

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +PN+NIFRDPRWGRGQET GEDP L+GK   ++V G+QG +    +V A+ KHF   D+ 
Sbjct: 148 APNLNIFRDPRWGRGQETYGEDPFLTGKMGVAFVSGMQGDNPKYYRVIATPKHF---DVH 204

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           +     R   +  VS  D  DT++  FR  +M+G   SVMCSYN +NG P CA+   L+ 
Sbjct: 205 SGPEPTRHFADVDVSLHDQLDTYEPAFRAAIMQGHADSVMCSYNAINGQPACANQFTLQH 264

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE--- 330
            +RG W   GY+VSDCD+V   Y    +  T  +AAA ++  G+D DC  F     +   
Sbjct: 265 QLRGAWGFKGYVVSDCDAVHDIYSGHKYRPTLAQAAAISMERGMDNDCADFAQPKGDDDY 324

Query: 331 ----SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
                AVQ+G LS+  ++ ALV   T +++LG+FD +    PY      ++ +P H+  A
Sbjct: 325 KAYIDAVQQGYLSQQAMDTALVRLFTARIKLGLFDPK-GMDPYADTPHSELNSPAHRAYA 383

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            + A + +VLLKN G +LPL      ++AV+GP +D T  ++GNY G+     + L+G+
Sbjct: 384 RKLADESMVLLKNDG-TLPLKPGSVHSIAVVGPLADQTAVLLGNYNGVPTHTVSFLEGL 441



 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 80/282 (28%), Positives = 133/282 (47%), Gaps = 48/282 (17%)

Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           I V+G+   +E E +          DR  L +P  ++ LV  V+   K P ++VLM+G  
Sbjct: 627 IAVVGITSKLEGEEMPVDQPGFLGGDRTNLQMPEPEEALVEAVAKTGK-PVVVVLMNGSA 685

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
           + V +        A++ A Y G+ GG AIAD L G ++P G+LP+T+Y  + +  LP  E
Sbjct: 686 LAVNWISQ--HANAVLEAWYSGEEGGAAIADTLSGKNDPAGRLPVTFY--KSVNQLPNFE 741

Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
                        RTYR++KG  +YPFG+G+SYT F ++  + P          H +++A
Sbjct: 742 -------DYSMENRTYRYFKGKPLYPFGYGLSYTTFRYSDLSIP----------HATVDA 784

Query: 652 TISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKV 711
              G+ ++ +              V N G   G   + ++   P    AP   L  F+++
Sbjct: 785 ---GQPVEAS------------ATVTNTGKVAGDEVVQLYLKFPKVDGAPDIALRGFQRI 829

Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           H+  G  Q+V   +   + LS+V   G   +  G++ + IGG
Sbjct: 830 HLEPGQSQQVHFELK-KRDLSMVTALGQIIVAQGDYTLSIGG 870


>gi|86143269|ref|ZP_01061671.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
 gi|85830174|gb|EAQ48634.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
          Length = 873

 Score =  284 bits (726), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 166/428 (38%), Positives = 241/428 (56%), Gaps = 29/428 (6%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           F C  +    +  PF    L +  R+NDL+ R++L+EK+  L+S A A+ RL I  Y WW
Sbjct: 12  FCCCIQIEAQKQFPFQNEQLDLETRLNDLVSRMTLEEKISQLMSDAPAIERLNIPKYNWW 71

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           +E+LHGV+  G           AT FPQ I+ A+S++A L   +   +SDEARA ++   
Sbjct: 72  NESLHGVARAGY----------ATVFPQSISIAASWDAQLVREVATAISDEARAKHHEYL 121

Query: 148 --------AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK 199
                    GLT WSPN+NIFRDPRWGRG ET GEDP L+G   A YV+GLQG D + LK
Sbjct: 122 RRDQHDIYQGLTMWSPNINIFRDPRWGRGHETYGEDPFLTGTLGAQYVKGLQGDDPEYLK 181

Query: 200 VAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
           V A+ KHF  +     +G +  R +F+A  S++D+ +T+   FRM V + +V SVM +YN
Sbjct: 182 VVATAKHFAVH-----SGPEESRHYFDANTSERDLWETYLPAFRMLVKDAQVQSVMTAYN 236

Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
           +  G    ++  +L   +R +W  +GY+VSDC ++   ++    T+    A+A A+  G 
Sbjct: 237 RFRGEAASSN-KLLFDILRNKWGFDGYVVSDCGAINDIWEDHKITADAASASALALETGT 295

Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDV 377
           DL+CG       E A+  GL++E  IN A+      +++LGMFD E  +  Y  +     
Sbjct: 296 DLNCGATYKSLKE-AIANGLITEEKINIAIERLFRARLKLGMFDTE-ENLSYATIPFSVN 353

Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
               H  LA +AA++ IVLLKN+   LPLS    + +AVIGPN+    ++ GNY G    
Sbjct: 354 TNASHTALARKAAQESIVLLKNEAHMLPLSK-DLKQIAVIGPNAHNVQSLWGNYNGTPKN 412

Query: 438 YTTPLQGI 445
             T +QGI
Sbjct: 413 PVTVVQGI 420



 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 101/315 (32%), Positives = 155/315 (49%), Gaps = 59/315 (18%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           A++ +  +D TILV+GL++ +E E +          DR  L LP  Q+EL+ +  +A+  
Sbjct: 592 AVNLAEDSDVTILVLGLNERLEGEEMRIDVEGFSKGDRTALDLPLEQRELM-RALVATGK 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
           P +LVL++G  + + +A+    + AI+ AGYPGQ GG AIAD+LFG  NP G+LP+T+Y 
Sbjct: 651 PIVLVLLNGSALAINYAQE--HVPAILSAGYPGQEGGNAIADVLFGDYNPAGRLPVTYYK 708

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
              + +LP   + +M+        GRTYR+++G  +YPFG+G+SYT F            
Sbjct: 709 S--VDDLPDFEDYSMK--------GRTYRYFEGEALYPFGYGLSYTQF------------ 746

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGH 698
                         S  AIK +        L VQV V N G +DG   + L      A  
Sbjct: 747 --------------SYDAIKTSGRLAADKVLNVQVTVTNSGDRDGDEVVQLYLKDEVAST 792

Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG----- 753
             P  QLV F+++H+  G  Q V   +   ++  + D+      P G   ++ GG     
Sbjct: 793 TRPQVQLVGFKRIHLQKGETQTVEFRLDARQFSMINDQEQLVVEP-GWFTLYAGGGQPNK 851

Query: 754 --TKHSVSLHAATLG 766
             T+ SVS+     G
Sbjct: 852 KQTESSVSMRYEVTG 866


>gi|393773722|ref|ZP_10362116.1| glycosyl hydrolase family 3 N terminal domain protein
           [Novosphingobium sp. Rr 2-17]
 gi|392720897|gb|EIZ78368.1| glycosyl hydrolase family 3 N terminal domain protein
           [Novosphingobium sp. Rr 2-17]
          Length = 846

 Score =  283 bits (725), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 184/465 (39%), Positives = 252/465 (54%), Gaps = 38/465 (8%)

Query: 57  IGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQV 116
           I +  L EKV  L S A A   +G+  Y+WW+E LHG++  G           AT FPQ 
Sbjct: 17  IAKADLAEKVGQLQSTAPAAKAMGLPAYDWWNEGLHGLARNGE----------ATVFPQA 66

Query: 117 ITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRG 168
           I  A++F+  L E +G V+S EARA +N            GLT WSPN+NIFRDPRWGRG
Sbjct: 67  IGLAATFDPVLMEKVGTVISTEARAKFNTRPVNADRRIYEGLTIWSPNINIFRDPRWGRG 126

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           QET GEDP LSG  A  +++GLQG+D    +V A+ KHF  +         R  F+  VS
Sbjct: 127 QETYGEDPFLSGTLAVGFIKGLQGTDPAHPRVIATPKHFAVHSGPE---AGRDSFDVDVS 183

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            QD E T+   FR  V EG+  SVMC+YN  +G P CAD  ++ + +R +W   G++VSD
Sbjct: 184 PQDREATYLPAFRKAVTEGRPLSVMCAYNSTHGTPVCADDALINQRLRKDWGFKGFVVSD 243

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALV 348
           CD++   +   H+ +   EA+A +I+AG D DCG       + AV+RGL+ E  I+ AL 
Sbjct: 244 CDAIANVWMFHHYAADAAEASAASIKAGTDFDCGNTYAALPQ-AVERGLVDEATIDRALA 302

Query: 349 NTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSH 408
            +L  +  LG+  G  ++ P+G + P  V T   + LALEAAR+ IVLL+N    LPL  
Sbjct: 303 RSLQARHSLGIAFG--AANPWGRIKPSQVHTLASRALALEAARKSIVLLQNDNARLPLKP 360

Query: 409 IRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLF 468
                +AV+G N+D    +  +Y G A    TPLQGI R      + G  +VA A   + 
Sbjct: 361 --GTKLAVVGANADDLSVIEASYHGTAADPITPLQGIRR------RFGAANVAYAQGSVL 412

Query: 469 --GAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
             GAA+     A A+    GL    + E LD  G ++  RQ   V
Sbjct: 413 AEGAAVIVPETALASQGKPGL----KGEYLDATGKVVLTRQDRRV 453



 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 98/301 (32%), Positives = 153/301 (50%), Gaps = 63/301 (20%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           AI A+R ADA + V+GL   +E EAL          DR+ + LP  Q +L+  +    K 
Sbjct: 565 AIAATRNADAIVAVLGLSPDLEGEALQVQVPGFVGGDRSDIALPRPQAQLLEALHATGK- 623

Query: 521 PTILVLMSGGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
           P I+VL SG  I +     DP++A A++ A YPG+ GGTA+A+ L G +NP G+LP+T+Y
Sbjct: 624 PVIVVLTSGSAIAI-----DPKLADAVLAAWYPGEEGGTALAETLGGINNPSGRLPVTFY 678

Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
                ++LP   + +M+         R+YR++ G  ++ FGHG+SYTNF +   +AP  V
Sbjct: 679 --RSTSDLPAYVDYSMKE--------RSYRYFTGTPLWGFGHGLSYTNFGY---DAPKAV 725

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH 698
           A  +            G  + VT            V V N G + G   +  +  PP   
Sbjct: 726 ATGI------------GDPVSVT------------VTVHNTGVRAGEDVVQAYVVPPTIE 761

Query: 699 WAP-------HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
             P        +QL AF+++ +  G  + + +N+   + +SVVDR G RR+  GE+ + +
Sbjct: 762 PEPIMTQAVLQRQLAAFQRIALAPGQARAITLNLD-PREISVVDRRGIRRLVPGEYRVWV 820

Query: 752 G 752
           G
Sbjct: 821 G 821


>gi|237721201|ref|ZP_04551682.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
 gi|229448997|gb|EEO54788.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
          Length = 863

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 170/417 (40%), Positives = 241/417 (57%), Gaps = 31/417 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L   QR +DL+ RL+L+EKV L+ + + A+PRLGIK YEWW+EALHGV+  G  
Sbjct: 27  PYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL- 85

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYNGG-----TAGLTY 152
                    AT FPQ I  A+SFN  L   +   VSDEARA    +N         GLT 
Sbjct: 86  ---------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLTM 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYD 211
           W+PNVNIFRDPRWGRGQET GEDP LSG+   + VRGLQG  D +  K+ A  KHF  + 
Sbjct: 137 WTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHFAVHS 196

Query: 212 LDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              WN   R  FNA+ ++ +D+ +T+   F+  V +  V  VMC+YN+  G P C    +
Sbjct: 197 GPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNRL 253

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLH 328
           L + +R +W   G +V+DC ++G ++  +   + P+   A+ADA+ +G DL+CG      
Sbjct: 254 LTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGNFKSI 313

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
           T+ AV++GL+SE  IN ++   L  +  LG  +   S+ P+ ++    +  P H+ELAL+
Sbjct: 314 TD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPKHKELALK 369

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            A + +VLL+N   +  L   R   VAVIGPN++ +V   GNY G      T L+GI
Sbjct: 370 MAHESLVLLQNN--NNLLPLNRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGI 424



 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 90/307 (29%), Positives = 140/307 (45%), Gaps = 49/307 (15%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           ++  + AD  I   G+   +E E++          DR  + LP  Q+E+++ +    K  
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-K 652

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
           T+ V  SG  +  A         AI+ A YPGQAGGTA+AD+LFG  NP G+LP+T+Y  
Sbjct: 653 TVFVNFSGSAM--AIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY-- 708

Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
           + +  LP  E  +M+        GRTYRF     +YPFG+G+SYT F +  A        
Sbjct: 709 KSMQQLPDYEDYSMK--------GRTYRFMTKTPLYPFGYGLSYTRFSYGKA-------- 752

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
                  ++N +      K+T  +   LT    + V NVG +DG   + V+   P     
Sbjct: 753 -------TLNQS------KLTKGEKAILT----IPVSNVGQRDGEEVVQVYICRPDDKEG 795

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L  F++V +  G  Q V I +    +      + T R   G + I  G + +   L
Sbjct: 796 PQKTLRGFQRVSIAKGKTQNVQIELPYDSFEWFDAATNTIRPLNGTYKILYGNSSNEKDL 855

Query: 761 HAATLGV 767
              ++ +
Sbjct: 856 QTCSIQI 862


>gi|150007848|ref|YP_001302591.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
           8503]
 gi|301310124|ref|ZP_07216063.1| beta-glucosidase [Bacteroides sp. 20_3]
 gi|423336365|ref|ZP_17314112.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
           CL09T03C24]
 gi|149936272|gb|ABR42969.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Parabacteroides distasonis ATCC 8503]
 gi|300831698|gb|EFK62329.1| beta-glucosidase [Bacteroides sp. 20_3]
 gi|409240840|gb|EKN33614.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
           CL09T03C24]
          Length = 868

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 164/439 (37%), Positives = 239/439 (54%), Gaps = 35/439 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LP+ +R++DL+ RL+ +EK+  +++   A+ RLGI  Y+WW+EALHGV+  G  
Sbjct: 27  PFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG-- 84

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                    AT FPQ I  A++F+         +VSDEARA Y+            GLT+
Sbjct: 85  --------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           W+PN+NIFRDPRWGRG ET GEDP L+ K   +  RGLQG D +  K  A  KH+  +  
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYAVHSG 196

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
             WN   R  FNA+ + +D+ +T+   F   V EG V  VMC+YN+  G P C+   +L 
Sbjct: 197 PEWN---RHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKLLI 253

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY----DTQHFTSTP--EEAAADAIRAGLDLDC-GPFL 325
             +R  W  +  I+SDC ++  ++    +T    + P  E A+ADA+  G DL+C G + 
Sbjct: 254 DILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGSYR 313

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
            L+   A+  G +SE D++ +L   L  +  LGMFD +    PY  +    V +P+H   
Sbjct: 314 ALN--KALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIAK 370

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           AL+ AR+ IVLLKN+   LPL     + +AV+GPN+  +  +  NY G      T ++GI
Sbjct: 371 ALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPTKTVTIVEGI 429

Query: 446 GRY---ARTIHQQGCKDVA 461
                 A  I++ GC   A
Sbjct: 430 RNKVPNAEVIYELGCNHTA 448



 Score =  127 bits (319), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 90/309 (29%), Positives = 146/309 (47%), Gaps = 49/309 (15%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F A     + AD  + V G+   +E E +          DR  + +P  Q+E+V K  +A
Sbjct: 595 FQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMV-KALVA 653

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
           +  P + V+ +G  + + + +ND  + AI+ A Y GQ GGTA+AD+LFG  NP G+LP+T
Sbjct: 654 TGKPVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPIT 711

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y    +  LP   + +M+        GRTYR+     +YPFG+G+SYT F +       
Sbjct: 712 FYKS--VDQLPDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYK------ 755

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                        NA +S   I       +  ++ +  D+ N G  DG     ++   P 
Sbjct: 756 -------------NAKLSKDKI------ASNESVTLSFDIANTGKMDGDEVAQIYIKNPN 796

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K + AF++V+V AG++Q V I +    + S  D + T  +  G++ I  GG+  
Sbjct: 797 DPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYGGSSD 856

Query: 757 SVSLHAATL 765
             +L    L
Sbjct: 857 DKTLKKIDL 865


>gi|254295141|ref|YP_003061164.1| glycoside hydrolase [Hirschia baltica ATCC 49814]
 gi|254043672|gb|ACT60467.1| glycoside hydrolase family 3 domain protein [Hirschia baltica ATCC
           49814]
          Length = 897

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 167/439 (38%), Positives = 241/439 (54%), Gaps = 45/439 (10%)

Query: 33  KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALH 92
           ++A +    F   SL   +R  DL+  ++L+EK   +   AAA+PRLG+  Y WW+EALH
Sbjct: 35  EEAKSSEFRFMDPSLSPKERALDLVSHMTLEEKAAQMYDKAAAIPRLGLHEYNWWNEALH 94

Query: 93  GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-------- 144
           GV+  G           AT FPQ I  A++++  L   +  V+SDE RA ++        
Sbjct: 95  GVARAG----------HATVFPQAIGMAATWDEDLMLEVANVISDEGRAKHHFYANEDVY 144

Query: 145 GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
               GLT+WSPN+NIFRDPRWGRGQET GEDP L+G+ A +++ GLQG D    K  A+ 
Sbjct: 145 AMYGGLTFWSPNINIFRDPRWGRGQETYGEDPYLTGRMAVNFINGLQGDDDKYFKSVATV 204

Query: 205 KHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
           KH+  +     +G +  R   N   +  D+ +T+   F+    E +VASVMC+YN V G 
Sbjct: 205 KHYAVH-----SGPEPSRHRDNYIATDADLYETYLPAFKTAFDETEVASVMCAYNAVWGD 259

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVG-VYYDTQH------------FTSTPEEAA 309
           P C    ++K  +R E   +GY+VSDC ++G  YYD +                T  +AA
Sbjct: 260 PACGSERLMKDLLREELGFDGYVVSDCGAIGDFYYDEEKKAEGTAPYAAHDHVDTRAQAA 319

Query: 310 ADAIRAGLDLDCGPFLGLHTES---AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSS 366
           A ++  G DL+CG   G   ++   AV+ GL++E  I+ ++V   +   +LGM+D +PS 
Sbjct: 320 ALSVNMGTDLNCGDGEGNKMDALPQAVKEGLITEETIDQSVVRLYSALFKLGMYD-DPSL 378

Query: 367 QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT 426
            P+ ++    V +P H E + EAAR  +VLLKN G  LPL       VAVIGPN+D   T
Sbjct: 379 VPWSNISIDTVASPSHLEKSEEAARASLVLLKNDG-ILPLK--PDTKVAVIGPNADNWWT 435

Query: 427 MIGNYAGIACGYTTPLQGI 445
           ++ NY G      T L+GI
Sbjct: 436 LVANYYGQPTAPVTALKGI 454



 Score = 99.4 bits (246), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 79/286 (27%), Positives = 127/286 (44%), Gaps = 54/286 (18%)

Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           +   G+D ++E E +          DR  + LP  Q++L+ ++    K P +LV  SG  
Sbjct: 634 LFFGGIDANLEGEEMGVELDGFLGGDRTHINLPAPQEKLLKELHATGK-PVVLVNFSGSA 692

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MT 590
           +  A    D  + AI+ A YPG+  GTAIAD+L+G  +P G+LP+T+Y    +  +P   
Sbjct: 693 M--ALNWEDENLPAIVQAFYPGEKSGTAIADLLWGEFSPSGRLPVTFYKS--LEGMPAFD 748

Query: 591 EMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
           + +M          RTY++Y+G  +YPFGHG+SYT+F ++                    
Sbjct: 749 DYSME--------NRTYKYYEGEQLYPFGHGLSYTSFEYS-------------------- 780

Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP--AGHWAPHKQLVAF 708
                  +K+  A      L V V V N G K     +  + T    A    P  +L AF
Sbjct: 781 ------DLKLETAYAANENLQVSVKVTNSGDKASREIVQAYVTRDTLANVSTPRVELAAF 834

Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGG 753
           + + +     Q V ++I     +   + +G    P  G   + IGG
Sbjct: 835 DAIELAPKESQTVTLSIK-PDAIGYFNENGKLTFPEDGSFTLSIGG 879


>gi|393720890|ref|ZP_10340817.1| beta-glucosidase [Sphingomonas echinoides ATCC 14820]
          Length = 563

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 168/423 (39%), Positives = 237/423 (56%), Gaps = 34/423 (8%)

Query: 60  LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITT 119
           L+  +K   L S A A P  G+  Y+WW+E LHG++  G           AT FPQ I  
Sbjct: 20  LTTAQKAAQLQSSAPADPAAGLPAYDWWNEGLHGLARNG----------YATVFPQAIGM 69

Query: 120 ASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQET 171
           A++++A L   IG VV+ EARA +N            GLT WSPN+NIFRDPRWGRGQET
Sbjct: 70  AATWDAPLLNRIGEVVATEARAKFNAQPVAADRKIYQGLTIWSPNINIFRDPRWGRGQET 129

Query: 172 PGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQD 231
            GEDP L+G    +++ GLQG D    KV A+ KHF  +         R  F+   S QD
Sbjct: 130 YGEDPYLTGHLGVAFITGLQGPDPKHPKVIATPKHFVVHSGPE---AGRDGFDVDPSPQD 186

Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
           +E T+   FR+ V EGK  S+MC+YN ++GVP CA+  ++   +R +W   G+ VSDCD+
Sbjct: 187 LEATYLPAFRLAVTEGKAQSLMCAYNAIHGVPACANAPLMVDRLRHDWGFTGFTVSDCDA 246

Query: 292 VGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTL 351
           V   +   H+      AAA +++ G DL+CG       E A+QRGL++  DI+ A    +
Sbjct: 247 VANVHLFHHYRLDAAGAAAASLKGGTDLNCGTTYAALPE-ALQRGLVTSQDIDTAFGRAM 305

Query: 352 TVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRH 411
             +  LG+  G  ++ P+  + P+D+ TP+H  +ALEAAR+ IVLLKN+G  LPL     
Sbjct: 306 AARKALGIAYG--ATSPWARISPRDIGTPEHHAVALEAARKAIVLLKNEGDRLPLK--TG 361

Query: 412 RTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLF--G 469
             +A+IG ++D    + GNY G A    TPL+G+ +      Q G ++V  A   L   G
Sbjct: 362 TRIALIGADADDIGVLQGNYHGTAIAPVTPLEGLRQ------QFGAENVLYAQGSLLADG 415

Query: 470 AAI 472
           AA+
Sbjct: 416 AAV 418


>gi|393782428|ref|ZP_10370612.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673256|gb|EIY66719.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
           CL02T12C01]
          Length = 596

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 195/621 (31%), Positives = 315/621 (50%), Gaps = 64/621 (10%)

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
           +TYWSPNVNIFRDPRWGRGQET GEDP L+ +   +YVRGLQG+D   LK AA  KH+  
Sbjct: 1   MTYWSPNVNIFRDPRWGRGQETYGEDPYLTAEIGKAYVRGLQGNDPFFLKAAACAKHYAV 60

Query: 210 YDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
           +     +G +  R  FNA  SK+D+ +T+   F   V E KV +VM +YN+V G      
Sbjct: 61  H-----SGPEALRHEFNASPSKRDLFETYLPAFEALVKEAKVEAVMGAYNRVYGESASGS 115

Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFLG 326
             +L   +R +W   G++VSDC +V   Y          EA+A A+++GL+L+C G F  
Sbjct: 116 FFLLTDILRKKWGFKGHVVSDCGAVDDIYGGHKIAKDVAEASAIALKSGLNLNCGGSFHA 175

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
           L  + A++R L++E+D++NAL+  +  +++LG    +  S PY ++    + +  H  +A
Sbjct: 176 L--KEALERKLITEVDLDNALMPLMMTRLKLGNLTDDDES-PYKNISDSVIASYTHAMVA 232

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG 446
            E A++ +VLLKN   +LPL     +T+ V GP +  T  M+GNY G++    T LQGI 
Sbjct: 233 REVAQKSMVLLKNNNHTLPLKK-DVKTIFVTGPYAADTYVMMGNYYGVSPRSNTFLQGIA 291

Query: 447 RYAR---TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL---DQSIEAEAL---- 496
                  +I+ +        +       +   R A+  I+V+GL   D+  E +A+    
Sbjct: 292 AKVSGGTSINYKIGILPTTPNMNPADWTVGEVRAAEVAIVVIGLSGIDEGEEGDAIASSH 351

Query: 497 --DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             D+  L LP  Q + +  +S       + V+  G PID+   +      A+I A YPGQ
Sbjct: 352 RGDKQNLKLPEHQLKFLRDISRNRWNKLVTVITGGSPIDL--EEVSELSDAVIMAWYPGQ 409

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
            GG A+ D+LFG  +  G++P+T+         P+    +   +     GRTY++    +
Sbjct: 410 EGGMALGDLLFGDVSFSGRMPVTF---------PINSDWLPAFEDYNMQGRTYKYMTDNI 460

Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCN-RLTLGVQ 673
           +YPFG+G++Y +                          +S   +K+ + K + +  + VQ
Sbjct: 461 MYPFGYGLTYGD--------------------------VSYSDVKILNPKYDGKQEIHVQ 494

Query: 674 VDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
             ++N G+ +    + L  S P AG   P   L+ F++V + +   Q V   I   + L 
Sbjct: 495 ATLRNNGNNEVEEVVQLYLSAPGAGVITPISSLIGFKRVTLESHLSQTVEFIIKPDQ-LK 553

Query: 733 VVDRSGTRRIPLGEHNIHIGG 753
           +V   G++ +  G++ I + G
Sbjct: 554 MVMEDGSKNLLKGKYTIIVSG 574


>gi|256840106|ref|ZP_05545615.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
 gi|256739036|gb|EEU52361.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
          Length = 868

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 164/439 (37%), Positives = 239/439 (54%), Gaps = 35/439 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LP+ +R++DL+ RL+ +EK+  +++   A+ RLGI  Y+WW+EALHGV+  G  
Sbjct: 27  PFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG-- 84

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                    AT FPQ I  A++F+         +VSDEARA Y+            GLT+
Sbjct: 85  --------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           W+PN+NIFRDPRWGRG ET GEDP L+ K   +  RGLQG D +  K  A  KH+  +  
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYAVHSG 196

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
             WN   R  FNA+ + +D+ +T+   F   V EG V  VMC+YN+  G P C+   +L 
Sbjct: 197 PEWN---RHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKLLI 253

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY----DTQHFTSTP--EEAAADAIRAGLDLDC-GPFL 325
             +R  W  +  I+SDC ++  ++    +T    + P  E A+ADA+  G DL+C G + 
Sbjct: 254 DILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGSYR 313

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
            L+   A+  G +SE D++ +L   L  +  LGMFD +    PY  +    V +P+H   
Sbjct: 314 ALN--KALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIAK 370

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           AL+ AR+ IVLLKN+   LPL     + +AV+GPN+  +  +  NY G      T ++GI
Sbjct: 371 ALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPSKTVTIVEGI 429

Query: 446 GRY---ARTIHQQGCKDVA 461
                 A  I++ GC   A
Sbjct: 430 RNKVPNAEVIYELGCNHTA 448



 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 91/309 (29%), Positives = 146/309 (47%), Gaps = 49/309 (15%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F A     + AD  + V G+   +E E +          DR  + +P  Q+E+V K  +A
Sbjct: 595 FQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMV-KALVA 653

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
           +  P + V+ +G  + + + +ND  + AI+ A Y GQ GGTA+AD+LFG  NP G+LP+T
Sbjct: 654 TGKPVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPIT 711

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y    +  LP   + +M+        GRTYR+     +YPFG+G+SYT F +       
Sbjct: 712 FYKS--VDQLPDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYK------ 755

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                        NA +S   I       +  ++ +  D+ N G  DG     ++   P 
Sbjct: 756 -------------NAKLSKDKI------ASNESVTLSFDIANTGKMDGDEVAQIYIKNPN 796

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K + AF++V+V AG++Q V I +    + S  D + T  I  G++ I  GG+  
Sbjct: 797 DPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEIRPGKYQILYGGSSD 856

Query: 757 SVSLHAATL 765
             +L    L
Sbjct: 857 DKTLKKIDL 865


>gi|298376791|ref|ZP_06986746.1| beta-glucosidase [Bacteroides sp. 3_1_19]
 gi|298266669|gb|EFI08327.1| beta-glucosidase [Bacteroides sp. 3_1_19]
          Length = 868

 Score =  283 bits (724), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 164/439 (37%), Positives = 239/439 (54%), Gaps = 35/439 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LP+ +R++DL+ RL+ +EK+  +++   A+ RLGI  Y+WW+EALHGV+  G  
Sbjct: 27  PFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG-- 84

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                    AT FPQ I  A++F+         +VSDEARA Y+            GLT+
Sbjct: 85  --------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           W+PN+NIFRDPRWGRG ET GEDP L+ K   +  RGLQG D +  K  A  KH+  +  
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYAVHSG 196

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
             WN   R  FNA+ + +D+ +T+   F   V EG V  VMC+YN+  G P C+   +L 
Sbjct: 197 PEWN---RHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKLLI 253

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY----DTQHFTSTP--EEAAADAIRAGLDLDC-GPFL 325
             +R  W  +  I+SDC ++  ++    +T    + P  E A+ADA+  G DL+C G + 
Sbjct: 254 DILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGSYR 313

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
            L+   A+  G +SE D++ +L   L  +  LGMFD +    PY  +    V +P+H   
Sbjct: 314 ALN--KALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIAK 370

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           AL+ AR+ IVLLKN+   LPL     + +AV+GPN+  +  +  NY G      T ++GI
Sbjct: 371 ALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPSKTVTIVEGI 429

Query: 446 GRY---ARTIHQQGCKDVA 461
                 A  I++ GC   A
Sbjct: 430 RNKVPNAEVIYELGCNHTA 448



 Score =  127 bits (318), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 90/309 (29%), Positives = 146/309 (47%), Gaps = 49/309 (15%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F A     + AD  + V G+   +E E +          DR  + +P  Q+E+V K  +A
Sbjct: 595 FQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMV-KALVA 653

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
           +  P + V+ +G  + + + +ND  + AI+ A Y GQ GGTA+AD+LFG  NP G+LP+T
Sbjct: 654 TGKPVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPIT 711

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y    +  LP   + +M+        GRTYR+     +YPFG+G+SYT F +       
Sbjct: 712 FYKS--VDQLPDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYK------ 755

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                        NA +S   I       +  ++ +  D+ N G  DG     ++   P 
Sbjct: 756 -------------NAKLSKDKI------ASNESVTLSFDIANTGKMDGDEVAQIYIKNPN 796

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K + AF++V+V AG++Q V I +    + S  D + T  +  G++ I  GG+  
Sbjct: 797 DPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYGGSSD 856

Query: 757 SVSLHAATL 765
             +L    L
Sbjct: 857 DKTLKKIDL 865


>gi|393781366|ref|ZP_10369565.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676859|gb|EIY70281.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
           CL02T12C01]
          Length = 854

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 169/455 (37%), Positives = 256/455 (56%), Gaps = 44/455 (9%)

Query: 5   IAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQE 64
           I FF +GL+ +  +          A   KD       +  ++ P  +R+ DL+ +L+++E
Sbjct: 6   ILFFIVGLVCIFTAQ---------AQKGKDV------YLDMNAPQHERILDLLSKLTIEE 50

Query: 65  KVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFN 124
           K+ LL + +  +PRL I  Y   +EALHGV  V PG          T FPQ I  A+ +N
Sbjct: 51  KISLLRATSPGIPRLQIDKYYHGNEALHGV--VRPGN--------FTVFPQAIGLAAMWN 100

Query: 125 ATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNVNIFRDPRWGRGQETPGE 174
             L   I   +SDEARA +N    G          LT+WSP VN+ RDPRWGR  ET GE
Sbjct: 101 PQLLNEISTAISDEARARWNELEQGKKQLGQFSDLLTFWSPTVNMARDPRWGRTPETYGE 160

Query: 175 DPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIED 234
           DP LSGK   S+V+GLQG D   LK+ ++ KHF A + ++    +RF  N  +S++D+ +
Sbjct: 161 DPFLSGKLGVSFVKGLQGDDPRYLKIVSTPKHFAANNEEH----NRFECNPIISEKDLRE 216

Query: 235 TFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGV 294
            +   F  C++EGK AS+M +YN +N VP   +  +LK+ +R +W  +GY+VSDC +   
Sbjct: 217 YYLPAFEKCIIEGKAASIMTAYNAINDVPCTLNNWLLKKVLRHDWGFDGYVVSDCGAPDF 276

Query: 295 YYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH-TESAVQRGLLSEIDINNALVNTLTV 353
                 +  T E AA  +I+AGLDL+CG  + +    +A ++ +++E +I++A  + L  
Sbjct: 277 LVTHHKYVKTLEAAATLSIQAGLDLECGDNVYMEPLLNAYKQYMVTEAEIDSAAYHILRA 336

Query: 354 QMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRT 413
           +MRLG+FD +P+  PY  + P  V    H +LALEAARQ IVLLKN+   LPL   + ++
Sbjct: 337 RMRLGLFD-DPNLNPYNKISPSVVGCEKHSQLALEAARQSIVLLKNEKKFLPLDLKKIKS 395

Query: 414 VAVIGPNSDVTVTMIGNYAGIACGY-TTPLQGIGR 447
           +AV+G N+       G+Y+G       + L+GI +
Sbjct: 396 IAVVGINA--GNCEFGDYSGTPVNQPVSILEGIKK 428



 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 98/299 (32%), Positives = 151/299 (50%), Gaps = 45/299 (15%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+GAA DA R+ D TI V+G+++SIE E  DR  + LP  QQ  + +    +   T++VL
Sbjct: 591 LYGAAGDAMRKCDLTIAVVGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPN-TVVVL 649

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           ++G  + + +   D  I AI+ A YPG+AGGTA+A++LFG  NPGGKLP+T+Y    +  
Sbjct: 650 VAGSSLAINWM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYYRS--LDE 705

Query: 587 LP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
           LP   +  +R        GRTY+F++G  +Y FGHG+SYT F                  
Sbjct: 706 LPAFDDYDIR-------KGRTYQFFEGNPLYAFGHGLSYTTF-----------------S 741

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG---AHTLLVFSTPPAGHWAPH 702
           +  +N   +G A+KV+ A            +KN G  DG   A   + +    +    P 
Sbjct: 742 YKKLNIDSTGDAVKVSFA------------LKNTGKYDGDEVAQLYVKYQGNDSLVKLPL 789

Query: 703 KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           KQL  FE+VH+  G  +RV + +   +     +  G    P G++   +G    ++ L 
Sbjct: 790 KQLKGFERVHLKKGESKRVTLTVPKSELRFWDEEKGEFYTPAGDYLFMVGTASDAIQLQ 848


>gi|332881172|ref|ZP_08448831.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
 gi|357047867|ref|ZP_09109460.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
           11840]
 gi|332680886|gb|EGJ53824.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
 gi|355529206|gb|EHG98645.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
           11840]
          Length = 851

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 170/421 (40%), Positives = 239/421 (56%), Gaps = 29/421 (6%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           F  +  P  +R+ DL+ RL+++EK+ LL++ A A+ RLGI  Y   +EALHGV  V PG 
Sbjct: 29  FRDMKAPQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGV--VRPG- 85

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN----------GGTAGLT 151
               DF   T FPQ I  A+ +N  L   I   +SDEAR  +           G +  LT
Sbjct: 86  ----DF---TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAGASDLLT 138

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
           +WSP VN+ RDPRWGR  ET GEDP LSG    ++V+GLQG     LK  ++ KHF   +
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGDHPRYLKTVSTPKHFAVNN 198

Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
            ++    +R   NAKVS++D+ + +   F  C+ EGK  S+M +YN VN VP   +  ++
Sbjct: 199 EEH----NRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVNDVPCTVNTYLI 254

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTE 330
           K  +RG+W  NGYIVSDC +        H+  T E AA  A++ GLDL+CG    G    
Sbjct: 255 KNVLRGDWGFNGYIVSDCSAPEWMITKHHYVKTREAAATLAVKVGLDLECGNQVYGEGLL 314

Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
            A ++ ++SE DI++A    L  +M LG+FD  PS  PY  + P  V    HQ+LALEAA
Sbjct: 315 KAYRQYMVSEADIDSAAYRILRGRMMLGLFDA-PSQNPYNQIEPSVVGCKAHQDLALEAA 373

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY-TTPLQGIGRYA 449
           RQ +VLLKN+   LPL+  + +++AV+G ++       G+Y+G       T L GI +YA
Sbjct: 374 RQSMVLLKNKDNFLPLNPKKVKSIAVVGISAG--HCEFGDYSGTPKNEPVTILDGIKQYA 431

Query: 450 R 450
            
Sbjct: 432 E 432



 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 87/295 (29%), Positives = 142/295 (48%), Gaps = 42/295 (14%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           FG A   + + D T+ V+G+++SIE E  DR  L LP  QQE + ++   +   T++VL+
Sbjct: 593 FGDAGKVAAECDVTVAVLGINKSIEREGQDRFSLELPVDQQEFIKELYKVNPN-TVVVLV 651

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           +G  + V +   D  + AI+ A YPG+ GG A+A++LFG  NPGG+LP+T+Y        
Sbjct: 652 AGSSMAVNWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS------ 703

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
            + E+    + S +  GRTY++++G  +Y FG+G+SYT F                 R+ 
Sbjct: 704 -LDEIPAFDNYSVK--GRTYQYFEGQPLYEFGYGLSYTKF-----------------RYK 743

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLV 706
           S    +    +KV+             +V N G  DG     V+   P  G + P KQL 
Sbjct: 744 SKGVNVEQDTVKVS------------FEVSNTGKYDGDEVAQVYVKYPETGTYMPLKQLH 791

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
            F++VH+  G   +V I +         ++      P GE+   +G +   +   
Sbjct: 792 GFKRVHIKKGKTSKVTIGVPRKDLRYWYEQERKFITPKGEYTFMVGASSDDIKFR 846


>gi|224535242|ref|ZP_03675781.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224523140|gb|EEF92245.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 864

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 173/447 (38%), Positives = 253/447 (56%), Gaps = 41/447 (9%)

Query: 34  DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
           + T +  P+    L   +R  DL+ R++L+EKV  + +G+ A+ RLGI  Y+WW+EALHG
Sbjct: 16  NVTAQNEPYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHG 75

Query: 94  VSNVGPGTKFGGDFPGATSFPQVITTASSF-NATLWEAIGRVVSDEARAMYNG------- 145
           V+  G           AT FPQ I  A++F N  ++E    +VSDEARA Y+        
Sbjct: 76  VARAGK----------ATVFPQAIGLAATFDNQAVYETFD-IVSDEARAKYHDFQRKGER 124

Query: 146 -GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
            G  GLT+W+PN+NI+RDPRWGRG ET GEDP L+     + V+GLQG    +   A +C
Sbjct: 125 DGYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGGGTGKYDKAHAC 184

Query: 205 -KHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            KH+  +    WN   R  F+AK +S++D+ +T+   F+  V EGKV  VMC+YN+  G 
Sbjct: 185 AKHYAVHSGPEWN---RHSFDAKNISQRDLWETYLSAFKTLVKEGKVKEVMCAYNRFEGE 241

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLD 320
           P C++  +L R +R +W  +  +VSDC ++G +Y   H  + P    A+ADA+ +G DL+
Sbjct: 242 PCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLE 301

Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKD 376
           CG       E AV++GL+SE  IN ++   L  + +LGMFD +     S  PY  +  K 
Sbjct: 302 CGGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFDDDALVSWSEIPYSVVESK- 359

Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
               +H   ALE AR+ +VLL N+  +LPLS    R VAV+GPN++ +V +  NY G   
Sbjct: 360 ----EHVTKALEMARKSMVLLTNKNHTLPLSK-SIRKVAVLGPNANDSVMLWANYNGFPT 414

Query: 437 GYTTPLQGIGRY---ARTIHQQGCKDV 460
              T L+GI          +++GC  V
Sbjct: 415 KSVTILEGIKSKLPEGTVYYEKGCDYV 441



 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 96/321 (29%), Positives = 149/321 (46%), Gaps = 47/321 (14%)

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
           D+    +  +    D + +ADA I V GL  ++E E +          DR  + LP  Q 
Sbjct: 581 DIGIKKEINYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQA 640

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
           E++  +    K P I VL SG  + + +   +  + AI+ A YPGQ GGTA+AD+LFG  
Sbjct: 641 EMLKALKKTGK-PVIFVLCSGSTLALPWEAEN--LDAILEAWYPGQQGGTAVADVLFGDY 697

Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
           NP G+LP+T+Y      +LP  E             RTYR++KG  ++PFGHG+SYT F 
Sbjct: 698 NPAGRLPLTFYASS--NDLPDFE-------DYDMSNRTYRYFKGKALFPFGHGLSYTIFD 748

Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
           +  A         +D ++       +G+ + +T            + +KN G  DG   +
Sbjct: 749 YGKAK--------VDKQN-----VRAGEGMTLT------------IPLKNTGKLDGDEVI 783

Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
            V+   PA    P K L AF +V +PAG  + + I +    +      +    I  G++ 
Sbjct: 784 QVYLRNPADKEGPIKTLRAFRRVSLPAGQTENIRIELPASTFECFNPSTNRMEILPGKYE 843

Query: 749 IHIGGTKHSVSLHAATLGVIK 769
           +  GGT    +L   T+ V K
Sbjct: 844 LLYGGTSDEKALQKLTITVKK 864


>gi|423342048|ref|ZP_17319763.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409219455|gb|EKN12417.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 868

 Score =  282 bits (722), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 167/450 (37%), Positives = 238/450 (52%), Gaps = 35/450 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LPI +R++DL+ RL+ +EKV  +++   A+ RLGI  Y+WW+EALHGV+  G  
Sbjct: 27  PFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARAGK- 85

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                    AT FPQ I  A++F+         +VSDEARA Y+            GLT+
Sbjct: 86  ---------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           W+PN+NIFRDPRWGRG ET GEDP L+ +   + V+GLQG D    K  A  KH+  +  
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKYFKTHACAKHYAVHSG 196

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
             WN   R  F+  V+ +D+  T+   F   V EG V  VMC+YN+  G P C+   +L 
Sbjct: 197 PEWN---RHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNRYQGKPCCSSDKLLI 253

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFT------STPEEAAADAIRAGLDLDCG-PFL 325
             +R  W     I+SDC ++  +++    T         E A+ADA+  G DL+CG  + 
Sbjct: 254 DILRNSWGYENIILSDCGAINDFWERDERTPRHETHPDAESASADAVLNGTDLECGNSYR 313

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
            L    A++ G +SE D++ +L   L  +  LGMFD +    PY  +    V +P+H   
Sbjct: 314 AL--VKALKDGKISENDLDVSLRRLLKGRFELGMFDPD-ERVPYAQIPYNVVESPEHVAQ 370

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           ALE A + +VLLKN+  +LPLS    R +AV+GPN+  +  +  NY G      T L+GI
Sbjct: 371 ALEMAHKSMVLLKNKNNTLPLSKTI-RKIAVVGPNAADSTMLWANYNGFPTHTVTILEGI 429

Query: 446 GRY---ARTIHQQGCKDVACADDQLFGAAI 472
                    I++ GC   A    Q  G+ I
Sbjct: 430 RNKVPDTEVIYELGCNHAADFVIQDLGSHI 459



 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 100/311 (32%), Positives = 141/311 (45%), Gaps = 49/311 (15%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           + A     + AD  + V G+   +E E +          DR  + LP  QQE+V K   A
Sbjct: 595 YAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMV-KALKA 653

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
           +  P + VL +G  +  A    +  I AI+ A Y GQ  GTA+ADILFG  NP G+LP+T
Sbjct: 654 TGKPVVYVLCTGSAL--ALNWEEANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVT 711

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y    I  LP   + +M+        GRTYR+     +YPFG+G+SYTNF +       
Sbjct: 712 FYKS--IDQLPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR------ 755

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                        NA +S   I    AK   +TL    D+ N G  DG     ++   P 
Sbjct: 756 -------------NAKLSSGKI----AKDQSVTL--TFDIANTGKMDGDEIAQIYIKNPN 796

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K L AF +VHV AG  Q V I +    + S  D + T  +  G++ I  GG+  
Sbjct: 797 DPEGPIKALKAFLRVHVKAGDSQEVNIELAPETFHSFNDNTQTMEVRPGKYQILYGGSSD 856

Query: 757 SVSLHAATLGV 767
             +L    L +
Sbjct: 857 DKALQRLELTI 867


>gi|423313129|ref|ZP_17291065.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
           CL09T03C04]
 gi|392686343|gb|EIY79649.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
           CL09T03C04]
          Length = 864

 Score =  282 bits (721), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 173/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M + I + F  L++LS  ++  A ++                   SL   +R  DL+ +L
Sbjct: 1   MRNKILYLFASLVMLSGCNNQPAYKDS------------------SLSPEERAEDLLQQL 42

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EKV L++  +  V RLGIK Y WW+EALHGV+  G           AT FPQ I  A
Sbjct: 43  TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
           +SF       I   VSDEARA     +A        GLT W+P VNI+RDPRWGRG ET 
Sbjct: 93  ASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152

Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
           GEDP L+     + V+GLQ  D ++   K+ A  KHF  +    WN   R  FNA+ +  
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCMDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +D+ +T+ VPF   V E KV  VMC+YN++ G P C    +L + +R +W  +G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDC 269

Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
            ++  +Y  +   + P  E A+A A+ +G DL+CG       ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L  +  LG  D +P    +  +    VC+ +H  L+L+ AR+ + LL N+   LPL 
Sbjct: 329 KRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
               +T+AV+GPN++ +V   GNY G      T L+GI    G   + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 99/301 (32%), Positives = 138/301 (45%), Gaps = 50/301 (16%)

Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
           + AD  I   G+  S+E E +          DR  + LP  Q+EL+  +  A K   I V
Sbjct: 600 KDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
             SG PI  A         AI+ A YPGQ+GG A+A++LFG  NP G+LP+T+Y    IT
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAVAEVLFGDYNPAGRLPVTFYRN--IT 714

Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
            LP  E            GRTYR++KG  ++PFG+G+SYT F                  
Sbjct: 715 QLPNFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTF-----------------N 750

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
           +G+I      + IKV        T  + V V N G++DG   + V+         P K L
Sbjct: 751 YGNIKLE---QTIKVGE------TAKIIVPVTNTGNRDGEEVVQVYLKKQEDAEGPVKTL 801

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
            AF++V +PAG    V + +   K L   D ++ T R   G  +I +GG      L   T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDTQTNTMRTLAGNFDIMVGGNSKDTELQVKT 860

Query: 765 L 765
           L
Sbjct: 861 L 861


>gi|150003731|ref|YP_001298475.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|319640047|ref|ZP_07994774.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
 gi|345517061|ref|ZP_08796539.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|149932155|gb|ABR38853.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
 gi|254833833|gb|EET14142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|317388325|gb|EFV69177.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
          Length = 864

 Score =  282 bits (721), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 173/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M + I + F  L++LS  ++  A ++                   SL   +R  DL+ +L
Sbjct: 1   MRNKILYLFASLVMLSGCNNQPAYKDS------------------SLSPEERAEDLLQQL 42

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EKV L++  +  V RLGIK Y WW+EALHGV+  G           AT FPQ I  A
Sbjct: 43  TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
           +SF       I   VSDEARA     +A        GLT W+P VNI+RDPRWGRG ET 
Sbjct: 93  ASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152

Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
           GEDP L+     + V+GLQ  D ++   K+ A  KHF  +    WN   R  FNA+ +  
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCMDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +D+ +T+ VPF   V E KV  VMC+YN++ G P C    +L + +R +W  +G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDC 269

Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
            ++  +Y  +   + P  E A+A A+ +G DL+CG       ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L  +  LG  D +P    +  +    VC+ +H  L+L+ AR+ + LL N+   LPL 
Sbjct: 329 KRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
               +T+AV+GPN++ +V   GNY G      T L+GI    G   + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  129 bits (325), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 99/301 (32%), Positives = 137/301 (45%), Gaps = 50/301 (16%)

Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
           + AD  I   G+  S+E E +          DR  + LP  Q+EL+  +  A K   I V
Sbjct: 600 KDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
             SG PI  A         AI+ A YPGQ+GG A A++LFG  NP G+LP+T+Y    IT
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--IT 714

Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
            LP  E            GRTYR++KG  ++PFG+G+SYT F                  
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTF-----------------N 750

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
           +G+I      + IKV        T  + V V N G++DG   + V+         P K L
Sbjct: 751 YGNIKLE---QTIKVGE------TAKIIVPVTNTGNRDGEEVVQVYLKKQEDAEGPVKTL 801

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
            AF++V +PAG    V + +   K L   D ++ T R   G  +I +GG      L   T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860

Query: 765 L 765
           L
Sbjct: 861 L 861


>gi|294777452|ref|ZP_06742903.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
 gi|294448520|gb|EFG17069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
          Length = 864

 Score =  281 bits (720), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 173/474 (36%), Positives = 254/474 (53%), Gaps = 51/474 (10%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M + I + F  L++LS  ++  A ++                   SL   +R  DL+ +L
Sbjct: 1   MRNKILYLFASLVMLSGCNNQPAYKDS------------------SLSPEERAEDLLQQL 42

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EKV L++  +  V RLGIK Y WW+EALHGV+  G           AT FPQ I  A
Sbjct: 43  TLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL----------ATVFPQPIGMA 92

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRDPRWGRGQETP 172
           +SF       I   VSDEARA     +A        GLT W+P VNI+RDPRWGRG ET 
Sbjct: 93  ASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMWTPTVNIYRDPRWGRGIETY 152

Query: 173 GEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDRFHFNAK-VSK 229
           GEDP L+     + V+GLQ  D ++   K+ A  KHF  +    WN   R  FNA+ +  
Sbjct: 153 GEDPYLTSVMGVNVVKGLQCMDANQKYDKIHACAKHFAVHSGPEWN---RHEFNAENIKP 209

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +D+ +T+ VPF   V E KV  VMC+YN++ G P C    +L + +R +W  +G ++SDC
Sbjct: 210 RDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDC 269

Query: 290 DSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
            ++  +Y  +   + P  E A+A A+ +G DL+CG       ESA ++GL+SE DI+ ++
Sbjct: 270 GAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSV 328

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L  +  LG  D +P    +  +    VC+ +H  L+L+ AR+ + LL N+   LPL 
Sbjct: 329 KRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLK 387

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
               +T+AV+GPN++ +V   GNY G      T L+GI    G   + I++QGC
Sbjct: 388 R-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 135/301 (44%), Gaps = 50/301 (16%)

Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
           + AD  I   G+  S+E E +          DR  + LP  Q+EL+  +  A K   I V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
             SG PI  A         AI+ A YPGQ+GG A A++LFG  NP G+LP+T+Y      
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVTFYRN--TA 714

Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
            LP  E            GRTYR++KG  ++PFG+G+SYT F                  
Sbjct: 715 QLPDFE-------DYNMTGRTYRYFKGDPLFPFGYGLSYTTF-----------------N 750

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
           +G+I      + IKV        T  + V V N G++DG   + V+         P K L
Sbjct: 751 YGNIKLE---QTIKVGE------TAKIIVPVTNTGNRDGEEVVQVYLKKQEDAEGPVKTL 801

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
            AF++V +PAG    V + +   K L   D ++ T R   G  +I +GG      L   T
Sbjct: 802 RAFKRVQIPAGKTVNVELEL-TPKQLEWWDAQTNTMRTIAGNFDIMVGGNSKDAELQVKT 860

Query: 765 L 765
           L
Sbjct: 861 L 861


>gi|380512525|ref|ZP_09855932.1| beta-glucosidase [Xanthomonas sacchari NCPPB 4393]
          Length = 885

 Score =  281 bits (720), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 158/405 (39%), Positives = 229/405 (56%), Gaps = 38/405 (9%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           L+ +++  EK+   ++ A A+PRLG+  YEWWSE LHG++  G           AT FPQ
Sbjct: 40  LVAKMTRAEKIAQAMNAAPAIPRLGVPAYEWWSEGLHGIARNGE----------ATVFPQ 89

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFRDPRWG 166
            I  A+++N  L   +G V S EARA +N  GG        AGLT WSPN+NIFRDPRWG
Sbjct: 90  AIGLAATWNPELLHDVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWG 149

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFN 224
           RG ET GEDP L+G+ A  ++ GLQG D    +  A+ KH   +     +G +  R  F+
Sbjct: 150 RGMETYGEDPYLTGRLAVGFIHGLQGDDPAHPRTIATPKHLAVH-----SGPEPGRHGFD 204

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             VS  D E T+   FR  +++G+  SVMC+YN ++G P CA   ++   +RG+W   G+
Sbjct: 205 VDVSPHDFEATYSPAFRAAIVDGQAGSVMCAYNSLHGTPACAADWLIDGRVRGDWGFKGF 264

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF---LGLHTESAVQRGLLSEI 341
           +VSDCD++       ++      ++A A++AG DL+CG     LG+    A  RG   E 
Sbjct: 265 VVSDCDAIDDMTQFHYYRPDNAGSSAAALKAGHDLNCGTAYRELGI----AFDRGEADEA 320

Query: 342 DINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQG 401
            ++ +LV     + RLG       + PY  LG +D+ +  H+ LAL+AA+Q +VLLKN  
Sbjct: 321 LLDRSLVRLFAARYRLGELQPR-RNDPYARLGARDIDSAAHRALALQAAQQSLVLLKNAN 379

Query: 402 PSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            +LPL   +R   +AV+GPN+D    +  NY G +    TPLQG+
Sbjct: 380 ATLPLRPGLR---LAVLGPNADALAALEANYQGTSVQPVTPLQGL 421



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 93/278 (33%), Positives = 141/278 (50%), Gaps = 48/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E L          DR  L LP  QQ L+ +   + K P ++VLMSG  + + 
Sbjct: 622 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLERAKASGK-PLVVVLMSGSAVALN 680

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +A+      AII A YPGQ+GGTAIA  L G  NPGG+LP+T+Y      +LP       
Sbjct: 681 WAEQ--HADAIIAAWYPGQSGGTAIAQALAGDINPGGRLPVTFY--RSTKDLP------- 729

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
           P  S    GRTYR++KG  ++PFG+G+SYT F +   +AP +          S     +G
Sbjct: 730 PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTQFAY---DAPQL----------STTTLQAG 776

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
           + ++V+              V+N G++ G   + V+   P    +P + LV F++VH+  
Sbjct: 777 QPLQVS------------TTVRNTGARAGDEVVQVYLQYPQRAQSPLRSLVGFQRVHLQP 824

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           G  + +   +   + LS VDRSG R +  G++ + +GG
Sbjct: 825 GEARTLSFALD-ARQLSDVDRSGQRAVEAGDYRLFVGG 861


>gi|423227459|ref|ZP_17213920.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392623089|gb|EIY17195.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 864

 Score =  281 bits (719), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 173/447 (38%), Positives = 253/447 (56%), Gaps = 41/447 (9%)

Query: 34  DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
           + T +  P+    L   +R  DL+ R++L+EKV  + +G+ A+ RLGI  Y+WW+EALHG
Sbjct: 16  NVTAQNEPYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHG 75

Query: 94  VSNVGPGTKFGGDFPGATSFPQVITTASSF-NATLWEAIGRVVSDEARAMYNG------- 145
           V+  G           AT FPQ I  A++F N  ++E    +VSDEARA Y+        
Sbjct: 76  VARAGK----------ATVFPQAIGLAATFDNQAVYETFD-IVSDEARAKYHDFQRKGER 124

Query: 146 -GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
            G  GLT+W+PN+NI+RDPRWGRG ET GEDP L+     + V+GLQG    +   A +C
Sbjct: 125 DGYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGGGTGKYDKAHAC 184

Query: 205 -KHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            KH+  +    WN   R  F+AK +S++D+ +T+   F+  V EGKV  VMC+YN+  G 
Sbjct: 185 AKHYAVHSGPEWN---RHSFDAKNISQRDLWETYLPAFKTLVKEGKVKEVMCAYNRFEGE 241

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLD 320
           P C++  +L R +R +W  +  +VSDC ++G +Y   H  + P    A+ADA+ +G DL+
Sbjct: 242 PCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLE 301

Query: 321 CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKD 376
           CG       E AV++GL+SE  IN ++   L  + +LGMFD +     S  PY  +  K 
Sbjct: 302 CGGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFDDDALVSWSEIPYSVVESK- 359

Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
               +H   ALE AR+ +VLL N+  +LPLS    R VAV+GPN++ +V +  NY G   
Sbjct: 360 ----EHVAKALEMARKSMVLLTNKNHTLPLSK-SIRKVAVLGPNANDSVMLWANYNGFPT 414

Query: 437 GYTTPLQGIGRY---ARTIHQQGCKDV 460
              T L+GI          +++GC  V
Sbjct: 415 KSVTILEGIKSKLPEGTVYYEKGCDYV 441



 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 96/321 (29%), Positives = 149/321 (46%), Gaps = 47/321 (14%)

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
           D+    +  +    D + +ADA I V GL  ++E E +          DR  + LP  Q 
Sbjct: 581 DIGIKKEINYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQA 640

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
           E++  +    K P I VL SG  + + +   +  + AI+ A YPGQ GGTA+AD+LFG  
Sbjct: 641 EMLKALKKTGK-PVIFVLCSGSTLALPWEAEN--LDAILEAWYPGQQGGTAVADVLFGDY 697

Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
           NP G+LP+T+Y      +LP  E             RTYR++KG  ++PFGHG+SYT F 
Sbjct: 698 NPAGRLPLTFYASS--DDLPDFE-------DYDMSNRTYRYFKGKALFPFGHGLSYTIFD 748

Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
           +  A         +D ++       +G+ + +T            + +KN G  DG   +
Sbjct: 749 YGKAK--------VDKQN-----VRAGEGMTLT------------IPLKNTGKLDGDEVI 783

Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
            V+   PA    P K L AF +V +PAG  + + I +    +      +    I  G++ 
Sbjct: 784 QVYLRNPADKEGPIKTLRAFRRVSLPAGQTENIRIELPASTFECFNPSTNRMEILPGKYE 843

Query: 749 IHIGGTKHSVSLHAATLGVIK 769
           +  GGT    +L   T+ V K
Sbjct: 844 LLYGGTSDEKALQKLTITVKK 864


>gi|383649536|ref|ZP_09959942.1| beta-glucosidase [Sphingomonas elodea ATCC 31461]
          Length = 853

 Score =  281 bits (719), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 171/420 (40%), Positives = 235/420 (55%), Gaps = 29/420 (6%)

Query: 50  PQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPG 109
           P  V   I ++++++K   L S A A P +G+  Y+WW+E LHG++  G           
Sbjct: 13  PDPVAAAIAKMTIEQKAAQLQSTAPADPVIGLPAYDWWNEGLHGLARDG----------Y 62

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFR 161
           AT FPQ I  A++++  L   +G  V+ EARA +N            GLT WSPN+NIFR
Sbjct: 63  ATVFPQAIGLAATWDVPLLHKVGDTVATEARAKFNAKPVTADRKIYEGLTIWSPNINIFR 122

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRF 221
           DPRWGRGQET GEDP L+G  A  +++GLQG D    KV A+ KH   +         R 
Sbjct: 123 DPRWGRGQETYGEDPFLTGHLAVGFIQGLQGPDPAHPKVLATPKHLAVHSGPE---AGRD 179

Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
            F+   S QD+E T+   FR+ + EGK  SVMC+YN ++G P CA   +L   +R +W  
Sbjct: 180 GFDVDPSPQDLESTYTPAFRLALTEGKAQSVMCAYNSIHGTPACASGGLLNDRLRKDWGF 239

Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEI 341
            G IVSDCD+V   +   H+     EA+A AI+AG+DL+CG       E AV+RGL+SE 
Sbjct: 240 TGLIVSDCDAVANIHLFHHYRLDAAEASAAAIKAGMDLNCGTTYAALPE-AVKRGLVSEA 298

Query: 342 DINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQG 401
           +++ AL   L  +  LG+  G   + P+  + P +  TP  + LALEAAR+ IVLLKN G
Sbjct: 299 EVDVALKRALDARRALGIAFG--GANPWSRIKPGERGTPAQRALALEAARKAIVLLKNDG 356

Query: 402 PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGCK 458
             LPL+      +AVIG N+D    + GNY G A    TPL GI R     R ++ QG +
Sbjct: 357 DRLPLAP--GGRIAVIGANADDLGVLEGNYHGTARDPVTPLDGIRRQFGADRVVYAQGAQ 414



 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 89/292 (30%), Positives = 132/292 (45%), Gaps = 63/292 (21%)

Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
           D  + V+GL   +E EAL          DR  + LP  Q EL+  +    K P +LVL S
Sbjct: 580 DTIVAVLGLSPDLEGEALSVSIPGFVGGDRTDIALPRPQLELLKALRKTGK-PLVLVLTS 638

Query: 529 GGPIDVAFAKNDPRIA-AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           G  + V     DP +A AI+ A YPG+ GGTAIA+ L G +NP G+LP+T+Y    + +L
Sbjct: 639 GSAVAV-----DPSLADAILEAWYPGEEGGTAIAETLAGKNNPSGRLPLTFYAS--VDDL 691

Query: 588 P-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           P   +  M+         RTYRF+ G  ++ FGHG+SYT F                  +
Sbjct: 692 PAFVDYGMKE--------RTYRFFTGKPLWGFGHGLSYTKFA-----------------Y 726

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA-----GHWAP 701
           G++    +G    V           V   + N G++ G     V+   PA     G   P
Sbjct: 727 GNVAVKAAGIGQPVQ----------VSATLTNAGNRSGEEVAQVYVVTPAAGKPGGLTTP 776

Query: 702 --HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
              +QL  F++  +  G    +   +   + +S V R GTRR+  G + + I
Sbjct: 777 VLQRQLAGFQRTALAPGKSASLSFTLD-PRSISSVARDGTRRVLPGTYRVWI 827


>gi|222635353|gb|EEE65485.1| hypothetical protein OsJ_20901 [Oryza sativa Japonica Group]
          Length = 481

 Score =  281 bits (719), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 136/186 (73%), Positives = 155/186 (83%), Gaps = 2/186 (1%)

Query: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD--GDRLK 199
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA+YVRGLQ       RLK
Sbjct: 1   MYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQQQPSSGRLK 60

Query: 200 VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
           +AA CKHFTAYDLDNW+G DRFHFNA V++QD+EDTF+VPFR CV++G+ ASVMCSYNQV
Sbjct: 61  LAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRAASVMCSYNQV 120

Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL 319
           NGVPTCAD   L+ TIR  W L GYIVSDCDSV V+Y  QH+T T E+A A  +RAGLDL
Sbjct: 121 NGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVAATLRAGLDL 180

Query: 320 DCGPFL 325
           DCGPFL
Sbjct: 181 DCGPFL 186



 Score =  265 bits (676), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 145/264 (54%), Positives = 182/264 (68%), Gaps = 2/264 (0%)

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PGRQ EL+S V+ ASKGP ILVLMSGGPID+ FA+NDP+IA I+WAGYPGQAGG AIAD+
Sbjct: 220 PGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAGGQAIADV 279

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           +FG  NPGGKLP+TWYPQ+Y+  +PMT MAMR + +K YPGRTYRFY GP ++PFGHG+S
Sbjct: 280 IFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPAKGYPGRTYRFYTGPTIHPFGHGLS 339

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           YT+F H++A+AP+ + V L   H + +A+ S  A             G     +   S +
Sbjct: 340 YTSFTHSIAHAPSQLTVRLSAHHAAASASASLNA--TARLSRAAAERGRTRRRRTRCSCN 397

Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
            A   L  +   AGH AP +QLVAFEKVHV AG   RV + I VC  LSV DR+G RRIP
Sbjct: 398 AAAPALSAAEAAAGHGAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIP 457

Query: 744 LGEHNIHIGGTKHSVSLHAATLGV 767
           +GEH + IG   H+V++    LGV
Sbjct: 458 VGEHRLIIGELTHTVTIALEQLGV 481


>gi|325105296|ref|YP_004274950.1| beta-glucosidase [Pedobacter saltans DSM 12145]
 gi|324974144|gb|ADY53128.1| Beta-glucosidase [Pedobacter saltans DSM 12145]
          Length = 884

 Score =  281 bits (719), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 168/430 (39%), Positives = 239/430 (55%), Gaps = 31/430 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           F    LP+ +R+ +L+G L+L+EKV L+++ +  V RLGI  Y+WW+EALHGV+  G   
Sbjct: 32  FRNPDLPVNERIENLLGLLTLEEKVGLMMNSSKPVGRLGIPAYDWWNEALHGVARSGK-- 89

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYW 153
                   AT FPQ I  A+++N +  +    ++SDEARA YN        G   GL++W
Sbjct: 90  --------ATVFPQAIGMAATWNESGHKQTFDLISDEARAKYNEAIRNGERGRYYGLSFW 141

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +PN+NIFRDPRWGRGQET GEDP L+ +   + VRGLQG D    K  A  KHF  +   
Sbjct: 142 TPNINIFRDPRWGRGQETYGEDPYLTARLGVAAVRGLQGDDPKYFKTHACAKHFAVHSGP 201

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
            WN   R  ++A  S +D+ +T+   F+  V E  V  VMC+YN   G P C    +L  
Sbjct: 202 EWN---RHSYDATASGRDLWETYLPAFKALVKEANVQEVMCAYNAYEGQPCCGSDRLLTD 258

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTS--TPEEAAADAIRAGLDLDCGPFLGLHTES 331
            +R  W   G +VSDC ++  ++   H  +      AAADA+    DL+CG       E 
Sbjct: 259 ILRNRWEYKGIVVSDCWAIDDFFRKGHHETHKDAAAAAADAVIHSTDLECGSAYTNLLE- 317

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAA 390
           AV++GL+S+  I+ +L   L     LGM D  P+ + P+  L  + V + +H + AL+ A
Sbjct: 318 AVRQGLISQQQIDISLRRVLRGWFELGMLD--PAERLPWSQLPYQIVASKEHVQQALKVA 375

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR--- 447
           R+ + LLKN G  LPLS    + +AVIGPN+  +V + GNY G      T LQGI     
Sbjct: 376 RESMTLLKNNGSILPLSK-SIKKIAVIGPNAADSVMLWGNYNGTPNSTVTILQGIKNKLP 434

Query: 448 YARTIHQQGC 457
           +A  I+ +GC
Sbjct: 435 HAEIIYDKGC 444



 Score =  110 bits (274), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 94/374 (25%), Positives = 161/374 (43%), Gaps = 50/374 (13%)

Query: 416 VIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-HQQGCK------DVACADDQLF 468
           V+  N  V +T  G  A     YT   +   +Y   + H+Q  K       V   D   F
Sbjct: 539 VLYVNDKVLLTKKGREASADTEYTMKAEKNKKYNIVLEHKQVGKYVNIDFSVLKKDKADF 598

Query: 469 GAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMAS 518
              I   ++ DA +   GL   +E E +          D+  + LP  Q+EL+S +    
Sbjct: 599 TNLIKRLKEVDAIVYAGGLSPQLEGEEMPVNADGFRGGDKISIDLPKIQRELLSSLKSTG 658

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
           K P + VL +G  +  A  +++    A++ A Y GQ  GTA+AD+LFG  NP G+LP+T+
Sbjct: 659 K-PVVFVLCTGSSL--ALEQDEKNYNALLCAWYGGQEAGTAVADVLFGDYNPAGRLPITF 715

Query: 579 YPQ-EYITN--LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           Y     + N  L  ++ + +  ++    GRTYR+     +Y FGHG+SY+ F +      
Sbjct: 716 YKSLSQLDNALLKTSDTSRQDFENYSMQGRTYRYMTEKPLYAFGHGLSYSKFNYG----- 770

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
                          A ++   +K+ +      TL + + + N+ +  G   + V+    
Sbjct: 771 --------------EAKLTSGTVKIGN------TLNISIPLTNISNNKGEEVVQVYVKRN 810

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP-LGEHNIHIGGT 754
               AP K L  F++V + AG  + +   +   +     D S     P  G + I  GG+
Sbjct: 811 GDPDAPVKSLKGFKRVAIAAGETKHLDFQL-TAEAFEFYDPSKDELGPKAGNYTIMYGGS 869

Query: 755 KHSVSLHAATLGVI 768
            +   L + ++ V+
Sbjct: 870 SNEDRLKSISINVV 883


>gi|315607027|ref|ZP_07882031.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315251081|gb|EFU31066.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 866

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 164/440 (37%), Positives = 240/440 (54%), Gaps = 41/440 (9%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           F+C    A  +  P+  + L   +R  DL  RL+L+EK KL+ + + A+PRLGI  +EWW
Sbjct: 14  FSCV---AGAQQYPYQNLQLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWW 70

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SEALHG++  G           AT FPQ    A+S++  L   +    SDEA A  N   
Sbjct: 71  SEALHGIARNG----------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLAR 120

Query: 148 A--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-- 197
                    G++ W+PN+NIFRDPRWGRGQET GEDP L+ +   + V GLQG    R  
Sbjct: 121 KSGDIKRYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDM 180

Query: 198 ---------LKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEG 247
                     K  A  KH+  +    WN   R  F+  ++ ++D+ +T+   F+  V EG
Sbjct: 181 RPFTERPRYYKTLACAKHYAVHSGPEWN---RHVFDVERLPERDLWETYLPAFKSLVQEG 237

Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT--QHFTSTP 305
            V  VMC+Y +++G P C +   L + +RGEW  NG +VSDC ++  +Y     H   TP
Sbjct: 238 NVREVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETP 297

Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
            EA+A  +RAG D++CG         AV++GL+S   I+ ++V  L  +  +G FD E  
Sbjct: 298 AEASAMGVRAGTDVECGAVYAT-LPRAVEQGLISREAIDTSVVRLLKARFEVGDFDSE-K 355

Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
             P+   GP+ + +  H+ LAL+ AR+ + LL+N+   LPLS    R +AV+GPN++ +V
Sbjct: 356 LVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSV 414

Query: 426 TMIGNYAGIACGYTTPLQGI 445
            + GNY G     TT L+GI
Sbjct: 415 MLWGNYTGYPISTTTILKGI 434



 Score =  117 bits (294), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 86/300 (28%), Positives = 134/300 (44%), Gaps = 55/300 (18%)

Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           AD  + V G+   +E E +          DR  + LP  Q+E++  +  A K   + V  
Sbjct: 611 ADVVVFVGGISPRLEGEEMKVDAPGFNGGDRTSIELPEAQREVIRLLRQAGK-LVVFVNC 669

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           SGG   VA         A++ A Y G+AGG A+AD+LFG  NP GKLP+T+Y  +   +L
Sbjct: 670 SGGA--VALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSD--ADL 725

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
           P            R  GRTYR+++G  ++PFG G+SYT+FV                   
Sbjct: 726 P-------DFLDYRMTGRTYRYFRGTPLFPFGFGLSYTSFVFGTPRYE------------ 766

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
                 +GK               + V+V N G +DGA  + V+   PA    P K L  
Sbjct: 767 ------NGK---------------LYVEVTNTGKRDGAEVVQVYVKNPADADGPVKTLRG 805

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           F ++ + AG ++RV I +   ++      + T R+  G H + +G +     L   ++ +
Sbjct: 806 FARIDLKAGERRRVEIAMPRERFEGWDATTNTMRVKPGNHLLMVGSSSRDADLQTISVSI 865


>gi|71731103|gb|EAO33170.1| Beta-glucosidase [Xylella fastidiosa subsp. sandyi Ann-1]
          Length = 882

 Score =  280 bits (717), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 165/405 (40%), Positives = 231/405 (57%), Gaps = 28/405 (6%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           Q    L+ +++ QEK+   ++ A A+PRLGI  Y+WWSE LHG++  G           A
Sbjct: 32  QHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YA 81

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G V S EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 82  TVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFR 141

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRF 221
           DPRWGRG ET GEDP L+G+ A S++RGLQG   D  +  A+ KHF  +         R 
Sbjct: 142 DPRWGRGMETYGEDPYLTGQLAVSFIRGLQGDTPDHPRTIATPKHFAVHSGPEQG---RH 198

Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
            F+  VS  D+E T+   FR  +++G   SVMC+YN ++G P CA   +L   +R +W  
Sbjct: 199 SFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRNDWGF 258

Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSE 340
           NG++VSDCD++        F      A+A A+++G DL+CG  +  L+   A+ RG + E
Sbjct: 259 NGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGDDLNCGNTYRDLN--QAIARGDIDE 316

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
             ++ AL+   T + RLG         PY  +G K + TP H+ LAL+AA Q +VLLKN 
Sbjct: 317 STLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSLVLLKNS 375

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           G +LPL      T+AV+GP++D    +  NY G +    TPL G+
Sbjct: 376 GNTLPLP--PETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGL 418



 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 95/290 (32%), Positives = 142/290 (48%), Gaps = 48/290 (16%)

Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
           A   ADA +  +GL   +E E L          DR  + LP  Q+ L+  V    K P I
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665

Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
           +VLMSG  + + +A++     AI+ A YPGQ+GGTAIA  L G  NPGG+LP+T+Y    
Sbjct: 666 VVLMSGSAVALNWAQH--HADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFY--RS 721

Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
             +LP       P  S    GRTYR++KG  +YPFG+G+SYT F +    AP +    L 
Sbjct: 722 TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFAY---EAPQLSTATLK 771

Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
             +                      TL V   V+N G++ G   + ++  PP    AP +
Sbjct: 772 AGN----------------------TLTVTAHVRNTGTRAGDEVVQLYLEPPYSPQAPLR 809

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            LV F++V +  G  + +   +   + LS V ++G R +  G +++ +GG
Sbjct: 810 SLVGFKRVTLRPGESRLLTFTLD-ARQLSGVQQTGQRSVEAGHYHLFVGG 858


>gi|288927072|ref|ZP_06420962.1| beta-glucosidase [Prevotella buccae D17]
 gi|288336152|gb|EFC74543.1| beta-glucosidase [Prevotella buccae D17]
          Length = 866

 Score =  280 bits (717), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 164/440 (37%), Positives = 239/440 (54%), Gaps = 41/440 (9%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           F+C    A  +  P+    L   +R  DL  RL+L+EK KL+ + + A+PRLGI  +EWW
Sbjct: 14  FSCV---AGAQQYPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWW 70

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SEALHG++  G           AT FPQ    A+S++  L   +    SDEA A  N   
Sbjct: 71  SEALHGIARNG----------FATVFPQTTAMAASWDDELLYHVFCAASDEAVAKNNLAR 120

Query: 148 A--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-- 197
                    G++ W+PN+NIFRDPRWGRGQET GEDP L+ +   + V GLQG    R  
Sbjct: 121 KSGDIKRYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDM 180

Query: 198 ---------LKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEG 247
                     K  A  KH+  +    WN   R  F+  ++ ++D+ +T+   F+  V EG
Sbjct: 181 RPFTERPRYYKTLACAKHYAVHSGPEWN---RHVFDVERLPERDLWETYLPAFKSLVQEG 237

Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT--QHFTSTP 305
            V  VMC+Y +++G P C +   L + +RGEW  NG +VSDC ++  +Y     H   TP
Sbjct: 238 NVREVMCAYQRIDGSPCCGNTRYLHQILRGEWEYNGLVVSDCGAISDFYREGHHHVVETP 297

Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
            EA+A  +RAG D++CG         AV++GL+S   I+ ++V  L  +  +G FD E  
Sbjct: 298 AEASAMGVRAGTDVECGAVYAT-LPRAVEQGLISREAIDTSVVRLLKARFEVGDFDSE-K 355

Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
             P+   GP+ + +  H+ LAL+ AR+ + LL+N+   LPLS    R +AV+GPN++ +V
Sbjct: 356 LVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSV 414

Query: 426 TMIGNYAGIACGYTTPLQGI 445
            + GNY G     TT L+GI
Sbjct: 415 MLWGNYTGYPISTTTILKGI 434



 Score =  115 bits (289), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 85/300 (28%), Positives = 133/300 (44%), Gaps = 55/300 (18%)

Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           AD  + V G+   +E E +          DR  + LP  Q+E++  +  A K   + V  
Sbjct: 611 ADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQREVIRLLRQAGK-LVVFVNC 669

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           SGG   VA         A++ A Y G+AGG A+AD+LFG  NP GKLP+T+Y  +   +L
Sbjct: 670 SGGA--VALVPETEACDAVLQAWYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSD--ADL 725

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
           P            R  GRTYR+++G  ++PFG G+SYT+F                G+  
Sbjct: 726 P-------DFLDYRMTGRTYRYFRGIPLFPFGFGLSYTSFAF--------------GKPR 764

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
             N  +                    V+V N G +DGA  + V+   PA    P K L  
Sbjct: 765 YENGKL-------------------YVEVTNTGKRDGAEVVQVYVKNPADADGPVKTLRG 805

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           F ++ + AG ++RV I +   ++      + T R+  G H + +G +     L   ++ +
Sbjct: 806 FARIDLKAGERRRVEIAMPRERFEGWDATTNTMRVKPGNHLLMVGSSSRDADLQTISVSI 865


>gi|154493680|ref|ZP_02033000.1| hypothetical protein PARMER_03021 [Parabacteroides merdae ATCC
           43184]
 gi|423723902|ref|ZP_17698051.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
           CL09T00C40]
 gi|154086890|gb|EDN85935.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Parabacteroides merdae ATCC 43184]
 gi|409240709|gb|EKN33484.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
           CL09T00C40]
          Length = 868

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 163/440 (37%), Positives = 234/440 (53%), Gaps = 37/440 (8%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LPI +R++DL+ RL+ +EK+  +++   A+ RLGI  Y+WW+EALHGV+  G  
Sbjct: 27  PFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARAGK- 85

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                    AT FPQ I  A++F+         +VSDEARA Y+            GLT+
Sbjct: 86  ---------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKGLTF 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           W+PN+NIFRDPRWGRG ET GEDP L+ +   + V+GLQG D    K  A  KH+  +  
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKYFKTHACAKHYAVHSG 196

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
             WN   R  F+  V+ +D+  T+   F   V +G V  VMC+YN+  G P C+   +L 
Sbjct: 197 PEWN---RHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNRYQGKPCCSSDKLLI 253

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--------EEAAADAIRAGLDLDCGPF 324
             +R  W     I+SDC ++  ++  Q    TP        E A+ADA+  G DL+CG  
Sbjct: 254 DILRNSWGYENIILSDCGAINDFW--QRDERTPRHETHPDAESASADAVLNGTDLECGNS 311

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
                + A++ G +SE D++ +L   L  +  LGMFD +    PY  +    V +P+H  
Sbjct: 312 YKALIK-ALKEGKISENDLDVSLRRLLKGRFELGMFDPD-ERVPYAQIPYNVVESPEHVA 369

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
            ALE A + +VLLKN+  +LPLS    R +AV+GPN+  +  +  NY G      T L+G
Sbjct: 370 QALEMAHKSMVLLKNKNNTLPLSKTI-RKIAVVGPNAADSTMLWANYNGFPTHTVTILEG 428

Query: 445 IGRY---ARTIHQQGCKDVA 461
           I         I++ GC   A
Sbjct: 429 IRNKVPDTEVIYELGCNHAA 448



 Score =  132 bits (331), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 99/311 (31%), Positives = 141/311 (45%), Gaps = 49/311 (15%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           + A     + AD  + V G+   +E E +          DR  + +P  QQE+V K   A
Sbjct: 595 YAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMV-KALKA 653

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
           +  P + VL +G  +  A    D  I AI+ A Y GQ  GTA+ADILFG  NP G+LP+T
Sbjct: 654 TGKPVVYVLCTGSAL--ALNWEDANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVT 711

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y    I  LP   + +M+        GRTYR+     +YPFG+G+SYTNF +       
Sbjct: 712 FYKS--IDQLPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR------ 755

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                        NA +S    K+T  +   LT     D+ N G  DG     ++   P 
Sbjct: 756 -------------NAKLSSG--KITKDQSVTLTF----DIANTGKMDGDEVAQIYIKNPN 796

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K L AF +VHV AG  Q V I +    + S  D + T  +  G++ I  GG+  
Sbjct: 797 DPEGPIKALKAFLRVHVKAGDSQEVNIELTPEAFHSFNDNTQTMEVRPGKYQILYGGSSD 856

Query: 757 SVSLHAATLGV 767
             +L    L +
Sbjct: 857 EKALERLELTI 867


>gi|423344787|ref|ZP_17322476.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
           CL03T12C32]
 gi|409224378|gb|EKN17311.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
           CL03T12C32]
          Length = 866

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 163/440 (37%), Positives = 234/440 (53%), Gaps = 37/440 (8%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LPI +R++DL+ RL+ +EK+  +++   A+ RLGI  Y+WW+EALHGV+  G  
Sbjct: 25  PFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARAGK- 83

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                    AT FPQ I  A++F+         +VSDEARA Y+            GLT+
Sbjct: 84  ---------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKGLTF 134

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           W+PN+NIFRDPRWGRG ET GEDP L+ +   + V+GLQG D    K  A  KH+  +  
Sbjct: 135 WTPNINIFRDPRWGRGMETYGEDPYLTERMGLAVVKGLQGDDPKYFKTHACAKHYAVHSG 194

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
             WN   R  F+  V+ +D+  T+   F   V +G V  VMC+YN+  G P C+   +L 
Sbjct: 195 PEWN---RHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNRYQGKPCCSSDKLLI 251

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--------EEAAADAIRAGLDLDCGPF 324
             +R  W     I+SDC ++  ++  Q    TP        E A+ADA+  G DL+CG  
Sbjct: 252 DILRNSWGYENIILSDCGAINDFW--QRDERTPRHETHPDAESASADAVLNGTDLECGNS 309

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
                + A++ G +SE D++ +L   L  +  LGMFD +    PY  +    V +P+H  
Sbjct: 310 YKALIK-ALKEGKISENDLDVSLRRLLKGRFELGMFDPD-ERVPYAQIPYNVVESPEHVA 367

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
            ALE A + +VLLKN+  +LPLS    R +AV+GPN+  +  +  NY G      T L+G
Sbjct: 368 QALEMAHKSMVLLKNKNNTLPLSKTI-RKIAVVGPNAADSTMLWANYNGFPTHTVTILEG 426

Query: 445 IGRY---ARTIHQQGCKDVA 461
           I         I++ GC   A
Sbjct: 427 IRNKVPDTEVIYELGCNHAA 446



 Score =  132 bits (332), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 99/311 (31%), Positives = 141/311 (45%), Gaps = 49/311 (15%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           + A     + AD  + V G+   +E E +          DR  + +P  QQE+V K   A
Sbjct: 593 YAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMV-KALKA 651

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
           +  P + VL +G  +  A    D  I AI+ A Y GQ  GTA+ADILFG  NP G+LP+T
Sbjct: 652 TGKPVVYVLCTGSAL--ALNWEDANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVT 709

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y    I  LP   + +M+        GRTYR+     +YPFG+G+SYTNF +       
Sbjct: 710 FYKS--IDQLPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR------ 753

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                        NA +S    K+T  +   LT     D+ N G  DG     ++   P 
Sbjct: 754 -------------NAKLSSG--KITKDQSVTLTF----DIANTGKMDGDEVAQIYIKNPN 794

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K L AF +VHV AG  Q V I +    + S  D + T  +  G++ I  GG+  
Sbjct: 795 DPEGPIKALKAFLRVHVKAGDSQEVNIELTPEAFHSFNDNTQTMEVRPGKYQILYGGSSD 854

Query: 757 SVSLHAATLGV 767
             +L    L +
Sbjct: 855 EKALERLELTI 865


>gi|255013451|ref|ZP_05285577.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
 gi|410103695|ref|ZP_11298616.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
 gi|409236424|gb|EKN29231.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
          Length = 868

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 162/439 (36%), Positives = 238/439 (54%), Gaps = 35/439 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LP+ +R++DL+ RL+ +EK+  +++   A+ RLGI  Y+WW+EALHGV+  G  
Sbjct: 27  PFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG-- 84

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                    AT FPQ I  A++F+         +VSDEARA Y+            GLT+
Sbjct: 85  --------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           W+PN+NIFRDPRWGRG ET GEDP L+ K   +  RGLQG D +  K  A  KH+  +  
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYAVHSG 196

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
             WN   R  F+ + + +D+ +T+   F   V EG V  VMC+YN+  G P C+   +L 
Sbjct: 197 PEWN---RHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKLLI 253

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY----DTQHFTSTP--EEAAADAIRAGLDLDC-GPFL 325
             +R  W  +  I+SDC ++  ++    +T    + P  E A+ADA+  G DL+C G + 
Sbjct: 254 DILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGSYR 313

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
            L+   A+  G +SE D++ +L   L  +  LGMFD +    PY  +    V +P+H   
Sbjct: 314 ALN--KALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIAK 370

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           AL+ AR+ IVLLKN+   LPL     + +AV+GPN+  +  +  NY G      T ++GI
Sbjct: 371 ALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPTKTVTIVEGI 429

Query: 446 GRY---ARTIHQQGCKDVA 461
                 A  I++ GC   A
Sbjct: 430 RNKVPNAEVIYELGCNHTA 448



 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 90/309 (29%), Positives = 146/309 (47%), Gaps = 49/309 (15%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F A     + AD  + V G+   +E E +          DR  + +P  Q+E+V K  +A
Sbjct: 595 FQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMV-KALVA 653

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
           +  P + V+ +G  + + + +ND  + AI+ A Y GQ GGTA+AD+LFG  NP G+LP+T
Sbjct: 654 TGKPVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPIT 711

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y    +  LP   + +M+        GRTYR+     +YPFG+G+SYT F +       
Sbjct: 712 FYKS--VDQLPDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYK------ 755

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                        NA +S   I       +  ++ +  D+ N G  DG     ++   P 
Sbjct: 756 -------------NAKLSKDKI------ASNESVTLSFDIANTGKMDGDEVAQIYIKNPN 796

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K + AF++V+V AG++Q V I +    + S  D + T  +  G++ I  GG+  
Sbjct: 797 DPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYGGSSD 856

Query: 757 SVSLHAATL 765
             +L    L
Sbjct: 857 DKTLKKIDL 865


>gi|423331656|ref|ZP_17309440.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
           CL03T12C09]
 gi|409230226|gb|EKN23094.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
           CL03T12C09]
          Length = 868

 Score =  280 bits (715), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 162/439 (36%), Positives = 238/439 (54%), Gaps = 35/439 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LP+ +R++DL+ RL+ +EK+  +++   A+ RLGI  Y+WW+EALHGV+  G  
Sbjct: 27  PFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG-- 84

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTY 152
                    AT FPQ I  A++F+         +VSDEARA Y+            GLT+
Sbjct: 85  --------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTF 136

Query: 153 WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL 212
           W+PN+NIFRDPRWGRG ET GEDP L+ K   +  RGLQG D +  K  A  KH+  +  
Sbjct: 137 WTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYAVHSG 196

Query: 213 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILK 272
             WN   R  F+ + + +D+ +T+   F   V EG V  VMC+YN+  G P C+   +L 
Sbjct: 197 PEWN---RHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKLLI 253

Query: 273 RTIRGEWRLNGYIVSDCDSVGVYY----DTQHFTSTP--EEAAADAIRAGLDLDC-GPFL 325
             +R  W  +  I+SDC ++  ++    +T    + P  E A+ADA+  G DL+C G + 
Sbjct: 254 DILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGSYR 313

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
            L+   A+  G +SE D++ +L   L  +  LGMFD +    PY  +    V +P+H   
Sbjct: 314 ALN--KALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIAK 370

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           AL+ AR+ IVLLKN+   LPL     + +AV+GPN+  +  +  NY G      T ++GI
Sbjct: 371 ALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPTKTVTIVEGI 429

Query: 446 GRY---ARTIHQQGCKDVA 461
                 A  I++ GC   A
Sbjct: 430 RNKVPNAEVIYELGCNHTA 448



 Score =  125 bits (315), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 90/309 (29%), Positives = 145/309 (46%), Gaps = 49/309 (15%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F A     + AD  + V G+   +E E +          DR  + +P  Q+E+V K  +A
Sbjct: 595 FQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMV-KALVA 653

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
           +  P + V+ +G  + + + +ND  + AI+ A Y GQ GGTA+AD+LFG  NP G+LP+T
Sbjct: 654 TGKPVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPIT 711

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y    +  LP   + +M+        GRTYR+     +YPFG+G+SYT F +       
Sbjct: 712 FYKS--VDQLPDFQDYSMK--------GRTYRYMTQTPLYPFGYGLSYTTFDYK------ 755

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                        NA +S   I       +  ++ +  D+ N G  DG     ++   P 
Sbjct: 756 -------------NAKLSKDKI------ASNESVTLSFDIANTGKMDGDEVAQIYIKNPN 796

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K + AF++V+V AG+ Q V I +    + S  D + T  +  G++ I  GG+  
Sbjct: 797 DPAGPLKAMKAFKRVNVKAGSAQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYGGSSD 856

Query: 757 SVSLHAATL 765
             +L    L
Sbjct: 857 DKTLKKIDL 865


>gi|313205375|ref|YP_004044032.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312444691|gb|ADQ81047.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 858

 Score =  280 bits (715), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 175/445 (39%), Positives = 240/445 (53%), Gaps = 40/445 (8%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           + LP+    L    R  DL+ RL+L EK  L+ + + A+PRLGIK YEWW+EALHGV   
Sbjct: 20  QQLPYQNPKLSAEVRATDLLARLTLAEKAALMQNNSPAIPRLGIKAYEWWNEALHGVGRS 79

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-----GGTA---G 149
           G           AT FPQ I  A+SFN  L       VSDEARA  N     GG     G
Sbjct: 80  GV----------ATVFPQAIGMAASFNNGLLFDAFTAVSDEARAKSNKFSEQGGLKRYQG 129

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFT 208
           LTYW+PNVNIFRDPRWGRGQET GEDP L+     + V+GLQG D     K+ A  KHF 
Sbjct: 130 LTYWTPNVNIFRDPRWGRGQETYGEDPYLTSLMGVAVVKGLQGPDNAEYDKLHACAKHFA 189

Query: 209 AYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
            +    WN   R  FNA+ ++ +D+ +T+   F+  V +  V  VMC+YN+    P C  
Sbjct: 190 VHSGPEWN---RHSFNAENINPRDLWETYLPAFKALVQKADVKEVMCAYNRFEDEPCCGS 246

Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGP-F 324
             +L + +R +W+ +G +VSDC ++  +Y      + P+   AAA+A+  G DL+CG  F
Sbjct: 247 NRLLTQILRNDWKFDGLVVSDCWAISDFYKPNAHATQPDATHAAANAVLNGTDLECGSDF 306

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
             L    AV+ GL+ E  I+ +L   L  +  L    GE +S     +    V +  HQ 
Sbjct: 307 RNL--PEAVKAGLIEEKRIDVSLKRLLKARFEL----GEMNSDQVWPISYSVVNSEKHQN 360

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LAL  A + IVLL+N    LPLS  +   +AV+GPN++ +V   GNY G      T L+ 
Sbjct: 361 LALRMAEESIVLLQNNNNILPLS--KKLKIAVMGPNANDSVMQWGNYNGFPAHTVTLLEA 418

Query: 445 IGRY---ARTIHQQGCK---DVACA 463
           + +    A+ I++ GC    DVA +
Sbjct: 419 MRKSFPGAQLIYEPGCDRTMDVAVS 443



 Score =  102 bits (255), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 82/309 (26%), Positives = 131/309 (42%), Gaps = 49/309 (15%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
             A+I   + AD  +   G+  S+E E +          DR  + LP  Q+ L+  +  A
Sbjct: 585 LSASIAKVKDADVVVFAGGIAPSLEGEEMRVTVPGFKGGDRTDIELPAIQRRLLQALKDA 644

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K   + V  SG  +            AI+ A YPGQAGGTA+A++L G  NP G+LP+T
Sbjct: 645 GK-KVVFVNFSGSAM--GLVPETQSCEAILQAWYPGQAGGTAVANVLLGNYNPSGRLPVT 701

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y    +  LP   + +M+        GRTYR+     ++ FG+G+SYT FV   A    
Sbjct: 702 FYKN--VAQLPDFEDYSMK--------GRTYRYMTEKPLFSFGYGLSYTKFVLGTAK--- 748

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                           ++  +IK         TL + V V N G   G   L V+     
Sbjct: 749 ----------------LNKSSIKANE------TLKITVPVTNAGKVAGTEVLQVYVRKVK 786

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K L  F+KV++  G   ++ I++    +           +  GE+ ++ G +  
Sbjct: 787 DVDGPAKTLRGFKKVNIEPGKTSQISIDLTSSAFEFYDWTQRKMMVTPGEYEVYYGTSSD 846

Query: 757 SVSLHAATL 765
           +  L  A++
Sbjct: 847 AKDLKKASI 855


>gi|393786770|ref|ZP_10374902.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
           CL02T12C05]
 gi|392658005|gb|EIY51635.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
           CL02T12C05]
          Length = 864

 Score =  279 bits (714), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 165/425 (38%), Positives = 238/425 (56%), Gaps = 34/425 (8%)

Query: 36  TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
           T   LP+   +L   QR  DL+ RL+++EKV L+ + +  + RLGIK YEWW+EALHGV+
Sbjct: 24  TYSQLPYQDPNLTPEQRATDLLQRLTIEEKVSLMQNNSPGILRLGIKPYEWWNEALHGVA 83

Query: 96  NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM---YN-----GGT 147
             G           AT FPQ I  A+SF+ TL   +   +SDEARA    +N        
Sbjct: 84  RAGL----------ATVFPQTIGMAASFDDTLIYEVFNAISDEARAKNRHFNTLGQYKRY 133

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKH 206
            GLT W+PN+NIFRDPRWGRGQET GEDP L+ +   + V+GLQG D  R  K+ A  KH
Sbjct: 134 QGLTMWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYNKLHACAKH 193

Query: 207 FTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           F  +    WN   R  FNA+ +  +D+ +T+   F+  V E  V  VMC+YN+  G P C
Sbjct: 194 FAVHSGPEWN---RHSFNAENIIPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDPCC 250

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGP 323
               +L + +R EW   G +VSDC ++  ++ T+   + P+   A+A+A+  G DL+CG 
Sbjct: 251 GSNRLLTQILRNEWGFKGIVVSDCGAISDFWGTKKHNTHPDAAHASAEAVLNGTDLECGS 310

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLG-MFDGEPSSQPYGHLGPKDVCTPDH 382
                TE A++ G++SE  IN ++   L  +  LG M +  P + PY       V +P H
Sbjct: 311 NYRKLTE-AIKAGIISEKQINVSVKRLLKARFELGEMENIHPWTLPYS-----IVDSPKH 364

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           + LAL+ A + + LL+N+G  LPL   +   +A+IGPN++ +V   GNY G     +T L
Sbjct: 365 RCLALKMAHETMTLLQNKGKVLPLD--KQARIAIIGPNANDSVMQWGNYNGTPSHTSTLL 422

Query: 443 QGIGR 447
               +
Sbjct: 423 SAFRK 427



 Score =  117 bits (292), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 88/307 (28%), Positives = 140/307 (45%), Gaps = 49/307 (15%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           ++  +  D  I   G+  S+E E +          DR  +  P  Q+++++ +  A K  
Sbjct: 595 LEKLKDIDIIIFAGGISPSLEGEEMNVSATGFKGGDRTDIEFPAVQRKVLAALKEAGK-K 653

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
            ILV  SG  +  A         AI+ A YPG+ GG AI ++LFG  NP G+LP+T+Y  
Sbjct: 654 VILVNFSGSAM--ALTPETKSCDAILQAWYPGEEGGMAIVNVLFGDYNPAGRLPITFYKS 711

Query: 582 EYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
             I  LP  E  +M+        GRTYR+ +   ++PFG+G+SYT F             
Sbjct: 712 --IDQLPDFENYSMK--------GRTYRYMQEEPLFPFGYGLSYTTFA------------ 749

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
                H + N+  +G+  KVT          + + +KN+G +DG   + ++    A    
Sbjct: 750 -FGKIHINKNSLSAGE--KVT----------LHIPIKNIGDRDGVEVVQIYIQRQADKEG 796

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L AF++V +P G  Q V I +    +      + T R   GE+NI  G +     L
Sbjct: 797 PVKTLRAFKRVEIPKGKTQEVKIELPYVAFEWFDPTTNTMRPIQGEYNILYGNSSRMKDL 856

Query: 761 HAATLGV 767
            +A + +
Sbjct: 857 QSAKIQI 863


>gi|261408260|ref|YP_003244501.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
 gi|261284723|gb|ACX66694.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
           Y412MC10]
          Length = 763

 Score =  279 bits (714), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 218/678 (32%), Positives = 336/678 (49%), Gaps = 86/678 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT FP  +T  S++N  L+ +I R V+ E RA   GG+A    +SP +++ RDPRWGR 
Sbjct: 123 GATVFPVPLTIGSTWNTELFRSISRAVAAETRA--QGGSA---TYSPVLDVVRDPRWGRT 177

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L  ++A + V+GLQG   D    + A+ KHF  Y         R      +
Sbjct: 178 EETFGEDPHLVTEFAVAAVQGLQGERLDSHTSLLATLKHFAGYGASEGG---RNGAPVHM 234

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
             +++ +   +PFR  V  G + SVM +YN+++GVP  +   +L+  +R  W  +G++++
Sbjct: 235 GLRELHEVDLLPFRKAVEAGAL-SVMTAYNEIDGVPCTSSGYLLQDVLREAWGFDGFVIT 293

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDINNA 346
           DC ++ +     +   +  EAAA +++AG+D++  G     H   A+++GL++E D+N A
Sbjct: 294 DCGAIHMLACGHNTAGSGVEAAAQSLKAGVDMEMSGTMFRAHLHQALEQGLITEEDLNRA 353

Query: 347 LVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPL 406
               L ++ RLG+FD  P   P      + +   +H  LA +AA +GIVLLKN+G  LPL
Sbjct: 354 AGRVLELKFRLGLFD-RPYVDP--AWAEQVIGCKEHIALAYQAAAEGIVLLKNEGNLLPL 410

Query: 407 SHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGCKDVA 461
                 T+AVIGPN+      +G+Y          T L GI R    +R ++  GC+ + 
Sbjct: 411 DS-SSGTIAVIGPNAHAPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVLYAPGCR-IQ 468

Query: 462 CADDQLFGAAIDASRQADATILVMG------------------------LDQSIE-AEAL 496
               + F  A+  + QAD  ++V+G                         +  +E  E +
Sbjct: 469 GDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGHAESDMECGEGI 528

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DR+ L L G Q EL+ ++    K P I+V ++G PI   +   D  I +I+ A YPGQ G
Sbjct: 529 DRSTLTLMGVQLELLQELHKLGK-PVIVVYINGRPITEPWI--DEHIPSIVEAWYPGQEG 585

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           G+AIAD+LFG  NP G+LP++  P+E +  LP +        ++R  G+ Y        Y
Sbjct: 586 GSAIADMLFGDINPSGRLPLS-IPKE-VGQLPNS------YNARRTRGKRYLETDLAPRY 637

Query: 617 PFGHGMSYTNFVH-TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           PFG G+SYT F +  +   P VV              I G+A              V++D
Sbjct: 638 PFGFGLSYTEFRYGRLTVEPAVVP-------------IGGEAT-------------VRID 671

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           V N G++DGA  + L  S   A    P K L  F KV + AG  Q V   I   + L ++
Sbjct: 672 VTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEVTFTIG-SEQLELI 730

Query: 735 DRSGTRRIPLGEHNIHIG 752
                  +  GE  I +G
Sbjct: 731 GLDLKPVVEPGEFRIQVG 748


>gi|402304900|ref|ZP_10823963.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400380686|gb|EJP33499.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 866

 Score =  279 bits (713), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 164/440 (37%), Positives = 239/440 (54%), Gaps = 41/440 (9%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
           F+C    A  +  P+    L   +R  DL  RL+L+EK KL+ + + A+PRLGI  +EWW
Sbjct: 14  FSCV---AGAQQYPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWW 70

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           SEALHG++  G           AT FPQ    A+S++  L   +    SDEA A  N   
Sbjct: 71  SEALHGIARNG----------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLAR 120

Query: 148 A--------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR-- 197
                    G++ W+PN+NIFRDPRWGRGQET GEDP L+ +   + V GLQG    R  
Sbjct: 121 KSGDIKRYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDM 180

Query: 198 ---------LKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEG 247
                     K  A  KH+  +    WN   R  F+  ++ ++D+ +T+   F+  V EG
Sbjct: 181 RPFTERPRYYKTLACAKHYAVHSGPEWN---RHVFDVERLPERDLWETYLPAFKSLVQEG 237

Query: 248 KVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT--QHFTSTP 305
            V  VMC+Y +++G P C +   L + +RGEW  NG +VSDC ++  +Y     H   TP
Sbjct: 238 NVREVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETP 297

Query: 306 EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPS 365
            EA+A  +RAG D++CG         AV++GL+S   I+ ++V  L  +  +G FD E  
Sbjct: 298 AEASAMGVRAGTDVECGAVYAT-LPRAVEQGLISREAIDTSVVRLLKARFEVGDFDSE-K 355

Query: 366 SQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTV 425
             P+   GP+ + +  H+ LAL+ AR+ + LL+N+   LPLS    R +AV+GPN++ +V
Sbjct: 356 LVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSV 414

Query: 426 TMIGNYAGIACGYTTPLQGI 445
            + GNY G     TT L+GI
Sbjct: 415 MLWGNYTGYPISTTTILKGI 434



 Score =  115 bits (289), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 85/300 (28%), Positives = 133/300 (44%), Gaps = 55/300 (18%)

Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           AD  + V G+   +E E +          DR  + LP  Q+E++  +  A K   + V  
Sbjct: 611 ADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQREVIRLLRQAGK-LVVFVNC 669

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           SGG   VA         A++ A Y G+AGG A+AD+LFG  NP GKLP+T+Y  +   +L
Sbjct: 670 SGGA--VALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSD--ADL 725

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
           P            R  GRTYR+++G  ++PFG G+SYT+F                G+  
Sbjct: 726 P-------DFLDYRMTGRTYRYFRGTPLFPFGFGLSYTSFAF--------------GKPR 764

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
             N  +                    V+V N G +DGA  + V+   PA    P K L  
Sbjct: 765 YENGML-------------------YVEVTNTGKRDGAEVVQVYVKNPADADGPVKTLRG 805

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           F ++ + AG ++RV I +   ++      + T R+  G H + +G +     L   ++ +
Sbjct: 806 FARIDLKAGERRRVEIAMPRERFEGWDATANTMRVKPGNHLLMVGSSSRDADLQTISVSI 865


>gi|317477153|ref|ZP_07936394.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316906696|gb|EFV28409.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 863

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 149/403 (36%), Positives = 230/403 (57%), Gaps = 17/403 (4%)

Query: 45  VSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFG 104
           +S PI  R+ ++I +++L+EKV  L + + ++PRL +  Y +W+E LHGV+  G      
Sbjct: 52  LSQPISVRIENIIRQMTLEEKVAQLSNESDSIPRLNLPSYNYWNECLHGVARAGE----- 106

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPR 164
                 T FPQ I  AS+++  L + I   +S EAR  Y     GLTYW+P +N+ RDPR
Sbjct: 107 -----VTVFPQAINLASTWDTLLVKRIASAISTEARLKYLDIGKGLTYWAPTINMARDPR 161

Query: 165 WGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN 224
           WGR +ET GEDP L+ +   ++V+GLQG   + LK  A+ KHF A + +N    DRF  +
Sbjct: 162 WGRNEETYGEDPYLTSRLGVAFVKGLQGDHPNYLKTVATVKHFVANNQEN----DRFSSS 217

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
           +++  + + + +   +  CV E  V S+M +YN  NG+P      +L+  +R EW  +G+
Sbjct: 218 SQIPTKQLYEYYFPAYEACVKEANVQSIMTAYNAFNGIPPSGSTWLLEDVLRKEWGFDGF 277

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDIN 344
           +VSDC ++GV        ++ EEAAA  I +G DL+CG     +  +AVQRGL+SE  I+
Sbjct: 278 VVSDCGAIGVMNWQHRIVNSLEEAAALGINSGCDLECGGTYRENLVAAVQRGLVSEYAID 337

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            AL   LT++ +LG FD      PY H   K +     + LA EAA + I+LLKN+   L
Sbjct: 338 RALTRVLTMRFKLGEFD-PIELVPYNHYDKKLLAGEQFRRLAYEAAVKSIILLKNEDNFL 396

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
           P+     R++A++GP +D     +G Y+G      + LQG+ +
Sbjct: 397 PIDKKDVRSIAIVGPFAD--NNYLGGYSGKPVHNISLLQGVKK 437



 Score =  103 bits (256), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 84/286 (29%), Positives = 133/286 (46%), Gaps = 43/286 (15%)

Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTI-LVLMSGGPIDVA 535
           +AD  ++ +G D  +  E  D   + LP  Q+ L+ ++   +  P I L+L +G P+   
Sbjct: 608 RADLVLVALGNDGKLARENRDLPSIYLPMTQELLLKEIYKVN--PRIALILQTGNPLTSQ 665

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +A     + +I+ A YPGQ GG A+A ILFG  NP GKLPMT Y  E    LP       
Sbjct: 666 WAAE--HVPSILQAWYPGQEGGAALAGILFGLENPSGKLPMTIYESE--QQLPNI----- 716

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
                 + GRTY++     +Y FGHG+SY+NF +       VV V  DG           
Sbjct: 717 -LDYDIWKGRTYQYLSSKPLYGFGHGLSYSNFEYADLQCNDVVHV--DG----------- 762

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG--HWAPHKQLVAFEKVHV 713
                        TL   + VKN+    G   + V+ +      +  P K+L+AF +V++
Sbjct: 763 -------------TLQCSIKVKNISDVVGEEVIQVYVSREKTPVYTFPLKKLIAFARVNL 809

Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
                + V   I   + LSV  + G  ++  G++++ +GG +  +S
Sbjct: 810 KPNESKTVTFTI-TPRQLSVW-QDGEWKMLSGKYSLFVGGGQKELS 853


>gi|409195436|ref|ZP_11224099.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
           21150]
          Length = 867

 Score =  278 bits (712), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 162/439 (36%), Positives = 247/439 (56%), Gaps = 35/439 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           +R +DL+  L+L+EKV L++    A+ RLGI+ Y WW+EALHGV+  G           A
Sbjct: 35  ERADDLLKELTLEEKVSLMVDRNTAIERLGIEEYNWWNEALHGVARAGQ----------A 84

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
           T FPQ +  A++F+  +   +    SDEARA ++        G   GLT W+PN+N+FRD
Sbjct: 85  TVFPQPVGMAAAFDRDMVLDVFSAASDEARAKHHFFKERGERGRYQGLTMWTPNINVFRD 144

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRF 221
           PRWGRG E  GEDP ++G    + V+GLQG   G   K+ A  KH+  +    WN   R 
Sbjct: 145 PRWGRGMEAYGEDPFMNGVLGTAVVKGLQGDRSGKYDKLHACAKHYAVHSGPEWN---RH 201

Query: 222 HFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
            FNA+ +  +D+ +T+   F+  V++G V  VMC+YN+  G P C +  +L+  +R EW 
Sbjct: 202 SFNAENIRPRDLHETYLPAFKKLVIDGDVRMVMCAYNRFEGEPCCGNNQLLRDILRNEWG 261

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHTESAVQRGLL 338
            +G +VSDC ++  +++       P+   A+ DA+ AG DL+CG       E AV++GL+
Sbjct: 262 FDGVVVSDCWAINDFFNKDAHAMYPDAKTASTDAVLAGTDLNCGDSYPSLVE-AVEQGLI 320

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           +E  ++ +L   L  +  LG  D +   + +  +    V +P H E+ALEAAR+ + LL 
Sbjct: 321 TEEQLDISLRRLLIARFELGEMDPDEEVE-WSKIPHSVVSSPTHSEMALEAARKSMTLLM 379

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQ 454
           N+  +LPL      TVAV+GPN++ ++   GNY G     TT LQGI    G   + I++
Sbjct: 380 NKNGALPLKK-EGLTVAVMGPNANDSLMQWGNYNGTPATTTTILQGIRNALGNDDQVIYE 438

Query: 455 QGCKDVACADDQLFGAAID 473
           QG + V   DD++F +  D
Sbjct: 439 QGTQWV---DDRIFKSVFD 454



 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 86/303 (28%), Positives = 135/303 (44%), Gaps = 49/303 (16%)

Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           AD  +   G+   +E E +          DR  + LP  Q+E++  +  A K   ILV  
Sbjct: 603 ADVVVFASGISPFLEGEEMGVDLPGFKGGDRTDIALPAIQKEMLKALHKAGK-EIILVNC 661

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           SG  I   F +     +AI+ A YPGQAGG A+A++LFG  NP G+LP+T+Y    +  L
Sbjct: 662 SGSAI--GFEEATDYSSAILQAWYPGQAGGQAVAEVLFGDYNPAGRLPVTFYKS--VDQL 717

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
           P         Q      RTYR+++G  +YPFG+G+SYT F +   + P +    +     
Sbjct: 718 P-------DFQDYNMTNRTYRYFEGEPLYPFGYGLSYTTFSY---DQPELSQTSI----- 762

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
              +T    ++KV+              V N G  DG   + ++   P     P   L  
Sbjct: 763 ---STEEEASLKVS--------------VANTGDYDGEEVVQLYLQKPDDTEGPSLTLRG 805

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL-GEHNIHIGGTKHSVSLHAATLG 766
           F++V +P G    V   +   + L   +    R  PL G++ + +GG+     L    L 
Sbjct: 806 FQRVFIPKGETVEVEFQL-TEEVLEWWNADAQRMTPLAGDYRLLVGGSSRMQDLQEVALT 864

Query: 767 VIK 769
           V++
Sbjct: 865 VVQ 867


>gi|427387362|ref|ZP_18883418.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
           12058]
 gi|425725523|gb|EKU88394.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
           12058]
          Length = 865

 Score =  278 bits (711), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 171/442 (38%), Positives = 245/442 (55%), Gaps = 39/442 (8%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           AT +  P+    L   +R  DL+ R++L+EK+  + +G+ A+ RLGI  Y WW+EALHGV
Sbjct: 18  ATAQNEPYKNPDLTPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYNWWNEALHGV 77

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------G 146
           +  G           AT FPQ I  A++F+         +VSDEARA Y+         G
Sbjct: 78  ARAGK----------ATVFPQAIGLAATFDNQAVHETFSIVSDEARAKYHDFQRKGERDG 127

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCK 205
             GLT+W+PN+NI+RDPRWGRG ET GEDP L+     + V+GLQG   G   K  A  K
Sbjct: 128 YKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGDGTGKYDKTHACAK 187

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           H+  +    WN   R  F+AK +S++D+ +T+   F+  V EGKV  VMC+YN+  G P 
Sbjct: 188 HYAVHSGPEWN---RHSFDAKNISQRDLWETYLPAFKTLVTEGKVKEVMCAYNRYEGEPC 244

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCG 322
           C++  +L R +R +W  +  +VSDC ++G +Y   H  + P    A+ADA+ +G DL+CG
Sbjct: 245 CSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLECG 304

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKDVC 378
                  E AV++GL+SE  IN ++   L  + +LGMFD       S  PY  +  K   
Sbjct: 305 GSYSSLNE-AVRKGLISEDKINESVFRLLRARFQLGMFDDNTLVSWSEIPYSVVESK--- 360

Query: 379 TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY 438
             +H   ALE AR+ +VLL N+   LPLS    R VAV+GPN++ +V +  NY G     
Sbjct: 361 --EHVAKALEMARKSMVLLTNKNNILPLSK-SVRKVAVLGPNANDSVMLWANYNGFPTKS 417

Query: 439 TTPLQGIGRY---ARTIHQQGC 457
            T L+GI          +++GC
Sbjct: 418 VTILEGIRNKLPEGAVYYEKGC 439



 Score =  129 bits (325), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 91/321 (28%), Positives = 144/321 (44%), Gaps = 47/321 (14%)

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
           D+    +  +    D + +AD  I V GL  S+E E +          DR  + LP  Q+
Sbjct: 582 DIGIKKEINYKEMADKAAEADVIIFVGGLSSSLEGEEMPVDLPGFRKGDRTNIDLPQVQE 641

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
           E++  +    K P + VL SG  + + +   +  + AII A YPGQ GGTA+AD+LFG  
Sbjct: 642 EMLKALKKTGK-PVVFVLCSGSTLALPWEAEN--LDAIIEAWYPGQQGGTAVADVLFGDY 698

Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
           NP G+LP+T+Y     ++LP  E             RTYR++KG  ++PFGHG+SYT F 
Sbjct: 699 NPAGRLPLTFYASS--SDLPDFE-------DYDMSNRTYRYFKGRPLFPFGHGLSYTTFD 749

Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
           +  A A   +    +G                         L + + +KN+G   G   +
Sbjct: 750 YGKAKADKKILRAGEG-------------------------LTLTIPLKNIGKLSGDEVV 784

Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
            V+   P     P K L AF ++ + AG  + V   + V  +      +    +  G++ 
Sbjct: 785 QVYLRNPGDKEGPIKTLRAFRRISLEAGQAEDVLFELPVSTFEWFNPATNRMEVLPGKYE 844

Query: 749 IHIGGTKHSVSLHAATLGVIK 769
           +  GGT    +L   T+ + K
Sbjct: 845 LLYGGTSDEKALQRLTVTLKK 865


>gi|427384377|ref|ZP_18880882.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727638|gb|EKU90497.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
           12058]
          Length = 1050

 Score =  278 bits (711), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 166/436 (38%), Positives = 253/436 (58%), Gaps = 42/436 (9%)

Query: 28  FACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWW 87
            AC  + A  + L +   + P  +R+ DL+ RL+++EK+ LL + +  + RL I  Y   
Sbjct: 16  MACATRTAQAQEL-YKNENAPTHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHG 74

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           +EALHGV  V PG +F       T FPQ I  A+++N  L E +  V+SDEARA +N   
Sbjct: 75  NEALHGV--VRPG-RF-------TVFPQAIGLAATWNPVLQEQVATVISDEARARWNELD 124

Query: 148 AG----------LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR 197
            G          LT+WSP VN+ RDPRWGR  ET GEDP LSG    ++V+GLQG+D   
Sbjct: 125 QGREQKSQFSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSRY 184

Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
           LK+ ++ KHF A + ++    +RF  N ++S++ + + +   F  CV +GK AS+M +YN
Sbjct: 185 LKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYN 240

Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
            +N VP   +  +L + +R +W   GY+VSDC    +  +   +  T E AA  +I+AGL
Sbjct: 241 ALNDVPCTLNAWLLTKVLRNDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGL 300

Query: 318 DLDCG------PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYG 370
           DL+CG      P L     SA ++ ++++ DI++A    L  +M+LG+FD GE    PY 
Sbjct: 301 DLECGDDVYDEPLL-----SAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGE--KNPYT 353

Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
            + P  + + +HQE+AL AAR+ IVLLKNQ   LPL+  + +++AV+G N+    +  G+
Sbjct: 354 KISPAVIGSKEHQEVALNAARECIVLLKNQKKMLPLNAKKIKSIAVVGINA--GSSEFGD 411

Query: 431 YAGI-ACGYTTPLQGI 445
           Y+G+      + LQGI
Sbjct: 412 YSGLPVIAPVSVLQGI 427



 Score =  133 bits (334), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 89/300 (29%), Positives = 148/300 (49%), Gaps = 45/300 (15%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A R+ +  + V+G+++SIE E  DR  + LP  Q+E + ++   +  P I+V+
Sbjct: 592 LYGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQREFLQEIYKVN--PNIVVV 649

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           +  G   +A    D  + AI+ A YPG++GG A+A++LFG  NPGG+LP+T+Y    +  
Sbjct: 650 LVAGS-SLAVNWMDEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY--RSLDE 706

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SYT+F ++                
Sbjct: 707 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYTSFKYS---------------- 744

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
              N  ++    +V+          V   +KN G   G     V+   P      P K+L
Sbjct: 745 ---NLQVADGEEEVS----------VSFQLKNTGRYAGDEVAQVYVKLPEREEVMPVKEL 791

Query: 706 VAFEKVHVPAGAQQRVGINIH--VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
             FE+V + +G  ++V I +   + +Y    +  G    P G +NI +G +   + L  A
Sbjct: 792 KGFERVSLKSGESKKVTIKLRKDLLRYWD--EAKGKFIYPSGNYNIMVGASSADIRLQKA 849


>gi|28199699|ref|NP_780013.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
 gi|182682443|ref|YP_001830603.1| beta-glucosidase [Xylella fastidiosa M23]
 gi|417557804|ref|ZP_12208815.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
 gi|28057820|gb|AAO29662.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
 gi|182632553|gb|ACB93329.1| Beta-glucosidase [Xylella fastidiosa M23]
 gi|338179587|gb|EGO82522.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
          Length = 882

 Score =  278 bits (711), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 164/405 (40%), Positives = 230/405 (56%), Gaps = 28/405 (6%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           Q    L+ +++ QEK+   ++ A A+PRLGI  Y+WWSE LHG++  G           A
Sbjct: 32  QHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YA 81

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G V S EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 82  TVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFR 141

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRF 221
           DPRWGRG ET GEDP L+ + A S++RGLQG   D  +  A+ KHF  +         R 
Sbjct: 142 DPRWGRGMETYGEDPYLTSQLAVSFIRGLQGDTPDHPRTIATPKHFAVHSGPEQG---RH 198

Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
            F+  VS  D+E T+   FR  +++G   SVMC+YN ++G P CA   +L   +R +W  
Sbjct: 199 SFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRNDWGF 258

Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSE 340
           NG++VSDCD++        F      A+A A+++G DL+CG  +  L+   A+ RG + E
Sbjct: 259 NGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGNDLNCGNTYRDLN--QAIARGDIDE 316

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
             ++ AL+   T + RLG         PY  +G K + TP H+ LAL+AA Q +VLLKN 
Sbjct: 317 STLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSLVLLKNS 375

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           G +LPL      T+AV+GP++D    +  NY G +    TPL G+
Sbjct: 376 GNTLPLP--PETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGL 418



 Score =  132 bits (332), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 95/290 (32%), Positives = 142/290 (48%), Gaps = 48/290 (16%)

Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
           A   ADA +  +GL   +E E L          DR  + LP  Q+ L+  V    K P I
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665

Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
           +VLMSG  + + +A++     AI+ A YPGQ+GGTAIA  L G  NPGG+LP+T+Y    
Sbjct: 666 VVLMSGSAVALNWAQH--HADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFY--RS 721

Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
             +LP       P  S    GRTYR++KG  +YPFG+G+SYT F +    AP +    L 
Sbjct: 722 TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFAY---EAPQLSTATLK 771

Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
             +                      TL V   V+N G++ G   + ++  PP    AP +
Sbjct: 772 AGN----------------------TLTVTTHVRNTGTRAGDEVVQLYLEPPYSPQAPLR 809

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            LV F++V +  G  + +   +   + LS V ++G R +  G +++ +GG
Sbjct: 810 SLVGFKRVTLRPGESRLLTFTLD-ARQLSSVQQTGQRSVEAGHYHLFVGG 858


>gi|219118959|ref|XP_002180246.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408503|gb|EEC48437.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 682

 Score =  278 bits (711), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 200/614 (32%), Positives = 303/614 (49%), Gaps = 57/614 (9%)

Query: 33  KDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLL---------ISGAAAVPRLGIKG 83
           ++   + LP+C +SL I +R+ DL+  L+L EKV ++         ++    V R+G+  
Sbjct: 63  QNDVAKALPYCDMSLSIDERLEDLLSHLTLDEKVDMIGADPTQDVCMTHTMNVSRIGLPD 122

Query: 84  YEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY 143
           Y W  E     +N   G+    +   AT F   ++ A+SFN + W   G V   E RA+ 
Sbjct: 123 YYWLVE-----TNTAVGSACIAENKCATEFSGPLSIAASFNRSSWFLKGSVFGTEQRALM 177

Query: 144 N----------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS 193
           N          G   GLT + PN+N  RDPR+GR  E PGEDP LSG+YAA  V+G+Q  
Sbjct: 178 NVHGERFHTHSGRHIGLTAFGPNINQQRDPRFGRSSELPGEDPFLSGQYAAHMVQGMQER 237

Query: 194 DGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASV 252
           D +   KV A  KHFTAY  +   G D ++    +S  D+ DT+   + M +++G    V
Sbjct: 238 DANGYPKVLAYLKHFTAYSREEGRGNDDYN----ISMYDLFDTYLPQYEMGMVQGGATGV 293

Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEW-RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAAD 311
           MCSYN VNG+P CA+  +L + +R  W R + ++ +DC +V          +   +AAA 
Sbjct: 294 MCSYNAVNGIPACANDYLLNKILRQRWNRSDAHVTTDCGAVNNLRGKPIQAADEAQAAAM 353

Query: 312 AIRAGLDLDCGPFLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
           A+  G D++ G  L +H   +A+  G  +E  +N A+  +       G FD +P+   + 
Sbjct: 354 ALMNGADIEMGSTLFVHNLTTAITLGYATEEAVNQAIRRSYRPHFIAGRFD-DPTLSEWF 412

Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
            LG  D+ +  HQE+ LEAA QG+VLLK++   LP++      +AV+GP       ++ +
Sbjct: 413 SLGLDDIQSKKHQEIQLEAALQGLVLLKHEDSILPIA--AGTKLAVLGPLGMTRSGLMSD 470

Query: 431 YAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADAT--------- 481
           Y      +      I   A +I     K+   A     G  +D+   +D           
Sbjct: 471 YESDQSCFGGGHDCIPTLAESIGFINGKEFTVA---AAGVDVDSRNTSDVERILQLAADR 527

Query: 482 ---ILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
              +L +G  ++ E E  DR    LPG+Q  L   V +  + P +LVL++GG I +    
Sbjct: 528 DLIVLCLGNTKTQEQEGFDRKDTALPGQQYALFEAV-LTLRKPVVLVLVNGGQIALDGMT 586

Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
             P  +AII A  P   GGTA+A  LFG  N  GKLP T YP   + +  M + +M    
Sbjct: 587 GYP--SAIIEAFNPNGIGGTALAASLFGQENRWGKLPYTIYPYSVMQSFDMKDHSMSAP- 643

Query: 599 SKRYPGRTYRFYKG 612
               PGRTYR++ G
Sbjct: 644 ----PGRTYRYFTG 653


>gi|390945417|ref|YP_006409177.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
           17242]
 gi|390421986|gb|AFL76492.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
           17242]
          Length = 771

 Score =  278 bits (711), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 208/687 (30%), Positives = 328/687 (47%), Gaps = 94/687 (13%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT+FP     AS++N  L E +G+V++ E R        G   + P ++I RDPRW R 
Sbjct: 139 GATTFPTAPGQASTWNPELIERMGKVIAAEIRLQ-----GGHICYGPVLDIVRDPRWSRT 193

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHFTAYDLDN--WNGVDRFHFNA 225
           +E+ GED  L+ +   +YVRG    D  + + A S  KHF AY       NG        
Sbjct: 194 EESYGEDCYLTARIGEAYVRGTGSGDLSQSRHALSTLKHFIAYGASEGGQNGGSNL---- 249

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
            + ++++ +T+  PF   V  G   SVM +YN V+G+P  A+  +L   +RGEW  +G++
Sbjct: 250 -LGERELRETYLPPFEAAVKAG-ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFV 307

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC--GPFLGLHTESAVQRGLLSEIDI 343
           VSD  S+   ++T     +  EAA  A+RAG+D D   G F  L    A + G ++E +I
Sbjct: 308 VSDLLSIEGLHETHGVAGSVREAAVQALRAGVDADLKGGAFASLR--EAAEAGDVAEAEI 365

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY-GHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           + A+   L ++  +G+F+      PY       +V    H ELALEAARQ + LL+N+  
Sbjct: 366 DRAVERVLALKFEMGLFE-----NPYIDEAAAAEVGCAAHSELALEAARQSVTLLENRSG 420

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA---RTIHQQGCKD 459
           +LPL   R R VAVIGPN+D     +G+Y        T   G+ +     R ++ +GC  
Sbjct: 421 TLPLDPRRLRRVAVIGPNADNIYNQLGDYTAQQTAANTVRDGLEKLLGRDRVVYSRGCT- 479

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIE-----------------------AEAL 496
           V   D     AA+ A+R  DA ++V+G   + +                        E  
Sbjct: 480 VRGGDRSEIAAAVSAARGTDAAVVVIGGSSARDFDTEFLQTGAAKAAHDEVRDMECGEGF 539

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DRA L L G Q+EL+ ++  A+  P I+V ++G P+D+  A    +  A++ A YPG  G
Sbjct: 540 DRATLALLGEQEELLRRIK-ATGTPLIVVCIAGRPLDLRRASE--QADALLMAWYPGARG 596

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR-TYRFYKGPVV 615
           G A+A+ + G +NP G+LP+T         +P  E  +    +K+ P    Y       +
Sbjct: 597 GDAVAETILGHNNPAGRLPIT---------IPRAEGQIPVYYNKKRPANHDYTDLTAAPL 647

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+G+SY+ F                  +GS+ A  SG  +           L V   
Sbjct: 648 YPFGYGLSYSTF-----------------EYGSLEARQSGDNV-----------LEVSCR 679

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           ++N   ++G   + L  S   A    P +QL  F ++ +  G Q++V   +   + LS++
Sbjct: 680 IRNTSDREGDEVVQLYISDMVASTVRPPRQLGGFRRIRLAPGEQRQVSFTLG-DEALSLI 738

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           D  G R +  G+  I +G +   + L 
Sbjct: 739 DPQGRRVVEKGDFVIAVGSSSQDIRLQ 765


>gi|402307522|ref|ZP_10826545.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400378572|gb|EJP31427.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 858

 Score =  278 bits (710), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 174/457 (38%), Positives = 250/457 (54%), Gaps = 30/457 (6%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           AT + LP+C   L   +R  DL+ RL+L+EK +L++  + A+PRLGIK + WWSEALHG 
Sbjct: 17  ATAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGA 76

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY-----NGGTA- 148
           +N+G          G T FP+ +  A+SFN  L   +    SDE RA Y     NGG   
Sbjct: 77  ANMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDE 126

Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC- 204
               L+ W+PNVNIFRDPRWGRGQET GEDP L+     + VRGLQG +  + +   +C 
Sbjct: 127 KFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETAKYRKLWACA 186

Query: 205 KHFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           KH+  +    +    R   N A VS +D+ +T+   F+  V E KV  VMC+Y +++  P
Sbjct: 187 KHYAVHSGPEYT---RHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQRLDDDP 243

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
            C++  +L++ +R EW  N  +VSDC +V   Y     +S    AAA A  AG D++CG 
Sbjct: 244 CCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTDVECGF 303

Query: 324 FLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                T   AV+RGL++E +++  ++  L  +  LG  D +P    +  +    + +  H
Sbjct: 304 GYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVMDSKAH 362

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           ++LAL+ ARQ +VLL+N+G  LPL       +AVIGPN+D    M GNY G      T L
Sbjct: 363 RQLALDMARQSLVLLQNKGGVLPLK-AGGEPIAVIGPNADDGPMMWGNYNGTPNRTVTIL 421

Query: 443 QGI-GRYARTIHQQGCK--DVACADDQLFGAAIDASR 476
            GI  R+ R  + +GC   D    +  L   AID  +
Sbjct: 422 DGIKARHKRVTYLKGCDLTDTKTVNSLLPQCAIDGRK 458



 Score = 99.0 bits (245), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 129/299 (43%), Gaps = 54/299 (18%)

Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           + V G+  ++E E +          DR  + LP  Q++ +  +  A K   + V  SG  
Sbjct: 603 VFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK-TVVFVNCSGSA 661

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
           I  A         AI+ A Y GQ GGTA++D+LFGT NP GKLP+T+Y +     LP  E
Sbjct: 662 I--ALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYKR--TDQLPDYE 717

Query: 592 -MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
             +MR        GRTYR++  P ++ FG+G+SYT F                 R G   
Sbjct: 718 DYSMR--------GRTYRYFSDP-LFAFGYGLSYTTF-----------------RFGR-- 749

Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
                      HA+       + V + N G++ G   + V+    A    P K L AF +
Sbjct: 750 ----------AHAEAAEGGYRLSVPLTNTGTRPGEEVVQVYIRRVADTNGPLKSLRAFRR 799

Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
           V + AG    V I +    +    + + T R   G++ +  G +     L   T+ V++
Sbjct: 800 VALKAGESTTVEIPLSRKSFECFDESTNTMRTLPGDYELMYGNSSRPEDLQTLTVSVVE 858


>gi|224536538|ref|ZP_03677077.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521794|gb|EEF90899.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 863

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 168/432 (38%), Positives = 237/432 (54%), Gaps = 32/432 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SL   +R   L+  L+L+EK  L++ G+ +V RLGIK Y WW+EALHGV+  G   
Sbjct: 23  YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA--MYNGGT------AGLTYW 153
                   AT FPQ I  A+SFN  +   +   VSDEARA   Y           GLT W
Sbjct: 81  --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL 212
           +P VNI+RDPRWGRG ET GEDP L+ +     V+GLQG +DG   K+ A  KHF  +  
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGKYDKLHACAKHFAVHSG 192

Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             WN   R  FNA+ +  +D+ +T+  PF   V EGKV  VMC+YN+  G P C    +L
Sbjct: 193 PEWN---RHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDPCCGSDRLL 249

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
            + +RGEW  +G +VSDC ++  +Y+ +   + P  E A+A A+ +G DL+CG       
Sbjct: 250 MQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVISGTDLECGSSYKALI 309

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
           ES V++GL+SE  ++ ++   +  +  LG  D EP    +  +    V +  H  LAL  
Sbjct: 310 ES-VKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASAAHDSLALNM 367

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
           AR+ + LL N+   LPL      TVAV+GPN++ +V   GNY G+     T L G+    
Sbjct: 368 ARESMTLLMNKDNFLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPAHTVTILDGVRNLL 426

Query: 446 GRYARTIHQQGC 457
           G   + I++QGC
Sbjct: 427 GTDDKLIYEQGC 438



 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 100/344 (29%), Positives = 156/344 (45%), Gaps = 48/344 (13%)

Query: 434 IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA 493
           +  G +  L+    Y R+  Q    D+    D     +++  + AD  I   G+  S+E 
Sbjct: 557 VQAGKSYDLELDFEYLRSDAQLNF-DLGFKKDVDIRKSVERVKDADIVIFASGISPSLEG 615

Query: 494 EAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
           E +          DR  + LP  Q+EL+  +  A K   ILV  SG PI         + 
Sbjct: 616 EEMGVNLPGFKKGDRTDIELPAVQRELIDALHRAGK-KIILVNCSGSPI--GLEPETQKC 672

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
            AI+ A YPGQ GG A+A++LFG  NP GKLP+T+Y    ++ LP  E            
Sbjct: 673 EAILQAWYPGQQGGKAVAEVLFGDYNPAGKLPVTFYRN--VSQLPDFE-------DYNMT 723

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
           GRTYR+ +   ++PFG+G+SYT F +      TV+           N   +G+++K+T  
Sbjct: 724 GRTYRYMQDVPLFPFGYGLSYTTFGY----GKTVL---------DKNELTAGQSLKLT-- 768

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                     V V N G ++G   + V+         P K L AF++V +PAG    V  
Sbjct: 769 ----------VPVTNTGKRNGEEVVQVYLRKQGDAEGPIKTLRAFKRVSIPAGKTVNVEF 818

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           ++   +     D+S T R+  G ++I +GG+     L   T+ +
Sbjct: 819 DLKDKELEWWDDQSNTVRVCPGNYDIMVGGSSKEEDLQRTTIAI 862


>gi|325103214|ref|YP_004272868.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
 gi|324972062|gb|ADY51046.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
           12145]
          Length = 866

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 167/439 (38%), Positives = 241/439 (54%), Gaps = 32/439 (7%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  +  PF    LP  +RV+DL+ RL+++EKV L+   +  + RLGIK Y WW+EALHGV
Sbjct: 19  AQNKLYPFQDNRLPFDKRVDDLLQRLTVEEKVLLMQDVSRPIERLGIKQYNWWNEALHGV 78

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
           +  G           AT FPQ I  A+SF+      +   VSDEARA +N        G 
Sbjct: 79  ARAGL----------ATVFPQPIGMAASFDRDALFNVFNAVSDEARAKHNYHLSQGSYGR 128

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCK 205
             GLT W+P +NIFRDPRWGRG ET GEDP L+       V+GLQG S+G   K+ A  K
Sbjct: 129 YEGLTMWTPTINIFRDPRWGRGIETYGEDPYLTAVMGVQAVKGLQGPSNGKYDKLHACAK 188

Query: 206 HFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  F+ A + ++D+ +T+   F   V E KV  VMC+YN+  G P 
Sbjct: 189 HFAVHSGPEWN---RHSFDAANIKQRDLYETYLPAFEALVKEAKVQEVMCAYNRFEGDPC 245

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCG 322
           C    +L++ +R +W   G +V+DC ++  ++      + P+   A+A A+ +G DLDCG
Sbjct: 246 CGSDRLLQQILRKKWGFEGIVVADCGAIADFFKENAHKTHPDAASASAAAVYSGTDLDCG 305

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                 TE AV++GL+ E DI+ ++   L  + RLG  D + S  P+  +    V +  H
Sbjct: 306 SSYKALTE-AVKKGLIEEKDIDVSVRRLLMARFRLGEMDDQ-SLVPWSKISYNVVASKAH 363

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
            ++AL+ AR+ I LL+N+   LPL     + +AV+GPN+  +V   GNY G      T L
Sbjct: 364 NQIALDMARKSITLLQNKNNILPLKSGGLK-IAVMGPNAQDSVMQWGNYNGTPANTITIL 422

Query: 443 QGI----GRYARTIHQQGC 457
           +GI    G   + I++Q C
Sbjct: 423 EGIKAALGPKDKLIYEQAC 441



 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 87/319 (27%), Positives = 135/319 (42%), Gaps = 47/319 (14%)

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
           D+   ++     +I     AD  + V G+  S+E E +          DR  + LP  Q+
Sbjct: 584 DIGYKEEANINKSIKNIAGADLVVFVGGISPSLEGEEMGVKLPGFRGGDRTDIQLPTIQR 643

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
           + V  +  A K   I +  SG PI +A    +    AI+ A YPGQAGG A+AD+LFG  
Sbjct: 644 QFVKALKEAGK-RVIFINCSGSPIGLADEMANSE--AIVQAWYPGQAGGQAVADVLFGKY 700

Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
           NP G+LP+T+Y     T LP  E       +    GRTYR+ +   ++PFG+G+SYT F 
Sbjct: 701 NPSGRLPITFYRD--TTQLPDFE-------NYDMAGRTYRYMQDKPLFPFGYGLSYTQFQ 751

Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
           +                         G  I       N  T+ + V V N G + G   +
Sbjct: 752 Y-------------------------GNPILNQQVITNGQTIQLTVPVTNTGKRSGDEVV 786

Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
            V+         P K L  F ++   AG  Q+V   I   +     ++S   ++  G++ 
Sbjct: 787 QVYLRKKGDATGPVKTLRDFRRLSFNAGQTQQVVFKITPKQLEWWNEQSKAMQVQSGDYE 846

Query: 749 IHIGGTKHSVSLHAATLGV 767
           + +G +     L   T  +
Sbjct: 847 LLVGKSSADEDLKMVTFTI 865


>gi|334365132|ref|ZP_08514098.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
           sp. HGB5]
 gi|313158675|gb|EFR58064.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
           sp. HGB5]
          Length = 771

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 207/687 (30%), Positives = 328/687 (47%), Gaps = 94/687 (13%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT+FP     AS++N  L E +G+V++ E R        G   + P ++I RDPRW R 
Sbjct: 139 GATTFPTAPGQASTWNPELIERMGKVIAAEIRLQ-----GGHICYGPVLDIVRDPRWSRT 193

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHFTAYDLDN--WNGVDRFHFNA 225
           +E+ GED  L+ +   +YVRG    D  + + A S  KHF AY       NG        
Sbjct: 194 EESYGEDCYLTARIGEAYVRGTGSGDLSQSRHALSTLKHFIAYGASEGGQNGGSNL---- 249

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
            + ++++ +T+  PF   V  G   SVM +YN V+G+P  A+  +L   +RGEW  +G++
Sbjct: 250 -LGERELRETYLPPFEAAVKAG-ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFV 307

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC--GPFLGLHTESAVQRGLLSEIDI 343
           VSD  S+   ++T     +  EAA  A+RAG+D D   G F  L    A + G ++E +I
Sbjct: 308 VSDLLSIEGLHETHGVAGSVREAAVQALRAGVDADLKGGAFASLR--EAAEAGDVAEAEI 365

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY-GHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           + A+   L ++  +G+F+      PY       +V    H ELALEAARQ + LL+N+  
Sbjct: 366 DRAVERVLALKFEMGLFE-----NPYIDEAAAAEVGCAAHSELALEAARQSVTLLENRSG 420

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA---RTIHQQGCKD 459
           +LPL   R R VAVIGPN+D     +G+Y        T   G+ +     R ++ +GC  
Sbjct: 421 TLPLDPRRLRRVAVIGPNADNIYNQLGDYTAQQTAANTVRDGLEKLLGRDRVVYSRGCT- 479

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIE-----------------------AEAL 496
           V   D     AA+ A+R  DA ++V+G   + +                        E  
Sbjct: 480 VRGGDRSEIAAAVSAARGTDAAVVVIGGSSARDFDTEFLQTGAAKAAHDEVRDMECGEGF 539

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DRA L L G Q+EL+ ++  A+  P I+V ++G P+D+  A    +  A++ A YPG  G
Sbjct: 540 DRATLALLGEQEELLRRIK-ATGTPLIVVCIAGRPLDLRRASE--QADALLMAWYPGARG 596

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGR-TYRFYKGPVV 615
           G A+A+ + G +NP G+LP+T         +P  E  +    +K+ P    Y       +
Sbjct: 597 GDAVAETILGRNNPAGRLPIT---------IPRAEGQIPVYYNKKRPANHDYTDLTAAPL 647

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+G+SY+ F                  +GS+ A  SG  +           L V   
Sbjct: 648 YPFGYGLSYSTF-----------------EYGSLEARQSGDNV-----------LEVSCR 679

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           ++N   ++G   + L  S   A    P +QL  F ++ +  G Q++V   +   + L+++
Sbjct: 680 IRNTSDREGDEVVQLYISDMVASTVRPPRQLGGFRRIRLAPGEQRQVSFTLG-DEALALI 738

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           D  G R +  G+  I +G +   + L 
Sbjct: 739 DPQGRRVVEKGDFVIAVGSSSQDIRLQ 765


>gi|325918730|ref|ZP_08180824.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325535054|gb|EGD06956.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 391

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 156/368 (42%), Positives = 216/368 (58%), Gaps = 30/368 (8%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR   L+ ++S  EKV   ++ A A+PRL I  YEWWSE LHG++  G           A
Sbjct: 34  QRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNGY----------A 83

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GG-------TAGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G VVS EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 84  TVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 143

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+G+ A  ++RGLQG D +  +  A+ KH   +     +G +  
Sbjct: 144 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLNHPRTIATPKHIAVH-----SGPEPG 198

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS +D+E T+   FR  +++G+  SVMC+YN ++G P CA   +L   +RG+W
Sbjct: 199 RHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPACAADWLLNGRVRGDW 258

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
              G++VSDCD+V       +F      ++A A++AG DL+CG  +  L T  A++RG +
Sbjct: 259 GFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAYRELGT--AIERGEV 316

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ +LV     + RLG  +  P   PY  LG KDV    H+ LAL+AA + IVLLK
Sbjct: 317 DEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLK 375

Query: 399 NQGPSLPL 406
           N   +LPL
Sbjct: 376 NTATTLPL 383


>gi|288925400|ref|ZP_06419334.1| beta-glucosidase [Prevotella buccae D17]
 gi|288337871|gb|EFC76223.1| beta-glucosidase [Prevotella buccae D17]
          Length = 858

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 174/457 (38%), Positives = 250/457 (54%), Gaps = 30/457 (6%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           AT + LP+C   L   +R  DL+ RL+L+EK +L++  + A+PRLGIK + WWSEALHG 
Sbjct: 17  ATAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGA 76

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY-----NGGTA- 148
           +N+G          G T FP+ +  A+SFN  L   +    SDE RA Y     NGG   
Sbjct: 77  ANMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDE 126

Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC- 204
               L+ W+PNVNIFRDPRWGRGQET GEDP L+     + VRGLQG +  + +   +C 
Sbjct: 127 KFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETAKYRKLWACA 186

Query: 205 KHFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           KH+  +    +    R   N A VS +D+ +T+   F+  V E KV  VMC+Y +++  P
Sbjct: 187 KHYAVHSGPEYT---RHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQRLDDDP 243

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
            C++  +L++ +R EW  N  +VSDC +V   Y     +S    AAA A  AG D++CG 
Sbjct: 244 CCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTDVECGF 303

Query: 324 FLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                T   AV+RGL++E +++  ++  L  +  LG  D +P    +  +    + +  H
Sbjct: 304 GYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVMDSKAH 362

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           ++LAL+ ARQ +VLL+N+G  LPL       +AVIGPN+D    M GNY G      T L
Sbjct: 363 RQLALDMARQSLVLLQNKGGVLPLK-AGGEPIAVIGPNADDGPMMWGNYNGTPNRTVTIL 421

Query: 443 QGIG-RYARTIHQQGCK--DVACADDQLFGAAIDASR 476
            GI  R+ R  + +GC   D    +  L   AID  +
Sbjct: 422 NGIKVRHKRVTYLKGCDLTDTKTVNSLLPQCAIDGRK 458



 Score = 98.6 bits (244), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 83/299 (27%), Positives = 130/299 (43%), Gaps = 54/299 (18%)

Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           + V G+  ++E E +          DR  + LP  Q++ +  +  A K   + V  SG  
Sbjct: 603 VFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK-TVVFVNCSGSA 661

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
           I  A         AI+ A Y GQ GGTA++D+LFGT NP GKLP+T+Y +     LP  E
Sbjct: 662 I--ALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYKR--TDQLPDYE 717

Query: 592 -MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
             +MR        GRTYR++  P ++ FG+G+SYT F                 R G   
Sbjct: 718 DYSMR--------GRTYRYFSDP-LFAFGYGLSYTTF-----------------RFGRAR 751

Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
           A  +    +++            V + N G++ G   + V+    A    P K L AF +
Sbjct: 752 AEAAEGGYRLS------------VPLTNTGTRPGEEVVQVYIRRVADTNGPLKSLRAFRR 799

Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
           V + AG    V I +    +    + + T R   G++ +  G +     L   T+ V++
Sbjct: 800 VALKAGESTTVEIPLSRKSFECFDESTNTMRTLPGDYELMYGNSSRPEDLQTLTVSVVE 858


>gi|170731072|ref|YP_001776505.1| beta-glucosidase [Xylella fastidiosa M12]
 gi|167965865|gb|ACA12875.1| Beta-glucosidase [Xylella fastidiosa M12]
          Length = 882

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 165/407 (40%), Positives = 235/407 (57%), Gaps = 32/407 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           Q    L+ +++ QEK+   ++ A A+PRLGI  Y+WWSE LHG++  G           A
Sbjct: 32  QHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YA 81

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--GGT-------AGLTYWSPNVNIFR 161
           T FPQ I  A+S+N  L + +G V S EARA +N  GG        AGLT WSPN+NIFR
Sbjct: 82  TVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFR 141

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD-- 219
           DPRWGRG ET GEDP L+ + A S++RGLQG+  D  +  A+ KHF  +     +G +  
Sbjct: 142 DPRWGRGMETYGEDPYLTSQLAVSFIRGLQGNIPDHPRTIATPKHFAVH-----SGPEPG 196

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R  F+  VS  D+E T+   FR  +++G   SVMC+YN ++G P CA   +L   +R +W
Sbjct: 197 RHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRNDW 256

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLL 338
             NG++VSDCD++        F      A+A A+++G DL+CG  +  L+   A+ RG +
Sbjct: 257 GFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGDDLNCGNTYRDLN--QAIARGDI 314

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
            E  ++ AL+   T + RLG         PY  +G K + TP H+ LAL+AA Q +VLLK
Sbjct: 315 DESTLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSLVLLK 373

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           N G +LPL+     T+AV+GP++D    +  NY G +    TPL G+
Sbjct: 374 NSGNTLPLT--PGTTLAVLGPDADSLTALEANYQGTSSTPVTPLIGL 418



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 95/290 (32%), Positives = 139/290 (47%), Gaps = 48/290 (16%)

Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
           A   ADA +  +GL   +E E L          DR  + LP  Q+ L+  V    K P I
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665

Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
           +VLMSG  + + +A++     AI+ A YPGQ+GGTAIA  L G  NPGG+LPMT+Y    
Sbjct: 666 VVLMSGSAVALNWAQH--HADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPMTFY--RS 721

Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
             +LP       P  S    GRTYR++KG  +YPFG+G+SYT F +      T      D
Sbjct: 722 TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFAYEAPQLSTATLKAGD 774

Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
                                    TL V   V+N G++ G   + ++  PP    AP +
Sbjct: 775 -------------------------TLTVTAHVRNTGTRAGDEVVQLYLEPPHSPQAPLR 809

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            LV F++V +  G  + +   +   + LS V ++G R +  G +++ +GG
Sbjct: 810 NLVGFKRVTLRPGESRLLTFTLD-ARQLSSVQQTGQRSVEAGHYHLFVGG 858


>gi|395492941|ref|ZP_10424520.1| glycoside hydrolase family protein [Sphingomonas sp. PAMC 26617]
          Length = 865

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 168/411 (40%), Positives = 242/411 (58%), Gaps = 33/411 (8%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           PI  RV+DL+ R++L+EK   + + A A+PRLGI  Y++W+EALHGV+  G         
Sbjct: 20  PIEARVDDLMRRMTLEEKAAQMQNVAPAIPRLGIPPYDYWNEALHGVARAGE-------- 71

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNI 159
             AT FPQ I  A++++  +  A G+ V+ E RA YN   A        GLT+WSPN+NI
Sbjct: 72  --ATVFPQAIGMAATWDRDMMLAEGQTVATEGRAKYNQAQAQKNYDRYYGLTFWSPNINI 129

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
           FRDPRWGRGQET GEDP L+G  A  +V G+QG+D + LK  A+ KHF  +     +G +
Sbjct: 130 FRDPRWGRGQETLGEDPYLTGTMAVPFVHGVQGTDANYLKAIATPKHFAVH-----SGPE 184

Query: 220 --RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R  FN   S +D+ +T+   FR  +++G+  S+MC+YN V+    CA+  +LK T+RG
Sbjct: 185 QLRHQFNVDPSPRDLSETYLPAFRRAIVDGRAESLMCAYNAVDTKAACANTMLLKDTLRG 244

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG--LHTESAVQR 335
            W   G++ SDC ++       H + T  E AA A++AG D  C  F    L    AV+ 
Sbjct: 245 AWGFKGFVTSDCGAIDDITTGHHNSPTNPEGAALAVKAGTDTGCD-FKDEMLDLPRAVKA 303

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGI 394
           G L+E D++ AL    T +M+LGMFD  P+++ P+  +   +  +P H+ LAL AAR+ I
Sbjct: 304 GYLTEGDMDVALRRLFTARMKLGMFD--PAARVPFSTISIAENHSPAHRALALRAARESI 361

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           VLLKN G  LPL+    R +AV+GP +   + + GNY G   G   P+ G+
Sbjct: 362 VLLKNDG-VLPLA-AGARRIAVVGPTAASLIALEGNYNGTPVGAVLPVDGM 410



 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 74/241 (30%), Positives = 114/241 (47%), Gaps = 47/241 (19%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL+  +E E +          DR  + LP  Q +L+  +  A+  P ++VL SG  I  A
Sbjct: 602 GLNAWLEGEEMPLQVPGFAGGDRTAIALPAAQSQLLDAL-FATGKPLVIVLQSGSAI--A 658

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
               + +  A++ A YPG+AGG AIA++L GT NP G+LP+T+Y       LP  +    
Sbjct: 659 LGAQEAKARAVLEAWYPGEAGGQAIAEVLSGTVNPSGRLPVTFYAS--TDQLPAFD---- 712

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
                R   RTYR++ G V YPFGHG+SYT F ++     T             ++  +G
Sbjct: 713 ---DYRMANRTYRYFAGRVEYPFGHGLSYTRFAYSALRPAT-------------SSVAAG 756

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
           +   V+            V V+N G   G     ++ + P    AP + L  +++VH+ A
Sbjct: 757 QGTSVS------------VAVRNTGVLAGDEVAQLYLSVPGREGAPIRSLKGYQRVHLAA 804

Query: 716 G 716
           G
Sbjct: 805 G 805


>gi|423223593|ref|ZP_17210062.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638218|gb|EIY32065.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 863

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 168/432 (38%), Positives = 237/432 (54%), Gaps = 32/432 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SL   +R   L+  L+L+EK  L++ G+ +V RLGIK Y WW+EALHGV+  G   
Sbjct: 23  YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA--MYNGGT------AGLTYW 153
                   AT FPQ I  A+SFN  +   +   VSDEARA   Y           GLT W
Sbjct: 81  --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL 212
           +P VNI+RDPRWGRG ET GEDP L+ +     V+GLQG +DG   K+ A  KHF  +  
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGKYDKLHACAKHFAVHSG 192

Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             WN   R  FNA+ +  +D+ +T+  PF   V EGKV  VMC+YN+  G P C    +L
Sbjct: 193 PEWN---RHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDPCCGSDRLL 249

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
            + +RGEW  +G +VSDC ++  +Y+ +   + P  E A+A A+ +G DL+CG       
Sbjct: 250 MQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVISGTDLECGSSYKALI 309

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
           ES V++GL+SE  ++ ++   +  +  LG  D EP    +  +    V +  H  LAL  
Sbjct: 310 ES-VKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASAAHDSLALNM 367

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
           AR+ + LL N+   LPL      TVAV+GPN++ +V   GNY G+     T L G+    
Sbjct: 368 ARESMTLLMNKDNFLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPAHTVTILDGVRNLL 426

Query: 446 GRYARTIHQQGC 457
           G   + I++QGC
Sbjct: 427 GTDDKLIYEQGC 438



 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 95/319 (29%), Positives = 147/319 (46%), Gaps = 47/319 (14%)

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
           D+    D     +++  + AD  I   G+  S+E E +          DR  + LP  Q+
Sbjct: 581 DLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQR 640

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
           EL+  +  A K   ILV  SG PI         +  AI+ A YPGQ GG A+A++LFG  
Sbjct: 641 ELIDALHRAGK-KIILVNCSGSPI--GLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDY 697

Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
           NP GKLP+T+Y    ++ LP  E            GRTYR+ +   ++PFG+G+SYT F 
Sbjct: 698 NPAGKLPVTFYRN--VSQLPDFE-------DYNMTGRTYRYMQDVPLFPFGYGLSYTTFG 748

Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
           +      TV+           N   +G+++K+T            V V N G ++G   +
Sbjct: 749 Y----GKTVL---------DKNELTAGQSLKLT------------VPVTNTGKRNGEEVV 783

Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
            V+         P K L AF++V +PAG    V  ++   +     D+S T R+  G ++
Sbjct: 784 QVYLRKQGDAEGPIKTLRAFKRVSIPAGKTVNVEFDLKDKELEWWDDQSNTVRVCPGNYD 843

Query: 749 IHIGGTKHSVSLHAATLGV 767
           I +GG+     L   T+ +
Sbjct: 844 IMVGGSSKEEDLQRTTIAI 862


>gi|354580734|ref|ZP_08999639.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
 gi|353203165|gb|EHB68614.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
          Length = 766

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 224/683 (32%), Positives = 344/683 (50%), Gaps = 93/683 (13%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT FP  +T  S++N  L+ ++ R V+ E R+   GG A    +SP +++ RDPRWGR 
Sbjct: 123 GATVFPVPLTIGSTWNPELFRSMCRAVAAETRS--QGGAA---TYSPVLDVVRDPRWGRT 177

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRL----KVAASCKHFTAYDLDNWNGVDRFHFN 224
           +ET GEDP L  ++A + V+GLQG   DRL     + A+ KHF  Y         R    
Sbjct: 178 EETFGEDPHLVAEFAVAAVQGLQG---DRLDAEDSLLATLKHFAGYGASEGG---RNGAP 231

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +  +++ +   +PFR  V E    SVM +YN+++GVP  +   +L   +R  W  +G+
Sbjct: 232 VHMGLRELHEIDLLPFRKAV-EAGAQSVMTAYNEIDGVPCTSSRYLLHDVLREAWGFDGF 290

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDI 343
           +++DC ++ +     +  ++ EEAAA A+ AG+D++  G    ++   A+++G ++E D+
Sbjct: 291 VITDCGAIDMLKSGHNTAASGEEAAAQALTAGVDMEMSGSMFRVYLRQALEQGHITEDDL 350

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           N A+   L ++ RLG+FD  P + P      K +   +H ELA   A +GIVLLKN+G  
Sbjct: 351 NTAVGRVLAMKFRLGLFD-RPYTDP--ERAEKVIGCEEHIELARRVAAEGIVLLKNEGNV 407

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY------ARTIHQQ 455
           LPL+  +   +AVIGPN++     +G+Y          T L+GI R+       R ++  
Sbjct: 408 LPLNP-KTGKIAVIGPNANAPYNQLGDYTSPQPPGQIITVLEGIRRHIGEDADTRVLYAP 466

Query: 456 GCK-----------DVAC-ADDQLFGAAIDASRQAD---ATI-------LVMGLDQS-IE 492
           GC+            +AC A+  +   AI  S   D    TI       +V GL QS +E
Sbjct: 467 GCRIQGDSREGLSHALACAAEADVIVMAIGGSSARDFGEGTIDLRTGASVVTGLAQSDME 526

Query: 493 -AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
             E +DR+ L L G Q EL+ ++    K P ++V ++G PI   +   D  I AI+ A Y
Sbjct: 527 CGEGIDRSTLHLMGVQLELLQEIHKLGK-PVVVVYINGRPITEPWI--DEHIPAILEAWY 583

Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
           PGQ GG+AIADILFG  NP G+L +T  P+E +  LP+         +KR  G+ Y    
Sbjct: 584 PGQEGGSAIADILFGDVNPSGRLTLT-IPKE-VGQLPIN------YNAKRTRGKRYLETD 635

Query: 612 GPVVYPFGHGMSYTNFVH-TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL 670
               YPFG+G+SYT+F +  ++  P V  +P DG                        + 
Sbjct: 636 LEPRYPFGYGLSYTDFHYGNLSVEPAV--IPADG------------------------SA 669

Query: 671 GVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
            V++ V N G +DGA  + L  S   A    P K L AF KV + AG  + V   +   +
Sbjct: 670 AVRIVVTNTGPRDGAEVVQLYVSDLAASVTRPEKALKAFSKVFLKAGESREVTFTVG-PE 728

Query: 730 YLSVVDRSGTRRIPLGEHNIHIG 752
            L ++       +  GE  I +G
Sbjct: 729 QLELIGPDMKAVVEPGEFRIRVG 751


>gi|423248809|ref|ZP_17229825.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
           CL03T00C08]
 gi|423253758|ref|ZP_17234689.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
           CL03T12C07]
 gi|392655387|gb|EIY49030.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
           CL03T12C07]
 gi|392657750|gb|EIY51381.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
           CL03T00C08]
          Length = 805

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GRV++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 159 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 213

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L+G    + VRG QG    D   V A+ KHF +Y    W         A +
Sbjct: 214 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 270

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
            ++++E+    PFR  V  G + SVM SYN+++G P      +L   ++  W+  G++VS
Sbjct: 271 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 329

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
           D  +VG   +         EAA  A+ AG+D D G    ++ E   +AV+RG ++   I+
Sbjct: 330 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 386

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            A+   L+++ ++G+FD     +       + V + +H  LA E ARQ IVLLKN+   L
Sbjct: 387 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 443

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
           PL     RT+AVIGPN+D    M+G+Y       T  T L GI     +  R ++ +GC 
Sbjct: 444 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 502

Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
            V  +    F  AI+ +R ADA ++VMG     D S E                    E 
Sbjct: 503 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 561

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DRA L L GRQ EL+ ++S   K P +LVL+ G P+ +  A  +    AI+ A YPG  
Sbjct: 562 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 618

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     RY         G   
Sbjct: 619 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYVEE-----PGTPR 671

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+G+SYT F +T                           ++VT    +   + V V 
Sbjct: 672 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 706

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           ++N G+ DG     L F    +    P KQL AF ++H+ AG  + V   +   K L++ 
Sbjct: 707 IQNQGTADGDEVAQLYFQDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 765

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            + G   +  G   I +GG+   ++   A
Sbjct: 766 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 794


>gi|218130696|ref|ZP_03459500.1| hypothetical protein BACEGG_02285 [Bacteroides eggerthii DSM 20697]
 gi|217987040|gb|EEC53371.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 858

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 162/416 (38%), Positives = 248/416 (59%), Gaps = 41/416 (9%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           PI +R+ DL+ RL+++EK+ LL + +  + RL I  Y   +EALHGV  V PG +F    
Sbjct: 35  PIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG-RF---- 87

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
              T FPQ I  A+++N  L + +  V+SDEARA +N    G          LT+WSP V
Sbjct: 88  ---TVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLTFWSPTV 144

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR  ET GEDP LSG    ++V+GLQG+D   LK+ ++ KHF A + ++   
Sbjct: 145 NMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSRYLKIVSTPKHFAANNEEH--- 201

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +RF  N ++S++ + + +   F  CV EGK AS+M +YN +N VP   +  +L + +R 
Sbjct: 202 -NRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALNDVPCTLNAWLLTKVLRE 260

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTES 331
           +W   GY+VSDC    +  +   +  T E AA  +I+AGLDL+CG      P L     +
Sbjct: 261 DWGFKGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDAPLL-----N 315

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAA 390
           A ++ ++++ DI++A    L  +M+LG+FD GE  + PY  + PK + + +HQ++AL+AA
Sbjct: 316 AYRQYMVTDADIDSAAYRVLRARMQLGLFDSGE--NNPYTKISPKVIGSKEHQKVALDAA 373

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI-ACGYTTPLQGI 445
           R+ IVLLKNQ   LPL   + +++AV+G N+    +  G+Y+G+      + LQGI
Sbjct: 374 RECIVLLKNQNKMLPLDAKKIKSIAVVGINA--GRSEFGDYSGLPVIAPVSILQGI 427



 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 83/301 (27%), Positives = 146/301 (48%), Gaps = 45/301 (14%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A    R+ +  + V+G++++IE E  DR+ + LP  Q+E + ++   +    ++VL
Sbjct: 592 LYGEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVL 650

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           ++G  + + +   D  I AII A YPG++GG A+A++LFG  NPGG+LP+T+Y    +  
Sbjct: 651 VAGSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS--LDE 706

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SYT+F                 ++
Sbjct: 707 LPPFD------DYDITKGRTYQYFKGNVLYPFGYGLSYTSF-----------------KY 743

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
             +  T   + + V+              +KNVG   G     ++   P      P K+L
Sbjct: 744 SDLQVTDGNQEVNVSFC------------LKNVGKYAGDEVAQIYVKLPERDKIMPIKEL 791

Query: 706 VAFEKVHVPAGAQQRVGINIH--VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
             FE++ +  G  ++V I +   + +Y            P G++ I IG +   + L   
Sbjct: 792 KGFERISLKRGESRKVTIRLKKDLLRYWDEEKECFVH--PSGDYTIMIGASSADIRLQKV 849

Query: 764 T 764
           +
Sbjct: 850 S 850


>gi|317474225|ref|ZP_07933501.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909535|gb|EFV31213.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 858

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 162/416 (38%), Positives = 248/416 (59%), Gaps = 41/416 (9%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDF 107
           PI +R+ DL+ RL+++EK+ LL + +  + RL I  Y   +EALHGV  V PG +F    
Sbjct: 35  PIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG-RF---- 87

Query: 108 PGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNV 157
              T FPQ I  A+++N  L + +  V+SDEARA +N    G          LT+WSP V
Sbjct: 88  ---TVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLTFWSPTV 144

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR  ET GEDP LSG    ++V+GLQG+D   LK+ ++ KHF A + ++   
Sbjct: 145 NMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSRYLKIVSTPKHFAANNEEH--- 201

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +RF  N ++S++ + + +   F  CV EGK AS+M +YN +N VP   +  +L + +R 
Sbjct: 202 -NRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALNDVPCTLNAWLLTKVLRE 260

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTES 331
           +W   GY+VSDC    +  +   +  T E AA  +I+AGLDL+CG      P L     +
Sbjct: 261 DWGFKGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDAPLL-----N 315

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAA 390
           A ++ ++++ DI++A    L  +M+LG+FD GE  + PY  + PK + + +HQ++AL+AA
Sbjct: 316 AYRQYMVTDADIDSAAYRVLRARMQLGLFDSGE--NNPYTKISPKVIGSKEHQKVALDAA 373

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI-ACGYTTPLQGI 445
           R+ IVLLKNQ   LPL   + +++AV+G N+    +  G+Y+G+      + LQGI
Sbjct: 374 RECIVLLKNQNKMLPLDAKKIKSIAVVGINA--GRSEFGDYSGLPVIAPVSILQGI 427



 Score =  129 bits (325), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 83/301 (27%), Positives = 148/301 (49%), Gaps = 45/301 (14%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A    R+ +  + V+G++++IE E  DR+ + LP  Q+E + ++   +    ++VL
Sbjct: 592 LYGEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVL 650

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           ++G  + + +   D  I AII A YPG++GG A+A++LFG  NPGG+LP+T+Y    +  
Sbjct: 651 VAGSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY--RSLDE 706

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SYT+F                 ++
Sbjct: 707 LPPFD------DYDITKGRTYQYFKGNVLYPFGYGLSYTSF-----------------KY 743

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
             +  T   + + V+              +KNVG   G     ++   P      P K+L
Sbjct: 744 SDLQVTEGNQEVNVSFC------------LKNVGKYAGDEVAQIYVKLPERDKIMPIKEL 791

Query: 706 VAFEKVHVPAGAQQRVGINIH--VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
             FE++ +  G  ++V I +   + +Y    +  G    P G++ I +G +   + L   
Sbjct: 792 KGFERISLKRGGSRKVTIRLKKDLLRYWD--EEKGCFVHPSGDYTIMVGASSADIRLQKV 849

Query: 764 T 764
           +
Sbjct: 850 S 850


>gi|380694609|ref|ZP_09859468.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
           [Bacteroides faecis MAJ27]
          Length = 804

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 215/694 (30%), Positives = 335/694 (48%), Gaps = 108/694 (15%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  +++++ TL E +G+ ++ E R+      A ++Y  P +++ RDPRW R 
Sbjct: 161 GTTVFPTGIGMSATWSPTLIEEVGKAIAKEIRSQ----GAHISY-GPVLDLSRDPRWSRV 215

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA-ASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDPVLSG+  A+ V GL   D  R     A+ KHF AY +        +   A V
Sbjct: 216 EETFGEDPVLSGRLGAAMVTGLGSGDLSREHATIATLKHFLAYAVPEGGQNGNY---ASV 272

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
             +D+ + F  PFR  +  G + SVM SYN ++G+P  A+  +L + +R EW+  G++VS
Sbjct: 273 GARDLHENFLPPFREAIEAGAL-SVMTSYNSIDGIPCTANHYLLTQLLRNEWKFRGFVVS 331

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           D  S+   +++    ST EEAA  A+ AG+D+D G    ++   AV+ G L E  IN A+
Sbjct: 332 DLYSIEGIHESHFVASTMEEAAVQALSAGVDIDLGGDAFMNLLQAVRSGKLDETQINAAV 391

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L ++  +G+F+  P   P      K V   +H +LA + A+  +VLL+N+   LPLS
Sbjct: 392 DRILRMKFEMGLFE-HPYVNP--KTTTKMVRNKEHVKLARKVAQSSVVLLENKNSILPLS 448

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGC--KDV 460
             + + VAV+GPN+D    M+G+Y          T L G+      +R  + +GC  +D 
Sbjct: 449 K-KIKRVAVVGPNADNRYNMLGDYTAPQEDKDIRTVLDGVISKLSPSRVEYVRGCAIRDT 507

Query: 461 ACADDQLFGAAIDASRQADATILVMG-------------------LDQSIE----AEALD 497
              +      A++A+ +++  I V+G                    ++SI      E  D
Sbjct: 508 TVNE---IAEAVEAAHRSEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGFD 564

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           RA L L G+QQ+L++ +    K P I+V + G P+D  +A       A++ A YPGQAGG
Sbjct: 565 RATLTLLGKQQDLLNALKTTGK-PLIVVYIEGRPLDKVWASECAD--ALLTASYPGQAGG 621

Query: 558 TAIADILFGTSNPGGKLPMT----------WYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
            AIAD+LFG  NP G+LP++          +Y ++   N    EMA  P           
Sbjct: 622 DAIADVLFGDYNPAGRLPVSVPRSVGQIPVYYNKKAPRNHDYVEMAASP----------- 670

Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
                  +Y FG+G+SYT F ++                   +  I+ K      + C+ 
Sbjct: 671 -------LYGFGYGLSYTTFEYS-------------------DLQITQK------SPCH- 697

Query: 668 LTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
               V   VKN G+ DG     L      A    P KQL  FE+  +  G ++ +   + 
Sbjct: 698 --FEVSFKVKNTGNYDGEEVAQLYLKDEYASVVQPLKQLKHFERFFLRKGEEKEILFTL- 754

Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
             K LS++DRS  R +  G+  I IG +   + L
Sbjct: 755 TEKDLSIIDRSMKRVVETGDFRIMIGASSDDIRL 788


>gi|423269263|ref|ZP_17248235.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
           CL05T00C42]
 gi|423273173|ref|ZP_17252120.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
           CL05T12C13]
 gi|392701685|gb|EIY94842.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
           CL05T00C42]
 gi|392708205|gb|EIZ01313.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
           CL05T12C13]
          Length = 805

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GRV++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 159 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 213

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L+G    + VRG QG    D   V A+ KHF +Y    W         A +
Sbjct: 214 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 270

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
            ++++E+    PFR  V  G + SVM SYN+++G P      +L   ++  W+  G++VS
Sbjct: 271 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 329

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
           D  +VG   +         EAA  A+ AG+D D G    ++ E   +AV+RG ++   I+
Sbjct: 330 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 386

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            A+   L+++ ++G+FD     +       + V + +H  LA E ARQ IVLLKN+   L
Sbjct: 387 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 443

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
           PL     RT+AVIGPN+D    M+G+Y       T  T L GI     +  R ++ +GC 
Sbjct: 444 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 502

Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
            V  +    F  AI+ +R ADA ++VMG     D S E                    E 
Sbjct: 503 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 561

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DRA L L GRQ EL+ ++S   K P +LVL+ G P+ +  A  +    AI+ A YPG  
Sbjct: 562 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 618

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     RY         G   
Sbjct: 619 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 671

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+G+SYT F +T                           ++VT    +   + V V 
Sbjct: 672 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 706

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           ++N G+ DG     L F    +    P KQL AF ++H+ AG  + V   +   K L++ 
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 765

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            + G   +  G   I +GG+   ++   A
Sbjct: 766 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 794


>gi|265765465|ref|ZP_06093740.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
 gi|263254849|gb|EEZ26283.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
          Length = 814

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GRV++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 168 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 222

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L+G    + VRG QG    D   V A+ KHF +Y    W         A +
Sbjct: 223 EETYGEDPYLNGAMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 279

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
            ++++E+    PFR  V  G + SVM SYN+++G P      +L   ++  W+  G++VS
Sbjct: 280 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 338

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
           D  +VG   +         EAA  A+ AG+D D G    ++ E   +AV+RG ++   I+
Sbjct: 339 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 395

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            A+   L+++ ++G+FD     +       + V + +H  LA E ARQ IVLLKN+   L
Sbjct: 396 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 452

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
           PL     RT+AVIGPN+D    M+G+Y       T  T L GI     +  R ++ +GC 
Sbjct: 453 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 511

Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
            V  +    F  AI+ +R ADA ++VMG     D S E                    E 
Sbjct: 512 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 570

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DRA L L GRQ EL+ ++S   K P +LVL+ G P+ +  A  +    AI+ A YPG  
Sbjct: 571 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 627

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     RY         G   
Sbjct: 628 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 680

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+G+SYT F +T                           ++VT    +   + V V 
Sbjct: 681 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 715

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           ++N G+ DG     L F    +    P KQL AF ++H+ AG  + V   +   K L++ 
Sbjct: 716 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 774

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            + G   +  G   I +GG+   ++   A
Sbjct: 775 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 803


>gi|423258860|ref|ZP_17239783.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
           CL07T00C01]
 gi|423264169|ref|ZP_17243172.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
           CL07T12C05]
 gi|387776440|gb|EIK38540.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
           CL07T00C01]
 gi|392706435|gb|EIY99558.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
           CL07T12C05]
          Length = 805

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GRV++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 159 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 213

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L+G    + VRG QG    D   V A+ KHF +Y    W         A +
Sbjct: 214 EETYGEDPYLNGAMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 270

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
            ++++E+    PFR  V  G + SVM SYN+++G P      +L   ++  W+  G++VS
Sbjct: 271 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 329

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
           D  +VG   +         EAA  A+ AG+D D G    ++ E   +AV+RG ++   I+
Sbjct: 330 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 386

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            A+   L+++ ++G+FD     +       + V + +H  LA E ARQ IVLLKN+   L
Sbjct: 387 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 443

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
           PL     RT+AVIGPN+D    M+G+Y       T  T L GI     +  R ++ +GC 
Sbjct: 444 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 502

Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
            V  +    F  AI+ +R ADA ++VMG     D S E                    E 
Sbjct: 503 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 561

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DRA L L GRQ EL+ ++S   K P +LVL+ G P+ +  A  +    AI+ A YPG  
Sbjct: 562 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 618

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     RY         G   
Sbjct: 619 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 671

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+G+SYT F +T                           ++VT    +   + V V 
Sbjct: 672 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 706

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           ++N G+ DG     L F    +    P KQL AF ++H+ AG  + V   +   K L++ 
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 765

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            + G   +  G   I +GG+   ++   A
Sbjct: 766 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 794


>gi|325299987|ref|YP_004259904.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
 gi|324319540|gb|ADY37431.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
          Length = 864

 Score =  276 bits (705), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 165/448 (36%), Positives = 242/448 (54%), Gaps = 54/448 (12%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A  + LP+    L   +R NDL+GRL+L+EK  L+ + + A+PRLGIK Y+WW+EALHGV
Sbjct: 20  AMGQQLPYQNPKLTPEERANDLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGV 79

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GG 146
              G           AT FPQ I  A+SF+  L   +   VSDEARA Y           
Sbjct: 80  GRAGI----------ATVFPQTIGMAASFDDELLYQVFTAVSDEARAKYTQFRKEGDLKR 129

Query: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCK 205
             GLT+W+PNVNIFRDPRWGRGQET GEDP L+ +   + VRGLQG  D    K+ A  K
Sbjct: 130 YQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEDAPYDKLHACAK 189

Query: 206 HFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           HF  +    WN   R  FNA+ ++ +D+ +T+   F+  V +  V  VMC+YN++ G P 
Sbjct: 190 HFAVHSGPEWN---RHEFNAENIAPRDLWETYMPAFKDLVQKAHVKEVMCAYNRLEGEPC 246

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEE--AAADAIRAGLDLDCG 322
           C +  +L   +R EW   G +VSDC ++  ++      + P++  A+A A+ +G DL+CG
Sbjct: 247 CGNNRLLTHILRDEWGYQGIVVSDCGAISDFWRKGDHETHPDKAHASAGAVLSGTDLECG 306

Query: 323 PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVC---- 378
                  E AV+ GL++E  ++ ++   L  +  LG  D             KDVC    
Sbjct: 307 SNYKSLPE-AVKAGLIAESQLDISVKRLLKARFELGEMD-------------KDVCWDTI 352

Query: 379 ------TPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
                    H++LAL  AR+ IVLL+N+   LPL   +   +A++GPN++ ++   GNY 
Sbjct: 353 PYSVVDCQAHKDLALRMARESIVLLQNRNNILPLR--KDMKIALVGPNANDSIMHWGNYN 410

Query: 433 GIACGYTTPLQGIGRY---ARTIHQQGC 457
           G      T  + + +    ++ I++ GC
Sbjct: 411 GFPSHTETLYEALKKRLPASQLIYEFGC 438



 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 91/309 (29%), Positives = 136/309 (44%), Gaps = 49/309 (15%)

Query: 470 AAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASK 519
           A  D  + AD  +   G+  ++E E +          DR  + LP  Q++LV ++    K
Sbjct: 593 ATADKVKDADVILFAGGISPTLEGEEMPVDAEGFRGGDRTSIELPAIQRQLVGELKKLGK 652

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
            P + +  SG  +    A        +I A YPGQAGGTAIAD+LFG  NP GKLP+T+Y
Sbjct: 653 -PIVFINYSGSAM--GLAPESEICDGMIQAWYPGQAGGTAIADVLFGDYNPAGKLPVTFY 709

Query: 580 PQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVV 638
                  LP   + AM+        GRTYR+     ++ FGHG+SYT F +         
Sbjct: 710 RN--TEQLPDFEDYAMK--------GRTYRYMTETPLFRFGHGLSYTTFDY--------- 750

Query: 639 AVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGH 698
                           GKA    +      TL + + V N G++DG  T+ V+   P   
Sbjct: 751 ----------------GKARLSQNTFSKGETLTLTIPVSNTGTRDGEETVQVYLRRPGDA 794

Query: 699 WAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
            AP   L AF++V+VP G  + +   +    +L     +    +  GE+ +  GGT  + 
Sbjct: 795 DAPSHTLRAFKRVYVPKGGTKEIKFTLSDDNFLWFDTSTNNMNLISGEYELLYGGTSDTN 854

Query: 759 SLHAATLGV 767
            L    L +
Sbjct: 855 QLKKIRLMI 863


>gi|375357172|ref|YP_005109944.1| putative beta-glucosidase [Bacteroides fragilis 638R]
 gi|301161853|emb|CBW21397.1| putative beta-glucosidase [Bacteroides fragilis 638R]
          Length = 814

 Score =  276 bits (705), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GRV++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 168 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 222

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L+G    + VRG QG    D   V A+ KHF +Y    W         A +
Sbjct: 223 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 279

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
            ++++E+    PFR  V  G + SVM SYN+++G P      +L   ++  W+  G++VS
Sbjct: 280 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 338

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
           D  +VG   +         EAA  A+ AG+D D G    ++ E   +AV+RG ++   I+
Sbjct: 339 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 395

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            A+   L+++ ++G+FD     +       + V + +H  LA E ARQ IVLLKN+   L
Sbjct: 396 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 452

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
           PL     RT+AVIGPN+D    M+G+Y       T  T L GI     +  R ++ +GC 
Sbjct: 453 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 511

Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
            V  +    F  AI+ +R ADA ++VMG     D S E                    E 
Sbjct: 512 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 570

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DRA L L GRQ EL+ ++S   K P +LVL+ G P+ +  A  +    AI+ A YPG  
Sbjct: 571 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 627

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     RY         G   
Sbjct: 628 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 680

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+G+SYT F +T                           ++VT    +   + V V 
Sbjct: 681 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 715

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           ++N G+ DG     L F    +    P KQL AF ++H+ AG  + V   +   K L++ 
Sbjct: 716 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 774

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            + G   +  G   I +GG+   ++   A
Sbjct: 775 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 803


>gi|315606832|ref|ZP_07881841.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315251497|gb|EFU31477.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 858

 Score =  276 bits (705), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 173/457 (37%), Positives = 250/457 (54%), Gaps = 30/457 (6%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           AT + LP+C  +L   +R  DL+ RL+L+EK +L++  + A+PRLGIK + WWSEALHG 
Sbjct: 17  ATAQLLPYCNPALSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGA 76

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY-----NGGTA- 148
           +N+G          G T FP+ +  A+SFN  L   +    SDE RA Y     NGG   
Sbjct: 77  ANMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDE 126

Query: 149 ---GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC- 204
               L+ W+PNVNIFRDPRWGRGQET GEDP L+     + VRGLQG +  + +   +C 
Sbjct: 127 KFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETAKYRKLWACA 186

Query: 205 KHFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
           KH+  +    +    R   N A VS +D+ +T+   F+  V E KV  VMC+Y +++  P
Sbjct: 187 KHYAVHSGPEYT---RHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQRLDDDP 243

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
            C++  +L++ +R EW  N  +VSDC +V   Y     +S    AAA A  AG D++CG 
Sbjct: 244 CCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTDVECGF 303

Query: 324 FLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDH 382
                T   AV+RGL++E +++  ++  L  +  LG  D +P    +  +    + +  H
Sbjct: 304 GYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVMDSKAH 362

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           ++LAL+ ARQ +VLL+N+G  LPL       + VIGPN+D    M GNY G      T L
Sbjct: 363 RQLALDMARQSLVLLQNKGGVLPLK-AGGDPITVIGPNADDGPMMWGNYNGTPNRTVTIL 421

Query: 443 QGI-GRYARTIHQQGCK--DVACADDQLFGAAIDASR 476
            GI  R+ R  + +GC   D    +  L   AID  +
Sbjct: 422 DGIKARHTRVTYLKGCDLTDTKTVNSLLPQCAIDGRK 458



 Score = 98.6 bits (244), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 83/299 (27%), Positives = 130/299 (43%), Gaps = 54/299 (18%)

Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           + V G+  ++E E +          DR  + LP  Q++ +  +  A K   + V  SG  
Sbjct: 603 VFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK-TVVFVNCSGSA 661

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
           I  A         AI+ A Y GQ GGTA++D+LFGT NP GKLP+T+Y +     LP  E
Sbjct: 662 I--ALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYKR--TDQLPDYE 717

Query: 592 -MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
             +MR        GRTYR++  P ++ FG+G+SYT F                 R G   
Sbjct: 718 DYSMR--------GRTYRYFSDP-LFAFGYGLSYTTF-----------------RFGRAR 751

Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
           A  +    +++            V + N G++ G   + V+    A    P K L AF +
Sbjct: 752 AEAAEGGYRLS------------VPLTNTGTRPGEEVVQVYIRRVADTNGPLKSLRAFRR 799

Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
           V + AG    V I +    +    + + T R   G++ +  G +     L   T+ V++
Sbjct: 800 VALKAGESTTVEIPLSRKSFECFDESTNTMRTLPGDYELMYGNSSRPKDLQTLTVSVVE 858


>gi|60680320|ref|YP_210464.1| beta-glucosidase [Bacteroides fragilis NCTC 9343]
 gi|60491754|emb|CAH06512.1| putative beta-glucosidase [Bacteroides fragilis NCTC 9343]
          Length = 814

 Score =  276 bits (705), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GRV++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 168 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 222

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L+G    + VRG QG    D   V A+ KHF +Y    W         A +
Sbjct: 223 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 279

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
            ++++E+    PFR  V  G + SVM SYN+++G P      +L   ++  W+  G++VS
Sbjct: 280 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 338

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
           D  +VG   +         EAA  A+ AG+D D G    ++ E   +AV+RG ++   I+
Sbjct: 339 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 395

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            A+   L+++ ++G+FD     +       + V + +H  LA E ARQ IVLLKN+   L
Sbjct: 396 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 452

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
           PL     RT+AVIGPN+D    M+G+Y       T  T L GI     +  R ++ +GC 
Sbjct: 453 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCT 511

Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
            V  +    F  AI+ +R ADA ++VMG     D S E                    E 
Sbjct: 512 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 570

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DRA L L GRQ EL+ ++S   K P +LVL+ G P+ +  A  +    AI+ A YPG  
Sbjct: 571 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 627

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     RY         G   
Sbjct: 628 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 680

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+G+SYT F +T                           ++VT    +   + V V 
Sbjct: 681 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 715

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           ++N G+ DG     L F    +    P KQL AF ++H+ AG  + V   +   K L++ 
Sbjct: 716 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 774

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            + G   +  G   I +GG+   ++   A
Sbjct: 775 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 803


>gi|393782348|ref|ZP_10370533.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673619|gb|EIY67078.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
           CL02T12C01]
          Length = 852

 Score =  275 bits (704), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 164/426 (38%), Positives = 239/426 (56%), Gaps = 39/426 (9%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           F  ++ P  +R+ DL+ RL+++EK+ LL++ A  +PRL I  Y   +EALHG+  V PG 
Sbjct: 30  FRDMNAPQHERLLDLLSRLTIEEKISLLVNDAREIPRLNIDKYYHGNEALHGI--VRPGE 87

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN----------GGTAGLT 151
                    T FPQ I  A+++N  L   +   +SDEAR  +           G +  LT
Sbjct: 88  --------FTVFPQAIGLAATWNPGLIFEVSSAISDEARGRWKELDYGKKQIAGASDLLT 139

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
           +WSP VN+ RDPRWGR  ET GEDP L+G     +V+GLQG     LK  ++ KHF A +
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLTGVIGCEFVKGLQGDHPRYLKTVSTPKHFAANN 199

Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
            ++    +R   NA++S++D+ + +   F  C+++ K  S+M +YN VNGVP   +  ++
Sbjct: 200 EEH----NRSSCNARMSERDLREFYLPSFERCIVDAKAQSIMMAYNAVNGVPCTVNTYLI 255

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFL 325
           K  +RG+W  NGYIVSDC +         +    + AA  AI+AGLDL+CG      P L
Sbjct: 256 KNVLRGDWGFNGYIVSDCSAPEWMVTKHKYVRDLDAAATLAIKAGLDLECGDRVYTAPLL 315

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
             + ES V     S+ DI++A    L  +M LG+FD +PS  PY  + P  +    HQEL
Sbjct: 316 KAYNESMV-----SKADIDSAAYRVLRGRMLLGLFD-DPSQNPYNQIEPSVIGCKKHQEL 369

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY-TTPLQG 444
           ALE ARQ +VLLKNQ   LPL+  + +++AV+G N+       G+Y+GI      + L G
Sbjct: 370 ALETARQSMVLLKNQKNFLPLNLKKVKSIAVVGINA--GHCEFGDYSGIPKNAPVSVLDG 427

Query: 445 IGRYAR 450
           I +YA 
Sbjct: 428 IRKYAE 433



 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 91/296 (30%), Positives = 145/296 (48%), Gaps = 42/296 (14%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           LFG A  A+++ D T+ V+G+++SIE E  DR  L LP  QQE + ++   +   T++VL
Sbjct: 593 LFGDAGKAAKECDVTVAVLGINKSIEREGQDRYSLELPTDQQEFIRELYKVNPN-TVVVL 651

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           ++G  + + +   D  + AI+ A YPG+ GGTAIA++LFG  NPGG+LP+T+Y    +  
Sbjct: 652 VAGSSLAINWI--DENVPAILNAWYPGEQGGTAIAEVLFGDYNPGGRLPLTYYNS--LDE 707

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +       +     RTY+++KG  +Y FG+G+SYT F                  +
Sbjct: 708 LPSFD-------NYSVQNRTYQYFKGKPLYEFGYGLSYTKF-----------------NY 743

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
              N +I+   I +T              V N G  DG     V+   P  G + P KQL
Sbjct: 744 KKKNVSIANDTIDITFK------------VSNAGKYDGDEVAQVYVQYPETGTYMPLKQL 791

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             F +VH+  G    V I++   +     +++     P G++   IG +   + L 
Sbjct: 792 RGFSRVHIKKGKSADVTISVPKKELRYWDEKTRQFVTPEGKYVFLIGSSSEDIKLQ 847


>gi|238620766|ref|YP_002915592.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.4]
 gi|238381836|gb|ACR42924.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.16.4]
          Length = 755

 Score =  275 bits (704), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 207/691 (29%), Positives = 347/691 (50%), Gaps = 100/691 (14%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
           AT+FPQ I  AS+++  L   +   +  +A+ +   GT      SP +++ RDPRWGR +
Sbjct: 101 ATAFPQAIGLASTWDPDLIREVSSTIRYQAKLI---GTN--QCLSPVLDVCRDPRWGRCE 155

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
           ET GED  L      +YV+GLQG +    ++ A+ KHF A+   +    +   H    V 
Sbjct: 156 ETYGEDQYLVASIGLAYVKGLQGEN----ELIATVKHFAAHGFPEGGRNIAPVH----VG 207

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++ + F  PF + +  GK  SVM +Y++++G+P  ++  +L + +R EW   G +VSD
Sbjct: 208 NRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQEWGFEGIVVSD 267

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLD-----LDCGPFLGLHTESAVQRGLLSEIDI 343
            D++         +   +EAA  A+ AG+D     +DC    G     AV+ GL+SE  I
Sbjct: 268 YDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDC---FGEPLLEAVKEGLISESII 324

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           + A+   L ++ +LG+F+    ++   +  P+ +     +ELAL+ AR+ IVLLKN    
Sbjct: 325 DRAVERVLRIKEKLGLFNDHYINE---NNVPEKLDNSKSRELALDVARKSIVLLKNDN-I 380

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNY-------AGIACGYTTPLQGIGRYA----RTI 452
           LPL+     T+AVIGPN++    ++G+Y       A +     T L+GI R        +
Sbjct: 381 LPLNK-NIGTIAVIGPNANEPRNLLGDYTYTGHLNADVGIEVVTVLEGIMRKVSNNTNVL 439

Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLD-------------------QSIEA 493
           + +GC D+A    + F  AI+ +++ D  I VMG                     Q++  
Sbjct: 440 YAKGC-DIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGKDEFEKYQAVTG 498

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E  DR  L LPG Q+EL+ ++    K P ILVL++G P+ ++   N+  + AII A +PG
Sbjct: 499 EGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFNE--VNAIIDAWFPG 555

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYK 611
           + GG AIAD++FG  NP G+LP+++         P+ +    P    R P   R Y   K
Sbjct: 556 EEGGNAIADVIFGDYNPSGRLPISF---------PI-DTGQIPIYYNRKPSSLRPYVMMK 605

Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
              ++PFG+G+SYT F ++           L+     +N+  SGK             + 
Sbjct: 606 SKPLFPFGYGLSYTEFKYS----------NLEVTPKEVNS--SGK-------------IK 640

Query: 672 VQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
           + ++V+NVG ++G  T+ L  S   +G   P K+L  F KV++    ++++  ++ + + 
Sbjct: 641 ISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAKVYLKPNEKRKITFSLPL-EA 699

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           L+  D+     I  G++ I IG +   + L 
Sbjct: 700 LAFYDQYMRLIIDTGDYEILIGKSSEDIVLK 730


>gi|300726322|ref|ZP_07059774.1| beta-xylosidase B [Prevotella bryantii B14]
 gi|291292284|gb|ADD92014.1| Xyl3A [Prevotella bryantii B14]
 gi|299776347|gb|EFI72905.1| beta-xylosidase B [Prevotella bryantii B14]
          Length = 885

 Score =  275 bits (704), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 166/443 (37%), Positives = 241/443 (54%), Gaps = 39/443 (8%)

Query: 34  DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
           D T+  LP+   +L   +R  DL  RL+L+EK  L+   + A+PRLGIK + WWSEALHG
Sbjct: 42  DKTSTLLPYQNPNLSAYERAIDLCHRLTLEEKALLMQDESPAIPRLGIKKFFWWSEALHG 101

Query: 94  VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY-----NGGTA 148
            +N+G            T+FP+ I  ASSFN TL +++    SDE RA Y     NGG  
Sbjct: 102 AANMG----------NVTNFPEPIAMASSFNPTLLKSVFSAASDEMRAQYHHRMDNGGED 151

Query: 149 ----GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
                L+ W+PNVNIFRDPRWGRGQET GEDP L+     + V GLQG +  + +   +C
Sbjct: 152 EKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGCAVVEGLQGPESSKYRKLWAC 211

Query: 205 -KHFTAYDLDNWNGVDRFHFNA---KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
            KHF  +     +G +     A    +S +D+ +T+   F+  V +G V  VMC+Y +++
Sbjct: 212 AKHFAVH-----SGPESTRHTANLNNISPRDLYETYLPAFQSTVQDGHVREVMCAYQRLD 266

Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
             P C++  +L++ +R EW     +VSDC +V   + +   +S    A+  A  AG D++
Sbjct: 267 DEPCCSNNRLLQQILREEWGFKYLVVSDCGAVSDIWQSHKTSSDAVHASRQATLAGTDVE 326

Query: 321 CGP-FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG----EPSSQPYGHLGPK 375
           CG  +       AV+RGLL+E +I+  ++  L  +  LG  D     E S  PY  +  K
Sbjct: 327 CGYGYTYAKIPEAVKRGLLTEEEIDKHVIRLLEGRFDLGEMDDSKLVEWSKIPYSIMSCK 386

Query: 376 DVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA 435
                 H +LAL+ ARQ IVLL+N+G  LPL   ++  +AVIGPN+D    M GNY G  
Sbjct: 387 -----AHAQLALDMARQSIVLLQNKGNILPLQLKKNERIAVIGPNADNKPMMWGNYNGTP 441

Query: 436 CGYTTPLQGIGR-YARTIHQQGC 457
               + L+GI + Y   ++   C
Sbjct: 442 NHTVSILEGIRKQYKNVVYLPAC 464



 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 87/304 (28%), Positives = 134/304 (44%), Gaps = 53/304 (17%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           + A I   +  D  I V G+  S+E E +          DR  + +P  Q++ +  ++ A
Sbjct: 616 YAANIAQLKGIDKVIFVGGIAPSLEGEEMPVNIPGFKGGDRTDIEMPQVQRDFIKALAEA 675

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K   ILV  SG  I  A      R  AII A YPGQ GGTA+ADIL G  NP GKLP+T
Sbjct: 676 GK-QIILVNCSGSAI--ALTPEAQRCQAIIQAWYPGQEGGTAVADILMGKVNPMGKLPVT 732

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y       LP   + +M+         RTYR+++   +YPFG+G+SYT+F    A   T
Sbjct: 733 FYKS--TQQLPDFEDYSMK--------NRTYRYFED-ALYPFGYGLSYTSFEIGTAKLQT 781

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
           +                            N +TL  Q+ V N G ++G   + V+     
Sbjct: 782 LTN--------------------------NSITL--QIPVTNTGKREGTELVQVYLRRDD 813

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K L +F  + + AG  ++  + ++  ++      + T R+  G++ I  G +  
Sbjct: 814 DVEGPSKTLRSFAHITLKAGETKKAILKLNRNQFECWDASTNTMRVIPGKYTIFYGNSSK 873

Query: 757 SVSL 760
              L
Sbjct: 874 KEDL 877


>gi|390957160|ref|YP_006420917.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
 gi|390412078|gb|AFL87582.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
          Length = 908

 Score =  275 bits (703), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 153/405 (37%), Positives = 232/405 (57%), Gaps = 26/405 (6%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           QR  DL+GR++L+EK   +++GAAA+PRL +  Y++W+E LHGV+  G           A
Sbjct: 33  QRAADLVGRMTLEEKSLQMVNGAAAIPRLNVPAYDYWNEGLHGVARSG----------YA 82

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I  A++++A L + IG V++ EARA  N            GLT+WSPN+NIFRD
Sbjct: 83  TMFPQAIGMAATWDAPLLKQIGDVIATEARAKNNEALRRNNHDIYFGLTFWSPNINIFRD 142

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFH 222
           PRWGRGQET GEDP L+ +   +++ GLQG+D    KV A+ KHF  +         R  
Sbjct: 143 PRWGRGQETYGEDPHLTTQLGVNFIEGLQGTDPKFYKVIATPKHFAVHSGPEEG---RHK 199

Query: 223 FNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
           F+ + +  D+ DT+   FR  +++ K  S+MC+YN+++G P C    +L   +R +W+  
Sbjct: 200 FDVEPTPHDLWDTYLPQFRAAIVDAKADSIMCAYNRIDGQPACGSKLLLVDILRNDWKFQ 259

Query: 283 GYIVSDCDSVGVYYDTQHFTSTPEEAAAD--AIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
           G++ SDC ++  ++      + P+   AD  A+ AG D +CG         AV+ GL+ E
Sbjct: 260 GFVTSDCGAIDDFFRPNTHQTEPDAEHADKAALLAGTDTNCGSTY-RKLGDAVKSGLIKE 318

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
            DI+ +L      ++RLG+FD    S PY  +    V +P +  +A  AA + +VLLKN 
Sbjct: 319 SDIDVSLRRLFEARVRLGLFD-PAGSVPYAQIPFSQVNSPANAAVAKRAAEESMVLLKND 377

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           G  LPL   +++T+AVIGPN     ++ GNY G+A     P+  +
Sbjct: 378 G-ILPLKAGKYKTIAVIGPNGASLSSLEGNYNGMAHDPRMPVDAL 421



 Score =  135 bits (340), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 96/304 (31%), Positives = 153/304 (50%), Gaps = 55/304 (18%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSM 516
           L   A++A+ ++D  + ++GL   +E E +          DR  + LP  QQ L+  + +
Sbjct: 620 LLPEALEAANKSDLVVAMLGLSPDLEGEEMPVKLPGFVGGDRTDISLPASQQALLQGL-I 678

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
           A+  PTI+VL++G  + +  A  D +  AI+ + YPG+AG TA+AD L G +NP G+LP+
Sbjct: 679 ATGKPTIVVLLNGSALAINLA--DEKANAILESWYPGEAGSTALADTLVGRNNPSGRLPI 736

Query: 577 TWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP 635
           T+Y  E  ++LP   + +M+         RTYR++KG  +Y FG G+SYT F ++     
Sbjct: 737 TFYKSE--SDLPGFEDYSMQ--------NRTYRYFKGAPLYGFGFGLSYTKFAYS----- 781

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNR-LTLGVQVDVKNVGSKDGAHTLLVFSTP 694
                                 +K+  AK N   TL  +V VKN G   G     ++  P
Sbjct: 782 ---------------------GLKLAKAKLNAGDTLTAEVTVKNTGKVAGEEVAELYLLP 820

Query: 695 PA---GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
           PA      +P +QL  F++V +  G  +++   +   + LS VD  GTR I  G + I I
Sbjct: 821 PAEGNAGLSPKQQLEGFQRVMLKPGESRKLTFTL-TPRQLSEVDAKGTRAIQPGTYAIAI 879

Query: 752 GGTK 755
           GG +
Sbjct: 880 GGAQ 883


>gi|365121645|ref|ZP_09338561.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363645135|gb|EHL84409.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 868

 Score =  275 bits (703), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 170/454 (37%), Positives = 256/454 (56%), Gaps = 46/454 (10%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L   +R  DL+ R++L+EK   + +    + RLG++ Y+WW+EALHG++  G  
Sbjct: 25  PYKNPELSPDERALDLLNRMTLKEKFAQMHNNTGGIERLGVRPYDWWNEALHGIARAGK- 83

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---------MYNGGTAGLT 151
                    AT FPQ I  A++F+ T    +  +VSDE RA         MYNG   GLT
Sbjct: 84  ---------ATVFPQAIGLAATFDDTAVYEMFDMVSDEGRAKYHDFQRKGMYNG-YKGLT 133

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHFTAY 210
           +W+PN+NIFRDPRWGRG ET GEDP L+ K   + V+GLQG    +   A +C KH+  +
Sbjct: 134 FWTPNINIFRDPRWGRGMETYGEDPFLTTKMGLAVVKGLQGDGTQKYDKAHACAKHYAVH 193

Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
               WN   R  +NA+ +S +D+ +T+   F+  V EGKV  VMC+YN+  G P C++  
Sbjct: 194 SGPEWN---RHSYNAENISIRDLRETYLPAFKALVTEGKVKEVMCAYNRFEGEPCCSNKT 250

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDT-QHFT-STPEEAAADAIRAGLDLDC-GPFLG 326
           +L   ++ EW  +  IVSDC ++  +Y   +H T ++  +A+ADA+ +G DL+C G +  
Sbjct: 251 LLINILKDEWGFDDVIVSDCGAIADFYTKGRHETHASAADASADAVISGTDLECGGSYWA 310

Query: 327 LHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP----SSQPYGHLGPKDVCTPDH 382
           L  + A+++GL++E  IN ++   L  +  LGMFD +     SS PY       VC   H
Sbjct: 311 L--DEALEKGLITETKINESVFRLLRARFELGMFDDDSLVSWSSIPYSV-----VCCDKH 363

Query: 383 QELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
           +  ALE AR+ +VLL N+  +LPLS    + VAV+GPN++ +V +  NY G      T L
Sbjct: 364 KAKALEMARKSMVLLSNKNNTLPLSK-SIKKVAVMGPNANDSVMLWANYNGTPDRSVTIL 422

Query: 443 QGIGRY---ARTIHQQGCKDVACADDQLFGAAID 473
           +GI         I+++GC  V   D ++F +  D
Sbjct: 423 EGIKAKLPEGSVIYEKGCDYV---DTEVFFSYFD 453



 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 149/320 (46%), Gaps = 46/320 (14%)

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
           DV  +    + A  +  + ADA I V G+  S+E E +          DR  + LP  Q+
Sbjct: 583 DVGLSRQIDYKAVAEKVKDADAIIFVGGISSSLEGEEMGVKYPGFRNGDRTNIDLPQVQK 642

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
            ++  +    K P I VL SG  +  A +  D  + AI+ A YPGQ GGTA+AD+LFG  
Sbjct: 643 NMMKALKETGK-PVIFVLCSGSTM--ALSWEDKNMDAILQAWYPGQEGGTAVADVLFGDY 699

Query: 569 NPGGKLPMTWYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
           NP G+LP+T+Y      +LP  E   M   Q     GRTYR++KG  +YPFGHG+SYT F
Sbjct: 700 NPAGRLPLTFYASS--DDLPDFENYNMSEGQ-----GRTYRYFKGKPLYPFGHGLSYTGF 752

Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
            ++ A         L+ +  S+N ++                  + +++KN G +DG   
Sbjct: 753 SYSKAK--------LNKKSMSVNDSVF-----------------LSLNLKNTGLRDGDEV 787

Query: 688 LLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEH 747
           + V+         P K L  +++V V AG    V I++    +      +    +  G++
Sbjct: 788 VQVYIRNLQDPEGPSKSLRGYKRVSVKAGQTVPVKIDLPASSFEFFNPVTEKMEVRPGKY 847

Query: 748 NIHIGGTKHSVSLHAATLGV 767
            I  GGT  +  L    L +
Sbjct: 848 EILYGGTSDNKQLKKLGLEI 867


>gi|423281958|ref|ZP_17260843.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
           615]
 gi|404582445|gb|EKA87139.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
           615]
          Length = 805

 Score =  275 bits (703), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 222/689 (32%), Positives = 329/689 (47%), Gaps = 87/689 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GRV++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 159 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 213

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L+G    + VRG QG    D   V A+ KHF +Y    W         A +
Sbjct: 214 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 270

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
            ++++E+    PFR  V  G + SVM SYN+++G P      +L   ++  W+  G++VS
Sbjct: 271 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 329

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
           D  +VG   +         EAA  A+ AG+D D G    ++ E   +AV+RG ++   I+
Sbjct: 330 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 386

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            A+   L+++ ++G+FD     +       + V + +H  LA E ARQ IVLLKN+   L
Sbjct: 387 KAVRRILSLKFQMGLFDDPFVDEKQ---AAQLVASSEHTGLAREVARQSIVLLKNKDKLL 443

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
           PL     RT+AVIGPN+D    M+G+Y       T  T L GI     +  R ++ +GC 
Sbjct: 444 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 502

Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
            V  +    F  AI+ +R ADA ++VMG     D S E                    E 
Sbjct: 503 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 561

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DRA L L GRQ EL+ ++S   K P +LVL+ G P+ +  A  +    AI+ A YPG  
Sbjct: 562 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 618

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     RY         G   
Sbjct: 619 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 671

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+G+SYT F +T                           ++VT    +   + V V 
Sbjct: 672 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 706

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           ++N G+ DG     L F    +    P KQL AF ++H+ AG  + V   +   K L++ 
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 765

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            + G   +  G   I +GG+   ++   A
Sbjct: 766 MQEGEWVVEPGLFTIMVGGSSEDIACRQA 794


>gi|160901716|ref|YP_001567297.1| glycoside hydrolase family 3 protein [Petrotoga mobilis SJ95]
 gi|160359360|gb|ABX30974.1| glycoside hydrolase family 3 domain protein [Petrotoga mobilis
           SJ95]
          Length = 777

 Score =  275 bits (703), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 202/697 (28%), Positives = 354/697 (50%), Gaps = 102/697 (14%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT FPQ+I  AS++   L E +   + ++ +A+  G   GL   SP V++ RDPRWGR 
Sbjct: 110 GATIFPQMIGAASTWEPPLIERMTTSIRNQMKAL--GIHQGL---SPVVDVTRDPRWGRT 164

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFN---A 225
           +ET GEDP L  K   +YV+GLQ SD  +  + A+ KHF  Y      GV     N   A
Sbjct: 165 EETFGEDPYLIAKMGVAYVKGLQ-SDDLKNGIVATLKHFVGY------GVSEGGMNWAPA 217

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
            + ++++++TF  PF   + EGKV SVM +Y++++G+P  A   +L+R +R EW  +G +
Sbjct: 218 HIPERELKETFLFPFEAAIKEGKVKSVMNAYHEIDGIPCGASETLLRRILREEWGFDGIV 277

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEID 342
           VSD  ++    +        EEAA  A++AG+D++  P    + E   +A++ G  SE  
Sbjct: 278 VSDYFAINSLMEYHKIALNKEEAAIKALKAGIDVEL-PSFDCYKEPLKNAIENGEFSEAF 336

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLG--PKDVCTPDHQELALEAARQGIVLLKNQ 400
           I+ ++ N L ++  +G+F+      PY  L   P ++ TP+ ++LA E A++ IVLLKN 
Sbjct: 337 IDKSVRNILRLKFEMGLFE-----NPYVDLEKVPDNLDTPEDRKLAYEIAKKSIVLLKND 391

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI---------ACGYTTPLQGIGRYA-- 449
           G      + + + VAVIGPN++    + G+Y  +         A G T+ ++GI      
Sbjct: 392 GIVPLKKNSKIKKVAVIGPNANSARNLTGDYTYLTHLETLKQGAFG-TSAMEGITFSESE 450

Query: 450 ---RTIHQ----------------QGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
              +TI++                +GC ++   + ++   A++ +  +D  +LV+G D+S
Sbjct: 451 LPIKTIYESLKEKLEKLNVETSYAKGC-EINDDNKEMIKEAVELAENSDVALLVLG-DKS 508

Query: 491 ------IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
                    E+ D + L+LPG Q +L+  V + +  P I+VL++G P  + +   +  ++
Sbjct: 509 GLTLDCTTGESRDSSTLILPGVQLDLLKSV-INTGTPVIVVLVNGRPYSLDWVSKN--VS 565

Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
           AI  A  PG+ GG A+ADI+ G  +P GKLP++ +P+ ++  +P+     +PS  +    
Sbjct: 566 AIFEAWLPGEEGGNALADIILGDESPSGKLPIS-FPR-HVGQIPVY-YNHKPSGGRSQWW 622

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
             Y       +YPFGHG+SYT F                  +G++    + + +K++   
Sbjct: 623 GDYTDSPAKPLYPFGHGLSYTQF-----------------EYGNLQIENNDRIVKIS--- 662

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                    +DVKN+G + G   + L  +   A    P K+L  F++V +    ++R+  
Sbjct: 663 ---------MDVKNIGEETGDEIVQLYMNDEVASVTRPVKELKGFQRVTLKPSEKKRIIF 713

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           N+ + + L++ +      +  G   + +G +   + L
Sbjct: 714 NLPI-ETLALYNEKMEFLVEKGYFKVMVGSSSEDIRL 749


>gi|332185848|ref|ZP_08387595.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
 gi|332014206|gb|EGI56264.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
          Length = 838

 Score =  275 bits (703), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 173/452 (38%), Positives = 249/452 (55%), Gaps = 37/452 (8%)

Query: 59  RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
           +L++++K   L++ A A P L +  Y+WWSE LHG++  GP          AT FPQ + 
Sbjct: 24  KLTVEQKAAQLVNEAPAEPALDLPAYDWWSEGLHGIARNGP----------ATVFPQAMG 73

Query: 119 TASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRDPRWGRGQE 170
            A++++  L E +G VV+ EARA +N        G   GLT WSPN+NIFRDPRWGRGQE
Sbjct: 74  MAATWDPALIEKVGDVVAVEARAKFNTKPINADRGLFEGLTIWSPNINIFRDPRWGRGQE 133

Query: 171 TPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVS 228
           T GEDP L+ +   ++VRGLQG D    KV A+ KH   +     +G +  R  F+  VS
Sbjct: 134 TYGEDPFLTSRIGIAFVRGLQGPDLKHPKVIATVKHLAVH-----SGPEGGRDSFDVMVS 188

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +D+E T+   FR  V EGK  S+MC+YN ++G P CA+P ++   +R +W   G  VSD
Sbjct: 189 PRDLEATYLPAFRATVTEGKALSLMCAYNAIHGTPVCANPMLMTERLRTDWGFKGLTVSD 248

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALV 348
           CD+VG  +   H       AAA A+RAG DL+CG       E A++RGL++E +I+ AL 
Sbjct: 249 CDAVGNIWMFHHAQPDAPAAAAAALRAGTDLNCGNTYRALPE-ALKRGLITEGEIDTALA 307

Query: 349 NTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSH 408
             L V+  L +        P+  + P  + TP ++ LALE AR+GI+LL N    LPL  
Sbjct: 308 RALAVRRMLTV------DSPWNRIKPSQLGTPANRALALETARKGIILLDNPNDLLPL-- 359

Query: 409 IRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI-HQQGCKDVACADDQ 466
            + + +AV G ++D    + GNY G A    TPL GI  ++ R + + QG      A   
Sbjct: 360 -KGKKLAVFGADADDLGVLEGNYHGSAIDPVTPLDGIRAKFGRDVRYAQGSVLAEGAFIN 418

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDR 498
           +   A+ A+ +    +  MG +    A   DR
Sbjct: 419 VPETALSANGKPGLAVETMGANGQWIAAGQDR 450



 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 91/307 (29%), Positives = 137/307 (44%), Gaps = 64/307 (20%)

Query: 465 DQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKV 514
           + L   A+  +  AD  + ++GL   +E EAL          DR  ++LP  Q+ L++ +
Sbjct: 550 EALQAEALKVAGTADVLVAIVGLSPDLEGEALGVSVPGFAGGDRTEVVLPEPQRNLLAAL 609

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
               K P + V++SG     A +  D + AA + A YPG  GGTA+A+IL G  NP G+L
Sbjct: 610 QATGK-PVVAVIVSGS----AVSLGDIKPAATLAAFYPGAEGGTALAEILSGDVNPSGRL 664

Query: 575 PMTWYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
           P+T Y      +LP   +  M+         RTYR++ G   + FGHG+SYT F +   +
Sbjct: 665 PVTIY--HSAADLPAFADYGMKE--------RTYRYFTGKPAWGFGHGLSYTKFDYGQPS 714

Query: 634 APTVVAV--PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
               VAV  P+D                            V V ++N+G + G      +
Sbjct: 715 VTASVAVGQPVD----------------------------VAVQLRNIGQRGGEEVAQAY 746

Query: 692 STPPAGHWAP-------HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPL 744
             PPA    P        +QLV F +V +  G        I   + +S V R GTRRI  
Sbjct: 747 LVPPATDEKPVLTDPILQRQLVGFTRVALKPGQTGTTRFTID-PRMMSQVWRDGTRRILP 805

Query: 745 GEHNIHI 751
           G + ++I
Sbjct: 806 GTYKLYI 812


>gi|87200432|ref|YP_497689.1| beta-glucosidase [Novosphingobium aromaticivorans DSM 12444]
 gi|87136113|gb|ABD26855.1| Beta-glucosidase [Novosphingobium aromaticivorans DSM 12444]
          Length = 849

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 166/384 (43%), Positives = 219/384 (57%), Gaps = 26/384 (6%)

Query: 81  IKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR 140
           +  Y++W+EALHG++  G           AT FPQ I  A++++A L E IG VVS EAR
Sbjct: 46  LPAYDYWNEALHGLARNGV----------ATVFPQAIGLAATWDAPLLERIGTVVSTEAR 95

Query: 141 AMYNG-------GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS 193
           A YN           GLT WSPN+NIFRDPRWGRGQET GEDPVL+G  A +YVRGLQG 
Sbjct: 96  AKYNALPGKDRRRYQGLTIWSPNINIFRDPRWGRGQETYGEDPVLTGTLATAYVRGLQGP 155

Query: 194 DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVM 253
           D D  +V A+ KH  A+         R  FN + S  D+E T+   FR  + EGK  SVM
Sbjct: 156 DLDHPRVIATPKHLVAHSGPE---AGRDSFNVQSSAYDMEATYLPAFRRALTEGKALSVM 212

Query: 254 CSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAI 313
           CSYN V+GVP C    +L + +R +W  +G +VSDCD++G     Q +  T   A+A AI
Sbjct: 213 CSYNSVHGVPVCGADWLLNQRVRKDWGFDGLVVSDCDAIGNINHYQRYRQTNAAASAAAI 272

Query: 314 RAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG 373
            AG+DL+CG       E+  Q GL +   ++ AL  T   + +LG  D   ++ P+  + 
Sbjct: 273 NAGMDLNCGRTYAALPEALAQ-GLTTREVVDRALARTFAARRKLG--DAFGATSPWATIP 329

Query: 374 PKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
              V T +H+ LALEAAR+ +VLLKN G  LPL   +   +AV+GPN+D   T+  NY G
Sbjct: 330 ANTVDTAEHRALALEAARKSLVLLKNNG-VLPLR--KGARIAVVGPNADSLDTLEANYHG 386

Query: 434 IACGYTTPLQGIGRYARTIHQQGC 457
            A    TPL G     R  + QG 
Sbjct: 387 TAAQPVTPLDGFAARYRMSYAQGA 410



 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 88/288 (30%), Positives = 140/288 (48%), Gaps = 59/288 (20%)

Query: 485 MGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDV 534
           +GL  ++E EAL          DR  + LP  QQ+L+  +    K P ++VL+SG  + +
Sbjct: 579 VGLSPAVEGEALQIEVPGFSGGDRTDIALPRAQQDLLETLKATGK-PLVVVLLSGSAVAM 637

Query: 535 AFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMA 593
            + K +    A++ A YPGQ+GGTAIAD++ GT NP G+LP+T+Y +    +LP   +  
Sbjct: 638 PWVKEN--ADAVVAAWYPGQSGGTAIADLVDGTLNPSGRLPVTFYAR--TRDLPAFVDYN 693

Query: 594 MRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATI 653
           MR         RTYR++ G  ++ FG G+SYT+F +  A+AP  +               
Sbjct: 694 MRE--------RTYRYFHGTPLWSFGEGLSYTSFAYGKASAPARIK-------------- 731

Query: 654 SGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA-----GHWAP---HKQL 705
           +G+            TL   V + NVG+++G      +  PP      G +        L
Sbjct: 732 AGE------------TLTATVSLANVGARNGEEVAQAYLVPPEHLRTIGEFNDPVLRHSL 779

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           VA+ +  +  G   R+   +   + LS VDR+G R +  G + + IGG
Sbjct: 780 VAYRRAALAKGETTRLSFTLD-PRSLSTVDRNGVRAVRPGTYRLFIGG 826


>gi|189464219|ref|ZP_03013004.1| hypothetical protein BACINT_00556 [Bacteroides intestinalis DSM
           17393]
 gi|189438009|gb|EDV06994.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 865

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 154/417 (36%), Positives = 228/417 (54%), Gaps = 21/417 (5%)

Query: 44  QVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKF 103
            +S PI  RV +LI +++L+EKV  L +   ++PRL +  Y +W+E LHGV+  G     
Sbjct: 51  NLSQPISARVENLISKMTLEEKVAQLSNETDSIPRLNLPSYNYWNECLHGVARAGE---- 106

Query: 104 GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDP 163
                  T FPQ I  AS+++  L + +   +S EAR  Y     GLTYWSP +N+ RDP
Sbjct: 107 ------VTVFPQAINLASTWDTLLIKKVASAISTEARLKYLEIGKGLTYWSPTINMARDP 160

Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHF 223
           RWGR +ET GEDP L+ +   ++V+GLQG   D LK  A+ KHF A + +N    DRF  
Sbjct: 161 RWGRNEETYGEDPYLTSRLGVAFVKGLQGDHPDYLKTVATIKHFVANNQEN----DRFSS 216

Query: 224 NAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG 283
           ++++  + + + +   +  CV E    SVM +YN  NGV       +L   +R EW  +G
Sbjct: 217 SSQIPTKQLYEYYFPAYEACVKEADAQSVMTAYNAFNGVAPSGSTWLLGDVLRKEWGFDG 276

Query: 284 YIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDI 343
           ++VSDC ++GV        ++ EEAAA  I +G DL+CG        +AV+ GL+SE  I
Sbjct: 277 FVVSDCGAIGVMNWQHRVVNSLEEAAALGINSGCDLECGGTYREKLVAAVKMGLVSEQAI 336

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           + AL   LT + +LG FD      PY H   K +      +LA EAA + IVLLKN    
Sbjct: 337 DKALTRVLTARFKLGEFD-PIELVPYNHYDKKLLAGEKFGKLAYEAAVKSIVLLKNDNDF 395

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQG 456
           LP+   + R+VA++GP +D     +G Y+G      + LQG    +G+  +  + +G
Sbjct: 396 LPVDKKKIRSVAIVGPFAD--NNYLGGYSGKPVHNVSLLQGVKDLVGKKVKVSYMEG 450



 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 84/286 (29%), Positives = 135/286 (47%), Gaps = 45/286 (15%)

Query: 478 ADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFA 537
           AD  ++ +G D+ +  E  D   + LP  Q+ L+ ++   +   T L+L +G P+   +A
Sbjct: 609 ADLVLVALGNDEKLARENRDLPSIYLPMTQELLLKEIYKVNP-RTALILHTGNPLTSKWA 667

Query: 538 KNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPS 597
             +  + AI+ A YPGQ GG A+A ILFG+ NP GKLPMT Y  E        E      
Sbjct: 668 AEN--VPAILQAWYPGQEGGKALAGILFGSENPSGKLPMTIYESE--------EQLPDIL 717

Query: 598 QSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKA 657
               + GRTY++     +Y FGHG+SY+NF +T   +  VV    DG             
Sbjct: 718 DYDIWKGRTYQYLSSKPLYGFGHGLSYSNFEYTHLQSDDVVRP--DG------------- 762

Query: 658 IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF----STPPAGHWAPHKQLVAFEKVHV 713
                      TL   +++KN+    G   + V+    +TP   +  P K+LVAF +V +
Sbjct: 763 -----------TLQCSIEIKNISDVAGEEVVQVYISRENTPV--YTFPLKKLVAFARVDL 809

Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
             G  + V   I   + LS+      + +P G++++ +G  +  +S
Sbjct: 810 KPGESKTVTFTI-APRQLSIWQEGIWKMLP-GKYSLFVGSGQEGLS 853


>gi|423223731|ref|ZP_17210200.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638106|gb|EIY31959.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 854

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 172/463 (37%), Positives = 259/463 (55%), Gaps = 55/463 (11%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M +     F GL+ L A +    A+E +    KD             P+ +R+ DL+ RL
Sbjct: 1   MKTLKKIVFTGLLALIACAGVAQAQELY----KDEKA----------PMHERIMDLLSRL 46

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +++EK+ LL + +  + RL I  Y   +EALHGV  V PG +F       T FPQ I  A
Sbjct: 47  TVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG-RF-------TVFPQAIGLA 96

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNVNIFRDPRWGRGQE 170
           +++N  L   +  V+SDEARA +N    G          LT+WSP VN+ RDPRWGR  E
Sbjct: 97  ATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVNMARDPRWGRTPE 156

Query: 171 TPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
           T GEDP LSG    ++V+GLQG D   LK+ ++ KHF A + ++    +RF  N ++S++
Sbjct: 157 TYGEDPYLSGIMGTAFVKGLQGDDDRYLKIVSTPKHFAANNEEH----NRFVCNPQISEK 212

Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
            + + +   F  CV +GK AS+M +YN +N VP   +  +L + +R +W   GY+VSDC 
Sbjct: 213 QLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLLTKVLRKDWGFKGYVVSDCG 272

Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTESAVQRGLLSEIDIN 344
              +  +   +  T E AAA +I+AGLDL+CG      P L     SA ++ ++++ DI+
Sbjct: 273 GPSLLVNAHKYVKTKEAAAALSIKAGLDLECGDDVYDQPLL-----SAYRQYMVTDADID 327

Query: 345 NALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           +A    L  +M LG+FD GE    PY  + P  + + +HQE+AL AAR+ IVLLKNQ   
Sbjct: 328 SAAYRVLRARMELGLFDSGE--QNPYTKISPAVIGSAEHQEVALNAARECIVLLKNQKKM 385

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI-ACGYTTPLQGI 445
           LPL+  + +++AV+G N+    +  G+Y+G+      + LQGI
Sbjct: 386 LPLNAKKVKSIAVVGINA--GSSEFGDYSGLPVIAPISVLQGI 426



 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 88/297 (29%), Positives = 143/297 (48%), Gaps = 43/297 (14%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A R+ +  + V+G+++SIE E  DR  + LP  QQE + ++   +  P I+V+
Sbjct: 591 LYGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKVN--PNIVVV 648

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           +  G   +A    D  I AI+ A YPG++GG A+A++LFG  NPGG+LP+T+Y    +  
Sbjct: 649 LVAGS-SLAINWMDEHIPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY--RSLDE 705

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SYT F ++            DG  
Sbjct: 706 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYSNLQVA-------DGEE 752

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
                                  + V   +KN G   G     V+   P      P K+L
Sbjct: 753 ----------------------EINVSFQLKNSGKYAGDEVAQVYVKLPERDEVMPIKEL 790

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI-PLGEHNIHIGGTKHSVSLH 761
             FE+V + +G  ++V + +     L   D +  + + P G++ I +G +   + L 
Sbjct: 791 KGFERVTLKSGENKKVTLKLR-KDLLRYWDEAKDKFVCPSGDYTIMVGASSADIRLQ 846


>gi|383117091|ref|ZP_09937838.1| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
 gi|382973702|gb|EES87886.2| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
          Length = 805

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 221/689 (32%), Positives = 328/689 (47%), Gaps = 87/689 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GRV++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 159 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 213

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L+G    + VRG QG    D   V A+ KHF +Y    W         A +
Sbjct: 214 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 270

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
            ++++E+    PFR  V  G + SVM SYN+++G P      +L   ++  W+  G++VS
Sbjct: 271 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 329

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
           D  +VG   +         EAA  A+ AG+D D G    ++ E   +AV+RG ++   I+
Sbjct: 330 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 386

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            A+   L+++ ++G+FD     +       + V + +H  LA E ARQ IVLLKN+   L
Sbjct: 387 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 443

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
           PL     RT+AVIGPN+D    M+G+Y       T  T L GI     +  R ++ +GC 
Sbjct: 444 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 502

Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
            V  +    F  AI+ +R AD  ++VMG     D S E                    E 
Sbjct: 503 -VRDSSRTGFKDAIETARNADTVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 561

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DRA L L GRQ EL+ ++S   K P +LVL+ G P+ +  A  +    AI+ A YPG  
Sbjct: 562 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 618

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     RY         G   
Sbjct: 619 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYVEE-----PGTPR 671

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+G+SYT F +T                           ++VT    +   + V V 
Sbjct: 672 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCWVDVTVT 706

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           ++N G+ DG     L F    +    P KQL AF ++H+ AG  + V   +   K L++ 
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 765

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            + G   +  G   I +GG+   ++   A
Sbjct: 766 MQEGEWVVEPGRFTIMVGGSSEDITCRQA 794


>gi|423221630|ref|ZP_17208100.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392645869|gb|EIY39591.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 864

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 163/424 (38%), Positives = 238/424 (56%), Gaps = 32/424 (7%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
            P+   SL   +R +DL+ RL+L+EK  L+++G+ A+PRL IK Y WW+EALHG++  G 
Sbjct: 27  FPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLARTGL 86

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA----MYNGGT----AGLT 151
                     AT FPQ I   +SF+ +L   +   VSDEARA    + + G       LT
Sbjct: 87  ----------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQALT 136

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAY 210
            W+PNVNIFRDPRWGRGQET GEDP L+ +   + V GLQG D  R  K+ A  KH+  +
Sbjct: 137 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYNKLHACAKHYAVH 196

Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
               WN   R  FNA+ +S +D+ +T+   F+  V E KV  VMC+YN+  G P C    
Sbjct: 197 SGPEWN---RHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYNRFEGEPCCGSNR 253

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGL 327
           +L + +R EW  +G +VSDC +V  ++  +   + P    A+ADA+  G D++CG     
Sbjct: 254 LLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLNGTDVECGNSYK- 312

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
               AV+ GL++E  I+ ++   L  +  LG  D       +  +    V +P H++LAL
Sbjct: 313 SLPDAVKAGLITENQIDISVKRLLKARFELGEMD----ENVWTGISSDVVDSPKHRQLAL 368

Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
           + AR+ + LL+N    LPLS  +   +A+IGPN++ +V   GNY G+     T L+G+ R
Sbjct: 369 QMARETMTLLQNNNNILPLS--KQAKIALIGPNANDSVMQWGNYNGLPSHTITLLEGMQR 426

Query: 448 YART 451
           Y  T
Sbjct: 427 YLPT 430



 Score =  117 bits (294), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 88/299 (29%), Positives = 133/299 (44%), Gaps = 47/299 (15%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           ++  +  D  I   G+  ++E E +          DR  + LP  Q+ +V  +  A K  
Sbjct: 595 LERIKDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAGK-R 653

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
            + V  SG  I  A         AI+ A YPGQAGG A+A++LFG  NP GKLP+T+Y  
Sbjct: 654 IVFVNFSGAAI--ALEPESLNCEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLTFYRN 711

Query: 582 EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP 641
             +  +P  E            GRTYR+ K   ++PFGHG+SYT F              
Sbjct: 712 --LAQIPDFE-------DYNMTGRTYRYMKETPLFPFGHGLSYTTF-------------- 748

Query: 642 LDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAP 701
              ++G +         K+   +     L + + V N GS+DG   + V+         P
Sbjct: 749 ---KYGKLKMNDD----KIAAGQ----NLNLVIPVTNTGSRDGDEVVQVYLKKMDDTEGP 797

Query: 702 HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            K L AF++V +PAG    V  ++   +     ++S T R+  G + + IGGT    SL
Sbjct: 798 VKTLRAFKRVRIPAGKTVEVKFSLDDTQLEWWDEQSNTMRVCPGNYTVMIGGTSCEKSL 856


>gi|336408356|ref|ZP_08588849.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
 gi|335937834|gb|EGM99730.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
          Length = 805

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 221/689 (32%), Positives = 328/689 (47%), Gaps = 87/689 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GRV++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 159 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 213

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L+G    + VRG QG    D   V A+ KHF +Y    W         A +
Sbjct: 214 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 270

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
            ++++E+    PFR  V  G + SVM SYN+++G P      +L   ++  W+  G++VS
Sbjct: 271 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 329

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
           D  +VG   +         EAA  A+ AG+D D G    ++ E   +AV+RG ++   I+
Sbjct: 330 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 386

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            A+   L+++ ++G+FD     +       + V + +H  LA E ARQ IVLLKN+   L
Sbjct: 387 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 443

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
           PL     RT+AVIGPN+D    M+G+Y       T  T L GI     +  R ++ +GC 
Sbjct: 444 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 502

Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
            V  +    F  AI+ +R ADA ++VMG     D S E                    E 
Sbjct: 503 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 561

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DRA L L GRQ EL+ ++S   K P +LVL+ G P+ +  A  +    AI+ A YPG  
Sbjct: 562 YDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 618

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     RY         G   
Sbjct: 619 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYIEE-----PGTPR 671

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+G+SYT F +T                           ++VT    +   + V V 
Sbjct: 672 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCRVDVTVT 706

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           ++N G+ DG     L F    +    P KQL AF ++H+ A   + V   +   K L++ 
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAAESREVTFTLD-KKSLALY 765

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            + G   +  G   I +GG+   ++   A
Sbjct: 766 MQEGEWVVEPGRFTIMVGGSSEDIACRQA 794


>gi|224538282|ref|ZP_03678821.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520107|gb|EEF89212.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 864

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 163/424 (38%), Positives = 238/424 (56%), Gaps = 32/424 (7%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
            P+   SL   +R +DL+ RL+L+EK  L+++G+ A+PRL IK Y WW+EALHG++  G 
Sbjct: 27  FPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLARTGL 86

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA----MYNGGT----AGLT 151
                     AT FPQ I   +SF+ +L   +   VSDEARA    + + G       LT
Sbjct: 87  ----------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQALT 136

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAY 210
            W+PNVNIFRDPRWGRGQET GEDP L+ +   + V GLQG D  R  K+ A  KH+  +
Sbjct: 137 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYNKLHACAKHYAVH 196

Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
               WN   R  FNA+ +S +D+ +T+   F+  V E KV  VMC+YN+  G P C    
Sbjct: 197 SGPEWN---RHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYNRFEGEPCCGSNR 253

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGL 327
           +L + +R EW  +G +VSDC +V  ++  +   + P    A+ADA+  G D++CG     
Sbjct: 254 LLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLNGTDVECGNSYK- 312

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
               AV+ GL++E  I+ ++   L  +  LG  D       +  +    V +P H++LAL
Sbjct: 313 SLPDAVKAGLITENQIDISVKRLLKARFELGEMD----ENVWTGISSDVVDSPKHRQLAL 368

Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
           + AR+ + LL+N    LPLS  +   +A+IGPN++ +V   GNY G+     T L+G+ R
Sbjct: 369 QMARETMTLLQNNNNILPLS--KQAKIALIGPNANDSVMQWGNYNGLPSHTITLLEGMQR 426

Query: 448 YART 451
           Y  T
Sbjct: 427 YLPT 430



 Score =  118 bits (296), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 88/299 (29%), Positives = 133/299 (44%), Gaps = 47/299 (15%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGP 521
           ++  +  D  I   G+  ++E E +          DR  + LP  Q+ +V  +  A K  
Sbjct: 595 LERIKDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAGK-R 653

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ 581
            + V  SG  I  A         AI+ A YPGQAGG A+A++LFG  NP GKLP+T+Y  
Sbjct: 654 IVFVNFSGAAI--ALEPESQNCEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLTFYRN 711

Query: 582 EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP 641
             +  +P  E            GRTYR+ K   ++PFGHG+SYT F              
Sbjct: 712 --LAQIPDFE-------DYNMTGRTYRYMKETPLFPFGHGLSYTTF-------------- 748

Query: 642 LDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAP 701
              ++G +         K+   +     L + + V N GS+DG   + V+         P
Sbjct: 749 ---KYGKLKMNDD----KIAAGQ----NLNLAIPVTNTGSRDGDEVVQVYLKKMDDTEGP 797

Query: 702 HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            K L AF++V +PAG    V  ++   +     ++S T R+  G + + IGGT    SL
Sbjct: 798 VKTLRAFKRVRIPAGKTVEVKFSLDDTQLEWWDEQSNTMRVCPGNYTVMIGGTSCEKSL 856


>gi|294675412|ref|YP_003576028.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
 gi|294472176|gb|ADE81565.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
          Length = 875

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 163/437 (37%), Positives = 239/437 (54%), Gaps = 39/437 (8%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
           LP+   +L   QR +DL+ RL+L EKV L++  + A+PRLGI  ++WW+EALHG+   G 
Sbjct: 23  LPYQNANLSAAQRADDLLSRLTLDEKVSLMMDTSPAIPRLGIPQFQWWNEALHGIGRNG- 81

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLT 151
                     AT FP  +  A+S++  L   +   VSDEAR                 L+
Sbjct: 82  ---------FATVFPITMAMAASWDDALLHQVFTAVSDEARVKAQQAKCTGDIKRYQSLS 132

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ--GSDGDRL------KVAAS 203
           +W+PN+NIFRDPRWGRGQET GEDP L+ K   + VRGLQ  G +G+ L      K+ A 
Sbjct: 133 FWTPNINIFRDPRWGRGQETYGEDPYLTAKMGLAVVRGLQGVGYNGEDLGVSKYRKLLAC 192

Query: 204 CKHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            KHF  +    WN   R  FN + + ++D+ +T+   F+  V EGKVA VMC+Y +++G 
Sbjct: 193 AKHFAVHSGPEWN---RHEFNIENLPERDLWETYLPAFKALVQEGKVAEVMCAYQRIDGQ 249

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDC 321
             CA     ++ +R EW  +G I SDC ++  +    H  S    EA+A A+ AG D++C
Sbjct: 250 ACCAQTRYEQQILRDEWGFDGLITSDCGAIRDFLPRWHNVSKDGAEASAKAVLAGTDVEC 309

Query: 322 GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
           G     H   AV+RG + E DI+ +L   L  +  LG  D +     +  +    V +  
Sbjct: 310 GSEYK-HLPEAVRRGDVKEADIDRSLRRLLIARFELGDMDSD-DLNAWTKIPETVVASQA 367

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRH------RTVAVIGPNSDVTVTMIGNYAGIA 435
           H++LAL+ A + IVLL+N+   LPL +  +      + + V+GPN++ +V M GNYAG  
Sbjct: 368 HKDLALKMALKSIVLLQNKIKVLPLGNPLNAGAGSDKDIVVMGPNANDSVMMWGNYAGYP 427

Query: 436 CGYTTPLQGIGRYARTI 452
               T L GI R A+T+
Sbjct: 428 THTVTALDGITRMAKTL 444



 Score = 89.0 bits (219), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 129/297 (43%), Gaps = 57/297 (19%)

Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           I V G+  ++E E +          DR  + LP  Q++L++ +  A K   I V  SG  
Sbjct: 624 IFVGGISPNLEGEEMRVNEPGFKGGDRTSIELPQAQRDLLAVLHKAGK-KVIFVNCSGSA 682

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MT 590
           +  A A       AI+   Y G+ GG A+A  LFG   P GKLP+T+Y  +    LP   
Sbjct: 683 M--ALAPELETCDAILQWWYGGEQGGAALATTLFGMVAPSGKLPVTFY--KSTDELPDFL 738

Query: 591 EMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
           +  M+         RTYR+Y+G  ++PFG G+ YT F                    +I+
Sbjct: 739 DYTMK--------NRTYRYYEGEPLFPFGFGLGYTTF--------------------NID 770

Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEK 710
             I          K N+    VQV VKN+G+  G  T+ V+    A    P K L A+++
Sbjct: 771 KPIY---------KNNK----VQVRVKNLGTTAGTETVQVYIRHLADKEGPKKSLRAYQQ 817

Query: 711 VHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           V + A   + + I +    +     ++ T R+  G++ + +G +     L    + +
Sbjct: 818 VTLNAAEAKTISIELPRKSFEGWDVKTNTMRVVPGKYEVMVGNSSADKDLKKIIVNI 874


>gi|254445290|ref|ZP_05058766.1| Glycosyl hydrolase family 3 C terminal domain protein
           [Verrucomicrobiae bacterium DG1235]
 gi|198259598|gb|EDY83906.1| Glycosyl hydrolase family 3 C terminal domain protein
           [Verrucomicrobiae bacterium DG1235]
          Length = 730

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 222/714 (31%), Positives = 338/714 (47%), Gaps = 91/714 (12%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           AT    PF    LP  +R++DLI  ++L+EKV L+      +PRL +K Y   SE  HGV
Sbjct: 22  ATDYDYPFQDPDLPNEERIDDLITCMTLEEKVDLM-GFVPGIPRLDVK-YTRISEGYHGV 79

Query: 95  SNVGPGTKFGGDFPGATS-FPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT---AGL 150
           +  GP + +G   P  T+ FPQ    A++++  L   +    + E R +Y       +GL
Sbjct: 80  AQGGP-SNWGKRNPTPTTQFPQAYGLAATWDPALISRVSANQATEVRYLYQSPKYQRSGL 138

Query: 151 TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
              +PN ++ RDPRWGR +E  GEDP L+G  AA++  GL G     LK  +  KHF A 
Sbjct: 139 VVMAPNADLARDPRWGRTEEVYGEDPFLTGTLAAAFASGLAGDHPRYLKATSLLKHFLA- 197

Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
              N N  DRF  ++   ++   + +  PF M + +G   S+M +YN +NG P    P +
Sbjct: 198 ---NSNEDDRFFSSSDFDERLWREYYAKPFEMAIRDGGARSMMAAYNAINGTPAHVHP-M 253

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE 330
           L+  + GEW L+G I +D   +    +          A A  I+AG++L    FL  HT+
Sbjct: 254 LRDIVMGEWGLDGTICTDGGGLAHLVNQHKTYPDLPTATAACIKAGINL----FLDNHTQ 309

Query: 331 S---AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG------PKDVCTPD 381
           +   AV++ L++E +I++ +   + + + LG+ D  P   PY ++G      P ++  P+
Sbjct: 310 AALDAVEQSLVTEAEIDDVIRGRIRLFLDLGLLD-PPELVPYSNIGHEPGLEPWEL--PE 366

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
                 E  R+ IVLLKN+   LPL   +  +VA++GP ++   T++  Y+G       P
Sbjct: 367 THAFVREVTRKSIVLLKNENNILPLDPSKINSVAIVGPLAN--TTLLDWYSGTPPYAIPP 424

Query: 442 LQGIGRYAR-----TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA--- 493
             GI  YA      +  + G   VA   D     A++ +   D  I+V+G      A   
Sbjct: 425 RDGIEGYANSGPFPSPAKFGSNWVADMSD----TALEVAASRDVAIVVVGNHPESNAGWG 480

Query: 494 ---------EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF-AKNDPRI 543
                    EA+DR  ++L   Q+E + KV  A+   TI+VL+S  P  + + A+N P I
Sbjct: 481 VVTSPSEGKEAVDRQEIILQPDQEEFIQKVYAANPN-TIVVLVSNFPYAMPWAAENAPAI 539

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
             I  A    Q  G A+AD+LFG  NPGGK   TW P+      PM +  +R        
Sbjct: 540 VHITHA---SQEQGNALADVLFGDYNPGGKTVQTW-PKSLDQLPPMMDYDIR-------R 588

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
           GRTY + +    YPFG+G+SYT F  +                            K    
Sbjct: 589 GRTYMYSQHEPQYPFGYGLSYTTFELSKL--------------------------KAPKK 622

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAG 716
                T  ++V V N G +DG   + ++   P      P KQL  F++V VPAG
Sbjct: 623 LKADATATIKVRVANTGERDGDEVVQLYVRYPNSKVERPSKQLKGFQRVTVPAG 676


>gi|337749758|ref|YP_004643920.1| glycoside hydrolase family protein [Paenibacillus mucilaginosus
           KNP414]
 gi|336300947|gb|AEI44050.1| glycoside hydrolase family 3 domain protein [Paenibacillus
           mucilaginosus KNP414]
          Length = 937

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 166/430 (38%), Positives = 244/430 (56%), Gaps = 32/430 (7%)

Query: 36  TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
           T   +PF    LP+ +RV DL+ RL+L+EK+ L+      +PRLGI  Y+  +E  HGV+
Sbjct: 3   TETKMPFQDTRLPLEERVQDLVSRLTLEEKINLMCQYQEEIPRLGIAKYKHGTEGAHGVA 62

Query: 96  NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGTAGLTYW 153
            +G           AT FPQ +  A ++N  L   IG V+ DEAR  Y  N    GLT W
Sbjct: 63  WLGE----------ATVFPQNVGLACTWNPELMREIGSVIGDEARVYYQRNPEINGLTIW 112

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +P V++ RDPRWGR +E  GEDP L+G+ +   V+G+QG     LK AA+ KHF    L 
Sbjct: 113 APTVDMERDPRWGRTEEAYGEDPYLTGRLSTELVKGMQGDHPFYLKTAATLKHF----LG 168

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           N N +DR   +A +  +++ + +   F     EG   S+M +YN VNG P   +P++   
Sbjct: 169 NNNEMDRGECSASIDPRNMREYYLKAFEPAFREGGAQSMMTAYNSVNGTPCNLNPDV-NA 227

Query: 274 TIRGEWRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLDL---DCGPFLGLHT 329
            ++GEW ++G++VSD  D +G   + ++F S  E  AA ++R+G+D    D G  L    
Sbjct: 228 IVKGEWGMDGFVVSDAGDVLGTVNEHRYFASYAEAVAA-SVRSGIDSITDDAGITL-RAV 285

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
             A++ GLL+E D++ A+ N   V++RLG FDG P   PY H+    +C P+H  L+L+A
Sbjct: 286 RDALEGGLLAEEDLDRAVGNAFRVRIRLGEFDG-PEENPYAHVPEAKLCAPEHAALSLQA 344

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY- 448
           AR+G VLLKN+G  LPL+  +  ++AVIGP +DV  T    Y+G      TPL+G+    
Sbjct: 345 AREGFVLLKNEG-LLPLA--KPASIAVIGPLADVVHTDW--YSGTPPYRITPLEGVAERM 399

Query: 449 --ARTIHQQG 456
                IH+ G
Sbjct: 400 APGTVIHRTG 409



 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 85/299 (28%), Positives = 143/299 (47%), Gaps = 48/299 (16%)

Query: 470 AAIDASRQADATILVMGLDQSIEA-EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
           AA++A+RQA+  I+ +G    I   E +DR  L L   Q+ L+  V +A+   T+ VL+ 
Sbjct: 546 AAVEAARQAETAIVFLGNSPFINGKECVDRPDLTLAPAQEALLQAV-LAANPRTVAVLVG 604

Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
             P  V +A+    + AI++  + GQ  G A AD+LFG  +P G+L MTWY      +LP
Sbjct: 605 SYPFAVNWAQ--AHVPAILYTSHAGQELGRAAADVLFGDCSPAGRLNMTWYKS--AADLP 660

Query: 589 -MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH-TVANAPTVVAVPLDGRH 646
            + +  +   +      RTY+++ G V+YPFGHG+SYT F +  V  AP+         +
Sbjct: 661 DLLDYDIIKGK------RTYQYFDGEVLYPFGHGLSYTEFRYLEVQAAPS--------EN 706

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH--KQ 704
           G+ N T++                     V+N GS+ G   + ++       W P   K 
Sbjct: 707 GAFNVTVT---------------------VQNAGSRAGDEVVQLY-VKAGESWVPRPLKT 744

Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIGGTKHSVSLHA 762
           L  F ++H+  G  ++V   +   + L+  D +  +  +  G +    GG+  ++ L A
Sbjct: 745 LAGFRRIHLLPGQTEQVQFTVEASQ-LAFWDVTREKYCVEEGTYRFMAGGSSGNLPLAA 802


>gi|189461690|ref|ZP_03010475.1| hypothetical protein BACCOP_02354 [Bacteroides coprocola DSM 17136]
 gi|189431577|gb|EDV00562.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           coprocola DSM 17136]
          Length = 499

 Score =  273 bits (698), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 166/441 (37%), Positives = 248/441 (56%), Gaps = 52/441 (11%)

Query: 9   FLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKL 68
           F GL+L+ A    + A+E +  +  DA            P+ +R+ DL+ RL+++EKV L
Sbjct: 9   FAGLLLIIAYVIPMQAQELYKNE--DA------------PLHERIMDLLSRLTVEEKVSL 54

Query: 69  LISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLW 128
           L + +  + RL I  Y   +EALHGV  V PG +F       T FPQ I  A+++N  L 
Sbjct: 55  LRATSPGISRLDIPKYYHGNEALHGV--VRPG-RF-------TVFPQAIGLAATWNPELQ 104

Query: 129 EAIGRVVSDEARAMYNGGTAG----------LTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
             +  V+SDEARA +N    G          LT+WSP VN+ RDPRWGR  ET GEDP L
Sbjct: 105 YQVATVISDEARARWNELDQGKLQKGQFSDLLTFWSPTVNMARDPRWGRTPETYGEDPYL 164

Query: 179 SGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDV 238
           SG    ++VRGLQG D   LKV ++ KHF A + ++    +RF  N ++S++ + + +  
Sbjct: 165 SGTMGTAFVRGLQGDDARYLKVVSTPKHFAANNEEH----NRFECNPQISEKQLREYYLP 220

Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT 298
            F  C+ +GK AS+M +YN +N VP   +  +L + +R +W   GY+VSDC    +  + 
Sbjct: 221 AFEACIKDGKAASIMSAYNAINNVPCTLNSWLLTKVLRHDWGFQGYVVSDCGGPSLLVNA 280

Query: 299 QHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTESAVQRGLLSEIDINNALVNTLT 352
             +  T E AA  +I+AGLDL+CG      P L     +A ++ ++S+ DI++   + L 
Sbjct: 281 HKYVKTKEAAATLSIKAGLDLECGDDVYYEPLL-----NAYKQYMVSDADIDSTAYHVLK 335

Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
            +MRLG+FD    + PY  + P  + +  HQ +ALEAARQ IVLLKN    LPL   + +
Sbjct: 336 ARMRLGLFDNG-KNNPYTKISPSIIGSKLHQRVALEAARQCIVLLKNHNWVLPLDTKKLK 394

Query: 413 TVAVIGPNSDVTVTMIGNYAG 433
           ++AV+G N+       G+Y+G
Sbjct: 395 SIAVVGINAG--NCEFGDYSG 413


>gi|373952814|ref|ZP_09612774.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373889414|gb|EHQ25311.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 862

 Score =  273 bits (698), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 165/462 (35%), Positives = 246/462 (53%), Gaps = 30/462 (6%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A+ + LP+   +L    R  DL+ RL+L+EKV L+   + AVPRLGIK + WWSEALHG 
Sbjct: 17  ASAQQLPYQNPALSSEARAKDLVTRLTLKEKVGLMKDVSEAVPRLGIKKFNWWSEALHGY 76

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA------ 148
           +N GP           T FP+ +  A+SF+      +   VSDEARA  N          
Sbjct: 77  ANQGP----------VTVFPEPVGMAASFDDQKLFHVFDAVSDEARAKNNEYRKQVESQR 126

Query: 149 --GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCK 205
              L+ W+PNVNIFRDPRWGRGQET GEDP L+ +   S V+GLQG +D    K+ A  K
Sbjct: 127 FHDLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPADAKYRKLLACAK 186

Query: 206 HFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPT 264
           H+  +    W+   R   N   V+ +D+ +T+   F+  V +  V  VMC+Y +++  P 
Sbjct: 187 HYAVHSGPEWS---RHEMNVTDVTPRDLWETYLPAFKSLVQDADVREVMCAYQRLDDEPC 243

Query: 265 CADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF 324
           C +  +L + +R +W     +VSDC ++  +Y++ H +S    A+A A+ +G D++C  +
Sbjct: 244 CGNSRLLGQILREDWGFKYLVVSDCGAITDFYNSHHSSSDATHASAKAVLSGTDVECVGY 303

Query: 325 LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
                  AV RGL+ E DIN ++V  +T +  LG  D +    P+  +    V + DHQ+
Sbjct: 304 AFDKIPDAVYRGLIKEKDINTSVVRLMTQRFELGEMDKD-ELVPWTKIPLSVVNSEDHQK 362

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LAL+ AR+ + LL+N    LPLS    + +AVIGPN++ +  + GNY G        L+G
Sbjct: 363 LALDMARETMTLLQNNNNILPLSKSIGK-LAVIGPNANDSQMLSGNYNGTPLRTINILEG 421

Query: 445 IGRYA---RTIHQQGCK--DVACADDQLFGAAIDASRQADAT 481
           I         I+  GC   D    +  +   +ID  +   AT
Sbjct: 422 IKTKLGADHVIYDAGCDLTDDKITESLIGNCSIDGKKGIRAT 463



 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 81/311 (26%), Positives = 130/311 (41%), Gaps = 49/311 (15%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           +   I+  + AD  + V G+   +E E +          DR  + LP  Q+  +  +  A
Sbjct: 589 YDGIIEKVKDADIVVFVGGISPKLEGEEMPVQLPGFKGGDRTDIELPAVQRNCIEALRKA 648

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K   + V  SG  I  A         AI+ A Y G++GG A+AD+LFG  NP G LP+T
Sbjct: 649 GK-KIVFVNCSGSAI--AMVPETQNCDAILQAWYAGESGGQAVADVLFGDYNPSGHLPVT 705

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y    +  LP  ++ +M+        GRTYR+ K   ++PFG G+SYT F   +  A  
Sbjct: 706 FYRN--VQQLPDFSDYSMK--------GRTYRYLKSAPLFPFGFGLSYTTF--NIGEAKL 753

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                      + N    G+AI+            ++V V N G  DG   L V+     
Sbjct: 754 -----------TKNNITKGEAIQ------------LRVPVANAGKTDGTELLQVYIRKVD 790

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
                 K L  F+++ V AG  + V +++    +          R+  G + +  G +  
Sbjct: 791 DPDGASKTLRGFKRIPVSAGKTEMVTLDLPPKTFEFFDPTDAVVRVSPGNYQLLYGESSD 850

Query: 757 SVSLHAATLGV 767
              L +  L +
Sbjct: 851 PKDLKSIQLTI 861


>gi|393784569|ref|ZP_10372732.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665550|gb|EIY59074.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
           CL02T12C01]
          Length = 929

 Score =  273 bits (698), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 153/412 (37%), Positives = 229/412 (55%), Gaps = 21/412 (5%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           +T PF   SL   +R  +L+  L+L+EK+  +     A+PRL IKGY +W+EA+HGV+  
Sbjct: 38  QTHPFQDESLSFHERAKNLVSLLTLEEKINQVGHQTLAIPRLNIKGYNYWNEAIHGVARS 97

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
           G           ATSFP     +S+++  L        SDEAR   N    GL YW P +
Sbjct: 98  GL----------ATSFPVSKAMSSTWDLPLIFDCAVATSDEARVYSNTKDKGLIYWCPTI 147

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR +E  GEDP L+GK A  Y++G+QG D    K  A+ KHF A + +    
Sbjct: 148 NMSRDPRWGRDEENYGEDPFLTGKIAVEYIKGMQGDDPKYYKTIATAKHFAANNYEK--- 204

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
             R   ++ +  +++ + +   F M V EG V SVM +YN +NG+P  A+  +L   +R 
Sbjct: 205 -GRHSTSSDMDARNLREYYLPAFEMAVKEGNVRSVMSAYNALNGIPCGANHELLIDILRT 263

Query: 278 EWRLNGYIVSDCDSVGVYYDT--QHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQR 335
           EW  NG++ SDC +V   Y +   HF +T  EA+A +I  G DL+CG     + + A+++
Sbjct: 264 EWGFNGFVTSDCGAVDDVYQSNRHHFVNTAAEASAVSIVNGEDLNCGNTFQDYCKEAIEK 323

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
           G + E D++ ALV     +  +G FD   S+ P+  +    +   +H++LA +AA++ IV
Sbjct: 324 GYMQEADLDTALVRVFEARFSVGEFDN-ASNVPWRSISDDVLDCEEHRQLAYKAAQEAIV 382

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR 447
           LLKN    LPL   + ++VAVIGP  + T+T+ G Y+G     TTP  GI  
Sbjct: 383 LLKNDNNILPLD--KTKSVAVIGPFGN-TITL-GGYSGSPTALTTPFGGIAE 430



 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 90/308 (29%), Positives = 141/308 (45%), Gaps = 39/308 (12%)

Query: 455 QGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKV 514
           +GC     A+  L   A + + +AD  I   G D ++  E+ DR  L LPG QQ+L+  V
Sbjct: 591 KGCAVTGTAETNL-ERAKEIAAKADVVIFAAGTDLTVSDESHDRTNLNLPGDQQKLLEAV 649

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
             A+    IL+L +   + + +AK    + AII A Y GQA G AIAD+L+G  NP GKL
Sbjct: 650 YSANPN-VILLLQTCSSVTINWAK--EHVPAIIEAWYGGQAQGKAIADVLYGDYNPSGKL 706

Query: 575 PMTWYPQEYITNLP--MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVA 632
             TWY    +++LP  M    +R ++       TY ++    +YPFG+GMSYT F +   
Sbjct: 707 TSTWYNA--LSDLPNGMLNYDIRDAK------YTYMYHDKTPLYPFGYGMSYTTFEYQKL 758

Query: 633 NAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFS 692
           N                   IS   +           L V  D+ N G   GA  + +++
Sbjct: 759 N-------------------ISKSRLAAGEE------LIVSADITNTGKYAGAEIVQLYA 793

Query: 693 TPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              +    P KQLV F +V +  G  + V + +   +     +++ T  +  G  +I++ 
Sbjct: 794 HVNSSIERPLKQLVGFARVELEPGETKTVTMPLKHEQLSYFNEQTNTFDVEEGSVDIYVA 853

Query: 753 GTKHSVSL 760
            +   V L
Sbjct: 854 ASSADVRL 861


>gi|430749766|ref|YP_007212674.1| beta-glucosidase-like glycosyl hydrolase [Thermobacillus composti
           KWC4]
 gi|430733731|gb|AGA57676.1| beta-glucosidase-like glycosyl hydrolase [Thermobacillus composti
           KWC4]
          Length = 971

 Score =  273 bits (698), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 170/410 (41%), Positives = 231/410 (56%), Gaps = 25/410 (6%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LP+  RVNDLIGRL+ +EK+ L+     AVPRLGI  Y+  +EA HG++ +GP 
Sbjct: 8   PFRDPDLPLETRVNDLIGRLTTEEKINLMAQYQDAVPRLGIAPYKHGTEAAHGIAWLGP- 66

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGTAGLTYWSPNVN 158
                    AT+FPQ I  A +++A L   +G V+ DEAR  Y  N    GLT W+P V+
Sbjct: 67  ---------ATTFPQPIGLACTWDADLLRRVGSVIGDEARVWYRRNPAVNGLTLWAPTVD 117

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
           + RDPRWGR +E  GEDP L GK A++ +RG+QG     LK  A+ KHF    L N N  
Sbjct: 118 LERDPRWGRTEEAYGEDPFLVGKLASALIRGIQGDHPFYLKAVATLKHF----LGNNNEA 173

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
            R   +  +  +++ + +   F     EG   S+M +YN VNGVP    P ++   ++ E
Sbjct: 174 GRGDKSVSIDPRNMREYYLKVFETAFKEGGALSMMTAYNAVNGVPANLLP-MITSVVKQE 232

Query: 279 WRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLD--LDCGPFLGLHTESAVQR 335
           W +NG++VSD  D  G   D  +  S P   AA +IRAG+D   D    +      A++ 
Sbjct: 233 WGMNGFVVSDAFDVTGTVRDHGYMASLPAAVAA-SIRAGIDSITDDAKVVKQAIREALEM 291

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
           GLL+E D++ AL NT  V+ RLG FD +    PY  +G   +  P+H+ L+LEAAR+ IV
Sbjct: 292 GLLTENDLDVALRNTFRVRFRLGEFDPD-DRNPYAEIGESHMMRPEHEALSLEAARKSIV 350

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           LLKN G  LPLS  R R +AVIGP +DV       Y+G      TPL GI
Sbjct: 351 LLKNDG-VLPLSADRIRKIAVIGPLADVVYR--DWYSGTLPYAVTPLAGI 397



 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 70/234 (29%), Positives = 105/234 (44%), Gaps = 38/234 (16%)

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF-AKNDPRIAAIIWAGYP 552
           E +DR  + LPG Q+ L+ +V +A    TI+V++   P  + + A++ P   AI++  + 
Sbjct: 605 ETVDRPDIALPGPQERLIREV-LAVNPNTIVVVIGSYPFALGWTAEHAP---AILYLAHA 660

Query: 553 GQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKG 612
           GQ  G A+A  LFG  NP G+L MTWY       LP   M     Q K    RTY ++ G
Sbjct: 661 GQELGRAVAGTLFGRFNPAGRLNMTWY--RSADQLP-DFMDYDIIQGK----RTYLYFDG 713

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
             +YPFG+G+SY+ F +                    + TIS  AI        R+ +G 
Sbjct: 714 EPLYPFGYGLSYSRFEYA-------------------DLTISADAI----GYGERVKIGF 750

Query: 673 QVDVKNVGSKDGAHT-LLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINI 725
              V NV    G     L      +    P +QL  F ++ +  G  +RV   +
Sbjct: 751 T--VTNVSDVYGEEVPQLYIRCEDSRVQRPLRQLAGFARIGLAPGESKRVTFEL 802


>gi|227828570|ref|YP_002830350.1| glycoside hydrolase [Sulfolobus islandicus M.14.25]
 gi|229585800|ref|YP_002844302.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.27]
 gi|227460366|gb|ACP39052.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.14.25]
 gi|228020850|gb|ACP56257.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.16.27]
          Length = 755

 Score =  273 bits (697), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 207/691 (29%), Positives = 346/691 (50%), Gaps = 100/691 (14%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
           AT+FPQ I  AS+++  L   +   +  +A+ +   GT      SP +++ RDPRWGR +
Sbjct: 101 ATAFPQAIGLASTWDPDLIREVSSTIRYQAKLI---GTN--QCLSPVLDVCRDPRWGRCE 155

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL-DNWNGVDRFHFNAKVS 228
           ET GED  L      +YV+GLQG +    ++ A+ KHF A+   +    +   H    V 
Sbjct: 156 ETYGEDQYLVASIGLAYVKGLQGEN----ELIATVKHFAAHGFPEGGRNIAPVH----VG 207

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++ + F  PF + +  GK  SVM +Y++++G+P  ++  +L + +R EW   G +VSD
Sbjct: 208 NRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQEWGFEGIVVSD 267

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLD-----LDCGPFLGLHTESAVQRGLLSEIDI 343
            D++         +   +EAA  A+ AG+D     +DC    G     AV+ GL+SE  I
Sbjct: 268 YDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDC---FGEPLLEAVKEGLISESII 324

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           + A+   L ++ +LG+F+    ++   +  P+ +     +ELAL+ AR+ IVLLKN    
Sbjct: 325 DRAVERVLRIKEKLGLFNNHYINE---NNVPEKLDNSKSRELALDVARKSIVLLKNDN-I 380

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNY-------AGIACGYTTPLQGIGRYA----RTI 452
           LPL+     T+AVIGPN++    ++G+Y       A       T L+GI R        +
Sbjct: 381 LPLNK-NIGTIAVIGPNANEPRNLLGDYTYTGHLNADGGIEVVTVLEGIMRKVSNNTNVL 439

Query: 453 HQQGCKDVACADDQLFGAAIDASRQADATILVMGLD-------------------QSIEA 493
           + +GC D+A    + F  AI+ +++ D  I VMG                     Q++  
Sbjct: 440 YAKGC-DIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGKDEFEKYQAVTG 498

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E  DR  L LPG Q+EL+ ++    K P ILVL++G P+ ++   N+  + AII A +PG
Sbjct: 499 EGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFNE--VNAIIDAWFPG 555

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYK 611
           + GG AIAD++FG  NP G+LP+++         P+ +    P    R P   R Y   K
Sbjct: 556 EEGGNAIADVIFGDYNPSGRLPISF---------PI-DTGQIPIYYNRKPSSLRPYVMMK 605

Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
              ++PFG+G+SYT F ++           L+     +N+  SGK             + 
Sbjct: 606 SKPLFPFGYGLSYTEFKYS----------NLEVTPKEVNS--SGK-------------IK 640

Query: 672 VQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
           + ++V+NVG ++G  T+ L  S   +G   P K+L  F KV++    ++++  ++ + + 
Sbjct: 641 ISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAKVYLKPNEKRKITFSLPL-EA 699

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           L+  D+     I  G++ I IG +   + L 
Sbjct: 700 LAFYDQYMRLIIDTGDYEILIGKSSEDIVLK 730


>gi|270340289|ref|ZP_06007688.2| periplasmic beta-glucosidase [Prevotella bergensis DSM 17361]
 gi|270331970|gb|EFA42756.1| periplasmic beta-glucosidase [Prevotella bergensis DSM 17361]
          Length = 867

 Score =  272 bits (696), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 168/469 (35%), Positives = 249/469 (53%), Gaps = 37/469 (7%)

Query: 20  SGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRL 79
           S L A  PF   P     + LP+   +L   +R  DL GRL+L+EK  L++  + A+PRL
Sbjct: 9   SLLMAVLPFCNMP----AQQLPYQNPALSAHERAVDLCGRLTLEEKASLMLDDSPAIPRL 64

Query: 80  GIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEA 139
           GIK ++WWSEALHGV+N+G            T FPQ I  A+SFN  L   +    SDE 
Sbjct: 65  GIKRFQWWSEALHGVANMG----------DVTVFPQPIGMAASFNDNLVYQVFNATSDEM 114

Query: 140 RAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGL 190
           RA +N   A         GL+ W+PNVNIFRDPRWGRGQET GEDP L+ +   + VRGL
Sbjct: 115 RAKWNELQASGKEVTRFHGLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVRGL 174

Query: 191 QGSDGDRLKVAASC-KHFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGK 248
           QG +  + +   +C KH+  +    W    R   N   VS +D+ +T+   F+  V +  
Sbjct: 175 QGPETAKYRKLWACAKHYAIHSGPEWA---RHTDNVTDVSPRDLWETYMPAFKSLVQDAN 231

Query: 249 VASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEA 308
           V  VMC+Y + +  P C +  +L+R +R EW     +VSDC +V  ++ +   +S    A
Sbjct: 232 VREVMCAYQRWDDDPCCGNNRLLQRILRDEWGFKYLVVSDCGAVSDFWTSHKSSSNARNA 291

Query: 309 AADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ 367
           A  A+ AG D++CG  ++      AV+ G ++E +++  +V  L  +  LG  D + S  
Sbjct: 292 ATKAVLAGTDVECGYNYVYKSIPEAVKYGAMTEAEMDEHVVRLLEGRFDLGEMD-DNSIV 350

Query: 368 PYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTM 427
           P+  +    +C+  H++L+L+ + Q + LL+N    LPL+  + + +AVIGPN D    M
Sbjct: 351 PWSSIPSSALCSKKHRQLSLDMSLQTMTLLQNSNDVLPLNK-KEKKIAVIGPNVDNEPMM 409

Query: 428 IGNYAGIACGYTTPLQGIG---RYARTIHQQGCKDVACADDQLFGAAID 473
            GNY G      T L GI    R  + +  +GC  V   +DQ   +  D
Sbjct: 410 WGNYNGTPRSTVTILDGIKSRLRKNQLVTFRGCDLV---NDQTLNSYYD 455



 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 88/310 (28%), Positives = 134/310 (43%), Gaps = 52/310 (16%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           + +AI   +  +  + V G+   +E E +          DR  + LP  Q+E +  +  A
Sbjct: 594 YTSAIKQLKDCETVVFVGGISPQLEGEEMPINVDGFKGGDRTDIELPKVQREFLKALKQA 653

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K   + V  SG  I  A         AI+ A YPGQ GG A+A +LFG  NP GKLP+T
Sbjct: 654 GKS-VVFVCCSGSAI--ALTPETKTCDAILQAWYPGQEGGEAVARVLFGEYNPSGKLPIT 710

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y       LP   + +M+         RTYR Y    ++PFG+G+SYT+F         
Sbjct: 711 FYKN--TEQLPDFKDYSMK--------NRTYR-YMNDALFPFGYGLSYTDF--------- 750

Query: 637 VVAVPLDGRHGSI-NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPP 695
                      SI +AT+SGK +          T+ ++V VKN+G +DG   + ++   P
Sbjct: 751 -----------SIGDATLSGKTLTPG------ATITMKVPVKNIGKRDGTEVIQIYVKDP 793

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
                P K L  F++V V AG      I +    +      S T R   G + ++ G + 
Sbjct: 794 TDSEGPLKSLKGFQRVPVKAGQTAEATITLDSRTFELFDAGSNTVRAKAGNYEVYYGSSS 853

Query: 756 HSVSLHAATL 765
               L    +
Sbjct: 854 ADKDLKKVNV 863


>gi|146301613|ref|YP_001196204.1| beta-glucosidase [Flavobacterium johnsoniae UW101]
 gi|146156031|gb|ABQ06885.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
           [Flavobacterium johnsoniae UW101]
          Length = 864

 Score =  272 bits (695), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 163/435 (37%), Positives = 238/435 (54%), Gaps = 35/435 (8%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+   SL   +R  DLI RL+L+EK  L+   + A+PRLGIK + WWSEALHG +N    
Sbjct: 23  PYKDPSLSSKKRAEDLISRLTLEEKAALMCDQSDAIPRLGIKKFNWWSEALHGYANND-- 80

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG---------LT 151
                     T FP+ I  A+SF+  L   +   VSDE RA YN              L+
Sbjct: 81  --------NVTVFPEPIGMAASFDDQLVFRVFDAVSDEGRAKYNQWIQNGNENKRFLSLS 132

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
            W+PNVNIFRDPRWGRGQET GEDP L+ +   S V+GLQG +D    K+ A  KHF  +
Sbjct: 133 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPADAKYRKLLACAKHFAVH 192

Query: 211 DLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
               W+   R   N   V  +++ +T+   F+  V E  V  VMC+Y +++  P C++  
Sbjct: 193 SGPEWS---RHELNLNNVKPRELYETYLPAFKALVQEADVRQVMCAYQRLDDEPCCSNTR 249

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC----GPFL 325
           +L+R +R EW     +VSDC +V  +Y T   +S    AA+ A+ AG D++C     PF 
Sbjct: 250 LLQRILRDEWGFQYLVVSDCGAVTDFYTTHKVSSDEVHAASKAVLAGTDVECVWDKYPFK 309

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
            L    AV++ L+ E +IN +L+  L  +  LG  D + S  P+  +    + + +HQ+L
Sbjct: 310 KL--PEAVEKDLIKEEEINKSLLRVLIGRFDLGEMDDD-SIVPWAQIPASVLNSKEHQQL 366

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           ALE A++ + LL+N+   LPL+   ++ VAVIGPN+D    + GNY G      T  +GI
Sbjct: 367 ALEMAQKSMTLLQNKNNILPLNKNVNK-VAVIGPNADNEPMLWGNYNGTPNKTITIKKGI 425

Query: 446 -GRYA--RTIHQQGC 457
            G+ A  + ++ + C
Sbjct: 426 EGKLAANKVLYDKSC 440



 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 80/304 (26%), Positives = 125/304 (41%), Gaps = 48/304 (15%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F A I   +  D  I   GL   +E E +          DR  + LP  Q++ + ++  A
Sbjct: 592 FKALIQKLKGIDTVIFAGGLSTLLEGEEMPVSFPGFKGGDRTNIELPAVQRKCLKELKAA 651

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K   I V  SG  I  A         AI+ A YPG++GG A+AD+LFG  NP GKLP++
Sbjct: 652 GK-KVIFVNCSGSAI--ALTPETESCDAILQAWYPGESGGQAVADVLFGDYNPAGKLPIS 708

Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
           +Y              +   +     GRTYR Y   V++PFG G+SY+ F     N    
Sbjct: 709 FYK---------NSDKLGDFEDYSLKGRTYR-YTTDVLFPFGFGLSYSKFDIQAGN---- 754

Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
                          +S   IK            ++  +KN   + G   + V+      
Sbjct: 755 ---------------LSKNKIKPNE------DTQLKFSIKNSSKRSGTEIVQVYVRKLND 793

Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
              P K L AF+++ + AG +Q V I++    +     ++    I  GE+ ++ G +  +
Sbjct: 794 TEGPLKTLKAFKRIELKAGEKQNVTIDLPASSFEFYDAKTRGMNIASGEYEVYYGNSSDT 853

Query: 758 VSLH 761
             L 
Sbjct: 854 KDLK 857


>gi|329923020|ref|ZP_08278536.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328941793|gb|EGG38078.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 763

 Score =  272 bits (695), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 218/678 (32%), Positives = 334/678 (49%), Gaps = 86/678 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT FP  +T  S++N  L+ +I R V+ E RA   GG A    +SP +++ RDPRWGR 
Sbjct: 123 GATVFPVPLTIGSTWNTELFRSISRAVAAETRA--QGGAA---TYSPVLDVVRDPRWGRT 177

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L  ++A + V+GLQG   D    + A+ KHF  Y         R      +
Sbjct: 178 EETFGEDPHLVAEFAVAAVQGLQGERLDSHTSLLATLKHFAGYGASEGG---RNGAPVHM 234

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
             +++ +   +PFR  V  G + S+M +YN+++GVP  +   +L+  +R  W  +G++++
Sbjct: 235 GLRELHEVDLLPFRKAVESGAL-SIMTAYNEIDGVPCTSSRYLLQNVLREAWGFDGFVIT 293

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDINNA 346
           DC ++ +     +   +  EAA  +++AG+D++  G     H + A+++GL++E D+N A
Sbjct: 294 DCGAIHMLACGHNTAGSGVEAATQSLKAGVDMEMSGTMFRAHLQQALEQGLITEDDLNRA 353

Query: 347 LVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPL 406
               L ++ RLG+FD  P   P      + +   +H  LA +AA +GIVLLKN+G  LPL
Sbjct: 354 AGRVLELKFRLGLFD-RPYVDP--AWAEQVIGCKEHIALAYQAAAEGIVLLKNEGNLLPL 410

Query: 407 SHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGCKDVA 461
                 T+AVIGPN+      +G+Y          T L GI R    +R ++  GC+ + 
Sbjct: 411 DS-SSGTIAVIGPNAHTPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVLYAPGCR-IQ 468

Query: 462 CADDQLFGAAIDASRQADATILVMGLDQSIE-------------------------AEAL 496
               + F  A+  + QAD  ++V+G   + +                          E +
Sbjct: 469 GDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGDAKSDMECGEGI 528

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DR+ L L G Q EL+ ++    K P I+V ++G PI   +   D  I AII A YPGQ G
Sbjct: 529 DRSTLTLMGVQLELLQELQKLGK-PVIVVYINGRPITEPWI--DEFIPAIIEAWYPGQEG 585

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           G AIAD+LFG  NP G+LP++  P+E +  LP++  A      +R  G+ Y        Y
Sbjct: 586 GGAIADMLFGDINPSGRLPLS-IPKE-VGQLPISYNA------RRTRGKRYLETDLAPRY 637

Query: 617 PFGHGMSYTNFVH-TVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           PFG G+SYT F +  +   P VV              I G+A              V++D
Sbjct: 638 PFGFGLSYTEFRYGRLTVEPAVVP-------------IGGEAT-------------VRID 671

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           V N G++DGA  + L  S   A    P K L  F KV + AG  Q V   I   + L ++
Sbjct: 672 VTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEVTFTIG-SEQLELI 730

Query: 735 DRSGTRRIPLGEHNIHIG 752
                  +  GE  I +G
Sbjct: 731 GLDLKPVVEPGEFRIQVG 748


>gi|313204104|ref|YP_004042761.1| beta-glucosidase [Paludibacter propionicigenes WB4]
 gi|312443420|gb|ADQ79776.1| Beta-glucosidase [Paludibacter propionicigenes WB4]
          Length = 871

 Score =  272 bits (695), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 158/436 (36%), Positives = 238/436 (54%), Gaps = 32/436 (7%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           + LP+   +L   +R  DLI RL+L EK  L+   + AVPRLGIK + WWSEALHG +N 
Sbjct: 28  QQLPYKNPNLSSEERAKDLISRLTLSEKATLMCDISDAVPRLGIKKFNWWSEALHGFANN 87

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------- 148
           G            T FP+ +  A+SF+  L   I   VSDEARA YN  T          
Sbjct: 88  G----------NVTVFPEPVGMAASFDDNLVYKIFDAVSDEARAKYNEATKKGLENARFL 137

Query: 149 GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHF 207
            L+ W+PNVNIFRDPRWGRGQET GEDP L+ +   S V+GLQG +D    K+ A  KHF
Sbjct: 138 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPADAKYRKLLACAKHF 197

Query: 208 TAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
             +    W+   R   N   +  +D+ +T+   F+  V +  V  VMC+Y +++  P C 
Sbjct: 198 AVHSGPEWS---RHSLNINNLDPRDLWETYLPAFKSLVQKSDVRQVMCAYQRLDDEPCCG 254

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG 326
           +  +L+R +R EW     +V+DC +V  +Y +   +S    AA+  + AG D++C     
Sbjct: 255 NTQLLQRILRDEWGYKYMVVADCGAVSDFYTSHKVSSDAVHAASKGVWAGTDVECQWDNH 314

Query: 327 LHTE--SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQE 384
           ++ +   AV +GL++E +IN  L+N L  +  LG  D + +  P+  +    V   +H++
Sbjct: 315 IYKQLPDAVAKGLITEAEINKHLLNVLIGRFDLGEMDDD-ALVPWSKIPMSVVNNEEHRK 373

Query: 385 LALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQG 444
           LAL+ A + + LL+N+   LPLS  + + +AV+GPN++    + GNY G      T L G
Sbjct: 374 LALDMALKSMTLLQNKNNILPLS--KSKKIAVVGPNANDKPMLWGNYNGTPVRTITILDG 431

Query: 445 IGRYA---RTIHQQGC 457
           I       + ++++GC
Sbjct: 432 ITSKLSANKVLYEKGC 447



 Score = 89.4 bits (220), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 73/272 (26%), Positives = 113/272 (41%), Gaps = 38/272 (13%)

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DR  + LP  Q+  +  +  A K   I V  SG  I  A         AI+ A Y G++G
Sbjct: 638 DRTDIELPAVQRNCLKALKEAGK-QVIFVNCSGSAI--ALIPETESCDAILQAWYGGESG 694

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           G A+AD+LFG  NP GKLP+T+Y          +   +   +     GRTYR+   P ++
Sbjct: 695 GQAVADVLFGDYNPSGKLPITFYK---------SVKQLSDFEDYSMKGRTYRYMSDP-LF 744

Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
           PFG G+SYT F                      NA +S   IK         ++ + + V
Sbjct: 745 PFGFGLSYTTFAIG-------------------NAKLSNPEIKSNQ------SVELTIPV 779

Query: 677 KNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
            N G + G   + V+         P K L  F++V V AG   +  IN+    +      
Sbjct: 780 SNKGKRSGTEIVQVYVRMVNDTDGPLKTLKGFQRVEVAAGKTAQAVINLPYNSFEFFNRN 839

Query: 737 SGTRRIPLGEHNIHIGGTKHSVSLHAATLGVI 768
           +    +  GE+ +  G +  +  L  A + ++
Sbjct: 840 TVKMEVVPGEYEVLYGNSSATKDLKTAKVRIL 871


>gi|323344052|ref|ZP_08084278.1| beta-glucosidase [Prevotella oralis ATCC 33269]
 gi|323094781|gb|EFZ37356.1| beta-glucosidase [Prevotella oralis ATCC 33269]
          Length = 779

 Score =  271 bits (694), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 208/690 (30%), Positives = 328/690 (47%), Gaps = 97/690 (14%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT FP  +  A++++  + E  G +++ E R        G   + P +++  +PRW R 
Sbjct: 147 GATVFPTGLGMAATWSTDVIEQAGVIIAKEIRLQ-----GGHISYGPVLDLAHEPRWSRV 201

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAASCKHFTAYDL--DNWNGVDRFHFNA 225
           +ET GEDPVLSG  A + V+GL   D  +     A+ KHF AY +     NG        
Sbjct: 202 EETMGEDPVLSGTIAVAQVKGLGAGDITKPFATIATLKHFIAYGIPESGQNGAPSI---- 257

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
            +  +D+ D F  PFR  +  G + SVM SYN ++G+P  ++ ++L   +R +W   G++
Sbjct: 258 -IGTRDLLDNFLPPFRRAIDAGAL-SVMTSYNSMDGIPCTSNGHLLTEILRNQWGFKGFV 315

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINN 345
           VSD  S+   Y T H  S+ +EA  +A+RAG+D+D G         AV++G +SE  I+ 
Sbjct: 316 VSDLYSIDGIYGTHHTVSSLQEAGIEALRAGVDVDLGANAFALLCDAVRQGRVSEAAIDE 375

Query: 346 ALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD----VCTPDHQELALEAARQGIVLLKNQG 401
           A++  L +++ +G+F+      PY  + PK     V T ++ ++A   A + I LLKN  
Sbjct: 376 AVLRILRMKIEMGLFE-----HPY--VNPKTAKTGVRTAENIQVAKRVAEESITLLKNSN 428

Query: 402 PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQG 456
             LPLS  ++  +AVIGPN+D    M+G+Y          T L GI      ++  + +G
Sbjct: 429 KLLPLS--KNIKIAVIGPNADNRYNMLGDYTAPQQDSNVKTILDGIRSKLSPSQITYVKG 486

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMG-----------------------LDQSIEA 493
           C       +++ G A+ A+R+AD  ++ +G                       +      
Sbjct: 487 CSIRDTVFNEI-GEAVRAAREADVIVVAVGGSSARDFKTSYQETGAAITSSKVVSDMESG 545

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E  DRA L L G Q  L+  +    K P +++ + G P+D  +A    +  A++ A YPG
Sbjct: 546 EGFDRASLSLMGIQSRLLQSLKETGK-PMVVIYIEGRPLDKTWASE--QADALLTAYYPG 602

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
           Q GG AIA++LFG  NP G+LP+T  P+  +  LP+     RP          Y      
Sbjct: 603 QEGGNAIANVLFGDYNPAGRLPIT-VPRS-VGQLPVYYNKKRPVV------HNYVEMAST 654

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
            +YPFG+G+SYT+F               D  H +I              K +     V 
Sbjct: 655 PLYPFGYGLSYTSF---------------DYSHLNIT-------------KKSEEEYEVS 686

Query: 674 VDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
            D++N G +DG     L  S   A    P KQL  F ++H+  G  +R+ + I     LS
Sbjct: 687 FDIRNSGERDGDEVAQLYISDKVASVVQPVKQLKGFARIHLKKGETKRITL-ILKKDDLS 745

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
           + DR+  R +  G+  I IG +   + L A
Sbjct: 746 ITDRNMERVVEAGDFEIQIGSSSEDIRLKA 775


>gi|393782347|ref|ZP_10370532.1| hypothetical protein HMPREF1071_01400 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673618|gb|EIY67077.1| hypothetical protein HMPREF1071_01400 [Bacteroides salyersiae
           CL02T12C01]
          Length = 862

 Score =  271 bits (694), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 167/432 (38%), Positives = 242/432 (56%), Gaps = 31/432 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L   QR  DL+GRL+L+EKV+L+ + +  V RLGI  Y+WW+EALHGV+  G  
Sbjct: 23  PYKNPDLSPEQRAEDLLGRLTLEEKVRLMQNESFPVERLGIAHYDWWNEALHGVARAGI- 81

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGG---------TAGLT 151
                    AT FP  +  AS+F+  L E +  VVSDEARA Y+             GLT
Sbjct: 82  ---------ATVFPITMGMASTFDDKLVEDVFTVVSDEARAKYHNAHREGRRGVRCEGLT 132

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
           +W+PNVNIFRDPRWGRGQET GEDP L+ +   + V GLQG +D    K+ A  KH+  +
Sbjct: 133 FWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVNGLQGPADAKYDKLHACAKHYAVH 192

Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
                    R +FNA+ +S +D+ +T+   F+  V E  V  VMC+YN+  G P C    
Sbjct: 193 SGPE---AKRHYFNAENISPRDLWETYLPAFKDLVQEADVKEVMCAYNRFEGDPCCGSNR 249

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVG-VYYDTQHFT-STPEEAAADAIRAGLDLDCGPFLGL 327
           +L + +R EW     +VSDC ++   +Y  +H T     +A+A A+ +G DL+CG     
Sbjct: 250 LLTQILRDEWGYKHMVVSDCGAISDFFYKDRHATHKDAADASAAAVLSGTDLECGIEYA- 308

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
           H E AV++GL+SE  IN +L   L  +  LG  D + +  P+  +    V    H+++AL
Sbjct: 309 HLEEAVKKGLISEERINTSLRRLLKARFELGEMDDD-ALVPWSKISIDTVDCETHKQMAL 367

Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-G 446
           +  R+ +VLL N G  LPL+    R +AV+GPN+  +V   GNY G     +T L+GI  
Sbjct: 368 DVTRKSMVLLHNNG-VLPLAKTGTR-IAVMGPNAVDSVMQWGNYKGTPSHTSTILEGIRN 425

Query: 447 RYARTIHQQGCK 458
           +     +++GC+
Sbjct: 426 KIGNVPYEKGCE 437



 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 89/292 (30%), Positives = 125/292 (42%), Gaps = 48/292 (16%)

Query: 478 ADATILVMGLDQSIEAEAL-----------DRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           AD  I V G+   +E E             DR  + LP  Q+ ++  +  A K   I V 
Sbjct: 598 ADVVIFVGGISPDLEGEDKYFVNCPGFSGGDRTTIELPEVQRNILKALKQAGK-KVIFVN 656

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
            SG    VA         AI+ A YPGQAGGTA+AD+LFG  NP GKLP+T+Y       
Sbjct: 657 CSGSA--VALVPETKSCDAILQAWYPGQAGGTAVADVLFGDYNPSGKLPVTFYKN--TDQ 712

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  E       +    GRTYR+     +YPFG+G+SYT F  +       V        
Sbjct: 713 LPDFE-------NYDMKGRTYRYMTETPLYPFGYGLSYTTFDISKGRLSKNVI------- 758

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV 706
            S N T++ KA                 +VKN G ++G   + V+         P K L 
Sbjct: 759 -STNQTVTFKA-----------------NVKNTGKREGTEVVQVYVRKVGDKEGPVKTLR 800

Query: 707 AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
           AF  + + AG    V I++    +      + T RI  G++ I  G +  ++
Sbjct: 801 AFRSIPLKAGKSSVVSIDLPPRTFEFFDPETNTMRILPGDYEIMYGNSSETL 852


>gi|346226088|ref|ZP_08847230.1| glycoside hydrolase family 3 domain protein [Anaerophaga
           thermohalophila DSM 12881]
          Length = 749

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 224/728 (30%), Positives = 337/728 (46%), Gaps = 100/728 (13%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           +   + PF    L    R++DL+ R++L EKV  L S   +VPRLG+KG     E  HGV
Sbjct: 47  SAQESYPFQNPELDSEARIDDLLSRMTLDEKVSAL-STDPSVPRLGVKGAPH-IEGYHGV 104

Query: 95  SNVGPGT--KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG---GTAG 149
           +  GP      G +    T+FPQ     +++N  L    G + S EAR ++        G
Sbjct: 105 AMGGPANWAPKGDEAVPTTTFPQAYGMGATWNPELIRLAGEIESIEARYIFQNPEIAKGG 164

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTA 209
           L   +PN ++ RDPRWGR +E  GEDP L G  A ++ +GLQG D    + A+  KHF A
Sbjct: 165 LVVRAPNADLGRDPRWGRTEECFGEDPFLVGTSATAFTKGLQGDDDQYWRTASLLKHFLA 224

Query: 210 YDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
               N N   R   ++    Q   + +   FR   +EG   + M +YN +NGVP     +
Sbjct: 225 ----NSNENGRESSSSDFDMQLYHEYYGASFRRAFIEGGSNAYMAAYNAINGVPAHVH-D 279

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGV------YYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
           + K      W ++G   +D     +      YYD  +       AA   I+AGL+     
Sbjct: 280 MHKEITERMWGVDGIKCTDGGGYQLLVYGHKYYDDLYL------AAEGVIKAGLN----Q 329

Query: 324 FLGLHTE---SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG----PKD 376
           FL  + E    A+  G ++E DI+  L     V ++LG  D +    PY  +G    P  
Sbjct: 330 FLDNYREGVYGALAHGYITEADIDEVLRGVYRVMIKLGQLDPQ-EKVPYSAIGRDGKPAP 388

Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
             T  H++ AL  AR+ IVLLKN   +LPL+  +   VAVIG  +D    ++  Y+G+  
Sbjct: 389 WTTQKHKDAALRMARESIVLLKNNNKTLPLNADKLNKVAVIGYLAD--TVLLDWYSGLPP 446

Query: 437 GYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL--------- 487
              TPL+GI    R       K V  A D  + AA++A+ +AD  I+++G          
Sbjct: 447 YRITPLEGI----REKLGNDSK-VLYAPDNDYNAAVEAASEADVAIVILGNYPTCNSEIW 501

Query: 488 ----DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
               D  +  EA+DR  L L     E + K+ M +   TI VL S  P  + +++ +  +
Sbjct: 502 ADCPDPGMGREAIDRKTLRL---TDEYLVKLVMEANPNTIFVLQSSFPYAINWSQQN--V 556

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAMRPSQSKRY 602
            AI+   + GQ  G+A+AD+LFG  NPGGKL  TW   E    LP M E  +R       
Sbjct: 557 PAILHLTHNGQETGSALADVLFGDYNPGGKLTQTWPKSE--DQLPDMMEYDIR------- 607

Query: 603 PGRTYRFYKGPVVYPFGHGMSYTNFV--HTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
            G TY +++   +YPFGHG+SYT F       N P V A                     
Sbjct: 608 KGHTYMYFEDKPLYPFGHGLSYTTFAWEDISINKPVVSA--------------------- 646

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST-PPAGHWAPHKQLVAFEKVHVPAGAQQ 719
                +   + + V +KN G   G   + ++++ P +    P K L  F++V +  G ++
Sbjct: 647 -----DDEEVIITVKLKNTGDVKGDEVVQLYASFPESTVRRPAKALKGFKRVTLEPGEKK 701

Query: 720 RVGINIHV 727
           ++ I I +
Sbjct: 702 KIEIPIKL 709


>gi|225872720|ref|YP_002754177.1| xylan 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
 gi|225793233|gb|ACO33323.1| xylann 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
          Length = 721

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 231/755 (30%), Positives = 347/755 (45%), Gaps = 96/755 (12%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF   +L   QR++DL+ R++LQEK++ L      VPRLGI G     E LHG +  GP 
Sbjct: 25  PFQNPALSPDQRIDDLLSRMTLQEKIQAL-GDDPGVPRLGIPG-ALTEEGLHGAAIGGPA 82

Query: 101 --TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG-GTAGLTYWSPNV 157
                G      T FPQ      +++  L +    V + E R   N     GL   +PN 
Sbjct: 83  HWEGRGRAVVPTTQFPQNHGLGQTWDPALLQKAANVEAYETRWAVNKYHDGGLIVRAPNA 142

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           N+ RDPRWGR +E+ GEDP L G  A ++++GLQG++    + AA  KHF AY     N 
Sbjct: 143 NLSRDPRWGRTEESYGEDPYLVGTLAVAWIKGLQGNNPRYWETAALMKHFDAYS----NE 198

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            +R   ++   K+   + + VPFRM + +G   + M SYN  NG+P  A+P +LK  +  
Sbjct: 199 ANRDGSSSNFGKRLFYEYYSVPFRMGIEQGHSDAFMTSYNAWNGIPMTANP-VLKSVVMK 257

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLH---TESAVQ 334
           +W  NG I +D  ++       H+  T  EAAA A+ AG++     FL  +    E A+Q
Sbjct: 258 KWGFNGIICTDAGALSNMVTHFHYYKTMPEAAAGAVHAGINQ----FLDRYQQPVEEALQ 313

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLG----------PKDVCTPDHQ 383
           + LL+E  I+  L     V +RLG+ D  PSS  PY  +G          P D   P H 
Sbjct: 314 QKLLTEQQIDQDLKGVYRVVLRLGLMD--PSSMSPYSMIGLTNDNPAKGDPWD--WPSHI 369

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQ 443
            L  +   + IVLLKNQ  +LPL   +  ++AVIGP +++    +  Y+G      TP++
Sbjct: 370 ALDRKVTDESIVLLKNQNHALPLDAKKLHSIAVIGPWANIVA--LDWYSGTPPFGVTPVE 427

Query: 444 GIGRYARTIHQQGCKDVACA--DDQLFGAAIDASRQADATILVMGLDQSIEA-------- 493
           G       I Q+   DV     D     AA   ++Q+D  I+++G   + +A        
Sbjct: 428 G-------IRQRVGPDVKVTFNDGSNLQAAAALAKQSDEAIVIIGNHPTCDAGWGKCALP 480

Query: 494 ----EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
               EA DR  L LP    E ++K   A+   T++VL +  P    + +    I AI+  
Sbjct: 481 SEGKEAFDRTALNLP---DESIAKAVYAANPHTVVVLQTSFPYTTDWTQ--AHIPAILEM 535

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYR 608
            +  +  GTA+AD+LFG  +P G+L  TW     I  L PM +  +R        GRTY 
Sbjct: 536 AHNSEEQGTALADVLFGDYDPAGRLAQTWVAS--IGQLPPMMDYNIR-------DGRTYM 586

Query: 609 FYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
           + K   +YPFG G+SYT F ++                   N  +S      +H      
Sbjct: 587 YLKSKPLYPFGFGLSYTTFKYS-------------------NLRLS------SHTLPAGG 621

Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHV 727
            L V VDV N G  +G   + ++        + P + L  F++V +P G  + V + +  
Sbjct: 622 QLTVSVDVTNTGKYNGDEVVQMYVKHLDSKVSRPLEALKGFDRVSIPVGQTRTVTLPLKA 681

Query: 728 CKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHA 762
                    +   R+      I IG +   +   A
Sbjct: 682 SALAYWDKNTHAFRVEPDHVQIRIGASSADIRQKA 716


>gi|379722647|ref|YP_005314778.1| glycoside hydrolase family protein [Paenibacillus mucilaginosus
           3016]
 gi|378571319|gb|AFC31629.1| glycoside hydrolase family 3 domain-containing protein
           [Paenibacillus mucilaginosus 3016]
          Length = 937

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 165/430 (38%), Positives = 243/430 (56%), Gaps = 32/430 (7%)

Query: 36  TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
           T   +PF    LP+ +RV DL+ RL+L+EK+ L+      +PRLGI  Y+  +E  HGV+
Sbjct: 3   TETKMPFQDTRLPLEERVQDLVSRLTLEEKINLMCQYQEEIPRLGIAKYKHGTEGAHGVA 62

Query: 96  NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGTAGLTYW 153
            +G           AT FPQ +  A ++N  L   IG V+ DEAR  Y  N    GLT W
Sbjct: 63  WLGE----------ATVFPQNVGLACTWNPELMREIGSVIGDEARVYYQRNPEINGLTIW 112

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +P V++ RDPRWGR +E  GEDP L+G+ +   V+G+QG     LK AA+ KHF    L 
Sbjct: 113 APTVDMERDPRWGRTEEAYGEDPYLTGRLSTELVKGMQGDHPFYLKTAATLKHF----LG 168

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           N N +DR   +A +  +++ + +   F     EG   S+M +YN VNG P   +P++   
Sbjct: 169 NNNEMDRGECSASIDPRNMREYYLKAFEPAFREGGAQSMMTAYNSVNGTPCNLNPDV-NA 227

Query: 274 TIRGEWRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLDL---DCGPFLGLHT 329
            ++GEW ++G++VSD  D +G   + ++F S  E  AA ++R+G+D    D G  L    
Sbjct: 228 IVKGEWGMDGFVVSDAGDVLGTVNEHRYFASYAEAVAA-SVRSGIDSITDDVGITL-RAI 285

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
             A++ GLL+E D++ A+ N   V++RLG FD  P   PY H+    +C P+H  L+L+A
Sbjct: 286 RDALEGGLLAEEDLDRAVGNAFRVRIRLGEFDS-PEENPYAHVPEAKLCAPEHAALSLQA 344

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY- 448
           AR+G VLLKN+G  LPL+  +  ++AVIGP +DV  T    Y+G      TPL+G+    
Sbjct: 345 AREGFVLLKNEG-LLPLA--KPASIAVIGPLADVVHTDW--YSGTPPYRITPLEGVAERM 399

Query: 449 --ARTIHQQG 456
                IH+ G
Sbjct: 400 APGTVIHRTG 409



 Score = 99.4 bits (246), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 143/298 (47%), Gaps = 46/298 (15%)

Query: 470 AAIDASRQADATILVMGLDQSIEA-EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
           AA++A+RQA+  I+ +G    I   E +DR  L L   ++ L+  V +A+   T+ VL+ 
Sbjct: 546 AAVEAARQAETAIVFLGNSPFINGKECVDRPDLTLAPAKEALLQAV-LAANPRTVAVLVG 604

Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
             P  V +A+    + AI++  + GQ  G A AD+LFG  +P G+L MTWY  +   +LP
Sbjct: 605 SYPFAVNWAQ--AHVPAILYTSHAGQELGRAAADVLFGDCSPAGRLNMTWY--KSAADLP 660

Query: 589 -MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVH-TVANAPTVVAVPLDGRH 646
            + +  +   +      RTY+++ G V+YPFGHG+SYT F +  V  AP+         +
Sbjct: 661 DLLDYDIIKGK------RTYQYFDGEVLYPFGHGLSYTEFRYLEVQAAPS--------EN 706

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQL 705
           G+ N T++                     V+N GS+ G   + ++          P K L
Sbjct: 707 GAFNVTVT---------------------VQNAGSRAGDEVVQLYVKAGESRVPRPLKTL 745

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIGGTKHSVSLHA 762
             F ++H+  G  ++V   +   + L+  D +  +  +  G +    GG+  ++ L A
Sbjct: 746 AGFRRIHLLPGQTEQVEFTVEAPQ-LAFWDVTREKYCVEEGTYRFMAGGSSGNLPLAA 802


>gi|189464310|ref|ZP_03013095.1| hypothetical protein BACINT_00651 [Bacteroides intestinalis DSM
           17393]
 gi|189438100|gb|EDV07085.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 864

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 169/450 (37%), Positives = 250/450 (55%), Gaps = 54/450 (12%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M +     F GL  L A +    A+E +    KD             P+ +R+ DL+ RL
Sbjct: 1   MRTLKKIVFTGLFALIACAGVAQAQELY----KDEKA----------PMHERIMDLLSRL 46

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +++EK+ LL + +  +PRL I  Y   +EALHGV  V PG +F       T FPQ I  A
Sbjct: 47  TVEEKISLLRATSPGIPRLDIPKYYHGNEALHGV--VRPG-RF-------TVFPQAIGLA 96

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNVNIFRDPRWGRGQE 170
           +++N  L   +  V+SDEARA +N    G          LT+WSP VN+ RDPRWGR  E
Sbjct: 97  ATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVNMARDPRWGRTPE 156

Query: 171 TPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
           T GEDP LSG    ++V+GLQG D   LK+ ++ KHF A + ++    +RF  N ++S++
Sbjct: 157 TYGEDPYLSGVMGTAFVKGLQGDDDRYLKIVSTPKHFAANNEEH----NRFVCNPQISEK 212

Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
            + + +   F  CV +GK AS+M +YN +N VP   +  +L + +R +W   GY+VSDC 
Sbjct: 213 QLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCG 272

Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTESAVQRGLLSEIDIN 344
              +  +   +  T E AA  +I+AGLDL+CG      P L     SA ++ +++  DI+
Sbjct: 273 GPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVFDEPLL-----SAYRQYMVTNADID 327

Query: 345 NALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           +A    L  +M+LG+FD GE    PY  + P  V +  HQE+AL AAR+ IVLLKNQ   
Sbjct: 328 SAAYRVLRARMQLGLFDSGE--KNPYTKISPAVVGSAKHQEVALNAARECIVLLKNQKKM 385

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
           LPL+  + +++AV+G N+       G+Y+G
Sbjct: 386 LPLNAKKVKSIAVVGINA--GNCEFGDYSG 413



 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 85/298 (28%), Positives = 142/298 (47%), Gaps = 45/298 (15%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A R+ +  + V+G+++SIE E  DR  + LP  Q E + ++   +  P I+V+
Sbjct: 591 LYGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQMEFLQEIYKVN--PNIVVV 648

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           +  G   +A    D  + AI+ A YPG++GG A+A++LFG  NPGG+LP+T+Y    +  
Sbjct: 649 LVAGS-SLAVNWMDEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY--RSLDE 705

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SYT F ++            DG  
Sbjct: 706 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYSNLQVA-------DGEE 752

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
                                  + V   +KN G   G     V+   P      P K+L
Sbjct: 753 ----------------------EINVSFQLKNAGKYAGDEVAQVYVKLPERDEVMPVKEL 790

Query: 706 VAFEKVHVPAGAQQRVGINIH--VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
             FE+V + +G  +++ + +   + +Y    +  G    P G++ I +G +   + L 
Sbjct: 791 KGFERVALKSGENKKMTLKLRKDLLRYWD--EAKGKFVYPSGDYTIMVGASSADIRLQ 846


>gi|295135996|ref|YP_003586672.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294984011|gb|ADF54476.1| putative beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 796

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 219/717 (30%), Positives = 336/717 (46%), Gaps = 111/717 (15%)

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
            EA+HG   VG            T FP  I  AS++N  L + +  V++ E RA  +   
Sbjct: 135 EEAMHGHMAVG-----------TTVFPTAIGQASTWNPDLIKKMAHVIAKEIRAQGSN-- 181

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLK----VAAS 203
              T + P ++I R+PRW R +ET GEDP L  +   S V G QGS    LK    VAA+
Sbjct: 182 ---TAYGPIIDIAREPRWSRVEETFGEDPYLIAEMGKSMVTGFQGSHESDLKSNEHVAAT 238

Query: 204 CKHFTAYDLDNWNGVDRFHFN---AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
            KHF AY      GV     N     + ++D+   +  P +  V  G V SVM +Y+ ++
Sbjct: 239 LKHFAAY------GVSEGGHNGAAVHIGQRDLFQNYMYPVKEAVDNG-VMSVMTAYSSID 291

Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
           GVP+ A  N+L   ++ +W   G+++SD  S+       H   T E+AAA A+ AG+D+D
Sbjct: 292 GVPSTAHKNLLTNILKEKWGFKGFVISDLASIEGLLGDHHIVDTEEDAAAMAMNAGVDVD 351

Query: 321 CGPFLGLHTE--SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGH--LGPKD 376
            G   G       AV  G ++E  I+ A+   LTV+ +LG+F+      PY +     K 
Sbjct: 352 LGGN-GYDDALIDAVNAGKVAEERIDEAVRRILTVKFKLGLFE-----NPYANEKQAEKI 405

Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
           V   +H ELA E ARQ I +LKN+   LPL+    + +AVIG N+D+    +G+Y     
Sbjct: 406 VRNSEHIELAREVARQSITMLKNEDNILPLNK-ELQNIAVIGSNADMQYNQLGDYTAPQS 464

Query: 437 --GYTTPLQGIGRYARTIHQQGCKDVACADDQLFG--AAIDASRQADATILVMG------ 486
                T L+GI       + +  K  A  D       AA++A++ A+  I+V+G      
Sbjct: 465 EENIITVLEGIQHKMPNANIEYVKGTAVRDTTQTNIPAAVEAAKNAEVAIVVLGGSSARD 524

Query: 487 --------------------LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
                               L      E  DR+ L L G+Q EL+  V +A+  PT+LVL
Sbjct: 525 FKTEYLETGAATISSKEDQVLSDMESGEGYDRSTLNLMGKQLELLQAV-VATGTPTVLVL 583

Query: 527 MSGGPIDVAF-AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
           + G P+ + + A+N P    I+ A YPGQ GG+AIAD++FG  NP G+LP++  P+  + 
Sbjct: 584 IKGRPLLLNWPAENVP---VILDAWYPGQEGGSAIADVIFGDFNPAGRLPVS-VPKS-LG 638

Query: 586 NLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
            +P+      P++      R Y       +YPFG+G+SY+ F                 +
Sbjct: 639 QIPVYYNYWFPNR------RDYVETDAKPLYPFGYGLSYSEF-----------------K 675

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQ 704
           +  +    SGK             + + + + N    DG   + L      +   +P KQ
Sbjct: 676 YSDLKVATSGKG--------RNTKIEISLKISNTSKVDGDEVIQLYIRDMVSTVLSPVKQ 727

Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
           L AFE+V + AG  + V   + + K LS+ D    +++  GE  + IG +   + L 
Sbjct: 728 LRAFERVSIKAGETKTVQFEL-LPKELSLFDTEMKQKVQAGEFKLMIGASSEDIRLE 783


>gi|294462719|gb|ADE76904.1| unknown [Picea sitchensis]
          Length = 304

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 131/297 (44%), Positives = 195/297 (65%), Gaps = 5/297 (1%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           +FG A++ +++ D  I+V+GLDQ+ E E  DR  L LPG+Q+ LV +VS A+K P + V+
Sbjct: 1   MFGEAVETAKKVDEVIVVVGLDQTQEKEERDRIKLTLPGQQENLVYQVSRAAKRPVVFVI 60

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           +SGGP+DV+FA NDP+I++IIWAGYPGQAGG A+A+I+FG  NPGG+LPMTWYPQ+++  
Sbjct: 61  LSGGPVDVSFAVNDPQISSIIWAGYPGQAGGQALAEIIFGDYNPGGRLPMTWYPQDFV-K 119

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           +PMT+M MRP+    YPGRTYRFY G  V+ FG G+SY+ + +  ++  T+  + L+   
Sbjct: 120 IPMTDMNMRPNHDTGYPGRTYRFYTGKKVFEFGQGLSYSAYSYNFSST-TIQKIDLNVTM 178

Query: 647 GSINATIS-GKA-IKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-APHK 703
               A  + GK  ++V +  C +L     + V+N    DG H +L++S  PA H  AP K
Sbjct: 179 EHFEALGNRGKGHVRVENTPCRKLKFRSSIFVRNHDKMDGRHAVLLYSKSPATHKGAPQK 238

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           QL+ F  VHV      +V   +  C + S V+ +G R + +G H++ +G T++ VSL
Sbjct: 239 QLIGFRSVHVQGKHTAKVTFVVKPCDHFSTVEENGQRLLAIGSHSLIVGDTQYPVSL 295


>gi|156933295|ref|YP_001437211.1| hypothetical protein ESA_01107 [Cronobacter sakazakii ATCC BAA-894]
 gi|156531549|gb|ABU76375.1| hypothetical protein ESA_01107 [Cronobacter sakazakii ATCC BAA-894]
          Length = 757

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 212/694 (30%), Positives = 337/694 (48%), Gaps = 112/694 (16%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 110 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 163

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 164 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 218

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G    VM + N +NG P  +D  +LK  +RG+W   G 
Sbjct: 219 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 275

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H T S PE+A   AI++G+D+     +   +  + ++ G ++  +
Sbjct: 276 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 333

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A + AR+ +V
Sbjct: 334 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARKVARESLV 388

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    M+G++  AG+A    T LQG+    G  A
Sbjct: 389 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGHKA 446

Query: 450 RTIHQQGCKDVACADDQ----------------------LFGAAIDASRQADATILVMGL 487
           + ++ +G       DD+                      +   A++ ++Q+D  + V+G 
Sbjct: 447 KILYAKGAN---VTDDKGIVDFLNLYEPAVVVDKRTPKAMIDEAVNVAKQSDVVVAVVGE 503

Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
            Q +  EA  R  L +P  Q++L+S +    K P +LVLM+G P+  A  K D +  AI+
Sbjct: 504 AQGMAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAIL 560

Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKR 601
              + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +
Sbjct: 561 ETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNK 618

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
           Y  R +    GP +YPFG+G+SYT F  +V++    ++ P   R G + A+++       
Sbjct: 619 YTSRYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTMKRDGKVTASVT------- 666

Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
                         V N G ++GA  + ++     A    P KQL  FEK+ +  G  Q 
Sbjct: 667 --------------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQT 712

Query: 721 VG--INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           V   I+I   K+ +   R      P G+ N+ IG
Sbjct: 713 VSFPIDIDALKFWN--QRMKYDAEP-GKFNVFIG 743


>gi|429087308|ref|ZP_19150040.1| Periplasmic beta-glucosidase [Cronobacter universalis NCTC 9529]
 gi|426507111|emb|CCK15152.1| Periplasmic beta-glucosidase [Cronobacter universalis NCTC 9529]
          Length = 765

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 213/691 (30%), Positives = 335/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 118 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G    VM + N +NG P  +D  +LK  +RG+W   G 
Sbjct: 227 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H T S PE+A   AI++G+D+     +   +    ++ G +S  +
Sbjct: 284 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPDLIKSGKVSMEE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A   AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARNVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    M+G++  AG+A    T LQG+    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 454

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                 ++   A++ ++Q+D  + V+G  Q 
Sbjct: 455 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  L +P  Q++L+S +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  +V++    ++ P   R G + A+++          
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTMKRDGKVTASVT---------- 674

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                      V N G ++GA  + ++     A    P KQL  FEK+ +  G  Q V  
Sbjct: 675 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQTVSF 723

Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I+I   K+ +   R      P G+ N+ IG
Sbjct: 724 PIDIDALKFWN--QRMKYDAEP-GKFNVFIG 751


>gi|397691073|ref|YP_006528327.1| beta-glucosidase [Melioribacter roseus P3M]
 gi|395812565|gb|AFN75314.1| beta-glucosidase [Melioribacter roseus P3M]
          Length = 923

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 151/413 (36%), Positives = 236/413 (57%), Gaps = 26/413 (6%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           +R+NDLI  ++ +EK+K L + A ++PRLG++ Y +W+E+LHGV              GA
Sbjct: 115 ERLNDLISLMTTEEKIKQLTNQADSIPRLGLRAYNYWNESLHGVLA-----------EGA 163

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQE 170
           TSFPQ I   ++++  L   +   VSDEARA+      GLTYWSP +NI RDPRWGR +E
Sbjct: 164 TSFPQAIALGATWDPRLVNRVATAVSDEARALNRLYGKGLTYWSPTINIARDPRWGRNEE 223

Query: 171 TPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHF-NAKVSK 229
           +  EDP L  +   ++++G+QG     LK  A+ KHF A      N  +R H  ++ V  
Sbjct: 224 SYSEDPYLLSRMGVAFIKGMQGDHPYYLKTVATPKHFIAN-----NEEERRHTGSSDVDM 278

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +++ + +   F+  ++E +  S+M +YN++N VP+ A+  ++   +R +W   GY+VSDC
Sbjct: 279 RNLYEYYLPAFKSAIVEARAYSIMGAYNELNHVPSNANMFLMTDLLRRQWGFEGYVVSDC 338

Query: 290 DSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVN 349
            ++        F  T  EA A +I AG DL+CG       + A+  GLL E DI++AL  
Sbjct: 339 GAIHDMLYGHKFFKTGAEAVARSILAGCDLNCGQAYREFIKDALDEGLLREKDIDSALFR 398

Query: 350 TLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHI 409
            L+ + RLG FD  P   PY  +G   + + +++ LAL+AAR+ IVLLKN    LP+   
Sbjct: 399 VLSARFRLGEFD-PPELVPYSSIGKDKLDSKENRRLALDAARKSIVLLKNND-ILPIDKS 456

Query: 410 RHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTI-----HQQGC 457
           + +++AVIGPN+      +G Y+G      +PL+GI   A ++     + +GC
Sbjct: 457 KIKSIAVIGPNA--REAQLGIYSGFPNVLISPLEGIKNKADSLDIRVGYVKGC 507



 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 94/320 (29%), Positives = 140/320 (43%), Gaps = 50/320 (15%)

Query: 445 IGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
           +G  AR     G KD        F  A   + + D  ILV+G+   I  E LDR  + LP
Sbjct: 643 LGATARLTWDLGQKD--------FEKAKKIAAENDLVILVLGITPGISQEELDRKEIELP 694

Query: 505 GRQQELVSKVSMASKGPTI-LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
             Q+ELV +   A   P I +VL++GGP  VA A  +    AI+   Y G+ GG A+AD+
Sbjct: 695 SVQRELVKQT--AEVNPNIVIVLVNGGP--VALAGAEKYAKAIVENWYNGEFGGQALADV 750

Query: 564 LFGTSNPGGKLPMTWYPQ-EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGM 622
           LFG  NPGGKLP T+Y   E +  +   ++   P        RTY +     ++PFGHG+
Sbjct: 751 LFGDYNPGGKLPQTFYASTEQLPPMSDYDIINNP--------RTYMYLNEQALFPFGHGL 802

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGS 681
           SYT F +                           ++K+     N   TL +Q  + NVG+
Sbjct: 803 SYTTFKY--------------------------DSLKIVSNTLNETDTLSLQFRLTNVGN 836

Query: 682 KDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
           ++G   + ++ S   A    P KQL  F ++ +  G  + +   I V +           
Sbjct: 837 RNGDEVVQIYASCKDAKFKVPRKQLKRFRRLTLQTGESKVLEFKIPVDELAFYSTYENDF 896

Query: 741 RIPLGEHNIHIGGTKHSVSL 760
            +  G   I IG +   + L
Sbjct: 897 VVEKGAWEILIGSSSEDIRL 916


>gi|336399370|ref|ZP_08580170.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
 gi|336069106|gb|EGN57740.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
          Length = 862

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 158/431 (36%), Positives = 233/431 (54%), Gaps = 30/431 (6%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L    R  DL  RL+L+EK  L+   + A+PRLGIK + WWSEALHG +N G  
Sbjct: 24  PYQDPGLSFEARAKDLCSRLTLEEKASLMCDVSPAIPRLGIKPFNWWSEALHGYANNG-- 81

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA---------GLT 151
                     T FP+ I  A+SFN T+   +    SDEAR  YN   A          L+
Sbjct: 82  --------DVTVFPEPIGMAASFNPTMVYQVFTATSDEARGKYNQSMAEGKEDTRFHSLS 133

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHFTAY 210
            W+PNVNIFRDPRWGRGQET GEDP L+       V+GLQG +  + +   +C KHF  +
Sbjct: 134 VWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVEVVKGLQGPESTKYRKLYACAKHFAVH 193

Query: 211 DLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
               +    R   N A +S +D+ +T+   F+  V +  V  VMC+Y +++  P C +  
Sbjct: 194 SGPEYT---RHTANLADISPRDLWETYLPAFKATVQQAGVREVMCAYQRLDDEPCCGNSR 250

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLH 328
           +L++ +R EW     +VSDC ++  +Y   H +S    AAA    AG D++CG  +  + 
Sbjct: 251 LLQQILRDEWGFRHMVVSDCGAIADFYTNHHVSSDAVHAAAKGTLAGTDVECGFGYAYMK 310

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALE 388
              AV+RGL+SE +++  ++  L  +  LG+ D +P    +  + PK V +  H++LAL 
Sbjct: 311 LPEAVRRGLVSEAEVDKHVIRLLKGRFELGVMD-DPKLVSWTKISPKVVDSDAHRQLALN 369

Query: 389 AARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GR 447
            ARQ + LL+N+   LPL+  +   +AV+GPN+     + GNY G     TT L+GI  +
Sbjct: 370 MARQTMTLLQNRNNVLPLA--KGEKIAVVGPNAADGPMLWGNYNGTPSRTTTILEGIRAK 427

Query: 448 YARTI-HQQGC 457
             + I + QGC
Sbjct: 428 AGKDIPYLQGC 438



 Score =  100 bits (249), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 86/311 (27%), Positives = 133/311 (42%), Gaps = 50/311 (16%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           +  AI   R     + V G+   +E E +          DR  + LP  Q++ +  +  A
Sbjct: 590 YATAIRQLRGVRTVVFVGGISSKLEGEEMPVHVEGFKGGDRTSIELPAVQRDFLKALKAA 649

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K   + V  SG  I  A         AI+ A Y G+ GG A+AD+L+G  NPGGKLP+T
Sbjct: 650 GK-TVVFVNCSGSAI--ALTPEVESCDAILQAWYAGEEGGRAVADVLYGDYNPGGKLPVT 706

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y     T LP   + +M+        GRTYR++    ++PFG+G+SYT F         
Sbjct: 707 FY--RSTTQLPAFDDYSMK--------GRTYRYFSD-ALFPFGYGLSYTRFAIG------ 749

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                     GS++A       KVT          + V V NVG + G   + V+     
Sbjct: 750 ---------KGSLSAPAMKADGKVT----------LTVPVSNVGKRTGDEVVQVYVRDVN 790

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K L AF +V + AG  ++V I +    +      S T R   G++ ++ G +  
Sbjct: 791 DADGPLKSLKAFRRVSLKAGESRKVTIPLTAETFSLFDSASNTVRTKPGKYVVYYGTSSR 850

Query: 757 SVSLHAATLGV 767
              L   ++ +
Sbjct: 851 DSDLKQLSMTI 861


>gi|299141953|ref|ZP_07035087.1| beta-glucosidase [Prevotella oris C735]
 gi|298576415|gb|EFI48287.1| beta-glucosidase [Prevotella oris C735]
          Length = 858

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 177/490 (36%), Positives = 253/490 (51%), Gaps = 39/490 (7%)

Query: 3   STIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
           + I    L  + +  +SS LA  +P        T    P+   +L    R  DL+ RL+L
Sbjct: 2   NKIKTILLSFVTVMTASSALA--QP------SLTGMAYPYQNPNLSALTRAQDLLSRLTL 53

Query: 63  QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
           +EK  L++  + A+PRLGIK + WWSEALHG +N+G            T FP+ I  A+S
Sbjct: 54  EEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMG----------NVTVFPEPIAMAAS 103

Query: 123 FNATLWEAIGRVVSDEARAMY-----NGGTA----GLTYWSPNVNIFRDPRWGRGQETPG 173
           FN  L   +    SDE RA Y     NGG       L+ W+PNVNIFRDPRWGRGQET G
Sbjct: 104 FNDALLYKVFSAASDEMRAQYHHRIRNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYG 163

Query: 174 EDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSKQD 231
           EDP L+     + VRGLQG  D    K+ A  KH+  +    +    R   N   VS +D
Sbjct: 164 EDPYLTAVMGTAVVRGLQGPEDSKYRKLWACAKHYAVHSGPEYT---RHTANLNNVSPRD 220

Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
           + +T+   F+  V E KV  VMC+Y  ++  P C +  +L++ +R EW     +VSDC +
Sbjct: 221 LWETYLPAFKTLVEEAKVREVMCAYQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGA 280

Query: 292 VGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNT 350
           V   +     +S    A A A  AG D++CG  +       AVQRGL+SE +++  ++  
Sbjct: 281 VSDIWQNHKTSSDAVHATAKAALAGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRL 340

Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
           L  +  LG  D +P+  P+  +    + +  H++L+L+ ARQ IVLL+N+   LPL    
Sbjct: 341 LEGRFDLGEMD-DPALVPWSKIPYSVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN 399

Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTIHQQGCK--DVACADDQL 467
            R +AVIGPN+D    M GNY G      T L GI  ++    + +GC   D     D L
Sbjct: 400 ER-IAVIGPNADNVPMMWGNYNGTPNRTVTILDGIRAKHKNVKYIKGCDLTDTMVV-DPL 457

Query: 468 FGAAIDASRQ 477
           F A + A ++
Sbjct: 458 FDACMIAGKR 467



 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 84/261 (32%), Positives = 117/261 (44%), Gaps = 52/261 (19%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           +   I A +  +  + V G+  ++E E +          DR  + LP  Q++ +  +  A
Sbjct: 598 YQEVIRALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAA 657

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K   + V  SG  I  A         AI+ A Y GQ GGTA+AD+LFG  NP GKLP+T
Sbjct: 658 GK-QLVYVNCSGSAI--ALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVT 714

Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
           +Y       LP  E       +    GRTYR++  P+ + FGHG+SYT F     N  T 
Sbjct: 715 FYKNS--NQLPDYE-------NYSMKGRTYRYFSDPL-FAFGHGLSYTTF-----NMGT- 758

Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
                        A I  KA  +           V++ V+NVGSKDG  T+L++      
Sbjct: 759 -------------AEIIKKADSIV----------VRIPVENVGSKDGTETVLLYIKNHQD 795

Query: 698 HWAPHKQLVAFEKVHVPAGAQ 718
              P K L  F +V V AG Q
Sbjct: 796 PNGPIKSLRGFSRVFVKAGHQ 816


>gi|373460605|ref|ZP_09552356.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
 gi|371955223|gb|EHO73027.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
          Length = 858

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 177/490 (36%), Positives = 253/490 (51%), Gaps = 39/490 (7%)

Query: 3   STIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSL 62
           + I    L  + +  +SS LA  +P        T    P+   +L    R  DL+ RL+L
Sbjct: 2   NKIKTILLSFVTVMTASSALA--QP------SLTGMAYPYQNPNLSALTRAQDLLSRLTL 53

Query: 63  QEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASS 122
           +EK  L++  + A+PRLGIK + WWSEALHG +N+G            T FP+ I  A+S
Sbjct: 54  EEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMG----------NVTVFPEPIAMAAS 103

Query: 123 FNATLWEAIGRVVSDEARAMY-----NGGTA----GLTYWSPNVNIFRDPRWGRGQETPG 173
           FN  L   +    SDE RA Y     NGG       L+ W+PNVNIFRDPRWGRGQET G
Sbjct: 104 FNDALLYKVFSAASDEMRAQYHHRIRNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYG 163

Query: 174 EDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSKQD 231
           EDP L+     + VRGLQG  D    K+ A  KH+  +    +    R   N   VS +D
Sbjct: 164 EDPYLTAVMGTAVVRGLQGPEDSKYRKLWACAKHYAVHSGPEYT---RHTANLNNVSPRD 220

Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
           + +T+   F+  V E KV  VMC+Y  ++  P C +  +L++ +R EW     +VSDC +
Sbjct: 221 LWETYLPAFKTLVEEAKVREVMCAYQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGA 280

Query: 292 VGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNT 350
           V   +     +S    A A A  AG D++CG  +       AVQRGL+SE +++  ++  
Sbjct: 281 VSDIWQNHKTSSDAVHATAKAALAGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRL 340

Query: 351 LTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIR 410
           L  +  LG  D +P+  P+  +    + +  H++L+L+ ARQ IVLL+N+   LPL    
Sbjct: 341 LEGRFDLGEMD-DPALVPWSKIPYSVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN 399

Query: 411 HRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTIHQQGCK--DVACADDQL 467
            R +AVIGPN+D    M GNY G      T L GI  ++    + +GC   D     D L
Sbjct: 400 ER-IAVIGPNADNVPMMWGNYNGTPNRTVTILDGIRAKHKNVKYIKGCDLTDTMVV-DPL 457

Query: 468 FGAAIDASRQ 477
           F A + A ++
Sbjct: 458 FDACMIAGKR 467



 Score =  100 bits (248), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 83/259 (32%), Positives = 116/259 (44%), Gaps = 52/259 (20%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           +   I A +  +  + V G+  ++E E +          DR  + LP  Q++ +  +  A
Sbjct: 598 YQEVIRALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAA 657

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K   + V  SG  I  A         AI+ A Y GQ GGTA+AD+LFG  NP GKLP+T
Sbjct: 658 GK-QLVYVNCSGSAI--ALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVT 714

Query: 578 WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
           +Y       LP  E       +    GRTYR++  P+ + FGHG+SYT F     N  T 
Sbjct: 715 FYKNS--NQLPDYE-------NYSMKGRTYRYFSDPL-FAFGHGLSYTTF-----NMGT- 758

Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
                        A I  KA  +           V++ V+NVGSKDG  T+L++      
Sbjct: 759 -------------AEIIKKADSIV----------VRIPVENVGSKDGTETVLLYIKNHQD 795

Query: 698 HWAPHKQLVAFEKVHVPAG 716
              P K L  F +V V AG
Sbjct: 796 PNGPIKSLRGFSRVFVKAG 814


>gi|260598593|ref|YP_003211164.1| beta-D-glucoside glucohydrolase [Cronobacter turicensis z3032]
 gi|260217770|emb|CBA32207.1| Periplasmic beta-glucosidase [Cronobacter turicensis z3032]
          Length = 765

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 214/691 (30%), Positives = 335/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GR+      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 118 TIFPNSLGLASSFNLDAVKTVGRI------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G    VM + N +NG P  +D  +LK  +RG+W   G 
Sbjct: 227 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H T S PE+A   AI++G+D+     +   +  + ++ G +S  +
Sbjct: 284 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVSMEE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A   AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARNVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    M+G++  AG+A    T LQG+    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 454

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + +  +G     D    D                 ++   A++ ++Q+D  + V+G  Q 
Sbjct: 455 KILFAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  L LP  Q++L+S +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDLTLPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  +V++    ++ P   R G + A+++          
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTMKRDGKVTASVT---------- 674

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                      V N G ++GA  + ++     A    P KQL  FEKV +  G  Q V  
Sbjct: 675 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKVTLKPGETQTVSF 723

Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I+I   K+ +   R      P G+ N+ IG
Sbjct: 724 PIDIDALKFWN--QRMKYDAEP-GKFNVFIG 751


>gi|313203744|ref|YP_004042401.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443060|gb|ADQ79416.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 1286

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 158/425 (37%), Positives = 231/425 (54%), Gaps = 18/425 (4%)

Query: 32  PKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEAL 91
           P   +T+   +   S    +R  DLI RL+L+EK  LL +  AA+PRLGIK    WSEAL
Sbjct: 23  PAKVSTKKPIYLNTSYSFEERAADLISRLTLEEKESLLGNSMAAIPRLGIKSMNVWSEAL 82

Query: 92  HGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT 151
           HG+  +G   +  G   G TSFP  +   S+++  L +     ++DEARA+   GT GLT
Sbjct: 83  HGI--LGGANQSVG-ISGPTSFPNSVALGSAWDPALMQREAMAIADEARAINQTGTKGLT 139

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
           YWSP V   RDPRWGR  E+ GEDP L+ + A  +VRG+ G+D   LK     KH+ A  
Sbjct: 140 YWSPVVEPIRDPRWGRTGESYGEDPFLAAEIAGGFVRGMVGNDPTYLKSVPCAKHYFA-- 197

Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             N +  DR   ++ +  +D+ + +  P++  + +  + S+M SYN VNGVPT A    L
Sbjct: 198 --NNSEFDRHVSSSNMDSRDMREFYLAPYKKLIEQDNLPSIMSSYNAVNGVPTSASQLYL 255

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTES 331
               R  + L GYI  DC ++   Y   ++  T EEA A  ++AG+D DCG     +  +
Sbjct: 256 DTIARRTYGLKGYITGDCAAIEDIYTGHYYVKTAEEATAKGLKAGVDSDCGSIYQRYAIA 315

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAAR 391
           A+++GL++  DI+ AL+N   V+MR G FD  P+   Y    P  V +P ++ LA E A 
Sbjct: 316 ALKKGLITMADIDRALLNIFIVRMRTGEFD-PPAKVLYAQFQPNIVNSPANKALAKEIAT 374

Query: 392 QGIVLLKN------QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC--GYTTPLQ 443
           +  VLLKN         +LPL+    + +A+IGP++D     +G Y+G        TP  
Sbjct: 375 KTPVLLKNNISLKTNRKALPLNPADLKKIALIGPHAD--KVELGPYSGRPAQENMITPFA 432

Query: 444 GIGRY 448
           GI +Y
Sbjct: 433 GIKKY 437



 Score =  137 bits (344), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 108/297 (36%), Positives = 148/297 (49%), Gaps = 36/297 (12%)

Query: 461 ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG 520
           A AD ++    I  +  AD  ++ +G D+    E  DR  LLLPG Q EL+  V+ A   
Sbjct: 594 APADAKIDEQTIKTAAAADVAVVFVGTDEKTATEEADRLTLLLPGNQVELIKAVA-AVNP 652

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
            TI+V+ + G ++V   KN   I  IIW GY GQA G AIA +LFG  NPGGKL  TWY 
Sbjct: 653 NTIVVMQTLGCVEVEEFKNLQNIPGIIWVGYNGQAQGDAIASVLFGEVNPGGKLNGTWYK 712

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
              + +LP +T+  +R    K   GRT+ ++   V Y FG GMSYT F ++         
Sbjct: 713 S--VKDLPEITDYTLRGGNGKN--GRTFWYFDKDVSYEFGFGMSYTTFEYS--------- 759

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STP--PA 696
                     N  IS  +I + H K     + V VDVKN G  +G   + V+  TP  PA
Sbjct: 760 ----------NFRISKNSI-IPHDK-----ITVSVDVKNTGKVEGDEVIQVYMKTPDSPA 803

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR-SGTRRIPLGEHNIHIG 752
               P K+L  F++V +PAG  + V I+I+ C  L   D    T     G++   IG
Sbjct: 804 SLQRPIKRLKGFKRVTLPAGQTKTVNIDIN-CADLWFWDMDKNTTTFDQGKYVFEIG 859


>gi|429101023|ref|ZP_19162997.1| Periplasmic beta-glucosidase [Cronobacter turicensis 564]
 gi|426287672|emb|CCJ89110.1| Periplasmic beta-glucosidase [Cronobacter turicensis 564]
          Length = 759

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 214/691 (30%), Positives = 335/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 112 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 165

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 166 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 220

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G    VM + N +NG P  +D  +LK  +RG+W   G 
Sbjct: 221 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 277

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H T S PE+A   AI++G+D+     +   +  + ++ G +S  +
Sbjct: 278 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVSMEE 335

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A   AR+ +V
Sbjct: 336 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARNVARESLV 390

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    M+G++  AG+A    T LQG+    G  A
Sbjct: 391 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 448

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + +  +G     D    D                 ++   A++ ++Q+D  + V+G  Q 
Sbjct: 449 KILFAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQG 508

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  L +P  Q++L+S +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 509 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 565

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 566 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 623

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  +V++    ++ P   R G + A+++          
Sbjct: 624 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTMKRDGKVTASVT---------- 668

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                      V N G ++GA  + ++     A    P KQL  FEKV +  G  Q V  
Sbjct: 669 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKVTLKPGETQTVSF 717

Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I+I   K+ +   R      P G+ N+ IG
Sbjct: 718 PIDIDALKFWN--QRMKYDAEP-GKFNVFIG 745


>gi|429105352|ref|ZP_19167221.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 681]
 gi|426292075|emb|CCJ93334.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 681]
          Length = 765

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 212/691 (30%), Positives = 337/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 118 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G    VM + N +NG P  +D  +LK  +RG+W   G 
Sbjct: 227 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H T S PE+A   AI++G+D+     +   +  + ++ G ++  +
Sbjct: 284 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A + AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPQDTNAESRLHRDDARKVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    M+G++  AG+A    T LQG+    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 454

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                 ++   A++ ++Q+D  + V+G  Q 
Sbjct: 455 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  L +P  Q++L+S +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  +V++    ++ P   R G + A+++          
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSSPTMKRDGKVTASVT---------- 674

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                      V N G ++GA  + ++     A    P KQL  FEK+ +  G  Q V  
Sbjct: 675 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQTVSF 723

Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I+I   K+ +   R      P G+ N+ IG
Sbjct: 724 PIDIDALKFWN--QRMKYDAEP-GKFNVFIG 751


>gi|429110963|ref|ZP_19172733.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 507]
 gi|426312120|emb|CCJ98846.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 507]
          Length = 759

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 212/691 (30%), Positives = 337/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 112 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 165

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 166 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 220

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G    VM + N +NG P  +D  +LK  +RG+W   G 
Sbjct: 221 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 277

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H T S PE+A   AI++G+D+     +   +  + ++ G ++  +
Sbjct: 278 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 335

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A + AR+ +V
Sbjct: 336 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPQDTNAESRLHRDDARKVARESLV 390

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    M+G++  AG+A    T LQG+    G  A
Sbjct: 391 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKKVAGDKA 448

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                 ++   A++ ++Q+D  + V+G  Q 
Sbjct: 449 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDMVVAVVGEAQG 508

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  L +P  Q++L+S +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 509 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 565

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 566 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 623

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  +V++    ++ P   R G + A+++          
Sbjct: 624 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSSPTMKRDGKVTASVT---------- 668

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                      V N G ++GA  + ++     A    P KQL  FEK+ +  G  Q V  
Sbjct: 669 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQTVSF 717

Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I+I   K+ +   R      P G+ N+ IG
Sbjct: 718 PIDIDALKFWN--QRMKYDAEP-GKFNVFIG 745


>gi|389840350|ref|YP_006342434.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii ES15]
 gi|417792062|ref|ZP_12439471.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii E899]
 gi|429121663|ref|ZP_19182278.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 680]
 gi|333953864|gb|EGL71757.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii E899]
 gi|387850826|gb|AFJ98923.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii ES15]
 gi|426323856|emb|CCK13015.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 680]
          Length = 765

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 212/694 (30%), Positives = 337/694 (48%), Gaps = 112/694 (16%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 118 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G    VM + N +NG P  +D  +LK  +RG+W   G 
Sbjct: 227 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H T S PE+A   AI++G+D+     +   +  + ++ G ++  +
Sbjct: 284 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A + AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARKVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    M+G++  AG+A    T LQG+    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 454

Query: 450 RTIHQQGCKDVACADDQ----------------------LFGAAIDASRQADATILVMGL 487
           + ++ +G       DD+                      +   A++ ++Q+D  + V+G 
Sbjct: 455 KILYAKGAN---VTDDKGIVDFLNLYEPAVVVDKRTPKAMIDEAVNVAKQSDVVVAVVGE 511

Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
            Q +  EA  R  L +P  Q++L+S +    K P +LVLM+G P+  A  K D +  AI+
Sbjct: 512 AQGMAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAIL 568

Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKR 601
              + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +
Sbjct: 569 ETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNK 626

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
           Y  R +    GP +YPFG+G+SYT F  +V++    ++ P   R G + A+++       
Sbjct: 627 YTSRYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTMKRDGKVTASVT------- 674

Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
                         V N G ++GA  + ++     A    P KQL  FEK+ +  G  Q 
Sbjct: 675 --------------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQT 720

Query: 721 VG--INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           V   I+I   K+ +   R      P G+ N+ IG
Sbjct: 721 VSFPIDIDALKFWN--QRMKYDAEP-GKFNVFIG 751


>gi|429084349|ref|ZP_19147356.1| Periplasmic beta-glucosidase [Cronobacter condimenti 1330]
 gi|426546667|emb|CCJ73397.1| Periplasmic beta-glucosidase [Cronobacter condimenti 1330]
          Length = 759

 Score =  269 bits (687), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 210/689 (30%), Positives = 333/689 (48%), Gaps = 102/689 (14%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GR+      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 112 TIFPNSLGLASSFNLDAVKTVGRI------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 165

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 166 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 220

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G    VM + N +NG P  +D  +LK  +RGEW   G 
Sbjct: 221 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDSWLLKDLLRGEWGFKGI 277

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H T++ PE+A   AI++G+D+     +   +    ++ G ++  +
Sbjct: 278 TISDHGAIKEL--IKHGTASGPEDAVRVAIKSGVDMSMADEYYSKYLPDLIKNGSVTMAE 335

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A + AR+ +V
Sbjct: 336 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPQDTNAESRLHRDDARKVARESLV 390

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    M+G++  AG+A    T LQG+    G  A
Sbjct: 391 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSITLLQGMKNVAGDKA 448

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                 ++   A++ ++Q+D  + V+G  Q 
Sbjct: 449 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQG 508

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  L +P  Q++L+S +    K P +LVLM+G P+  A  K + +  AI+   
Sbjct: 509 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKENQQADAILETW 565

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  LP+      T     P +  +Y  
Sbjct: 566 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQLPVYYSHLNTGRPYNPEKPNKYTS 623

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  +V++    ++ P   R G + A+++          
Sbjct: 624 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSSPTMKRDGKVTASVT---------- 668

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                      V N G ++GA  + ++     A    P KQL  FEK+ +  G  Q V  
Sbjct: 669 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQTVSF 717

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I V        R      P G+ N+ IG
Sbjct: 718 PIDVDALKFWNQRMKYDAEP-GKFNVFIG 745


>gi|53712134|ref|YP_098126.1| beta-glucosidase [Bacteroides fragilis YCH46]
 gi|52214999|dbj|BAD47592.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
          Length = 812

 Score =  268 bits (685), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 220/689 (31%), Positives = 327/689 (47%), Gaps = 89/689 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GRV++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 168 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 222

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L+G    + VRG QG    D   V A+ KHF +Y    W         A +
Sbjct: 223 EETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY---GWTEGGHNGGTAHI 279

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
            ++++E+    PFR  V  G + SVM SYN+++G P      +L   ++  W+  G++VS
Sbjct: 280 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 338

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
           D  +VG   +         EAA  A+ AG+D D G    ++ E   +AV+RG ++   I+
Sbjct: 339 DLYAVGGLRE-HGVAGNDYEAAIKAVNAGVDSDLGT--NVYAEQLVAAVKRGDVAVATID 395

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
            A+   L+++ ++G+FD     +       + V + +H  LA E ARQ IVLLKN+   L
Sbjct: 396 KAVRRILSLKFQMGLFDDPFVDEKQ---AVQLVASSEHTGLAREVARQSIVLLKNKDKLL 452

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT--TPLQGI----GRYARTIHQQGCK 458
           PL     RT+AVIGPN+D    M+G+Y       T  T L GI     +  R ++ +GC 
Sbjct: 453 PLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA 511

Query: 459 DVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------AEA 495
            V  +    F  AI+ +R ADA ++VMG     D S E                    E 
Sbjct: 512 -VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEG 570

Query: 496 LDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQA 555
            DRA L L GRQ EL+ ++S   K    +VL+ G P+ +  A  +    AI+ A YPG  
Sbjct: 571 YDRATLHLMGRQLELLEEISRLGKP---VVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQ 625

Query: 556 GGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVV 615
           GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     RY         G   
Sbjct: 626 GGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTRRKGNRSRYVEE-----PGTPR 678

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVD 675
           YPFG+G+SYT F +T                           ++VT    +   + V V 
Sbjct: 679 YPFGYGLSYTTFSYT------------------------DMKVQVTEG-SDDCWVDVTVT 713

Query: 676 VKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVV 734
           ++N G+ DG     L F    +    P KQL AF ++H+ AG  + V   +   K L++ 
Sbjct: 714 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLD-KKSLALY 772

Query: 735 DRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            + G   +  G   I +GG+   ++   A
Sbjct: 773 MQEGEWVVEPGRFTIMVGGSSEDITCRQA 801


>gi|383125188|ref|ZP_09945842.1| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
 gi|382983435|gb|EES66611.2| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
          Length = 954

 Score =  268 bits (685), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 226/760 (29%), Positives = 362/760 (47%), Gaps = 105/760 (13%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAA--AVPRLGIKGYEWWSEALHGVSNVGP 99
           +   SLP+ +RV  L+  ++ ++K++L+  G     +P L +       EA+HG S    
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 224

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
              +G    GAT FPQ +   +++N  L E +  V+ DE  A  N   A    WSP +++
Sbjct: 225 ---YGS---GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
            +D RWGR +ET GEDPVL  +   ++++G Q        +  + KHF  +      G D
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 327

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
                  +S++++ +   VPFR  +      S+M +Y+   GVP      +L++ +R EW
Sbjct: 328 SHDIG--LSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKELLQQILRQEW 385

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
             NG+IVSDC ++G     +H+T+  + EAA  A+ AG+  +CG  +       A + G 
Sbjct: 386 GFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 445

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           ++  D++N     L    R  +F+  P     +  + P    +  H+E+A +AAR+ IV+
Sbjct: 446 INMEDLDNVCRTMLGTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 504

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACGYTTPLQG-IGRYAR 450
           L+N+   LPLS    RT+AV+GP +D      G+Y      G      T ++G +G+  +
Sbjct: 505 LENKDNLLPLSKTL-RTIAVLGPGAD--DLQPGDYTPKLLPGQLKSVLTGIKGAVGKQTK 561

Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAGL 501
            +++QGC D    D+     A+ A+ Q+D  I+V+G   + EA         E  D A L
Sbjct: 562 VLYEQGC-DFTNPDETNIPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTCGENNDWATL 620

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           +LPG+QQEL+  V    K P IL+L +G P D+  A    +   + W   PGQ GG A+A
Sbjct: 621 ILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAMA 677

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           D+LFG  NP G+LPMT +P+ ++  LP+       +  +RY      +Y    +Y FG G
Sbjct: 678 DVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGFG 730

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           +SYT+F ++                   N  I  KA            + VQ  VKNVGS
Sbjct: 731 LSYTSFEYS-------------------NLKIQEKA---------NGNVEVQATVKNVGS 762

Query: 682 KDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
           + G     ++ T     +A  K    +L  F ++H+  G  + V   +     +S+++  
Sbjct: 763 RAGDEVAQLYVT---DMYASVKTRVMELKDFARIHLQPGESKTVSFEMTPYD-ISLLNDR 818

Query: 738 GTRRIPLGEHNIHIGGT----------KHSVSLHAATLGV 767
             R +  GE  I +GG           KHSV       GV
Sbjct: 819 MDRVVEKGEFKIMVGGMSPDYVAKNEIKHSVGYSDNKKGV 858


>gi|449307585|ref|YP_007439941.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii SP291]
 gi|449097618|gb|AGE85652.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii SP291]
          Length = 765

 Score =  268 bits (685), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 212/694 (30%), Positives = 337/694 (48%), Gaps = 112/694 (16%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 118 TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G    VM + N +NG P  +D  +LK  +RG+W   G 
Sbjct: 227 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H T S PE+A   AI++G+D+     +   +  + ++ G ++  +
Sbjct: 284 TISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A + AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARKVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    M+G++  AG+A    T LQG+    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 454

Query: 450 RTIHQQGCKDVACADDQ----------------------LFGAAIDASRQADATILVMGL 487
           + ++ +G       DD+                      +   A++ ++Q+D  + V+G 
Sbjct: 455 KILYAKGAN---VTDDKGIVDFLNLYEPAVVVDKRTPKAMIDEAVNVAKQSDVVVAVVGE 511

Query: 488 DQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAII 547
            Q +  EA  R  L +P  Q++L+S +    K P +LVLM+G P+  A  K D +  AI+
Sbjct: 512 AQGMAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDLQADAIL 568

Query: 548 WAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKR 601
              + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +
Sbjct: 569 ETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNK 626

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
           Y  R +    GP +YPFG+G+SYT F  +V++    ++ P   R G + A+++       
Sbjct: 627 YTSRYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTMKRDGKVTASVT------- 674

Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
                         V N G ++GA  + ++     A    P KQL  FEK+ +  G  Q 
Sbjct: 675 --------------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKITLKPGESQT 720

Query: 721 VG--INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           V   I+I   K+ +   R      P G+ N+ IG
Sbjct: 721 VSFPIDIDALKFWN--QRMKYDAEP-GKFNVFIG 751


>gi|29347190|ref|NP_810693.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29339089|gb|AAO76887.1| periplasmic beta-glucosidase precursor [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 950

 Score =  268 bits (685), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 226/760 (29%), Positives = 362/760 (47%), Gaps = 105/760 (13%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAA--AVPRLGIKGYEWWSEALHGVSNVGP 99
           +   SLP+ +RV  L+  ++ ++K++L+  G     +P L +       EA+HG S    
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 220

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
              +G    GAT FPQ +   +++N  L E +  V+ DE  A  N   A    WSP +++
Sbjct: 221 ---YGS---GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
            +D RWGR +ET GEDPVL  +   ++++G Q        +  + KHF  +      G D
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 323

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
                  +S++++ +   VPFR  +      S+M +Y+   GVP      +L++ +R EW
Sbjct: 324 SHDIG--LSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKELLQQILRQEW 381

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
             NG+IVSDC ++G     +H+T+  + EAA  A+ AG+  +CG  +       A + G 
Sbjct: 382 GFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 441

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           ++  D++N     L    R  +F+  P     +  + P    +  H+E+A +AAR+ IV+
Sbjct: 442 INMEDLDNVCRTMLGTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 500

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACGYTTPLQG-IGRYAR 450
           L+N+   LPLS    RT+AV+GP +D      G+Y      G      T ++G +G+  +
Sbjct: 501 LENKDNLLPLSKTL-RTIAVLGPGAD--DLQPGDYTPKLLPGQLKSVLTGIKGAVGKQTK 557

Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAGL 501
            +++QGC D    D+     A+ A+ Q+D  I+V+G   + EA         E  D A L
Sbjct: 558 VLYEQGC-DFTNPDETNIPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTCGENNDWATL 616

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           +LPG+QQEL+  V    K P IL+L +G P D+  A    +   + W   PGQ GG A+A
Sbjct: 617 ILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAMA 673

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           D+LFG  NP G+LPMT +P+ ++  LP+       +  +RY      +Y    +Y FG G
Sbjct: 674 DVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGFG 726

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           +SYT+F ++                   N  I  KA            + VQ  VKNVGS
Sbjct: 727 LSYTSFEYS-------------------NLKIQEKA---------NGNVEVQATVKNVGS 758

Query: 682 KDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
           + G     ++ T     +A  K    +L  F ++H+  G  + V   +     +S+++  
Sbjct: 759 RAGDEVAQLYVT---DMYASVKTRVMELKDFARIHLQPGESKTVSFEMTPYD-ISLLNDR 814

Query: 738 GTRRIPLGEHNIHIGGT----------KHSVSLHAATLGV 767
             R +  GE  I +GG           KHSV       GV
Sbjct: 815 MDRVVEKGEFKIMVGGMSPDYVAKNEIKHSVGYSDNKKGV 854


>gi|253574420|ref|ZP_04851761.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251846125|gb|EES74132.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 782

 Score =  268 bits (685), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 233/809 (28%), Positives = 364/809 (44%), Gaps = 146/809 (18%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEK-------------------VKLLISGAAA 75
           +    L +   S PIP+RV  L+G ++L+EK                   +KL  +  A 
Sbjct: 13  SEVEMLLYKDSSKPIPERVEHLLGLMTLEEKAGQLVQPFGWQTYEHKDGEIKLTEAFKAQ 72

Query: 76  VPRLGI-------KGYEWW-------------SEALHGVSN-------------VGPGTK 102
           V   G+       +   W              +EA++ +               +G    
Sbjct: 73  VKNGGVGSLYGVLRADPWTGVTLETGLSPREGTEAVNAIQRYAIENSRLGIPILIGEECS 132

Query: 103 FGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRD 162
            G    GAT FP  ++  S++N  L+  + R V+ E RA       G   +SP +++ RD
Sbjct: 133 HGHMAIGATVFPVPLSLGSTWNVELYREMCRAVARETRAQ-----GGAVTYSPVLDVVRD 187

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGS--DGDRLKVAASCKHFTAYDLDNWNGVDR 220
           PRWGR +E  GED  L  + A + V GLQG   DG+   VAA+ KHF  Y      G   
Sbjct: 188 PRWGRTEECFGEDAYLISEMAVASVEGLQGESLDGED-SVAATLKHFVGY------GSSE 240

Query: 221 FHFNA---KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
              NA    + ++++ +   +PFR  V E   AS+M +YN+++GVP   +  +L   +RG
Sbjct: 241 GGRNAGPVHMGRRELLEVDLLPFRKAV-EAGAASIMPAYNEIDGVPCTTNEELLDGVLRG 299

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRG 336
           EW  +G +++DC ++ +            +AA  AIRAG+D++  G   G H   AV+ G
Sbjct: 300 EWGFDGMVITDCGAIDMLASGHDVAEDGRDAAIQAIRAGIDMEMSGVMFGKHLVEAVRSG 359

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
            L E  ++ A+   LT++ RLG+F+  P + P      + + + +H ELA + A +G+VL
Sbjct: 360 QLEEEVLDRAVRRVLTLKFRLGLFE-RPYADP--ERAERVIGSAEHVELARQLASEGVVL 416

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY-----A 449
           LKN+   LPLS     T+AVIGPN+D     +G+Y         TT L GI         
Sbjct: 417 LKNKDGVLPLS-ADAGTIAVIGPNADAGYNQLGDYTSPQPRSKVTTVLGGIRSKLAETPE 475

Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIE----------------- 492
           R ++  GC+ +     + F  A+  + +AD  ++V+G   + +                 
Sbjct: 476 RVLYAPGCR-INGNSREGFDVALSCAEKADTVVMVVGGSSARDFGEGTIDLRTGASKVTD 534

Query: 493 --------AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
                    E +DR  L L G Q EL+ ++    K P ++V ++G PI   +   D    
Sbjct: 535 NAESDMDCGEGIDRMNLSLSGVQLELIQEIHKLGK-PLVVVYINGRPIAEPWI--DEHAD 591

Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
           AI+ A YPGQ GG AIADILFG  NP G+L ++  P+ ++  +P+          KR  G
Sbjct: 592 AILEAWYPGQEGGHAIADILFGDVNPSGRLTIS-IPK-HVGQVPVY------YHGKRSRG 643

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           + Y        YPFG+G+SYT F +            L     +IN   S K        
Sbjct: 644 KRYLEGDSQPRYPFGYGLSYTEFTYN----------NLKLESDTINKDGSTK-------- 685

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGI 723
                  V V+V NVG + GA  + ++ T  A     P K+L  F K+ +  G  Q V  
Sbjct: 686 -------VTVEVTNVGERAGAEVIQLYITDVASKVTRPAKELKGFRKIFLQPGETQTVEF 738

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            +   + L  + ++    +  GE  +H+G
Sbjct: 739 TVG-PEQLQYIGQNYKPVVEPGEFRVHVG 766


>gi|167517889|ref|XP_001743285.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778384|gb|EDQ91999.1| predicted protein [Monosiga brevicollis MX1]
          Length = 521

 Score =  268 bits (685), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 192/535 (35%), Positives = 270/535 (50%), Gaps = 33/535 (6%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYN--------GGTAGLTYWSPNVNIFRD 162
           ++FP  +  A+SFNA+LW A G V++ E RA  N              T + PN+NI RD
Sbjct: 1   STFPGPLGMAASFNASLWFAKGDVLATELRAFSNTNWHRSNVDNQIQYTGFGPNINIARD 60

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRF 221
           PR+GR  E PGEDP LSG YA   VRG+  +D     K+ A  KHFTAY  +     +R 
Sbjct: 61  PRFGRTSELPGEDPYLSGTYATHMVRGMMQADAAGHPKMLAYLKHFTAYSTE----TNRQ 116

Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
           H +  VS  D+ DT+   + M     + A  MCSYN VNG P+CA+  IL+  +R +W+ 
Sbjct: 117 HSDFNVSSHDLWDTYLPQYEMAFRSAQPAGAMCSYNAVNGRPSCANGYILRDVLRNQWQQ 176

Query: 282 -NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSE 340
            N +I SDC ++        F      AAA A+  G DL+ G  +      AV R L S 
Sbjct: 177 PNAHITSDCGAISSLRGAPVFAPDDATAAAVALNNGTDLEMGSQVYASLAEAVARNLTSS 236

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
             +  A      +  R G FD  P++  +   G +D+ +   Q L  EA  Q +VLL+N+
Sbjct: 237 TLVEEAFRRAARILFRGGRFD-PPATVEWNAYGVQDINSSATQALVHEATAQSLVLLQNR 295

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC--GYTTPLQGIGRYARTIHQQGCK 458
              LPL+    + VAV+GP  +    ++ +YA + C  G    +  +G      ++ G  
Sbjct: 296 HGILPLAP--GQRVAVVGPLVERGDALLSDYASLVCYDGTYDCIPTLGASVTAANKGGAT 353

Query: 459 ------DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVS 512
                 DV   +     AA+ A++ AD  +  +G D++IE E LDR  L LP   Q L+ 
Sbjct: 354 TVVPGVDVNSNNSSGLAAAVAAAQAADVVVAFLGTDKTIEREGLDRVNLTLP-GLQGLLL 412

Query: 513 KVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG 572
              +A+  P +L+L +GGP+ +    N  R AA++     GQ G T +A  LFG  N  G
Sbjct: 413 DQLLATGTPVVLLLNNGGPLAIESYLN--RTAAVMETFNAGQFGATVMAKALFGQVNNFG 470

Query: 573 KLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
           KLP T YP  Y+     TE AM       YPGRTYR+     V+PFG G+SYT F
Sbjct: 471 KLPYTVYPAGYV-----TEQAMNNYDMALYPGRTYRYLVQAPVFPFGFGLSYTTF 520


>gi|261880245|ref|ZP_06006672.1| beta-glucosidase [Prevotella bergensis DSM 17361]
 gi|270333079|gb|EFA43865.1| beta-glucosidase [Prevotella bergensis DSM 17361]
          Length = 854

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 157/420 (37%), Positives = 234/420 (55%), Gaps = 29/420 (6%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           +  P+    L   +R  DL  RL+L+EK K++ +G+ A+PRLGI  +EWWSEALHG+   
Sbjct: 21  QQFPYQNTDLSPKERAADLCSRLTLEEKSKIMQNGSPAIPRLGIPQFEWWSEALHGIGRN 80

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM----YNGGT----AG 149
           G           AT FP  +  ASS++  L + +   VSDE R         GT     G
Sbjct: 81  G----------FATVFPITMGMASSWDDALLQKVFDAVSDEGRVKAQQAKRSGTIKRYQG 130

Query: 150 LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFT 208
           L++W+PN+NIFRDPRWGRGQET GEDP L+ +   + VRGLQG SD    K+ A  KHF 
Sbjct: 131 LSFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPSDSKYRKLLACAKHFA 190

Query: 209 AYDLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
            +    WN   R  FN + + ++D+ +T+   F+  V +G VA VMC+Y +++G P C +
Sbjct: 191 VHSGPEWN---RHTFNVEDLPERDLWETYLPAFKALVQQGDVAEVMCAYQRIDGQPCCGN 247

Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFL 325
              LK  +R EW   G +VSDC +V  ++   H   +P+   A+A A+ +G D++CG   
Sbjct: 248 NRFLKSILRNEWNYQGMVVSDCWAVPDFWKKGHHEVSPDATHASAKAVLSGTDVECGSDY 307

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
             +   AV+ G++ E D++ ++   L  +  LG FD +    P+  +    V +  H++L
Sbjct: 308 S-NLPEAVRAGIIKEADVDVSVRRLLEARFALGDFDPD-ELVPWTKISESVVASKAHKQL 365

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           AL+ AR+ +VLL+N    LPL     + V V+G N+  +  M GNY+G      T LQG+
Sbjct: 366 ALDMARKSMVLLQNND-ILPLKRSGQKIV-VVGANAIDSTMMWGNYSGYPTQTVTILQGL 423



 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 84/300 (28%), Positives = 131/300 (43%), Gaps = 55/300 (18%)

Query: 478 ADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLM 527
           AD  I V G+   +E E +          DR  + LP  Q+E++  +S A +   + V  
Sbjct: 599 ADVVIFVGGISPRLEGEEMEVSDPGFKGGDRTTIELPQAQREVIKALSEAGRR-IVFVNC 657

Query: 528 SGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL 587
           SG  I  A      R+ AI+ A YPG+ GGTA+AD+LFG  NP GKLP+T+Y  +    L
Sbjct: 658 SGSAI--ALTPESQRVDAILQAWYPGEQGGTAVADVLFGDYNPSGKLPVTFYKND--AQL 713

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
           P            R  GRTYR++K   ++PFG+G+SYT F                    
Sbjct: 714 P-------DFLDYRMAGRTYRYFKETPLFPFGYGLSYTQF-------------------- 746

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
                        T  +   +   VQV V N G +DG   + V+         P K L  
Sbjct: 747 -------------TIGQPRYINNQVQVSVSNTGKRDGDEVVQVYIRRTDDAAGPIKTLRG 793

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           F++V +  G  ++V +++    +      S T R+  G + + +G +  + +L    + +
Sbjct: 794 FQRVSLKVGETKQVSVSLPRESFEWWDASSNTMRVIPGNYEVMVGSSSMAKNLKTIMVNL 853


>gi|402299776|ref|ZP_10819350.1| glycoside hydrolase family protein [Bacillus alcalophilus ATCC
           27647]
 gi|401725066|gb|EJS98379.1| glycoside hydrolase family protein [Bacillus alcalophilus ATCC
           27647]
          Length = 937

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 155/425 (36%), Positives = 240/425 (56%), Gaps = 29/425 (6%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGP 99
            PF    L +  RV DL+ R +L EK++L++     + RLGIK ++  +EA HG++ +G 
Sbjct: 7   FPFQNTELLVEDRVKDLVNRFTLDEKIELMMQHQPEIKRLGIKAHKQGTEAAHGIAWLGE 66

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--GLTYWSPNV 157
                     AT FPQ I  +S++N  L   IG V+ DEAR  Y    +  GLT W+P V
Sbjct: 67  ----------ATVFPQNIGLSSTWNKDLMNKIGTVIGDEARVFYQKDPSKHGLTLWAPTV 116

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           ++ RDPRWGR +E  GEDP L+G+     V+G+QG+D D LK  A+ KHF    L N N 
Sbjct: 117 DMERDPRWGRTEEAYGEDPELTGQLTTELVKGMQGNDPDYLKSVATLKHF----LGNNNE 172

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
           ++R   +A +  +++ + +   F+  ++EGK  S+M +YN +NG P    P + +  ++ 
Sbjct: 173 INRDKCSASIDPRNMHEYYLKAFKPAIVEGKAKSIMTAYNSINGTPALLHPYV-EEVVKQ 231

Query: 278 EWRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLD--LDCGPFLGLHTESAVQ 334
           EW ++G+IVSD  D VG+  D +++  T  EA AD I++G+D   D           A+ 
Sbjct: 232 EWGMDGFIVSDAGDVVGIVNDHKYY-ETYSEALADTIKSGIDSITDDAEISLQAMRDALS 290

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGI 394
           +GLL E D++ AL+NT  V+ RLG FD E +  PY  +    +   +H++L+ +AA++ +
Sbjct: 291 KGLLVEADLDKALINTFKVRFRLGEFDEERN--PYAFVSESKLLAKEHKQLSYQAAQEQV 348

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIG---RYART 451
           VLLKN G  LPL+     ++AVIGP SD    ++  Y+G      T L+ I    +  + 
Sbjct: 349 VLLKNDGL-LPLNEGTTESIAVIGPLSD--DVLVDWYSGTPSYRITALEAIEHKLQNKKV 405

Query: 452 IHQQG 456
           IH+ G
Sbjct: 406 IHKSG 410



 Score =  105 bits (263), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 79/292 (27%), Positives = 141/292 (48%), Gaps = 39/292 (13%)

Query: 471 AIDASRQADATILVMGLDQSIE-AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
           A+ A+ + D  ++V+G +  I   E +DR  ++LP  Q++L+  V   +   TI VL+S 
Sbjct: 547 AVKAAAEQDYAVVVLGNNPFINGKECIDREDIVLPPEQEKLLQAVKQVNPN-TIAVLVSS 605

Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
            P  + +A  +  ++ I++  + G   G A+AD+LFG  NP G+L MTWY    +  LP 
Sbjct: 606 YPYALNWANEN--LSGIMYTSHAGPELGKAVADVLFGDYNPAGRLSMTWYRS--VDQLPD 661

Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSI 649
             M     + K    RTY++++G V+YPFGHG+SYT F ++          P D      
Sbjct: 662 I-MDYDIIKGK----RTYQYFEGEVLYPFGHGLSYTTFNYSNIQVNETTLSPDD------ 710

Query: 650 NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAF 708
                   +KVT            + ++N G   G   + ++          P K L+ F
Sbjct: 711 -------QVKVT------------ISIENTGRFAGDEVVQLYGRAERSRVKRPLKTLLDF 751

Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIGGTKHSVS 759
            +VH+ +G ++++   + + K L   D +  R  +  G +++ IG +  +++
Sbjct: 752 TRVHLQSGEKKQIEFTVPISK-LEFWDVTQERYCVEKGVYSLLIGSSSKNIA 802


>gi|333380553|ref|ZP_08472244.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826548|gb|EGJ99377.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 957

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 217/739 (29%), Positives = 361/739 (48%), Gaps = 101/739 (13%)

Query: 46  SLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNVGPGTKF 103
           +LP+  RV DL+  +++++K++LL  G     +P LG+       EA+HG S       +
Sbjct: 174 NLPLESRVEDLLSVMTVEDKMELLREGWGIPGIPHLGVPAIHK-VEAIHGFS-------Y 225

Query: 104 GGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDP 163
           G    GAT FPQ I   +++N  L EA    + DE  +        +  WSP +++ +D 
Sbjct: 226 GS---GATIFPQSIGMGATWNKRLIEAAAMAIGDETVS-----ANAVQAWSPVLDVAQDA 277

Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHF 223
           RWGR +ET GEDPVL  +   ++++G Q S G    +  + KHF A+      G D    
Sbjct: 278 RWGRCEETYGEDPVLVTEIGGAWIKGYQ-SKG----LMTTPKHFAAHGAP-LGGRDSHDI 331

Query: 224 NAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNG 283
              +S++++ +   VPFR    + K  S+M SY+   GVP      +LK  +R EW  +G
Sbjct: 332 G--LSEREMREIHLVPFRDIYKKYKYQSIMMSYSDFLGVPVAKSKELLKGILRDEWGFDG 389

Query: 284 YIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEI 341
           +IVSDC ++G     +H+T+  + EAA  A+ AG+  +CG  +      +A +RG L+  
Sbjct: 390 FIVSDCGAIGNLTARKHYTAVDKVEAARQALAAGIATNCGDTYNDPDVIAAAKRGELNMD 449

Query: 342 DINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
           D++      L    R G+F+  P     +  + P    +P+HQ LA + A++ IVLL+N+
Sbjct: 450 DLDFTCKTLLRTLFRNGLFENNPCKPLDWNKIYP-GWNSPEHQALARKTAQESIVLLENK 508

Query: 401 GPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIGRYARTIH 453
           G  LPLS    +T+AVIGP +D        +    G    +  G       +    + ++
Sbjct: 509 GNILPLSK-SLKTIAVIGPGADNLQPGDYTSKPQPGQLKSVLTGIKA---AVNSSTKVLY 564

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAGLLLP 504
           ++GC+ +      +   A+ A+  AD  +LV+G   + EA         E  D A L+LP
Sbjct: 565 EEGCRFIGTEGTDI-AKAVKAAENADVAVLVLGDCSTSEALKGITNTSGENHDLATLILP 623

Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
           G QQ+L+  V    K P +L+L +G P ++++A  + +   + W   PGQ GG A AD+L
Sbjct: 624 GEQQKLLEAVCKTGK-PVVLILQAGRPYNLSYAAENCQAVLVNW--LPGQEGGYATADVL 680

Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV--VYPFGHGM 622
           FG  NP G+LPMT +P++    LP+         + +  GR Y +   P   +Y FG+G+
Sbjct: 681 FGDYNPAGRLPMT-FPRD-AAQLPLY-------YNFKTSGRVYDYVDMPYYPLYQFGYGL 731

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
           SYT+F ++  N    +++  +G + S+NAT++                       N G  
Sbjct: 732 SYTSFNYSDLN----ISLEKNG-NVSVNATVT-----------------------NTGKV 763

Query: 683 DGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
            G   + ++ T     +A  K    +L  F++V++  G  ++V   +     LS+++   
Sbjct: 764 AGDEVVQLYIT---DMYASVKTRVMELKDFDRVYLNPGESKKVSF-VLTPYQLSLLNDEM 819

Query: 739 TRRIPLGEHNIHIGGTKHS 757
            R +  G   I +GG   S
Sbjct: 820 DRVVEKGLFKIMVGGKSPS 838


>gi|330995911|ref|ZP_08319806.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
 gi|329574250|gb|EGG55825.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
          Length = 865

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 176/472 (37%), Positives = 256/472 (54%), Gaps = 48/472 (10%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M +T+   +LGL   + +  G A  EP+  +P D + R            +R  DL+ RL
Sbjct: 1   MKNTVFVLWLGL---AVADIGWAQNEPYK-NP-DLSPR------------ERAEDLLKRL 43

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EKV L+I+ + AV RLGIK Y WWSEALHGV+  G           AT +P  +  A
Sbjct: 44  TLKEKVSLMINSSPAVERLGIKPYNWWSEALHGVARSGI----------ATVYPITMGMA 93

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFRDPRWGRGQET 171
           S F+    EA+   VSDEARA Y+             GLT+W+PNVNIFRDPRWGRGQET
Sbjct: 94  SVFDDEAVEAVYTTVSDEARAKYHDAHKKGRYGQMNEGLTFWTPNVNIFRDPRWGRGQET 153

Query: 172 PGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSK 229
            GEDP L+ +   + VRGLQG +D    K+ A  KH+  +         R +F+  ++  
Sbjct: 154 WGEDPYLTSRMGVAVVRGLQGPADAKYDKLHACAKHYAVHSGPE---AKRHYFDVEQLDP 210

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +D+ +T+   F++ V E  V  VMC+Y +  G P C    +L + +R EW     +VSDC
Sbjct: 211 RDLWETYLPAFKVLVQEADVKEVMCAYQRFEGEPCCGSKRLLHQILRDEWGFKYVVVSDC 270

Query: 290 DSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
            ++G +++     + P+   A+A A+ +G DL+CG    +  E+AV RGL++E  I+ +L
Sbjct: 271 GAIGDFFNPGLHETHPDAATASASAVTSGTDLECGWGDYMQLEAAVDRGLITEHRIDTSL 330

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L  +  LG  D + S  P+  +G   V    H+++AL+ AR+ +VLL N G  LPL 
Sbjct: 331 CRLLEARFALGEMD-DDSLVPWSRIGIDTVDCQTHKQMALDIARKSLVLLHNDG-VLPLD 388

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI-HQQGC 457
             R   V V+GPN+  +V   GNY G      T L GI  R  R + +++GC
Sbjct: 389 KTRG-DVVVMGPNAVDSVMQWGNYEGTPSHTYTVLDGIRERLGRDVRYEKGC 439



 Score =  108 bits (271), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 86/294 (29%), Positives = 130/294 (44%), Gaps = 50/294 (17%)

Query: 476 RQADATILVMGLDQSIEAEAL-----------DRAGLLLPGRQQELVSKVSMASKGPTIL 524
           + A   + V G+   +E E             DR  + LP  Q++++  +  A K   + 
Sbjct: 599 KDAGTVVFVGGISPDLEGEEKNHVNCPGFAGGDRTSIELPQVQRDILKALKAAGK-KVVF 657

Query: 525 VLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYI 584
           V  SG  +  A         AI+ A YPGQAGG A+AD+LFG  NP GKLP+T+Y     
Sbjct: 658 VNCSGSAM--ALVPELESCDAILQAWYPGQAGGLAVADVLFGDFNPSGKLPVTFYKN--T 713

Query: 585 TNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
             LP   + +M+        GRTYR+     ++PFG+G+SYT F  +             
Sbjct: 714 EQLPDFEDYSMK--------GRTYRYMTDKPLFPFGYGLSYTTFAIS------------K 753

Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
           GR  S ++  +G+ +K T              VKN G +DGA  + V+         P K
Sbjct: 754 GRL-SKSSVEAGEGVKFT------------AQVKNTGKRDGAEVVQVYVRKVGDTGGPLK 800

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
            L  F +V + AG  +RV I +    +      + T R+  GE+ I  G +  +
Sbjct: 801 SLRGFRRVELKAGESRRVSIELSPEAFEFFDTSTNTMRVVPGEYEIMYGNSSDT 854


>gi|380696432|ref|ZP_09861291.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 954

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 224/761 (29%), Positives = 358/761 (47%), Gaps = 107/761 (14%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNVGP 99
           +  VSLP+ +RV  L+  ++ ++K++L+  G     +P L +       EA+HG S    
Sbjct: 170 YMDVSLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 224

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
              +G    GAT FPQ +   +++N  L E +  V+ DE  A  N   A    WSP +++
Sbjct: 225 ---YGS---GATIFPQALAMGATWNKKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
            +D RWGR +ET GEDPVL  +   ++++G Q        +  + KHF  +      G D
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQMGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 327

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
                  +S++++ +   VPFR  +      S+M +Y+   GVP      +L++ +R EW
Sbjct: 328 SHDIG--LSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKELLQQILRQEW 385

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
             NG+IVSDC ++G     +H+T+  + EAA  A+ AG+  +CG  +       A + G 
Sbjct: 386 GFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 445

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           ++  D++N     L+   R  +F+  P     +  + P    +  H+E+A +AAR+ IV+
Sbjct: 446 INMEDLDNVCRTMLSTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 504

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIGRYA 449
           L+N+   LPLS    RT+AV+GP +D           + G    +  G  +    +G+  
Sbjct: 505 LENKENLLPLSKTL-RTIAVVGPGADDLQPGDYTPKLLPGQLKSVLTGIKS---AVGKQT 560

Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAG 500
           + +++QGC D    D      A+  + Q+D  I+V+G   + EA         E  D A 
Sbjct: 561 KVLYEQGC-DFTNPDATNIPKAVKTASQSDVVIMVLGDCSTSEATNDVRKTCGENNDWAT 619

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L+LPG+QQEL+  V    K P IL+L +G P D+  A    +   + W   PGQ GG A+
Sbjct: 620 LILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAM 676

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           AD+LFG  NP G+LPMT +P+ ++  LP+       +  +RY      +Y    +Y FG 
Sbjct: 677 ADVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGF 729

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           G+SYT+F ++                   N  I  KA            + VQ  VKNVG
Sbjct: 730 GLSYTSFEYS-------------------NLKIQEKA---------NGNVEVQATVKNVG 761

Query: 681 SKDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           S  G     ++ T     +A  K    +L  F ++H+  G  + V   +     +S+++ 
Sbjct: 762 SCAGDEVAQLYVT---DMYASVKTRVMELKDFTRIHLQPGESKTVSFEMTPYD-ISLLND 817

Query: 737 SGTRRIPLGEHNIHIGGT----------KHSVSLHAATLGV 767
              R +  GE  I IGG           KHSV       GV
Sbjct: 818 RMDRVVEKGEFKIMIGGMSPDYVAKNEIKHSVGYSDNKKGV 858


>gi|294146655|ref|YP_003559321.1| putative beta-glucosidase [Sphingobium japonicum UT26S]
 gi|292677072|dbj|BAI98589.1| putative beta-glucosidase [Sphingobium japonicum UT26S]
          Length = 874

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 152/393 (38%), Positives = 218/393 (55%), Gaps = 33/393 (8%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           ++ RL+L EKV  L++ A A+PRLGI  Y WW+E+LHG     P           T+FP+
Sbjct: 40  IVDRLTLDEKVAQLLNVAPAIPRLGIPAYNWWTESLHGAIGAVP----------TTNFPE 89

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWG 166
            I  A++F+A L + +   +  E RA++  G           GL  WSPN+NIFRDPRWG
Sbjct: 90  PIGLAATFDAPLIQQVAAAIGTEVRALHTLGRRTGHLGRIGTGLDTWSPNINIFRDPRWG 149

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
           RGQET GEDP L+ +   ++V+G+QG+D D   V A+ KHF  +     +   R   N  
Sbjct: 150 RGQETYGEDPYLTARIGVAFVQGMQGNDPDLPLVVATPKHFAVHSGPEPS---RHTDNIF 206

Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
            + +D+EDT+   FR  ++EGK  S+MC+YN+V+G P C    +LK  +RG W   GY+V
Sbjct: 207 ATPRDLEDTYLPAFRAAIVEGKAGSIMCAYNRVDGQPACGSHMLLKDYLRGAWGFKGYVV 266

Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-------PFLGLHTESAVQRGLLS 339
           SDCD+V   Y+   +        A A+R G+D +C        P LG   + ++ RG ++
Sbjct: 267 SDCDAVVDIYEHHKYAPDAATGVAVALRHGVDSECNNATLGGRPDLGDRYKESLARGHIA 326

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
             DI+ ALV   + + R G   G  S +P   + P  + TPDHQ LAL  A + +VLLKN
Sbjct: 327 MGDIDAALVRLFSARYRNGDLPGLSSRKP-NAIPPSAIGTPDHQALALSTAEKSLVLLKN 385

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
            G  LPL       +A+ GP +D T  + GNY+
Sbjct: 386 DG-ILPLRP--GARIALAGPLADATRVLRGNYS 415



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 93/278 (33%), Positives = 142/278 (51%), Gaps = 50/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +E E +          DR  L LP  Q+  + K     K P ++V M+G  ID++
Sbjct: 611 GLTSDLEGEEMPVKVEGFDGGDRTTLDLPADQRAFLEKARALGK-PLVIVAMNGSAIDLS 669

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMAM 594
           +AK +   AAI+ A YPGQ+GG A+ ++L G ++PGG+LP+T+Y    + +L P T+  M
Sbjct: 670 WAKEN--AAAIVEAWYPGQSGGLAVGNVLSGRADPGGRLPVTFY--RNVNDLPPFTDYGM 725

Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
                    GRTYR+++G  VYPFGHG+SYT+F +    AP  V  P+D   GS+     
Sbjct: 726 E--------GRTYRYFRGTPVYPFGHGLSYTSFRY----APLTVE-PVD---GSVE---- 765

Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
            K +KV  A            + N G++ G     ++ TPP    AP   L  F++V + 
Sbjct: 766 -KGLKVRTA------------ITNTGARAGDDVAQLYITPPRFEGAPRLALRGFQRVTLK 812

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            G  + V   +   + LS V  +G R +  G++ + IG
Sbjct: 813 PGETRNVEFTLS-PRDLSFVTMAGERGLIPGDYGLSIG 849


>gi|357046682|ref|ZP_09108302.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
           YIT 11840]
 gi|355530484|gb|EHG99896.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
           YIT 11840]
          Length = 677

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 174/472 (36%), Positives = 254/472 (53%), Gaps = 48/472 (10%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M +T+   +LGL++   +  G A  EP+  +P              L   +R  DL+ RL
Sbjct: 1   MKNTVFVLWLGLVV---ADIGWAQNEPYK-NP-------------DLSPRERAEDLLKRL 43

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EK  L+I+ + AV RLGIK Y WWSEALHGV+  G           AT +P  +  A
Sbjct: 44  TLKEKASLMINSSPAVERLGIKPYNWWSEALHGVARSGV----------ATVYPITMGMA 93

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFRDPRWGRGQET 171
           S F+    EA+   VSDEARA Y+             GLT+W+PNVNIFRDPRWGRGQET
Sbjct: 94  SVFDDKAVEAVYTTVSDEARAKYHDAHKKGRYGQMNEGLTFWTPNVNIFRDPRWGRGQET 153

Query: 172 PGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSK 229
            GEDP L+ +   + VRGLQG +D    K+ A  KH+  +         R +F+  ++  
Sbjct: 154 WGEDPYLTSRMGVAVVRGLQGPADAKYDKLHACAKHYAVHSGPE---AKRHYFDVEQLDP 210

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +D+ +T+   F++ V E  V  VMC+Y +  G P C    +L + +R EW     +VSDC
Sbjct: 211 RDLWETYLPAFKVLVQEADVKEVMCAYQRFEGEPCCGSERLLHQILRDEWGFKYVVVSDC 270

Query: 290 DSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
            ++G +++     + P+   A+A A+ +G DL+CG    +  E+AV RGL++E  I+ +L
Sbjct: 271 GAIGDFFNPGLHETHPDAATASASAVTSGTDLECGWGDYMQLEAAVDRGLITEHRIDTSL 330

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L  +  LG  D + S  P+  +G   V    H+++AL+ AR+ +VLL N G  LPL 
Sbjct: 331 CRLLEARFALGEMD-DDSLVPWSRIGIDTVDCQTHKQMALDIARKSLVLLHNDG-VLPLD 388

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI-HQQGC 457
             R   V V+GPN+  +V   GNY G      T L GI  R  R + +++GC
Sbjct: 389 KTRG-DVVVMGPNAVDSVMQWGNYEGTPSHTYTVLDGIRERLGRDVRYEKGC 439


>gi|329956868|ref|ZP_08297436.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523625|gb|EGF50717.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 864

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 165/424 (38%), Positives = 233/424 (54%), Gaps = 33/424 (7%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           +R  DL+ +L+L+EKV L++  +  V RLGIK Y WW+EALHGV+  G           A
Sbjct: 33  ERAEDLVKQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG----------WA 82

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYWSPNVNIFRD 162
           T FPQ I  A+SF+          VSDEARA     +A        GLT W+P VNI+RD
Sbjct: 83  TVFPQPIGMAASFSPEALHTAFVAVSDEARAKNAAYSAEGSYKRYQGLTIWTPTVNIYRD 142

Query: 163 PRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRL--KVAASCKHFTAYDLDNWNGVDR 220
           PRWGRG ET GEDP L+     S V+GLQ  D +    KV A  KHF  +    WN   R
Sbjct: 143 PRWGRGIETYGEDPYLASVMGVSVVKGLQCLDENEKYDKVHACAKHFAVHSGPEWN---R 199

Query: 221 FHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
             FNA+ +S +D+ +T+  PF   V EGKV  VMC+YN+  G P C    +L   +R EW
Sbjct: 200 HSFNAENISPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGEPCCGSNRLLNHILRREW 259

Query: 280 RLNGYIVSDCDSVGVYY-DTQHFT-STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGL 337
             +G +V+DC ++  ++ D  H T +    A++ A+ +G DL+CG      TE  V++G 
Sbjct: 260 GYDGIVVADCSAISDFHNDKGHKTHADAASASSAAVLSGTDLECGSNYRSLTE-GVKKGF 318

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           + E DI+ ++   L  +  LG  D EP    +  +    VC+  H  L+L+ AR+ + LL
Sbjct: 319 IDEADIDRSVKRLLQARFELGEMD-EPDQVRWAQIPYSVVCSDKHDSLSLDMARKSMTLL 377

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIH 453
            N+  +LPL      T+AV+GPN++ +V   GNY G+     T L GI    G+  + I+
Sbjct: 378 LNKNNALPLER-GGTTIAVMGPNANDSVMQWGNYNGLPKRTITILDGIRSAMGKDDKLIY 436

Query: 454 QQGC 457
           +QGC
Sbjct: 437 EQGC 440



 Score =  120 bits (300), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 91/321 (28%), Positives = 142/321 (44%), Gaps = 50/321 (15%)

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
           D+   ++     ++   + AD  I   G+   +E E +          DR  + LP  Q+
Sbjct: 583 DLGFKEEADIQRSVAKVKDADVVIFAGGISPQLEGEEMGVKLPGFRGGDRTDIELPAVQR 642

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
           E++  +  A K   I V  SG PI  A         AI+ A YPGQ+GG A+A++LFG  
Sbjct: 643 EMIKALHDAGK-KVIFVNCSGSPI--AMEPETEYCQAILQAWYPGQSGGKAVAEVLFGDY 699

Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
           NP G+LP T+Y    +  LP  E            G TYRF+ G  ++PFG+G+SYT F 
Sbjct: 700 NPAGRLPATFYRN--LAQLPDFE-------DYNMAGHTYRFFNGEPLFPFGYGLSYTTF- 749

Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
                           ++G I    S +  +         T+ + V V N GS++G   +
Sbjct: 750 ----------------KYGKIQLKSSAQTDE---------TVKITVPVTNTGSRNGEEVV 784

Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR-SGTRRIPLGEH 747
            V+         P K L AF++V++PAG   +V + +   K L   D  + T R      
Sbjct: 785 QVYLKKQGETDGPVKTLRAFKRVYIPAGKTVKVELEL-TPKQLEWWDSATNTMRTMPNTF 843

Query: 748 NIHIGGTKHSVSLHAATLGVI 768
           ++ +GG+     L   TL ++
Sbjct: 844 DVMVGGSSKDSDLQKKTLKLL 864


>gi|404406439|ref|ZP_10998023.1| glycoside hydrolase 3 [Alistipes sp. JC136]
          Length = 925

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 199/686 (29%), Positives = 329/686 (47%), Gaps = 82/686 (11%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT+FP  +    ++N  L    GR+V  EAR +      G T  ++P +++ RD RWGR 
Sbjct: 180 ATNFPSQLGMGHTWNRELLRKTGRIVGREARLL------GYTNIYAPVLDVGRDQRWGRY 233

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           +E  GE P L  +   +   G+Q       +VA++ KHF AY  +          + ++ 
Sbjct: 234 EEVFGESPYLVAELGVAMASGMQTD----YQVASTAKHFAAYSNNKGAREGMSRVDPQMP 289

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++E+   +PFR  +    +  VM SYN  +GVP       L   +RGE    GY+VSD
Sbjct: 290 PREVENIHLMPFREVIRRAGILGVMSSYNDYDGVPIQGSRYWLTERLRGEMGFRGYVVSD 349

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAV-------QRGLLSEI 341
             SV   ++  H      +A   +I AGL++ C  +   H E+ V       + GL++E 
Sbjct: 350 SGSVEYLHNKHHTAVNQLDAVRQSIEAGLNVRCNFW---HPETYVMPLRQLLREGLITEE 406

Query: 342 DINNALVNTLTVQMRLGMFDGEPSSQPYGH---LGPKDVCTPDHQELALEAARQGIVLLK 398
            +++ + + L V+  +G+FD     +PY        ++V  P+H E+AL+A+R+ IVLLK
Sbjct: 407 LLDSRVRDVLRVKFLVGLFD-----RPYQTDLAAADREVDGPEHNEVALQASRESIVLLK 461

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQ 454
           N+  +LPL   + R +AV+GPN+D     +G+Y  +A   T+ L G+ R        +++
Sbjct: 462 NENSTLPLDARKIRRIAVLGPNADARGFALGHYGPLAVEVTSVLDGLKRNLGARCEIVYE 521

Query: 455 QGCK--DVACADDQLFGA------------AIDASRQADATILVMGLDQSIEAEALDRAG 500
           +GC+  D A    ++F              A +A+ ++D  ++V+G       E   R+ 
Sbjct: 522 KGCELVDAAWPLSEIFREEMTPEEKAGIRRAAEAASESDVAVVVLGGGSRTCGENCSRSS 581

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L LPGRQ+EL+  V    K PT+LV+++G P  + +A  D  + AI+ A YPG  GG A+
Sbjct: 582 LDLPGRQEELLRAVEATGK-PTVLVMINGRPNSINWA--DAHVDAIVEAWYPGAHGGQAV 638

Query: 561 ADILFGTSNPGGKLPMTWYPQ---EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
            ++LFG  NPGGKL +T +P+   +   N P    A         PG       G  +Y 
Sbjct: 639 YEVLFGEYNPGGKLTVT-FPRHVGQIPFNFPYKPAANTDGGLTPGPGGNQTRING-ALYD 696

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
           FG+G+SYT F +                    +  I  + I+            V  DV 
Sbjct: 697 FGYGLSYTTFEYA-------------------DLRIEPQTIRQDEP------FRVSFDVT 731

Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           N G +DG   + ++          + K L  F++VH+ AG  +RV + +   + LS+++ 
Sbjct: 732 NTGQRDGDEVVQLYIHDVLSSVTTYEKNLRGFDRVHLKAGETRRVTMQVR-PQDLSLLNE 790

Query: 737 SGTRRIPLGEHNIHIGGTKHSVSLHA 762
              R +  G+ ++ IG +   + L A
Sbjct: 791 RMERVVEPGDFDVLIGASSTDIRLKA 816


>gi|332877556|ref|ZP_08445303.1| glycosyl hydrolase family 3 protein, partial [Capnocytophaga sp.
           oral taxon 329 str. F0087]
 gi|332684662|gb|EGJ57512.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
          Length = 676

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 174/472 (36%), Positives = 254/472 (53%), Gaps = 48/472 (10%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           M +T+   +LGL++   +  G A  EP+  +P              L   +R  DL+ RL
Sbjct: 1   MKNTVFVLWLGLVV---ADIGWAQNEPYK-NP-------------DLSPRERAEDLLKRL 43

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +L+EK  L+I+ + AV RLGIK Y WWSEALHGV+  G           AT +P  +  A
Sbjct: 44  TLKEKASLMINSSPAVERLGIKPYNWWSEALHGVARSGV----------ATVYPITMGMA 93

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFRDPRWGRGQET 171
           S F+    EA+   VSDEARA Y+             GLT+W+PNVNIFRDPRWGRGQET
Sbjct: 94  SVFDDKAVEAVYTTVSDEARAKYHDAHKKGRYGQMNEGLTFWTPNVNIFRDPRWGRGQET 153

Query: 172 PGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA-KVSK 229
            GEDP L+ +   + VRGLQG +D    K+ A  KH+  +         R +F+  ++  
Sbjct: 154 WGEDPYLTSRMGVAVVRGLQGPADAKYDKLHACAKHYAVHSGPE---AKRHYFDVEQLDP 210

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +D+ +T+   F++ V E  V  VMC+Y +  G P C    +L + +R EW     +VSDC
Sbjct: 211 RDLWETYLPAFKVLVQEADVKEVMCAYQRFEGEPCCGSERLLHQILRDEWGFKYVVVSDC 270

Query: 290 DSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
            ++G +++     + P+   A+A A+ +G DL+CG    +  E+AV RGL++E  I+ +L
Sbjct: 271 GAIGDFFNPGLHETHPDAATASASAVTSGTDLECGWGDYMQLEAAVDRGLITEHRIDTSL 330

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L  +  LG  D + S  P+  +G   V    H+++AL+ AR+ +VLL N G  LPL 
Sbjct: 331 CRLLEARFALGEMD-DDSLVPWSRIGIDTVDCQTHKQMALDIARKSLVLLHNDG-VLPLD 388

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTI-HQQGC 457
             R   V V+GPN+  +V   GNY G      T L GI  R  R + +++GC
Sbjct: 389 KTRG-DVVVMGPNAVDSVMQWGNYEGTPSHTYTVLDGIRERLGRDVRYEKGC 439


>gi|427384502|ref|ZP_18881007.1| hypothetical protein HMPREF9447_02040 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727763|gb|EKU90622.1| hypothetical protein HMPREF9447_02040 [Bacteroides oleiciplenus YIT
           12058]
          Length = 862

 Score =  266 bits (679), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 164/432 (37%), Positives = 238/432 (55%), Gaps = 32/432 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SL   +R   L+  L+L+EK +L++  + +V RLGIK Y WW+EALHGV+  G   
Sbjct: 22  YKNTSLSPEERAELLVKELTLEEKARLMMDASQSVERLGIKPYNWWNEALHGVARAGL-- 79

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                   AT FPQ I  A+SF+  +   +   VSDE RA     T+        GLT W
Sbjct: 80  --------ATVFPQPIGMAASFDPEMVYEVFNAVSDEVRAKNTYYTSRDSRERYQGLTMW 131

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL 212
           +P VNI+RDPRWGRG ET GEDP L+ +     V+GLQG +DG   K+ A  KHF  +  
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGKYDKLHACAKHFAVHSG 191

Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             WN   R  FNA+ ++ +D+ +T+  PF   V EG+V  VMC+YN+  G P C    +L
Sbjct: 192 PEWN---RHSFNAENINPRDLYETYLPPFEALVKEGEVKEVMCAYNRFEGEPCCGSDRLL 248

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHT 329
            + +RGEW  +G +V+DC ++  +Y+ +   + P    A+A A+ +G DLDCG       
Sbjct: 249 MQILRGEWGFDGIVVADCGAIADFYNDRGHHTHPNAESASAAAVVSGTDLDCGSSYKALV 308

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
           ES V++GL++E  ++ ++   L  +  LG  D EP    +  +    V +  H  LALE 
Sbjct: 309 ES-VKKGLIAEETVDVSVKRLLKARFELGEMD-EPEKVSWTKIPFSVVASAAHDSLALEI 366

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
           AR+ + LL N+   LPL      TVAV+GPN++ +V   GNY G+     T L G+    
Sbjct: 367 ARKSMTLLMNKDNFLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDGVRNIL 425

Query: 446 GRYARTIHQQGC 457
           G   + I++QGC
Sbjct: 426 GAEDKLIYEQGC 437



 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 91/319 (28%), Positives = 143/319 (44%), Gaps = 47/319 (14%)

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
           D+    D     +++  + AD  I   G+  S+E E +          DR  + LP  Q+
Sbjct: 580 DLGFKKDVDIRKSVERVKDADVVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQR 639

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
           EL+  +  A K   ILV  SG PI         +  A++ A YPGQ GGTA+A++LFGT 
Sbjct: 640 ELIDALHRAGK-KIILVNCSGSPI--GLEPETKKCEALLQAWYPGQQGGTAVAEVLFGTY 696

Query: 569 NPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFV 628
           NP G+LP+T+Y   Y   LP  E            GRTYR+ +   ++PFG+G+SYT F 
Sbjct: 697 NPAGRLPVTFYRNIY--QLPDFE-------DYNMTGRTYRYMQDIPLFPFGYGLSYTTFN 747

Query: 629 HTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTL 688
           +                       +S    ++T  +  +LT    V V N G + G   +
Sbjct: 748 Y---------------------GKVSLNKGEITDGQTVKLT----VSVTNTGERGGEEVV 782

Query: 689 LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHN 748
            V+         P K L AF+++ +P+G    V   +   +     ++S T R+  G ++
Sbjct: 783 QVYLKKQGDTKGPVKTLRAFKRISIPSGQTVNVEFELKDKELEWWDEQSNTVRVCPGNYD 842

Query: 749 IHIGGTKHSVSLHAATLGV 767
           I +GG+     L   ++ V
Sbjct: 843 IMVGGSSRDGDLQQISIIV 861


>gi|261340566|ref|ZP_05968424.1| periplasmic beta-glucosidase [Enterobacter cancerogenus ATCC 35316]
 gi|288316974|gb|EFC55912.1| periplasmic beta-glucosidase [Enterobacter cancerogenus ATCC 35316]
          Length = 765

 Score =  265 bits (678), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 212/689 (30%), Positives = 336/689 (48%), Gaps = 102/689 (14%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GR+      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDAWLLKDVLRDQWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A+++G+++     +   +    V+ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGDNA 454

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+    +Q              +   A++A++Q+D  + V+G  Q 
Sbjct: 455 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNAAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYHADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R GS+ A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--MSAPTMTRDGSVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                    VDV N G +DGA  + ++     A    P KQL  FEKV +  G  + V  
Sbjct: 673 ---------VDVTNSGKRDGATVIQMYVQDVTASMSRPVKQLRGFEKVDLKPGETKTVSF 723

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I V   L   ++        G+ N+ IG
Sbjct: 724 PIDVNA-LKFWNQQMKYDAEPGKFNVFIG 751


>gi|336412663|ref|ZP_08593016.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942709|gb|EGN04551.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
           3_8_47FAA]
          Length = 735

 Score =  265 bits (678), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 230/773 (29%), Positives = 363/773 (46%), Gaps = 101/773 (13%)

Query: 29  ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI--------------SGAA 74
           A D KD      P      PI +RVNDL+ R++L+EKV  L                   
Sbjct: 21  AKDKKDVALYKDP----KAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVK 76

Query: 75  AVP-RLGIKGYEWWSEALHGV--------SNVGPGTKFGGDFPGA--TSFPQVITTASSF 123
            VP  +G   Y   + AL           S +G    FG D      T +P  +  A S+
Sbjct: 77  KVPAEIGSLIYFETNPALRNSMQKKAMEKSRLGIPIIFGYDAIHGFRTVYPISLAQACSW 136

Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
           N  L E    V + EAR       +G+ + +SP +++ RDPRWGR  E  GEDP  +G +
Sbjct: 137 NPDLVEQACAVSAQEAR------MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVF 190

Query: 183 AASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
            A+ V+G QG D     ++AA  KH+  Y         R +   ++SKQ + DT+ +P+ 
Sbjct: 191 GAASVKGYQGDDLSAENRMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYE 247

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
           M V  G  A++M S+N ++GVP  A+P I+   ++  W  +G+IVSD  ++      Q  
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIE-QLKNQGL 305

Query: 302 TSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
            +T +EAA  A  AGL++D        H +  V+ G +S   ++ A+   L ++ RLG+F
Sbjct: 306 AATKKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLF 365

Query: 361 DGEPSSQPY--GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
           +     +PY       +    P   ++A   A + +VLLKN+  +LPL+    + +AVIG
Sbjct: 366 E-----RPYTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTD--KKKIAVIG 418

Query: 419 PNSDVTVTMIGNYAG------IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAI 472
           P +     ++G++ G      +A  Y          A   +  GC      + + F  A+
Sbjct: 419 PMAKNGWDLLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNREGFAEAL 477

Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
           +A+R +D  +L +G   +   E   R+ + LP  Q+EL +++  A K P +LVL++G P+
Sbjct: 478 EAARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPL 536

Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
           ++   + +P   AI+    PG  G   +A IL G  NP GKL MT          P +  
Sbjct: 537 EL--NRLEPISDAILEIWQPGVNGALPMAGILSGRINPSGKLAMT---------FPYSTG 585

Query: 593 AMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
            +    ++R  GR ++ FYK      +YPFGHG+SYT F +                   
Sbjct: 586 QIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKY------------------- 626

Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVA 707
              T++  A KV   + +RL+  V+V V NVG++DGA T+  F S P      P K+L  
Sbjct: 627 --GTVTPSATKVK--RGDRLS--VEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKH 680

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           FEK  + AG  +    +I + +    V+  G R +  GE++I + G    + L
Sbjct: 681 FEKQLIKAGETKTFRFDIDMERDFGFVNEDGKRFLEAGEYHILVQGQTVKIEL 733


>gi|313145353|ref|ZP_07807546.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
 gi|313134120|gb|EFR51480.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
          Length = 802

 Score =  265 bits (677), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 217/687 (31%), Positives = 330/687 (48%), Gaps = 91/687 (13%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GRV++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 156 GTTVFPTSIGQASTWNPELIRQMGRVIATEASA--QGAHIG---YGPVLDLARDPRWSRV 210

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L+G   A+ VRG QG     R  V A+ KHF +Y    W         A +
Sbjct: 211 EETYGEDPYLNGVMGAALVRGFQGDTLRGRKSVIATLKHFASY---GWTEGGHNGGTAHL 267

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
            ++++E+    PFR  V  G + SVM SYN+++G P      +L   ++  W+  G++VS
Sbjct: 268 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVS 326

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
           D  ++G   +      +  EAA  A+ AG+D D G    ++ E   +AV++G ++   ++
Sbjct: 327 DLYAIGGLRE-HGVAGSDYEAAVKAVNAGVDSDLGT--NVYAEQLVAAVRKGDVAMETVD 383

Query: 345 NALVNTLTVQMRLGMFDGE--PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
            A+   L ++  +G+FD       +P      + V +P+H  LA E ARQ IVLLKN+  
Sbjct: 384 KAVRRILFLKFHMGLFDAPFVDDKRP-----AQLVASPEHIGLAREVARQSIVLLKNEDK 438

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC--GYTTPLQGI----GRYARTIHQQG 456
            LPL     RT+AVIGPN+D    M+G+Y          T L+GI     +  R ++ +G
Sbjct: 439 LLPLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVLYAKG 497

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------A 493
           C  V  +    F  AI+A+R AD  ++V+G     D S E                    
Sbjct: 498 CA-VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESG 556

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E  DRA L L GRQ EL+ +V    K P +LVL+ G P+ +     +    AI+ A YPG
Sbjct: 557 EGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQEA--DAILDAWYPG 613

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
             GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     RY         G 
Sbjct: 614 MQGGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTKRKGNRSRYIEEA-----GT 666

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
             YPFG+G+SYT F +T                        G  ++V+  + N   + V 
Sbjct: 667 PRYPFGYGLSYTTFSYT------------------------GMKVRVSE-ESNHCRVDVS 701

Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
           V V+N G+ DG   + ++     G +  P +QL AF +V + AG  + +   +   K L+
Sbjct: 702 VTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKAGETREITFTLD-KKSLA 760

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVS 759
           +  R G   +  G   +  GG+   ++
Sbjct: 761 LYMRDGEWAVEPGRFTVMAGGSSEDIA 787


>gi|116621797|ref|YP_823953.1| glycoside hydrolase family protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116224959|gb|ABJ83668.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 765

 Score =  265 bits (677), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 223/730 (30%), Positives = 345/730 (47%), Gaps = 109/730 (14%)

Query: 78  RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
           RLGI    +  E LHG + +G            TSFPQ I   ++F+  L E++  + + 
Sbjct: 104 RLGIPVI-FHEECLHGHAAIG-----------GTSFPQPIGLGATFDPELVESLFAMTAA 151

Query: 138 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDG-- 195
           EARA   G    LT   P V++ R+PRWGR +ET GEDP L  +   + VRG QG     
Sbjct: 152 EARA--RGTHQALT---PVVDVAREPRWGRVEETYGEDPFLVSRMGIAAVRGFQGDATFR 206

Query: 196 DRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
           D+ +V A+ KHF A+     +G +    N  VS + + +TF  PF+  + +G   SVM S
Sbjct: 207 DKTRVIATLKHFAAHGQPE-SGTNCAPVN--VSMRVLRETFLFPFKEALDKGCAISVMAS 263

Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--------- 306
           YN+++GVP+ A   +L+  +R EW   G++VSD      YY     +  PE         
Sbjct: 264 YNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSD------YYAIYELSYRPESHGHFVAKD 317

Query: 307 --EAAADAIRAGLDLDC-GPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE 363
             EA A A++AG++++   P   LH    V +G+L E  ++  +   L  + ++G+FD +
Sbjct: 318 KREACALAVQAGVNIELPEPDCYLHLVDLVHKGVLQESQLDELVEPMLRWKFQMGLFD-D 376

Query: 364 PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDV 423
           P   P      +      H+ELA++AAR+ I LLKN GP +PL     +T+AVIGPN++ 
Sbjct: 377 PYVDPAE--AERIAGCDAHRELAMQAARETITLLKNDGPVVPLDLSAIKTIAVIGPNAN- 433

Query: 424 TVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDV--------------ACADD 465
             +++G Y+G+     T L GI    G  A+ ++ +GCK                   D 
Sbjct: 434 -RSLLGGYSGVPKHDVTVLDGIRERVGSRAKVVYAEGCKITIGGSWVQDEVTPSDPAEDR 492

Query: 466 QLFGAAIDASRQADATILVMGLDQSIEAEAL------DRAGLLLPGRQQELVSKVSMASK 519
           +    A+  +++AD  +L +G ++    EA       DR  L L GRQ+ELV +  +A+ 
Sbjct: 493 RQIAEAVKVAKRADVIVLAIGGNEQTSREAWSPKHLGDRPSLDLVGRQEELV-RAMVATG 551

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
            P I  L +G PI + +      + AI    Y GQ  G A+A++LFG +NPGGKLP+T  
Sbjct: 552 KPVIAFLFNGRPISINYLAQS--VPAIFECWYLGQETGRAVAEVLFGDTNPGGKLPIT-I 608

Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
           P+    +LP      +PS       R Y F +   +Y FG+G+SYT F            
Sbjct: 609 PRS-AGHLPAF-YNHKPSAR-----RGYLFDEVGPLYAFGYGLSYTTFAF---------- 651

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRL-TLGVQVDVKNVGSKDGAHTLLVFSTPPAGH 698
                           + +++   K +R  T  V VDV N G+++G   + ++       
Sbjct: 652 ----------------QNLRLAKKKMHRESTARVLVDVTNTGAREGREVVQLYIRDLVSS 695

Query: 699 WA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
              P K+L  F K+ +  G  Q V   I     L+  +      +  G+  I +G +   
Sbjct: 696 VTRPIKELKGFRKITLQPGQTQTVEFEI-TPDLLAFYNVDMKFVVEPGDFEIMVGSSSRD 754

Query: 758 VSLHAATLGV 767
             L    L V
Sbjct: 755 ADLQKVILRV 764


>gi|409196594|ref|ZP_11225257.1| beta-glucosidase [Marinilabilia salmonicolor JCM 21150]
          Length = 876

 Score =  265 bits (677), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 154/419 (36%), Positives = 224/419 (53%), Gaps = 29/419 (6%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF   SL   +RV DLI RL+++EK  LL   + A+PRLGI+ + WWSEALHG +N    
Sbjct: 34  PFQNPSLSFEERVKDLISRLTVEEKAALLCDQSDAIPRLGIEKFNWWSEALHGYANND-- 91

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY---------NGGTAGLT 151
                     T FPQ +  A+SFN  L   I    SDE RA Y         N     L+
Sbjct: 92  --------SVTVFPQPVGMAASFNDELVFDIFNATSDEGRAKYHQAQRRGEENRRFLSLS 143

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
            W+PNVNIFRDPRWGRGQET GEDP L+ +     V+GLQG  D    K+ A  KH+T +
Sbjct: 144 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAKYRKLLACAKHYTVH 203

Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
               W+   R   N   VS ++  +T+   F+  V +  V  VMC+Y++++  P C++  
Sbjct: 204 SGPEWS---RHELNVNDVSPREFYETYMPAFKALVQDADVRQVMCAYHRLDDEPCCSNTR 260

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHT 329
           IL+R +R EW     + +DC ++  ++ T   +STP  AAA  + AG DL+C  +   H 
Sbjct: 261 ILQRILRDEWGYEHLVAADCGAITDFHTTHGISSTPVHAAATGLLAGTDLEC-IWDNYHY 319

Query: 330 E---SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELA 386
           +    A+++ L+ E  I+++L   L  +  LG  D + +  P+       +   +H+ELA
Sbjct: 320 KMLPEALEKDLIKEEHIDSSLFRVLMGRFELGEMDDD-AIVPWAQTPASVLNNKEHRELA 378

Query: 387 LEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
            + A+Q + LL+N+   LPL     + +AVIGPN+D    + GNY G      T L G+
Sbjct: 379 YKMAQQSMTLLQNKNNVLPLDKSSTQKIAVIGPNADDEEVLWGNYNGTPIRTITILDGM 437



 Score = 98.6 bits (244), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 82/258 (31%), Positives = 113/258 (43%), Gaps = 50/258 (19%)

Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
           +  +   GL  S+E E +          DR  + LP  Q+  +  +  A K   I V  S
Sbjct: 615 ETVVFAGGLSGSLEGEEMPVSYPGFKGGDRTHIELPSVQRNALKALKEAGK-TVIFVNFS 673

Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL- 587
           G  I  A         AI+ A Y G++GG AIAD+LFG  NP GKLP+T+Y      NL 
Sbjct: 674 GSAI--ALEPETESCDAILQAWYAGESGGQAIADVLFGDYNPSGKLPLTFYRNS--DNLG 729

Query: 588 PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
              + +M         GRTYR Y    ++PFG G+SYTNF   + NA             
Sbjct: 730 DFEDYSME--------GRTYR-YTNDHLFPFGFGLSYTNF--QIGNAKLS---------- 768

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
              ATI+              T+ + + VKN G +DG   + V+         P K L  
Sbjct: 769 --KATITPDE-----------TISINIPVKNTGKRDGTEIIQVYVRKVNDIDGPIKTLKG 815

Query: 708 FEKVHVPAGAQQRVGINI 725
           FE+V VPAG   +  I++
Sbjct: 816 FERVDVPAGKTTQANISL 833


>gi|375149998|ref|YP_005012439.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361064044|gb|AEW03036.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 875

 Score =  265 bits (677), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 164/462 (35%), Positives = 233/462 (50%), Gaps = 43/462 (9%)

Query: 1   MASTIAFFF--LGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIG 58
           M  T AFF+  +G +LL              C    A     PF    L    RVNDL+ 
Sbjct: 1   MKITRAFFYPLIGCLLL--------------CIHLQAQNSKFPFQNYRLSFEDRVNDLVS 46

Query: 59  RLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVIT 118
           RL+L+EKV  +++ A  +PRL I  Y+WW+E LHGV+     T +       T FPQ I 
Sbjct: 47  RLTLEEKVAQMLNAAPGIPRLDIPAYDWWNETLHGVAR----TPYN-----VTVFPQAIA 97

Query: 119 TASSFNATLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQ 169
            A++++      +    + E R ++N   A         GLTYW+PN+NIFRDPRWGRGQ
Sbjct: 98  MAATWDTAALYRMADCSALEGRVIHNKAIAAGKEKDRYLGLTYWTPNINIFRDPRWGRGQ 157

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD--RFHFNAKV 227
           ET GEDP L+   A ++VRGLQG+D   LK AA  KH+  +     +G +  R  F+  V
Sbjct: 158 ETYGEDPYLTAALADAFVRGLQGNDPKYLKAAACAKHYAVH-----SGPEPSRHVFDVDV 212

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
           +  D+ DT+   F+  V    VA VMC+YN     P CA   ++   +R +W   GY+ S
Sbjct: 213 TPYDLWDTYLPSFKKLVTVSNVAGVMCAYNAFRKQPCCASDVLMTDILRNQWSFKGYVTS 272

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           DC ++  +Y           A+ADA+  G D+DCG         AV+   ++E  I+ ++
Sbjct: 273 DCGAIDDFYRNHKTHPDAAAASADAVFHGTDIDCGNEAYRALVQAVKENKITEKQIDISV 332

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
                ++ RLGMFD  PS   Y      ++ +  H + AL  A + IVLLKN   +LPL 
Sbjct: 333 KRLFMIRFRLGMFD-PPSMVKYAQTPATELESAAHAKHALLMAHESIVLLKNANNTLPLK 391

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA 449
               + V V+GPN+   +  +GNY+G      T  QGI   A
Sbjct: 392 KGLKKIV-VLGPNATNVIAPLGNYSGTPSKLITLFQGIKEKA 432



 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 88/300 (29%), Positives = 132/300 (44%), Gaps = 52/300 (17%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           F A +     ADA I   G+   +E E +          DR  +LLP  Q EL+ K   A
Sbjct: 597 FNALVKKYSDADAFIFAGGISPQLEGEEMKVSDPGFKGGDRTTILLPAIQTELM-KALQA 655

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
           S  P + V+M+G  +   +   +  I AI+ A Y GQA GTA+AD+LFG  NP G+LP+T
Sbjct: 656 SGKPVVFVMMTGSALATPWESEN--IPAIVNAWYGGQAAGTALADVLFGDYNPSGRLPVT 713

Query: 578 WYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y  +   +LP  E  +M+         RTYR++ G  +Y FG+G+SYT F +     P 
Sbjct: 714 FYGSD--NDLPSFEDYSMK--------NRTYRYFTGKPLYGFGYGLSYTTFRYDQLTMPV 763

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPP 695
                            +GK +KVT            V V N G   G     ++     
Sbjct: 764 TAQ--------------NGKPVKVT------------VRVTNTGKTTGDEVAQIYVVNEN 797

Query: 696 AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTK 755
                  K L  F+++ +     + V   +     L+ VD  G R+   G+  I +GG++
Sbjct: 798 TSIQTALKTLKGFQRISLRPAESKMVSFVLQ-SDDLTYVDADGQRKPLTGKIQICVGGSQ 856


>gi|298387489|ref|ZP_06997041.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298259696|gb|EFI02568.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 950

 Score =  265 bits (677), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 225/761 (29%), Positives = 362/761 (47%), Gaps = 105/761 (13%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNVGP 99
           +   SLP+ +RV  L+  ++ ++K++L+  G     +P L +       EA+HG S    
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 220

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
              +G    GAT FPQ +   +++N  L E +  V+ DE  A  N   A    WSP +++
Sbjct: 221 ---YGS---GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
            +D RWGR +ET GEDPVL  +   ++++G Q        +  + KHF  +      G D
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 323

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
                  +S++++ +   VPFR  +      S+M +Y+   GVP      +L++ +R EW
Sbjct: 324 SHDIG--LSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKELLQQILRQEW 381

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
             NG+IVSDC ++G     +H+T+  + EAA  A+ AG+  +CG  +       A + G 
Sbjct: 382 GFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 441

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           ++  D++N     L    R  +F+  P     +  + P    +  H+E+A +AAR+ IV+
Sbjct: 442 INMEDLDNVCRTMLGTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 500

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA-----GIACGYTTPLQG-IGRYAR 450
           L+N+   LPLS     T+AV+GP +D      G+Y      G      T ++G +G+  +
Sbjct: 501 LENKENLLPLSKTLC-TIAVLGPGAD--DLQPGDYTPKLLPGQLKSVLTGIKGAVGKQTK 557

Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAGL 501
            +++QGC D    D+     A+ A+ Q+D  I+V+G   + EA         E  D A L
Sbjct: 558 VLYEQGC-DFTNPDETNIPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTCGENNDWATL 616

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
           +LPG+QQEL+  V    K P IL+L +G P D+  A    +   + W   PGQ GG A+A
Sbjct: 617 ILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAMA 673

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           D+LFG  NP G+LPMT +P+ ++  LP+       +  +RY      +Y    +Y FG G
Sbjct: 674 DVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGFG 726

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           +SYT+F ++                   N  I  KA            + VQ  VKNVGS
Sbjct: 727 LSYTSFEYS-------------------NLKIQEKA---------NGNVEVQATVKNVGS 758

Query: 682 KDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
           + G     ++ T     +A  K    +L  F ++H+  G  + V   +     +S+++  
Sbjct: 759 RAGDEVAQLYVT---DMYASVKTRVMELKDFARIHLQPGESKTVSFEMTPYD-ISLLNDR 814

Query: 738 GTRRIPLGEHNIHIGGT----------KHSVSLHAATLGVI 768
             R +  GE  I +GG           KHSV       GV+
Sbjct: 815 MDRVVEKGEFKIMVGGMSPDYVAKNEIKHSVGYSDNKKGVM 855


>gi|354582345|ref|ZP_09001247.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
 gi|353199744|gb|EHB65206.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
          Length = 765

 Score =  265 bits (677), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 206/683 (30%), Positives = 325/683 (47%), Gaps = 97/683 (14%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT FP  ++  S++N  L+  + R V+ E R+       G   +SP +++ RDPRWGR 
Sbjct: 122 GATVFPVPLSLGSTWNTELYREMCRAVALETRSQ-----GGAVTYSPVLDVVRDPRWGRT 176

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGD-RLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +E  GEDP L G++AA+ V GLQG   D    VAA+ KHF  Y      G      NA  
Sbjct: 177 EECFGEDPYLIGEFAAASVEGLQGESLDGEASVAATLKHFVGY------GSSEGGRNAGP 230

Query: 228 SKQDIEDTFDV---PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
                 +  +V   PF+  V E   AS+M +YN+++GVP   +  +L   +R EW  +G 
Sbjct: 231 VHMGTRELMEVDMYPFKKAV-EAGAASIMPAYNEIDGVPCTVNEELLDGVLRKEWGFDGM 289

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDI 343
           +++DC ++ +            +AA  AI AG+D++  G   G++ E AVQ   L    +
Sbjct: 290 VITDCGAINMLAAGHDTAEDGMDAAVSAISAGIDMEMSGEMFGMYLERAVQEKRLDVSVL 349

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           + A+   LT++ +LG+F+  P + P      + +    H+E+A + A +GIVLLKN+G +
Sbjct: 350 DEAVRRVLTLKFKLGLFE-NPYADP--ARAEQVIGCSRHREMARQLAAEGIVLLKNEGST 406

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGI-----GRYARTIHQQG 456
           LPLS      +AVIGPN+D     +G+Y          T L+GI     G   R ++  G
Sbjct: 407 LPLSK-EDGVIAVIGPNADQGYNQLGDYTSPQPPSRVVTVLEGIRAKLGGDKGRVLYAPG 465

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIE------------------------ 492
           C+ +     + F  A+  + QAD  +LV+G   + +                        
Sbjct: 466 CR-INGDSREGFELALSCAGQADTVVLVLGGSSARDFGEGTIDLRTGASKVTGNDWSDMD 524

Query: 493 -AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
             E +DR  L L G Q EL  ++    K   ++V ++G PI   +   D    AI+ A Y
Sbjct: 525 CGEGIDRMTLQLSGVQLELAREIHKLGK-RLVVVYINGRPIAEPWI--DRHADAILEAWY 581

Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
           PGQ GG A+ADILFG  NP GKL ++  P+ ++  LP+          KR  G+ Y    
Sbjct: 582 PGQEGGHAVADILFGDVNPSGKLTIS-IPK-HVGQLPVY------YNGKRSRGKRYLEED 633

Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
               YPFG+G+SYT F ++                   +  ++ + I+         T  
Sbjct: 634 SQPQYPFGYGLSYTEFRYS-------------------DLQVTPQTIRTGE------TAV 668

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINI----- 725
           V V+V+N GS  GA  + ++    A  +  P K+L  F K+++  G +QR+   +     
Sbjct: 669 VTVNVENSGSVAGAEVVQLYINDAASRFTRPAKELKGFRKIYLEPGEKQRIEFTVGPEQL 728

Query: 726 -HVCKYLSVVDRSGTRRIPLGEH 747
            ++ +    V   G  R+ +G+H
Sbjct: 729 QYIGQNYQPVVEPGLFRVMVGKH 751


>gi|224583333|ref|YP_002637131.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
 gi|224467860|gb|ACN45690.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Paratyphi C strain RKS4594]
          Length = 765

 Score =  265 bits (677), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 206/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + +H +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILHAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G  + V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|383113360|ref|ZP_09934132.1| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
 gi|382948727|gb|EFS32364.2| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
          Length = 954

 Score =  265 bits (677), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 216/761 (28%), Positives = 362/761 (47%), Gaps = 107/761 (14%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNVGP 99
           +   SLP+ +RV  L+  ++ ++K++L+  G     +P L +       EA+HG S    
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 224

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
              +G    GAT FPQ +   +++N  L E +  V+ DE  A  N   A    WSP +++
Sbjct: 225 ---YGS---GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
            +D RWGR +ET GEDPVL  +   ++++G Q        +  + KHF  +      G D
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 327

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
                  +S++++ +   VPFR  +      S+M +Y+   G+P      +L++ +R EW
Sbjct: 328 SHDIG--LSEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTELLQQILRQEW 385

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
             NG+IVSDC ++G     +H+T+  + EAA  A+ AG+  +CG  +       A + G 
Sbjct: 386 GFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 445

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVL 396
           ++  +++N     L+   R  +F+  P     +  + P    +  H+E+A +AAR+ IV+
Sbjct: 446 INMENLDNVCRTMLSTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 504

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIGRYA 449
           L+N+   LPL+    RT+AV+GP +D           + G    +  G     + +G+  
Sbjct: 505 LENKENLLPLTK-NLRTIAVLGPGADDLQPGDYTPKLLPGQLKSVLTGIK---EAVGKQT 560

Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAG 500
           + +++QGC D    D+     A+ A+ Q+D  ++V+G   + EA         E  D A 
Sbjct: 561 KVLYEQGC-DFTNPDETNIPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTCGENNDWAT 619

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L+LPG+QQEL+  V    K P IL+L +G P D+  A    +   + W   PGQ GG A+
Sbjct: 620 LILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAM 676

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           AD+LFG  NPGG+LPMT +P+ ++  LP+       +  +RY      +Y    +Y FG 
Sbjct: 677 ADVLFGDYNPGGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGF 729

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           G+SYT+F ++                           +K+       +T  VQ  VKN+G
Sbjct: 730 GLSYTSFEYS--------------------------DLKIQEKPNGNVT--VQATVKNIG 761

Query: 681 SKDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           S+ G     ++ T     +A  K    +L  F+++++  G  + V   +     +S+++ 
Sbjct: 762 SRAGDEVAQLYVT---DMYASVKTRVMELKDFDRIYLQPGESKTVSFELTPYD-ISLLND 817

Query: 737 SGTRRIPLGEHNIHIGGT----------KHSVSLHAATLGV 767
              R +  GE  I +GG           KHSV       GV
Sbjct: 818 HMDRVVEKGEFKICVGGMSPDYVAKNEIKHSVGYSDKKKGV 858


>gi|317474349|ref|ZP_07933623.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909030|gb|EFV30710.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 877

 Score =  265 bits (677), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 166/450 (36%), Positives = 243/450 (54%), Gaps = 42/450 (9%)

Query: 29  ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
           AC   +A     PF    L + +RV++L+  L+LQEKV ++I+ + A+ RLGI  Y WW 
Sbjct: 8   AC--SEAEKYQYPFQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWG 65

Query: 89  EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN---- 144
           EA HG+          G   G T FPQ I  A++F+ +       +VSDEARA YN    
Sbjct: 66  EACHGL--------IAG---GVTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPL 114

Query: 145 --------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
                        LT+W+PN+NIFRDPRWGRGQET GEDP L  +   + V G+QG D  
Sbjct: 115 DGDIGPYVSAIPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQGDDEH 174

Query: 197 RLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC 254
             K  A  KH+  +     +G +  R  FNA VS +D+ +T+   F   V++G V  VMC
Sbjct: 175 YYKTHACAKHYGVH-----SGPEPLRHEFNAVVSMRDLWETYLPAFETLVVKGNVREVMC 229

Query: 255 SYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADA 312
           +Y+   G P CA   +L   +R  W  +G +VSDCD++  +Y      + P    A+ADA
Sbjct: 230 AYSAYEGEPCCASNRLLVDILRNRWGFDGMVVSDCDAINDFYVKGRHETHPDAAAASADA 289

Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGH 371
           +  G DL+CG       E AV++G++ E D++ +L   LT + RLG+ D  P+   PY  
Sbjct: 290 VLTGTDLECGRSYNALIE-AVEKGIIKEQDLDVSLRRILTERFRLGLLD--PAKYVPYST 346

Query: 372 LGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
           +    +   +H++ AL+ A +  VLLKN+G  LPL     +++AV+GPN + ++ M GNY
Sbjct: 347 IPGSVIDCQEHRDHALKMAHESQVLLKNEGNILPLDK-NIKSIAVVGPNINDSIMMRGNY 405

Query: 432 AGIACGYTTPLQGIGRY---ARTIHQQGCK 458
           +G      T LQG+       R I ++G +
Sbjct: 406 SGSPTHCITILQGLKNKLPNTRIISERGSE 435



 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 91/303 (30%), Positives = 136/303 (44%), Gaps = 41/303 (13%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           +F    +  +  +A I V G+  + E E  +RA + LP  Q+  +  +    K P I V 
Sbjct: 609 VFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKRFLKAMHETGK-PVIYVN 667

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
            SG  I  A A  D    A++ A YPGQ GGTA+AD+LFG  NP GKLP+T+Y       
Sbjct: 668 CSGSAI--ALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYNPSGKLPVTFYKS--TEQ 723

Query: 587 LP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
           LP  T+ +M          RTYR++KG   Y FG+G+SYT+F                  
Sbjct: 724 LPEFTDYSME--------NRTYRYFKGEPQYAFGYGLSYTDFEFG--------------- 760

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
                A +S  +IK          + + + + NVG  DGA  + V+        AP K L
Sbjct: 761 ----EALLSSSSIKAGEK------VEITIPLTNVGKMDGAEVVQVYVKSLTNPDAPIKSL 810

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSV-VDRSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
             + +  + AG  ++V I +    +     D  G +  P G++ I  G +   + L +  
Sbjct: 811 KGYVRQEIKAGKSEKVRITLEPESFAYYNADVDGLKVFP-GKYQILYGNSSRDMDLSSLD 869

Query: 765 LGV 767
           L V
Sbjct: 870 LEV 872


>gi|393789623|ref|ZP_10377743.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
           CL02T12C05]
 gi|392650339|gb|EIY44008.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
           CL02T12C05]
          Length = 863

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 169/465 (36%), Positives = 250/465 (53%), Gaps = 45/465 (9%)

Query: 8   FFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVK 67
             + L L +A+  G+A  EP+  +P             +L   +R  DL+GRL+L+EKV 
Sbjct: 5   LIITLCLSTATLVGMAQNEPYR-NP-------------NLSPEERAEDLLGRLTLKEKVT 50

Query: 68  LLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATL 127
           L+ + +  + RLGI  Y+WW+EALHGV+  G           AT FP  +  ASSF+   
Sbjct: 51  LMQNESFPIERLGIAHYDWWNEALHGVARAGI----------ATVFPITMGMASSFDDKA 100

Query: 128 WEAIGRVVSDEARAMYNGG---------TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 178
            E I   VSDE RA Y+             GLT+W+PN+NIFRDPRWGRGQET GEDP L
Sbjct: 101 IEDIFTAVSDEVRAKYHDAHRQGRRGVRCEGLTFWTPNINIFRDPRWGRGQETYGEDPYL 160

Query: 179 SGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK-VSKQDIEDTF 236
           + +   + V+GLQG +D    K+ A  KH+  +         R  FNA+ +S +D+ +T+
Sbjct: 161 TSRMGVAVVKGLQGPADAKYDKLHACAKHYAVHSGPE---AKRHSFNAENISPRDLWETY 217

Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG-VY 295
              F+  V E  V  VMC+YN+  G P C    +L + +R +W     +VSDC ++   +
Sbjct: 218 LPAFKTLVQEADVKEVMCAYNRFEGDPCCGSNRLLTQILRDDWGYKHVVVSDCGAISDFF 277

Query: 296 YDTQHFT-STPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQ 354
           Y  +H T     +A+A A+ +G DL+CG     H + AV+ GL+SE  IN +L   L  +
Sbjct: 278 YSDRHATHKNAADASAAAVLSGTDLECGIEYA-HLDKAVKEGLISEERINTSLFRLLKAR 336

Query: 355 MRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTV 414
             LG  D + +  P+  +    V    H+++AL+  R+ +VLL N G  LPL     R +
Sbjct: 337 FELGEMDDD-ALVPWSKISIDTVDCQMHKQMALDITRKSMVLLHNNG-VLPLGKTGAR-I 393

Query: 415 AVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-GRYARTIHQQGCK 458
           AV+GPN++ +V   GNY G      T L+GI  +    I+++GC+
Sbjct: 394 AVMGPNANDSVMQWGNYEGTPSHTVTVLEGIRNKIGNVIYEKGCE 438



 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 90/308 (29%), Positives = 135/308 (43%), Gaps = 50/308 (16%)

Query: 470 AAIDASRQADATILVMGLDQSIEAEAL-----------DRAGLLLPGRQQELVSKVSMAS 518
           A  +  ++AD  I V G+  ++E E             DR  + LP  Q+  + K    +
Sbjct: 591 AVAERVKEADVVIFVGGISPNLEGEEKNFVNCPGFVGGDRTSIELPEVQRN-ILKALKKA 649

Query: 519 KGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTW 578
               I V  SG  +  A         AI+ A YPGQAGGTA+ADI+FG  NP GKLP+T+
Sbjct: 650 GKKVIFVNCSGSAM--ALVPETQSCDAILQAWYPGQAGGTAVADIIFGDYNPSGKLPVTF 707

Query: 579 YPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV 637
           Y       LP   + +M+        GRTYR+     ++PFG+G+SYT F  + A     
Sbjct: 708 YKN--TEQLPDFEDYSMK--------GRTYRYMTESPLFPFGYGLSYTTFQFSKAG---- 753

Query: 638 VAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAG 697
               L+ R  +IN  +  K                 V+VKN G +DG   + ++      
Sbjct: 754 ----LNKRVATINQPVQFK-----------------VNVKNTGKRDGTEVVQIYIRKADD 792

Query: 698 HWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHS 757
              P K L AF  V + AG    V I +    +      + T RI  G++ I  G + ++
Sbjct: 793 EEGPVKSLRAFRPVTLKAGKSATVNITLSPDAFEFFDPETNTMRIVPGDYEIMYGNSSNT 852

Query: 758 VSLHAATL 765
           +  +  +L
Sbjct: 853 LPENKLSL 860


>gi|336415919|ref|ZP_08596257.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
           3_8_47FAA]
 gi|335939822|gb|EGN01694.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
           3_8_47FAA]
          Length = 782

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 210/699 (30%), Positives = 325/699 (46%), Gaps = 108/699 (15%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT FP  I  A++++  L + +G+V++ E R+       G   + P +++ RDPRW R 
Sbjct: 148 GATVFPTGIGMAATWSPELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 202

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDPVLSG   AS V GL G +   +    A+ KHF AY +        +   A V
Sbjct: 203 EETFGEDPVLSGILGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 259

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
             +D+   F  PFR  +  G + SVM SYN ++G+P  ++ N+L + +R EW+  G++VS
Sbjct: 260 GIRDLHQNFLPPFRKAIDAGAL-SVMTSYNSIDGIPCTSNHNLLTQLLRNEWKFRGFVVS 318

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           D  S+   +++     T E AA  ++ AG+D+D G     +   AVQ G + +  I+ A+
Sbjct: 319 DLYSIEGIHESHFVAPTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAV 378

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L ++  +G+F+  P   P   +  K V   +H ELA + A+  I LLKN+   LPLS
Sbjct: 379 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGC--KDV 460
              ++ VAVIGPN+D    M+G+Y          T L GI      +R  + +GC  +D 
Sbjct: 436 KTINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 494

Query: 461 ACADDQLFGAAIDASRQAD----------------------ATILVMGLDQSIE-AEALD 497
              + +    AI+A+R+++                      A +   G    +E  E  D
Sbjct: 495 TVNEIE---QAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           RA L L GRQQEL+  +    K P I+V + G P++  +A       A++ A YPGQ GG
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGG 608

Query: 558 TAIADILFGTSNPGGKLPMT----------WYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
            AIAD+LFG  NP G+LP++          +Y Q+   N    E++  P           
Sbjct: 609 NAIADVLFGDYNPSGRLPISVPRSVGQIPVYYNQKAPRNHDYVEVSSSP----------- 657

Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
                  +Y FG+GMSYT F +                        S   +    A+C  
Sbjct: 658 -------LYSFGYGMSYTTFEY------------------------SDLQVVQKSARC-- 684

Query: 668 LTLGVQVDVKNVGSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
               V   VKN G  DG   + L      A    P KQL  FE+ H+  G +++V   + 
Sbjct: 685 --FEVSFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLT 742

Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATL 765
              +  +V+ +  + +  G  ++ IG   + + L    L
Sbjct: 743 EEDFF-LVNYTLKKVVESGNFHLMIGAASNDIRLQNVIL 780


>gi|194449077|ref|YP_002046219.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. SL476]
 gi|205359117|ref|ZP_02667180.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Heidelberg str. SL486]
 gi|386591978|ref|YP_006088378.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Heidelberg str. B182]
 gi|417384271|ref|ZP_12149694.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Johannesburg str. S5-703]
 gi|194407381|gb|ACF67600.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Heidelberg str. SL476]
 gi|205338504|gb|EDZ25268.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Heidelberg str. SL486]
 gi|353609401|gb|EHC62716.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Johannesburg str. S5-703]
 gi|383799022|gb|AFH46104.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Heidelberg str. B182]
          Length = 755

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 386

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 445 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 714 PIDIEALKF 722


>gi|323451833|gb|EGB07709.1| hypothetical protein AURANDRAFT_64764 [Aureococcus anophagefferens]
          Length = 819

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 246/763 (32%), Positives = 348/763 (45%), Gaps = 126/763 (16%)

Query: 16  SASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA 75
           +A  +GLAA E  +C   D T     +   SLP   R+  L   + L++ +  L++ A A
Sbjct: 52  TACVAGLAAYE--SCPGLDGT-----YLDASLPEADRLAWLADNVPLEDMIGQLVNAAPA 104

Query: 76  VPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVV 135
           V  + +  Y W ++  HGV     GT        AT +P   +  +S++  L   +G  +
Sbjct: 105 VDAVDLPAYNWLNDNEHGVK----GTAH------ATVYPMGASLGASWSVDLAWRVGAAI 154

Query: 136 SDEARAMYNG-----GTA--------------GLTYWSPNVNIFRDPRWGRGQETPGEDP 176
            +E+RA +NG     G A              G+T ++PNVN+ RDPRWGR +E  GEDP
Sbjct: 155 GNESRATHNGLADKSGNACGSTSTGEVVANGCGITLYAPNVNLVRDPRWGRAEEVYGEDP 214

Query: 177 VLSGKYAASYVRGLQGS--------DGDRLKVAASCKHFTAYDLDNWNG---VDRFHFNA 225
            L+ + A   V GLQG+         G  L   A CKHF A+     N     DR   +A
Sbjct: 215 HLTAELAVGMVTGLQGNAEGSTSGPGGGPLVTGACCKHFAAHFAVYQNEDLPADRMVLDA 274

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
            VS +D+ +T+    + CV+  K          VNG PTCA P +L   +R  W  +G++
Sbjct: 275 NVSSRDLWETYLPVMKACVVRAKA-------THVNGKPTCAHPELLNDVLRESWGFDGFV 327

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-----PFLGLHTESAVQRGLLSE 340
           VSD D+      T  + ST EEAAA  I AG+D + G     P   L    AV+ G ++ 
Sbjct: 328 VSDYDAWSNLVTTHKYVSTWEEAAAAGINAGMDQEGGFGDYSPVDAL--PDAVRNGTVAA 385

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL--EAARQGIVLLK 398
             +  +    + V++RLGMFD   S+  YG     D       +LAL  EAAR+GIVL K
Sbjct: 386 ATVRRSFERLMRVRLRLGMFDPPASTAVYGEAYQCDYQCETAAKLALAREAAREGIVLFK 445

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIG--NYA---GIACGYTTPLQGIGRYARTIH 453
           N G +LPL+  +   +A++GP  D    ++G  NYA   G      T  +G+   A    
Sbjct: 446 NAGGALPLA--KGARIALVGPQVDDWRVLLGAVNYAFEDGPDVAPVTIQKGLEAVANVSV 503

Query: 454 QQGCKDVACADDQLFGAAI--------------DASRQADATILVMGL-DQSIEAEALDR 498
             GC  VACA       A               D+    D   L  G  D   E+E+ DR
Sbjct: 504 AAGCDSVACAALVDVDGAKRLAAAADATVVVLGDSFGATDGWPLCRGTRDDGCESESHDR 563

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
           A + LPG Q  LV+ +  AS    + VL+ GG + +  A +D      +W   PGQ GG 
Sbjct: 564 ATIELPGEQVALVAALRAASS-RLVCVLVHGGAVALGAAADDCDAVLDLWV--PGQMGGA 620

Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLP----MTEMAMRPSQSKRYPGRTYRFYKGPV 614
           A+AD+LFG  +P G+ P+T Y     ++LP      E A   S      G TYR+Y GP 
Sbjct: 621 ALADVLFGDYSPAGRSPITMYAA--TSDLPPMGVFDEYAGESSN-----GTTYRYYAGPA 673

Query: 615 -VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
             Y FG G+SY +F +  A AP                         T   C  + L  +
Sbjct: 674 PTYAFGDGLSYASFSYAWAAAP-----------------------PTTVDACGAIRL--R 708

Query: 674 VDVKNVGSKDGAHTLLVFS-TPPAGHWAPHKQLVAFEKVHVPA 715
           V V N GS      + V++  P A   AP  +LVAF++V   A
Sbjct: 709 VAVTNTGSVASDEVVQVYARVPDATVPAPAIRLVAFDRVRAIA 751


>gi|218132025|ref|ZP_03460829.1| hypothetical protein BACEGG_03650 [Bacteroides eggerthii DSM 20697]
 gi|217985785|gb|EEC52125.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 888

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 165/450 (36%), Positives = 243/450 (54%), Gaps = 42/450 (9%)

Query: 29  ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWS 88
           AC   +A     PF    L + +RV++L+  L+LQEKV ++I+ + A+ RLGI  Y WW 
Sbjct: 19  AC--SEAEKYQYPFQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWG 76

Query: 89  EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN---- 144
           EA HG+          G   G T FPQ I  A++F+ +       +VSDEARA YN    
Sbjct: 77  EACHGL--------IAG---GVTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPL 125

Query: 145 --------GGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
                        LT+W+PN+NIFRDPRWGRGQET GEDP L  +   + V G+QG D  
Sbjct: 126 DGDIGPYVSAIPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQGDDEH 185

Query: 197 RLKVAASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMC 254
             K  A  KH+  +     +G +  R  FNA VS +D+ +T+   F   V++G V  VMC
Sbjct: 186 YYKTHACAKHYGVH-----SGPEPLRHEFNAVVSMRDLWETYLPAFETLVVKGNVREVMC 240

Query: 255 SYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADA 312
           +Y+   G P CA   +L   +R  W  +G +VSDCD++  +Y      + P    A+ADA
Sbjct: 241 AYSAYEGEPCCASNRLLVDILRNRWGFDGMVVSDCDAINDFYVKGRHETHPDAAAASADA 300

Query: 313 IRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGH 371
           +  G DL+CG       E AV++G++ E D++ +L   LT + RLG+ D  P+   PY  
Sbjct: 301 VLTGTDLECGRSYNALIE-AVEKGIIKEQDLDVSLRRILTERFRLGLLD--PAKYVPYST 357

Query: 372 LGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
           +    +   +H++ AL+ A +  VLLKN+G  LPL     +++A++GPN + ++ M GNY
Sbjct: 358 IPGSVIDCQEHRDHALKMAHESQVLLKNEGNILPLDK-NIKSIAIVGPNINDSIMMRGNY 416

Query: 432 AGIACGYTTPLQGIGRY---ARTIHQQGCK 458
           +G      T LQG+       R I ++G +
Sbjct: 417 SGSPTHCITILQGLKNKLPNTRIISERGSE 446



 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 91/303 (30%), Positives = 136/303 (44%), Gaps = 41/303 (13%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           +F    +  +  +A I V G+  + E E  +RA + LP  Q+  +  +    K P I V 
Sbjct: 620 VFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKRFLKAMHETGK-PVIYVN 678

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
            SG  I  A A  D    A++ A YPGQ GGTA+AD+LFG  NP GKLP+T+Y       
Sbjct: 679 CSGSAI--ALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYNPSGKLPVTFYKS--TEQ 734

Query: 587 LP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGR 645
           LP  T+ +M          RTYR++KG   Y FG+G+SYT+F                  
Sbjct: 735 LPEFTDYSME--------NRTYRYFKGEPQYAFGYGLSYTDFEFG--------------- 771

Query: 646 HGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQL 705
                A +S  +IK          + + + + NVG  DGA  + V+        AP K L
Sbjct: 772 ----EALLSSSSIKAGEK------VEITIPLTNVGKMDGAEVVQVYVKSLTNPDAPIKSL 821

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSV-VDRSGTRRIPLGEHNIHIGGTKHSVSLHAAT 764
             + +  + AG  ++V I +    +     D  G +  P G++ I  G +   + L +  
Sbjct: 822 KGYVRQEIKAGKSEKVRITLEPESFAYYNADVDGLKVFP-GKYQILYGNSSRDMDLRSLD 880

Query: 765 LGV 767
           L V
Sbjct: 881 LEV 883


>gi|336417087|ref|ZP_08597416.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
           3_8_47FAA]
 gi|335936712|gb|EGM98630.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
           3_8_47FAA]
          Length = 954

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 216/761 (28%), Positives = 361/761 (47%), Gaps = 107/761 (14%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNVGP 99
           +   SLP+ +RV  L+  ++ ++K++L+  G     +P L +       EA+HG S    
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 224

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
              +G    GAT FPQ +   +++N  L E +  V+ DE  A  N   A    WSP +++
Sbjct: 225 ---YGS---GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
            +D RWGR +ET GEDPVL  +   ++++G Q        +  + KHF  +      G D
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 327

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
                  +S++++ +   VPFR  +      S+M +Y+   G+P      +L++ +R EW
Sbjct: 328 SHDIG--LSEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTELLQQILRQEW 385

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
             NG+IVSDC ++G     +H+T+  + EAA  A+ AG+  +CG  +       A + G 
Sbjct: 386 GFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 445

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVL 396
           +   +++N     L+   R  +F+  P     +  + P    +  H+E+A +AAR+ IV+
Sbjct: 446 IDMENLDNVCRTMLSTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 504

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIGRYA 449
           L+N+   LPL+    RT+AV+GP +D           + G    +  G     + +G+  
Sbjct: 505 LENKENLLPLTK-NLRTIAVLGPGADDLQPGDYTPKLLPGQLKSVLTGIK---EAVGKQT 560

Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAG 500
           + +++QGC D    D+     A+ A+ Q+D  ++V+G   + EA         E  D A 
Sbjct: 561 KVLYEQGC-DFTNPDETNIPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTCGENNDWAT 619

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L+LPG+QQEL+  V    K P IL+L +G P D+  A    +   + W   PGQ GG A+
Sbjct: 620 LILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAM 676

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           AD+LFG  NPGG+LPMT +P+ ++  LP+       +  +RY      +Y    +Y FG 
Sbjct: 677 ADVLFGDYNPGGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGF 729

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           G+SYT+F ++                           +K+       +T  VQ  VKN+G
Sbjct: 730 GLSYTSFEYS--------------------------DLKIQEKPNGNVT--VQATVKNIG 761

Query: 681 SKDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           S+ G     ++ T     +A  K    +L  F+++++  G  + V   +     +S+++ 
Sbjct: 762 SRAGDEVAQLYVT---DMYASVKTRVMELKDFDRIYLQPGESKTVSFELTPYD-ISLLND 817

Query: 737 SGTRRIPLGEHNIHIGGT----------KHSVSLHAATLGV 767
              R +  GE  I +GG           KHSV       GV
Sbjct: 818 HMDRVVEKGEFKICVGGMSPDYVAKNEIKHSVGYSDKKKGV 858


>gi|150002739|ref|YP_001297483.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|294776994|ref|ZP_06742455.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
 gi|149931163|gb|ABR37861.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
 gi|294449242|gb|EFG17781.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
          Length = 788

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 201/679 (29%), Positives = 314/679 (46%), Gaps = 82/679 (12%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT FP      +++N  L   IG V + EA A+      G T  +SP ++I +DPRWGR 
Sbjct: 158 ATYFPAQCGQGATWNKKLIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRC 211

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
            ET GEDP L G+     +  LQ     +  + A+ KHF  Y +       +   +  V+
Sbjct: 212 VETYGEDPYLVGELGKQMITSLQ-----KYNLVATPKHFAVYSIPIGGRDGKTRTDPHVA 266

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++   +  PFRM   E     VM SYN  +G P     + L   +R EW   GY+VSD
Sbjct: 267 PREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSD 326

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDI 343
            ++V    +      T E+  A A+ AGL++         F+ L    AV  G +S+  +
Sbjct: 327 SEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFI-LPLRKAVDNGKISQETL 385

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
           +  +   L ++ RLG+FD      PY   G    + V + +HQ ++LEAARQ +VLLKN+
Sbjct: 386 DKRVAEILRIKFRLGLFD-----NPYRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNE 440

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR---YARTIHQQGC 457
              LPLS    R++AVIGPN++    +I  Y        T  QGI     +   I+++GC
Sbjct: 441 TNLLPLSK-SIRSIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHTEVIYKKGC 499

Query: 458 KDV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
             +                 + QL   AI A++QA+  ++V+G ++    E   R  L L
Sbjct: 500 DIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVREDRSRTSLNL 559

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PGRQ+EL+  V    K P ILV++ G    + +A     I AI+ A +PG+  G A+A+ 
Sbjct: 560 PGRQEELLKAVCATGK-PIILVMLDGRASSINYAA--AHIPAILHAWFPGEFCGQAVAEA 616

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFG  NPGG+L +T +P+  +  +P      +P   +      Y       +YPFGHG+S
Sbjct: 617 LFGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSSTSVY-----GALYPFGHGLS 668

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           YT F ++  +    ++    G  G I            H  C          +KN G   
Sbjct: 669 YTTFTYSDLH----ISPSHQGVQGDI------------HVSCK---------IKNTGKIK 703

Query: 684 GAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           G   + ++          + K L  FE++ + AG +Q V   +   + L + D++   R+
Sbjct: 704 GDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLR-PQDLGLWDKNMNFRV 762

Query: 743 PLGEHNIHIGGTKHSVSLH 761
            LG   + +G +   + LH
Sbjct: 763 ELGSFKVMLGASSTDIRLH 781


>gi|270296173|ref|ZP_06202373.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270273577|gb|EFA19439.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 942

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 207/681 (30%), Positives = 333/681 (48%), Gaps = 74/681 (10%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT+FP  +    ++N  L   +G +   EAR +      G T  ++P +++ RD RWGR 
Sbjct: 173 ATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 226

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           +E  GE P L  +     VRGLQ +     +VAA+ KHF AY  +          + ++S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGLQHNH----QVAATGKHFAAYSNNKGAREGMARVDPQMS 282

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++E+    PF+  + E  +  VM SYN  +G+P       L   +RGE    GY+VSD
Sbjct: 283 PREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVSD 342

Query: 289 CDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEID 342
            D+V   Y T+H T+   +EA   ++ AGL++ C       F+ L     V+ G LSE  
Sbjct: 343 SDAVEYLY-TKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFV-LPLRELVKEGGLSEEV 400

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           IN+ + + L V+  +G+FD    +   G    ++V   +++ +AL+A+R+ IVLLKN G 
Sbjct: 401 INDRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKEENEAIALQASRESIVLLKNAGE 458

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK 458
            LPL     + +AV GPN++     + +Y  +A   TT L+GI       A  ++ +GC 
Sbjct: 459 LLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGCD 518

Query: 459 ------------DVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
                       D    DD+      A++ +RQAD  I+V+G  Q    E   R  L LP
Sbjct: 519 LVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDLP 578

Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
           GRQ +L+  +    K P +L+L++G P+ + +A  D  + AI+ A YPG  GGTA+ADIL
Sbjct: 579 GRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADIL 635

Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGH 620
           FG  NPGGKL +T +P+  +  +P      +PS      + PG T    +    +YPFG+
Sbjct: 636 FGDYNPGGKLTVT-FPKT-VGQIPFN-FPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 692

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           G+SYT F ++  +    V  P +      +AT+    +KVT+         VQ+ +++V 
Sbjct: 693 GLSYTTFEYSDLDITPRVITPNE------SATVR---LKVTNTGKRAGDEVVQLYIRDVL 743

Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
           S    +                K L  F+++H+  G  Q +   I   K+L ++D     
Sbjct: 744 SSITTY---------------EKNLAGFQRIHLEPGEAQELSFTID-RKHLELLDADMKW 787

Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
            +  G+  +  G +   + L+
Sbjct: 788 VVEPGDFVLMAGASSEDIRLN 808


>gi|397691065|ref|YP_006528319.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
 gi|395812557|gb|AFN75306.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
          Length = 769

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 223/734 (30%), Positives = 343/734 (46%), Gaps = 119/734 (16%)

Query: 78  RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
           RLGI    +  E LHG++              ATS+P  I  A++FN  L E I   +++
Sbjct: 110 RLGIPVI-FHEECLHGLAA-----------KDATSYPVPIGLAATFNPELIEKIFSAIAE 157

Query: 138 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SDG 195
           +AR+   G    LT   P V++ RDPRWGR +ET GED  L  +   + V+GLQG  S  
Sbjct: 158 DARS--RGAHQALT---PVVDVVRDPRWGRVEETFGEDTYLVSQMGIASVKGLQGDGSLN 212

Query: 196 DRLKVAASCKHFTAYDLDNWNGVDRFHFN---AKVSKQDIEDTFDVPFRMCVMEGKVASV 252
           +  KV A+ KHF A+      G      N   A  S++ + DTF +PF+  + +  V SV
Sbjct: 213 NNNKVIATLKHFAAH------GQPESGTNCAPANFSERFLRDTFLMPFKEAIDKAGVISV 266

Query: 253 MCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTS-----TPEE 307
           M SYN+++G+P+ A+  +L++ +R EW   G++VSD  ++   +  +   S        E
Sbjct: 267 MASYNEIDGIPSHANKWLLRKVLRDEWNFKGFVVSDYYAITELFHKEETVSHGVAANKVE 326

Query: 308 AAADAIRAGLDL-----DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDG 362
           AA  A+ AG+++     DC P L   TE  V+ GL  E DI+  ++  L  +  LG+FD 
Sbjct: 327 AAKLALEAGVNIEFPNPDCYPNL---TE-MVKGGLADESDIDALVLPMLKYKFELGLFD- 381

Query: 363 EPSSQPYGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPN 420
                PY    P        Q  ELAL+AAR+ I LLKN+G  LPL   +   +AVIGPN
Sbjct: 382 ----NPYVEAEPGQFENKLEQDRELALQAARETITLLKNEGNLLPLKDFKK--IAVIGPN 435

Query: 421 SDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCKDVA--------------C 462
           +D   T++G Y G    YT+  QGI    G+     + +GCK                  
Sbjct: 436 AD--RTLLGGYHGTPKYYTSVYQGIKDKVGKNGEVFYSEGCKITVGGSWNDDEVILPDPA 493

Query: 463 ADDQLFGAAIDASRQADATILVMGLDQSIEAEAL------DRAGLLLPGRQQELVSKVSM 516
            D++L   A+  ++++D  +LV+G ++    EA       DR  L L GRQ +LV ++  
Sbjct: 494 EDEKLINEAVAVAQKSDVAVLVLGGNEQTSREAWNKKHLGDRPSLELVGRQNKLVEEILK 553

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
             K P +++L +G P  + F K++  + AI+   Y GQ  G A+AD+LFG  NP GKLP+
Sbjct: 554 TGK-PVVVLLFNGRPNSIGFIKDN--VPAILECWYLGQETGRAVADVLFGDYNPSGKLPV 610

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPG--RTYRFYKGPVVYPFGHGMSYTNFVHTVANA 634
           +         +P +   + P+     P   R Y F     ++ FG+G+SYT F       
Sbjct: 611 S---------IPRSAGHI-PAHYSHKPSARRGYLFDDVSPLFAFGYGLSYTKFSFD---- 656

Query: 635 PTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTP 694
                          N  +S   I           + V ++VKN G+  G   + ++   
Sbjct: 657 ---------------NLRLSKDTISADE------KVSVSIEVKNEGAIAGEEVVQLYIRD 695

Query: 695 PAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
                  P K+L  F K+ +  G    V   + + ++L+  +      +  GE  I +G 
Sbjct: 696 KVSSVTRPVKELKGFRKITLAPGQTSTVVFEL-LPEHLAFTNVDMKFTVEPGEFEIMVGN 754

Query: 754 TKHSVSLHAATLGV 767
           +     L    L V
Sbjct: 755 SSRDEDLIKTILTV 768


>gi|423303655|ref|ZP_17281654.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
           CL03T00C23]
 gi|423307623|ref|ZP_17285613.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
           CL03T12C37]
 gi|392688019|gb|EIY81310.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
           CL03T00C23]
 gi|392689492|gb|EIY82769.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
           CL03T12C37]
          Length = 801

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 199/678 (29%), Positives = 321/678 (47%), Gaps = 82/678 (12%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT FP      +++N  L   IG   + EA  +      G T  +SP ++I +DPRWGR 
Sbjct: 171 ATYFPAQCGQGATWNKELIAQIGEAEAREASVL------GYTNIYSPILDIAQDPRWGRC 224

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
            ET GEDP  +G+     +  LQ +     K+ ++ KHF  Y +       +   +  V+
Sbjct: 225 VETYGEDPYHAGQMGKQMILSLQKN-----KLVSTPKHFAVYSIPVGGRDGKTRTDPHVA 279

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++   +  PFR+   E     VM SYN  +G P     + L   +R EW   GY+VSD
Sbjct: 280 PREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSD 339

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDI 343
            ++V          +  E+A A A+ AGL++         F+ L   SAV++G +S+  +
Sbjct: 340 SEAVEFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFI-LPLRSAVKKGKISQETL 398

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
           N  +   L V+  LG+FD      PY        + V +P+HQ+LALEAARQ +VLLKN+
Sbjct: 399 NQRVAEILRVKFWLGLFDN-----PYRGDEKRAGQIVHSPEHQQLALEAARQSLVLLKNE 453

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGC 457
             +LPLS    R+VAVIGPN+D    +I  Y       TT  +GI +    A  ++++GC
Sbjct: 454 HQTLPLSK-SIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVVYKKGC 512

Query: 458 KDV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
             +              A  + Q+   AI+A++ A+ T++V+G ++    E   R  L L
Sbjct: 513 DIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSRTSLDL 572

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PGRQ+EL+ K+    K P +LV++ G    + FA     + AII A +PG+ GG AIA+ 
Sbjct: 573 PGRQEELLKKICQLGK-PVVLVMIDGRASSINFAAT--HVPAIIHAWFPGEFGGQAIAEA 629

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFG  NPGG+L +T +P+  +  +P      +P   +      Y       +YPFGHG+S
Sbjct: 630 LFGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSETSVY-----GALYPFGHGLS 681

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           YT F ++      V++    G  G+I+                     +   +KN+G ++
Sbjct: 682 YTTFQYS----DLVISPSKQGVQGNIS---------------------ISCTIKNIGQRE 716

Query: 684 GAHTLLVFSTPPAGHWAPHKQLV-AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           G   + ++          + Q++  FE++ +   A   V   +   + L + D+     +
Sbjct: 717 GDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFEL-TPQELGIWDKQMNFTV 775

Query: 743 PLGEHNIHIGGTKHSVSL 760
             G   + IG +   + L
Sbjct: 776 EPGMFKVMIGSSSKDIRL 793


>gi|299149395|ref|ZP_07042452.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298512582|gb|EFI36474.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 950

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 216/761 (28%), Positives = 362/761 (47%), Gaps = 107/761 (14%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNVGP 99
           +   SLP+ +RV  L+  ++ ++K++L+  G     +P L +       EA+HG S    
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS---- 220

Query: 100 GTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNI 159
              +G    GAT FPQ +   +++N  L E +  V+ DE  A  N   A    WSP +++
Sbjct: 221 ---YGS---GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
            +D RWGR +ET GEDPVL  +   ++++G Q        +  + KHF  +      G D
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG-----LFTTPKHFGGHGAP-LGGRD 323

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
                  +S++++ +   VPFR  +      S+M +Y+   G+P      +L++ +R EW
Sbjct: 324 SHDIG--LSEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTELLQQILRQEW 381

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQRGL 337
             NG+IVSDC ++G     +H+T+  + EAA  A+ AG+  +CG  +       A + G 
Sbjct: 382 GFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGR 441

Query: 338 LSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAARQGIVL 396
           ++  +++N     L+   R  +F+  P     +  + P    +  H+E+A +AAR+ IV+
Sbjct: 442 INMENLDNVCRTMLSTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVM 500

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIGRYA 449
           L+N+   LPL+    RT+AV+GP +D           + G    +  G     + +G+  
Sbjct: 501 LENKENLLPLTK-NLRTIAVLGPGADDLQPGDYTPKLLPGQLKSVLTGIK---EAVGKQT 556

Query: 450 RTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDRAG 500
           + +++QGC D    D+     A+ A+ Q+D  ++V+G   + EA         E  D A 
Sbjct: 557 KVLYEQGC-DFTNPDETNIPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTCGENNDWAT 615

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L+LPG+QQEL+  V    K P IL+L +G P D+  A    +   + W   PGQ GG A+
Sbjct: 616 LILPGKQQELLEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNW--LPGQEGGPAM 672

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGH 620
           AD+LFG  NPGG+LPMT +P+ ++  LP+       +  +RY      +Y    +Y FG 
Sbjct: 673 ADVLFGDYNPGGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYVDMEYYP---LYRFGF 725

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           G+SYT+F ++                           +K+       +T  VQ  VKN+G
Sbjct: 726 GLSYTSFEYS--------------------------DLKIQEKPNGNVT--VQATVKNIG 757

Query: 681 SKDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           S+ G     ++ T     +A  K    +L  F+++++  G  + V   +     +S+++ 
Sbjct: 758 SRAGDEVAQLYVT---DMYASVKTRVMELKDFDRIYLQPGESKTVSFELTPYD-ISLLND 813

Query: 737 SGTRRIPLGEHNIHIGGT----------KHSVSLHAATLGV 767
              R +  GE  I +GG           KHSV       GV
Sbjct: 814 HMDRVVEKGEFKICVGGMSPDYVAKNEIKHSVGYSDNKKGV 854


>gi|424661946|ref|ZP_18098983.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
           616]
 gi|404578257|gb|EKA82992.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
           616]
          Length = 814

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 217/687 (31%), Positives = 330/687 (48%), Gaps = 91/687 (13%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GRV++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 168 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 222

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L+G   A+ VRG QG     R  V A+ KHF +Y    W         A +
Sbjct: 223 EETYGEDPYLNGVMGAALVRGFQGDTLRGRKSVIATLKHFASY---GWTEGGHNGGTAHL 279

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
            ++++E+    PFR  V  G + SVM SYN+++G P      +L   ++  W   G++VS
Sbjct: 280 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILKDRWLFKGFVVS 338

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
           D  ++G   +      +  EAA  A+ AG+D D G    ++ E   +AV++G ++   ++
Sbjct: 339 DLYAIGGLRE-HGVAGSDYEAAVKAVNAGVDSDLGT--NVYAEQLVAAVRKGDVAMETVD 395

Query: 345 NALVNTLTVQMRLGMFDGE--PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
            A+   L+++  +G+FD       +P      + V +P+H  LA E ARQ IVLLKN+  
Sbjct: 396 KAVRRILSLKFHMGLFDAPFVDDKRP-----AQLVASPEHIGLAREVARQSIVLLKNEDK 450

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC--GYTTPLQGI----GRYARTIHQQG 456
            LPL     RT+AVIGPN+D    M+G+Y          T L+GI     +  R ++ +G
Sbjct: 451 LLPLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVLYAKG 509

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------A 493
           C  V  +    F  AI+A+R AD  ++V+G     D S E                    
Sbjct: 510 CA-VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESG 568

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E  DRA L L GRQ EL+ +V    K P +LVL+ G P+ +     +    AI+ A YPG
Sbjct: 569 EGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQEA--DAILDAWYPG 625

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
             GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     RY         G 
Sbjct: 626 MQGGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTKRKGNRSRYIEEA-----GT 678

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
             YPFG+G+SYT F +T                        G  ++V+  + N   + V 
Sbjct: 679 PRYPFGYGLSYTMFSYT------------------------GMKVRVSE-ESNHCRVDVS 713

Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
           V V+N G+ DG   + ++     G +  P +QL AF +V + AG  + +   +   K L+
Sbjct: 714 VTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKAGETREITFTLD-KKSLA 772

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVS 759
           +  R G   +  G   +  GG+   ++
Sbjct: 773 LYMRDGEWAVEPGRFTVMAGGSSEDIA 799


>gi|161613146|ref|YP_001587111.1| hypothetical protein SPAB_00854 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|416529744|ref|ZP_11744511.1| hypothetical protein SEEM010_09961 [Salmonella enterica subsp.
           enterica serovar Montevideo str. LQC 10]
 gi|417466542|ref|ZP_12165045.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Montevideo str. S5-403]
 gi|161362510|gb|ABX66278.1| hypothetical protein SPAB_00854 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|353629977|gb|EHC77663.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Montevideo str. S5-403]
 gi|363552147|gb|EHL36453.1| hypothetical protein SEEM010_09961 [Salmonella enterica subsp.
           enterica serovar Montevideo str. LQC 10]
          Length = 755

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 386

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                  +   A+ A++QAD  + V+G  Q 
Sbjct: 445 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 714 PIDIEALKF 722


>gi|375001990|ref|ZP_09726330.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
           enterica serovar Infantis str. SARB27]
 gi|353076678|gb|EHB42438.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
           enterica serovar Infantis str. SARB27]
          Length = 751

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 104 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 157

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 158 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 212

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 213 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 269

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 270 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 327

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 328 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 382

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 383 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 440

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 441 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 500

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 501 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 557

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 558 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 615

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 616 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 658

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 659 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 709

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 710 PIDIEALKF 718


>gi|365121891|ref|ZP_09338802.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363644131|gb|EHL83433.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 855

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 162/421 (38%), Positives = 245/421 (58%), Gaps = 35/421 (8%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +  ++ PI  RV DL+ R++++EKV L+I  A  +PRL I  Y   +EALHG+  V PG 
Sbjct: 32  YKDMNAPIHDRVMDLLSRMTVEEKVSLMIHNAPGIPRLEIDKYYHGNEALHGI--VRPG- 88

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN----------GGTAGLT 151
           KF       T FPQ I  A+S+N  L   I   +SDEAR  +N          G +  L+
Sbjct: 89  KF-------TVFPQAIGMAASWNPELIYKISTAISDEARGKWNALGLGKKQLDGSSDLLS 141

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYD 211
           +WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQG+    LK  A+ KHF A +
Sbjct: 142 FWSPTVNMARDPRWGRTPETYGEDPHLTGTLGCAFVKGLQGNHPKYLKAVATPKHFAANN 201

Query: 212 LDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
            ++    +R H NA +S++D+ + +   F  C++EGK  S+M +YN VNG+P   +  ++
Sbjct: 202 EEH----NRAHCNAVISERDLREYYLPSFEKCIVEGKAQSIMTAYNAVNGIPCTVNTYLI 257

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCGPFLGLHTE 330
           K+ +R +W   GY+V+DC S   +  TQH +    E AA    +AG D++C     ++T+
Sbjct: 258 KKVLREDWGFQGYVVTDC-SAPAWMVTQHKYVKDYETAAVLMAKAGSDMECAD--NVYTQ 314

Query: 331 ---SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
              +A     +S+ DI++   + L  +M LG+FD +P   PY  + P+ V   +HQELAL
Sbjct: 315 PLLNAYYNYRVSDADIDSIAYHLLRGRMLLGLFD-DPEKNPYNKISPEKVGCKEHQELAL 373

Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT-TPLQGIG 446
           E ARQ +VLLKN+   LP++  + +++AV+G N+D      G+Y+G     + T L+GI 
Sbjct: 374 ETARQSLVLLKNENNFLPINPKKIKSIAVVGINAD--RCEFGDYSGTPVNESVTVLEGIK 431

Query: 447 R 447
           R
Sbjct: 432 R 432



 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 101/300 (33%), Positives = 151/300 (50%), Gaps = 47/300 (15%)

Query: 466 QLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILV 525
            LFG A  A+++ D T+ V+G+D+SIE E  DR  L LP  QQE + ++   +   T++V
Sbjct: 594 NLFGEAGKAAKECDITVAVLGIDKSIEREGQDRYTLELPADQQEFIREIYKINPK-TVVV 652

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
           L++G  I + +   D  I AII A YPG+ GGTA+A+ LFG  NPGG+LP+T+Y    + 
Sbjct: 653 LVAGSSIAINWI--DENIPAIIDAWYPGEQGGTAVAEALFGKYNPGGRLPLTFYNS--MD 708

Query: 586 NL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
            L P  + A++        GRTY+++ G  +Y FG+G+SYT F +   N    +A   D 
Sbjct: 709 ELPPFDDYAVK-------KGRTYQYFTGKPLYEFGYGLSYTKFNYRKLN----IASKQD- 756

Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHK 703
                                   T+ +Q  + N G  DG     V+   P  G + P K
Sbjct: 757 ------------------------TINIQFSISNTGKYDGDEVAQVYVQYPETGTYMPIK 792

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI--PLGEHNIHIGGTKHSVSLH 761
           QL  F++VH+  G  Q V I+I   K L   D   TR+   P G +   +G +   ++L 
Sbjct: 793 QLKGFKRVHIKKGQTQNVSISIPK-KELRYWDEK-TRKFVTPSGNYIFQVGSSSQRINLQ 850


>gi|416539671|ref|ZP_11750078.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. SARB30]
 gi|416551719|ref|ZP_11756625.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. 29N]
 gi|416566632|ref|ZP_11763924.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. 4441 H]
 gi|417327454|ref|ZP_12112866.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Adelaide str. A4-669]
 gi|418868407|ref|ZP_13422850.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Newport str. CVM 4176]
 gi|353570549|gb|EHC34783.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Adelaide str. A4-669]
 gi|363559751|gb|EHL43903.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. SARB30]
 gi|363566134|gb|EHL50153.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. 29N]
 gi|363579468|gb|EHL63250.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. 4441 H]
 gi|392838313|gb|EJA93877.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Newport str. CVM 4176]
          Length = 765

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                  +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|329963634|ref|ZP_08301109.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
 gi|328528179|gb|EGF55158.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
          Length = 863

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 173/454 (38%), Positives = 245/454 (53%), Gaps = 35/454 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SLP  +R   L+  L+L+EKV L++ GA  V RLGIK Y WW+EALHGV+  G   
Sbjct: 23  YKTTSLPPEERAELLLKELTLEEKVSLMMDGAKPVERLGIKPYNWWNEALHGVARAGL-- 80

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                   AT FPQ I  A+SF+      +   VSDEARA  +   +        GLT W
Sbjct: 81  --------ATVFPQPIGMAASFSPEAVYGVFDAVSDEARAKNSYYASQGSYERYQGLTMW 132

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL 212
           +P VNI+RDPRWGRG ET GEDP L+ +     V+GLQG +DG   K+ A  KHF  +  
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVEVVKGLQGGNDGRYDKLHACAKHFAVHSG 192

Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             WN   R  FNA+ +  +D+ +T+  PF   V EGKV  VMC+YN+  G P C    +L
Sbjct: 193 PEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSNRLL 249

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
            + +R EW  +G ++SDC ++  +Y+ +   + P  E A+A A+ +G DL+CG       
Sbjct: 250 MQILRDEWGFDGIVLSDCGAIADFYNDRGHKTHPDAESASAAAVLSGTDLECGSSYKALV 309

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
           E AV+ G + E  ++ A+   LT +  LG  D EP    +  +    V +  H  LAL+ 
Sbjct: 310 E-AVKLGKIEEKAVDVAVKRLLTARFALGEMD-EPDKVSWTKIPFSVVASAKHDSLALDI 367

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
           AR+ + LL N+   LPL      TVAV+GPN++ +V   GNY G+     T L+G+    
Sbjct: 368 ARKSMTLLMNKNDILPLKQ-GGLTVAVMGPNANDSVMQWGNYNGMPSHTVTILEGVRKAL 426

Query: 446 GRYARTIHQQGCKDVACADDQLFGAAIDASRQAD 479
           G   + I++QGC      D  L  +A +  + AD
Sbjct: 427 GADDKLIYEQGC---GWVDRSLIQSAFNLCKSAD 457



 Score =  119 bits (297), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 89/289 (30%), Positives = 137/289 (47%), Gaps = 47/289 (16%)

Query: 482 ILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGP 531
           I V G+  S+E E +          DR  + LP  Q+E+++ +  + K   I V  SG P
Sbjct: 604 IFVGGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQREMIAALHRSGK-KVIFVNCSGSP 662

Query: 532 IDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE 591
           I            AI+ A YPGQAGGTA+A++LFG  NP G+LP+T+Y    ++ LP  E
Sbjct: 663 I--GLEPETKECEAILQAWYPGQAGGTAVAEVLFGKYNPAGRLPVTFYRN--VSQLPDFE 718

Query: 592 MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINA 651
                       GRTYR+     ++PFG+G+SYT F +          + LD     INA
Sbjct: 719 -------DYDMSGRTYRYMTQEPLFPFGYGLSYTAFEYG--------QMVLD--QNKINA 761

Query: 652 TISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKV 711
              G+ +++T            V V N G +DG   + V+         P+K L AF++V
Sbjct: 762 ---GQTLELT------------VPVTNAGKRDGEEVVQVYLRKRDDTDGPNKTLRAFKRV 806

Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           ++PAG    V   +   +       + T R+  G+++I +GG+  +  L
Sbjct: 807 NIPAGKTVNVDFELKDKELEWWDTATNTMRVCAGDYDIMVGGSSQTEDL 855


>gi|255545654|ref|XP_002513887.1| hypothetical protein RCOM_1034150 [Ricinus communis]
 gi|223546973|gb|EEF48470.1| hypothetical protein RCOM_1034150 [Ricinus communis]
          Length = 246

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 126/229 (55%), Positives = 162/229 (70%), Gaps = 15/229 (6%)

Query: 12  LILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLIS 71
           LI + A S   A + PF+CD  + +T++  FCQ SLPI QR  DL+ RL+L EK+  L+ 
Sbjct: 18  LIFIFADS---APQPPFSCDYSNPSTKSYLFCQTSLPISQRAKDLVSRLTLDEKISQLVD 74

Query: 72  GAAAVPRLGIKGYEWWSEALHGVSNV-----GPGTKFGGDFPGATSFPQVITTASSFNAT 126
            A ++PRLG+  Y WWSEALHGV+ +       G  F G    ATSFPQVI TA+SF+  
Sbjct: 75  TAPSIPRLGVPAYRWWSEALHGVAFLPDVSERQGIHFNGTIRSATSFPQVIITAASFDTK 134

Query: 127 LWEAIGRVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
           LW  IG+V   EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+++GKYA S
Sbjct: 135 LWYRIGQVTGKEARAIYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPMVTGKYAVS 194

Query: 186 YVRGLQGSD------GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           +VRG+QG        G++L+ +A CKHFTAYDL+ WNG +RF FNA+VS
Sbjct: 195 FVRGVQGDSFEGGVLGEKLQASACCKHFTAYDLEKWNGTNRFIFNAQVS 243


>gi|419728314|ref|ZP_14255280.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41579]
 gi|419736497|ref|ZP_14263337.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41563]
 gi|419741060|ref|ZP_14267772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41573]
 gi|419745173|ref|ZP_14271816.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41566]
 gi|419746816|ref|ZP_14273391.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41565]
 gi|421574444|ref|ZP_16020065.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00325]
 gi|421579948|ref|ZP_16025510.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00326]
 gi|421585722|ref|ZP_16031214.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00328]
 gi|381291806|gb|EIC33035.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41563]
 gi|381293004|gb|EIC34177.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41573]
 gi|381299851|gb|EIC40919.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41579]
 gi|381306597|gb|EIC47470.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41566]
 gi|381320690|gb|EIC61232.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41565]
 gi|402521423|gb|EJW28761.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00326]
 gi|402524644|gb|EJW31941.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00325]
 gi|402529572|gb|EJW36805.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00328]
          Length = 765

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|334124834|ref|ZP_08498829.1| periplasmic beta-glucosidase [Enterobacter hormaechei ATCC 49162]
 gi|333387905|gb|EGK59096.1| periplasmic beta-glucosidase [Enterobacter hormaechei ATCC 49162]
          Length = 771

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 212/689 (30%), Positives = 336/689 (48%), Gaps = 102/689 (14%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 124 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 177

Query: 170 ETPGEDPVLSGKYAASYVRGLQ-GSDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +Q  S  DR  V  S KHF AY        +N VD     
Sbjct: 178 EGFGEDTFLTATMGKTMVEAMQRKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 232

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 233 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 289

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   A+++G+++     +   +    V+ G ++  +
Sbjct: 290 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 347

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A E AR+ +V
Sbjct: 348 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPTDTNAESRLHRKEAREVARESLV 402

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 403 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGDNA 460

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+    +Q              +   A++A++Q+D  + V+G  Q 
Sbjct: 461 KVVYAKGANVTDDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNAAKQSDVVVAVVGEAQG 520

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 521 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 577

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 578 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 635

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R GS+ A+            
Sbjct: 636 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--MSAPTMKRDGSVTAS------------ 678

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                    VDV N G ++GA  + ++     A    P KQL  FEKV++  G  + V  
Sbjct: 679 ---------VDVTNSGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETRTVSF 729

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I V   L   ++        G+ N+ IG
Sbjct: 730 PIDVNA-LKFWNQQMKYDAEPGKFNVFIG 757


>gi|189464698|ref|ZP_03013483.1| hypothetical protein BACINT_01042 [Bacteroides intestinalis DSM
           17393]
 gi|189436972|gb|EDV05957.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 862

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 157/451 (34%), Positives = 240/451 (53%), Gaps = 33/451 (7%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           +TLP+    L   +R  DL+ RL+L+EK  L+   + A+PRLGIK + WWSEALHGV+N 
Sbjct: 19  QTLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG-------- 149
           G            T FP+ +  A+SFN  L   I   VSDE RA +N             
Sbjct: 79  G----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFH 128

Query: 150 -LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHF 207
            L+ W+PNVNIFRDPRWGRGQET GEDP L+ +   + V+GLQG + ++ +   +C KH+
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEKYRKLLACAKHY 188

Query: 208 TAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
             +    W+   R   N   VS +D+ +T+   F+  V +  V  VMC+Y +++  P C 
Sbjct: 189 AVHSGPEWS---RHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDDDPCCG 245

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFL 325
           +  +L++ +R EW     +VSDC ++  ++ +   +S    AA     AG D++CG  + 
Sbjct: 246 NTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTMAGTDVECGYGYA 305

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
                 AV RGL++E ++N  ++  +  +  LG  D +PS   +  +    V    H++L
Sbjct: 306 YQKLPEAVSRGLITEEEVNKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVVNCKAHKDL 364

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           +L  +RQ + LL+N+   LPLS    R +AVIGPN+D    + GNY G      T L G 
Sbjct: 365 SLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGPNADDKPMLWGNYNGTPNQTITILDGF 423

Query: 446 G---RYARTIHQQGCKDVACADDQLFGAAID 473
               +  + ++ +GC  V   +DQ   + +D
Sbjct: 424 KSKLKKNQIVYMKGCDLV---NDQTLESYLD 451



 Score = 98.6 bits (244), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 83/315 (26%), Positives = 129/315 (40%), Gaps = 62/315 (19%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           +  +I   +  D  + V G+   +E E +          DR  + LP  Q+  +  +  A
Sbjct: 590 YSESISKLKGIDVVVFVGGISPQLEGEEMPVNIPGFKGGDRTDIELPAVQRNFLKALKDA 649

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K   + V  SG  +  A         AI+ A Y G+ GG A+AD+LFG  NP GKLP+T
Sbjct: 650 GK-QVVFVNCSGSSM--ALLPETESCDAILQAWYGGELGGYAVADVLFGDYNPSGKLPVT 706

Query: 578 WYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y       LP  E  +M+        GRTYR+   P ++PFG G+SYT+F         
Sbjct: 707 FYKN--TKQLPDYEDYSMK--------GRTYRYMSDP-LFPFGFGLSYTDFA-------- 747

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRL------TLGVQVDVKNVGSKDGAHTLLV 690
                                  +  A CN+       +L + V V N G + G   + V
Sbjct: 748 -----------------------IGTASCNKTQLRTDESLTLTVPVSNTGKRSGTEIVQV 784

Query: 691 FSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
           +         P K L A+ +V +  GAQQ V I +    +      + T R+  GE+ + 
Sbjct: 785 YIRKTDDADGPLKSLKAYARVELATGAQQDVKIELPSESFECFDPSTNTMRVAPGEYELF 844

Query: 751 IGGTKHSVSLHAATL 765
            G +  +  L +  +
Sbjct: 845 YGTSSAAQDLQSVKV 859


>gi|378984729|ref|YP_005247884.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. T000240]
 gi|312913157|dbj|BAJ37131.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. T000240]
          Length = 765

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                  +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQE 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|198245676|ref|YP_002216250.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Dublin str. CT_02021853]
 gi|375119733|ref|ZP_09764900.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Dublin str. SD3246]
 gi|197940192|gb|ACH77525.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Dublin str. CT_02021853]
 gi|326624000|gb|EGE30345.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Dublin str. SD3246]
          Length = 755

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 445 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADMVVAVVGESQG 504

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 714 PIDIEALKF 722


>gi|194736410|ref|YP_002115255.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Schwarzengrund str. CVM19633]
 gi|197300833|ref|ZP_02661390.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Schwarzengrund str. SL480]
 gi|204929066|ref|ZP_03220209.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Javiana str. GA_MM04042433]
 gi|205360240|ref|ZP_02681470.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Hadar str. RI_05P066]
 gi|416491454|ref|ZP_11727088.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. OH_2009072675]
 gi|416496191|ref|ZP_11729048.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. CASC_09SCPH15965]
 gi|416586610|ref|ZP_11775622.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. MD_MDA09249507]
 gi|416674556|ref|ZP_11821234.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB102109-0047]
 gi|417359373|ref|ZP_12133763.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Give
           str. S5-487]
 gi|417391848|ref|ZP_12154881.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Minnesota str. A4-603]
 gi|417476332|ref|ZP_12170882.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Rubislaw str. A4-653]
 gi|417511967|ref|ZP_12176427.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Senftenberg str. A4-543]
 gi|194711912|gb|ACF91133.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Schwarzengrund str. CVM19633]
 gi|197290594|gb|EDY29949.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Schwarzengrund str. SL480]
 gi|204321610|gb|EDZ06809.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Javiana str. GA_MM04042433]
 gi|205350836|gb|EDZ37467.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Hadar str. RI_05P066]
 gi|322649375|gb|EFY45811.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. OH_2009072675]
 gi|322656495|gb|EFY52784.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. CASC_09SCPH15965]
 gi|322667165|gb|EFY63332.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. MD_MDA09249507]
 gi|323222876|gb|EGA07232.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB102109-0047]
 gi|353589745|gb|EHC48461.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Give
           str. S5-487]
 gi|353614040|gb|EHC65987.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Minnesota str. A4-603]
 gi|353641236|gb|EHC86018.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Rubislaw str. A4-653]
 gi|353641325|gb|EHC86084.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Senftenberg str. A4-543]
          Length = 755

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 445 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 714 PIDIEALKF 722


>gi|423300729|ref|ZP_17278753.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472616|gb|EKJ91142.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
           CL09T03C10]
          Length = 735

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 220/754 (29%), Positives = 352/754 (46%), Gaps = 97/754 (12%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLLI--------------SGAAAVP-RLGIKGYEWWSEALH 92
           PI +RV+DL+ R++L+EKV  L                    VP  +G   Y   +  L 
Sbjct: 36  PIEKRVDDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYFETNPELR 95

Query: 93  G--------VSNVGPGTKFGGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAM 142
                     S +G    FG D      T +P  +  A S+N  L E    V + EAR  
Sbjct: 96  NNMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVSAQEAR-- 153

Query: 143 YNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKV 200
                +G+ + +SP +++ RDPRWGR  E  GEDP  +G + A+ VRG QG +     +V
Sbjct: 154 ----MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYANGVFGAASVRGYQGDNMSAENRV 209

Query: 201 AASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVN 260
           AA  KH+  Y         R +   ++SKQ + DT+ +P++M V  G  A++M S+N ++
Sbjct: 210 AACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYKMGVKAG-AATLMSSFNDIS 265

Query: 261 GVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD 320
           GVP  A+P  +   ++  WR +G+IVSD  ++      Q   +T +EAA  A  AGL++D
Sbjct: 266 GVPGSANPYTMTEILKNRWRHDGFIVSDWGAIE-QLKNQGLAATKKEAARHAFTAGLEMD 324

Query: 321 CGPF-LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD--V 377
                   H +  V+ G +S   ++ A+   L ++ RLG+F+     +PY  +  +    
Sbjct: 325 MMSHAYDRHLQELVEEGKVSMAQVDEAVRRVLLLKFRLGLFE-----RPYTPVTTEKERF 379

Query: 378 CTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG---- 433
             P   ++A   A + +VLLKN+   LPL+    + +AVIGP +     ++G++ G    
Sbjct: 380 LRPQSMDIAARLAAESMVLLKNENNVLPLAD--KKKIAVIGPMAKNGWDLLGSWRGHGKD 437

Query: 434 --IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSI 491
             +   Y          A   +  GC +    + + F  A+ A+R +D  +L +G   + 
Sbjct: 438 TDVVMLYDGLAAEFAGKAELRYALGC-NTKGDNREGFAEALGAARWSDVVVLCLGEMMTW 496

Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
             E   R+ + LP  Q+EL  ++    K P +L+L++G P+++   + +P   AI+    
Sbjct: 497 SGENASRSSIALPQMQEELAKELKKVGK-PVVLILVNGRPLEL--NRLEPVSDAILEIWQ 553

Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FY 610
           PG  G   +A IL G  NP GKL MT          P +   +    ++R  GR ++ FY
Sbjct: 554 PGVNGALPMAGILSGRINPSGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQGFY 604

Query: 611 KGPV---VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
           K      +YPFGHG+SYT F +                      T++  A KV   +   
Sbjct: 605 KDMTSDPLYPFGHGLSYTEFKY---------------------GTVTPSATKVKRGE--- 640

Query: 668 LTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
             L  +V V N+G++DGA T+  F S P      P K+L  FEK  + AG  +    +I 
Sbjct: 641 -KLSAEVTVTNIGARDGAETVHWFISDPYCSITRPVKELKHFEKQLIKAGETKTFRFDID 699

Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           + +    V+  G R +  GE+NIH+ G    + L
Sbjct: 700 LERDFGFVNEDGKRFLETGEYNIHVLGQTVKIEL 733


>gi|16765496|ref|NP_461111.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. LT2]
 gi|378445601|ref|YP_005233233.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. D23580]
 gi|378450850|ref|YP_005238209.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. 14028S]
 gi|378700080|ref|YP_005182037.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. SL1344]
 gi|378989553|ref|YP_005252717.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. UK-1]
 gi|379701394|ref|YP_005243122.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. ST4/74]
 gi|422026439|ref|ZP_16372832.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|422031463|ref|ZP_16377632.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm2]
 gi|427551058|ref|ZP_18928136.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|427567333|ref|ZP_18932851.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|427587543|ref|ZP_18937641.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|427611175|ref|ZP_18942507.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|427634848|ref|ZP_18947401.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|427656473|ref|ZP_18952166.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|427661625|ref|ZP_18957079.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|427670100|ref|ZP_18961878.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
 gi|20141276|sp|Q56078.2|BGLX_SALTY RecName: Full=Periplasmic beta-glucosidase; AltName:
           Full=Beta-D-glucoside glucohydrolase; AltName:
           Full=Cellobiase; AltName: Full=Gentiobiase; AltName:
           Full=T-cell inhibitor; Flags: Precursor
 gi|16420702|gb|AAL21070.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. LT2]
 gi|261247380|emb|CBG25205.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhimurium str. D23580]
 gi|267994228|gb|ACY89113.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. 14028S]
 gi|301158728|emb|CBW18240.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhimurium str. SL1344]
 gi|323130493|gb|ADX17923.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. ST4/74]
 gi|332989100|gb|AEF08083.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. UK-1]
 gi|414017758|gb|EKT01456.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|414018535|gb|EKT02182.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|414020221|gb|EKT03810.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm2]
 gi|414032204|gb|EKT15215.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|414033625|gb|EKT16574.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|414036785|gb|EKT19597.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|414046773|gb|EKT29088.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|414047776|gb|EKT30042.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|414052520|gb|EKT34556.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|414059327|gb|EKT40912.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
          Length = 765

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                  +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|417519247|ref|ZP_12181440.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Uganda str. R8-3404]
 gi|353647113|gb|EHC90324.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Uganda str. R8-3404]
          Length = 755

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 324/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 386

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                  +   A+ A++QAD  + V+G  Q 
Sbjct: 445 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 620 RYFDEVNGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G  + V  
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 713

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 714 PIDIEALKF 722


>gi|416424053|ref|ZP_11691321.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315996572]
 gi|416434178|ref|ZP_11697512.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-1]
 gi|416438422|ref|ZP_11699509.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-3]
 gi|416447728|ref|ZP_11705981.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-4]
 gi|416450525|ref|ZP_11707600.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-1]
 gi|416460485|ref|ZP_11714793.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-2]
 gi|416470222|ref|ZP_11718747.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 531954]
 gi|416476726|ref|ZP_11721214.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. NC_MB110209-0054]
 gi|416508101|ref|ZP_11735884.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB31]
 gi|416513698|ref|ZP_11738019.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. ATCC BAA710]
 gi|416540687|ref|ZP_11750492.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 19N]
 gi|416561671|ref|ZP_11761668.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 42N]
 gi|416575337|ref|ZP_11768369.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 81038-01]
 gi|416594582|ref|ZP_11780414.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 414877]
 gi|416602160|ref|ZP_11785217.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 366867]
 gi|416607231|ref|ZP_11788413.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 413180]
 gi|416611818|ref|ZP_11791047.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 446600]
 gi|416623896|ref|ZP_11797678.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609458-1]
 gi|416630098|ref|ZP_11800505.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556150-1]
 gi|416636573|ref|ZP_11803136.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609460]
 gi|416652084|ref|ZP_11811486.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. 507440-20]
 gi|416655624|ref|ZP_11812629.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556152]
 gi|416669115|ref|ZP_11819140.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB101509-0077]
 gi|416700997|ref|ZP_11829262.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB110209-0055]
 gi|416705382|ref|ZP_11830863.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB111609-0052]
 gi|416713638|ref|ZP_11837193.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009083312]
 gi|416720123|ref|ZP_11841889.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009085258]
 gi|416721559|ref|ZP_11842724.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315731156]
 gi|416729877|ref|ZP_11848338.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2009159199]
 gi|416738151|ref|ZP_11853179.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008282]
 gi|416744242|ref|ZP_11856524.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008283]
 gi|416759732|ref|ZP_11864557.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008284]
 gi|416760387|ref|ZP_11864780.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008285]
 gi|416767537|ref|ZP_11869997.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008287]
 gi|418484908|ref|ZP_13053898.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 80959-06]
 gi|418492113|ref|ZP_13058613.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035278]
 gi|418496941|ref|ZP_13063366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035318]
 gi|418500305|ref|ZP_13066703.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035320]
 gi|418503319|ref|ZP_13069684.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035321]
 gi|418509304|ref|ZP_13075600.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035327]
 gi|418512924|ref|ZP_13079159.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Pomona str. ATCC 10729]
 gi|418527981|ref|ZP_13093934.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008286]
 gi|452119629|ref|YP_007469877.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Javiana str. CFSAN001992]
 gi|322615194|gb|EFY12116.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315996572]
 gi|322617785|gb|EFY14681.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-1]
 gi|322624621|gb|EFY21452.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-3]
 gi|322626928|gb|EFY23724.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-4]
 gi|322634114|gb|EFY30850.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-1]
 gi|322635695|gb|EFY32405.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-2]
 gi|322640170|gb|EFY36834.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 531954]
 gi|322646407|gb|EFY42919.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. NC_MB110209-0054]
 gi|322661471|gb|EFY57695.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 19N]
 gi|322665641|gb|EFY61825.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 81038-01]
 gi|322671171|gb|EFY67299.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 414877]
 gi|322675188|gb|EFY71265.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 366867]
 gi|322680797|gb|EFY76832.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 413180]
 gi|322686972|gb|EFY82949.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 446600]
 gi|323192736|gb|EFZ77963.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609458-1]
 gi|323198832|gb|EFZ83931.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556150-1]
 gi|323205158|gb|EFZ90136.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609460]
 gi|323213417|gb|EFZ98214.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556152]
 gi|323215774|gb|EGA00517.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB101509-0077]
 gi|323223860|gb|EGA08163.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB110209-0055]
 gi|323231220|gb|EGA15335.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB111609-0052]
 gi|323233654|gb|EGA17746.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009083312]
 gi|323237724|gb|EGA21784.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009085258]
 gi|323245707|gb|EGA29701.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315731156]
 gi|323248929|gb|EGA32853.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2009159199]
 gi|323253000|gb|EGA36833.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008282]
 gi|323258676|gb|EGA42338.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008283]
 gi|323259508|gb|EGA43143.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008284]
 gi|323268291|gb|EGA51766.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008285]
 gi|323271920|gb|EGA55336.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008287]
 gi|363552716|gb|EHL36999.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB31]
 gi|363567278|gb|EHL51278.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. ATCC BAA710]
 gi|363573864|gb|EHL57738.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 42N]
 gi|366055944|gb|EHN20277.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035318]
 gi|366057212|gb|EHN21516.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 80959-06]
 gi|366058491|gb|EHN22779.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035278]
 gi|366069158|gb|EHN33284.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035320]
 gi|366073161|gb|EHN37236.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035321]
 gi|366078494|gb|EHN42495.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035327]
 gi|366082861|gb|EHN46791.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Pomona str. ATCC 10729]
 gi|366827184|gb|EHN54093.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. 507440-20]
 gi|372204105|gb|EHP17636.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008286]
 gi|451908633|gb|AGF80439.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Javiana str. CFSAN001992]
          Length = 765

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|421883572|ref|ZP_16314801.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Senftenberg str. SS209]
 gi|379986798|emb|CCF87074.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Senftenberg str. SS209]
          Length = 765

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFNA-----PYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|223936933|ref|ZP_03628842.1| Beta-glucosidase [bacterium Ellin514]
 gi|223894502|gb|EEF60954.1| Beta-glucosidase [bacterium Ellin514]
          Length = 774

 Score =  263 bits (672), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 218/693 (31%), Positives = 342/693 (49%), Gaps = 85/693 (12%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQE 170
           TSFPQ I   ++FN  L E +  + + E R    GG   LT   P V++ RD RWGR +E
Sbjct: 133 TSFPQPIGLGATFNPALVEKLYAMTAHETRV--RGGHQALT---PVVDVARDARWGRVEE 187

Query: 171 TPGEDPVLSGKYAASYVRGLQG--SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           T GEDP L+ +   + VRG QG  S  D+  V A+ KHF A+     +G +    N  VS
Sbjct: 188 TYGEDPFLNTQLGIAAVRGFQGDASFKDKKHVIATLKHFAAHGQPE-SGQNCAPVN--VS 244

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
           ++ + +TF  PFR C+ +G   SVM SYN+++GVP+ A   +L+  +R EW   G++VSD
Sbjct: 245 ERLLRETFLHPFRDCLKKGGAISVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSD 304

Query: 289 CDSVGVYYDT-----QHFTSTPEEAAADAIRAGLDLDC-GPFLGLHTESAVQRGLLSEID 342
             ++            H  +  +EA   A++AG++++   P    H    V++ +L E +
Sbjct: 305 YYAIWELSHRPDSHGHHVAADKKEACVLAVKAGVNIEFPEPDCYRHLVELVRKKVLHETE 364

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           ++  +   L  + ++G+FD +P   P      + V    H+ELA EAAR+ I LLKN+  
Sbjct: 365 LDELIAPMLLWKFKMGLFD-DPYVDP--EEAARVVGCEVHRELASEAARETITLLKNEND 421

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCK 458
            LPL+  + +TVAVIGPN++   +++G Y+G+     T L GI    G   + +H +GCK
Sbjct: 422 LLPLNPAKLKTVAVIGPNAN--RSLLGGYSGVPAHNVTVLDGIKARLGGAVKVVHAEGCK 479

Query: 459 ---------DVACADD-----QLFGAAIDASRQADATILVMGLDQSIEAEAL------DR 498
                    D   A D     +    A+  +  AD  I+ +G ++    EA       DR
Sbjct: 480 ITVGGSWQQDEVLASDPAEDRKQIDEAVKVAWSADVVIVAIGGNEQTSREAWSLKHMGDR 539

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA-FAKNDPRIAAIIWAGYPGQAGG 557
             L L G Q EL+ +  +A+  P + ++ +G P+ +   A+N P   AI+   Y GQ  G
Sbjct: 540 TSLDLIGHQDELI-RALLATGKPVVALVFNGRPLAINHVAQNVP---AILECWYLGQECG 595

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
           +A+A +LFG  NPGGKLP++  P+  +  LP+     +PS       R + + +   ++P
Sbjct: 596 SAVAAVLFGDHNPGGKLPIS-IPRS-VGQLPVF-YNHKPSAR-----RGFLWDEATPLFP 647

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
           FG G+SYT F  T  N    +A  +  R GS            TH         V VDV 
Sbjct: 648 FGFGLSYTKF--TFKN--VRLAKKIISRTGS------------TH---------VSVDVT 682

Query: 678 NVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           N G + G   + V+          P K+L  F+K+ +  G  + V +++   + L+  D 
Sbjct: 683 NAGKRAGTEVVQVYVRDLISSVTRPVKELKVFQKITLAPGETKTVSLDL-TPESLAFYDV 741

Query: 737 SGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIK 769
           +    +  GE  I +G +   V L    L V+K
Sbjct: 742 NMKYVVEPGEFEIMVGNSSRDVDLQKVVLRVVK 774


>gi|378954476|ref|YP_005211963.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Gallinarum/pullorum str. RKS5078]
 gi|357205087|gb|AET53133.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Gallinarum/pullorum str. RKS5078]
          Length = 755

 Score =  263 bits (672), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 203/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI        
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVDDGA 444

Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
              YA+               +H++  K    +   +   A+ A++QA+  + V+G  Q 
Sbjct: 445 KILYAKGANITNDKDIVDFLNLHEEAVKIDPRSPQAMIDEAVQAAKQANVVVAVVGESQG 504

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 714 PIDIEALKF 722


>gi|205357644|ref|ZP_02572188.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar 4,[5],12:i:- str. CVM23701]
 gi|374981198|ref|ZP_09722528.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. TN061786]
 gi|383496875|ref|YP_005397564.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. 798]
 gi|205330573|gb|EDZ17337.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar 4,[5],12:i:- str. CVM23701]
 gi|321224818|gb|EFX49881.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. TN061786]
 gi|380463696|gb|AFD59099.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhimurium str. 798]
          Length = 755

 Score =  263 bits (672), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                  +   A+ A++QAD  + V+G  Q 
Sbjct: 445 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 714 PIDIEALKF 722


>gi|238912609|ref|ZP_04656446.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Tennessee str. CDC07-0191]
          Length = 765

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G  + V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|445146604|ref|ZP_21387818.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Dublin str. SL1438]
 gi|445154484|ref|ZP_21391811.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Dublin str. HWS51]
 gi|444845501|gb|ELX70711.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Dublin str. SL1438]
 gi|444850729|gb|ELX75826.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Dublin str. HWS51]
          Length = 765

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADMVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|194446334|ref|YP_002041437.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Newport str. SL254]
 gi|194469038|ref|ZP_03075022.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Kentucky str. CVM29188]
 gi|195874002|ref|ZP_02700028.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. SL317]
 gi|205358223|ref|ZP_02654575.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Kentucky str. CDC 191]
 gi|205359498|ref|ZP_02830128.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Weltevreden str. HI_N05-537]
 gi|417374418|ref|ZP_12144175.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Inverness str. R8-3668]
 gi|418765082|ref|ZP_13321175.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35199]
 gi|418772136|ref|ZP_13328140.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21539]
 gi|418825678|ref|ZP_13380951.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22462]
 gi|194404997|gb|ACF65219.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. SL254]
 gi|194455402|gb|EDX44241.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Kentucky str. CVM29188]
 gi|195631424|gb|EDX49984.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. SL317]
 gi|205335699|gb|EDZ22463.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Kentucky str. CDC 191]
 gi|205344851|gb|EDZ31615.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Weltevreden str. HI_N05-537]
 gi|353599534|gb|EHC55680.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Inverness str. R8-3668]
 gi|392732250|gb|EIZ89461.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21539]
 gi|392742031|gb|EIZ99126.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35199]
 gi|392812566|gb|EJA68550.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22462]
          Length = 755

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 445 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G  + V  
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 713

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 714 PIDIEALKF 722


>gi|197248682|ref|YP_002147137.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Agona
           str. SL483]
 gi|440763077|ref|ZP_20942125.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH11G1113]
 gi|440769230|ref|ZP_20948190.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH08SF124]
 gi|440771374|ref|ZP_20950293.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH10GFN094]
 gi|197212385|gb|ACH49782.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Agona str. SL483]
 gi|436415218|gb|ELP13139.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH08SF124]
 gi|436421310|gb|ELP19157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH11G1113]
 gi|436422409|gb|ELP20247.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH10GFN094]
          Length = 765

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G  + V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|270296098|ref|ZP_06202298.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
 gi|270273502|gb|EFA19364.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
          Length = 798

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 200/679 (29%), Positives = 323/679 (47%), Gaps = 84/679 (12%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT FP      +++N  L   IG   + EA  +      G T  +SP ++I +DPRWGR 
Sbjct: 168 ATYFPAQCGQGATWNKELIAQIGEAEAREASVL------GYTNIYSPILDIAQDPRWGRC 221

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
            ET GEDP  +G+     +  LQ +     K+ ++ KHF  Y +       +   +  V+
Sbjct: 222 VETYGEDPYHAGQMGKQMILSLQKN-----KLVSTPKHFAVYSIPVGGRDGKTRTDPHVA 276

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++   +  PFR+   E     VM SYN  +G P     + L   +R EW   GY+VSD
Sbjct: 277 PREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSD 336

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDI 343
            ++V          +  E+A A A+ AGL++         F+ L   SAV++G +S+  +
Sbjct: 337 SEAVEFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFI-LPLRSAVKKGKISQETL 395

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
           N  +   L V+  LG+FD      PY        + V +P+HQ+LALEAARQ +VLLKN+
Sbjct: 396 NQRVAEILRVKFWLGLFDN-----PYRGDEKRAGQIVHSPEHQQLALEAARQSLVLLKNE 450

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGC 457
             +LPLS    R+VAVIGPN+D    +I  Y       TT  +GI +    A  ++++GC
Sbjct: 451 HQTLPLSK-SIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVVYKKGC 509

Query: 458 KDV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
             +              A  + Q+   AI+A++ A+ T++V+G ++    E   R  L L
Sbjct: 510 DIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSRTSLDL 569

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PGRQ+EL+ K+    K P +LV++ G    + FA     + AII A +PG+ GG AIA+ 
Sbjct: 570 PGRQKELLKKICQLGK-PVVLVMIDGRASSINFAAT--HVPAIIHAWFPGEFGGQAIAEA 626

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFG  NPGG+L +T +P+  +  +P      +P   +      Y       +YPFGHG+S
Sbjct: 627 LFGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSETSVY-----GALYPFGHGLS 678

Query: 624 YTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
           YT F ++ +A +P+       G  G+I+                     +   +KN+G +
Sbjct: 679 YTTFQYSDLAISPSK-----QGVQGNIS---------------------ISCTIKNIGQR 712

Query: 683 DGAHTLLVFSTPPAGHWAPHKQLV-AFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
           +G   + ++          + Q++  FE++ +   A   V   +   + L + D+     
Sbjct: 713 EGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFEL-TPQELGIWDKQMNFT 771

Query: 742 IPLGEHNIHIGGTKHSVSL 760
           +  G   + IG +   + L
Sbjct: 772 VEPGMFKVMIGSSSKDIRL 790


>gi|429092960|ref|ZP_19155568.1| Periplasmic beta-glucosidase [Cronobacter dublinensis 1210]
 gi|426742256|emb|CCJ81681.1| Periplasmic beta-glucosidase [Cronobacter dublinensis 1210]
          Length = 759

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 201/664 (30%), Positives = 323/664 (48%), Gaps = 101/664 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GR+      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 112 TIFPNSLGLASSFNLDAVKTVGRI------SAYEAADDGLNMTWAPMVDVTRDPRWGRGS 165

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 166 EGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 220

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G    VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 221 --MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASDGWLLKDLLRDDWGFKGI 277

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H T++ PE+A   AI++G+D+     +   +    ++ G ++  +
Sbjct: 278 TISDHGAIKEL--IKHGTASGPEDAVRVAIKSGVDMSMADEYYSKYLPDLIKSGKVTMAE 335

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A   AR+ +V
Sbjct: 336 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPKDTNAESRLHRDDARRVARESLV 390

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    M+G++  AG+A    T LQG+    G  A
Sbjct: 391 LLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVADQAVTLLQGMKNVAGDKA 448

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                 ++   A++ ++Q+D  + V+G  Q 
Sbjct: 449 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQG 508

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  L +P  Q++L+S +    K P +LVLM+G P+  A  K D +  A++   
Sbjct: 509 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAMLETW 565

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 566 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 623

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  +V++    ++ P   R G + A+++          
Sbjct: 624 RYFDEANGP-LYPFGYGLSYTTF--SVSDVK--LSAPTLKRDGKVTASVT---------- 668

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                      V N G ++GA  + ++     A    P KQL  FEKV +  G  Q V  
Sbjct: 669 -----------VTNTGKREGATAVQMYLQDVTASMSRPVKQLRGFEKVTLKPGESQTVSF 717

Query: 724 NIHV 727
            I V
Sbjct: 718 PIDV 721


>gi|62180752|ref|YP_217169.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Choleraesuis str. SC-B67]
 gi|375115081|ref|ZP_09760251.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Choleraesuis str. SCSA50]
 gi|62128385|gb|AAX66088.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Choleraesuis str. SC-B67]
 gi|322715227|gb|EFZ06798.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Choleraesuis str. SCSA50]
          Length = 765

 Score =  263 bits (671), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNQDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +   +AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGAIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + +H +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILHAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G  + V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|437730260|ref|ZP_20831186.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 648901 16-16]
 gi|435289521|gb|ELO66481.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 648901 16-16]
          Length = 764

 Score =  263 bits (671), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 203/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI        
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVDDGA 454

Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
              YA+               ++++  K    +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|423295566|ref|ZP_17273693.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
           CL03T12C18]
 gi|392672275|gb|EIY65744.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
           CL03T12C18]
          Length = 782

 Score =  263 bits (671), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 209/699 (29%), Positives = 325/699 (46%), Gaps = 108/699 (15%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT FP  I  A++++  L + +G+V++ E R+       G   + P +++ RDPRW R 
Sbjct: 148 GATVFPTGIGMAATWSPELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 202

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDPVLSG   AS V GL G +   +    A+ KHF AY +        +   A V
Sbjct: 203 EETFGEDPVLSGTLGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 259

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
             +D+   F  PFR  +  G + SVM SYN ++G+P  ++  +L + +R EW+  G++VS
Sbjct: 260 GIRDLHQNFLPPFRKAIDAGAL-SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVS 318

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           D  S+   +++     T E AA  ++ AG+D+D G     +   AVQ G + +  I+ A+
Sbjct: 319 DLYSIEGIHESHFVAPTKENAAIQSVMAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAV 378

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L ++  +G+F+  P   P   +  K V   +H ELA + A+  I LLKN+   LPLS
Sbjct: 379 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGC--KDV 460
            + ++ VAVIGPN+D    M+G+Y          T L GI      +R  + +GC  +D 
Sbjct: 436 KMINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 494

Query: 461 ACADDQLFGAAIDASRQAD----------------------ATILVMGLDQSIEA-EALD 497
              + +    AI+A+R+++                      A +   G    +E  E  D
Sbjct: 495 TVNEIE---QAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           RA L L GRQQEL+  +    K P I+V + G P++  +A       A++ A YPGQ GG
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGG 608

Query: 558 TAIADILFGTSNPGGKLPMT----------WYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
            AIAD+LFG  NP G+LP++          +Y Q+   N    E++  P           
Sbjct: 609 NAIADVLFGDYNPSGRLPISVPRSVGQIPVYYNQKAPRNHDYVEVSSSP----------- 657

Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
                  +Y FG+GMSYT F +                        S   +    A+C  
Sbjct: 658 -------LYSFGYGMSYTTFEY------------------------SDLQVVQKSARC-- 684

Query: 668 LTLGVQVDVKNVGSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
               V   VKN G  DG   + L      A    P KQL  FE+ H+  G +++V   + 
Sbjct: 685 --FEVSFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLT 742

Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATL 765
              +  +V+ +  + +  G  ++ IG   + + L    L
Sbjct: 743 EEDFF-LVNYTLKKVVESGNFHLMIGAASNDIRLQNVIL 780


>gi|409250776|ref|YP_006886584.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Weltevreden str. 2007-60-3289-1]
 gi|418760521|ref|ZP_13316675.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35185]
 gi|418775352|ref|ZP_13331310.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 33953]
 gi|418781475|ref|ZP_13337358.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35188]
 gi|418783741|ref|ZP_13339586.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21559]
 gi|418789285|ref|ZP_13345072.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19447]
 gi|418794086|ref|ZP_13349808.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19449]
 gi|418796683|ref|ZP_13352374.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19567]
 gi|418803316|ref|ZP_13358937.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35202]
 gi|418809719|ref|ZP_13365271.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21550]
 gi|418813873|ref|ZP_13369394.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22513]
 gi|418815139|ref|ZP_13370647.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21538]
 gi|418822928|ref|ZP_13378339.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22425]
 gi|418830026|ref|ZP_13384989.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N18486]
 gi|418836408|ref|ZP_13391292.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N1543]
 gi|418840973|ref|ZP_13395796.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21554]
 gi|418844732|ref|ZP_13399518.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19443]
 gi|418850848|ref|ZP_13405564.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 37978]
 gi|418857905|ref|ZP_13412528.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19470]
 gi|418862200|ref|ZP_13416744.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19536]
 gi|419789701|ref|ZP_14315381.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 1]
 gi|419792237|ref|ZP_14317879.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 15]
 gi|320086604|emb|CBY96375.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Weltevreden str. 2007-60-3289-1]
 gi|392615486|gb|EIW97925.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 1]
 gi|392618961|gb|EIX01347.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 15]
 gi|392742601|gb|EIZ99688.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35185]
 gi|392747243|gb|EJA04244.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35188]
 gi|392748425|gb|EJA05411.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 33953]
 gi|392756753|gb|EJA13648.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21559]
 gi|392760526|gb|EJA17361.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19447]
 gi|392762081|gb|EJA18897.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19449]
 gi|392770056|gb|EJA26784.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19567]
 gi|392772845|gb|EJA29542.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35202]
 gi|392773804|gb|EJA30500.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21550]
 gi|392775099|gb|EJA31794.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22513]
 gi|392787185|gb|EJA43733.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22425]
 gi|392793518|gb|EJA49962.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21538]
 gi|392800769|gb|EJA56999.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N1543]
 gi|392802259|gb|EJA58473.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N18486]
 gi|392809639|gb|EJA65673.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21554]
 gi|392813541|gb|EJA69505.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19443]
 gi|392818698|gb|EJA74582.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 37978]
 gi|392834583|gb|EJA90187.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19470]
 gi|392836652|gb|EJA92232.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19536]
          Length = 765

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G  + V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|197263061|ref|ZP_03163135.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA23]
 gi|207857601|ref|YP_002244252.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. P125109]
 gi|421359420|ref|ZP_15809713.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|421364010|ref|ZP_15814248.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|421367050|ref|ZP_15817252.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|421373030|ref|ZP_15823175.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|421375395|ref|ZP_15825508.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|421382177|ref|ZP_15832228.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|421386891|ref|ZP_15836897.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|421391230|ref|ZP_15841201.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|421395600|ref|ZP_15845536.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|421397961|ref|ZP_15847870.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|421402484|ref|ZP_15852342.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|421409068|ref|ZP_15858863.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|421411752|ref|ZP_15861516.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|421418166|ref|ZP_15867872.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|421422724|ref|ZP_15872392.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|421424918|ref|ZP_15874555.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|421433210|ref|ZP_15882778.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|421434152|ref|ZP_15883702.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|421441895|ref|ZP_15891355.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|421443218|ref|ZP_15892660.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|436600421|ref|ZP_20512957.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|436686654|ref|ZP_20517957.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|436802328|ref|ZP_20525319.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|436809476|ref|ZP_20528856.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|436814772|ref|ZP_20532323.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|436844194|ref|ZP_20537952.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|436854476|ref|ZP_20544110.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|436855800|ref|ZP_20544925.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|436864301|ref|ZP_20550268.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|436870232|ref|ZP_20554038.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|436877723|ref|ZP_20558651.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|436886662|ref|ZP_20563082.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|436893559|ref|ZP_20567466.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|436901307|ref|ZP_20572217.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|436912656|ref|ZP_20578485.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|436920338|ref|ZP_20582934.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|436926675|ref|ZP_20586501.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|436936605|ref|ZP_20592045.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|436940618|ref|ZP_20594562.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|436950717|ref|ZP_20599772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|436961959|ref|ZP_20605333.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|436969147|ref|ZP_20608268.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|436976823|ref|ZP_20612073.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|436989589|ref|ZP_20616596.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|437001001|ref|ZP_20620797.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|437022365|ref|ZP_20628334.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|437036112|ref|ZP_20633844.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|437044148|ref|ZP_20637101.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|437052068|ref|ZP_20641628.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|437059125|ref|ZP_20645972.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|437064832|ref|ZP_20648606.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|437077112|ref|ZP_20655320.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|437083638|ref|ZP_20659292.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|437092016|ref|ZP_20663616.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|437113855|ref|ZP_20669057.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|437122048|ref|ZP_20672085.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|437128670|ref|ZP_20675357.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|437139171|ref|ZP_20681653.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|437144157|ref|ZP_20684771.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|437151251|ref|ZP_20689128.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|437162727|ref|ZP_20696289.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|437169975|ref|ZP_20700070.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|437179039|ref|ZP_20705157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|437181201|ref|ZP_20706372.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|437239027|ref|ZP_20714239.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13183-1]
 gi|437261953|ref|ZP_20718699.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|437267405|ref|ZP_20721157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|437280676|ref|ZP_20728053.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|437289916|ref|ZP_20731294.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|437311898|ref|ZP_20736006.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|437330435|ref|ZP_20741599.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|437346849|ref|ZP_20747003.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|437411031|ref|ZP_20752807.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|437442646|ref|ZP_20757878.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|437461006|ref|ZP_20761959.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|437479748|ref|ZP_20768095.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|437494385|ref|ZP_20772414.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|437505366|ref|ZP_20775420.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|437535206|ref|ZP_20781440.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648899 3-17]
 gi|437559607|ref|ZP_20785823.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|437572273|ref|ZP_20789035.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|437583575|ref|ZP_20792569.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|437599978|ref|ZP_20797137.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|437616848|ref|ZP_20802600.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|437631373|ref|ZP_20806367.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|437662149|ref|ZP_20813366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|437676329|ref|ZP_20816941.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|437695929|ref|ZP_20822252.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|437711846|ref|ZP_20826864.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|437813102|ref|ZP_20841687.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|438082703|ref|ZP_20857889.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|438099351|ref|ZP_20863367.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|438109484|ref|ZP_20867447.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|445165173|ref|ZP_21394056.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|445182314|ref|ZP_21398520.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|445229380|ref|ZP_21405024.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
 gi|445333762|ref|ZP_21414961.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 18569]
 gi|445344304|ref|ZP_21417576.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|445361577|ref|ZP_21423869.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
 gi|197241316|gb|EDY23936.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA23]
 gi|206709404|emb|CAR33745.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Enteritidis str. P125109]
 gi|395985043|gb|EJH94216.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|395985485|gb|EJH94655.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|395989705|gb|EJH98839.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|395998658|gb|EJI07685.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|395999277|gb|EJI08299.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|396005381|gb|EJI14360.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|396011532|gb|EJI20442.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|396012239|gb|EJI21137.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|396012640|gb|EJI21536.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|396025999|gb|EJI34772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|396032021|gb|EJI40746.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|396032157|gb|EJI40881.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|396039333|gb|EJI47961.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|396042039|gb|EJI50662.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|396045253|gb|EJI53847.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|396049426|gb|EJI57969.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|396058514|gb|EJI66975.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|396060758|gb|EJI69199.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|396062572|gb|EJI70983.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|396071002|gb|EJI79329.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|434957421|gb|ELL51060.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|434967293|gb|ELL60128.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|434972888|gb|ELL65276.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|434975403|gb|ELL67694.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|434978780|gb|ELL70772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|434983279|gb|ELL75087.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|434992014|gb|ELL83484.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|434995336|gb|ELL86652.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|435002560|gb|ELL93625.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|435003859|gb|ELL94860.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|435008607|gb|ELL99430.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|435011905|gb|ELM02608.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|435018555|gb|ELM09017.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|435020741|gb|ELM11130.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|435026901|gb|ELM17032.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|435027857|gb|ELM17949.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|435036516|gb|ELM26335.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|435039443|gb|ELM29224.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|435045523|gb|ELM35151.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|435051098|gb|ELM40602.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|435051184|gb|ELM40686.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|435059248|gb|ELM48538.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|435071156|gb|ELM60106.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|435071299|gb|ELM60247.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|435074446|gb|ELM63278.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|435075555|gb|ELM64369.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|435076993|gb|ELM65767.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|435081352|gb|ELM69994.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|435091470|gb|ELM79861.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|435094938|gb|ELM83277.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|435100574|gb|ELM88742.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|435103950|gb|ELM92024.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|435107381|gb|ELM95366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|435112922|gb|ELN00787.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|435116153|gb|ELN03904.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|435124194|gb|ELN11661.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|435128186|gb|ELN15537.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|435132693|gb|ELN19891.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|435139148|gb|ELN26152.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|435142647|gb|ELN29534.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|435145134|gb|ELN31963.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|435147755|gb|ELN34507.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|435152456|gb|ELN39086.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|435162324|gb|ELN48508.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|435166749|gb|ELN52715.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|435170164|gb|ELN55920.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|435175921|gb|ELN61323.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|435182180|gb|ELN67212.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|435182531|gb|ELN67539.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13183-1]
 gi|435183030|gb|ELN68005.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|435189277|gb|ELN73922.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|435189619|gb|ELN74243.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|435204236|gb|ELN87933.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|435211340|gb|ELN94479.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|435219425|gb|ELO01787.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|435220698|gb|ELO02980.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|435227283|gb|ELO08792.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|435235423|gb|ELO16226.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|435238918|gb|ELO19527.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|435240567|gb|ELO20958.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648899 3-17]
 gi|435247913|gb|ELO27842.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|435261576|gb|ELO40730.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|435262896|gb|ELO41978.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|435263473|gb|ELO42520.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|435267795|gb|ELO46460.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|435276722|gb|ELO54719.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|435277229|gb|ELO55183.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|435283446|gb|ELO61011.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|435288066|gb|ELO65157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|435297184|gb|ELO73479.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|435320361|gb|ELO93000.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|435325905|gb|ELO97749.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|435332633|gb|ELP03544.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|444865473|gb|ELX90243.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
 gi|444867011|gb|ELX91716.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|444871369|gb|ELX95805.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|444875449|gb|ELX99648.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 18569]
 gi|444880476|gb|ELY04551.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|444884583|gb|ELY08407.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
          Length = 765

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 203/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI        
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVDDGA 454

Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
              YA+               ++++  K    +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|427384989|ref|ZP_18881494.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728250|gb|EKU91109.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
           12058]
          Length = 862

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 156/451 (34%), Positives = 239/451 (52%), Gaps = 33/451 (7%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           +TLP+    L   +R  DL+ RL+L+EK  L+   + A+PRLGIK + WWSEALHGV+N 
Sbjct: 19  QTLPYQNPELSPAERAKDLVSRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG-------- 149
           G            T FP+ +  A+SFN  L   I    SDE RA +N             
Sbjct: 79  G----------NVTVFPEPVGMAASFNEKLVFEIFNATSDEMRAKHNERVRNGLEDTRFH 128

Query: 150 -LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHF 207
            L+ W+PNVNIFRDPRWGRGQET GEDP L+ +   + V+GLQG + ++ +   +C KH+
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPEDEKYRKLLACAKHY 188

Query: 208 TAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
             +    W+   R   N   +S +D+ +T+   F+  V +  V  VMC+Y +++  P C 
Sbjct: 189 AVHSGPEWS---RHSANLNNISPRDLWETYLPAFKALVQKADVREVMCAYQRLDDDPCCG 245

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFL 325
              +L++ +R EW     +VSDC ++  ++ +   +S    AA     AG D++CG  + 
Sbjct: 246 STRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTMAGTDVECGYGYA 305

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
                 AV RGL++E +I+  ++  L  +  LG  D +PS   +  +    V +  H++L
Sbjct: 306 YQKLPEAVSRGLITEEEIDKHVLRLLEGRFELGEMD-DPSLVKWSQIPMSVVNSKAHKDL 364

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           +L  +RQ + LL+N+   LPLS    R +AVIGPN+D    + GNY G      T L G 
Sbjct: 365 SLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGPNADDKPMLWGNYNGTPNQTITILDGF 423

Query: 446 G---RYARTIHQQGCKDVACADDQLFGAAID 473
               +  + I+ +GC  V   +D+   + +D
Sbjct: 424 KTKLKKNQIIYMKGCDLV---NDKTLESYLD 451



 Score =  100 bits (250), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 83/312 (26%), Positives = 132/312 (42%), Gaps = 50/312 (16%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           +  +I   +  D  + V G+   +E E +          DR  + LP  Q+  +  +  A
Sbjct: 590 YDESISKLKNVDMVVFVGGISPQLEGEEMPLNLPGFKNGDRTDIELPAVQRNFLKALKEA 649

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
           SK   + V  SG  +  A         AI+ A Y G+ GG A+AD+LFG  NP GKLP+T
Sbjct: 650 SK-QVVFVNCSGSSM--ALLPETESCDAILQAWYGGELGGQAVADVLFGDYNPSGKLPVT 706

Query: 578 WYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y       LP  E  +M+        GRTYR+   P ++PFG G+SYT+F  TV     
Sbjct: 707 FYKS--TKQLPDYEDYSMK--------GRTYRYMSDP-LFPFGFGLSYTDF--TVG---- 749

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                         A  S   ++   A      L + V + N G + G   + V+     
Sbjct: 750 -------------TAQCSKTQLRTEEA------LTLTVPISNTGKRSGTEVIQVYIRKTD 790

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K L A+ +  + AGA Q + I +    +      + T R+  GE+ +  G +  
Sbjct: 791 DTGGPLKSLKAYARAELAAGATQDIEIQLPAESFECFDPSTNTMRVAPGEYELFYGTSSA 850

Query: 757 SVSLHAATLGVI 768
           +  L +  + ++
Sbjct: 851 AKDLQSVKVTLL 862


>gi|421449803|ref|ZP_15899183.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
 gi|396068619|gb|EJI76965.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
          Length = 765

 Score =  262 bits (670), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 203/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI        
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVDDGA 454

Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
              YA+               ++++  K    +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|437849166|ref|ZP_20847216.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 6.0562-1]
 gi|435338617|gb|ELP07830.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 6.0562-1]
          Length = 792

 Score =  262 bits (670), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 203/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 145 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 198

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 199 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 253

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 254 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 310

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 311 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 368

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 369 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 423

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI        
Sbjct: 424 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVDDGA 481

Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
              YA+               ++++  K    +   +   A+ A++QAD  + V+G  Q 
Sbjct: 482 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 541

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 542 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 598

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 599 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 656

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 657 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 699

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 700 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 750

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 751 PIDIEALKF 759


>gi|160892207|ref|ZP_02073210.1| hypothetical protein BACUNI_04671 [Bacteroides uniformis ATCC 8492]
 gi|156858685|gb|EDO52116.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           uniformis ATCC 8492]
          Length = 990

 Score =  262 bits (670), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 207/681 (30%), Positives = 332/681 (48%), Gaps = 74/681 (10%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT+FP  +    ++N  L   +G +   EAR +      G T  ++P +++ RD RWGR 
Sbjct: 221 ATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 274

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           +E  GE P L  +     VRGLQ +     +VAA+ KHF AY  +          + ++S
Sbjct: 275 EEVYGESPYLVAELGIEMVRGLQHNH----QVAATGKHFAAYSNNKGAREGMARVDPQMS 330

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++E+    PF+  + E  +  VM SYN  +G+P       L   +RGE    GY+VSD
Sbjct: 331 PREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVSD 390

Query: 289 CDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEID 342
            D+V   Y T+H T+   +EA   ++ AGL++ C       F+ L     V+ G LSE  
Sbjct: 391 SDAVEYLY-TKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFV-LPLRELVKEGGLSEEV 448

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           IN+ + + L V+  +G+FD  P          ++V   +++ +AL+A+R+ IVLLKN G 
Sbjct: 449 INDRVRDILRVKFLIGLFDA-PYQTDLAD-ADREVEKEENEAIALQASRESIVLLKNAGE 506

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK 458
            LPL     + +AV GPN++     + +Y  +A   TT L+GI       A  ++ +GC 
Sbjct: 507 LLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGCD 566

Query: 459 ------------DVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
                       D    DD+      A++ +RQAD  I+V+G  Q    E   R  L LP
Sbjct: 567 LVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDLP 626

Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
           GRQ +L+  +    K P +L+L++G P+ + +A  D  + AI+ A YPG  GGTA+ADIL
Sbjct: 627 GRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADIL 683

Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGH 620
           FG  NPGGKL +T +P+  +  +P      +PS      + PG T    +    +YPFG+
Sbjct: 684 FGDYNPGGKLTVT-FPKT-VGQIPFN-FPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 740

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           G+SYT F ++  +    V  P +      +AT+    +KVT+         VQ+ +++V 
Sbjct: 741 GLSYTTFEYSDLDITPRVITPNE------SATVR---LKVTNTGKRAGDEVVQLYIRDVL 791

Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
           S    +                K L  F+++H+  G  Q +   I   K+L ++D     
Sbjct: 792 SSITTY---------------EKNLAGFQRIHLEPGEAQELSFTID-RKHLELLDADMKW 835

Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
            +  G+  +  G +   + L+
Sbjct: 836 VVEPGDFVLMAGASSEDIRLN 856


>gi|417366389|ref|ZP_12138693.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Hvittingfoss str. A4-620]
 gi|353592073|gb|EHC50188.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Hvittingfoss str. A4-620]
          Length = 755

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 324/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                  +   A+ A++QAD  + V+G  Q 
Sbjct: 445 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 505 MAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G  + V  
Sbjct: 663 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 713

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 714 PIDIEALKF 722


>gi|317480750|ref|ZP_07939836.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
 gi|316903091|gb|EFV24959.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
          Length = 942

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 208/684 (30%), Positives = 333/684 (48%), Gaps = 80/684 (11%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT+FP  +    ++N  L   +G +   EAR +      G T  ++P +++ RD RWGR 
Sbjct: 173 ATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 226

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           +E  GE P L  +     VRGLQ +     +VAA+ KHF AY  +          + ++S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGLQHNH----QVAATGKHFAAYSNNKGAREGMARVDPQMS 282

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++E+    PF+  + E  +  VM SYN  +G+P       L   +RGE    GY+VSD
Sbjct: 283 PREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVSD 342

Query: 289 CDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEID 342
            D+V   Y T+H T+   +EA   ++ AGL++ C       F+ L     V+ G LSE  
Sbjct: 343 SDAVEYLY-TKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFV-LPLRELVKEGGLSEEV 400

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGH---LGPKDVCTPDHQELALEAARQGIVLLKN 399
           IN+ + + L V+  +G+FD      PY        ++V   +++ +AL+A+R+ IVLLKN
Sbjct: 401 INDRVRDILRVKFLIGLFDA-----PYQTDLADADREVEKEENEAIALQASRESIVLLKN 455

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQ 455
            G  LPL     + +AV GPN++     + +Y  +A   TT L+GI       A  ++ +
Sbjct: 456 AGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTK 515

Query: 456 GCK------------DVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEALDRAGL 501
           GC             D    DD+      A++ +RQAD  I+V+G  Q    E   R  L
Sbjct: 516 GCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSL 575

Query: 502 LLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIA 561
            LPGRQ +L+  +    K P +L+L++G P+ + +A  D  + AI+ A YPG  GGTA+A
Sbjct: 576 DLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALA 632

Query: 562 DILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYP 617
           DILFG  NPGGKL +T +P+  +  +P      +PS      + PG T    +    +YP
Sbjct: 633 DILFGDYNPGGKLTVT-FPKT-VGQIPFN-FPCKPSSQIDGGKNPGPTGNMSRINGALYP 689

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
           FG+G+SYT F ++  +    V  P +      +AT+    +KVT+         VQ+ ++
Sbjct: 690 FGYGLSYTTFEYSDLDITPRVITPNE------SATVR---LKVTNTGKRAGDEVVQLYIR 740

Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
           +V S    +                K L  F+++H+  G  Q +   I   K+L ++D  
Sbjct: 741 DVLSSITTY---------------EKNLAGFQRIHLEPGEAQELSFTID-RKHLELLDAD 784

Query: 738 GTRRIPLGEHNIHIGGTKHSVSLH 761
               +  G+  +  G +   + L+
Sbjct: 785 MKWVVEPGDFVLMAGASSEDIRLN 808


>gi|293372493|ref|ZP_06618877.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|299144770|ref|ZP_07037838.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|292632676|gb|EFF51270.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|298515261|gb|EFI39142.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 735

 Score =  262 bits (669), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 228/773 (29%), Positives = 355/773 (45%), Gaps = 101/773 (13%)

Query: 29  ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI--------------SGAA 74
           A D KD      P      PI +RVNDL+ R++L+EKV  L                   
Sbjct: 21  AKDKKDVALYKDP----KAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVK 76

Query: 75  AVP-RLGIKGYEWWSEALHGV--------SNVGPGTKFGGDFPGA--TSFPQVITTASSF 123
            VP  +G   Y   + AL           S +G    FG D      T +P  +  A S+
Sbjct: 77  KVPAEIGSLIYFETNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSW 136

Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
           N  L E    V + EAR       +G+ + +SP +++ RDPRWGR  E  GEDP  +G +
Sbjct: 137 NPDLVEQACAVSAQEAR------MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVF 190

Query: 183 AASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
            A+ V+G QG D     ++AA  KH+  Y         R +   ++SKQ + DT+ +P+ 
Sbjct: 191 GAASVKGYQGDDLSAENRMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYE 247

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
           M V  G  A++M S+N ++GVP  A+P I+   ++  W  +G+IVSD  ++      Q  
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIE-QLKNQGL 305

Query: 302 TSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
            +T +EAA  A  AGL++D        H +  V+ G +S   ++ A+   L ++ RLG+F
Sbjct: 306 AATKKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLF 365

Query: 361 DGEPSSQPY--GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
           +     +PY       +    P   ++A   A + +VLLKN+  +LPL+    + +AVIG
Sbjct: 366 E-----RPYTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTD--KKKIAVIG 418

Query: 419 PNSDVTVTMIGNYAG------IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAI 472
           P +     ++G++ G      +A  Y          A   +  GC      + + F  A+
Sbjct: 419 PMAKNGWDLLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNKEGFAEAL 477

Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
           +A+R +D  +L +G   +   E   R+ + LP  Q+EL +++  A K P +LVL++G P+
Sbjct: 478 EAARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPL 536

Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
           ++   +        IW   PG  G   +A IL G  NP GKL MT          P +  
Sbjct: 537 ELNRLELISDAILEIWQ--PGVNGALPMAGILSGRINPSGKLAMT---------FPYSTG 585

Query: 593 AMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
            +    ++R  GR ++ FYK      +YPFGHG+SYT F +     P+V  V    R   
Sbjct: 586 QIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKYGTVT-PSVTKVKRGDR--- 641

Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVA 707
                                L V+V V NVG++DGA T+  F S P      P K+L  
Sbjct: 642 ---------------------LSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKH 680

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           FEK  + AG  +    +I + +    V+  G R +  GE++I + G    + L
Sbjct: 681 FEKQLIRAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILVQGQTVKIEL 733


>gi|401676349|ref|ZP_10808334.1| glucosidase [Enterobacter sp. SST3]
 gi|400216388|gb|EJO47289.1| glucosidase [Enterobacter sp. SST3]
          Length = 765

 Score =  262 bits (669), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 212/691 (30%), Positives = 337/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRGS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTFLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   A+++G+++     +   +    V+ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A + AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEARDVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGENA 454

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+    +Q              +   A++A++Q+D  + V+G  Q 
Sbjct: 455 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNAAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNF-VHTVA-NAPTVVAVPLDGRHGSINATISGKAIKVTH 662
           R +    GP +YPFG+G+SYT F V  V  +APT+       R G + A+          
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTFKVSDVKMSAPTL------KRDGKVTAS---------- 672

Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
                      V+V N G ++GA  + ++     A    P KQL  FEKV++  G  + +
Sbjct: 673 -----------VEVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETKTI 721

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              I V   L   ++        G+ N+ IG
Sbjct: 722 SFPIDV-DALKFWNQQMKYDAEPGKFNVFIG 751


>gi|296103779|ref|YP_003613925.1| putative periplasmic beta-glucosidase [Enterobacter cloacae subsp.
           cloacae ATCC 13047]
 gi|295058238|gb|ADF62976.1| putative periplasmic beta-glucosidase precursor [Enterobacter
           cloacae subsp. cloacae ATCC 13047]
          Length = 765

 Score =  262 bits (669), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 209/689 (30%), Positives = 336/689 (48%), Gaps = 102/689 (14%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDAWLLKDVLRDQWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   A+++G+++     +   +    V+ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGENA 454

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+    +Q              +   A++A++Q+D  + V+G  Q 
Sbjct: 455 KVVYAKGANVTDDKDIVTFLNQYEEAVKVDPRSPKEMIDEAVNAAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F   V++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--KVSDVK--MSAPSLKRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                    VDV N G ++GA  + ++     A    P KQL  FEKV++  G  + +  
Sbjct: 673 ---------VDVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETKTISF 723

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I V   L   ++        G+ N++IG
Sbjct: 724 PIDV-DALKFWNQQMKYDAEPGKFNVYIG 751


>gi|423313768|ref|ZP_17291703.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
           CL09T03C04]
 gi|392684303|gb|EIY77631.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
           CL09T03C04]
          Length = 788

 Score =  262 bits (669), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 200/679 (29%), Positives = 314/679 (46%), Gaps = 82/679 (12%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT FP      +++N  L   IG V + EA A+      G T  +SP ++I +DPRWGR 
Sbjct: 158 ATYFPAQCGQGATWNKKLIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRC 211

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
            ET GEDP L G+     +  LQ     +  + A+ KHF  Y +       +   +  V+
Sbjct: 212 VETYGEDPYLVGELGKQMITSLQ-----KYNLVATPKHFAVYSIPIGGRDGKTRTDPHVA 266

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++   +  PFRM   E     VM SYN  +G P     + L   +R EW   GY+VSD
Sbjct: 267 PREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSD 326

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDI 343
            ++V    +      T E+  A A+ AGL++         F+ L    AV  G +S+  +
Sbjct: 327 SEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFI-LPLRKAVDNGKISQETL 385

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
           +  +   L ++ RLG+FD      PY   G    + V + +HQ ++LEAARQ +VLLKN+
Sbjct: 386 DKRVAEILRIKFRLGLFD-----NPYRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNE 440

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR---YARTIHQQGC 457
              LPLS    R++AVIGPN++    +I  Y        T  QGI     +A  I+++GC
Sbjct: 441 TNLLPLSK-SIRSIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHAEVIYKKGC 499

Query: 458 KDV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
             +                 + QL   AI A++QA+  ++V+G ++    E   R  L L
Sbjct: 500 DIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVREDRSRTSLNL 559

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PGRQ+EL+  V    K P ILV++ G    + +A     + AI+ A +PG+  G A+A+ 
Sbjct: 560 PGRQEELLKAVCATGK-PVILVMLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEA 616

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFG  NPGG+L +T +P+  +  +P      +P   +      Y       +YPFGHG+S
Sbjct: 617 LFGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSSTSVY-----GALYPFGHGLS 668

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           YT F ++  +    ++    G  G I            H  C          +KN G   
Sbjct: 669 YTTFTYSDLH----ISPSHQGVQGDI------------HVSCK---------IKNTGKIK 703

Query: 684 GAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           G   + ++          + K L  FE++ + AG +Q V   +   + L + D++   R+
Sbjct: 704 GDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLR-PQDLGLWDKNMNFRV 762

Query: 743 PLGEHNIHIGGTKHSVSLH 761
             G   + +G +   + LH
Sbjct: 763 EPGSFKVMLGASSTDIRLH 781


>gi|383115340|ref|ZP_09936096.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
 gi|313695250|gb|EFS32085.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
          Length = 735

 Score =  262 bits (669), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 228/773 (29%), Positives = 357/773 (46%), Gaps = 101/773 (13%)

Query: 29  ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI--------------SGAA 74
           A D KD      P      PI +RVNDL+ R++L+EKV  L                   
Sbjct: 21  AKDKKDVALYKDP----KAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVK 76

Query: 75  AVP-RLGIKGYEWWSEALHGV--------SNVGPGTKFGGDFPGA--TSFPQVITTASSF 123
            VP  +G   Y   + AL           S +G    FG D      T +P  +  A S+
Sbjct: 77  KVPAEIGSLIYFETNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSW 136

Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
           N  L E    V + EAR       +G+ + +SP +++ RDPRWGR  E  GEDP  +G +
Sbjct: 137 NPDLVEQACAVSAQEAR------MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVF 190

Query: 183 AASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
            A+ V+G QG D     ++AA  KH+  Y         R +   ++SKQ + DT+ +P+ 
Sbjct: 191 GAASVKGYQGDDLSAENRMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYE 247

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
           M V  G  A++M S+N ++GVP  A+  I+   ++  W  +G+IVSD  ++      Q  
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANSYIMTEILKKRWGHDGFIVSDWGAIE-QLKNQGL 305

Query: 302 TSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
            +T +EAA  A  AGL++D        H +  V+ G +S   ++ A+   L ++ RLG+F
Sbjct: 306 AATKKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLF 365

Query: 361 DGEPSSQPY--GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
           +     +PY       +    P   ++A   A + +VLLKN+  +LPL+    + +AVIG
Sbjct: 366 E-----RPYTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTD--KKKIAVIG 418

Query: 419 PNSDVTVTMIGNYAG------IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAI 472
           P +     ++G++ G      +A  Y          A   +  GC      + + F  A+
Sbjct: 419 PMAKNGWDLLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNKEGFAEAL 477

Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
           +A+R +D  +L +G   +   E   R+ + LP  Q+EL +++  A K P +LVL++G P+
Sbjct: 478 EAARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPL 536

Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
           ++   + +P   AI+    PG  G   +A IL G  NP GKL MT          P +  
Sbjct: 537 EL--NRLEPISDAILEIWQPGVNGALPMAGILSGRINPSGKLAMT---------FPYSTG 585

Query: 593 AMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
            +    ++R  GR ++ FYK      +YPFGHG+SYT F +     P+V  V    R   
Sbjct: 586 QIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKYGTVT-PSVTKVKRGDR--- 641

Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVA 707
                                L V+V V NVG++DGA T+  F S P      P K+L  
Sbjct: 642 ---------------------LSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKH 680

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           FEK  + AG  +    +I + +    V+  G R +  GE++I + G    + L
Sbjct: 681 FEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILVQGQTVKIEL 733


>gi|205357092|ref|ZP_02344874.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA29]
 gi|213650030|ref|ZP_03380083.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. J185]
 gi|289824263|ref|ZP_06543858.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. E98-3139]
 gi|378958841|ref|YP_005216327.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Typhi
           str. P-stx-12]
 gi|205324021|gb|EDZ11860.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA29]
 gi|374352713|gb|AEZ44474.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhi str. P-stx-12]
          Length = 755

 Score =  262 bits (669), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 204/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 445 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 619

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 620 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 662

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++G   + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 663 ---------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 713

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 714 PIDIEALKF 722


>gi|379996382|gb|AFD23677.1| glucosidase [uncultured organism]
          Length = 765

 Score =  262 bits (669), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 212/691 (30%), Positives = 338/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRGS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   A+++G+++     +   +    V+ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A + AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEARDVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGDNA 454

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+    +Q              +   A++ ++Q+D  + V+G  Q 
Sbjct: 455 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNTAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNF-VHTVA-NAPTVVAVPLDGRHGSINATISGKAIKVTH 662
           R +    GP +YPFG+G+SYT+F V  V  +APT+       R G + A+          
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTSFKVSDVKMSAPTL------KRDGKVTAS---------- 672

Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
                      V+V N G ++GA  + ++     A    P KQL  FEKV++  G  Q +
Sbjct: 673 -----------VEVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETQTI 721

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              I V + L   ++        G+ N+ IG
Sbjct: 722 SFPIDV-EALKFWNQQMKYDAEPGKFNVFIG 751


>gi|200387658|ref|ZP_03214270.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Virchow str. SL491]
 gi|199604756|gb|EDZ03301.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Virchow str. SL491]
          Length = 765

 Score =  262 bits (669), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 205/669 (30%), Positives = 324/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                  +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G  + V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKLTLKPGESKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|161502686|ref|YP_001569798.1| hypothetical protein SARI_00733 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:- str. RSK2980]
 gi|160864033|gb|ABX20656.1| hypothetical protein SARI_00733 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:-]
          Length = 771

 Score =  262 bits (669), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 206/689 (29%), Positives = 335/689 (48%), Gaps = 102/689 (14%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 124 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 177

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG +  DR  V  S KHF AY        +N VD     
Sbjct: 178 EGFGEDTYLTSIMGETMVKAMQGKNPADRYSVMTSVKHFAAYGAVEGGKEYNSVD----- 232

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 233 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKEVLRDEWGFKGI 289

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A+++G+D+     +   +    ++ G ++  +
Sbjct: 290 TVSDHGAIKEL--IKHGTAADPEDAVRVALKSGVDMSMADEYYSKYLPGLIKSGKVTMAE 347

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 348 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESLV 402

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI        
Sbjct: 403 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVADQSVTVLAGIQNAVGDGA 460

Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
              YA+               ++++  K    +   +   A+ A++QAD  + V+G  Q 
Sbjct: 461 KILYAKGANITNNKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 520

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 521 MAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 577

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RPSQSK---RYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP  ++   +Y  
Sbjct: 578 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYDTEKPNKYTS 635

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G++ A+            
Sbjct: 636 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSAPTMKRDGNVTAS------------ 678

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 679 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 729

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I + + L   ++        G+ NI IG
Sbjct: 730 PIDI-EALKFWNQQMKYDAEPGKFNIFIG 757


>gi|423222970|ref|ZP_17209439.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392640546|gb|EIY34345.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 862

 Score =  261 bits (668), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 156/451 (34%), Positives = 240/451 (53%), Gaps = 33/451 (7%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           +TLP+    L   +R  DL+ RL+L+EK  L+   + A+PRLGIK + WWSEALHGV+N 
Sbjct: 19  QTLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG-------- 149
           G            T FP+ +  A+SFN  L   I   VSDE RA +N             
Sbjct: 79  G----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFH 128

Query: 150 -LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHF 207
            L+ W+PNVNIFRDPRWGRGQET GEDP L+ +   + V+GLQG + ++ +   +C KH+
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEKYRKLLACAKHY 188

Query: 208 TAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
             +    W+   R   N   VS +D+ +T+   F+  V +  V  VMC+Y +++  P C 
Sbjct: 189 AVHSGPEWS---RHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDDDPCCG 245

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP-FL 325
           +  +L++ +R EW     +VSDC ++  ++ +   +S    AA     AG D++CG  + 
Sbjct: 246 NTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTMAGTDVECGYGYA 305

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
                 AV RGL++E +++  ++  +  +  LG  D +PS   +  +    V    H++L
Sbjct: 306 YQKLPEAVSRGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVVNCKAHKDL 364

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           +L  +RQ + LL+N+   LPLS    R +AVIGPN+D    + GNY G      T L G 
Sbjct: 365 SLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGPNADDKPMLWGNYNGTPNQTITILDGF 423

Query: 446 G---RYARTIHQQGCKDVACADDQLFGAAID 473
               +  + ++ +GC  V   +DQ   + +D
Sbjct: 424 KSKLKKNQIVYMKGCDLV---NDQTLESYLD 451



 Score = 93.2 bits (230), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 77/279 (27%), Positives = 118/279 (42%), Gaps = 52/279 (18%)

Query: 497 DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAG 556
           DR  + LP  Q+  +  +  A K   + V  SG  +  A         AI+ A Y G+ G
Sbjct: 629 DRTDIELPAVQRNFLKALKDAGK-QVVFVNCSGSSM--ALLPETESCDAILQAWYGGELG 685

Query: 557 GTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVV 615
           G A+AD+LFG  NP GKLP+T+Y       LP  E  +M+        GRTYR+   P +
Sbjct: 686 GYAVADVLFGDYNPSGKLPVTFYKS--TKQLPDYEDYSMK--------GRTYRYMSDP-L 734

Query: 616 YPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL------T 669
           +PFG G+SYT+F                                V  A CN+       +
Sbjct: 735 FPFGFGLSYTDFA-------------------------------VGTASCNKTQLRTDES 763

Query: 670 LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
           L + V V N G + G   + V+         P K L A+ +V + AGA+Q V I +    
Sbjct: 764 LTLTVPVSNTGKRSGTEVVQVYIRKTDDADGPLKSLKAYARVELAAGAKQDVKIELPSES 823

Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVI 768
           +      + T R+  G++ +  G +     L +  + ++
Sbjct: 824 FECFDPSTNTMRVAPGKYELFYGTSSADRDLQSVKVTLL 862


>gi|383114908|ref|ZP_09935668.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
 gi|382948422|gb|EIC71783.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
          Length = 782

 Score =  261 bits (668), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 209/692 (30%), Positives = 327/692 (47%), Gaps = 94/692 (13%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT FP  I  A++++  L + +G+V++ E R+       G   + P +++ RDPRW R 
Sbjct: 148 GATVFPTGIGMAATWSPELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 202

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDPVLSG   AS V GL G +   +    A+ KHF AY +        +   A V
Sbjct: 203 EETFGEDPVLSGILGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 259

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
             +D+   F  PFR  +  G + SVM SYN ++G+P  ++  +L + +R EW+  G++VS
Sbjct: 260 GIRDLHQNFLPPFRKAIDSGAL-SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVS 318

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           D  S+   +++     T E AA  ++ AG+D+D G     +   AVQ G + +  I+ A+
Sbjct: 319 DLYSIEGIHESHFVALTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAV 378

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L ++  +G+F+  P   P   +  K V   +H ELA + A+  I LLKN+   LPLS
Sbjct: 379 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGC--KDV 460
              ++ VAVIGPN+D    M+G+Y          T L GI      +R  + +GC  +D 
Sbjct: 436 KTINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 494

Query: 461 ACADDQLFGAAIDASRQAD----------------------ATILVMGLDQSIE-AEALD 497
              + +    AI+A+R+++                      A +   G    +E  E  D
Sbjct: 495 TVNEIE---QAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           RA L L GRQQEL+  +    K P I+V + G P++  +A       A++ A YPGQ GG
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGG 608

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP-GRTYRFYKGPVVY 616
            AIAD+LFG  NP G+LP++         +P +   +    +K+ P    Y       +Y
Sbjct: 609 NAIADVLFGDYNPSGRLPIS---------VPRSVGQIPVYYNKKAPRNHDYVEVSSSPLY 659

Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTH--AKCNRLTLGVQV 674
            FG+GMSYT F ++                          A++V    A+C      V  
Sbjct: 660 SFGYGMSYTTFEYS--------------------------ALQVVQKSARC----FEVSF 689

Query: 675 DVKNVGSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSV 733
            VKN G  DG   + L      A    P KQL  FE+ H+  G +++V   +    +  +
Sbjct: 690 KVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTEEDFF-L 748

Query: 734 VDRSGTRRIPLGEHNIHIGGTKHSVSLHAATL 765
           V+ +  + +  G  ++ IG   + + L    L
Sbjct: 749 VNYTLKKVVESGNFHLMIGAASNDIRLQNVIL 780


>gi|213852437|ref|ZP_03381969.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. M223]
          Length = 721

 Score =  261 bits (668), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 204/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 74  TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 127

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 128 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 182

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 183 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 239

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 240 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 297

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 298 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 352

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 353 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 410

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 411 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 470

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 471 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 527

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 528 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 585

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 586 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 628

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++G   + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 629 ---------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 679

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 680 PIDIEALKF 688


>gi|16761109|ref|NP_456726.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhi str. CT18]
 gi|29141195|ref|NP_804537.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
 gi|213428232|ref|ZP_03360982.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. E02-1180]
 gi|25289432|pir||AD0778 beta-glucosidase (EC 3.2.1.21) - Salmonella enterica subsp.
           enterica serovar Typhi (strain CT18)
 gi|16503407|emb|CAD02546.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi]
 gi|29136821|gb|AAO68386.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. Ty2]
          Length = 765

 Score =  261 bits (667), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 204/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++G   + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 673 ---------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|289805550|ref|ZP_06536179.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. AG3]
          Length = 657

 Score =  261 bits (667), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 204/669 (30%), Positives = 326/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 10  TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 63

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 64  EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 118

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 119 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 175

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 176 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 233

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 234 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 288

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 289 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 346

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 347 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 406

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 407 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 463

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 464 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 521

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 522 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 564

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++G   + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 565 ---------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 615

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 616 PIDIEALKF 624


>gi|299146513|ref|ZP_07039581.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298517004|gb|EFI40885.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 736

 Score =  261 bits (667), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 208/699 (29%), Positives = 324/699 (46%), Gaps = 108/699 (15%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  A++++  L + +G+V++ E R+       G   + P +++ RDPRW R 
Sbjct: 102 GTTVFPTGIGMAATWSPELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 156

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDPVLSG   AS V GL G +   +    A+ KHF AY +        +   A V
Sbjct: 157 EETFGEDPVLSGILGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 213

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
             +D+   F  PFR  +  G + SVM SYN ++G+P  ++  +L + +R EW+  G++VS
Sbjct: 214 GIRDLHQNFLPPFRKAIDAGAL-SVMTSYNSIDGIPCTSNHYLLTKLLRNEWKFRGFVVS 272

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           D  S+   +++     T E AA  ++ AG+D+D G     +   AVQ G + +  I+ A+
Sbjct: 273 DLYSIEGIHESHFVAPTKENAAIQSVMAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAV 332

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L ++  +G+F+  P   P   +  K V   +H ELA + A+  I LLKN+   LPLS
Sbjct: 333 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 389

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRY---ARTIHQQGC--KDV 460
            + ++ VAVIGPN+D    M+G+Y          T L GI      +R  + +GC  +D 
Sbjct: 390 KMINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 448

Query: 461 ACADDQLFGAAIDASRQAD----------------------ATILVMGLDQSIE-AEALD 497
              + +    AI+A+R+++                      A +   G    +E  E  D
Sbjct: 449 TVNEIE---QAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 505

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
           RA L L GRQQEL+  +    K P I+V + G P++  +A       A++ A YPGQ GG
Sbjct: 506 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGG 562

Query: 558 TAIADILFGTSNPGGKLPMT----------WYPQEYITNLPMTEMAMRPSQSKRYPGRTY 607
            AIAD+LFG  NP G+LP++          +Y Q+   N    E++  P           
Sbjct: 563 NAIADVLFGDYNPSGRLPISVPRSVGQIPVYYNQKAPRNHDYVEVSSSP----------- 611

Query: 608 RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNR 667
                  +Y FG+GMSYT F +                        S   +    A+C  
Sbjct: 612 -------LYSFGYGMSYTTFEY------------------------SDLQVVQKSARC-- 638

Query: 668 LTLGVQVDVKNVGSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIH 726
               V   VKN G  DG   + L      A    P KQL  FE+ H+  G +++V   + 
Sbjct: 639 --FEVSFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLT 696

Query: 727 VCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATL 765
              +  +V+ +  + +  G  ++ IG   + + L    L
Sbjct: 697 EEDFF-LVNYTLKKVVESGNFHLMIGAASNDIRLQNVIL 734


>gi|392979938|ref|YP_006478526.1| beta-D-glucoside glucohydrolase [Enterobacter cloacae subsp.
           dissolvens SDM]
 gi|392325871|gb|AFM60824.1| beta-D-glucoside glucohydrolase [Enterobacter cloacae subsp.
           dissolvens SDM]
          Length = 765

 Score =  261 bits (667), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 210/689 (30%), Positives = 335/689 (48%), Gaps = 102/689 (14%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   A+++G+++     +   +    V+ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPADTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGENA 454

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + I+ +G      KD+    +Q              +   A++A++Q+D  + V+G  Q 
Sbjct: 455 KVIYAKGANVTDDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNAAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F   V++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--KVSDVK--MSAPSLKRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                    V+V N G ++GA  L ++     A    P KQL  FEKV++  G  + +  
Sbjct: 673 ---------VEVTNTGKREGATVLQMYVQDVTASMSRPVKQLRGFEKVNLKPGETKTISF 723

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I V   L   ++        G+ N+ IG
Sbjct: 724 PIDV-DALKFWNQQMKYDAEPGKFNVFIG 751


>gi|56412920|ref|YP_149995.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. ATCC 9150]
 gi|197361851|ref|YP_002141487.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Paratyphi A str. AKU_12601]
 gi|56127177|gb|AAV76683.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Paratyphi A str. ATCC 9150]
 gi|197093327|emb|CAR58775.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Paratyphi A str. AKU_12601]
          Length = 768

 Score =  261 bits (667), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 205/671 (30%), Positives = 324/671 (48%), Gaps = 104/671 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFR--MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
             +S Q + + +  P++  +    G V   + S N +NG P  +D  +LK  +R EW   
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGSGAVMVALNSLNSLNGTPATSDSWLLKDVLRDEWGFK 284

Query: 283 GYIVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSE 340
           G  VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++ 
Sbjct: 285 GITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTM 342

Query: 341 IDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQG 393
            ++++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ 
Sbjct: 343 AELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARES 397

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GR 447
           +VLLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G 
Sbjct: 398 VVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGD 455

Query: 448 YARTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLD 488
            A+ ++ +G     D    D                  +   A+ A++QAD  + V+G  
Sbjct: 456 GAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGES 515

Query: 489 QSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIW 548
           Q +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+ 
Sbjct: 516 QGMAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILE 572

Query: 549 AGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRY 602
             + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y
Sbjct: 573 TWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKY 630

Query: 603 PGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTH 662
             R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+          
Sbjct: 631 TSRYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS---------- 675

Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
                      V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V
Sbjct: 676 -----------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTV 724

Query: 722 G--INIHVCKY 730
              I+I   K+
Sbjct: 725 SFPIDIEALKF 735


>gi|237721771|ref|ZP_04552252.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
 gi|229448640|gb|EEO54431.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
          Length = 735

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 227/773 (29%), Positives = 355/773 (45%), Gaps = 101/773 (13%)

Query: 29  ACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLI--------------SGAA 74
           A D KD      P      PI +RVNDL+ R++L+EK+  L                   
Sbjct: 21  AKDKKDVALYKDP----KAPIEKRVNDLLSRMTLEEKMMQLNQYTLGRNNNVNNVGEEVK 76

Query: 75  AVP-RLGIKGYEWWSEALHGV--------SNVGPGTKFGGDFPGA--TSFPQVITTASSF 123
            VP  +G   Y   + AL           S +G    FG D      T +P  +  A S+
Sbjct: 77  KVPAEIGSLIYFETNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSW 136

Query: 124 NATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKY 182
           N  L E    V + EAR       +G+ + +SP +++ RDPRWGR  E  GEDP  +G +
Sbjct: 137 NPDLVEQACAVSAQEAR------MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVF 190

Query: 183 AASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFR 241
            A+ V+G QG D     ++AA  KH+  Y         R +   ++SKQ + DT+ +P+ 
Sbjct: 191 GAASVKGYQGDDLSAENRMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYE 247

Query: 242 MCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHF 301
           M V  G  A++M S+N ++GVP  A+P I+   ++  W  +G+IVSD  ++      Q  
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIE-QLKNQGL 305

Query: 302 TSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
            +T +EAA  A  AGL++D        H +  V+ G +S   ++ A+   L ++ RLG+F
Sbjct: 306 AATKKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLF 365

Query: 361 DGEPSSQPY--GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIG 418
           +     +PY       +    P   ++A   A + +VLLKN+  +LPL+    + +AVIG
Sbjct: 366 E-----RPYTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTD--KKKIAVIG 418

Query: 419 PNSDVTVTMIGNYAG------IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAI 472
           P +     ++G++ G      +A  Y          A   +  GC      + + F  A+
Sbjct: 419 PMAKNGWDLLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNKEGFAEAL 477

Query: 473 DASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPI 532
           +A+R +D  +L +G   +   E   R+ + LP  Q+EL +++  A K P +LVL++G P+
Sbjct: 478 EAARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPL 536

Query: 533 DVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEM 592
           ++   +        IW   PG  G   +A IL G  NP GKL MT          P +  
Sbjct: 537 ELNRLELISDAILEIWQ--PGVNGALPMAGILSGRINPSGKLAMT---------FPYSTG 585

Query: 593 AMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
            +    ++R  GR ++ FYK      +YPFGHG+SYT F +     P+V  V    R   
Sbjct: 586 QIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKYGTVT-PSVTKVKRGDR--- 641

Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVA 707
                                L V+V V NVG++DGA T+  F S P      P K+L  
Sbjct: 642 ---------------------LSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKH 680

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           FEK  + AG  +    +I + +    V+  G R +  GE++I + G    + L
Sbjct: 681 FEKQLIRAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILVQGQTVKIEL 733


>gi|205353304|ref|YP_002227105.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Gallinarum str. 287/91]
 gi|375124143|ref|ZP_09769307.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Gallinarum str. SG9]
 gi|445128447|ref|ZP_21380240.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 9184]
 gi|205273085|emb|CAR38038.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 287/91]
 gi|326628393|gb|EGE34736.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Gallinarum str. SG9]
 gi|444854909|gb|ELX79964.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 9184]
          Length = 765

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 202/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI        
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVDDGA 454

Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
              YA+               ++++  K    +   +   A+ A++QA+  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQANVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGERKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|427384392|ref|ZP_18880897.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727653|gb|EKU90512.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
           12058]
          Length = 954

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 217/736 (29%), Positives = 353/736 (47%), Gaps = 91/736 (12%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSNV 97
           L +   +LP+ +RV  L+  ++ ++K++L+  G     +P L +       EA+HG S  
Sbjct: 168 LRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS-- 224

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNV 157
                +G    GAT FPQ +   +++N  L E I   V DE  A    GT  +  WSP +
Sbjct: 225 -----YGS---GATIFPQALAMGATWNKKLTEEIAMAVGDETLA---AGT--MQAWSPVL 271

Query: 158 NIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNG 217
           ++ +D RWGR +ET GEDPVL  +   ++++G Q S G    +  + KHF  +      G
Sbjct: 272 DVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-SKG----LFTTPKHFGGHGAP-LGG 325

Query: 218 VDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRG 277
            D       +S++++ +   VPFR  +      S+M +Y+   GVP      +L   +R 
Sbjct: 326 RDSHDIG--LSEREMREVHLVPFRHVIRNYDCQSLMMAYSDFLGVPVAKSKELLHNILRE 383

Query: 278 EWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQR 335
           EW  +G+IVSDC ++G     +H+T+  + EAA  A+ AG+  +CG  +       A + 
Sbjct: 384 EWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNDKEVIQAAKD 443

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQGI 394
           G L+  +++N     L +  R  +F+  P+    +  + P    + +H+E+A +AAR+ I
Sbjct: 444 GRLNMENLDNVCRTMLRMMFRNELFEKAPNKPLDWNKIYP-GWNSDNHKEMARQAARESI 502

Query: 395 VLLKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIGR 447
           V+L+N+   LPL     R++AV+GP +D           + G    +  G     Q +G+
Sbjct: 503 VMLENKENILPLDK-GIRSIAVLGPGADDLQPGDYTPKLLPGQLKSVLTGIK---QAVGK 558

Query: 448 YARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALDR 498
             + I++QGC D     +     A+ A+ Q+D  ++V+G   + EA         E  D 
Sbjct: 559 QTKVIYEQGC-DFTNLSETNIPKAVKAASQSDVVVMVLGDCSTSEATTDVYKTSGENHDY 617

Query: 499 AGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGT 558
           A L+LPG+QQEL+  V    K P ILVL +G P ++   K      AII    PGQ GG 
Sbjct: 618 ATLILPGKQQELLEAVCATGK-PVILVLQAGRPYNL--TKASKLCKAIIVNWLPGQEGGP 674

Query: 559 AIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPF 618
           A AD+LFG  NP G+LPMT +PQ ++  LP+       +  +RY      +Y  P+ Y F
Sbjct: 675 ATADVLFGDYNPAGRLPMT-FPQ-HVGQLPL--YYNFKTSGRRYEYSDLEYY--PLYY-F 727

Query: 619 GHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKN 678
           G+G+SYT+F ++                           +KV       +T  VQ  VKN
Sbjct: 728 GYGLSYTSFEYS--------------------------GLKVQEKDNGNIT--VQATVKN 759

Query: 679 VGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRS 737
           VG + G   + ++ T           +L  F ++++  G  + V   +     LS+++  
Sbjct: 760 VGQRAGDEVVQLYVTDMYASVKTRITELKDFTRINLKPGESKTVSFELTPYD-LSLLNDH 818

Query: 738 GTRRIPLGEHNIHIGG 753
             R +  GE  I +GG
Sbjct: 819 MDRVVEKGEFKILVGG 834


>gi|419958177|ref|ZP_14474242.1| beta-D-glucoside glucohydrolase [Enterobacter cloacae subsp.
           cloacae GS1]
 gi|388606860|gb|EIM36065.1| beta-D-glucoside glucohydrolase [Enterobacter cloacae subsp.
           cloacae GS1]
          Length = 765

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 213/691 (30%), Positives = 336/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   A+++G+++     +   +    V+ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGDNA 454

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+    +Q              +   A++A++Q+D  + V+G  Q 
Sbjct: 455 KVVYAKGANVTDDKDIVTFLNQYEEAVKVDARTPKEMLDEAVNAAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNF-VHTVA-NAPTVVAVPLDGRHGSINATISGKAIKVTH 662
           R +    GP +YPFG+G+SYT F V  V  +APT+       R G + A+          
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTFKVSDVKMSAPTL------KRDGKVTAS---------- 672

Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
                      V+V N G ++GA  + ++     A    P KQL  FEKV++  G  + V
Sbjct: 673 -----------VEVTNSGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETRTV 721

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              I V   L   ++        G+ N+ IG
Sbjct: 722 SFPIDVNA-LKFWNQQMKYDAEPGKFNVFIG 751


>gi|255545660|ref|XP_002513890.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223546976|gb|EEF48473.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 336

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 139/325 (42%), Positives = 200/325 (61%), Gaps = 13/325 (4%)

Query: 425 VTMIGNYAGIACGYTTPLQGIGRYAR-TIHQQGCKDVACADDQLFGAAIDASRQADATIL 483
           +T++GNYAG  C   TPLQG+  Y + T + +GC  VAC+   +   A+  +R+ D  +L
Sbjct: 1   MTLVGNYAGPPCKTVTPLQGLQNYIKNTKYHRGCNTVACSKATI-REAVQIAREVDQVVL 59

Query: 484 VMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRI 543
           VM LDQ+ EAE +D   L LPG QQ+L+  V+ A+  P +LVL+ GG +DV+FA  +P+I
Sbjct: 60  VMELDQTQEAERIDLLNLRLPGNQQKLIISVARAANKPVVLVLICGGLVDVSFAITEPKI 119

Query: 544 AAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP 603
            +I+WAGYPG+AGGTA+A+I+FG  NPGGKLP+TWYPQ+Y T +PMT++ MRP  +  YP
Sbjct: 120 GSILWAGYPGEAGGTALAEIIFGDHNPGGKLPVTWYPQQY-TKIPMTDVRMRPQIASGYP 178

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAP---TVVAVP--LDGRHGSINATISGKAI 658
           GR+YRFY+G  V+ FG+G+SY+N  + +A+ P     +  P  + G   S    +S    
Sbjct: 179 GRSYRFYEGKKVFEFGYGLSYSNCSYEIASIPQDKIFLRSPSSIKGVKTSSYTLVS---- 234

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGA 717
           ++    C R    V V VKN G   G H +LVF   P  G   P K+LVAF+ V + AG 
Sbjct: 235 ELGKELCERSKFSVTVKVKNEGKIIGKHPVLVFLRQPKPGSGRPVKKLVAFQTVRLNAGQ 294

Query: 718 QQRVGINIHVCKYLSVVDRSGTRRI 742
              +   +  C+ L+  +  G+  I
Sbjct: 295 NAEIQRKLSPCEPLTRANEDGSMVI 319


>gi|423279982|ref|ZP_17258895.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
           610]
 gi|404584318|gb|EKA88983.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
           610]
          Length = 812

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 216/687 (31%), Positives = 327/687 (47%), Gaps = 91/687 (13%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  AS++N  L   +GRV++ EA A   G   G   + P +++ RDPRW R 
Sbjct: 166 GTTVFPTSIGQASTWNPELIRQMGRVIAIEASA--QGAHIG---YGPVLDLARDPRWSRV 220

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDP L+G   A+ VRG QG     R  V A+ KHF +Y    W         A +
Sbjct: 221 EETYGEDPYLNGVMGAALVRGFQGDTLRGRKSVIATLKHFASY---GWTEGGHNGGTAHL 277

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
            ++++E+    PFR  V  G + SVM SYN+++G P      +L   +   W   G++VS
Sbjct: 278 GERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDILEDRWLFKGFVVS 336

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEIDIN 344
           D  ++G   +      +  EAA  A+ AG+D D G    ++ E   +AV++G ++   ++
Sbjct: 337 DLYAIGGLRE-HGVAGSDYEAAVKAVNAGVDSDLGT--NVYAEQLVAAVRKGDVAMETVD 393

Query: 345 NALVNTLTVQMRLGMFDGE--PSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
            A+   L+++  +G+FD       +P      + V +P+H  LA E ARQ IVLLKN+  
Sbjct: 394 KAVRRILSLKFHMGLFDAPFVDDKRP-----AQLVASPEHIGLAREVARQSIVLLKNEDK 448

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC--GYTTPLQGI----GRYARTIHQQG 456
            LPL     RT+AVIGPN+D    M+G+Y          T L+GI     +  R  + +G
Sbjct: 449 LLPLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVFYAKG 507

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMG----LDQSIE-------------------A 493
           C  V  +    F  AI+++R AD  ++V+G     D S E                    
Sbjct: 508 CA-VRDSSRTGFADAIESARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESG 566

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E  DRA L L GRQ EL+ +V    K P +LVL+ G P+ +     +    AI+ A YPG
Sbjct: 567 EGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQEA--DAILDAWYPG 623

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
             GG A+AD+LFG  NP G+L ++  P+  +  LP+     R     RY         G 
Sbjct: 624 MQGGNAVADVLFGDYNPAGRLTLS-VPRS-VGQLPVYYNTKRKGNRSRYIEEA-----GT 676

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
             YPFG+G+SYT F +T                        G  ++V+  + N   + V 
Sbjct: 677 PRYPFGYGLSYTTFSYT------------------------GMKVRVSE-ESNHCRVDVS 711

Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHW-APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
           V V+N G+ DG   + ++     G +  P +QL AF +V + AG    +   +   K L+
Sbjct: 712 VTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFRRVRLKAGETWEITFTLD-KKSLA 770

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVS 759
           +  R G   +  G   +  GG+   ++
Sbjct: 771 LYMRDGEWAVEPGRFTVMAGGSSEDIA 797


>gi|324497317|gb|ADY39467.1| putative periplasmic beta-glucosidase precursor [bacterium
           enrichment culture clone P69-9E]
          Length = 765

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 212/691 (30%), Positives = 337/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRGS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   A+++G+++     +   +    V+ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A + AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEARDVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGDNA 454

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+    +Q              +   A++ ++Q+D  + V+G  Q 
Sbjct: 455 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNTAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNF-VHTVA-NAPTVVAVPLDGRHGSINATISGKAIKVTH 662
           R +    GP +YPFG+G+SYT F V  V  +APT+       R G + A+          
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTFKVSDVKMSAPTL------KRDGKVIAS---------- 672

Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
                      V+V N G ++GA  + ++     A    P KQL  FEKV++  G  Q +
Sbjct: 673 -----------VEVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETQTI 721

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              I V + L   ++        G+ N+ IG
Sbjct: 722 SFPIDV-EALKFWNQQMKYDAEPGKFNVFIG 751


>gi|393719789|ref|ZP_10339716.1| glycosyl hydrolase family protein [Sphingomonas echinoides ATCC
           14820]
          Length = 896

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 168/451 (37%), Positives = 246/451 (54%), Gaps = 47/451 (10%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRL 60
           MAST++F    L+L+SAS + +A++   A D   A   ++   +      +    L+ ++
Sbjct: 1   MASTVSFASRALLLVSASVATIASQ---AQDLHGAAQPSVARAEAD----RLAATLVAQM 53

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           + +EK+  L++ A A+PRL I  Y WW+E+LHG   +GP        P  T+FP+ I  A
Sbjct: 54  TTEEKLPQLLNVAPAIPRLKIPAYNWWTESLHGA--LGP-------IP-TTNFPEPIGLA 103

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWGRGQET 171
           +SF+A L   +   +S E RA++  G           GL  WSPN+NIFRDPRWGRGQET
Sbjct: 104 ASFDAPLVHQVADAISTEVRALHTQGRQTGKLGRIGTGLDTWSPNINIFRDPRWGRGQET 163

Query: 172 PGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQD 231
            GEDP L+ +   ++V G+QG + D  +V ++ KHF A+         R   N  VS  D
Sbjct: 164 YGEDPYLAARMGVAFVTGMQGPNPDLPRVISTPKHFAAHSGPEST---RHAANVYVSPHD 220

Query: 232 IEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDS 291
           +EDT+   FR  ++EG+  S+MC+YN++ G P CA   +LK  +RG W   GY+VSDCD+
Sbjct: 221 LEDTYLPAFRAAIVEGRAGSIMCAYNRIEGQPACASDLLLKDHLRGAWGFTGYVVSDCDA 280

Query: 292 VGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTE--SAVQRGLLSEIDIN 344
           V    D   +      A A A++AG+D +C     G   GL      A++R L+S+ DI+
Sbjct: 281 VKDIADNHKYAPDQATAVAAAMKAGVDNECNTQTIGDIGGLPDRFGDALKRNLISQGDID 340

Query: 345 NALVNTLTVQMRLGMFD---GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQG 401
             LV   + ++R G      G P+  P        + TPDH  LAL+AA + +VLLKN G
Sbjct: 341 RTLVRLFSARLRNGDLPGVAGAPAVVPVSA-----ILTPDHIALALDAAEKSLVLLKNDG 395

Query: 402 PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
             LPL       +AV+GP  D T  + GNY+
Sbjct: 396 -VLPLRP--GARIAVVGPLGDATRVLRGNYS 423



 Score =  119 bits (297), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 85/295 (28%), Positives = 136/295 (46%), Gaps = 48/295 (16%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +EAE            D+  L LP  QQ L+       K P I+V M+G PI++A
Sbjct: 633 GLTSDMEAEETGTDIPGFKGGDKTSLDLPAEQQALLEHAKATGK-PLIVVAMNGSPINLA 691

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK++   AAI+ A YPGQ+GG A+A++L G +NP G+LP+T+Y    + +LP       
Sbjct: 692 WAKDN--AAAIVEAWYPGQSGGLAVANVLTGKTNPAGRLPLTFY--RSVADLP------- 740

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
           P       GRTYR++ G  VYPFG+G+S+T F +    AP  V     G    +  T   
Sbjct: 741 PFDDYAMTGRTYRYFTGTPVYPFGYGLSFTRFAY----APLTVTPAAGGAETGLRVT--- 793

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
                              +V N G + G     ++   P     P   L  ++++ +  
Sbjct: 794 ------------------TEVSNTGQRSGDEVAQLYLNFPDQPGTPRIALRGYQRITLKP 835

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGVIKS 770
           G  + +  ++   + LS V   GTR++  G++ + +G  +    +  +T     S
Sbjct: 836 GEHRVISFDLS-PRDLSAVAVDGTRQVMAGQYRVSVGSGQPDTGVPTSTASFTTS 889


>gi|339999930|ref|YP_004730813.1| periplasmic beta-glucosidase [Salmonella bongori NCTC 12419]
 gi|339513291|emb|CCC31041.1| periplasmic beta-glucosidase precursor [Salmonella bongori NCTC
           12419]
          Length = 765

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 207/691 (29%), Positives = 335/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  AS+FN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASTFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMIALNSLNGTPATSDAWLLKDVLRHQWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+T PE+A   A+++G+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTATDPEDAVRVALKSGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPLS  ++ T+AV+GP +D    ++G++  AG+A    T L GI    G   
Sbjct: 397 LLKNRLNTLPLS--KNATIAVVGPLADSKRDIMGSWSAAGVADQSVTVLTGIKNALGERG 454

Query: 450 RTIHQQGC---------------KDVACADDQ----LFGAAIDASRQADATILVMGLDQS 490
           + ++ +G                ++V   D +    +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KVLYAKGANITNDKDIVDFLNLYEEVVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDDNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNPEKPDKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SY  F      +   ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYNTF----KVSDVTLSAPTMKRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G  + V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKIILKPGETKTVSF 723

Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I+I+  K+ +   R      P G+ N+ IG
Sbjct: 724 PIDINALKFWN--QRMKYDAEP-GKFNVFIG 751


>gi|255689965|ref|ZP_05413640.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260624572|gb|EEX47443.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 688

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 198/657 (30%), Positives = 319/657 (48%), Gaps = 72/657 (10%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T +P  +  A S+N  L E    V + EAR       +G+ + +SP +++ RDPRWGR  
Sbjct: 77  TVYPISLAQACSWNPDLVEQACAVSAQEAR------MSGVDWTFSPMIDVARDPRWGRVA 130

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           E  GEDP  +G + A+ VRG QG +     +VAA  KH+  Y         R +   ++S
Sbjct: 131 EGYGEDPYANGVFGAASVRGYQGDNMSAENRVAACLKHYVGYGASE---AGRDYVYTEIS 187

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
           +Q + DT+ +P+ M V  G  A++M S+N ++GVP  A+P  +   ++  WR +G+IVSD
Sbjct: 188 QQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYTMTEILKNRWRHDGFIVSD 246

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDINNAL 347
             ++      Q   +T +EAA  A  AGL++D        H +  V+ G +S   ++ A+
Sbjct: 247 WGAIE-QLKNQGLAATKKEAARYAFTAGLEMDMMSHAYDRHLQELVEEGKVSMAQVDEAV 305

Query: 348 VNTLTVQMRLGMFDGEPSSQPY--GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLP 405
              L ++ RLG+F+     +PY       +    P   ++A   A + +VLLKN+   LP
Sbjct: 306 RRVLLLKFRLGLFE-----RPYTPATTEKERFFRPKSMDIAARLAAESMVLLKNENNVLP 360

Query: 406 LSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYARTIHQQGCKD 459
           L+    + +AVIGP +     ++G++ G      +A  Y          A   +  GC +
Sbjct: 361 LTD--KKKIAVIGPMAKNGWDLLGSWRGHGKDTDVAMLYDGLAAEFAGKAELRYALGC-N 417

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
               + + F  A++A+R +D  +L +G   +   E   R+ + LP  Q+EL  ++  A K
Sbjct: 418 TQGDNREGFAEALEAARWSDVVVLCLGEMMTWSGENASRSSIALPQMQEELAKELKKAGK 477

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
            P +LVL++G P+++   + +P   AI+    PG  G   +A IL G  NP GKL MT  
Sbjct: 478 -PVVLVLVNGRPLEL--NRLEPVSDAILEIWQPGVNGALPMAGILSGRINPSGKLAMT-- 532

Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTNFVHTVANAP 635
                   P +   +    ++R  GR ++ FYK      +YPFGHG+SYT F +      
Sbjct: 533 -------FPYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKY------ 579

Query: 636 TVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STP 694
                           T++  A KV   +     L  +V V N+G++DGA T+  F S P
Sbjct: 580 ---------------GTVTPSATKVKRGE----KLSAEVTVTNIGARDGAETVHWFISDP 620

Query: 695 PAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
                 P K+L  FEK  + AG  +    +I + +    V+  G R +  GE+NIH+
Sbjct: 621 YCSITRPVKELKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLETGEYNIHV 677


>gi|295132888|ref|YP_003583564.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294980903|gb|ADF51368.1| beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 855

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 150/424 (35%), Positives = 227/424 (53%), Gaps = 32/424 (7%)

Query: 37  TRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSN 96
           ++  P+   +L   +R  DL+ RL+L+EK  L+   + A+PRLGIK + WWSEALHG +N
Sbjct: 9   SQQKPYQNPNLSPEERAEDLVNRLTLEEKASLMFDVSEAIPRLGIKKFNWWSEALHGFAN 68

Query: 97  VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY---------NGGT 147
                         T FP+ +  A+SF+  L   +    SDE RA Y         N   
Sbjct: 69  ND----------DVTVFPEPVGMAASFDDELVYQVFDATSDEVRAKYHEALRNGEENKRF 118

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKH 206
             L+ W+PNVNIFRDPRWGRGQET GEDP L+ +     V+GLQG  D    K+ A  KH
Sbjct: 119 LSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAKYKKLLACAKH 178

Query: 207 FTAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
           +  +    W+   R   N   VS++D+ +T+   F++ V +  V  VMC+Y +++  P C
Sbjct: 179 YAVHSGPEWS---RHELNLNNVSQRDLWETYLPAFKVLVQDANVRQVMCAYQRLDDEPCC 235

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFL 325
               +L++ +R +W     +VSDC ++  +Y + + +S    AAA A+ AG D++C    
Sbjct: 236 GSDRLLQQILREKWGFEHLVVSDCGAIQDFYTSHNVSSDAVHAAAKAVLAGTDVECQ--W 293

Query: 326 GLHT----ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
             H       AV++GL+ E DI+ ++   L  +  LG  D +    PY  +    +   +
Sbjct: 294 DKHNYKLLPEAVEKGLVKEEDIDRSVKRVLIGRFELGEMDPD-EIVPYAQIPASVINNEE 352

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
           H++LAL+ AR+ + LL+N+   LPLS  + R +AVIGPN+D    + GNY G      + 
Sbjct: 353 HRQLALKMARESMTLLQNKNNILPLSKGQDR-IAVIGPNADDEPMLWGNYNGTPVRTISI 411

Query: 442 LQGI 445
           L GI
Sbjct: 412 LDGI 415



 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 79/300 (26%), Positives = 125/300 (41%), Gaps = 49/300 (16%)

Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
           +  I V GL   +E E +          DR  + LP  Q+  +  +  A K   I V  S
Sbjct: 593 ETVIFVGGLSTKLEGEEMPVSYPGFKGGDRTDIALPSVQRNCLKTLKDAGK-KVIFVNNS 651

Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
           G  I            AI+ A Y G++GG A+AD+LFG  NP GKLP+T+Y     T LP
Sbjct: 652 GSAI--GLVPETTSCDAILQAWYGGESGGQAVADVLFGDYNPSGKLPVTFYKD--TTQLP 707

Query: 589 -MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHG 647
              + +M         GRTYRF K   ++PFGHG+SYTNF         +    LD    
Sbjct: 708 DFEDYSMN--------GRTYRFMKAEPLFPFGHGLSYTNF--------KIGEAQLDKSEI 751

Query: 648 SINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVA 707
             +++++                 + + + N G  +G   + V+         P K L  
Sbjct: 752 DTSSSVN-----------------ITISISNEGKTEGVEIIQVYVHKQGLEEGPIKTLKG 794

Query: 708 FEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           F++V++     + V IN+    +     ++ + ++  G + I  G +     L    L +
Sbjct: 795 FKRVNLKPNEMKNVTINLPSNSFEFYDKKARSMKVMPGNYEIFYGNSSDDKQLKKTELKI 854


>gi|319787180|ref|YP_004146655.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
 gi|317465692|gb|ADV27424.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
           suwonensis 11-1]
          Length = 903

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 153/399 (38%), Positives = 219/399 (54%), Gaps = 35/399 (8%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           L+ +L+ +EK+  L++ A A+PRLG+  Y WW+E+LHG     P           T+F +
Sbjct: 54  LVAKLTTEEKLDQLLNTAPAIPRLGVPHYNWWTESLHGAMGTLP----------TTNFAE 103

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMY-----NGGTA----GLTYWSPNVNIFRDPRWG 166
            I  A++F+  L + +G+V++ E R ++      G T     GL  WSPN+NIFRDPRWG
Sbjct: 104 PIGLAATFDDGLVKRVGQVIATEVRGLHALARQTGRTGRMGTGLNTWSPNINIFRDPRWG 163

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
           RGQET GEDP L+ +   +YV+G+QG D     + A+ KH+  +         R   N  
Sbjct: 164 RGQETYGEDPFLAARMGVAYVKGVQGEDPRWYDIIATPKHYAVHSGPEST---RHSANVY 220

Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
           VS+ D+EDT+   FR  ++EG   SVMC+YN+V+G P CA+  +LK  +RG W   GY+V
Sbjct: 221 VSRHDLEDTYLPAFRAAIVEGGAGSVMCAYNRVDGQPACANDLLLKDYLRGAWDFKGYVV 280

Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-------PFLGLHTESAVQRGLLS 339
           SDCD+V       H+      A A A+R+G+D +C          L    + A++RGL+S
Sbjct: 281 SDCDAVTDINRHHHYAPDAASAVAAAMRSGVDNECNGATLVDMAGLARPYQEALERGLIS 340

Query: 340 EIDINNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
             D++ ALV     + R G   G  P S       P DV  P+H  LALE A + +VLLK
Sbjct: 341 MADVDRALVRLFAARYRNGDLPGLRPLSTETA--SPADVGKPEHAALALEVAEKSLVLLK 398

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACG 437
           N G  LPL       +AV+GP +D T  + GNY+    G
Sbjct: 399 NSG-VLPLRP--QAKLAVVGPLADATRVLRGNYSSPQSG 434



 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/300 (30%), Positives = 149/300 (49%), Gaps = 50/300 (16%)

Query: 465 DQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKV 514
           D   G    A+ Q+D  + V+GL   +EAE            D+  L L   QQ L+ + 
Sbjct: 619 DDPIGQMRAAAAQSDVVVAVVGLTSDLEAEEAPVEVPGFKGGDKTSLDLLADQQALL-EA 677

Query: 515 SMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKL 574
           + A+  P ++V+M+G P+++A+A+     AAII A YPGQ+GG AI ++L G +NP G+L
Sbjct: 678 ARATGKPLVVVVMNGSPVNLAWARQ--HAAAIIEAWYPGQSGGLAIGNVLAGHANPAGRL 735

Query: 575 PMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
           P+T+Y    + +L P  +  MR        GRTYR+++G  VYPFGHG+SYT+F +    
Sbjct: 736 PLTFY--RSVDDLPPFDDYDMR--------GRTYRYFEGKPVYPFGHGLSYTSFGY---- 781

Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST 693
            P  +     G H         + +KVT             +++N G + G     ++  
Sbjct: 782 GPLKLEPAAGGAH---------EGLKVT------------TELRNTGKRAGDEVAQLYLD 820

Query: 694 PPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            P     P   L  F++VH+  G ++ +   +   + LS V   G R++  G + + +GG
Sbjct: 821 FPDVDGLPRIALRGFQRVHLAPGERRVLEFQLD-PRDLSAVTPEGQRQVFAGNYRVFVGG 879


>gi|418855797|ref|ZP_13410448.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19593]
 gi|392820838|gb|EJA76679.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19593]
          Length = 765

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 204/669 (30%), Positives = 325/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGC-----KDVAC--------------ADDQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+                +   +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +Y FG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYSFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G  + V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGESKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|224536377|ref|ZP_03676916.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522015|gb|EEF91120.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 954

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 213/737 (28%), Positives = 357/737 (48%), Gaps = 91/737 (12%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSN 96
           +L +   +LP+ +RV  L+  ++ ++K++L+  G     +P L +       EA+HG S 
Sbjct: 167 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS- 224

Query: 97  VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPN 156
                 +G    GAT FPQ +   +++N  L E +   V DE  A    GT  +  WSP 
Sbjct: 225 ------YGS---GATIFPQALAMGATWNKKLTEDVAMAVGDETLA---AGT--MQAWSPV 270

Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
           +++ +D RWGR +ET GEDPVL  +   ++++G Q S G    +  + KHF  +      
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-SKG----LFTTPKHFGGHGAP-LG 324

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
           G D       +S++++ +   VPFR  +      SVM +Y+   GVP      +L   +R
Sbjct: 325 GRDSHDIG--LSEREMREVHLVPFRHVIRNYDCQSVMMAYSDYLGVPVAKSRELLHSILR 382

Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQ 334
            EW  +G+IVSDC ++G     +H+T+  + EAA  A+ AG+  +CG  +       A +
Sbjct: 383 EEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNDKEVIQAAK 442

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQG 393
            G ++  +++      L +  R  +F+  P+    +  + P    +  H+E+A +AAR+ 
Sbjct: 443 DGRINMENLDEVCRTMLRMMFRNELFEKTPNKPLDWNKIYP-GWNSDSHKEMARQAARES 501

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIG 446
           IV+L+N+   LPL+    RT+AV+GP +D           + G    +  G     Q +G
Sbjct: 502 IVMLENKDNILPLAK-DMRTIAVVGPGADDLQPGDYTPKLLPGQLKSVLTGIK---QAVG 557

Query: 447 RYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALD 497
           +  + +++QGC D   ++      A+ A+ Q+D  +LV+G   + E+         E  D
Sbjct: 558 KQTKVVYEQGC-DFTSSNGTNIPKAVKAASQSDVVVLVLGDCSTSESTTDVYKTSGENHD 616

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
            A L+LPG+QQEL+  V    K P IL+L +G P +++ A    +   + W   PGQ GG
Sbjct: 617 YATLILPGKQQELLEAVCATGK-PVILILQAGRPYNLSKASELCKAILVNW--LPGQEGG 673

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
            A AD+LFG  NP G+LPMT +P+ ++  LP+       +  +RY      FY  P+ Y 
Sbjct: 674 PATADVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYSDMEFY--PLYY- 726

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
           FG+G+SYT+F +                        SG  I+    + +   + +Q  VK
Sbjct: 727 FGYGLSYTSFEY------------------------SGLKIQ----EKDNGNVAIQATVK 758

Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           NVG + G   + ++ T           +L  F +VH+  G  + V   +   + LS+++ 
Sbjct: 759 NVGQRAGDEVVQLYITDMYASVKTRITELKDFTRVHLQPGESKIVSFELTPYE-LSLLND 817

Query: 737 SGTRRIPLGEHNIHIGG 753
              R +  GE  I +GG
Sbjct: 818 RMDRVVEKGEFKILVGG 834


>gi|423140736|ref|ZP_17128374.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
           houtenae str. ATCC BAA-1581]
 gi|379053290|gb|EHY71181.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
           houtenae str. ATCC BAA-1581]
          Length = 765

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 207/669 (30%), Positives = 327/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTWLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A+++G+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKSGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVADQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                  +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + LP  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITLPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RPSQSKRYPGRTY 607
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP  S++    T 
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYVSEKPNKYTS 629

Query: 608 RFY---KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R++    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSAPTMKRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEK+ +  G  + V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKIALKPGESKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIDALKF 732


>gi|386821036|ref|ZP_10108252.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
 gi|386426142|gb|EIJ39972.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
          Length = 725

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 217/712 (30%), Positives = 341/712 (47%), Gaps = 90/712 (12%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS---NV 97
           PF    +   +RV++L+  +++ EKV  L S    VPRLG+KG     E LHG++     
Sbjct: 31  PFQNPKIATEKRVDNLLSLMTIDEKVNAL-STNPEVPRLGVKG-TGHVEGLHGLALGGPA 88

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR-AMYNGGTAGLTYWSPN 156
           G G K     P  T+FPQ      +++  L + I ++   EAR A+   G  GL   +PN
Sbjct: 89  GWGGKGKEPLP-TTTFPQAYGLGETWDTELLKEIAKIEGYEARYALQKYGRGGLVIRAPN 147

Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
            ++ RDPRWGR +E+ GED   +GK   ++V+GLQGSD    + A+  KHF A    N N
Sbjct: 148 ADLARDPRWGRTEESYGEDAFFNGKMTVAFVKGLQGSDKTYWQTASLMKHFLA----NSN 203

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
              R + ++   ++   + + +PF+M V+EG   + M +YN+VNG+P    P +LK    
Sbjct: 204 EDGRTYTSSDFDERLWREYYALPFKMGVVEGGSRAYMAAYNKVNGIPAMVHP-MLKDITV 262

Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAV 333
            EW  NG I +D  +  +      +       AA  I+AG++     FL   TE    A+
Sbjct: 263 DEWGQNGIICTDGGAYKLLLSDHKYYKDKYLGAAATIKAGIN----QFLDDFTEGVYGAL 318

Query: 334 QRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPK-DVCTP----DHQELALE 388
             G L+E D++  L     V ++LGM D   ++ PY  +G + D   P     H++LALE
Sbjct: 319 ANGYLTEADLDEVLRGNYRVMIKLGMLDSS-ANNPYAKIGAEADSMDPWELEAHKKLALE 377

Query: 389 AARQGIVLLKNQGPS--LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI- 445
           A  + IVLLKN      LPL   + + +A+IG  +D    ++  Y+G      +PLQGI 
Sbjct: 378 ATEKSIVLLKNDPAKRLLPLQKKKVKKIAIIGEYAD--AVLLDWYSGTPPYTISPLQGIK 435

Query: 446 GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA------------ 493
            +    +      +V  A +   G A++ ++ AD  I+ +G   +  A            
Sbjct: 436 NKVGENV------EVLFAKNNADGKAVEIAKNADVAIVFIGNHPTCNAGWAQCPVPSNGK 489

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           EA+DR  L     + E + K+   +   T++ L+S  P  + + + +  I AI       
Sbjct: 490 EAVDRQAL---NSEYEDLVKLVYKANPNTVVGLISSFPYTINWTQEN--IPAIFHVTQNS 544

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMAMRPSQSKRYPGRTYRFYKG 612
           Q  GTAIA++LFG  NP G+L  TW     I++L P+ +  +R        GRTY ++KG
Sbjct: 545 QELGTAIANVLFGAYNPAGRLTQTWVKD--ISDLPPLMDYNIR-------NGRTYMYFKG 595

Query: 613 PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGV 672
             +Y FGHG+SYT F +     P                    K IK          + V
Sbjct: 596 KPLYAFGHGLSYTTFKYKDMEIP--------------------KQIKENEE------VSV 629

Query: 673 QVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
           +V++ N G  DG   + L      +    P K+L +F+++H+ AG  + V +
Sbjct: 630 KVNITNAGEVDGDEVVQLYVKHINSTVERPIKELKSFKRIHIKAGETKTVSL 681


>gi|218132023|ref|ZP_03460827.1| hypothetical protein BACEGG_03648 [Bacteroides eggerthii DSM 20697]
 gi|217985783|gb|EEC52123.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 762

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 218/779 (27%), Positives = 352/779 (45%), Gaps = 123/779 (15%)

Query: 48  PIPQRVNDLIGRLSLQEKVKLL---------ISGAAAVPRLG--IKGYEWWS-------- 88
           P+  RV DL+ R++L+EK+  +         + G     ++   +KG  + S        
Sbjct: 34  PVEVRVADLLKRMTLEEKIAQMQDLKFKDFSVDGKVDTVKMDSVLKGMSYASVFGSRLSV 93

Query: 89  ----EALH------------GVSNVGPGTKFGGDF-PGATSFPQVITTASSFNATLWEAI 131
               E++             G+  +G      G    GAT FPQ I  +S+FN  +   +
Sbjct: 94  EQMQESMFAINKYMAEHNRLGIPVLGEAESLHGLIHDGATIFPQSIALSSTFNPDITHRV 153

Query: 132 GRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQ 191
             V++ EA+A     T      SP +++ R+ RWGR +ET GEDP L G+   +YV    
Sbjct: 154 ATVIAQEAKA-----TGVDQVLSPVLDLARELRWGRVEETYGEDPYLVGRMGVAYVSAF- 207

Query: 192 GSDGDRLKVAASCKHFTAY-------DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCV 244
               ++  V  + KHF A+       +L +  G +R          D+   +  PF+  +
Sbjct: 208 ----NKEGVMTTLKHFLAHGSPTGGLNLASVTGCER----------DLRSLYLKPFQDVM 253

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
            E    SVM SYN    VP  A   IL   +RGE    GYI SD  SV +     H    
Sbjct: 254 REAMPYSVMNSYNSYESVPVAASHWILDDILRGEMGFKGYISSDWGSVEMLRSLHHTAKD 313

Query: 305 PEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEP 364
             +AA  A+ AG+D++         +S V+ G+L E +I+  +   LT +  +G+FD + 
Sbjct: 314 KADAACQAVIAGVDVEVDGDCYETLDSLVRSGVLPEKEIDKCVSRVLTAKFAMGLFDKDY 373

Query: 365 SSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVT 424
           + +   +L  + V TP+  ELAL AAR+  +L+KN+   LPL   + R+VAVIGPN+   
Sbjct: 374 TKR--ANLS-QTVHTPEAVELALVAARESAILVKNENSLLPLDANKLRSVAVIGPNA--A 428

Query: 425 VTMIGNYAGIACGY--TTPLQGIGRY----ARTIHQQGCKDVACADDQLFGAAIDASRQA 478
               G+Y          TPLQGI        +  + +GC ++   D   F  A+ A+R +
Sbjct: 429 QVQFGDYMWTNSNEYGITPLQGIEAVTQGKVKINYAKGC-EIHTQDRSGFSQAVTAARNS 487

Query: 479 DATILVMG---------LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
           D  +L +G            S+  E+ D + + LPG Q+ L+  V    K PTI+VL++G
Sbjct: 488 DVALLFVGAMSGSPGRPWPNSVSGESFDLSDISLPGCQEALIRAVKATGK-PTIVVLVAG 546

Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------Y 583
            P  + + K++     + W  Y G+  G AIA+ILFG  NP G+L ++ +PQ       +
Sbjct: 547 KPFAIPWVKDNCEAVIVQW--YGGEQEGRAIAEILFGEVNPSGRLNVS-FPQSTGHLPVF 603

Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
               P  +       +   PGR Y F     V+ FGHG+SYT F +              
Sbjct: 604 YNYYPSDKGFYHDHGTLEKPGRDYVFSSPDPVWAFGHGLSYTTFKY-------------- 649

Query: 644 GRHGSINATISGKAIKVTHAK-CNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-P 701
                       K++++++ +  +  T  + V+V N G +DG   + ++          P
Sbjct: 650 ------------KSMQISNKEFTDDDTCEITVEVANTGKRDGKEVVQLYVNDIVSSVVTP 697

Query: 702 HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            K+L  FEKV +PAG  + V  N+ + K L++ +      +  G+  + +G     + L
Sbjct: 698 VKELRRFEKVFIPAGETRTVKFNLPI-KELALWNTDMKEVVEPGDFELQVGAASDDIRL 755


>gi|224537265|ref|ZP_03677804.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521119|gb|EEF90224.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 885

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 155/451 (34%), Positives = 240/451 (53%), Gaps = 33/451 (7%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNV 97
           +TLP+    L   +R  DL+ RL+L+EK  L+   + A+PRLGIK + WWSEALHGV+N 
Sbjct: 19  QTLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78

Query: 98  GPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAG-------- 149
           G            T FP+ +  A+SFN  L   I   VSDE RA +N             
Sbjct: 79  G----------NVTVFPEPVGMAASFNDKLVFDIFNAVSDEMRAKHNERVRNGLEDVRFH 128

Query: 150 -LTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC-KHF 207
            L+ W+PNVNIFRDPRWGRGQET GEDP L+ +   + V+GLQG + ++ +   +C KH+
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEKYRKLLACAKHY 188

Query: 208 TAYDLDNWNGVDRFHFNA-KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCA 266
             +    W+   R   N   VS +D+ +T+   F+  V +  V  VMC+Y +++  P C 
Sbjct: 189 AVHSGPEWS---RHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDDDPCCG 245

Query: 267 DPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFL 325
           +  +L++ +R EW     +VSDC ++  ++ +   +S    AA     AG D++CG  + 
Sbjct: 246 NTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTMAGTDVECGYGYA 305

Query: 326 GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQEL 385
                 AV +GL++E +++  ++  +  +  LG  D +PS   +  +    V    H++L
Sbjct: 306 YQKLPEAVSKGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVVNCKAHKDL 364

Query: 386 ALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 445
           +L  +RQ + LL+N+   LPLS    R +AVIGPN+D    + GNY G      T L G 
Sbjct: 365 SLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGPNADDKPMLWGNYNGTPNQTITILDGF 423

Query: 446 G---RYARTIHQQGCKDVACADDQLFGAAID 473
               +  + ++ +GC  V   +DQ   + +D
Sbjct: 424 KSKLKKNQIVYMKGCDLV---NDQTLESYLD 451



 Score = 99.4 bits (246), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 85/320 (26%), Positives = 132/320 (41%), Gaps = 62/320 (19%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           +  +I   +  D  + V G+   +E E +          DR  + LP  Q+  +  +  A
Sbjct: 590 YSESISKLKGIDVVVFVGGISPQLEGEEMPVNIPGFKGGDRTDIELPAVQRNFLKALKDA 649

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K   + V  SG  +  A         AI+ A Y G+ GG A+AD+LFG  NP GKLP+T
Sbjct: 650 GK-QVVFVNCSGSSM--ALLPETESCDAILQAWYGGELGGYAVADVLFGDYNPSGKLPVT 706

Query: 578 WYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y       LP  E  +M+        GRTYR+   P ++PFG G+SYT+F         
Sbjct: 707 FYKS--TKQLPDYEDYSMK--------GRTYRYMSDP-LFPFGFGLSYTDFA-------- 747

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRL------TLGVQVDVKNVGSKDGAHTLLV 690
                                  V  A CN+       +L + V V N G + G   + V
Sbjct: 748 -----------------------VGTASCNKTQLHTDESLTLTVPVSNTGKRSGTEVVQV 784

Query: 691 FSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
           +         P K L A+ +V + AGA+Q V I +    +      + T R+  GE+ + 
Sbjct: 785 YIRKTDDADGPLKSLKAYARVELAAGAKQDVKIELPSESFECFDPSTNTMRVAPGEYELF 844

Query: 751 IGGTKHSVSLHAATLGVIKS 770
            G +  +  L +  + +  S
Sbjct: 845 YGTSSAARDLQSVKVTLRDS 864


>gi|365971332|ref|YP_004952893.1| beta-glucosidase [Enterobacter cloacae EcWSU1]
 gi|365750245|gb|AEW74472.1| Periplasmic beta-glucosidase [Enterobacter cloacae EcWSU1]
          Length = 765

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 211/691 (30%), Positives = 336/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGRG 
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRGS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYAVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   A+++G+++     +   +    V+ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A + AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPVDTNAESRLHRKEARDVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVAEQSVTVLTGIKNAVGENA 454

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+    +Q              +   A++ ++Q+D  + V+G  Q 
Sbjct: 455 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRSPKEMIDEAVNTAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNF-VHTVA-NAPTVVAVPLDGRHGSINATISGKAIKVTH 662
           R +    GP +YPFG+G+SYT F V  V  +APT+       R G + A+          
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTFKVSDVKMSAPTL------KRDGKVTAS---------- 672

Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRV 721
                      V+V N G ++GA  + ++     A    P KQL  FEKV++  G  + +
Sbjct: 673 -----------VEVSNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVNLKPGETKTI 721

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              I V   L   ++        G+ N+ IG
Sbjct: 722 SFPIDV-DALKFWNQQMKYDAEPGKFNVFIG 751


>gi|206901921|ref|YP_002251428.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
 gi|206741024|gb|ACI20082.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
          Length = 756

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 217/690 (31%), Positives = 337/690 (48%), Gaps = 96/690 (13%)

Query: 89  EALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA 148
           EALHG               G+T FPQ I  AS++N  L   +   +  E R      + 
Sbjct: 138 EALHGCMA-----------KGSTIFPQAIGMASTWNPELIYQVATAIGKETR------SR 180

Query: 149 GL-TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHF 207
           G+    SP +NI RDPR GR +ET GEDP L+ + A +Y++G+Q        V A+ KHF
Sbjct: 181 GIHQVLSPTINIARDPRCGRTEETYGEDPYLASRMAVAYIKGVQEQG-----VIATPKHF 235

Query: 208 TAYDLDNWNGVDRF--HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTC 265
            A  + +  G D +  HF+ ++    + + +   FR  + E    S+M +YN ++G+P  
Sbjct: 236 VANFVGD-GGRDSYPIHFSERL----LREIYFPAFRASIEEAGALSLMAAYNSLDGIPCS 290

Query: 266 ADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL-----D 320
           ++  +L R +R EW   GY+VSD  SV           +  EAA  ++ AGLD+     D
Sbjct: 291 SNKWLLTRILRKEWGFKGYVVSDYFSVLHLMTKHKVAESKAEAAKLSLEAGLDMELPDSD 350

Query: 321 CGPFL-GLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT 379
           C   + GL  ES      LS+  ++ A+   L V+  +G+FD  P   P  +    + C+
Sbjct: 351 CFEEIPGLIRESK-----LSQDTLDEAVRRVLRVKFWIGLFDN-PFVDP-DYAERINDCS 403

Query: 380 PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYT 439
            +H+ELAL  AR+ IVLLKN+G  LPL+    R++AVIGPN+   V  +G Y+G      
Sbjct: 404 -EHRELALRVARESIVLLKNEG-ILPLNK-DIRSIAVIGPNA--AVPRLGGYSGYGVKVV 458

Query: 440 TPLQGI----GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLD-QSIEAE 494
           TPL+GI    G   +    +GC  +       F  AI  ++++D  IL MG      E E
Sbjct: 459 TPLEGIKNKLGDKVKVYFAEGC-GLNDTSKSGFDEAIKIAQKSDVAILFMGNSVPETEGE 517

Query: 495 ALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQ 554
             DR  L LPG Q++L+ ++   +  P I+VL++G  I +    +  ++ A+I A YPG+
Sbjct: 518 QRDRHNLNLPGVQEDLIKEIC-NTNTPVIVVLINGSAITMMNWID--KVQAVIEAWYPGE 574

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRT--YRFYKG 612
            GG AIAD+LFG  NPGGKLP++ +P+ Y + LP+     +PS      GR   Y   +G
Sbjct: 575 EGGNAIADVLFGDYNPGGKLPIS-FPK-YSSQLPLY-YNHKPS------GRVDDYVDLRG 625

Query: 613 -PVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
              ++PFG+G+SYT+F ++     T   +P DG                         + 
Sbjct: 626 NQYLFPFGYGLSYTDFKYSNLRI-TPEEIPRDGE------------------------VV 660

Query: 672 VQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKY 730
           +  D++N+G   G   + L      A    P K+L  FE+V +  G ++ V   ++  + 
Sbjct: 661 ITFDIENIGKYKGDEVVQLYLHDEFASVARPIKELKRFERVTLDVGERKTVSFKLN-RRD 719

Query: 731 LSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           L  +       +  G   + IG +   + L
Sbjct: 720 LEFLSMDMELVVEPGRFEVLIGSSSEDIRL 749


>gi|359473580|ref|XP_003631325.1| PREDICTED: protein BRASSINOSTEROID INSENSITIVE 1-like [Vitis
           vinifera]
          Length = 785

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 134/302 (44%), Positives = 189/302 (62%), Gaps = 12/302 (3%)

Query: 284 YIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEID 342
           YIVSDC  + V  D Q++ +  + +A A  ++AGLDL+CG +       +V  G +S+ +
Sbjct: 10  YIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYE 69

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           ++ AL N   + MR+G FDG P+   Y  LG KD+C  DH ELA EAARQGIVLLKN   
Sbjct: 70  LDRALKNIYVLLMRVGYFDGIPA---YESLGLKDICAADHIELAREAARQGIVLLKNDYE 126

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVAC 462
            LPL     + + ++GP+++ T  MIGNYAG+   Y +PL+         +  GC D +C
Sbjct: 127 VLPLK--PGKKLVLVGPHANATEVMIGNYAGLPYKYVSPLEAFSAIGNVTYATGCLDASC 184

Query: 463 ADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPT 522
           ++D  F  A +A++ A+ TI+ +G D SIEAE +DR   LLPG Q EL+ +V+  S GP 
Sbjct: 185 SNDTYFSEAKEAAKFAEVTIIFVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPV 244

Query: 523 ILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGG------KLPM 576
           ILV++SG  ID+ FAKN+PRI+AI+W G+PG+ GG AIAD++FG  NP        KL  
Sbjct: 245 ILVVLSGSNIDITFAKNNPRISAILWVGFPGEQGGHAIADVVFGKYNPDTIPEWLWKLDF 304

Query: 577 TW 578
           +W
Sbjct: 305 SW 306


>gi|308080460|ref|NP_001183746.1| uncharacterized protein LOC100502339 [Zea mays]
 gi|238014360|gb|ACR38215.1| unknown [Zea mays]
          Length = 344

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 140/338 (41%), Positives = 196/338 (57%), Gaps = 16/338 (4%)

Query: 425 VTMIGNYAGIACGYTTPLQGIGRYARTIH-QQGCKDVAC---ADDQLFGAAIDASRQADA 480
           + +I NY G  C  TTPL+G+  Y   +    GC   AC   A DQ    A+  +   D 
Sbjct: 1   MALIANYFGPPCESTTPLKGLQSYVNDVRFLAGCNSAACDVAATDQ----AVALAGSEDY 56

Query: 481 TILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKND 540
             L MGL Q  E+E  DR  LLLPG QQ L++ V+ ASK P ILVL+SGGP+D+ FA+++
Sbjct: 57  VFLFMGLSQKQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSN 116

Query: 541 PRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSK 600
           P+I AI+WAGYPGQAGG AIA +LFG  NP G+LP+TWYP+E+ T +PMT+M MR   + 
Sbjct: 117 PKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEF-TKVPMTDMRMRADPTS 175

Query: 601 RYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVP---LDGRHGSINATISGKA 657
            YPGR+YRFY+G  VY FG+G+SY+ F   + +  +V A+    L G   ++      ++
Sbjct: 176 GYPGRSYRFYQGNTVYKFGYGLSYSTFSRRLVHGTSVPALSSTLLTGLRETMTPQDGDRS 235

Query: 658 IKVT---HAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHV 713
             V       C +L     V+V+N G  DG H++L+F   P      P  QL+ F   H+
Sbjct: 236 YHVDAIGTEGCEQLKFPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQLIGFRSQHL 295

Query: 714 PAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
            AG   ++  +I  CK+ S V   G + I +G H + +
Sbjct: 296 KAGETAKLRFDISPCKHFSRVRADGRKVIDIGSHFLMV 333


>gi|119476117|ref|ZP_01616469.1| periplasmic beta-glucosidase [marine gamma proteobacterium
           HTCC2143]
 gi|119450744|gb|EAW31978.1| periplasmic beta-glucosidase [marine gamma proteobacterium
           HTCC2143]
          Length = 748

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 202/671 (30%), Positives = 334/671 (49%), Gaps = 71/671 (10%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  A+S+N  L E   RV + EA       T G+ + ++P ++I RDPRWGR  
Sbjct: 101 TIFPIPLGQAASWNPELIEQGARVAALEA------ATVGVNWTFAPMIDITRDPRWGRIA 154

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRL-KVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           E+ GEDP L G+  A+ VRG QG D   +  +AA  KHF  Y      GVD  +  A ++
Sbjct: 155 ESLGEDPYLCGELGAAMVRGFQGKDLSAIGSIAACAKHFAGYGAAE-GGVD--YNTAIIA 211

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
           + ++ + +  PF+  +  G VAS M ++N +NGVP   +  +LK+ +R EW   G +VSD
Sbjct: 212 ENELRNVYLPPFKAALDSG-VASFMTAFNDLNGVPASGNEFLLKQILREEWCYQGMVVSD 270

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDINNAL 347
            +S+ V      FT+  +EAA +A  AG+D++        H ES +  G +S   ++  +
Sbjct: 271 WESI-VQLTEHGFTANDKEAAFEAANAGIDMEMVSNTYSQHLESLIIEGRISLAQVDEMV 329

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
            N L ++ RLG+F+  P  QP        +   DH++ A + A + +VLLKN   SLPL 
Sbjct: 330 KNILRLKFRLGLFEN-PYPQPDKL---PALVNHDHRQAAKKLALESVVLLKNSHQSLPLR 385

Query: 408 HIRHRTVAVIGPNSDVTVTMIGN--YAGIACGYTTPLQGIGRYA---RTIHQQGCKDVAC 462
                ++A+IGP +D     +G   + G A    T LQ I  +A    T++     +   
Sbjct: 386 LSALSSIALIGPLADDAYEQLGTWIFDGDADDSETVLQAINAFAGDSLTVNVDRALETTR 445

Query: 463 ADDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGP 521
           ++  +     + A++ +DA +L +G +  +  EA  RA + LPG Q++L+  ++  +K P
Sbjct: 446 SNTFIDIDRTMAAAQSSDAIVLCLGEESILSGEAHSRADISLPGAQEQLIHLLAKTAK-P 504

Query: 522 TILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT---- 577
            IL++M+G P+ +    +   + AI++A +PG   GTA+ D+LFG  +P GKLP+T    
Sbjct: 505 MILIVMAGRPLTLEPIID--HVDAILYAWHPGTMAGTALTDLLFGEVSPSGKLPITFPRM 562

Query: 578 ------WYPQEYITNLP-------MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
                 +Y ++     P       M ++A R +Q+       +       ++PFG G+SY
Sbjct: 563 VGQVPIYYGKKNTGKPPSAESVVHMNDIAPRAAQTSLGMSAFHLDAGFTPLFPFGFGLSY 622

Query: 625 TNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDG 684
           T+F +                    N  +S   + +         + V VDV N G ++G
Sbjct: 623 TSFTYE-------------------NLHLSSSTMNIDG------VITVTVDVINCGEREG 657

Query: 685 AHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP 743
              + +++   A +   P K+L  F+KVH+ AG +Q+V   +     L+  DR   R I 
Sbjct: 658 QEVVQLYTRDLAANVTRPVKELKQFQKVHLSAGERQQVKFLLKASA-LAFYDRKMNRIIE 716

Query: 744 LGEHNIHIGGT 754
            G  ++  GG+
Sbjct: 717 PGVFHLWTGGS 727


>gi|398386387|ref|ZP_10544389.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
 gi|397718418|gb|EJK79007.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
          Length = 791

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 238/726 (32%), Positives = 338/726 (46%), Gaps = 109/726 (15%)

Query: 78  RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
           RLGI    +  E LHG + VG           ATSFPQ I  ASS++ T+   + +V+  
Sbjct: 138 RLGIP-ILFHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPTMLRQVNQVIGR 185

Query: 138 EARAMYNGGTAGL-TYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
           E RA       G+    SP V+I RDPRWGR +ET GEDP L G+   + V GLQG    
Sbjct: 186 EIRAR------GVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQGEGRS 239

Query: 197 RL----KVAASCKHFTAYDLDNWNGVDRFHFN---AKVSKQDIEDTFDVPFRMCVMEGKV 249
           RL     V A+ KH T +      G      N   A VS++++ + F  PF   V    +
Sbjct: 240 RLLRPGHVFATLKHLTGH------GQPESGTNVGPAPVSERELRENFFPPFEQVVKRTGI 293

Query: 250 ASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAA 309
            +VM SYN+++GVP+ A+  +L   +R EW   G +VSD  +V       H  +  EEAA
Sbjct: 294 EAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQLMSIHHIAANLEEAA 353

Query: 310 ADAIRAGLDLDCGPFLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQP 368
             A+ AG+D D    L   T    V+ G +SE  ++ A+   L ++ R G+F+      P
Sbjct: 354 MRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLFE-----NP 408

Query: 369 YGHLGPKDVCTPDHQ--ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVT 426
           Y         T + +   LA  AA++ I LLKN G  LPL      T+AVIGP++   V 
Sbjct: 409 YADANAAAAITNNDEARALARTAAQRSITLLKNDG-MLPLKP--EGTIAVIGPSA--AVA 463

Query: 427 MIGNYAGIACGYTTPLQGI----GRYARTIHQQGCK---------DVACADD-----QLF 468
            +G Y G      + L+GI    G  A  +  QG K         D     D     +L 
Sbjct: 464 RLGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITENDDWWEDKVVKSDPAENRKLI 523

Query: 469 GAAIDASRQADATILVMGLDQSIEAEAL------DRAGLLLPGRQQELVSKVSMASKGPT 522
             A++A+R  D  IL +G  +    E        DR  L L G QQEL   +    K P 
Sbjct: 524 AQAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDALKALGK-PI 582

Query: 523 ILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE 582
            +VL++G P   +  K   +  AI+   Y G+ GG A+ADILFG  NPGGKLP+T  P+ 
Sbjct: 583 TVVLINGRP--ASTVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGGKLPVT-VPRS 639

Query: 583 YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPL 642
            +  LPM    M+PS       R Y F     +YPFG G+SYTNF  +   AP       
Sbjct: 640 -VGQLPMF-YNMKPSAR-----RGYLFDTTDPLYPFGFGLSYTNFSLS---AP------- 682

Query: 643 DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-P 701
                 ++AT  G   K +          V VDV+N G+++G   + ++          P
Sbjct: 683 -----RLSATKIGTGGKTS----------VSVDVRNTGAREGDEVVQLYIRDKVSSVTRP 727

Query: 702 HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLH 761
            K+L  F++V +  G  + V   +   + L + +    R +  G+  I  G +  SV+L 
Sbjct: 728 VKELKGFQRVTLKPGESRTVTFTVG-PEALQMWNDQMRRVVEPGDFEIMTGNS--SVALQ 784

Query: 762 AATLGV 767
           + TL V
Sbjct: 785 STTLTV 790


>gi|334144838|ref|YP_004538047.1| beta-glucosidase [Novosphingobium sp. PP1Y]
 gi|333936721|emb|CCA90080.1| beta-glucosidase [Novosphingobium sp. PP1Y]
          Length = 889

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 151/394 (38%), Positives = 216/394 (54%), Gaps = 32/394 (8%)

Query: 55  DLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFP 114
           DL+ +++L EK+  L++ A A+PRL I  Y WW+E+LHG           G  P  T+FP
Sbjct: 36  DLVAKMTLDEKLGQLLNTAPAIPRLDIPAYNWWTESLHGAL---------GSLP-TTNFP 85

Query: 115 QVITTASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRW 165
           + I  A++F+A+L + +   +S E R ++              GL  WSPN+NIFRDPRW
Sbjct: 86  EPIGLAATFDASLVKDVAGAISTEVRGLHALARKTGRMGRIGTGLDTWSPNINIFRDPRW 145

Query: 166 GRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA 225
           GRGQET GEDP L+ +   S+V G+QG D D   V A+ KHF  +   N     R H N 
Sbjct: 146 GRGQETYGEDPYLTARMGVSFVEGMQGPDPDLPDVIATPKHFAVH---NGPESTRHHANV 202

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
            VS+ D+EDT+   FR  ++EG+  SVMC+YN+V+G P CA   +L+  +   W   GY+
Sbjct: 203 FVSRHDLEDTYLPAFRAAIVEGRAGSVMCAYNRVDGQPACASQELLQEHLVDAWGFQGYV 262

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE-------SAVQRGLL 338
           VSDCD+V    D   +      A A A+R G+D +C  +    T+        A++RGL+
Sbjct: 263 VSDCDAVKDISDNHKYAPDGAAAVAAAMRMGVDSECHTWTLSDTDGLTDRYREALERGLI 322

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLK 398
           +  D++  L+   + ++R G   G      +      DV TP H  LAL+AA + +VLLK
Sbjct: 323 TVSDVDRTLIRLFSARLRNGDLPGVRKLSTFTS-SAADVGTPAHGALALKAAEESLVLLK 381

Query: 399 NQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
           N G  LP      + VAVIGP  D T  + GNY+
Sbjct: 382 NDG-ILPFQTAGMK-VAVIGPFGDATRVLRGNYS 413



 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 91/289 (31%), Positives = 136/289 (47%), Gaps = 48/289 (16%)

Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
           A++ AD  + V+GL   +EAE            D+  L +P  QQEL+ +     K P I
Sbjct: 614 AAQAADVLVAVVGLTSDLEAEESPIEIPGFKGGDKTTLDIPADQQELLEQAKATGK-PLI 672

Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
           +V M+G PI++ +AK +    AI+ A YPGQ+GG AIA++L G +NP GKLP+T+Y    
Sbjct: 673 VVAMNGSPINLHWAKEN--ADAILEAWYPGQSGGLAIANVLTGKANPTGKLPLTFY--RS 728

Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
           + +LP       P       GRTYR++ G  VYPFG+G+SYT F +     P  V     
Sbjct: 729 VEDLP-------PFDDYDMKGRTYRYFTGKAVYPFGYGLSYTTFGY----GPVAVEPASG 777

Query: 644 GRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK 703
           G    I  T                       V N G + G   + ++   P     P+ 
Sbjct: 778 GAQDGIRVTTQ---------------------VSNTGQRAGGDAVQLYLDFPDAPGTPNI 816

Query: 704 QLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            L  F+KV +  G  ++V   +   + LS V   G R++  G + + +G
Sbjct: 817 ALRGFQKVSLQPGETRQVTFTLS-PRDLSSVTPDGVRKVLKGHYRVTVG 864


>gi|393724082|ref|ZP_10344009.1| beta-glucosidase [Sphingomonas sp. PAMC 26605]
          Length = 900

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 170/452 (37%), Positives = 239/452 (52%), Gaps = 45/452 (9%)

Query: 1   MASTIAFFFLGLILLSASSSGLAAREPFACD-PKDATTRTLPFCQVSLPIPQRVNDLIGR 59
           MAST +     ++LL    SG+AA    A D P   T    P   V     +    ++ R
Sbjct: 1   MASTFSVLSRTVLLLG---SGMAAIASQAQDAPPQPTVLAQP-SAVRADADRLARAIVAR 56

Query: 60  LSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITT 119
           ++  EK+  L++ A A+PRL I  Y WW+E+LHG     P           T+FP+ I  
Sbjct: 57  MTPAEKLPQLLNVAPAIPRLNIPAYNWWTESLHGAIGTVP----------TTNFPEPIGL 106

Query: 120 ASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWGRGQE 170
           A+SF+A L   +   +S E RA++              GL  WSPN+NIFRDPRWGRGQE
Sbjct: 107 AASFDAPLVHRVAGAISTEVRALHTLARETGKLGRIGTGLDTWSPNINIFRDPRWGRGQE 166

Query: 171 TPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
           T GEDP L+     ++V G+QG + D  +V A+ KHF  +         R   N  VS  
Sbjct: 167 TYGEDPYLTAHQGVAFVTGMQGPNPDLPEVIATPKHFAVHSGPEST---RHAANVYVSPH 223

Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
           D+EDT+   FR  ++EG+  S+MC+YN+++G P CA+  +LK  +RG W  NGY+VSDCD
Sbjct: 224 DLEDTYLPAFRAAIVEGRAGSIMCAYNRIDGQPACANDLLLKDHLRGAWGFNGYVVSDCD 283

Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTE--SAVQRGLLSEIDI 343
           +V    D   +      A A A++AG+D +C     G   GL      A++R L+SE D+
Sbjct: 284 AVKDIADNHKYAPDQATAVAAALKAGVDNECNTQTIGDIGGLPDRFGEALKRNLISEGDV 343

Query: 344 NNALVNTLTVQMRLGMFDG---EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
           +  LV   + ++R G   G    P+  P G      + TPDH  LALEAA++ +V+LKN 
Sbjct: 344 DRTLVRLFSARIRNGDLPGIAAAPAVVPVGA-----ILTPDHAALALEAAQKSLVMLKND 398

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
           G  LPL       +AVIGP  D T  + GNY+
Sbjct: 399 G-VLPLRP--GARIAVIGPLGDATRVLRGNYS 427



 Score =  117 bits (294), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 87/278 (31%), Positives = 136/278 (48%), Gaps = 50/278 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +EAE            D+  L LP  QQ L+ +     K P I++ M+G PI++A
Sbjct: 637 GLTSDLEAEETGVDVPGFKGGDKTTLDLPAEQQALLERAHATGK-PLIVIAMNGSPINLA 695

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK++   AAI+ A YPGQAGG A+A++L G ++PGG+LP+T+Y    + +LP       
Sbjct: 696 WAKDN--AAAIVEAWYPGQAGGLAVANVLTGKADPGGRLPLTFY--RSVADLP------- 744

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
           P       GRTYR++ G  VYPFG+G+S+T+F +    AP                    
Sbjct: 745 PFDDYAMTGRTYRYFTGTPVYPFGYGLSFTSFAY----AP-------------------- 780

Query: 656 KAIKVTHAKCNRLT-LGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
             + VT A     T L V  +V N G + G   + ++   P     P   L  + +V + 
Sbjct: 781 --LTVTPAAGGAETGLRVTTEVSNTGQRVGDEVVQLYLNFPDQPGVPRVALRGYRRVTLK 838

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            G  + V  ++   + LS V   G R++  G++ + +G
Sbjct: 839 PGEHRAVTFDLS-PRDLSAVGIDGKRQVMAGQYRVSVG 875


>gi|189464498|ref|ZP_03013283.1| hypothetical protein BACINT_00840 [Bacteroides intestinalis DSM
           17393]
 gi|189438288|gb|EDV07273.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 862

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 164/432 (37%), Positives = 239/432 (55%), Gaps = 32/432 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SL   +R + L+  L+L+EKV+L++ G+  V RL IK Y WW+EALHGV+  G   
Sbjct: 22  YKNTSLSPEERADLLVKELTLEEKVQLMMDGSRPVERLDIKPYNWWNEALHGVARAGL-- 79

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA---MYNGGTA-----GLTYW 153
                   AT FPQ I  A+SF+      +   VSDEARA    Y+   +     GLT W
Sbjct: 80  --------ATVFPQPIGMAASFSPETVYDVFTAVSDEARAKNTYYSSKDSRERYQGLTMW 131

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL 212
           +P VNI+RDPRWGRG ET GEDP L+ +     V+GLQG +DG+  K+ A  KHF  +  
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGNYDKLHACAKHFAVHSG 191

Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             WN   R  FNA+ +  +D+ +T+  PF   V EG+V  VMC+YN+  G P C    +L
Sbjct: 192 PEWN---RHSFNAENIKPRDLYETYLPPFEALVKEGEVKEVMCAYNRFEGEPCCGSNRLL 248

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE--EAAADAIRAGLDLDCGPFLGLHT 329
            + +R EW  +G +VSDC ++  +Y+ +   + P+   A+A A+ +G DL+CG       
Sbjct: 249 MQILRDEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVVSGTDLECGSSYKALV 308

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
           ES V++GL+SE  +N ++   L  +  LG  D +     +  +    V +  H  LAL+ 
Sbjct: 309 ES-VKKGLISEEKVNTSVRRLLKARFELGEMD-DLEKVSWAKIPFSVVASAAHDSLALKI 366

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
           AR+ + LL N+   LPL      TVAV+GPN++ +V   GNY G+     T L G+    
Sbjct: 367 ARKSMTLLMNKDNFLPLRR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDGVRNLL 425

Query: 446 GRYARTIHQQGC 457
           G   + I++QGC
Sbjct: 426 GADDKLIYEQGC 437



 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 94/309 (30%), Positives = 147/309 (47%), Gaps = 48/309 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           +++  ++AD  I   G+  S+E E +          DR  + LP  Q+EL+  +  A K 
Sbjct: 592 SVEQVKEADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALYRAGK- 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             +LV  SG PI         +  AI+ A YPGQ GGTA+A++LFG  NP G+LP+T+Y 
Sbjct: 651 KIVLVNCSGSPI--GLEPETKKCEAILQAWYPGQQGGTAVAEVLFGDYNPAGRLPVTFYR 708

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
              ++ LP  E            GRTYR+ +   ++PFG+G+SYT F +          V
Sbjct: 709 N--VSQLPDFE-------DYNMAGRTYRYMQDTPLFPFGYGLSYTTFSYD--------KV 751

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
            LD      +   +G+ +K+T            V V N G +DG   + V+         
Sbjct: 752 VLDK-----SEVTAGQTLKLT------------VPVTNTGKRDGEEVVQVYLKKQGDAEG 794

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L AF++V +PAG    +  ++   +     +++ T R+  G ++I +GG+    SL
Sbjct: 795 PVKTLRAFKRVFIPAGQTTNIEFDLKDKELEWWDEQTNTVRVCSGTYDIMVGGSSEDESL 854

Query: 761 HAATLGVIK 769
              TL  IK
Sbjct: 855 Q-QTLVTIK 862


>gi|410097219|ref|ZP_11292201.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409224537|gb|EKN17469.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 805

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 232/844 (27%), Positives = 375/844 (44%), Gaps = 151/844 (17%)

Query: 10  LGLI-LLSASSSGLAAREPFACDPKDATT-------------RTLPFCQVSLPIPQRVND 55
            GL+ LL+A++S L A +P A  PK+                +   +  +S PI +RV D
Sbjct: 17  FGLLPLLNANTSSLQASKPDA--PKNQKKIYQKGWIDFNKNGKKDIYEDLSQPIDKRVED 74

Query: 56  LIGRLSLQEKVKLL--ISGAAAVPRLGIKGYEW----WSEALHGVSNV-----GPGTKFG 104
           L+ +++++EK   L  I G  AV +  +   EW    W +   G+ N+     G   +  
Sbjct: 75  LLKQMTVEEKTCQLGTIYGYGAVLKDTLPTDEWKTRIWKD---GIGNIDEHLNGEWKRTS 131

Query: 105 GDFP--------------------------------------GATSFPQVITTASSFNAT 126
            DFP                                       +T FP  I    +++  
Sbjct: 132 LDFPYSNHAEAMNKVQAFFVEETRLGIPADLTNEGIRGLKHEKSTFFPAQIGQGCTWDKE 191

Query: 127 LWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAAS 185
           L   IGR+  +EA+A+      G T  +SP +++ RDPRWGR  E+ GED  L+G+    
Sbjct: 192 LIYEIGRITGEEAKAL------GYTNIYSPILDLSRDPRWGRTVESYGEDSYLAGELGRQ 245

Query: 186 YVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRF-HFNAKVSKQDIEDTFDVPFRMCV 244
            V G+Q +     +V ++ KHF  Y +    G D +   +   S Q++ +    PFR+  
Sbjct: 246 QVLGIQSN-----RVVSTPKHFAIYGIPG-GGRDCYSRTDPHASPQEVHELHLEPFRIAF 299

Query: 245 MEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTST 304
            E      MCS+N  NG P  A   ++   +R +W   GY+VSD  ++       H   T
Sbjct: 300 QEAGALGTMCSHNDYNGTPVSASHYLMTELLRNQWGFKGYVVSDSWAIDKNVKFYHIVDT 359

Query: 305 PEEAAADAIRAGLD----LDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMF 360
            EEA A  + AGL+     +           A+Q+GL+ E  ++  +   L V+  LG+F
Sbjct: 360 EEEAVASELNAGLNVRTFFEQSEVFIEALRRALQKGLVEESTLDQRVREVLYVKFWLGLF 419

Query: 361 DGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVI 417
           D      PY     L  K V +  ++E++L AAR+ IVLLKN+  +LPLS    + +AVI
Sbjct: 420 D-----DPYVKDTKLADKIVNSDKNREVSLRAARESIVLLKNENNTLPLSKTL-KNIAVI 473

Query: 418 GPNSDVTVTMIGNYAGIACGYTTPLQG----IGRYARTIHQQGCK--------------D 459
           GP +D   ++   Y        T LQG    +G     ++ +GC               +
Sbjct: 474 GPQADEVKSLTSRYGSHNPNVITGLQGLKNLLGENVNLMYAKGCNVRDKNFPQSDVMYFE 533

Query: 460 VACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
           ++  + +    A++ +++A+  I+ +G D     E+  R  L L GRQ+ELV  V  A+ 
Sbjct: 534 LSDKEKEEIDEAVEIAKKAEVAIIYVGDDFRTIGESRSRVNLDLSGRQKELVRAV-QATG 592

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
            P +LVL +G P+ + +   D  + AI+ A YPG+  G A+A++LFG  NPGGKL  T +
Sbjct: 593 TPVVLVLFNGRPVTLNW--EDANLPAIVEAWYPGEFSGQAVAEVLFGDYNPGGKLSTT-F 649

Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
           P+  +  +P      +P+ +    G+ +    G  +YPFG+G+SYT F   ++N      
Sbjct: 650 PKS-VGQIPWA-FPFKPNAT----GKGFARVDGE-LYPFGYGLSYTTF--EISNLQPSAT 700

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
              DG                        TL V   VKN GS  G   + ++        
Sbjct: 701 KIADGD-----------------------TLTVTCKVKNTGSVKGDEVVQLYLNDETSSI 737

Query: 700 AP-HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSV 758
           +   K+L  FE+V +  G ++ V   ++   Y    D++     P G+  +  G +  S 
Sbjct: 738 SRFEKELCGFERVALEPGEEKTVTFKVNRRAYGMYNDKNEFVVEP-GKFFLFAGNSSKST 796

Query: 759 SLHA 762
            L+A
Sbjct: 797 PLNA 800


>gi|224025503|ref|ZP_03643869.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
           18228]
 gi|224018739|gb|EEF76737.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
           18228]
          Length = 787

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 234/824 (28%), Positives = 368/824 (44%), Gaps = 166/824 (20%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKV----------------------------------- 66
           + Q S PI +RV+DL+GR++L+EK+                                   
Sbjct: 24  YRQASAPIEERVSDLLGRMTLEEKIGQICCPLGWEMYTKVSPDSVTISDKYRQQMDEAPI 83

Query: 67  --------------KLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFP---- 108
                         K L +G    PRL  K      +     + +G    F  + P    
Sbjct: 84  GSYWAVLRADPWTQKTLETGLN--PRLAAKALNALQKYAVEHTRLGIPVLFAEECPHGHM 141

Query: 109 --GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWG 166
             GAT FP  +  AS++N +L   +G V+  EAR    G   G   + P ++I R+PRW 
Sbjct: 142 AIGATVFPTSMGQASTWNESLIRQMGEVIGLEAR--LQGANIG---YGPVLDIAREPRWS 196

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFN- 224
           R +ET GEDP L+G    ++V+G+QG D  D   V ++ KH  AY      GV R   N 
Sbjct: 197 RVEETFGEDPYLTGILGTAFVQGMQGKDFKDGRHVYSTLKHLAAY------GVPRGGHNG 250

Query: 225 --AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLN 282
             A +  + + D +   F+  V  GK A+VM SYN ++GVP  ++  ++   +R  W  +
Sbjct: 251 GPADMGLRALLDEYLPGFQRAVEVGKAATVMTSYNSIDGVPCTSNKFLIDSLLRKRWGFD 310

Query: 283 GYIVSDCDSV-GVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEI 341
           G++ SD  S+ G+     H  +  E+AA  A+ AG D+D G         AVQ G + E 
Sbjct: 311 GFVYSDLASIDGIA--GAHVAANLEDAAIQAVEAGTDMDLGANAYRRLVKAVQTGKVKES 368

Query: 342 DINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD----VCTPDHQELALEAARQGIVLL 397
            IN A+ N L ++ R+G+F+     QPY  + P++    V   DH+ LA + AR+G VLL
Sbjct: 369 AINRAVSNVLRLKFRMGLFE-----QPY--VSPEEAARLVNCEDHRMLARKIAREGTVLL 421

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG--IACGYTTPLQGIGRY---ARTI 452
           KN G  LPL  ++   +AVIGPN+DV    +G+Y          T L  +       R  
Sbjct: 422 KNNG-ILPLGKVKR--IAVIGPNADVMYNYLGDYTAPQERSKVVTLLDALRNRMPDVRID 478

Query: 453 HQQGC--KDVACADDQLFGAAIDASRQADATILVMG------------------------ 486
           + +GC  +D   ++      A++A+R+AD  IL +G                        
Sbjct: 479 YVKGCAIRDTTQSN---IKEAVEAARKADLVILAVGGSSARDFKTKYINTGAATVDSENS 535

Query: 487 --LDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA 544
             L      E  DRA L L G Q++L+  ++ A++ P + V ++G P+++  A       
Sbjct: 536 GILSDMECGEGFDRATLDLLGDQEKLIRAIA-ATEKPLVTVYIAGRPLNMNLASEVSD-- 592

Query: 545 AIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPG 604
           A++ A YPG+ GG  I D+L G  NP G+LPM+  P+ ++  +P+            Y  
Sbjct: 593 ALLTAWYPGEQGGNGIVDVLTGEYNPSGRLPMS-VPR-HVGQIPV-----------HYSQ 639

Query: 605 RTYRFYK---GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVT 661
            T R Y    G  +Y FG+G+SYT F ++                   N  +S  A   +
Sbjct: 640 GTLRDYMDCPGKPLYTFGYGLSYTTFAYS-------------------NLKLSATAKAAS 680

Query: 662 H-AKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQ 719
             A  N +   +   V N G +DG   + ++        A P  +L  F+K+ +  G  +
Sbjct: 681 QPAGDNEVMQTITCTVTNTGDRDGDEVVQLYLNDEVSSVAVPPIRLKGFQKIFLKKGESR 740

Query: 720 RVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            V   +   + LS+ DR+       G  N+ IGG+  ++ L  +
Sbjct: 741 EVTFQL-TRQDLSIYDRNMNFTAEPGRFNVMIGGSSDNLPLKGS 783


>gi|399070782|ref|ZP_10749848.1| beta-glucosidase-like glycosyl hydrolase [Caulobacter sp. AP07]
 gi|398043993|gb|EJL36849.1| beta-glucosidase-like glycosyl hydrolase [Caulobacter sp. AP07]
          Length = 900

 Score =  259 bits (661), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 158/418 (37%), Positives = 228/418 (54%), Gaps = 38/418 (9%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           L+G+++L EKV+ L++ A A+PRLGI  Y WW+E+LHG     P           T+FP+
Sbjct: 52  LVGQMTLDEKVEQLLNTAPAIPRLGIPAYNWWTESLHGALGALP----------TTNFPE 101

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWG 166
            I  A++F+  L   +   V  E R ++    +         GL  WSPN+NIFRDPRWG
Sbjct: 102 PIGLAATFDPALVHDVAGAVGSEVRGLHTLARSTGRMGRIGGGLDTWSPNINIFRDPRWG 161

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
           RGQET GEDP L+ +   ++V+G+QG + D  +V A+ KHF  +         R   N  
Sbjct: 162 RGQETYGEDPFLTARVGVAFVQGVQGPNPDLPQVIATPKHFAVHSGPEST---RHSANVF 218

Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
           VS+ D+EDT+   FR  + EG+  SVMC+YN+++G P CA   +LK  +R  WR +GY+V
Sbjct: 219 VSRHDLEDTYLPAFRAAITEGRAGSVMCAYNRIDGQPACASDLLLKDHLRQAWRFDGYVV 278

Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-------PFLGLHTESAVQRGLLS 339
           SDCD+V    D   +      A A A+RAG+D +C          L      A++R L+S
Sbjct: 279 SDCDAVKDISDHHRYAPDAAAAVAAAMRAGVDNECNGATLTDTDGLADRYREALRRDLIS 338

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
             D++ ALV   + ++R G   G    +P   +    V TP+H  LAL AA + +VLLKN
Sbjct: 339 VADVDRALVRLFSARLRTGDLPG--VRKPETPVALASVMTPEHAVLALAAAEKSLVLLKN 396

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY-AGIACGYTTPLQGIGR---YARTIH 453
            G  LPL       +AVIGP  D T  + GNY + ++    + L+G+ R    AR +H
Sbjct: 397 DG-VLPLKA--DTRIAVIGPLGDATRVLRGNYSSALSAPPISVLEGLRRAMPSARIVH 451



 Score =  122 bits (307), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 86/290 (29%), Positives = 142/290 (48%), Gaps = 50/290 (17%)

Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
           A+ QAD  + V+GL   +EAE            D+  L +P  QQ L+ +     K P +
Sbjct: 625 AAAQADVLVAVVGLTSDLEAEESPVEIPGFKGGDKTTLDIPSDQQALLEQAKALGK-PLV 683

Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
           +V M+G P+++++AK++   +AI+ A YPGQ+GG A+A++L G +NP G+LP+T+Y    
Sbjct: 684 VVTMNGSPLNLSWAKDN--ASAILEAWYPGQSGGLAVANVLSGKTNPSGRLPLTFY--RS 739

Query: 584 ITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLD 643
           + +LP       P       GRTYR+++G  VY FG+G+SYT F +              
Sbjct: 740 VDDLP-------PFGDYAMAGRTYRYFEGTPVYSFGYGLSYTRFAY-------------- 778

Query: 644 GRHGSINATISGKAIKV-THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH 702
                       + +KV   A      L V   V+N+G++ G     ++   P    AP 
Sbjct: 779 ------------EPLKVEAAAGGAGKGLRVTTSVRNLGTRSGDEVAQLYLDFPDAPGAPR 826

Query: 703 KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
             L  F+++ +  G  + +   +   + LS VD  G RR+  G + + +G
Sbjct: 827 LALRGFQRLTLKPGESRAITFALS-SRDLSSVDPDGQRRVASGRYRVSVG 875


>gi|224538725|ref|ZP_03679264.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224519667|gb|EEF88772.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 942

 Score =  259 bits (661), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 205/680 (30%), Positives = 325/680 (47%), Gaps = 74/680 (10%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT+FP  +    ++N  L   IG +   EAR +      G T  ++P +++ RD RWGR 
Sbjct: 173 ATNFPTQLGLGHTWNRELIRQIGLITGREARML------GYTNVYAPILDVGRDQRWGRY 226

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           +E  GE P L  +     VRG+Q +     +VAA+ KHF AY  +          + ++S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGMQHNH----QVAATGKHFVAYSNNKGAREGMARVDPQMS 282

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++E     PF+  + E  +  VM SYN  +GVP       L   +RGE    GY+VSD
Sbjct: 283 PREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGEMGFRGYVVSD 342

Query: 289 CDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDCG----PFLGLHTESAVQRGLLSEIDI 343
            D+V   Y T+H T+   +EA   ++ AGL++ C         L     V+ G LSE  I
Sbjct: 343 SDAVEYLY-TKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVI 401

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           N+ + + L V+  +G+FD    +   G    K+V   +++ LAL+A+R+ +VLLKN+   
Sbjct: 402 NDRVRDILRVKFLVGLFDTPYQTDLAG--ADKEVEKAENESLALQASRESLVLLKNENNV 459

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK- 458
           LPL     + +AV GPN+D     + +Y  +A   TT L+GI +     A  ++ +GC  
Sbjct: 460 LPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEVLYTKGCDL 519

Query: 459 -DVACADDQL------------FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
            D    + +L               A++ +RQAD  ++V+G  Q    E   R+ L LPG
Sbjct: 520 VDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLELPG 579

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
           RQ +L+  V    K P +LVL++G P+ + +A  D  + AI+ A YPG  GGTA+AD+LF
Sbjct: 580 RQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGTAVADVLF 636

Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGHG 621
           G  NPGGKL +T +P+  +  +P      +PS      + PG      +    +Y FG+G
Sbjct: 637 GDYNPGGKLTVT-FPKS-VGQIPFN-FPCKPSSQIDGGKNPGLDGNMSRVNGALYSFGYG 693

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           +SYT F ++                   +  IS K I             V+  V N G 
Sbjct: 694 LSYTTFEYS-------------------DIEISPKVITPNQKAT------VRCKVTNTGK 728

Query: 682 KDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
           + G   + ++          + K L  FE++H+  G  + V   +   K L ++D+    
Sbjct: 729 RAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFTLD-RKQLELLDKHMEW 787

Query: 741 RIPLGEHNIHIGGTKHSVSL 760
            +  G+ +I IG +   + L
Sbjct: 788 VVEPGDFSIMIGASSEDIRL 807


>gi|386724230|ref|YP_006190556.1| beta-glucosidase [Paenibacillus mucilaginosus K02]
 gi|384091355|gb|AFH62791.1| beta-glucosidase [Paenibacillus mucilaginosus K02]
          Length = 973

 Score =  259 bits (661), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 161/440 (36%), Positives = 237/440 (53%), Gaps = 23/440 (5%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   + P+  RVNDLI +L+L EK++L+     A+ RLGI+ Y+  +EA HG++ +G   
Sbjct: 9   YWDFNAPLEDRVNDLIAQLTLDEKIELMPQYQIAIERLGIQAYKHGTEAAHGMAWLGE-- 66

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGTAGLTYWSPNVNI 159
                   ATS+PQ I  A ++++ L + IG  + DEAR  Y  N    GLT W+P V++
Sbjct: 67  --------ATSYPQPIGLACTWDSELLKRIGSAIGDEARGFYKQNPNFNGLTLWAPTVDL 118

Query: 160 FRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVD 219
            RDPRWGR +E  GEDPVL+GK AA+  +G+QG     LK  A+ KHF    + N N   
Sbjct: 119 ERDPRWGRTEEAYGEDPVLAGKLAAALTQGIQGDHPFYLKAVATLKHF----IGNNNEAG 174

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R   +  +  +++ + +   F +   EG   S+M +YN VNGVP      ++   ++GEW
Sbjct: 175 RGDTSVSLDPRNLREYYLKVFEITFKEGGAQSMMTAYNAVNGVPANLSELVID-IVKGEW 233

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIR-AGLD--LDCGPFLGLHTESAVQRG 336
            +NG++VSD   V       H+  T +EA A +IR  G+D   D    +      A++ G
Sbjct: 234 GMNGFVVSDAFDVTGTMRDHHYVETLKEAVARSIREGGIDSITDDAAVVTEAIHEALKDG 293

Query: 337 LLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           LL+E D++ AL NT  V+ RLG FD  P   PY  +    +  P+H +LA EA+++ +VL
Sbjct: 294 LLTENDLDTALRNTFRVRFRLGEFD-PPERNPYATIDESAILHPEHAKLAREASQKAVVL 352

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQG 456
           LKN G +LPL   +   VAVIGP +DV       Y+G      TPLQGI        +  
Sbjct: 353 LKNDGKTLPLQADKLSKVAVIGPLADVVYQ--DWYSGSLPYAVTPLQGIRERLAAEGKDA 410

Query: 457 CKDVACADDQLFGAAIDASR 476
               A   D++   AI + R
Sbjct: 411 VTSYAAGTDRMRIKAISSGR 430



 Score = 89.7 bits (221), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 85/313 (27%), Positives = 141/313 (45%), Gaps = 55/313 (17%)

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA-EALDRAGLLLPGRQQELVS 512
           Q    ++ C    L  AA+ ++R ADA I+ +G    I   E +DR  + L   Q++L  
Sbjct: 560 QADTFEIECVSSGL-DAAVASARGADAAIVFVGNHPLINGKETMDRPDITLAESQEKLA- 617

Query: 513 KVSMASKGP-TILVLMSGGPIDVAFAKN--DPRIAAIIWAGYPGQAGGTAIADILFGTSN 569
            +++  K P TI+V++   P    FA N    +  AII+  + GQ  G A+A +LFG  N
Sbjct: 618 -LAVIEKNPNTIIVVVGSYP----FALNALQEKAKAIIYTSHAGQELGRAVAAVLFGDVN 672

Query: 570 PGGKLPMTWYPQEYITNLP--MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
           P G+L MTWY    +  LP  M    +R         RTY++++G  +YPFGHG+SY++F
Sbjct: 673 PAGRLNMTWYKS--VDQLPPFMDYDIIRGC-------RTYQYFEGEPLYPFGHGLSYSDF 723

Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHT 687
            +       +  +P     G+  A                  + V   V N+   +G   
Sbjct: 724 RY-----EALQLLPNRAEAGTDTA------------------VQVSCRVTNISDCEGEEV 760

Query: 688 LLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD--------RSG 738
           + ++    A     P  QL  F+++ + AG    V   + + + L++ D         SG
Sbjct: 761 VQLYVRADASRVKRPRLQLADFKRIALAAGESAEVTFQLQL-ESLAIWDVTRDRFCLESG 819

Query: 739 TRRIPLGEHNIHI 751
           T  + LG  ++ +
Sbjct: 820 TYTVMLGASSVDL 832


>gi|389696043|ref|ZP_10183685.1| beta-glucosidase-like glycosyl hydrolase [Microvirga sp. WSM3557]
 gi|388584849|gb|EIM25144.1| beta-glucosidase-like glycosyl hydrolase [Microvirga sp. WSM3557]
          Length = 751

 Score =  259 bits (661), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 223/739 (30%), Positives = 354/739 (47%), Gaps = 88/739 (11%)

Query: 49  IPQRVNDLIGRLSLQEKVKLL-------------ISGAAAVPRLGIKGYE--WWSEALHG 93
           I +RVN+L+GR++L+EKV  L             IS   A   L     E    ++AL  
Sbjct: 36  IERRVNELLGRMTLEEKVGQLNLVSHGPPLRWEDISEGKAGAVLNFNSAEDVARAQALVR 95

Query: 94  VSNVGPGTKFGGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT 151
            S++     FG D      T FP  +  A++F+      + R+ S+   A       G+ 
Sbjct: 96  ESHLKIPLLFGLDVLHGFRTQFPLPLGEAAAFSPR----VSRLASE--WAAREASYVGVN 149

Query: 152 Y-WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAY 210
           + ++P  ++ RD RWGR  E  GEDP L     A+ V G +     +  +AA+ KHF  Y
Sbjct: 150 WTFAPMADLSRDSRWGRIVEGFGEDPTLGAALTAARVEGFR-----KGGLAAAAKHFAGY 204

Query: 211 DLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNI 270
                    R +    + + ++ DT+  PFR  V E   AS M ++N +NG P+ A+P +
Sbjct: 205 GAPQGG---RDYDTTYIPRAEMYDTYLPPFRAAV-EAGTASFMAAFNALNGEPSTANPWL 260

Query: 271 LKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFLGLHT 329
           L   +R +W  +G++ SD   +G   +     +   EAA  AI AG+D+D  G     H 
Sbjct: 261 LTDVLRTQWGFDGFVTSDWVGIGELVN-HGIAADGAEAARKAILAGVDMDMMGQLYINHL 319

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
              V+ G + E  I+ ++   L  + RLG+FD  P      HL   +  +P+ ++ A E 
Sbjct: 320 PDEVRAGRVPESVIDESVRRVLRTKFRLGLFD-RPDVD-SSHLD-SEFPSPESRQAAREV 376

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA--GIACGYTTPLQGIGR 447
           AR+  VLL+N+   LP+   + R++AV+GP +D     +G +A  G      T L+GI R
Sbjct: 377 ARETFVLLQNRDDVLPIPS-KVRSIAVVGPLADAPQDQMGPHAARGHKEDSVTILEGIRR 435

Query: 448 YARTI-----HQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLL 502
            A++      H  GC D+ C +      A++A+RQ+D  I V G  Q +  EA  RA + 
Sbjct: 436 RAQSAGIAVRHAPGC-DLFCRNTDALPGALEAARQSDFVIAVFGEPQELSGEAASRANME 494

Query: 503 LPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIAD 562
           L G+Q E++ +++   K P  LV+M G P  +    +  RI +I+ A YPG   G A+AD
Sbjct: 495 LNGKQIEVLEELAKTGK-PVALVIMGGRPQVLGPVAD--RIPSILMAWYPGTEAGPAVAD 551

Query: 563 ILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVY 616
           +LFG  +P GKLP+TW P+       Y   LP      RP+ +       Y       +Y
Sbjct: 552 VLFGDVSPSGKLPLTW-PRATGQLPLYYNRLPTG----RPTLANNRFTLHYIDESIAPLY 606

Query: 617 PFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDV 676
           PFG G+SYT+F ++                   +A I+ + +           L V +DV
Sbjct: 607 PFGWGLSYTHFAYS-------------------DARIASRQLDEGQ------VLEVSLDV 641

Query: 677 KNVGSKDGAHTLLVFSTPP-AGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD 735
           KN G++DG   + +++  P A    P ++L AFEK+ + +G  +RV + + V      +D
Sbjct: 642 KNTGARDGQEVVQLYTRDPVASRSRPLRELKAFEKIALKSGETKRVTLRVPVESLGFHLD 701

Query: 736 RSGTRRIPLGEHNIHIGGT 754
             GT  +  G   + +GG+
Sbjct: 702 -DGTYLVEAGAIQVFVGGS 719


>gi|423212854|ref|ZP_17199383.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392694712|gb|EIY87939.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 782

 Score =  259 bits (661), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 205/685 (29%), Positives = 318/685 (46%), Gaps = 84/685 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT FP  I  A++++  L + +G+V++ E R+       G   + P +++ RDPRW R 
Sbjct: 148 GATVFPTGIGMAATWSPELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 202

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDPVLSG   AS V GL G +   +    A+ KHF AY +        +   A V
Sbjct: 203 EETFGEDPVLSGILGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 259

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
             +D+   F  PFR  +  G + SVM SYN ++G+P  ++  +L + +R EW+  G++VS
Sbjct: 260 GIRDLHQNFLPPFRKAIDAGAL-SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVS 318

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           D  S+   +++     T E AA  ++ AG+D+D G     +   AVQ G + +  I+ A+
Sbjct: 319 DLYSIEGIHESHFVAPTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAV 378

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L ++  +G+F+  P   P   +  K V   +H ELA + A+  I LLKN+   LPLS
Sbjct: 379 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRYARTIHQQGCKDVACADD 465
              ++ VAVIGPN+D    M+G+Y          T L GI         +  +  A  D 
Sbjct: 436 KTINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDT 494

Query: 466 QL--FGAAIDASRQAD----------------------ATILVMGLDQSIE-AEALDRAG 500
            +     AI+A+R+++                      A +   G    +E  E  DRA 
Sbjct: 495 TVNEIEQAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRAS 554

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L L GRQQEL+  +    K P I+V + G P++  +A       A++ A YPGQ GG AI
Sbjct: 555 LSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGGNAI 611

Query: 561 ADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYP-GRTYRFYKGPVVYPFG 619
           AD+LFG  NP G+LP++         +P +   +    +K+ P    Y       +Y FG
Sbjct: 612 ADVLFGDYNPSGRLPIS---------VPRSVGQIPVYYNKKAPRNHDYVEVSSSPLYSFG 662

Query: 620 HGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNV 679
           +GMSYT F +                        S   +    A+C      V   VKN 
Sbjct: 663 YGMSYTTFEY------------------------SDLQVVQKSARC----FEVSFKVKNT 694

Query: 680 GSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSG 738
           G  DG   + L      A    P KQL  FE+ H+  G +++V   +    +  +V+ + 
Sbjct: 695 GKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTEEDFF-LVNYTL 753

Query: 739 TRRIPLGEHNIHIGGTKHSVSLHAA 763
            + +  G   + IG +   + L  +
Sbjct: 754 KKVVESGTFQVMIGSSSDDIRLEKS 778


>gi|358342292|dbj|GAA27551.2| probable beta-D-xylosidase 7 [Clonorchis sinensis]
          Length = 826

 Score =  259 bits (661), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 231/827 (27%), Positives = 356/827 (43%), Gaps = 159/827 (19%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAA--------VPRLGIKGYEWWSEALH 92
           PF   SLP   RV+DL+ RL+ +E ++ + +G A         + RL I  Y+W      
Sbjct: 28  PFRNPSLPANFRVDDLLARLTNEELIQQVSNGGAGPQHGPAPGIARLNISAYQW------ 81

Query: 93  GVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY 152
             +N G G          T FPQ +   ++F+      + R    E RA +N   A  TY
Sbjct: 82  -RTNPGDGR--------ITPFPQPVNLGATFDVHTVYRVARATGLEMRARWNRAKAKKTY 132

Query: 153 --------WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG------------ 192
                   ++P VN+ R P WGR QET GEDP + GK A ++VRGL G            
Sbjct: 133 RDGNGIHLFAPVVNLLRHPLWGRNQETFGEDPFMIGKLARTFVRGLGGWKNAEPQSLDEQ 192

Query: 193 ---SDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKV 249
              S  D L V A+CKHF  +       V R  F A V+  D+  T+   FR C+  G V
Sbjct: 193 NLSSQPDVLLVGANCKHFAVHTGPEDFPVSRLSFEANVTDVDLWQTYLPAFRACLEAGAV 252

Query: 250 ASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVG-VYYDTQHFTSTPEEA 308
            SVMC+Y+ +NG P C +  +L   +R +W+  G++V+DC ++  V +  Q F    E A
Sbjct: 253 -SVMCAYSGINGTPDCINHWLLTELLRQKWKFKGFVVTDCGALQFVIWKHQIFNHYNETA 311

Query: 309 AADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGE---PS 365
            A A+RAG++L+          S +   L S     + L+          +  GE     
Sbjct: 312 MA-AVRAGVNLENSVVYATEVFSTLPHLLASGSLSRDQLIEMARPLFLTRLMQGEFNPVE 370

Query: 366 SQPYGHLGPKD-VCTPDHQELALEAARQGIVLLKNQGPSLPLSH--------IRHRTVAV 416
             PY  L P++ +   DH+ +AL    + IVLL+N+   LPL +        +RH  +A+
Sbjct: 371 MDPYRLLAPEEAILNEDHRRVALATTARSIVLLQNRDRFLPLKNNMSDSGGPLRH--IAI 428

Query: 417 IGPNSDVTVTMIGNY-AGIACGYTTPL-QGIGRYARTIHQQG-CKD----VACADDQLFG 469
           +GP +     + G+Y          PL +G+ + +R +H    C D     +  DD L  
Sbjct: 429 VGPFATSVTELYGHYRTAPEPEIEVPLSKGLSQLSRRMHASDICTDGGRCSSLNDDALHS 488

Query: 470 AAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKG--------- 520
                    D  +L +G    +E E +DR  + LPG+Q EL+ +    S G         
Sbjct: 489 TL--GYDDLDLIVLSLGTGSEVEGENVDRQNITLPGKQPELLEETLKLSSGLGNSGLSKR 546

Query: 521 --PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSN--------- 569
             P IL++ S GPI+++ A  +  + AI W G+PG   G A+  +L G+S          
Sbjct: 547 TVPIILLVFSAGPINISRAVENENVKAIFWCGFPGPLVGDAMRHLLLGSSGELFGPSKPI 606

Query: 570 ------------------------PGGKLPMTWYPQ-EYITNLPMTEMAMRPSQSKRY-P 603
                                   P  +LP TWY   + + N+ + EM    +Q+ RY P
Sbjct: 607 SVGFHSFQEAYRWDVTPDDGYWWIPAARLPFTWYESIDQLANITVYEMT---NQTYRYLP 663

Query: 604 GRTY---RFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKV 660
            + +      K PV+YPFG+G+SY NF              L G  G + + +    I  
Sbjct: 664 TQCHMSSEDCKIPVLYPFGYGLSY-NF-------------NLSGASGFVYSDL----IAP 705

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW-----------APHKQLVAFE 709
           + A  +   +   V V+N G       + V++      W            P  QL  FE
Sbjct: 706 SSAVSSNQRIVFYVTVQNEGPIACEEVVQVYT-----KWLNRTENDNSRNGPLIQLAGFE 760

Query: 710 KVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIP-LGEHNIHIGGTK 755
           +V +  G  +++   +   ++L+V   S    IP  G   I +GG +
Sbjct: 761 RVRLDVGEYKQLKFTLIPSEHLAVWSLSENTMIPGRGVLQISVGGQQ 807


>gi|336404202|ref|ZP_08584900.1| hypothetical protein HMPREF0127_02213 [Bacteroides sp. 1_1_30]
 gi|335943530|gb|EGN05369.1| hypothetical protein HMPREF0127_02213 [Bacteroides sp. 1_1_30]
          Length = 735

 Score =  259 bits (661), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 232/795 (29%), Positives = 369/795 (46%), Gaps = 109/795 (13%)

Query: 8   FFLGLILLSASSSGLAAREPFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKV 66
             LGL LL  + S L++++           +++P +     PI +R++DLI R++L+EK+
Sbjct: 6   LILGLCLLGVTHS-LSSKD----------KKSIPLYKDAKAPIEKRIDDLISRMTLEEKI 54

Query: 67  KLLI--------------SGAAAVP-RLGIKGYEWWSEALHGV--------SNVGPGTKF 103
             L                    VP  +G   Y   +  L           S +G    F
Sbjct: 55  LQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFDINPELRNSMQKKAMEESRLGIPIIF 114

Query: 104 GGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIF 160
           G D      T +P  +  A S+N  L E    V + EAR       +G+ + +SP +++ 
Sbjct: 115 GYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSAQEAR------MSGVDWTFSPMIDVA 168

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVD 219
           RDPRWGR  E  GEDP  +G +AA+ VRG QG D     ++AA  KH+  Y         
Sbjct: 169 RDPRWGRVAEGYGEDPYTNGVFAAASVRGYQGDDMSAENRMAACLKHYVGYGASE---AG 225

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R +   ++S Q + DT+ +P+ M V  G  A++M S+N ++GVP  A+P I+   ++  W
Sbjct: 226 RDYVYTEISAQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRW 284

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLL 338
           + +G+IVSD  +V      Q   +T ++AA  A  AGL++D        H +  V+ G +
Sbjct: 285 KHDGFIVSDWGAVE-QLKNQGLAATKKDAAQYAFNAGLEMDMMSHAYDRHLKELVEEGKV 343

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKD-VCTPDHQELALEAARQGIVL 396
           +   ++ ++   L V+ RLG+F+     +PY  +   KD    P    +A + A + +VL
Sbjct: 344 TMAQVDESVRRVLRVKFRLGLFE-----RPYTPVTNEKDRFFRPQSMAVAAQLAAESMVL 398

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYAR 450
           LKN    LPL++   + +AV+GP +     ++G++ G      +   Y       G  A 
Sbjct: 399 LKNDNQILPLTN--KKKIAVVGPMAKNGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAE 456

Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
             +  GCK     D   F  A+D +R +D  I+ +G   +   E   R+ + LP  Q+EL
Sbjct: 457 LRYAMGCKPQG-NDRSGFAGALDVARWSDVVIVCLGEMLTWSGENASRSTIALPQIQEEL 515

Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
           V ++  A K P ILVL +G P+++   + +P   AI+    PG  G  ++A IL G  NP
Sbjct: 516 VKELKEAGK-PVILVLSNGRPLEL--NRMEPLCDAILEIWQPGINGARSMAGILSGRINP 572

Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTN 626
            GKL MT          P +   +    ++R  GR ++ FYK      +YPFGHG+SYT 
Sbjct: 573 SGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTE 623

Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
           F +                      T++  A KV         L  +V V N G++DGA 
Sbjct: 624 FKY---------------------GTVTPSATKVKRGD----KLSAEVTVTNTGARDGAE 658

Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
           T+  F S P      P K+L  FEK  + AG  +    +I + +    V+  G R +  G
Sbjct: 659 TVHWFISDPYCSITRPVKELKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAG 718

Query: 746 EHNIHIGGTKHSVSL 760
           E++I + G    + L
Sbjct: 719 EYHILVQGQTVKIEL 733


>gi|423289665|ref|ZP_17268515.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
           CL02T12C04]
 gi|423298158|ref|ZP_17276217.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
           CL03T12C18]
 gi|392663699|gb|EIY57246.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
           CL03T12C18]
 gi|392667376|gb|EIY60886.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
           CL02T12C04]
          Length = 955

 Score =  259 bits (661), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 215/759 (28%), Positives = 357/759 (47%), Gaps = 99/759 (13%)

Query: 20  SGLAAREPFACDPKDATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVP 77
           S L A +    +  + T R   +   SLP+ +RV  L+  ++ ++K++L+  G     +P
Sbjct: 151 SVLVAEKVLKDNKGEVTDRR--YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIP 208

Query: 78  RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
            L +       EA+HG S       +G    GAT FPQ +   +++N  L E +  V+ D
Sbjct: 209 HLYVPPITK-VEAVHGFS-------YGS---GATIFPQALAMGATWNRKLTEEVAMVIGD 257

Query: 138 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR 197
           E   + N   A    WSP +++ +D RWGR +ET GEDPVL  +   ++++G Q      
Sbjct: 258 ET-VVANTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRG--- 309

Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
             +  + KHF  +      G D       +S++++ +   VPFR  V      S+M +Y+
Sbjct: 310 --LFTTPKHFGGHGAP-LGGRDSHDIG--LSEREMREVHLVPFRHVVRNYDCQSLMMAYS 364

Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAG 316
              G+P      +L++ +R EW  NG+IVSDC ++G     +H+T+  + EAA  A+ AG
Sbjct: 365 DYMGIPVAGSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAG 424

Query: 317 LDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGP 374
           +  +CG  +       A + G ++ ++++N     L    R  +F+  P     +  + P
Sbjct: 425 IATNCGDTYNDKEVIQAAKDGRINMVNLDNVCRTMLATMFRNELFEKNPCKPLDWNKIYP 484

Query: 375 KDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTM 427
               +  H+E+A +AAR+ IV+L+N+   LPLS    +T+AV+GP +D            
Sbjct: 485 -GWNSDRHREMARQAARESIVMLENKDNLLPLSKTL-KTIAVLGPGADDLQPGDYTPKLQ 542

Query: 428 IGNYAGIACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGL 487
            G    +  G       +G+  + +++QGC D    D      A+ A+ Q+D  ++V+G 
Sbjct: 543 PGQLKSVLSGIKA---AVGKQTKVLYEQGC-DFTTPDATNIPKAVKAASQSDVVVMVLGD 598

Query: 488 DQSIEA---------EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAK 538
             + EA         E  D A L+LPG+QQEL+  V    K P +L+L +G P D+  A 
Sbjct: 599 CSTSEATNNVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVVLILQAGRPYDLLKAS 657

Query: 539 NDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQ 598
              +   + W   PGQ GG A AD+LFG  NPGG+LPMT +P+ ++  LP+       + 
Sbjct: 658 EMCKAILVNW--LPGQEGGPATADVLFGDYNPGGRLPMT-FPR-HVGQLPL--YYNFKTS 711

Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
            +RY      FY    +Y FG+G+SYT+F ++                  I    +G  +
Sbjct: 712 GRRYEYVDMEFYP---LYRFGYGLSYTSFEYS---------------DLKIQEKSNGNVM 753

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHK----QLVAFEKVHVP 714
                        VQ  VKNVG   G     ++ T     +A  K    +L  F ++H+ 
Sbjct: 754 -------------VQATVKNVGGCAGDEVAQLYIT---DMYASVKTRVMELKDFTRIHLQ 797

Query: 715 AGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
            G  + V   +     +S+++    R +  GE  + +GG
Sbjct: 798 PGESKNVSFELTPYD-ISLLNDRMDRVVEKGEFKVMVGG 835


>gi|329954674|ref|ZP_08295734.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328527215|gb|EGF54219.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 855

 Score =  259 bits (661), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 212/699 (30%), Positives = 324/699 (46%), Gaps = 95/699 (13%)

Query: 88  SEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGT 147
           +EALHG ++            G+T +PQ I   S+FN  L     ++++ E  A    G 
Sbjct: 120 AEALHGSAH-----------EGSTIYPQNIALGSTFNPDLAYRKAKMIAGELHAQ---GM 165

Query: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHF 207
             +   +P V++ RD RWGR +E+ GEDP L G +  + V+G Q        ++   KHF
Sbjct: 166 KQIL--APCVDVVRDLRWGRVEESYGEDPFLCGLFGIAEVKGYQEQG-----ISPMLKHF 218

Query: 208 TAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCAD 267
             +  +   G++       V  +D+ D +  PF M +    + +VM +YN  N +P  A 
Sbjct: 219 GPHG-NPLGGLNLASVECGV--RDLHDIYLKPFEMIIKSQPLKAVMSTYNSWNRIPNSAS 275

Query: 268 PNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDL----DCGP 323
             +L   +R +W   GY+ SD  ++ +     H  ++PEE A  A+ AGLD+    +C P
Sbjct: 276 HYLLTDVLRKQWGFTGYVYSDWGAIEMLQTFHHTANSPEECALQALMAGLDVEASSECYP 335

Query: 324 FLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQ 383
            L    + AV+ G L    I+ A+   LT +   G+F+ +P  + Y       V + +  
Sbjct: 336 AL----KQAVKEGRLPVSYIDEAVRRVLTAKFETGLFE-DPFGEKY---RAGKVHSEESV 387

Query: 384 ELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGY--TTP 441
            LA E A + +VLLKN+G  LPL   R   VAVIGPN+D      G+Y          TP
Sbjct: 388 RLAREIADESVVLLKNEGKLLPLDEKRLDAVAVIGPNAD--QVQFGDYTWSRSNKDGVTP 445

Query: 442 LQGI----GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---- 493
           LQGI    G   +  + +GC D+   D  L   A++A+R++D  IL +G   +  A    
Sbjct: 446 LQGIRRLVGNKVKVHYAKGC-DMMSPDTSLIAKAVEAARKSDVAILFVGSSSASLARDYS 504

Query: 494 -----EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIW 548
                E  D   L L G Q  L+  V  A+  P +LVL+SG P  +++ K    I AI+ 
Sbjct: 505 NTNCGEGFDLNDLTLAGAQSGLIKAV-YATGTPVVLVLVSGKPFVLSWEKE--HIPAILA 561

Query: 549 AGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRY 602
             Y G+  G +IADILFG  NP G L  + +PQ       Y  +LP  +   +   S   
Sbjct: 562 QWYAGEQEGNSIADILFGRVNPSGHLTFS-FPQSTGHLPVYYNHLPSDKGFYKKPGSYSE 620

Query: 603 PGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTH 662
           PGR Y F     ++ FGHG+SYT+FV                    I+ ++   +I V  
Sbjct: 621 PGRDYVFSSPAALWSFGHGLSYTDFVFE-----------------GIDTSLRNDSIYV-- 661

Query: 663 AKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRV 721
            KC          ++NVG++ G   + ++          P +QL AFEKV + AG    V
Sbjct: 662 -KCT---------LRNVGNRSGKEVVQLYVRDLVSSVVMPVQQLKAFEKVGLKAGESSEV 711

Query: 722 GINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
            +++ V + L++ D  G R I  G     +G     + L
Sbjct: 712 QLSLPVSE-LAISDNDGKRVIEPGAFEFQLGNASDRILL 749


>gi|160887545|ref|ZP_02068548.1| hypothetical protein BACOVA_05565 [Bacteroides ovatus ATCC 8483]
 gi|156107956|gb|EDO09701.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 736

 Score =  258 bits (660), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 207/696 (29%), Positives = 317/696 (45%), Gaps = 102/696 (14%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT FP  I  A++++  L + +G+V++ E R+       G   + P +++ RDPRW R 
Sbjct: 102 GATVFPTGIGMAATWSPELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 156

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDPVLSG   AS V GL G +   +    A+ KHF AY +        +   A V
Sbjct: 157 EETFGEDPVLSGILGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 213

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
             +D+   F  PFR  +  G + SVM SYN ++G P  ++  +L + +R EW+  G++VS
Sbjct: 214 GIRDLHQNFLPPFRKAIDAGAL-SVMTSYNSIDGTPCTSNHYLLTQLLRNEWKFRGFVVS 272

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           D  S+   +++     T E AA  ++ AG+D+D G     +   AVQ G + +  I+ A+
Sbjct: 273 DLYSIEGIHESHFVAPTKENAAIQSVMAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAV 332

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L ++  +G+F+  P   P   +  K V   +H ELA + A+  I LLKN+   LPLS
Sbjct: 333 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 389

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRYARTIHQQGCKDVACADD 465
              ++ VAVIGPN+D    M+G+Y          T L GI         +  +  A  D 
Sbjct: 390 KTINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDT 448

Query: 466 QL--FGAAIDASRQAD----------------------ATILVMGLDQSIE-AEALDRAG 500
            +     AI A+R+++                      A +   G    +E  E  DRA 
Sbjct: 449 TVNEIEQAIKAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRAS 508

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L L GRQQEL+  +    K P I+V + G P++  +A       A++ A YPGQ GG AI
Sbjct: 509 LSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGGNAI 565

Query: 561 ADILFGTSNPGGKLPMT----------WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY 610
           AD+LFG  NP G+LP++          +Y ++   N    EM+  P              
Sbjct: 566 ADVLFGDYNPSGRLPISVPRSVGQIPVYYNKKAPRNHDYVEMSSFP-------------- 611

Query: 611 KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL 670
               +Y FG+GMSYT F +                        S   +    A+C     
Sbjct: 612 ----LYSFGYGMSYTTFEY------------------------SDLQVVQKSARC----F 639

Query: 671 GVQVDVKNVGSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
            V   VKN G  DG   + L      A    P KQL  FE+ H+  G +++V   +    
Sbjct: 640 EVSFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTEED 699

Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATL 765
           +  +V+ +  + +  G  ++ IG   + + L    L
Sbjct: 700 FF-LVNYTLKKVVESGNFHLMIGAASNDIRLQNVIL 734


>gi|354724530|ref|ZP_09038745.1| beta-D-glucoside glucohydrolase [Enterobacter mori LMG 25706]
          Length = 765

 Score =  258 bits (660), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 208/689 (30%), Positives = 334/689 (48%), Gaps = 102/689 (14%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   A+++G+++     +   +    V+ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A + AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEARDVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGENA 454

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+    +Q              +   A++ ++Q+D  + V+G  Q 
Sbjct: 455 KVVYAKGANVTDDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNTAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDIPIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F   V++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--KVSDVK--MSAPTMKRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                    VDV N G ++GA  + ++     A    P KQL  F+KV +  G  + V  
Sbjct: 673 ---------VDVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFDKVSLKPGETKTVSF 723

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I V   L   ++     +  G+ N+ IG
Sbjct: 724 PIDV-DALKFWNQQMKYVVEPGKFNVFIG 751


>gi|295098160|emb|CBK87250.1| beta-glucosidase [Enterobacter cloacae subsp. cloacae NCTC 9394]
          Length = 765

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 214/692 (30%), Positives = 337/692 (48%), Gaps = 108/692 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   A+++G+++     +   +    V+ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKSAVGDNA 454

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+    +Q              +   A++A++Q+D  I V+G  Q 
Sbjct: 455 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNAAKQSDVVIAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNF-VHTVA-NAPTVVAVPLDGRH-GSINATISGKAIKVT 661
           R +    GP +YPFG+G+SYT F V  V  +APT+       +H G + A+         
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTFKVSDVKMSAPTL-------KHDGKVTAS--------- 672

Query: 662 HAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
                       V+V N G ++GA  + ++     A    P KQL  FEKV++  G  + 
Sbjct: 673 ------------VEVTNSGKREGATVIQMYIQDVTASMSRPVKQLRGFEKVNLKPGETRT 720

Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           V   I V   L   ++        G+ N+ IG
Sbjct: 721 VSFPIDVNA-LKFWNQQMKYDAEPGKFNVFIG 751


>gi|423287910|ref|ZP_17266761.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
           CL02T12C04]
 gi|392671925|gb|EIY65396.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
           CL02T12C04]
          Length = 782

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 207/696 (29%), Positives = 318/696 (45%), Gaps = 102/696 (14%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           GAT FP  I  A++++  L + +G+V++ E R+       G   + P +++ RDPRW R 
Sbjct: 148 GATVFPTGIGMAATWSLELVKEVGQVIAKEIRSQ-----GGHISYGPVLDLTRDPRWSRV 202

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 227
           +ET GEDPVLSG   AS V GL G +   +    A+ KHF AY +        +   A V
Sbjct: 203 EETFGEDPVLSGILGASMVDGLGGGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASV 259

Query: 228 SKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVS 287
             +D+   F  PFR  +  G + SVM SYN ++G+P  ++  +L + +R EW+  G++VS
Sbjct: 260 GIRDLHQNFLPPFRKAIDSGAL-SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFCGFVVS 318

Query: 288 DCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNAL 347
           D  S+   +++     T E AA  ++ AG+D+D G     +   AVQ G + +  I+ A+
Sbjct: 319 DLYSIEGIHESHFVALTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAV 378

Query: 348 VNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLS 407
              L ++  +G+F+  P   P   +  K V   +H ELA + A+  I LLKN+   LPLS
Sbjct: 379 CRVLRMKFEMGLFE-HPYVDP--KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435

Query: 408 HIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGIGRYARTIHQQGCKDVACADD 465
              ++ VAVIGPN+D    M+G+Y          T L GI         +  +  A  D 
Sbjct: 436 KTINK-VAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDT 494

Query: 466 QL--FGAAIDASRQAD----------------------ATILVMGLDQSIEA-EALDRAG 500
            +     AI A+R+++                      A +   G    +E  E  DRA 
Sbjct: 495 TVNEIEQAIKAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRAS 554

Query: 501 LLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAI 560
           L L GRQQEL+  +    K P I+V + G P++  +A       A++ A YPGQ GG AI
Sbjct: 555 LSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWASE--YADALLTAYYPGQEGGNAI 611

Query: 561 ADILFGTSNPGGKLPMT----------WYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY 610
           AD+LFG  NP G+LP++          +Y ++   N    EM+  P              
Sbjct: 612 ADVLFGDYNPSGRLPISVPRSVGQIPVYYNKKAPRNHDYVEMSSFP-------------- 657

Query: 611 KGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTL 670
               +Y FG+GMSYT F +                        S   +    A+C     
Sbjct: 658 ----LYSFGYGMSYTTFEY------------------------SDLQVVQKSARC----F 685

Query: 671 GVQVDVKNVGSKDGAH-TLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCK 729
            V   VKN G  DG   + L      A    P KQL  FE+ H+  G +++V   +    
Sbjct: 686 EVSFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTEED 745

Query: 730 YLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATL 765
           +  +V+ +  + +  G  ++ IG   + + L    L
Sbjct: 746 FF-LVNYTLKKVVESGNFHLMIGAASNDIRLQNVIL 780


>gi|281422547|ref|ZP_06253546.1| beta-glucosidase [Prevotella copri DSM 18205]
 gi|281403371|gb|EFB34051.1| beta-glucosidase [Prevotella copri DSM 18205]
          Length = 869

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 157/453 (34%), Positives = 238/453 (52%), Gaps = 28/453 (6%)

Query: 34  DATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHG 93
           +A+ + LP+   +L   +R  DL GRL+L+EK  L++  + A+PRLGIK ++WWSEALHG
Sbjct: 20  NASAQQLPYQNPALSAHERAVDLCGRLTLEEKASLMLDDSPAIPRLGIKRFQWWSEALHG 79

Query: 94  VSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYN-----GGTA 148
           V+N+G            T FP+ I  A+SFN  +   +    SDE RA +N     GG  
Sbjct: 80  VANMG----------DVTVFPEPIGMAASFNDRMVYRVFDATSDEMRAKWNELQQKGGDV 129

Query: 149 ----GLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAAS 203
                L+ W+PNVNIFRDPRWGRGQET GEDP L+ +   + VRGLQG  D    K+ A 
Sbjct: 130 TRFHALSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVRGLQGPEDTKYRKLWAC 189

Query: 204 CKHFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            KH+  +    W    R   N   V+ +D+ +T+   F+  V + KV  VMC+Y + +  
Sbjct: 190 AKHYAIHSGPEWA---RHTDNITDVTPRDLWETYMPAFKSLVQDAKVREVMCAYQRWDDE 246

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
           P C +  +L++ +R EW     +VSDC +V  +++    +S    AAA  + AG D++CG
Sbjct: 247 PCCGNTRLLQQILRDEWGFKYLVVSDCGAVTDFWENHKVSSNARNAAAKGVLAGTDVECG 306

Query: 323 -PFLGLHTESAVQRGLLS-EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTP 380
             ++      AV+ G L+ E +++  ++  L  +  LG  D +     +  +    +C+ 
Sbjct: 307 FNYIYKSVPEAVKYGALTEEEEVDKHVIRLLEGRFDLGEMD-DNKIVSWSKIPVSVLCSK 365

Query: 381 DHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTT 440
            H++L+L+ A Q + LL+N+   LPL   + + +A IGPN D    M GNY G      T
Sbjct: 366 AHRQLSLDMALQTMTLLQNKNEVLPLDK-KVKKIAFIGPNVDNEPMMWGNYNGTPRQTIT 424

Query: 441 PLQGIGRYARTIHQQGCKDVACADDQLFGAAID 473
            L GI    +       K     +DQ   +  D
Sbjct: 425 ILDGIKSRLKKNQVVTFKGCDLVNDQTLDSYFD 457



 Score =  116 bits (290), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 92/311 (29%), Positives = 137/311 (44%), Gaps = 50/311 (16%)

Query: 468 FGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMA 517
           + A++   +  +  + V G+   +E E +          DR  + LP  Q+  +  +  A
Sbjct: 596 YQASLKQLKDCETVVFVGGISPQLEGEEMPIEISGFKGGDRTNIELPKVQRNFLKALKEA 655

Query: 518 SKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMT 577
            K   + V  SG  I  A         AI+ A YPGQ GG A+A +LFG  NP GKLP+T
Sbjct: 656 GK-KVVFVNCSGSAI--ALTPETESCDAILQAWYPGQEGGEAVARVLFGEYNPAGKLPIT 712

Query: 578 WYPQEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPT 636
           +Y       LP   + +M+        GRTYR Y    ++PFG+G+SYT+F         
Sbjct: 713 FYKNS--EQLPDFKDYSMK--------GRTYR-YMNDALFPFGYGLSYTSF--------- 752

Query: 637 VVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPA 696
                   R G  +AT+S   +K    K  ++TL  +V V NVG KDG   + V+   PA
Sbjct: 753 --------RMG--DATLSNSILK----KGEKITL--KVPVSNVGKKDGTEIVQVYVKDPA 796

Query: 697 GHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKH 756
               P K L AFE+V V AG      I +    +      + T R   G + ++ G +  
Sbjct: 797 DTEGPLKSLKAFERVEVKAGKTAEAVITLDSRNFELFDAATNTVRAKAGNYEVYYGSSSA 856

Query: 757 SVSLHAATLGV 767
              L    + +
Sbjct: 857 DKDLKKLDVSI 867


>gi|160891087|ref|ZP_02072090.1| hypothetical protein BACUNI_03534 [Bacteroides uniformis ATCC 8492]
 gi|156859308|gb|EDO52739.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           uniformis ATCC 8492]
          Length = 865

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 168/432 (38%), Positives = 235/432 (54%), Gaps = 32/432 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SLP  +R   L+  L+L+EKV L++  +  V RLGIK Y WW+EALHGV+  G   
Sbjct: 23  YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 80

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--------GLTYW 153
                   AT FPQ I  A+SF+      +   VSDEARA     T+        GLT W
Sbjct: 81  --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYTSQESHERYQGLTMW 132

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDL 212
           +P VNI+RDPRWGRG ET GEDP L+ +     V+GLQG+ DG   K+ A  KHF  +  
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHFAVHSG 192

Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             WN   R  FNA+ +  +D+ +T+  PF   V EGKV  VMC+YN+  G P C    +L
Sbjct: 193 PEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSDRLL 249

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
            + +R EW  +G ++SDC ++  +Y      + P  E A+A A+ +G DL+CG       
Sbjct: 250 MQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAAAVLSGTDLECGSSYEALV 309

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
           E AV++G + E  ++ A+   LT +  LG  D EP    +  +    V +  H  LAL+ 
Sbjct: 310 E-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDSLALDM 367

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
           AR+ + LL N+  +LPL      T+AV+GPN++ +V   GNY G+     T L GI    
Sbjct: 368 ARKSMTLLMNKDNTLPLKR-GGLTIAVMGPNANDSVMQWGNYNGMPPHTVTILDGIRKAL 426

Query: 446 GRYARTIHQQGC 457
           G   R I++QGC
Sbjct: 427 GSDDRLIYEQGC 438



 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 93/300 (31%), Positives = 144/300 (48%), Gaps = 47/300 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           +++  + AD  + V G+  ++E E +          DR  + LP  Q+EL++ +  A K 
Sbjct: 594 SVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAALHHAGK- 652

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             +LV  SG PI         R  AI+ A YPGQAGGTA+A++LFG  NP G+LP+T+Y 
Sbjct: 653 KVVLVNCSGSPI--GLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNPAGRLPVTFYR 710

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
              ++ LP  E            GRTYR+     ++PFGHG+SYT+F +         AV
Sbjct: 711 N--VSQLPDFE-------DYNMTGRTYRYMTQEPLFPFGHGLSYTSFSYG--------AV 753

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
            L    GS N   SG+ ++++            V V N G  DG   + V+         
Sbjct: 754 VL----GSDNIK-SGEKLRLS------------VPVTNTGKCDGEEVVQVYLKKNDDVEG 796

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L AF++VH+PAG    V  ++   + +    +S    +  G + + +GG+  +  L
Sbjct: 797 PSKALRAFKRVHIPAGKTVDVEFDLGDKELVWWNPQSNAMCVSEGSYELMVGGSSQTADL 856


>gi|298480647|ref|ZP_06998843.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298273081|gb|EFI14646.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 862

 Score =  258 bits (659), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 168/454 (37%), Positives = 242/454 (53%), Gaps = 35/454 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +  +SL   +R   L+  L+L+EKV L++ G+ AV RLGIK Y WW+EALHGV+  G   
Sbjct: 22  YKTISLSPEKRAELLLKELTLEEKVSLMMDGSKAVERLGIKPYNWWNEALHGVARAGL-- 79

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARA--MYNGGTA------GLTYW 153
                   AT FPQ I  A+SF+      +   VSDEARA   Y           GLT W
Sbjct: 80  --------ATVFPQPIGMAASFSPQTVYGVFDAVSDEARAKNTYYASQGSYERYQGLTMW 131

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL 212
           +P VN++RDPRWGRG ET GEDP L+ +     V+GLQG +DG   K+ A  KHF  +  
Sbjct: 132 TPTVNVYRDPRWGRGIETYGEDPYLTSRMGVEVVKGLQGGNDGKYDKLHACAKHFAVHSG 191

Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             WN   R  FN + +  +D+ +T+  PF   V E KV  VMC+YN+  G P C    +L
Sbjct: 192 PEWN---RHEFNVENIKPRDLYETYLPPFEALVKEAKVKEVMCAYNRFEGDPCCGSDRLL 248

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
            + +R EW  +G ++SDC ++  +Y+ +   + P  E A+A A+ +G DL+CG       
Sbjct: 249 MQILRNEWGFDGIVLSDCGAIADFYNDRGHKTHPDAESASAAAVLSGTDLECGSSYKALI 308

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
           E AV+ G + E D++ A+   L  +  LG  D EP    +  +    V +  H  LAL+ 
Sbjct: 309 E-AVKLGKIDEKDVDVAVKRLLAARFALGEMD-EPDKVSWTKIPFSVVASEKHDSLALDI 366

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA 449
           AR+ + LL N+   LPL      T+AV+GPN++ +V   GNY G+     T L G+ +  
Sbjct: 367 ARKSMTLLMNKNDILPLKR-GGLTIAVMGPNANDSVMQWGNYNGMPSHTVTVLDGVRKAL 425

Query: 450 ----RTIHQQGCKDVACADDQLFGAAIDASRQAD 479
               + I++QGC      D  L  +A +  + AD
Sbjct: 426 DANDKLIYEQGC---GWVDRSLIQSAFNLCKSAD 456



 Score =  128 bits (322), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 95/300 (31%), Positives = 143/300 (47%), Gaps = 47/300 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           +++  + AD  I V G+  S+E E +          DR  + LP  Q+EL++ +  + K 
Sbjct: 592 SVEKVKGADVVIFVGGISPSLEGEEMGVNLPGFKRGDRTDIELPAVQRELIAALHHSGK- 650

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             ILV  SG PI         +  AI+ A YPGQAGGTA+A++LFG  NP G+LP+T+Y 
Sbjct: 651 KVILVNCSGSPI--GLEPETKKCEAILQAWYPGQAGGTAVAEVLFGKYNPAGRLPVTFYR 708

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
              ++ LP  E            GRTYRF     ++PFG+G+SYT F +          +
Sbjct: 709 N--VSQLPDFE-------DYDMSGRTYRFMTQEPLFPFGYGLSYTTFEYG--------EM 751

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
            LD     INA   G+ +K+T            V + N G +DG   + V+         
Sbjct: 752 TLD--KNKINA---GEVLKLT------------VPITNTGKRDGEEVVQVYLRKHGDTDG 794

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L AFE+V++PAG    V   +   +       S T R+  G+++I +G +  +  L
Sbjct: 795 PQKTLRAFERVNIPAGETVNVDFELKDRELEWWDVTSNTVRVCPGDYDIMVGSSSQTEDL 854


>gi|423303577|ref|ZP_17281576.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
           CL03T00C23]
 gi|423307700|ref|ZP_17285690.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
           CL03T12C37]
 gi|392687941|gb|EIY81232.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
           CL03T00C23]
 gi|392689569|gb|EIY82846.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
           CL03T12C37]
          Length = 942

 Score =  258 bits (659), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 203/681 (29%), Positives = 331/681 (48%), Gaps = 74/681 (10%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT+FP  +    ++N  L   +G +   EAR +      G T  ++P +++ RD RWGR 
Sbjct: 173 ATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 226

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           +E  GE P L  +     VRGLQ +     +VAA+ KHF AY  +          + ++S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGLQHNH----QVAATGKHFAAYSNNKGAREGMARVDPQMS 282

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++E+    PF+  + E  +  VM SYN  +G+P       L   +RGE    GY+VSD
Sbjct: 283 PREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVSD 342

Query: 289 CDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEID 342
            D+V   Y T+H T+   +EA   ++ AGL++ C       F+ L     V+ G LSE  
Sbjct: 343 SDAVEYLY-TKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFV-LPLRELVKEGGLSEEV 400

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           IN+ + + L V+  +G+FD    +   G    ++V   +++ +AL+A+ + +VLLKN   
Sbjct: 401 INDRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKEENEAIALQASHESVVLLKNADE 458

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK 458
            LPL     + +AV GPN++     + +Y  +A   TT L+GI       A  ++ +GC 
Sbjct: 459 LLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKSKAEVLYTKGCD 518

Query: 459 ------------DVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
                       D    DD+      A++ +RQAD  ++V+G  Q    E   R  L LP
Sbjct: 519 LVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRTSLDLP 578

Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
           GRQ +L+  +    K P +L+L++G P+ + +A  D  + AI+ A YPG  GGTA+ADIL
Sbjct: 579 GRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADIL 635

Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGH 620
           FG  NPGGKL +T +P+  +  +P      +PS      + PG T    +    +YPFG+
Sbjct: 636 FGDYNPGGKLTVT-FPKT-VGQIPFN-FPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 692

Query: 621 GMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVG 680
           G+SYT F ++  +    V  P +      +AT+    +KVT+         VQ+ +++V 
Sbjct: 693 GLSYTTFEYSDLDITPRVITPNE------SATVR---LKVTNTGKRAGDEVVQLYIRDVL 743

Query: 681 SKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
           S    +                K L  F+++H+  G  Q +   I   K+L ++D     
Sbjct: 744 SSITTY---------------EKNLAGFQRIHLEPGEAQELSFTID-RKHLELLDADMKW 787

Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
            +  G+  +  G +   + L+
Sbjct: 788 VVEPGDFVLMAGASSEDIRLN 808


>gi|332184951|ref|ZP_08386700.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
 gi|332014675|gb|EGI56731.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
          Length = 886

 Score =  258 bits (659), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 153/394 (38%), Positives = 216/394 (54%), Gaps = 37/394 (9%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           L+G+++  EK+  L++ A A+PRLG+  Y WW+E+LHG     P           T+FP+
Sbjct: 37  LVGKMTTDEKLYQLLNTAPAIPRLGVPAYNWWTESLHGALGPVP----------TTNFPE 86

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMY----------NGGTAGLTYWSPNVNIFRDPRW 165
            +  A++F+  L     + +S E R ++            GT GL  WSPN+NIFRDPRW
Sbjct: 87  PVGLAATFDDALVGRAAQAISREVRGLHALARQTGRLGRIGT-GLDSWSPNINIFRDPRW 145

Query: 166 GRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA 225
           GRGQET GEDP L+ +   ++VRG+QG+D +R+ V A+ KH+  +         R   N 
Sbjct: 146 GRGQETYGEDPFLTARMGVAFVRGMQGTDPNRIDVVATPKHYAVHSGPEST---RHEANV 202

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
            VS+ D+ DT+   FR  ++EG   S+MC+YN+++G P CA   +LK  +RG W   GY+
Sbjct: 203 YVSQHDLVDTYLPAFRAAIVEGGAGSIMCAYNRIDGQPACASDLLLKDYLRGRWGFRGYV 262

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE-------SAVQRGLL 338
           VSDCD+V       H+   P  A A A+RAG+D +C       T+        A+ RGL+
Sbjct: 263 VSDCDAVKDIDANHHYAPDPATAVAAAMRAGVDNECNNATLTDTDGLTNPYRDALARGLI 322

Query: 339 SEIDINNALVNTLTVQMRLGMFDG-EPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLL 397
           S  DI+ +L      + R G   G  P S       P D+ TP HQ LALE A + +VLL
Sbjct: 323 SVADIDRSLTRLFAARYRTGDLPGVRPLSMQSSS--PADIGTPAHQALALETAEKSLVLL 380

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY 431
           KN+G  LPL       +AVIGP  D T  + GNY
Sbjct: 381 KNKG-ILPLKA--QARIAVIGPLGDATRVLRGNY 411



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 95/291 (32%), Positives = 147/291 (50%), Gaps = 52/291 (17%)

Query: 474 ASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTI 523
           A++ +DA + V+GL   +EAE            D+  L LP  QQ L+ +     K P I
Sbjct: 611 AAKDSDALVAVVGLTSDLEAEEAPISVPGFKGGDKTTLDLPADQQALLEQAKATGK-PLI 669

Query: 524 LVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEY 583
           +VLM+G P+++A+AK +    AI+ A YPGQAGG A+ ++L G +NP G+LP+T+Y    
Sbjct: 670 VVLMNGSPVNLAWAKAN--ADAIVEAWYPGQAGGLAVGNVLSGKTNPAGRLPLTFY--RS 725

Query: 584 ITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHT-VANAPTVVAVP 641
           + +LP   + AM+        GRTYR++ G  VYPFG+G+SYT+F +  V   P V  V 
Sbjct: 726 VEDLPAFGDYAMK--------GRTYRYFTGTPVYPFGYGLSYTSFAYAPVKLTPAVTGV- 776

Query: 642 LDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAP 701
            DG                         L V  +V+N G + G     ++   P    AP
Sbjct: 777 QDG-------------------------LTVTTEVRNTGRRAGDEVAQLYLNFPKVDGAP 811

Query: 702 HKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
              L  F++VH+  G ++ +   +   + LS VD  G R +  G++ + +G
Sbjct: 812 RVALRGFQRVHLAPGERRMLRFTLD-PRDLSAVDADGHRLVMKGDYRVSVG 861


>gi|304406707|ref|ZP_07388362.1| glycoside hydrolase family 3 domain protein [Paenibacillus
           curdlanolyticus YK9]
 gi|304344240|gb|EFM10079.1| glycoside hydrolase family 3 domain protein [Paenibacillus
           curdlanolyticus YK9]
          Length = 733

 Score =  258 bits (659), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 207/672 (30%), Positives = 331/672 (49%), Gaps = 77/672 (11%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  A+++N  +        S EA       T G+T+ ++P +++ RDPRWGR  
Sbjct: 112 TVFPIPLAMAAAWNPEVARQTSAAASTEAL------TDGVTWVFAPMIDVSRDPRWGRIA 165

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFNA 225
           E+ GEDP L+  Y  ++V G Q  +G     A+  KHF  Y +     ++N VD      
Sbjct: 166 ESIGEDPYLTAAYGRAWVEGSQIDNGPGRATASCPKHFAGYGMAEAGRDYNTVD------ 219

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
            +S +++ D    PF+  V  G + S+M S+N++NG+P CA+  +LK  +R EW   G +
Sbjct: 220 -LSDRELRDIILPPFQDAVEAGAL-SIMASFNEINGIPACANEYLLKTILRDEWGFEGVV 277

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE---SAVQRGLLSEID 342
            SD +++ V        +  EEA    + AG D+D     G+ T      V+ G + E  
Sbjct: 278 ASDYNAL-VELIVHGVAANEEEACEMTVLAGCDMDMHS--GIFTRQLPKLVRAGRVPESV 334

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           +++++   L ++++LG+   E S               ++ ELA EAARQ IVLL+N+  
Sbjct: 335 VDDSVRRILAMKIKLGLL--EQSKSDVSQSAATQPLKSEYVELAREAARQSIVLLQNKEQ 392

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYA--GIACGYTTPLQGIGRYA----RTIHQQG 456
            LPLS     ++AVIGP +D     +G +A  G +    T L+GI + A       + QG
Sbjct: 393 VLPLSKA-GASIAVIGPLADNATDPLGCWALDGRSDEVVTALEGIRQAAAEGTSIRYAQG 451

Query: 457 CKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSM 516
           C D+    ++ F AA++A+R +D  ++++G   ++  E+  RA L LPG+Q+ LV  V+ 
Sbjct: 452 C-DIDSDSEEGFEAALEAARSSDVVVMLLGESATMSGESRSRAALDLPGKQRALVEAVAK 510

Query: 517 ASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPM 576
             K P + V++SG P+   FA    + +AI+ A + G   G AIAD+LFG  NP G+LP+
Sbjct: 511 LGK-PIVAVILSGRPL--TFAWLPEQASAIVQAWHLGVQSGNAIADVLFGDFNPSGRLPV 567

Query: 577 TWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFY----KGPVVYPFGHGMSYTNFVHTVA 632
           T +PQ    N+    +     ++ R P   Y  Y        +YPFG+G++YT F     
Sbjct: 568 T-FPQ----NVGQIPIYHYRKKTGRPPAGAYSSYYIDSTTEPLYPFGYGLTYTEF----- 617

Query: 633 NAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF- 691
                        +G+I  + S               L V V ++NVG+  G   +  + 
Sbjct: 618 ------------EYGAIQTSKSSIGADE--------QLDVTVSIRNVGNLAGEEVVQCYV 657

Query: 692 STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHI 751
               A    P K+LVAF KV V AG    V   I   + L+++D+   R +  G+  + I
Sbjct: 658 RDEVASVTQPLKRLVAFRKVKVAAGESVDVTFTIGAAE-LAILDKHMKRTVEPGDFTLWI 716

Query: 752 G---GTKHSVSL 760
           G   G+  SVS+
Sbjct: 717 GPSAGSGESVSI 728


>gi|262405981|ref|ZP_06082531.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|345510488|ref|ZP_08790055.1| beta-glucosidase [Bacteroides sp. D1]
 gi|262356856|gb|EEZ05946.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|345454434|gb|EEO48987.2| beta-glucosidase [Bacteroides sp. D1]
          Length = 735

 Score =  258 bits (659), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 233/795 (29%), Positives = 368/795 (46%), Gaps = 109/795 (13%)

Query: 8   FFLGLILLSASSSGLAAREPFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKV 66
             LGL LL  + S L++++           +++P +     PI +R++DLI R++L+EKV
Sbjct: 6   LILGLCLLGVTHS-LSSKD----------KKSIPLYKDAKAPIEKRIDDLISRMTLEEKV 54

Query: 67  KLLI--------------SGAAAVP-RLGIKGYEWWSEALHGV--------SNVGPGTKF 103
             L                    VP  +G   Y   +  L           S +G    F
Sbjct: 55  LQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFDINPELRNSMQKKAMEESRLGIPIIF 114

Query: 104 GGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIF 160
           G D      T +P  +  A S+N  L E    V + EAR       +G+ + +SP +++ 
Sbjct: 115 GYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSAQEAR------MSGVDWTFSPMIDVA 168

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVD 219
           RDPRWGR  E  GEDP  +G +AA+ VRG QG D     ++AA  KH+  Y         
Sbjct: 169 RDPRWGRVAEGYGEDPYTNGVFAAASVRGYQGDDMSAENRMAACLKHYVGYGASE---AG 225

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R +   ++S Q + DT+ +P+ M V  G   ++M S+N ++GVP  A+P I+   ++  W
Sbjct: 226 RDYVYTEISAQTLWDTYLLPYEMGVKAG-APTLMSSFNDISGVPGSANPYIMTEILKKRW 284

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLL 338
           + +G+IVSD  +V      Q   +T ++AA  A  AGL++D        H +  V+ G +
Sbjct: 285 KHDGFIVSDWGAVE-QLKNQGLAATKKDAARYAFNAGLEMDMMSHAYDRHLKELVEEGKV 343

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKD-VCTPDHQELALEAARQGIVL 396
           +   ++ ++   L V+ RLG+F+     +PY  +   KD    P    +A + A + +VL
Sbjct: 344 TMAQVDESVRRVLRVKFRLGLFE-----RPYTPVTNEKDRFFRPQSMAVAAQLAAESMVL 398

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYAR 450
           LKN    LPL++   + +AV+GP +     ++G++ G      +   Y       G  A 
Sbjct: 399 LKNDNQILPLTN--KKKIAVVGPMAKNGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAE 456

Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
             +  GCK     D   F  A+D +R +D  I+ +G   +   E   R+ + LP  Q+EL
Sbjct: 457 LRYAMGCKPQG-NDRSGFAGALDVARWSDVVIVCLGEMLTWSGENASRSTIALPQIQEEL 515

Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
           V ++  A K P ILVL +G P+++   + +P   AI+    PG  G  ++A IL G  NP
Sbjct: 516 VKELKEAGK-PVILVLSNGRPLEL--NRMEPLCDAILEIWQPGINGARSMAGILSGRINP 572

Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTN 626
            GKL MT          P +   +    ++R  GR ++ FYK      +YPFGHG+SYT 
Sbjct: 573 SGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTE 623

Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
           F +                      T++  A KV         L  +V V N GS+DGA 
Sbjct: 624 FKY---------------------GTVTPSATKVKRGD----KLSAEVTVTNTGSRDGAE 658

Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
           T+  F S P      P K+L  FEK  + AG  +    +I + +    V+  G R +  G
Sbjct: 659 TVHWFISDPYCSITRPVKELKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAG 718

Query: 746 EHNIHIGGTKHSVSL 760
           E++I + G    + L
Sbjct: 719 EYHILVQGQTVKIEL 733


>gi|261405721|ref|YP_003241962.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
 gi|261282184|gb|ACX64155.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
           Y412MC10]
          Length = 765

 Score =  258 bits (659), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 209/686 (30%), Positives = 324/686 (47%), Gaps = 103/686 (15%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  ++  S++N  L+  + R V+ E R+       G   +SP +++ RDPRWGR 
Sbjct: 122 GGTVFPVPLSIGSTWNVDLYRDMCRAVALETRSQ-----GGAVTYSPVLDVVRDPRWGRT 176

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAASCKHFTAYDLDNWNGVDRFHFNA-- 225
           +E  GEDP L  +YA + V GLQG   D    VAA+ KHF  Y      G      NA  
Sbjct: 177 EECFGEDPYLISEYAVASVEGLQGESLDSPSSVAATLKHFVGY------GSSEGGRNAGP 230

Query: 226 -KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +  +++ +   +PF+  V E   AS+M +YN+++GVP   +  +L   +R EW  +G 
Sbjct: 231 VHMGTRELMEVDMLPFKKAV-EAGAASIMPAYNEIDGVPCTVNTELLDGILRKEWGFDGM 289

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDI 343
           +++DC ++ +            +AA  AIRAG+D++  G   G H + AV+   L    +
Sbjct: 290 VITDCGAIDMLASGHDTAEDGMDAAVQAIRAGIDMEMSGEMFGKHLQKAVESNKLEVSVL 349

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           + A+   LT++ +LG+F+  P   P        + +  H  LA + A +GIVLLKN+  +
Sbjct: 350 DEAVRRVLTLKFKLGLFE-NPYVDP--QTAENVIGSEQHVGLARQLAAEGIVLLKNEAKA 406

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGI----GRYA-RTIHQQG 456
           LPLS      +AVIGPN+D     +G+Y         TT L GI    G  A R ++  G
Sbjct: 407 LPLSK-EGGVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPG 465

Query: 457 CKDVACADD--QLFGAAIDASRQADATILVMG---------------------LDQSIE- 492
           C+     DD  + F  A+  + QAD  ++V+G                      D ++  
Sbjct: 466 CR---IKDDSREGFEFALTCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSD 522

Query: 493 ---AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
               E +DR  L L G Q ELV ++    K   I+V ++G PI   +   D    AI+ A
Sbjct: 523 MDCGEGIDRMTLQLSGVQLELVQEIHKLGK-RMIVVYINGRPIAEPWI--DEHADAILEA 579

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
            YPGQ GG A+ADILFG  NP GKL M+  P+ ++  LP+          KR  G+ Y  
Sbjct: 580 WYPGQEGGHAVADILFGDVNPSGKLTMS-IPK-HVGQLPVY------YNGKRSRGKRYLE 631

Query: 610 YKGPVVYPFGHGMSYTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
                 YPFG+G+SYT F ++ +   P V+                G A+          
Sbjct: 632 EDSQPRYPFGYGLSYTEFSYSDIQMTPEVIGT-------------DGTAV---------- 668

Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINI-- 725
              V V+V N G  +G+  + ++ +  A  +  P ++L  F+K+ +  G +++V   I  
Sbjct: 669 ---VSVNVTNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKIFLQPGERRKVEFTIGP 725

Query: 726 ----HVCKYLSVVDRSGTRRIPLGEH 747
               ++ +    V   G  R+ LG H
Sbjct: 726 EQLQYIGQDYRQVVEPGLFRVMLGRH 751


>gi|167646366|ref|YP_001684029.1| beta-glucosidase [Caulobacter sp. K31]
 gi|167348796|gb|ABZ71531.1| Beta-glucosidase [Caulobacter sp. K31]
          Length = 898

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 156/396 (39%), Positives = 223/396 (56%), Gaps = 37/396 (9%)

Query: 55  DLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFP 114
           +L+G+++L EK++ L++ A A+PRLG+  Y WW+E+LHG     P           T+FP
Sbjct: 48  ELVGKMTLDEKLEQLLNTAPAIPRLGVPAYNWWTESLHGALGALP----------TTNFP 97

Query: 115 QVITTASSFNATLWEAIGRVVSDE-------ARAMYNGGT--AGLTYWSPNVNIFRDPRW 165
           + I  A++F+A L   +   +  E       ARA    G    GL  WSPN+NIFRDPRW
Sbjct: 98  EPIGLAATFDAPLVHDVAGAIGAEMQDLHALARATGRMGRIGTGLNTWSPNINIFRDPRW 157

Query: 166 GRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNA 225
           GRGQET GEDP L+ +   ++V G+QG D D  ++ A+ KHF  +         R   N 
Sbjct: 158 GRGQETYGEDPFLTARMGVAFVEGIQGDDPDHPRIIATPKHFAVHSGPEST---RHGANV 214

Query: 226 KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYI 285
            VS++D+EDT+   FR  V+EG+  S+MC+YN+++G P CA   +LK  +RG W+ +GY+
Sbjct: 215 FVSRRDLEDTYLPAFRAAVVEGRAGSIMCAYNRIDGQPACASDLLLKEHLRGAWKFDGYV 274

Query: 286 VSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLGLHTE-------SAVQRGLL 338
           VSDCD+V    D   +      A A A+RAG+D +C       T+        A+ RGL+
Sbjct: 275 VSDCDAVKDISDHHKYAPEAASAVAAALRAGVDNECNGATLTDTDGLAGRYREALDRGLI 334

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKDVCTPDHQELALEAARQGIVLL 397
           S   I+ ALV   + + R G    +  S   G L GP  V T +H+ LAL A+ + +VLL
Sbjct: 335 STAQIDTALVRLFSARFRNGDLPAKGGSD--GRLAGPSVVTTHEHEALALAASEKSLVLL 392

Query: 398 KNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYA 432
           KN G  LPL   +R   +AVIGP  D T  + GNY+
Sbjct: 393 KNDG-VLPLKPGLR---IAVIGPLGDATRVLRGNYS 424



 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 85/284 (29%), Positives = 140/284 (49%), Gaps = 48/284 (16%)

Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
           D  + V+GL   +EAE            D+  L LP  Q+ ++ +     K P I+V M+
Sbjct: 628 DVLVAVVGLTSDMEAEEAPIEIPGFKGGDKTTLDLPADQRAMLEQAKALGK-PLIVVAMN 686

Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
           G P++ A+AK++   +A++ A YPGQ+GG AIA++L G +NP G+LP+T+Y    + +LP
Sbjct: 687 GSPLNFAWAKDN--ASALLEAWYPGQSGGLAIANVLTGKTNPAGRLPLTFY--RSVDDLP 742

Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
                  P       GRTYR+++G  VYPFG+G+SYT F +     P  +     G    
Sbjct: 743 -------PFDDYAMAGRTYRYFEGTPVYPFGYGLSYTRFDY----GPLKIEPATKG---- 787

Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAF 708
                +G+ ++VT              +KNVG++ G     ++   P    AP   L  F
Sbjct: 788 -----AGQGLRVT------------TTIKNVGTRPGEEVAQLYLDFPKTPGAPRLALRGF 830

Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           +++ +  G  + +   +   + LS VD  G  R+  G + + +G
Sbjct: 831 QRIALKPGETRDITFALS-PRDLSSVDLDGEHRVSAGLYRVSVG 873


>gi|1483154|dbj|BAA13102.1| T-cell inhibitor(STI) [Salmonella enterica subsp. enterica serovar
           Typhimurium]
          Length = 765

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 204/669 (30%), Positives = 323/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 397 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 454

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                  +   A+ A++QAD  + V+G  Q 
Sbjct: 455 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
             +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 LYFDKPNGP-LYPFGYGLSYTTF--TVSD--FTLSSPTMQRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V V N G ++GA  + ++     A    P KQL  FEK+ +  G ++ V  
Sbjct: 673 ---------VKVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKITLKPGKRKTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|329962030|ref|ZP_08300041.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
 gi|328530678|gb|EGF57536.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
          Length = 941

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 204/681 (29%), Positives = 319/681 (46%), Gaps = 76/681 (11%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT+FP  +    ++N  L   +G +   EAR +      G T  ++P +++ RD RWGR 
Sbjct: 173 ATNFPTQLGLGHTWNRALIHKVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 226

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           +E  GE P L  +     VRGLQ        VAA+ KHF AY  +          + + S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGLQQH------VAATGKHFAAYSNNKGAREGMARVDPQTS 280

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
             ++E+    PFR  + E  +  VM SYN  +G+P       L   +R E    GY+VSD
Sbjct: 281 PHEVENIHIYPFRRVIKEAGLLGVMSSYNDYDGIPIQGSYYWLTTRLRDEMGFRGYVVSD 340

Query: 289 CDSVGVYYDTQHFTSTP-EEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEID 342
            D+V   Y T+H T+   +EA   ++ AGL++ C       F+ L     V+ G L E  
Sbjct: 341 SDAVEYLY-TKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFV-LPLRELVKEGGLDEET 398

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGP 402
           +N+ + + L V+  +G+FD    +   G    K+V   +++ +AL+A+R+ +VLLKN+  
Sbjct: 399 VNDRVRDILRVKFLIGLFDAPYQTDLAG--ADKEVEKEENEAVALQASRESVVLLKNENS 456

Query: 403 SLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGCK 458
           +LPL+    + +AV GPN+D     + +Y  +A   TT L+GI       A  ++ +GC 
Sbjct: 457 TLPLNINTVKKIAVCGPNADEDGYALTHYGPLAVEVTTVLKGIQDKVNGKAEVLYTKGCD 516

Query: 459 DVACA-------------DDQL-FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
            V                D+Q     A++ +R+AD  ++V+G  Q    E   R+ L LP
Sbjct: 517 LVDANWPESEIIDYPLTPDEQAEINKAVENARRADVAVVVLGGGQRTCGENKSRSSLDLP 576

Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
           GRQ +L+  V    K P +L+L++G P+ V +A  D  + AI+ A YPG  GG A+ADIL
Sbjct: 577 GRQLQLLQAVQATGK-PVVLILINGRPLSVNWA--DKYVPAILEAWYPGSKGGVALADIL 633

Query: 565 FGTSNPGGKLPMTWYPQ---EYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHG 621
           FG  NPGGKL +T +P+   +   N P    +         P        G  +YPFG+G
Sbjct: 634 FGDYNPGGKLTVT-FPKTVGQIPFNFPCKPASQIDGGKNAGPDGNMSRING-ALYPFGYG 691

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           +SYT F ++                   N  I+ K I        RL       V N G 
Sbjct: 692 LSYTTFEYS-------------------NLEITPKVITPNEKATVRLK------VTNTGK 726

Query: 682 KDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
             G   + +++         + K L  FE++H+  G  + V   I   K+L ++D    R
Sbjct: 727 YAGDEVVQLYTRDVLSSVTTYEKNLAGFERIHLEPGETKEVTF-ILDRKHLELLDADMKR 785

Query: 741 RIPLGEHNIHIGGTKHSVSLH 761
            +  G+  I  G +   + L+
Sbjct: 786 VVEPGDFAIMAGASSEDIRLN 806


>gi|237712573|ref|ZP_04543054.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|345512524|ref|ZP_08792050.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|423239901|ref|ZP_17221016.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
           CL03T12C01]
 gi|229435409|gb|EEO45486.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|229453894|gb|EEO59615.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|392644890|gb|EIY38624.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
           CL03T12C01]
          Length = 788

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 202/680 (29%), Positives = 313/680 (46%), Gaps = 84/680 (12%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT FP      +++N  L   IG V + EA A+      G T  +SP ++I +DPRWGR 
Sbjct: 158 ATYFPAQCGQGATWNKELIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRC 211

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
            ET GEDP L G+     +  LQ     +  + A+ KHF  Y +       +   +  V+
Sbjct: 212 VETYGEDPYLVGELGKQMITSLQ-----KHNLVATPKHFAVYSIPVGGRDGKTRTDPHVA 266

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++   +  PFRM   E     VM SYN  +G P     + L   +R EW   GY+VSD
Sbjct: 267 PREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSD 326

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDI 343
            ++V          +T E+  A A+ AGL++         F+ L    AV  G +S+  +
Sbjct: 327 SEAVEFISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFI-LPLRKAVADGKISQETL 385

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
           +  +   L V+  LG+FD      PY   G    + V + +HQ ++LEAARQ +VLLKN+
Sbjct: 386 DKRVAEILRVKFWLGLFD-----NPYRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNE 440

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR---YARTIHQQGC 457
              LPLS    R++AVIGPN+D    +I  Y        T  QGI     +   I+++GC
Sbjct: 441 MNLLPLSK-SLRSIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTEVIYRKGC 499

Query: 458 KDV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
             +                 + +L   AI A++QA+  ++V+G ++    E   R  L L
Sbjct: 500 DIIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVREDRSRTSLNL 559

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PGRQ+EL+  V    K P +LVL+ G    + +A     + AI+ A +PG+  G A+A+ 
Sbjct: 560 PGRQEELLKAVCATGK-PVVLVLLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEA 616

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFG  NPGG+L +T +P+  +  +P      +P   +      Y      V+YPFGHG+S
Sbjct: 617 LFGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSSTSVY-----GVLYPFGHGLS 668

Query: 624 YTNFVHTVANAPTVVAVPL-DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
           YT F +       +   PL  G  G IN                     +   +KN G  
Sbjct: 669 YTTFSYG-----DLKISPLRQGVQGDIN---------------------ISCKIKNTGKI 702

Query: 683 DGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
            G   + ++          + K L  FE++ + AG +Q V   +   + L + D++   R
Sbjct: 703 KGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRLR-PQDLGLWDKNMNFR 761

Query: 742 IPLGEHNIHIGGTKHSVSLH 761
           +  G+  + IG +   + LH
Sbjct: 762 VEPGKFKVMIGSSSTDIRLH 781


>gi|395233504|ref|ZP_10411744.1| beta-D-glucoside glucohydrolase [Enterobacter sp. Ag1]
 gi|394732231|gb|EJF31938.1| beta-D-glucoside glucohydrolase [Enterobacter sp. Ag1]
          Length = 765

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 209/691 (30%), Positives = 328/691 (47%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GR+      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRVS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSMMGRTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G    VM + N +NG P  AD  +LK  +R EW+  G 
Sbjct: 227 --MSPQRLFNDYMPPYKAALDAGS-GGVMVALNSLNGTPASADSWLLKDLLRDEWKFKGI 283

Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H   S P++A   A+ +G+++     +   +    V+ G +   +
Sbjct: 284 TISDHGAIKEL--IKHGVASDPKDAVRIALNSGINMSMSDEYYSKYLPELVKSGAVPMKE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKDSDPVDTNAESRLHRKEAREVARESMV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AVIGP +D     +G++  AG+A    TPL+GI    G+  
Sbjct: 397 LLKNRLQTLPLK--KTSTIAVIGPLADSKRDAMGSWSAAGVAAQSVTPLEGIKNAVGQQG 454

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + +  +G      KD+    +Q              +   A+  ++ AD  + V+G  Q 
Sbjct: 455 KVVFAKGANVTDEKDIVEFLNQYEPAVTVDPRTPQAMIDEAVKTAKNADVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  L LP  Q++L++ +    K P +LVLM+G P+  A  K + +  A++ A 
Sbjct: 515 MAHEASSRTNLTLPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEEQQADAMLEAW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYPG 604
           Y G  GG A+ADILFG  NP GKLPM+ +P+       Y ++L  T     P +  +Y  
Sbjct: 572 YGGTEGGNAVADILFGDYNPSGKLPMS-FPRSVGQIPTYYSHL-NTGRPYDPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F   V +    ++ P   R G + A+++          
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--KVGDVK--LSAPSMKRDGKVEASVT---------- 674

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                      V N G + G   + ++     A    P K+L  F+KV + AG  Q V  
Sbjct: 675 -----------VTNTGDRAGETVVQMYLQDVTASMSRPVKELKGFKKVALKAGESQTVSF 723

Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I+I   K+ +   +        G+ N+ IG
Sbjct: 724 PIDISALKFWNAKMKYDAEP---GKFNVFIG 751


>gi|423293434|ref|ZP_17271561.1| hypothetical protein HMPREF1070_00226 [Bacteroides ovatus
           CL03T12C18]
 gi|392678377|gb|EIY71785.1| hypothetical protein HMPREF1070_00226 [Bacteroides ovatus
           CL03T12C18]
          Length = 735

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 231/795 (29%), Positives = 368/795 (46%), Gaps = 109/795 (13%)

Query: 8   FFLGLILLSASSSGLAAREPFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKV 66
             LGL LL  + S L++++           +++P +     PI +R++DLI R++L+EK+
Sbjct: 6   LILGLCLLGVTHS-LSSKD----------KKSIPLYKDAKAPIEKRIDDLISRMTLEEKI 54

Query: 67  KLLI--------------SGAAAVP-RLGIKGYEWWSEALHGV--------SNVGPGTKF 103
             L                    VP  +G   Y   +  L           S +G    F
Sbjct: 55  LQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFDINPELRNSMQKKAMEESRLGIPIIF 114

Query: 104 GGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIF 160
           G D      T +P  +  A S+N  L E    V + EAR       +G+ + +SP +++ 
Sbjct: 115 GYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSAQEAR------MSGVDWTFSPMIDVA 168

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVD 219
           RDPRWGR  E  GEDP  +G +AA+ VRG QG D     ++AA  KH+  Y         
Sbjct: 169 RDPRWGRVAEGYGEDPYTNGVFAAASVRGYQGDDMSAENRMAACLKHYVGYGASE---AG 225

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R +   ++S Q + DT+ +P+ M V  G  A++M S+N ++GVP  A+P I+   ++  W
Sbjct: 226 RDYVYTEISAQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRW 284

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLL 338
           + +G+IVSD  +V      Q   +T ++AA  A  AGL++D        H +  V+ G +
Sbjct: 285 KHDGFIVSDWGAVE-QLKNQGLAATKKDAARYAFNAGLEMDMMSHAYDRHLKELVEEGKV 343

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKD-VCTPDHQELALEAARQGIVL 396
           +   ++ ++   L V+ RLG+F+     +PY  +   KD    P    +A + A + +VL
Sbjct: 344 TMAQVDESVRRVLRVKFRLGLFE-----RPYTPVTNEKDRFFRPQSMAVAAQLAAESMVL 398

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYAR 450
           LKN    LPL++   + +AV+GP +     ++G++ G      +   Y       G  A 
Sbjct: 399 LKNNNQILPLTN--KKKIAVVGPMAKNGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAE 456

Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
             +  GCK     D   F  A+D +R +D  I+ +G   +   E   R+ + LP  Q+EL
Sbjct: 457 LRYAMGCKPQG-NDRSGFAGALDVARWSDVVIVCLGEMLTWSGENASRSTIALPQIQEEL 515

Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
           V ++  A K P ILVL +G P+++   + +P   AI+    PG  G  ++A IL G  NP
Sbjct: 516 VKELKEAGK-PIILVLSNGRPLEL--NRMEPLCDAILEIWQPGINGARSMAGILSGRINP 572

Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTN 626
            GKL MT          P +   +    ++R  GR ++ FYK      +YPFGHG+SYT 
Sbjct: 573 SGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTE 623

Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
           F +                      T++  A KV         L  +V V N G++DGA 
Sbjct: 624 FKY---------------------GTVTPSATKVKRGD----KLSAEVTVTNTGARDGAE 658

Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
           T+  F S P      P K+L  FEK  +  G  +    +I + +    V+  G R +  G
Sbjct: 659 TVHWFISDPYCSITRPVKELKHFEKQFIKVGETKTFRFDIDLERDFGFVNEDGKRFLEAG 718

Query: 746 EHNIHIGGTKHSVSL 760
           E++I + G    + L
Sbjct: 719 EYHILVQGQTVKIEL 733


>gi|381405212|ref|ZP_09929896.1| beta-D-glucoside glucohydrolase [Pantoea sp. Sc1]
 gi|380738411|gb|EIB99474.1| beta-D-glucoside glucohydrolase [Pantoea sp. Sc1]
          Length = 765

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 205/665 (30%), Positives = 330/665 (49%), Gaps = 103/665 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASS++    + +GRV      + Y     GL   W+P V++ R+PRWGRG 
Sbjct: 118 TIFPIPLALASSWDLDAVKEVGRV------SAYEAADDGLNMTWAPMVDVTREPRWGRGS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+ +   S V+ +QG S  DR  V  S KHF  Y       ++N VD     
Sbjct: 172 EGFGEDTYLTSEMGRSMVQAMQGKSAADRYSVMTSVKHFALYGAVEGGRDYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q +   +  P++  +  G    VM + N +NGVP  AD  +LK  +RG+W+  G 
Sbjct: 227 --MSPQRMFQDYLPPYKASLDAGS-GGVMVALNSINGVPATADSWLLKDLLRGQWKFKGI 283

Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H   S P++A   A+++G+D+     +   +    V+RG +S  +
Sbjct: 284 TISDHGAIKEL--IKHGVASDPQDAVRIALKSGVDMSMSDEYYSKYLPDLVKRGEVSMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           I++A  + L V+  +G+F+      PY HLGPK+    D       H+  A + AR+ IV
Sbjct: 342 IDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPQDTNAESRLHRAEARDVARKSIV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI-------- 445
           LLKN+  +LPL   +  T+A+IGP +D    ++G++  AG+A    + LQG+        
Sbjct: 397 LLKNRLETLPLK--KSGTIALIGPLADSQRDIMGSWSAAGVAKQSVSLLQGMRNATEGKA 454

Query: 446 ----GRYARTIHQQGCKDV------ACADD-----QLFGAAIDASRQADATILVMGLDQS 490
                + A     +G +D       A + D     Q+   A+  ++QAD  +  +G  Q 
Sbjct: 455 TLLYEKGANVSDNKGIQDFLNLYEPAVSIDTRSPQQMIDDAVAKAKQADVIVAAVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWA 549
           +  EA  R+ L++P  QQ+L++ +    K P ++VLM+G  + V    N+ R+A A++  
Sbjct: 515 MAHEASSRSDLVIPPSQQKLLAALKATGK-PLVIVLMNGRALSVV---NEDRMADAMLET 570

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYP 603
            + G  GG AIAD+LFG  NP GKLP++ +P+       Y  +LP T      ++  +Y 
Sbjct: 571 WFSGTEGGNAIADVLFGDYNPSGKLPVS-FPRSVGQIPIYYNHLP-TGRPYNFAKPNKYT 628

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
              Y    GP +YPFG+G+SYT F  TV  +P  ++ P   R+GSI A+++         
Sbjct: 629 SHYYDAVNGP-LYPFGYGLSYTRF--TV--SPVKMSSPTMPRNGSIEASVT--------- 674

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVG 722
                       V N G +DGA  + ++   P    + P ++L  F+++ + AG  Q V 
Sbjct: 675 ------------VTNSGQRDGATVVQMYLNDPVASISRPVQELRGFKRIMLKAGESQTVT 722

Query: 723 INIHV 727
             I V
Sbjct: 723 FKIDV 727


>gi|146312373|ref|YP_001177447.1| beta-galactosidase [Enterobacter sp. 638]
 gi|145319249|gb|ABP61396.1| beta-glucosidase [Enterobacter sp. 638]
          Length = 772

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 208/690 (30%), Positives = 330/690 (47%), Gaps = 104/690 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GR+      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 125 TVFPISLGLASSFNLDAVRTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 178

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 179 EGFGEDTYLTATLGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 233

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 234 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 290

Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H   S PE+A   A++AG+++     +   +    V+ G ++  +
Sbjct: 291 TVSDHGAIKEL--IKHGAASDPEDAVRVALKAGINMSMSDEYYSKYLPDLVKTGKVTMTE 348

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPKD    D       H++ A E AR+ +V
Sbjct: 349 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPADTNAESRLHRKDAREVARESLV 403

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G   
Sbjct: 404 LLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNALGEDG 461

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      KD+    +Q              +   A++A++Q+D  + V+G  Q 
Sbjct: 462 KVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRSAQAMIDEAVNAAKQSDVVVAVVGEAQG 521

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  A++   
Sbjct: 522 MAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADALLETW 578

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 579 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 636

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTV-VAVPLDGRHGSINATISGKAIKVTHA 663
           R +    GP +YPFG+G+SYT F     N   V ++ P   R G + A+           
Sbjct: 637 RYFDEANGP-LYPFGYGLSYTTF-----NVSDVKMSAPSLKRDGKVTAS----------- 679

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG 722
                     V+V N G ++GA  + ++     A    P KQL  FEKV +  G  + V 
Sbjct: 680 ----------VEVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVDLKPGETKTVS 729

Query: 723 INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
             I V   L   ++        G+ N+ IG
Sbjct: 730 FPIDV-DALKFWNQQMKYDAEAGKFNVFIG 758


>gi|417419081|ref|ZP_12159904.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Mississippi str. A4-633]
 gi|353619740|gb|EHC70042.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Mississippi str. A4-633]
          Length = 769

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 205/681 (30%), Positives = 322/681 (47%), Gaps = 113/681 (16%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 108 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 161

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 162 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 216

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 217 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 273

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A++AG+D+     +   +    ++ G ++  +
Sbjct: 274 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAE 331

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 332 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESVV 386

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G  A
Sbjct: 387 LLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSVTVLAGIQNAVGDGA 444

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                  +   A+ A++QAD  + V+G  Q 
Sbjct: 445 KILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQG 504

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 505 MAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 561

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTW---------YPQEYITNLPMTEMAM------- 594
           + G  GG AIAD+LFG  NP GKLP+++         Y     T  P     +       
Sbjct: 562 FAGTEGGNAIADVLFGDYNPSGKLPISFPRSVGQIPVYYSHLNTGRPYYYSHLNTGRPGR 621

Query: 595 --RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINAT 652
              P +  +Y  R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+
Sbjct: 622 PYNPEKPNKYTSRYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVTAS 676

Query: 653 ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKV 711
                                V+V N G ++GA  + ++     A    P KQL  FEK+
Sbjct: 677 ---------------------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKI 715

Query: 712 HVPAGAQQRVG--INIHVCKY 730
            +  G ++ V   I+I   K+
Sbjct: 716 TLKPGERKTVSFPIDIEALKF 736


>gi|427383551|ref|ZP_18880271.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728735|gb|EKU91590.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
           12058]
          Length = 939

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 197/680 (28%), Positives = 323/680 (47%), Gaps = 72/680 (10%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT+FP  +    ++N  L   +G +   EAR +      G T  ++P +++ RD RWGR 
Sbjct: 170 ATNFPTQLGLGHTWNRKLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 223

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           +E  GE P L  +     VRG+Q +     +VAA+ KHF AY  +          + ++S
Sbjct: 224 EEVYGESPYLVAELGIEMVRGMQHNH----QVAATGKHFVAYSNNKGAREGMARVDPQMS 279

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++E     PF+  + E  +  VM SYN  +G+P       L + +RGE    GY+VSD
Sbjct: 280 PREVEMIHVYPFKRVIKEAGMLGVMSSYNDYDGIPIQGSYYWLTKRLRGEMGFRGYVVSD 339

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG----PFLGLHTESAVQRGLLSEIDIN 344
            D+V   Y         +EA   ++ AGL++ C         L     V+ G LSE  IN
Sbjct: 340 SDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEDIIN 399

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
           + + + L V+  +G+FD    +   G    K+V   +++ +AL+A+R+ ++LLKN+   L
Sbjct: 400 DRVRDILRVKFLIGLFDAPYQTDLAG--ADKEVEKAENEAVALQASRESLILLKNENNVL 457

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK-- 458
           PL     +T+AV GPN++     + +Y  +A    T L+GI +     A  ++ +GC   
Sbjct: 458 PLDINNIKTIAVCGPNANEEGYALTHYGPLAVEVITVLEGIRQKAEGKAEVLYAKGCDLV 517

Query: 459 DVACADDQL------------FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGR 506
           D    + +L               A++ +R+AD  ++V+G  Q    E   R+ L LPGR
Sbjct: 518 DANWPESELIEYPMTNEEQAEINKAVENARKADVAVVVLGGGQRTCGENKSRSSLDLPGR 577

Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFG 566
           Q +L+  V    K P +LVL++G P+ + +A  D  + AI+   YPG  GGTA+AD+LFG
Sbjct: 578 QLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILETWYPGSKGGTAVADVLFG 634

Query: 567 TSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGHGM 622
             NPGGKL +T +P+  +  +P      +PS      + PG      +    +YPFG+G+
Sbjct: 635 DYNPGGKLTVT-FPKS-VGQIPFN-FPCKPSSQIDGGKNPGPDGNMSRVNGSLYPFGYGL 691

Query: 623 SYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSK 682
           SYT F ++                   N  IS K +             V+  V N G +
Sbjct: 692 SYTTFEYS-------------------NIEISPKMMTANQKAT------VRCKVTNTGKR 726

Query: 683 DGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR 741
            G   + ++          + K L  FE+VH+  G  + V   I   K+L ++D+     
Sbjct: 727 AGDEVVQLYIRDMLSSVTTYEKNLAGFERVHLQPGETKEVTF-ILDRKHLELLDKHMEWV 785

Query: 742 IPLGEHNIHIGGTKHSVSLH 761
           +  G+ +I +G +   + L+
Sbjct: 786 VEPGDFSIMVGASSEDIRLN 805


>gi|189464325|ref|ZP_03013110.1| hypothetical protein BACINT_00666 [Bacteroides intestinalis DSM
           17393]
 gi|189438115|gb|EDV07100.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 935

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 208/737 (28%), Positives = 356/737 (48%), Gaps = 91/737 (12%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSN 96
           +L +   +LP+ +RV  L+  ++ ++K++L+  G     +P L +       EA+HG S 
Sbjct: 148 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS- 205

Query: 97  VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPN 156
                 +G    GAT FPQ +   +++N  L E +   V DE     + GT  +  WSP 
Sbjct: 206 ------YGS---GATIFPQALAMGATWNKKLTEEVAMAVGDET---LSAGT--MQAWSPV 251

Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
           +++ +D RWGR +ET GEDPVL  +   ++++G Q      + +  + KHF  +      
Sbjct: 252 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS-----MGLYTTPKHFGGHGAP-LG 305

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
           G D       +S++++ +   VPFR  +      S+M +Y+   GVP      +L   +R
Sbjct: 306 GRDSHDIG--LSEREMREVHLVPFRHVIRNYDCQSLMMAYSDFLGVPVAKSRELLHNILR 363

Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQ 334
            EW  +G+IVSDC ++G     +H+T+  + EAA  A+ AG+  +CG  +       A +
Sbjct: 364 EEWGFSGFIVSDCGAIGNLTARKHYTAKNKIEAANQALAAGIATNCGDTYNDKEVIQAAK 423

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQG 393
            G ++  +++      L +  R  +F+  P+    +  + P    +  H+E+A +AAR+ 
Sbjct: 424 DGRINMENLDEVCRTMLRMMFRNELFEKAPNKPLDWNKIYP-GWNSDSHKEMARQAARES 482

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNS------DVTVTMI-GNYAGIACGYTTPLQGIG 446
           IVLL+N+   LPLS    RT+AV+GP +      D T  +  G    +  G     Q +G
Sbjct: 483 IVLLENKDNILPLSK-DMRTIAVLGPGANDLQPGDYTPKLQPGQLKSVLTGIK---QAVG 538

Query: 447 RYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALD 497
           +  + I++QGC   +  ++ +   A+  + Q+D  +LV+G   + EA         E  D
Sbjct: 539 KQTKVIYEQGCDFTSLGENNI-AKAVKVASQSDVVLLVLGDCSTSEATTDVYKTSGENHD 597

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
            A L+LPG+QQEL+  V    K P IL+L +G P +++ A    +   + W   PGQ GG
Sbjct: 598 YATLILPGKQQELLEAVCATGK-PVILILQAGRPYNLSKASELCKAILVNW--LPGQEGG 654

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
            A AD+LFG  NP G+LPMT +P+ ++  LP+       +  +RY      +Y  P+ Y 
Sbjct: 655 PATADVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYSDMEYY--PLYY- 707

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
           FG+G+SYT+F ++                           +K+   +   +T  VQ  VK
Sbjct: 708 FGYGLSYTSFEYS--------------------------GLKIQEKENGNIT--VQATVK 739

Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           N+G + G   + ++ T           +L  F ++H+  G  + V   +   + LS+++ 
Sbjct: 740 NIGQRAGDEVVQLYVTDMYASVKTRITELKDFTRIHLKPGEAKTVSFELTPYE-LSLLND 798

Query: 737 SGTRRIPLGEHNIHIGG 753
              R +  G   I +GG
Sbjct: 799 HMDRVVEKGAFKILVGG 815


>gi|224536364|ref|ZP_03676903.1| hypothetical protein BACCELL_01238, partial [Bacteroides
           cellulosilyticus DSM 14838]
 gi|224522024|gb|EEF91129.1| hypothetical protein BACCELL_01238 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 808

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 156/403 (38%), Positives = 234/403 (58%), Gaps = 41/403 (10%)

Query: 61  SLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTA 120
           +++EK+ LL + +  + RL I  Y   +EALHGV  V PG +F       T FPQ I  A
Sbjct: 1   TVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG-RF-------TVFPQAIGLA 50

Query: 121 SSFNATLWEAIGRVVSDEARAMYNGGTAG----------LTYWSPNVNIFRDPRWGRGQE 170
           +++N  L   +  V+SDEARA +N    G          LT+WSP VN+ RDPRWGR  E
Sbjct: 51  ATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVNMARDPRWGRTPE 110

Query: 171 TPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQ 230
           T GEDP LSG    ++V+GLQG D   LK+ ++ KHF A + ++    +RF  N ++S++
Sbjct: 111 TYGEDPYLSGIMGTAFVKGLQGDDDRYLKIVSTPKHFAANNEEH----NRFVCNPQISEK 166

Query: 231 DIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCD 290
            + + +   F  CV +GK AS+M +YN +N VP   +  +L + +R +W   GY+VSDC 
Sbjct: 167 QLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLLTKVLRKDWGFKGYVVSDCG 226

Query: 291 SVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG------PFLGLHTESAVQRGLLSEIDIN 344
              +  +   +  T E AAA +I+AGLDL+CG      P L     SA ++ ++++ DI+
Sbjct: 227 GPSLLVNAHKYVKTKEAAAALSIKAGLDLECGDDVYDQPLL-----SAYRQYMVTDADID 281

Query: 345 NALVNTLTVQMRLGMFD-GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           +A    L  +M LG+FD GE    PY  + P  + + +HQE+AL AAR+ IVLLKNQ   
Sbjct: 282 SAAYRVLRARMELGLFDSGE--QNPYTKISPAVIGSAEHQEVALNAARECIVLLKNQKKM 339

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGI-ACGYTTPLQGI 445
           LPL+  + +++AV+G N+    +  G+Y+G+      + LQGI
Sbjct: 340 LPLNARKVKSIAVVGINA--GSSEFGDYSGLPVIAPISVLQGI 380



 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 88/297 (29%), Positives = 143/297 (48%), Gaps = 43/297 (14%)

Query: 467 LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVL 526
           L+G A  A R+ +  + V+G+++SIE E  DR  + LP  QQE + ++   +  P I+V+
Sbjct: 545 LYGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKVN--PNIVVV 602

Query: 527 MSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITN 586
           +  G   +A    D  I AI+ A YPG++GG A+A++LFG  NPGG+LP+T+Y    +  
Sbjct: 603 LVAGS-SLAINWMDEHIPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY--RSLDE 659

Query: 587 LPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
           LP  +            GRTY+++KG V+YPFG+G+SYT F ++            DG  
Sbjct: 660 LPPFD------DYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYSNLQVA-------DGEE 706

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQL 705
                                  + V   +KN G   G     V+   P      P K+L
Sbjct: 707 ----------------------EINVSFQLKNSGKYAGDEVAQVYVKLPERDEIMPIKEL 744

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI-PLGEHNIHIGGTKHSVSLH 761
             FE+V + +G  ++V + +     L   D +  + + P G++ I +G +   + L 
Sbjct: 745 KGFERVTLKSGENKKVTLKLR-KDLLRYWDEAKDKFVCPSGDYTIMVGASSADIRLQ 800


>gi|354583533|ref|ZP_09002432.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
 gi|353198174|gb|EHB63648.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
          Length = 931

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 170/467 (36%), Positives = 251/467 (53%), Gaps = 35/467 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LP+ +RVNDL+ R +L+EK++L+    A +PRLG++ Y+  +E  HGV+ +G  
Sbjct: 4   PFQNPDLPLEERVNDLVSRFTLEEKIELMCQYQAEIPRLGVQKYKHGTEGAHGVAWLGE- 62

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--GLTYWSPNVN 158
                    AT FPQ    A ++N  L   IG V++DEAR  Y    A  GLT W+P V+
Sbjct: 63  ---------ATVFPQNTGLACTWNPELMRQIGSVIADEARVYYQRDKAVNGLTIWAPTVD 113

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
           + RDPRWGR +E  GEDP L+G+ +   V+G+QG      K  A+ KHF      N N V
Sbjct: 114 MERDPRWGRTEEAYGEDPHLTGELSTGLVQGMQGDHPFYYKTVATLKHFYG----NNNEV 169

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
           DR + +  +  ++  + +   F     +G   S+M +YN +NG P C   + + R ++ E
Sbjct: 170 DRGNASVSIDPRNKREYYLKAFEPAFRKGGAKSMMTAYNGINGTP-CNLNDEVNRIVKQE 228

Query: 279 WRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLD--LDCGPFLGLHTESAVQR 335
           W ++G++V D  D +G   D  H+  +  EA A +++AG+D   D           A+++
Sbjct: 229 WGMDGFVVGDAGDVLGTVMD-HHYVDSYAEAVARSVKAGIDSITDDQEISFRALRDALEQ 287

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
           GLL+E D+++AL NT  V+ RLG FD E    PY H+    +C P+H EL+L AAR+ +V
Sbjct: 288 GLLTEQDLDHALRNTFRVRFRLGEFDPE-DRNPYSHVPESKLCAPEHAELSLRAARESVV 346

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYARTIHQQ 455
           LLKN G  LPL++    + AVIGP +D   T    Y+G      TPLQGI     T    
Sbjct: 347 LLKNDG-LLPLNN-NMPSAAVIGPLADEAFT--DWYSGTPPYRITPLQGI-----TEKMN 397

Query: 456 GCK-DVACADDQL---FGAAIDASRQADATILVMGLDQSIEAEALDR 498
           G K +     DQ+     A+      AD   L+ G ++S EA   +R
Sbjct: 398 GRKVNFHTGLDQVRLRSAASGQYVTLADDNRLLFGTNESSEAALFER 444



 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 78/261 (29%), Positives = 130/261 (49%), Gaps = 38/261 (14%)

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E +DR  ++LP  QQ+L+  V  A+   T++V++   P  V + +    + +I++  + G
Sbjct: 562 ETIDRPDIVLPPSQQQLIQAVHAANPN-TVVVIVGSYPFAVNWEQE--HVPSIVFTSHAG 618

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
           Q  G A+AD+L+G  NPGG+L MTWY    +  LP   M     + K    RTY+++ G 
Sbjct: 619 QELGRAVADVLYGDYNPGGRLNMTWYKS--VDQLPDL-MDYDIIKGK----RTYQYFDGD 671

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
           V+YPFGHG+SYT F +              GR      +IS   +K       ++T+ VQ
Sbjct: 672 VLYPFGHGLSYTRFAY--------------GR-----LSISEPVMKAD----GQVTVSVQ 708

Query: 674 VDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
             V+N G + G   + ++ +        P K L  F ++H+  GA + V   +   + L+
Sbjct: 709 --VRNEGDRAGDEVVQLYVSANNSRVVRPLKTLKGFRRIHLDPGASETVSFEL-AAEELA 765

Query: 733 VVDRSGTRR-IPLGEHNIHIG 752
             D +  R  I  GE+++ +G
Sbjct: 766 FWDVTRERYCIESGEYSVMVG 786


>gi|317480996|ref|ZP_07940076.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
 gi|316902889|gb|EFV24763.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
          Length = 864

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 169/432 (39%), Positives = 234/432 (54%), Gaps = 32/432 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SLP  +R   L+  L+L+EKV L++  +  V RLGIK Y WW+EALHGV+  G   
Sbjct: 22  YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM--YNGGTA------GLTYW 153
                   AT FPQ I  A+SF+      +   VSDEARA   Y           GLT W
Sbjct: 80  --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDL 212
           +P VNI+RDPRWGRG ET GEDP L+ +     V+GLQG+ DG   K+ A  KHF  +  
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHFAVHSG 191

Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             WN   R  FNA+ +  +D+ +T+  PF   V EGKV  VMC+YN+  G P C    +L
Sbjct: 192 PEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSDRLL 248

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
            + +R EW  +G ++SDC ++  +Y      + P  E A+A A+ +G DL+CG       
Sbjct: 249 MQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAAAVLSGTDLECGSSYEALV 308

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
           E AV++G + E  ++ A+   LT +  LG  D EP    +  +    V +  H  LAL+ 
Sbjct: 309 E-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDSLALDM 366

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
           AR+ + LL N+  +LPL      TVAV+GPN++ +V   GNY G+     T L GI    
Sbjct: 367 ARKSMTLLMNKDNTLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDGIRKAL 425

Query: 446 GRYARTIHQQGC 457
           G   R I++QGC
Sbjct: 426 GTDDRLIYEQGC 437



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 94/300 (31%), Positives = 144/300 (48%), Gaps = 47/300 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           +++  + AD  + V G+  ++E E +          DR  + LP  Q+EL++ +  A K 
Sbjct: 593 SVERVKDADVVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAALHHAGK- 651

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             +LV  SG PI         R  AI+ A YPGQAGGTA+A++LFG  NP G+LP+T+Y 
Sbjct: 652 KVVLVNCSGSPI--GLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNPAGRLPVTFYR 709

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
              ++ LP  E            GRTYR+     ++PFGHG+SYT+F +         AV
Sbjct: 710 N--VSQLPDFE-------DYNMTGRTYRYMTQEPLFPFGHGLSYTSFCYG--------AV 752

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
            L    GS N   SG+ ++            + V V N G  DG   + V+         
Sbjct: 753 VL----GSDNIK-SGEKLR------------LNVPVTNTGKCDGEEVVQVYLKKNDDVEG 795

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L AF++VH+PAG    V  ++   + +    +S T  +  G + + +GG+  +  L
Sbjct: 796 PSKALRAFKRVHIPAGKTVDVEFDLGDKELVWWNPQSNTMCVSEGSYELMVGGSSQTAGL 855


>gi|157144933|ref|YP_001452252.1| hypothetical protein CKO_00662 [Citrobacter koseri ATCC BAA-895]
 gi|157082138|gb|ABV11816.1| hypothetical protein CKO_00662 [Citrobacter koseri ATCC BAA-895]
          Length = 823

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 211/691 (30%), Positives = 335/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 176 TVFPISLGLASSFNLEAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 229

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 230 EGFGEDTYLTAIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 284

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 285 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKEVLRDEWGFKGI 341

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   A+++G+++     +   +    V+ G ++  +
Sbjct: 342 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 399

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLG K+    D       H++ A + AR+ +V
Sbjct: 400 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPQDTNAESRLHRKDARDVARESLV 454

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL+  +  T+AV+GP +D    ++G++  AG+A    T L GI    G   
Sbjct: 455 LLKNRLETLPLN--KSGTIAVVGPLADSQRDVMGSWSAAGVASQSVTVLTGIKNALGDKG 512

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + I+ +G     D    D                + +   A++A++Q+D  + V+G  Q 
Sbjct: 513 KVIYAKGANVTNDKGIVDFLNQYEEAVKVAPRSPEAMIDEAVNAAKQSDVVVAVVGEAQG 572

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 573 MAHEASSRTDIQIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 629

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 630 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 687

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G++ A+            
Sbjct: 688 RYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSAPTMKRDGAVTAS------------ 730

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V V N G ++GA  + ++     A    P KQL  FEKV +  G  Q V  
Sbjct: 731 ---------VRVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVTLKPGETQTVSF 781

Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I+I   K+ +   R      P G+ N+ IG
Sbjct: 782 PIDIEALKFWN--QRMKYDAEP-GKFNVFIG 809


>gi|283832442|ref|ZP_06352183.1| periplasmic beta-glucosidase [Citrobacter youngae ATCC 29220]
 gi|291072102|gb|EFE10211.1| periplasmic beta-glucosidase [Citrobacter youngae ATCC 29220]
          Length = 765

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 212/691 (30%), Positives = 332/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM   N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVGLNSLNGTPATSDAWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   AI++G+++     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVAIKSGINMSMSDEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLG K+    D       H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPVDTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  TVAV+GP +D    ++G++  AG+A    T L GI    G   
Sbjct: 397 LLKNRLDTLPLK--KSATVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAIGEQG 454

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + I+ +G     D    D                  +   A++A++Q+D  + V+G  Q 
Sbjct: 455 KVIYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVNAAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G++ A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--LSSPTMKRDGTVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V V N G ++GA  + ++     A    P KQL  FEKV++  G  Q V  
Sbjct: 673 ---------VQVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVNLKPGETQTVSF 723

Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I+I   K+ +   R      P G+ N+ IG
Sbjct: 724 PIDIEALKFWN--QRMKYDAEP-GKFNVFIG 751


>gi|423229063|ref|ZP_17215468.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
           CL02T00C15]
 gi|423244903|ref|ZP_17225977.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
           CL02T12C06]
 gi|392634816|gb|EIY28728.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
           CL02T00C15]
 gi|392640944|gb|EIY34735.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
           CL02T12C06]
          Length = 788

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 200/679 (29%), Positives = 311/679 (45%), Gaps = 82/679 (12%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQ 169
           AT FP      +++N  L   IG V + EA A+          +SP ++I +DPRWGR  
Sbjct: 158 ATYFPAQCGQGATWNKELIARIGEVEAKEAVAL-----EYTNIYSPILDIAQDPRWGRCV 212

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSK 229
           ET GEDP L G+     +  LQ     +  + A+ KHF  Y +       +   +  V+ 
Sbjct: 213 ETYGEDPYLVGELGKQMITSLQ-----KHNLVATPKHFAVYSIPVGGRDGKTRTDPHVAP 267

Query: 230 QDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDC 289
           +++   +  PFRM   E     VM SYN  +G P     + L   +R EW   GY+VSD 
Sbjct: 268 REMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDS 327

Query: 290 DSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDIN 344
           ++V          +T E+  A A+ AGL++         F+ L    AV  G +S+  ++
Sbjct: 328 EAVEFISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFI-LPLRKAVADGKISQETLD 386

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQG 401
             +   L V+  LG+FD      PY   G    + V + +HQ ++LEAARQ +VLLKN+ 
Sbjct: 387 KRVAEILRVKFWLGLFD-----NPYRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNEM 441

Query: 402 PSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR---YARTIHQQGCK 458
             LPLS    R++AVIGPN+D    +I  Y        T  QGI     +   I+++GC 
Sbjct: 442 NLLPLSK-SLRSIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTEVIYRKGCD 500

Query: 459 DV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLP 504
            +                 + +L   AI A++QA+  ++V+G ++    E   R  L LP
Sbjct: 501 IIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVREDRSRTSLNLP 560

Query: 505 GRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADIL 564
           GRQ+EL+  V    K P +LVL+ G    + +A     + AI+ A +PG+  G A+A+ L
Sbjct: 561 GRQEELLKAVCATGK-PVVLVLLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEAL 617

Query: 565 FGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSY 624
           FG  NPGG+L +T +P+  +  +P      +P   +      Y      V+YPFGHG+SY
Sbjct: 618 FGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSSTSVY-----GVLYPFGHGLSY 669

Query: 625 TNFVHTVANAPTVVAVPL-DGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           T F +       +   PL  G  G IN                     +   +KN G   
Sbjct: 670 TTFSYG-----DLKISPLRQGVQGDIN---------------------ISCKIKNTGKIK 703

Query: 684 GAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           G   + ++          + K L  FE++ + AG +Q V   +   + L + D++   R+
Sbjct: 704 GDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRLR-PQDLGLWDKNMNFRV 762

Query: 743 PLGEHNIHIGGTKHSVSLH 761
             G+  + IG +   + LH
Sbjct: 763 EPGKFKVMIGSSSTDIRLH 781


>gi|389794138|ref|ZP_10197297.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
 gi|388432924|gb|EIL89908.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
          Length = 869

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 160/403 (39%), Positives = 224/403 (55%), Gaps = 36/403 (8%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           L+ R++L EK+  L++ A A+PRLGI  Y WW+E+LHG     P           T+FP+
Sbjct: 25  LVARMTLAEKLPQLLNVAPAIPRLGIPRYNWWTESLHGALGTLP----------TTNFPE 74

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWG 166
            I   +SF+A L + +   +S E R ++              GL  WSPN+NIFRDPRWG
Sbjct: 75  PIGLGASFDAPLVKRVAAAISTEVRGLHTLARQTGRLGRIGTGLDTWSPNLNIFRDPRWG 134

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
           RGQET GEDP L+ +   ++V G+QG + DR +V A+ KHF  +   N     R   N  
Sbjct: 135 RGQETYGEDPFLTARMGVAFVEGMQGPNADRPEVIATPKHFAVH---NGPESTRHEANVF 191

Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
           VS+ D+ DT+   FR  + E K  S+MC+YN+++G P CA+  +LK  +RG W   GY+V
Sbjct: 192 VSRHDLVDTYLPAFRAAITEAKAGSIMCAYNRIDGQPACANDLLLKHYLRGAWGFTGYVV 251

Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFL----GLHTE--SAVQRGLLS 339
           SDCD+V    D  H+      A A A+RAG+D +C G  L    GL      A+QR L+S
Sbjct: 252 SDCDAVKDISDHHHYAPDAATAVAAAVRAGVDNECNGATLSDTAGLEDRYSEALQRELIS 311

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKN 399
             DI+ ALV     + R+G   G P + P   + P  + +  + +LAL+AA++ +VLLKN
Sbjct: 312 VADIDRALVRLFAARYRVGDLAGLPGTAP--AVPPSAIGSAANSQLALDAAQKTLVLLKN 369

Query: 400 QGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL 442
            G  LPL       VAVIGP  D T  + GNY+    G  TP+
Sbjct: 370 DG-VLPLKA--GVKVAVIGPLGDATRVLRGNYSSPLSG--TPI 407



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 108/332 (32%), Positives = 161/332 (48%), Gaps = 52/332 (15%)

Query: 434 IACGYTTPLQGIGRYARTIHQQGCKDVACADDQLFGAA-ID-ASRQADATILVMGLDQSI 491
           +A   T  L+   RY  T+   G  D+A         A +D A+ QAD  + V+GL   +
Sbjct: 552 LATMKTVTLERGHRYPITVDATGGVDLAWKRISTTPEADLDRAAAQADVLVAVVGLTSDL 611

Query: 492 EAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDP 541
           EAE            D+  L LP  Q  ++ K + A+  P I+V M+G PID+A++K + 
Sbjct: 612 EAEETSVTVPGFAGGDKTTLDLPADQIAML-KQARATGKPLIVVAMNGSPIDLAWSKQN- 669

Query: 542 RIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKR 601
             AAI+ A YPGQ+GG AIA++L G ++PGG+LP+T+Y    + +LP       P    R
Sbjct: 670 -AAAILEAWYPGQSGGLAIANVLSGKTDPGGRLPLTFY--RSVADLP-------PFGDYR 719

Query: 602 YPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINAT-ISGKAIKV 660
             GRTYR++KG  VYPFG G+SYT F                  +G +N T  +G A   
Sbjct: 720 MEGRTYRYFKGEPVYPFGFGLSYTEFT-----------------YGPLNVTPAAGGAANG 762

Query: 661 THAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQR 720
            H         V   V N G ++G     ++   PA   AP   L  F++V +  G  + 
Sbjct: 763 LH---------VSTVVTNAGQREGGDVAQLYLRFPAIPGAPQVALRGFQRVALKPGETRT 813

Query: 721 VGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           V  ++   + LS V   G R +  G++ + +G
Sbjct: 814 VSFDLS-PRDLSAVTADGQREVMAGDYTVSVG 844


>gi|423305378|ref|ZP_17283377.1| hypothetical protein HMPREF1072_02317 [Bacteroides uniformis
           CL03T00C23]
 gi|423311197|ref|ZP_17289166.1| hypothetical protein HMPREF1073_03916 [Bacteroides uniformis
           CL03T12C37]
 gi|392679729|gb|EIY73108.1| hypothetical protein HMPREF1073_03916 [Bacteroides uniformis
           CL03T12C37]
 gi|392681368|gb|EIY74727.1| hypothetical protein HMPREF1072_02317 [Bacteroides uniformis
           CL03T00C23]
          Length = 864

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 170/432 (39%), Positives = 235/432 (54%), Gaps = 32/432 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SLP  +R   L+  L+L+EKV L++  +  V RLGIK Y WW+EALHGV+  G   
Sbjct: 22  YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM--YNGGTA------GLTYW 153
                   AT FPQ I  A+SF+      +   VSDEARA   Y           GLT W
Sbjct: 80  --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDL 212
           +P VNI+RDPRWGRG ET GEDP L+ +     V+GLQG+ DG   K+ A  KHF  +  
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHFAVHSG 191

Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             WN   R  FNA+ +  +D+ +T+  PF   V EGKV  VMC+YN+  G P C    +L
Sbjct: 192 PEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSDRLL 248

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYY-DTQHFTS-TPEEAAADAIRAGLDLDCGPFLGLHT 329
            + +R EW  +G ++SDC ++  +Y D  H T    E A+A A+ +G DL+CG       
Sbjct: 249 MQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHLDAESASAAAVLSGTDLECGSSYEALV 308

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
           E AV++G + E  ++ A+   LT +  LG  D EP    +  +    V +  H  LAL+ 
Sbjct: 309 E-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDSLALDM 366

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
           AR+ + LL N+  +LPL      T+AV+GPN++ +V   GNY G+     T L GI    
Sbjct: 367 ARKSMTLLMNKDNTLPLKR-GGLTIAVMGPNANDSVMQWGNYNGMPPHTVTILDGIRKAL 425

Query: 446 GRYARTIHQQGC 457
           G   R I++QGC
Sbjct: 426 GSDDRLIYEQGC 437



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 94/300 (31%), Positives = 145/300 (48%), Gaps = 47/300 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           +++  + AD  + V G+  ++E E +          DR  + LP  Q+EL++ +  A K 
Sbjct: 593 SVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAALHRAGK- 651

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             +LV  SG PI         R  AI+ A YPGQAGGTA+A++LFG  NP G+LP+T+Y 
Sbjct: 652 KVVLVNCSGSPI--GLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNPAGRLPVTFYR 709

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
              ++ LP  E            GRTYR+     ++PFGHG+SYT+F +         AV
Sbjct: 710 N--VSQLPDFE-------DYNMTGRTYRYMTQEPLFPFGHGLSYTSFSYG--------AV 752

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
            L    GS N   SG+ ++++            V V N G  DG   + V+         
Sbjct: 753 VL----GSDNIK-SGEKLRLS------------VPVTNTGKCDGEEVVQVYLKKNDDVEG 795

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L AF++VH+PAG    V  ++   + +    +S T  +  G + + +GG+  +  L
Sbjct: 796 PSKALRAFKRVHIPAGKTVDVEFDLGDKELVWWNPQSNTMCVSEGSYELMVGGSSQTADL 855


>gi|294675359|ref|YP_003575975.1| 1,4-beta-xylosidase [Prevotella ruminicola 23]
 gi|225016052|gb|ACN78955.1| xylosidase/arabinofuranosidase [Prevotella ruminicola]
 gi|294472720|gb|ADE82109.1| putative 1,4-beta-xylosidase [Prevotella ruminicola 23]
          Length = 861

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 153/437 (35%), Positives = 231/437 (52%), Gaps = 29/437 (6%)

Query: 35  ATTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGV 94
           A+ +TLP+   +L   +R  DL  RL+L+EK  L++  + A+PRLGIK + WWSEALHG 
Sbjct: 17  ASAQTLPYQNPNLSAKERAVDLCSRLTLEEKAMLMLDESPAIPRLGIKKFFWWSEALHGA 76

Query: 95  SNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------- 145
           +N+G            T+FP+ +  A+SFN  L   +  + S E RA YN          
Sbjct: 77  ANMG----------NVTNFPEPVGMAASFNPHLLFKVFDIASTEFRAQYNHRMYDLNGED 126

Query: 146 -GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASC 204
                L+ W+PNVNIFRDPRWGRGQET GEDP L+       V+GLQG +  R +   +C
Sbjct: 127 MKMRSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVQVVKGLQGPEDARYRKLWAC 186

Query: 205 -KHFTAYDLDNWNGVDRFHFN-AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGV 262
            KH+  +    +    R   N   VS +D  +T+   F+  V + KV  VMC+Y +++  
Sbjct: 187 AKHYAVHSGPEYT---RHTANLTDVSARDFWETYMPAFKTLVKDAKVREVMCAYQRLDDD 243

Query: 263 PTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG 322
           P C    +L++ +R EW     +VSDC +V  +Y+    +S      + A+ AG D++CG
Sbjct: 244 PCCGSTRLLQQILRDEWGFEYLVVSDCGAVSDFYENHKSSSDAVHGTSKAVLAGTDVECG 303

Query: 323 PFLGLHT-ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD 381
                 +   AV++GLLSE +++  ++  L  +  LG  D +PS   +  +    + T  
Sbjct: 304 FNYAYKSLPEAVRKGLLSEKEVDKHVIRLLEGRFDLGEMD-DPSLVEWSKIPYSAMSTKA 362

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTP 441
              +AL+ ARQ IVLL+N+   LPL     + +A+IGPN+     M GNY G      T 
Sbjct: 363 SANVALDMARQTIVLLQNKNNILPLKKNAEK-IAIIGPNAHNEPMMWGNYNGTPNHTVTI 421

Query: 442 LQGI-GRYARTIHQQGC 457
           L G+  +  + ++  GC
Sbjct: 422 LDGVKAKQKKLVYIPGC 438



 Score = 99.4 bits (246), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 93/323 (28%), Positives = 137/323 (42%), Gaps = 62/323 (19%)

Query: 459 DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQ 508
           DVA   +  +   I   +  +  I   G+  S+E E +          DR  + LP  Q+
Sbjct: 583 DVARELNIDYQETIAQLKGINKVIFCGGIAPSLEGEEMPVNIEGFKGGDRTSIELPKVQR 642

Query: 509 ELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTS 568
           E +  +  A K   I V  SG  I  A         AI+ A YPGQ GGTA+AD+LFG  
Sbjct: 643 EFLKALKAAGK-QVIYVNCSGSAI--ALQPETESCDAIVQAWYPGQEGGTAVADVLFGDY 699

Query: 569 NPGGKLPMTWYPQEYITNLPMTE-MAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNF 627
           NPGGKL +T+Y  +    LP  E  +M+        GRTYR++    ++PFG+G+SYT F
Sbjct: 700 NPGGKLSVTFYKND--QQLPDYEDYSMK--------GRTYRYFDD-ALFPFGYGLSYTTF 748

Query: 628 VHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG----VQVDVKNVGSKD 683
                                          +V  AK    T G    VQ+ V N G+K+
Sbjct: 749 -------------------------------EVGEAKVEAATDGALYNVQIPVTNTGTKN 777

Query: 684 GAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVD-RSGTRRI 742
           G+ T+ ++         P K L  FE++ + AG      + +   + L   D  + T R 
Sbjct: 778 GSETIQLYIRNLQDPDGPLKSLRGFERLDIKAGKTATANLKL-TKESLEFWDAETNTMRT 836

Query: 743 PLGEHNIHIGGTKHSVSLHAATL 765
             G++ I  G +     L   T+
Sbjct: 837 KPGKYEILYGTSSLDKDLKKLTI 859


>gi|160884133|ref|ZP_02065136.1| hypothetical protein BACOVA_02110 [Bacteroides ovatus ATCC 8483]
 gi|423291392|ref|ZP_17270240.1| hypothetical protein HMPREF1069_05283 [Bacteroides ovatus
           CL02T12C04]
 gi|156110475|gb|EDO12220.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
 gi|392663392|gb|EIY56942.1| hypothetical protein HMPREF1069_05283 [Bacteroides ovatus
           CL02T12C04]
          Length = 735

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 231/795 (29%), Positives = 369/795 (46%), Gaps = 109/795 (13%)

Query: 8   FFLGLILLSASSSGLAAREPFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKV 66
             LGL LL  + S L++++           +++P +     PI +R++DLI R++L+EK+
Sbjct: 6   LILGLCLLGVTHS-LSSKD----------KKSIPLYKDAKAPIEKRIDDLISRMTLEEKI 54

Query: 67  KLLI--------------SGAAAVP-RLGIKGYEWWSEALHGV--------SNVGPGTKF 103
             L                    VP  +G   Y   +  L           S +G    F
Sbjct: 55  LQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFDINPELRNSMQKKAMEESRLGIPIIF 114

Query: 104 GGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIF 160
           G D      T +P  +  A S+N  L E    V + EAR       +G+ + +SP +++ 
Sbjct: 115 GYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSAQEAR------MSGVDWTFSPMIDVA 168

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVD 219
           RDPRWGR  E  GEDP  +G +AA+ VRG QG D     ++AA  KH+  Y         
Sbjct: 169 RDPRWGRVAEGYGEDPYTNGVFAAASVRGYQGDDMSAENRMAACLKHYVGYGASE---AG 225

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R +   ++S Q + DT+ +P+ M V  G  A++M S+N ++GVP  A+P I+   ++  W
Sbjct: 226 RDYVYTEISAQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRW 284

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLL 338
           + +G+IVSD  +V      Q   +T ++AA  A  AGL++D        + +  V+ G +
Sbjct: 285 KHDGFIVSDWGAVE-QLKNQGLAATKKDAARYAFNAGLEMDMMSHAYDRYLKELVEEGKV 343

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKD-VCTPDHQELALEAARQGIVL 396
           +   ++ ++   L V+ RLG+F+     +PY  +   KD    P    +A + A + +VL
Sbjct: 344 TMAQVDESVRRVLRVKFRLGLFE-----RPYTPVTNEKDRFFRPQSMAVAAQLAAESMVL 398

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYAR 450
           LKN    LPL++   + +AV+GP +     ++G++ G      +   Y       G  A 
Sbjct: 399 LKNDNQILPLTN--KKKIAVVGPMAKNGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAE 456

Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
             +  GCK     D   F  A+D +R +D  I+ +G   +   E   R+ + LP  Q+EL
Sbjct: 457 LRYAMGCKPQG-NDRSGFAGALDVARWSDVVIVCLGEMLTWSGENASRSTIALPQIQEEL 515

Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
           V ++  A K P ILVL +G P+++   + +P   AI+    PG  G  ++A IL G  NP
Sbjct: 516 VKELKEAGK-PIILVLSNGRPLEL--NRMEPLCDAILEIWQPGINGARSMAGILSGRINP 572

Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTN 626
            GKL MT          P +   +    ++R  GR ++ FYK      +YPFGHG+SYT 
Sbjct: 573 SGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTE 623

Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
           F +                      T++  A KV         L  +V V N G++DGA 
Sbjct: 624 FKY---------------------GTVTPSATKVKRGD----KLSAEVTVTNTGARDGAE 658

Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
           T+  F S P      P K+L  FEK  + AG  +    +I + +    V+  G R +  G
Sbjct: 659 TVHWFISDPYCSITRPVKELKHFEKQFIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAG 718

Query: 746 EHNIHIGGTKHSVSL 760
           E++I + G    + L
Sbjct: 719 EYHILVQGQTVKIEL 733


>gi|255693560|ref|ZP_05417235.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260620625|gb|EEX43496.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 770

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 203/685 (29%), Positives = 330/685 (48%), Gaps = 78/685 (11%)

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDP 163
           G + G+T +P  I  A++F+  L E + R  + E RA      +G+ + ++PNV + RD 
Sbjct: 141 GLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA------SGMHWTFTPNVEVARDA 194

Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFT--AYDLDNWNGVDR 220
           RWGR  ET GEDP L G+  A+ VRG Q  D     KV A  KH    +   +  NG   
Sbjct: 195 RWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGNDKVIACAKHLVGGSQPANGINGAP- 253

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
               A++S++ +++ F  PF+ C +E  V +VM ++N++NG+P   +  ++   +R +W+
Sbjct: 254 ----AELSERTLQEVFFPPFKDC-LEAGVFTVMTAHNELNGIPCHGNKYLMTEVLRNQWK 308

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFLGLHTESAVQRGLLS 339
            +G++VSD   +   +D  +   T ++A   ++ AG+ +   GP         V+ G + 
Sbjct: 309 FDGFVVSDWMDIERMHDYHNVAETLKDAYQISVDAGMGMHMHGPEFYEAIIECVKEGSIP 368

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD--VCTPDHQELALEAARQGIVLL 397
           E  I+ A+   L V+ RLG+F+      P+  L  KD  V    HQ+ ALE AR+ IVLL
Sbjct: 369 EKQIDAAVSKILEVKFRLGLFEN-----PFIDLKKKDEIVFNEKHQQTALEGARKSIVLL 423

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA--GIACGYTTPLQGIGRYARTIHQQ 455
           KN+G  LPL   +++ V V G N++   +++G++A        TT L+G+   +   +  
Sbjct: 424 KNEGNMLPLDASKYKKVFVTGHNAN-NQSILGDWAMEQPEEHVTTVLKGLKAISPETNYN 482

Query: 456 GCK---DVACADDQLFGAAIDASRQADATILVMG-------LDQSIEAEALDRAGLLLPG 505
                 +V    D     A+  +R +D  ILV+G        ++    E  DR  L LPG
Sbjct: 483 FLDLGWNVRLLSDNQIKEAVQQARSSDLAILVVGENSMRYHWNEKTCGENSDRYELSLPG 542

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
           RQQELV  V+ A+  PT+++L++G P+   +   D  +  II A  PG AGG A+A+IL+
Sbjct: 543 RQQELVEAVA-ATGVPTVVILVNGRPLTTEWI--DENMPCIIEAWEPGVAGGQALAEILY 599

Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV--VYPFGHGMS 623
           G  NP GKLP+T         +P +   ++   + ++    + +  G    +Y FG+G+S
Sbjct: 600 GKVNPSGKLPIT---------IPRSTGQIQCMYNHKFTNHWFPYATGNSLPLYEFGYGLS 650

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           YT + +           P              K++KVT            VDV N G  D
Sbjct: 651 YTTYKYENLKLSEATITP-------------DKSVKVT------------VDVTNTGKMD 685

Query: 684 GAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           G  T+ L      +    P K+L  F ++ + AG  + V   +   + LS  D +    +
Sbjct: 686 GEETVQLYIRDEYSSATRPVKELKDFARIPLKAGETKEVSFTL-TPEMLSYYDANMHYGV 744

Query: 743 PLGEHNIHIGGTKHSVSLHAATLGV 767
             G   I +G +     L +  L V
Sbjct: 745 EKGTFKIMVGASSRDTDLQSIILTV 769


>gi|390433227|ref|ZP_10221765.1| beta-D-glucoside glucohydrolase [Pantoea agglomerans IG1]
          Length = 765

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 205/665 (30%), Positives = 330/665 (49%), Gaps = 103/665 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASS++    + +GRV      + Y     GL   W+P V++ R+PRWGRG 
Sbjct: 118 TIFPIPLALASSWDLDAVKEVGRV------SAYEAADDGLNMTWAPMVDVSREPRWGRGS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+ +   S V+ +QG S  DR  V  S KHF  Y       ++N VD     
Sbjct: 172 EGFGEDTYLTSEMGRSMVQSMQGKSAADRYSVMTSVKHFALYGAVEGGRDYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q +   +  P++  +  G    VM + N +NGVP  AD  +LK  +R +W+  G 
Sbjct: 227 --MSPQRMFQDYLPPYKASLDAGS-GGVMVALNSINGVPATADSWLLKDLLRDKWKFKGI 283

Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H   S P+EA   A+++G+D+     +   +  + V+ G ++  +
Sbjct: 284 TISDHGAIKEL--IKHGVASDPQEAVRIALKSGVDMSMSDEYYSKYLPALVKSGDVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           I++A  + L V+  +G+F+      PY HLGPK+    D       H+  A + AR+ IV
Sbjct: 342 IDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPQDTNAESRLHRAEARDVARKSIV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI-------- 445
           LLKN+  +LPL   +  T+A+IGP +D    ++G++  AG+A    + LQG+        
Sbjct: 397 LLKNRLETLPLK--KSGTIALIGPLADSQRDIMGSWSAAGVAKQSVSLLQGMRNATEGKA 454

Query: 446 ----GRYARTIHQQGCKDV------ACADD-----QLFGAAIDASRQADATILVMGLDQS 490
                + A     +G +D       A + D     QL   A+  ++QAD  +  +G  Q 
Sbjct: 455 TLLYEKGANVTDNKGIQDFLNLYEQAVSVDTRSPQQLIDDAVAKAKQADVVVAAVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWA 549
           +  EA  R+ L++P  QQ+L++ +    K P ++VLM+G P+ V    N+ R+A A++  
Sbjct: 515 MAHEASSRSDLVIPPSQQKLLAALKATGK-PLVIVLMNGRPLSVV---NEDRMADAMLET 570

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYP 603
            + G  GG AIAD+LFG  NP GKLP++ +P+       Y  +LP T      ++  +Y 
Sbjct: 571 WFSGTEGGNAIADVLFGDYNPSGKLPVS-FPRSVGQIPIYYNHLP-TGRPYNFAKPNKYT 628

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
              Y    GP +YPFG+G+SYT F  TV  +P  ++ P   R+GSI A+++         
Sbjct: 629 SHYYDAANGP-LYPFGYGLSYTTF--TV--SPVKMSSPTMPRNGSIEASVT--------- 674

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVG 722
                       V N G +DGA  + ++   P    + P ++L  F+++ + AG  Q V 
Sbjct: 675 ------------VTNSGKRDGATVVQMYLNDPVASISRPVQELRGFKRIMLKAGESQTVT 722

Query: 723 INIHV 727
             I V
Sbjct: 723 FKIDV 727


>gi|298479985|ref|ZP_06998184.1| periplasmic beta-glucosidase [Bacteroides sp. D22]
 gi|298273794|gb|EFI15356.1| periplasmic beta-glucosidase [Bacteroides sp. D22]
          Length = 735

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 233/795 (29%), Positives = 368/795 (46%), Gaps = 109/795 (13%)

Query: 8   FFLGLILLSASSSGLAAREPFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKV 66
             LGL LL  + S L++++           +++P +     PI +R++DLI R++L+EKV
Sbjct: 6   LILGLCLLGVTHS-LSSKD----------KKSIPLYKDAKAPIEKRIDDLISRMTLEEKV 54

Query: 67  KLLI--------------SGAAAVP-RLGIKGYEWWSEALHGV--------SNVGPGTKF 103
             L                    VP  +G   Y   +  L           S +G    F
Sbjct: 55  LQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFDINPELRNSMQKKAMEESRLGIPIIF 114

Query: 104 GGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIF 160
           G D      T +P  +  A S+N  L E    V + EAR       +G+ + +SP +++ 
Sbjct: 115 GYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSAQEAR------MSGVDWTFSPMIDVA 168

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVD 219
           RDPRWGR  E  GEDP  +G +AA+ VRG QG D     ++AA  KH+  Y         
Sbjct: 169 RDPRWGRVAEGYGEDPYTNGVFAAASVRGYQGDDMSAENRMAACLKHYVGYGASE---AG 225

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R +   ++S Q + DT+ +P+ M V  G  A++M S+N ++GVP  A+P I+   ++  W
Sbjct: 226 RDYVYTEISAQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRW 284

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLL 338
           + +G+IVSD  +V      Q   +T ++AA  A  AGL++D        H +  V+ G +
Sbjct: 285 KHDGFIVSDWGAVE-QLKNQGLAATKKDAAQYAFNAGLEMDMMSHAYDRHLKELVEEGKV 343

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKD-VCTPDHQELALEAARQGIVL 396
           +   ++ ++   L V+  LG+F+     +PY  +   KD    P    +A + A + +VL
Sbjct: 344 TMAQVDESVRRVLRVKFCLGLFE-----RPYTPVTNEKDRFFRPQSMAVAAQLAAESMVL 398

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYAR 450
           LKN    LPL++   + +AV+GP +     ++G++ G      +   Y       G  A 
Sbjct: 399 LKNDNQILPLTN--KKKIAVVGPMAKNGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAE 456

Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
             +  GCK     D   F  A+D +R +D  I+ +G   +   E   R+ + LP  Q+EL
Sbjct: 457 LRYAMGCKPQG-NDRSGFAGALDVARWSDVVIVCLGEMLTWSGENASRSTIALPQIQEEL 515

Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
           V ++  A K P ILVL +G P+++   + +P   AI+    PG  G  ++A IL G  NP
Sbjct: 516 VKELKEAGK-PVILVLSNGRPLEL--NRMEPLCDAILEIWQPGINGARSMAGILSGRINP 572

Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTN 626
            GKL MT          P +   +    ++R  GR ++ FYK      +YPFGHG+SYT 
Sbjct: 573 SGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTE 623

Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
           F +                      T++  A KV         L  +V V N GS+DGA 
Sbjct: 624 FKY---------------------GTVTPSATKVKRGD----KLSAEVTVTNTGSRDGAE 658

Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
           T+  F S P      P K+L  FEK  + AG  +    +I + +    V+  G R +  G
Sbjct: 659 TVHWFISDPYCSITRPVKELRHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAG 718

Query: 746 EHNIHIGGTKHSVSL 760
           E++I + G    + L
Sbjct: 719 EYHILVQGQTVKIEL 733


>gi|365108332|ref|ZP_09336233.1| periplasmic beta-glucosidase [Citrobacter freundii 4_7_47CFAA]
 gi|363640688|gb|EHL80138.1| periplasmic beta-glucosidase [Citrobacter freundii 4_7_47CFAA]
          Length = 765

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 204/669 (30%), Positives = 324/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GR+      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM   N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVGLNSLNGTPATSDAWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   AI++G+++     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVAIKSGINMSMSDEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLG K+    D       H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPQDTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
           LLKN+  +LPL   +  TVAV+GP +D    ++G++  AG+A    T L GI        
Sbjct: 397 LLKNRLETLPLK--KSATVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGDQG 454

Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
              YA+               ++++  K    +   +   A++A++Q+D  + V+G  Q 
Sbjct: 455 KVLYAKGANITNDKGIVDFLNLYEEAVKVDPRSPQAMIDEAVNAAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G++ A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--LSSPTMKRDGTVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V V N G ++GA  + ++     A    P KQL  FEKV++  G  Q V  
Sbjct: 673 ---------VQVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVNLKPGETQTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|270294390|ref|ZP_06200592.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270275857|gb|EFA21717.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 864

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 169/432 (39%), Positives = 234/432 (54%), Gaps = 32/432 (7%)

Query: 42  FCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGT 101
           +   SLP  +R   L+  L+L+EKV L++  +  V RLGIK Y WW+EALHGV+  G   
Sbjct: 22  YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79

Query: 102 KFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM--YNGGTA------GLTYW 153
                   AT FPQ I  A+SF+      +   VSDEARA   Y           GLT W
Sbjct: 80  --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDL 212
           +P VNI+RDPRWGRG ET GEDP L+ +     V+GLQG+ DG   K+ A  KHF  +  
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHFAVHSG 191

Query: 213 DNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNIL 271
             WN   R  FNA+ +  +D+ +T+  PF   V EGKV  VMC+YN+  G P C    +L
Sbjct: 192 PEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSDRLL 248

Query: 272 KRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHT 329
            + +R EW  +G ++SDC ++  +Y      + P  E A+A A+ +G DL+CG       
Sbjct: 249 MQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAVAVLSGTDLECGSSYEALV 308

Query: 330 ESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEA 389
           E AV++G + E  ++ A+   LT +  LG  D EP    +  +    V +  H  LAL+ 
Sbjct: 309 E-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDSLALDM 366

Query: 390 ARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI---- 445
           AR+ + LL N+  +LPL      TVAV+GPN++ +V   GNY G+     T L GI    
Sbjct: 367 ARKSMTLLMNKDNTLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDGIRKAL 425

Query: 446 GRYARTIHQQGC 457
           G   R I++QGC
Sbjct: 426 GTDDRLIYEQGC 437



 Score =  126 bits (316), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 94/300 (31%), Positives = 145/300 (48%), Gaps = 47/300 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKG 520
           +++  + AD  + V G+  ++E E +          DR  + LP  Q+EL++ +  A K 
Sbjct: 593 SVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAALHRAGK- 651

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             +LV  SG PI         R  AI+ A YPGQAGGTA+A++LFG  NP G+LP+T+Y 
Sbjct: 652 KVVLVNCSGSPI--GLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNPAGRLPVTFYR 709

Query: 581 QEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAV 640
              ++ LP  E            GRTYR+     ++PFGHG+SYT+F +         AV
Sbjct: 710 N--VSQLPDFE-------DYNMTGRTYRYMTQEPLFPFGHGLSYTSFSYG--------AV 752

Query: 641 PLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA 700
            L    GS N   SG+ ++++            V V N G  DG   + V+         
Sbjct: 753 VL----GSDNIK-SGEKLRLS------------VPVTNTGKCDGEEVVQVYLKKNDDVEG 795

Query: 701 PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           P K L AF++VH+PAG    V  ++   + +    +S T  +  G + + +GG+  +  L
Sbjct: 796 PSKALRAFKRVHIPAGKTVDVEFDLGDKELVWWNPQSNTMCVSEGSYELMVGGSSQTADL 855


>gi|423223721|ref|ZP_17210190.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638096|gb|EIY31949.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 954

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 212/737 (28%), Positives = 356/737 (48%), Gaps = 91/737 (12%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSN 96
           +L +   +LP+ +RV  L+  ++ ++K++L+  G     +P L +       EA+HG S 
Sbjct: 167 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS- 224

Query: 97  VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPN 156
                 +G    GAT FPQ +   +++N  L E +   V DE  A    GT  +  WSP 
Sbjct: 225 ------YGS---GATIFPQALAMGATWNKKLTEDVAMAVGDETLA---AGT--MQAWSPV 270

Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
           +++ +D RWGR +ET GEDPVL  +   ++++G Q S G    +  + KHF  +      
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-SKG----LFTTPKHFGGHGAP-LG 324

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
           G D       +S++++ +   VPFR  +      SVM +Y+   GVP      +L   +R
Sbjct: 325 GRDSHDIG--LSEREMREVHLVPFRHVIRNYDCQSVMMAYSDYLGVPVAKSRELLHSILR 382

Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQ 334
            EW  +G+IVSDC ++G     +H+T+  + EAA  A+ AG+  +CG  +       A +
Sbjct: 383 EEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNDKEVIQAAK 442

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSS-QPYGHLGPKDVCTPDHQELALEAARQG 393
            G ++  +++      L +  R  +F+  P+    +  + P    +  H+E+A +AAR+ 
Sbjct: 443 DGRINMENLDEVCRTMLRMMFRNELFEKTPNKPLDWNKIYP-GWNSDSHKEMARQAARES 501

Query: 394 IVLLKNQGPSLPLSHIRHRTVAVIGPNSD-------VTVTMIGNYAGIACGYTTPLQGIG 446
           IV+L+N+   LPL+    RT+AV+GP +D           + G    +  G     Q +G
Sbjct: 502 IVMLENKDNILPLAK-DMRTIAVVGPGADDLQPGDYTPKLLPGQLKSVLTGIK---QAVG 557

Query: 447 RYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA---------EALD 497
           +  + +++QGC D   ++      A+ A+ Q+D  +LV+G   + E+         E  D
Sbjct: 558 KQTKVVYEQGC-DFTSSNGTDIPKAVKAASQSDVVVLVLGDCSTSESTTDVYKTSGENHD 616

Query: 498 RAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGG 557
            A L+LPG+QQEL+  V    K P IL+L +G P +++ A    +   + W   PGQ GG
Sbjct: 617 YATLILPGKQQELLEAVCATGK-PVILILQAGRPYNLSKASELCKAILVNW--LPGQEGG 673

Query: 558 TAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYP 617
            A AD+LFG  NP G+LPMT +P+ ++  LP+       +  +RY      FY  P+ Y 
Sbjct: 674 PATADVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYSDMEFY--PLYY- 726

Query: 618 FGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVK 677
           FG+G+SYT+F +                        SG  I+    + +   + +Q  VK
Sbjct: 727 FGYGLSYTSFEY------------------------SGLKIQ----EKDNGNVAIQATVK 758

Query: 678 NVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDR 736
           NVG + G   + ++ T           +L  F +VH+     + V   +   + LS+++ 
Sbjct: 759 NVGQRAGDEVVQLYITDMYASVKTRITELKDFTRVHLQPDESKIVSFELTPYE-LSLLND 817

Query: 737 SGTRRIPLGEHNIHIGG 753
              R +  GE  I +GG
Sbjct: 818 RMDRVVEKGEFKILVGG 834


>gi|319643197|ref|ZP_07997825.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
 gi|345520511|ref|ZP_08799899.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|254835034|gb|EET15343.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|317385101|gb|EFV66052.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
          Length = 788

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 198/679 (29%), Positives = 312/679 (45%), Gaps = 82/679 (12%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT FP      +++N  L   IG V + EA A+      G T  +SP ++I +DPRWGR 
Sbjct: 158 ATYFPAQCGQGATWNKKLIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRC 211

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
            ET GEDP L G+     +  LQ     +  + A+ KHF  Y +       +   +  V+
Sbjct: 212 VETYGEDPYLVGELGKQMITSLQ-----KYNLVATPKHFAVYSIPIGGRDGKTRTDPHVA 266

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++   +  PFRM   E     VM SYN  +G P     + L   +R EW   GY+VSD
Sbjct: 267 PREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSD 326

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDI 343
            ++V    +      T E+  A A+ AGL++         F+ L    AV  G +S+  +
Sbjct: 327 SEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFI-LPLRKAVDDGKISQETL 385

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQ 400
           +  +   L ++  LG+FD      PY   G    + V + +HQ ++LEAARQ +VLLKN+
Sbjct: 386 DKRVAEILRIKFWLGLFD-----NPYRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNE 440

Query: 401 GPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR---YARTIHQQGC 457
              LPLS    R++AVIGPN+D    +I  Y        T  QGI     +A  I+++GC
Sbjct: 441 THLLPLSK-SIRSIAVIGPNADEQTQLICRYGPANAPIKTVYQGIKELLPHAEVIYKKGC 499

Query: 458 KDV--------------ACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLL 503
             +                 + +L    I A++QA+  ++V+G ++    E   R  L L
Sbjct: 500 DIIDPHFPESEILDFPKTAEEVRLMQEVIRAAKQAEVVVMVLGGNELTVREDRSRTSLNL 559

Query: 504 PGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADI 563
           PGRQ+EL+  V    K P ILV++ G    + +A     + AI+ A +PG+  G A+A+ 
Sbjct: 560 PGRQEELLKAVCATGK-PVILVMLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEA 616

Query: 564 LFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMS 623
           LFG  NPGG+L +T +P+  +  +P      +P   +      Y       +YPFGHG+S
Sbjct: 617 LFGDYNPGGRLAVT-FPKS-VGQIPFA-FPFKPGSDESSSTSVY-----GALYPFGHGLS 668

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           YT F ++  +    ++    G  G I            H  C          +KN G   
Sbjct: 669 YTTFTYSDLH----ISPSHQGVQGDI------------HVSCK---------IKNTGKIK 703

Query: 684 GAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           G   + ++          + K L  FE++ + AG +Q V   +   + L + D++   R+
Sbjct: 704 GDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLR-PQDLGLWDKNMNFRV 762

Query: 743 PLGEHNIHIGGTKHSVSLH 761
             G   + +G +   + LH
Sbjct: 763 EPGSFKVMLGASSTDIRLH 781


>gi|440286925|ref|YP_007339690.1| beta-glucosidase-like glycosyl hydrolase [Enterobacteriaceae
           bacterium strain FGI 57]
 gi|440046447|gb|AGB77505.1| beta-glucosidase-like glycosyl hydrolase [Enterobacteriaceae
           bacterium strain FGI 57]
          Length = 765

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 211/689 (30%), Positives = 330/689 (47%), Gaps = 102/689 (14%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDAWLLKEILRDKWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   A+++G+++     +   +    V+ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A + AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPQDTNAESRLHRKDARDVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  TVAV+GP +D    ++G++  AG+A    T L GI    G   
Sbjct: 397 LLKNRLETLPLK--KSGTVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNALGDKG 454

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                  +   A+ A++Q+D  + V+G  Q 
Sbjct: 455 KVVYAKGANVTNDKGIVDFLNLYEEAVKVDPRSPQAMIDEAVAAAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTNIEIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--MSAPSMTRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                    VDV N G ++GA  + ++     A    P KQL  FEKV +  G  Q V  
Sbjct: 673 ---------VDVTNSGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVTLKPGETQTVSF 723

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I V   L   ++S       G+ N+ IG
Sbjct: 724 PIDV-DALKFWNQSMKYDAEPGKFNVFIG 751


>gi|237732180|ref|ZP_04562661.1| yohA [Citrobacter sp. 30_2]
 gi|226907719|gb|EEH93637.1| yohA [Citrobacter sp. 30_2]
          Length = 765

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 204/669 (30%), Positives = 324/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GR+      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM   N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVGLNSLNGTPATSDAWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   AI++G+++     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVAIKSGINMSMSDEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLG K+    D       H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPQDTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
           LLKN+  +LPL   +  TVAV+GP +D    ++G++  AG+A    T L GI        
Sbjct: 397 LLKNRLETLPLK--KSATVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGDQG 454

Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
              YA+               ++++  K    +   +   A++A++Q+D  + V+G  Q 
Sbjct: 455 KVLYAKGANITNDKGIVDFLNLYEEAVKVDPRSPQAMIDEAVNAAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G++ A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--LSSPTMKRDGTVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V V N G ++GA  + ++     A    P KQL  FEKV++  G  Q V  
Sbjct: 673 ---------VQVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVNLKPGETQTVSF 723

Query: 723 -INIHVCKY 730
            I+I   K+
Sbjct: 724 PIDIEALKF 732


>gi|455646191|gb|EMF25234.1| beta-D-glucoside glucohydrolase [Citrobacter freundii GTC 09479]
          Length = 765

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 210/691 (30%), Positives = 333/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GR+      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM   N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVGLNSLNGTPATSDAWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   AI++G+++     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVAIKSGINMSMSDEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLG K+    D       H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPQDTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
           LLKN+  +LPL   +  TVAV+GP +D    ++G++  AG+A    T L GI        
Sbjct: 397 LLKNRLETLPLK--KSATVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGDQG 454

Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
              YA+               ++++  K    +   +   A++A++Q+D  + V+G  Q 
Sbjct: 455 KVLYAKGANITNDKGIVDFLNLYEEAVKVDPRSPQAMIDEAVNAAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G++ A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--LSSPTMKRDGTVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V V N G ++GA  + ++     A    P KQL  FEKV++  G  Q V  
Sbjct: 673 ---------VQVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVNLKPGETQTVSF 723

Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I+I   K+ +   R      P G+ N+ IG
Sbjct: 724 PIDIEALKFWN--QRMQYDAEP-GKFNVFIG 751


>gi|383115540|ref|ZP_09936296.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
 gi|313695055|gb|EFS31890.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
          Length = 770

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 203/685 (29%), Positives = 330/685 (48%), Gaps = 78/685 (11%)

Query: 105 GDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDP 163
           G + G+T +P  I  A++F+  L E + R  + E RA      +G+ + ++PNV + RD 
Sbjct: 141 GLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA------SGMHWTFTPNVEVARDA 194

Query: 164 RWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFT--AYDLDNWNGVDR 220
           RWGR  ET GEDP L G+  A+ VRG Q  D     KV A  KH    +   +  NG   
Sbjct: 195 RWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGNDKVIACAKHLVGGSQPANGINGAP- 253

Query: 221 FHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWR 280
               A++S++ +++ F  PF+ C +E  V +VM ++N++NG+P   +  ++   +R +W+
Sbjct: 254 ----AELSERTLQEVFFPPFKDC-LEAGVFTVMTAHNELNGIPCHGNKYLMTEVLRNQWK 308

Query: 281 LNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-GPFLGLHTESAVQRGLLS 339
            +G++VSD   +   +D  +   T ++A   ++ AG+ +   GP         V+ G + 
Sbjct: 309 FDGFVVSDWMDIERMHDYHNVAETLKDAYRISVDAGMGMHMHGPEFYEAIIECVKEGSIP 368

Query: 340 EIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD--VCTPDHQELALEAARQGIVLL 397
           E  I+ A+   L V+ RLG+F+      P+  L  KD  V    HQ+ ALE AR+ IVLL
Sbjct: 369 EKQIDAAVSKILEVKFRLGLFEN-----PFIDLKKKDEIVFNEKHQQTALEGARKSIVLL 423

Query: 398 KNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA--GIACGYTTPLQGIGRYARTIHQQ 455
           KN+G  LPL   +++ V V G N++   +++G++A        TT L+G+   +   +  
Sbjct: 424 KNEGNMLPLDASKYKKVFVTGHNAN-NQSILGDWAMEQPEEHVTTVLKGLKAISPETNYN 482

Query: 456 GCK---DVACADDQLFGAAIDASRQADATILVMG-------LDQSIEAEALDRAGLLLPG 505
                 +V    D     A+  +R +D  ILV+G        ++    E  DR  L LPG
Sbjct: 483 FLDLGWNVRLLSDNQIKEAVQQARNSDLAILVVGENSMRYHWNEKTCGENSDRYELSLPG 542

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
           RQQELV  V+ A+  PT+++L++G P+   +   D  +  II A  PG AGG A+A+IL+
Sbjct: 543 RQQELVKAVA-ATGVPTVVILVNGRPLTTEWI--DENMPCIIEAWEPGVAGGQALAEILY 599

Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV--VYPFGHGMS 623
           G  NP GKLP+T         +P +   ++   + ++    + +  G    +Y FG+G+S
Sbjct: 600 GKVNPSGKLPIT---------IPRSTGQIQCMYNHKFTNHWFPYATGNSLPLYEFGYGLS 650

Query: 624 YTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKD 683
           YT + +           P              K++KVT            VDV N G  D
Sbjct: 651 YTTYKYENLKLSEATITP-------------DKSVKVT------------VDVTNTGKMD 685

Query: 684 GAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRI 742
           G  T+ L      +    P K+L  F ++ + AG  + V   +   + LS  D +    +
Sbjct: 686 GEETVQLYIRDEYSSATRPVKELKDFARIPLKAGETKEVSFTL-TPEMLSYYDANMHYGV 744

Query: 743 PLGEHNIHIGGTKHSVSLHAATLGV 767
             G   I +G +     L +  L V
Sbjct: 745 EKGTFKIMVGASSRDTDLQSIILTV 769


>gi|229818089|ref|ZP_04448371.1| hypothetical protein BIFANG_03380 [Bifidobacterium angulatum DSM
           20098 = JCM 7096]
 gi|229784693|gb|EEP20807.1| hypothetical protein BIFANG_03380 [Bifidobacterium angulatum DSM
           20098 = JCM 7096]
          Length = 775

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 240/815 (29%), Positives = 366/815 (44%), Gaps = 133/815 (16%)

Query: 40  LPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLG-IKGYEWWSEALH------ 92
           LP+    LPI +R+ DL+GR++L+EKV  ++   A    L  I   ++    LH      
Sbjct: 6   LPYKNPDLPIQERIADLLGRMTLEEKVGQMMQLDARNGNLDDIIINKYVGSILHTSPADL 65

Query: 93  --GVSNVGPGTKFG-----GD--------FPGATSFPQVITTASSFNATLWEAIGRVVSD 137
              V  V   T+ G     GD        +PG+T FP  +  A+SF+    +A GR  ++
Sbjct: 66  PRAVETVNTKTRLGIPLIIGDDCIHGYSFWPGSTIFPSQLGMATSFDPAKVQAAGRATAE 125

Query: 138 EARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG--SD 194
           E        T G+ + +SP + I RD RWGR  ET GEDP L G+ A++ V+G QG    
Sbjct: 126 EV------STTGVHWTFSPVLCIARDTRWGRVDETFGEDPYLIGEMASAIVKGYQGGAKA 179

Query: 195 GDRLK---VAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVAS 251
           G+ L    + A  KHF  Y  +   G D     A +S + +E  F  PF     EG   +
Sbjct: 180 GEPLAKDAILACAKHFAGYS-ETQGGRDASE--ADLSHRKLESWFLPPFERVAKEG-CGT 235

Query: 252 VMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAAD 311
            M  Y  ++GVP   +  +L   +RG+W   G +++D D+VG     Q   +   +AAAD
Sbjct: 236 FMLGYESIDGVPVTFNKWLLSDKLRGDWNYQGTLITDWDNVGRAVWEQKVKADYVQAAAD 295

Query: 312 AIRAGLDL-DCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYG 370
           A++AG DL    P        AV+RGLL E  I+ A+   L ++ RLG+F  E    P  
Sbjct: 296 AVKAGNDLIMTTPKFYEGAIEAVKRGLLDESLIDEAVSRILALKFRLGLF--EDPRLPDE 353

Query: 371 HLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGN 430
               + + + +HQEL L+ AR+ I LLKN G SLP +    + +AV+GP +D     +G+
Sbjct: 354 QRMKEAIGSSEHQELNLQIARESIALLKNNG-SLPFTATSGKRIAVVGPLADDAQEQLGD 412

Query: 431 YAG-------IACG-----YTTPLQGIGRYA----RTIHQQGCKDVACADD--------- 465
           +AG       I  G      TT L G  + A      ++ +G   +   DD         
Sbjct: 413 WAGNSGQVNWIPDGQPRNMITTVLDGFKQLAPEGCEVVYSRGANILDLVDDPEGKFYPDG 472

Query: 466 --------------QLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
                          L   A++ +RQ+D  + V+G    +  E    A L L G Q  L+
Sbjct: 473 QPRPKLGVSAAIDQALLDEAVENARQSDLVVAVVGDVVQLVGETCSTATLELLGGQNALL 532

Query: 512 SKVSMASK---GPTILVLMSGGP--------------IDVAFAKNDPRIAAIIWAGYPGQ 554
             +S  +K    P ++VLMS  P              +D + A      +A +WA  PG 
Sbjct: 533 EALSDVAKETGKPLVVVLMSSKPQVMPACVIGTNGVIVDESAADG---TSAFMWAPNPGM 589

Query: 555 AGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPV 614
            GG AIA+I+ G +NP G+LP+T +P+ +   LP+    +R     RY   T    + P 
Sbjct: 590 KGGQAIAEIILGKTNPSGRLPIT-FPR-HAGQLPVYYNQIRGQHGNRYADLT----QDP- 642

Query: 615 VYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQV 674
            + FG G+SYT F +     PT+  VP  G                T  K +  T+  ++
Sbjct: 643 AFAFGEGLSYTTFAY---GEPTITNVPDSG----------------TFTKTD--TVHAEI 681

Query: 675 DVKNVGSKDGAHTLLVF--STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
            + N G + G   +  +      +  W   ++L AF++V +  G  + V  +I V    +
Sbjct: 682 TLTNTGERRGTEVVQAYIGDIVTSYSWT-DRELKAFQRVELEPGETRTVAFDIPVAD-CT 739

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSLHAATLGV 767
           +VD    R +  GE  + IG +     L   T  V
Sbjct: 740 IVDPDARRIVEPGEFEVLIGHSSRRKDLRRTTFTV 774


>gi|294674604|ref|YP_003575220.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
 gi|294474051|gb|ADE83440.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
          Length = 869

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 155/447 (34%), Positives = 237/447 (53%), Gaps = 49/447 (10%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L   +RV +L+  L+ +EKV L+++ + ++ RLGI  Y WWSEA HGV      
Sbjct: 24  PYLDTKLSNKERVENLLSLLTPEEKVGLMMNKSISIDRLGIPSYNWWSEACHGVRQ---- 79

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNG--------------- 145
                   G T +PQ I  A++FNA L+  +   VSDEARA +N                
Sbjct: 80  -------DGYTVYPQPIGMAAAFNAQLFYDVFSQVSDEARANWNRTDHNDPKLFNVPMGV 132

Query: 146 ----GTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVA 201
               G   LT+W PNVNIFRDPRWGRGQET GEDP L+       V G+QG++    K  
Sbjct: 133 TYYPGNPELTFWCPNVNIFRDPRWGRGQETCGEDPYLNAVLGVQTVLGMQGNNDKYFKTH 192

Query: 202 ASCKHFTAYDLDNWNGVD--RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQV 259
           A  KH+  +     +G +  R   N + + +D+ +T+   F+  V +G V  VMC+Y + 
Sbjct: 193 ACAKHYAVH-----SGPEPLRHSMNVEPTNRDLWETYLPAFKALVKKGNVREVMCAYQRF 247

Query: 260 NGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT-QHFTSTPE-EAAADAIRAGL 317
            G P C    +L   +R +W  +  +++DCD++  +++  QH T      A+ DA+  G 
Sbjct: 248 EGKPCCTSDRLLIDILRNKWGYDAIVLTDCDAINNFFNRGQHETHKDGLSASVDAVLNGT 307

Query: 318 DLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKD 376
           DL+CG    +     +++GL+ E D++N L  TL  +  LGMFD  P+   P+  LG   
Sbjct: 308 DLECGKVF-MSLVEGLKKGLIKESDLDNHLRKTLMGRFELGMFD--PADMLPWAKLGADV 364

Query: 377 VCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG--I 434
           + +  +  +A++AAR+ +VLL+N+   LPLS    +T+AV+GPN+D    + GNY G   
Sbjct: 365 ISSEKNDAMAVQAARESMVLLENKAGILPLSK-SIKTLAVLGPNADDVNMLNGNYGGTPT 423

Query: 435 ACGYTTPLQGIGRY---ARTIHQQGCK 458
           A    + L GI      A+ I+ + C+
Sbjct: 424 AAHQHSLLSGIKAAVPGAKIIYNKACE 450



 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 83/251 (33%), Positives = 123/251 (49%), Gaps = 39/251 (15%)

Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           ++ADA +++ G+   +E E  D+  + LP  QQ LV  +    K P I V  SG  I  A
Sbjct: 614 KEADAIVIIGGISAQMEGEGGDKQDIELPKVQQMLVKAMHKTGK-PVIFVNCSGSAI--A 670

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP-MTEMAM 594
           F   +    A++ A Y GQ G  A+A++LFG  NPGGKLP+T+Y      +LP   + +M
Sbjct: 671 FGSVEGEYDALLQAWYAGQGGAKALAEVLFGDYNPGGKLPVTFYRSN--NDLPDFLDYSM 728

Query: 595 RPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATIS 654
           +         RTYR++ G   Y FG+G+SYT F                      +A IS
Sbjct: 729 K--------NRTYRYFTGVPQYAFGYGLSYTTFALG-------------------DAKIS 761

Query: 655 GKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVP 714
            K +K    K  ++TL   V V N G ++G  T+ V+        AP K L  F+K+++ 
Sbjct: 762 AKQMK----KNGKVTL--TVPVTNTGKREGTETVQVYVKRLDDAGAPIKALKGFQKLNLK 815

Query: 715 AGAQQRVGINI 725
           AG  Q+  I +
Sbjct: 816 AGETQKATITL 826


>gi|304395778|ref|ZP_07377661.1| glycoside hydrolase family 3 domain protein [Pantoea sp. aB]
 gi|304357072|gb|EFM21436.1| glycoside hydrolase family 3 domain protein [Pantoea sp. aB]
          Length = 765

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 203/665 (30%), Positives = 330/665 (49%), Gaps = 103/665 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASS++    + +GRV      + Y     GL   W+P V++ R+PRWGRG 
Sbjct: 118 TIFPIPLALASSWDLDAVKEVGRV------SAYEAADDGLNMTWAPMVDVSREPRWGRGS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+ +   S V+ +QG S  DR  V  S KHF  Y       ++N VD     
Sbjct: 172 EGFGEDTYLTSEMGRSMVQSMQGKSAADRYSVMTSVKHFALYGAVEGGRDYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q +   +  P++  +  G    VM + N +NGVP  AD  +LK  +R +W+  G 
Sbjct: 227 --MSPQRMFQDYLPPYKASLDAGS-GGVMVALNSINGVPATADSWLLKDLLRDKWKFKGI 283

Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H   S P+EA   A+++G+D+     +   +  + V+ G ++  +
Sbjct: 284 TISDHGAIKEL--IKHGVASDPQEAVRIALKSGVDMSMSDEYYSKYLPALVKSGDVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           I++A  + L V+  +G+F+      PY HLGPK+    D       H+  A + AR+ IV
Sbjct: 342 IDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPQDTNAESRLHRAEARDVARKSIV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI-------- 445
           LLKN+  +LPL   +  T+A+IGP +D    ++G++  AG+A    + LQG+        
Sbjct: 397 LLKNRLETLPLK--KSGTIALIGPLADSQRDIMGSWSAAGVAKQSVSLLQGMRNATEGKA 454

Query: 446 ----GRYARTIHQQGCKDV------ACADD-----QLFGAAIDASRQADATILVMGLDQS 490
                + A     +G +D       A + D     Q+   A+  ++QAD  +  +G  Q 
Sbjct: 455 TLLYEKGANVTDNKGIQDFLNLYEQAVSVDTRSPQQMIDDAVAKAKQADVVVAAVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWA 549
           +  EA  R+ L++P  QQ+L++ +    K P ++VLM+G P+ +    N+ R+A A++  
Sbjct: 515 MAHEASSRSDLVIPPSQQKLLAALKATGK-PLVIVLMNGRPLSIV---NEDRMADAVLET 570

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYP 603
            + G  GG AIAD+LFG  NP GKLP++ +P+       Y  +LP T      ++  +Y 
Sbjct: 571 WFSGTEGGNAIADVLFGDYNPSGKLPVS-FPRSVGQIPIYYNHLP-TGRPYNFAKPNKYT 628

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
              Y    GP +YPFG+G+SYT F  TV  +P  ++ P   R+GSI A+++         
Sbjct: 629 SHYYDAVNGP-LYPFGYGLSYTTF--TV--SPVKMSSPTMPRNGSIKASVT--------- 674

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVG 722
                       V N G +DGA  + ++   P    + P ++L  F+++ + AG  Q V 
Sbjct: 675 ------------VTNSGKRDGATVVQMYLNDPVASISRPVQELRGFKRIMLKAGESQTVT 722

Query: 723 INIHV 727
             I V
Sbjct: 723 FKIDV 727


>gi|372277745|ref|ZP_09513781.1| beta-D-glucoside glucohydrolase [Pantoea sp. SL1_M5]
          Length = 765

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 205/665 (30%), Positives = 329/665 (49%), Gaps = 103/665 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASS++    + +GRV      + Y     GL   W+P V++ R+PRWGRG 
Sbjct: 118 TIFPIPLALASSWDLDAVKEVGRV------SAYEAADDGLNMTWAPMVDVSREPRWGRGS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+ +   S V+ +QG S  DR  V  S KHF  Y       ++N VD     
Sbjct: 172 EGFGEDTYLTSEMGRSMVQSMQGKSAADRYSVMTSVKHFALYGAVEGGRDYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q +   +  P++  +  G    VM + N +NGVP  AD  +LK  +R +W+  G 
Sbjct: 227 --MSPQRMFQDYLPPYKASLDAGS-GGVMVALNSINGVPATADSWLLKDLLRDKWKFKGI 283

Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H   S P+EA   A+++G+D+     +   +  + V+ G ++  +
Sbjct: 284 TISDHGAIKEL--IKHGVASDPQEAVRIALKSGVDMSMSDEYYSKYLPALVKSGDVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           I++A  + L V+  +G+F+      PY HLGPK+    D       H+  A + AR+ IV
Sbjct: 342 IDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPQDTNAESRLHRAEARDVARKSIV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI-------- 445
           LLKN+  +LPL   +  T+A+IGP +D    ++G++  AG+A    + LQG+        
Sbjct: 397 LLKNRLETLPLK--KSGTIALIGPLADSQRDIMGSWSAAGVAKQSVSLLQGMRNATEGKA 454

Query: 446 ----GRYARTIHQQGCKDV------ACADD-----QLFGAAIDASRQADATILVMGLDQS 490
                + A     +G +D       A + D     QL   A+  ++QAD  +  +G  Q 
Sbjct: 455 TLLYEKGANVTDNKGIQDFLNLYEQAVSVDTRSPQQLIDDAVAKAKQADVVVAAVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWA 549
           +  EA  R+ L++P  QQ+L++ +    K P ++VLM+G P+ V    N+ R+A A++  
Sbjct: 515 MAHEASSRSDLVIPPSQQKLLAALKATGK-PLVIVLMNGRPLSVV---NEDRMADAMLET 570

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYP 603
            + G  GG AIAD+LFG  NP GKLP++ +P+       Y  +LP T      ++  +Y 
Sbjct: 571 WFSGTEGGNAIADVLFGDYNPSGKLPVS-FPRSVGQIPIYYNHLP-TGRPYNFAKPNKYT 628

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
              Y    GP +YPFG+G+SYT F  TV  +P  ++ P   R GSI A+++         
Sbjct: 629 SHYYDAANGP-LYPFGYGLSYTTF--TV--SPVKMSSPTMPRDGSIEASVT--------- 674

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVG 722
                       V N G +DGA  + ++   P    + P ++L  F+++ + AG  Q V 
Sbjct: 675 ------------VTNSGKRDGATVVQMYLNDPVASISRPVQELRGFKRIMLKAGESQTVT 722

Query: 723 INIHV 727
             I V
Sbjct: 723 FKIDV 727


>gi|429116223|ref|ZP_19177141.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 701]
 gi|426319352|emb|CCK03254.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 701]
          Length = 625

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 201/662 (30%), Positives = 322/662 (48%), Gaps = 106/662 (16%)

Query: 143 YNGGTAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKV 200
           Y     GL   W+P V++ RDPRWGRG E  GED  L+     + V  +QG S  DR  V
Sbjct: 4   YEAADDGLNMTWAPMVDVTRDPRWGRGSEGFGEDTYLTSMMGKTMVESMQGKSPADRYSV 63

Query: 201 AASCKHFTAYDL----DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSY 256
             S KHF AY        +N VD       +S Q + + +  P++  +  G    VM + 
Sbjct: 64  MTSVKHFAAYGAVEGGKEYNTVD-------MSSQRLFNDYMPPYKEALDAGS-GGVMIAL 115

Query: 257 NQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFT-STPEEAAADAIRA 315
           N +NG P  +D  +LK  +RG+W   G  +SD  ++      +H T S PE+A   AI++
Sbjct: 116 NSLNGTPAASDGWLLKDLLRGDWGFKGITISDHGAIKEL--IKHGTASDPEDAVRVAIKS 173

Query: 316 GLDLDCG-PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGP 374
           G+D+     +   +  + ++ G ++  ++++A  + L V+  +G+F+      PY HLGP
Sbjct: 174 GVDMSMADEYYSKYLPNLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGP 228

Query: 375 KDVCTPD-------HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTM 427
           KD    D       H++ A + AR+ +VLLKN+  +LPL   +  T+AV+GP +D    M
Sbjct: 229 KDSDPKDTNAESRLHRDDARKVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSKRDM 286

Query: 428 IGNY--AGIACGYTTPLQGI----GRYARTIHQQGCKDVACADDQ--------------- 466
           +G++  AG+A    T LQG+    G  A+ ++ +G       DD+               
Sbjct: 287 MGSWSAAGVADQSVTLLQGMKNVAGDKAKILYAKGAN---VTDDKGIVDFLNLYEPAVVV 343

Query: 467 -------LFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
                  +   A++ ++Q+D  + V+G  Q +  EA  R  L +P  Q++L+S +    K
Sbjct: 344 DKRTPKAMIDEAVNVAKQSDVVVAVVGEAQGMAHEASSRTDLTIPQSQRDLISALKATGK 403

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
            P +LVLM+G P+  A  K D +  AI+   + G  GG AIAD+LFG  NP GKLP++ +
Sbjct: 404 -PLVLVLMNGRPL--ALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-F 459

Query: 580 PQEYITNLPM------TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
           P+  +  +P+      T     P +  +Y  R +    GP +YPFG+G+SYT F  +V++
Sbjct: 460 PRS-VGQIPVYYSHLNTGRPYNPEKPNKYTSRYFDEANGP-LYPFGYGLSYTTF--SVSD 515

Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-S 692
               ++ P   R G + A+++                     V N G ++GA  + ++  
Sbjct: 516 VK--LSAPTMKRDGKVTASVT---------------------VTNTGKREGATAVQMYLQ 552

Query: 693 TPPAGHWAPHKQLVAFEKVHVPAGAQQRVG--INIHVCKYLSVVDRSGTRRIPLGEHNIH 750
              A    P KQL  FEK+ +  G  Q V   I+I   K+ +   R      P G+ N+ 
Sbjct: 553 DVTASMSRPVKQLHGFEKITLKPGESQTVSFPIDIDALKFWN--QRMKYDAEP-GKFNVF 609

Query: 751 IG 752
           IG
Sbjct: 610 IG 611


>gi|417342342|ref|ZP_12123176.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Baildon str. R6-199]
 gi|357956757|gb|EHJ82061.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Baildon str. R6-199]
          Length = 765

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 205/683 (30%), Positives = 324/683 (47%), Gaps = 117/683 (17%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 104 TVFPISLGLASSFNLDAVRTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 157

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V+ +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 158 EGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 212

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R EW   G 
Sbjct: 213 --MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGI 269

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLD---------------CGPFLGLH 328
            VSD  ++      +H T+  PE+A   A++AG+D+                   +   +
Sbjct: 270 TVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSIMSMADEYYRVDMSMADEYYSKY 327

Query: 329 TESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD------- 381
               ++ G ++  ++++A  + L V+  +G+F+      PY HLGPK+    D       
Sbjct: 328 LPGLIKSGTVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRL 382

Query: 382 HQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYT 439
           H++ A E AR+ +VLLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    
Sbjct: 383 HRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVANQSV 440

Query: 440 TPLQGI----GRYARTIHQQGCK---DVACAD----------------DQLFGAAIDASR 476
           T L GI    G  A+ ++ +G     D    D                  +   A+ A++
Sbjct: 441 TVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVLAAK 500

Query: 477 QADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAF 536
           QAD  + V+G  Q +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A 
Sbjct: 501 QADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--AL 557

Query: 537 AKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------T 590
            K D +  AI+   + G  GG AIAD+LFG  NP GKLP++ +P+  +  +P+      T
Sbjct: 558 VKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS-VGQIPVYYSHLNT 615

Query: 591 EMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSIN 650
                P +  +Y  R +    GP +YPFG+G+SYT F  TV++    ++ P   R G + 
Sbjct: 616 GRPYNPEKPNKYTSRYFDEANGP-LYPFGYGLSYTTF--TVSD--VTLSSPTMQRDGKVT 670

Query: 651 ATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFE 709
           A+                     V+V N G ++GA  + ++     A    P KQL  FE
Sbjct: 671 AS---------------------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFE 709

Query: 710 KVHVPAGAQQRVG--INIHVCKY 730
           K+ +  G  + V   I+I   K+
Sbjct: 710 KITLKPGESKTVSFPIDIEALKF 732


>gi|440759115|ref|ZP_20938268.1| Periplasmic beta-glucosidase [Pantoea agglomerans 299R]
 gi|436427131|gb|ELP24815.1| Periplasmic beta-glucosidase [Pantoea agglomerans 299R]
          Length = 737

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 203/665 (30%), Positives = 330/665 (49%), Gaps = 103/665 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASS++    + +GRV      + Y     GL   W+P V++ R+PRWGRG 
Sbjct: 90  TIFPIPLALASSWDLDAVKEVGRV------SAYEAADDGLNMTWAPMVDVSREPRWGRGS 143

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+ +   S V+ +QG S  DR  V  S KHF  Y       ++N VD     
Sbjct: 144 EGFGEDTYLTSEMGRSMVQSMQGKSAADRYSVMTSVKHFALYGAVEGGRDYNTVD----- 198

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q +   +  P++  +  G    VM + N +NGVP  AD  +LK  +R +W+  G 
Sbjct: 199 --MSPQRMFQDYLPPYKASLDAGS-GGVMVALNSINGVPATADSWLLKDLLRDKWKFKGI 255

Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H   S P+EA   A+++G+D+     +   +  + V+ G ++  +
Sbjct: 256 TISDHGAIKEL--IKHGVASDPQEAVRIALKSGVDMSMSDEYYSKYLPALVKSGDVTMAE 313

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           I++A  + L V+  +G+F+      PY HLGPK+    D       H+  A + AR+ IV
Sbjct: 314 IDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPQDTNAESRLHRAEARDVARKSIV 368

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI-------- 445
           LLKN+  +LPL   +  T+A+IGP +D    ++G++  AG+A    + LQG+        
Sbjct: 369 LLKNRLETLPLK--KSGTIALIGPLADSQRDIMGSWSAAGVAKQSVSLLQGMRNATEGKA 426

Query: 446 ----GRYARTIHQQGCKDV------ACADD-----QLFGAAIDASRQADATILVMGLDQS 490
                + A     +G +D       A + D     Q+   A+  ++QAD  +  +G  Q 
Sbjct: 427 TLLYEKGANVTDNKGIQDFLNLYEQAVSVDTRSPQQMIDDAVAKAKQADVVVAAVGEAQG 486

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIA-AIIWA 549
           +  EA  R+ L++P  QQ+L++ +    K P ++VLM+G P+ +    N+ R+A A++  
Sbjct: 487 MAHEASSRSDLVIPPSQQKLLAALKATGK-PLVIVLMNGRPLSIV---NEDRMADAVLET 542

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQE------YITNLPMTEMAMRPSQSKRYP 603
            + G  GG AIAD+LFG  NP GKLP++ +P+       Y  +LP T      ++  +Y 
Sbjct: 543 WFSGTEGGNAIADVLFGDYNPSGKLPVS-FPRSVGQIPIYYNHLP-TGRPYNFAKPNKYT 600

Query: 604 GRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHA 663
              Y    GP +YPFG+G+SYT F  TV  +P  ++ P   R+GSI A+++         
Sbjct: 601 SHYYDAVNGP-LYPFGYGLSYTTF--TV--SPVKMSSPTMPRNGSIEASVT--------- 646

Query: 664 KCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVG 722
                       V N G +DGA  + ++   P    + P ++L  F+++ + AG  Q V 
Sbjct: 647 ------------VTNSGKRDGATVVQMYLNDPVASISRPVQELRGFKRIMLKAGESQTVT 694

Query: 723 INIHV 727
             I V
Sbjct: 695 FKIDV 699


>gi|423222018|ref|ZP_17208488.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392644204|gb|EIY37946.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 942

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 201/680 (29%), Positives = 320/680 (47%), Gaps = 74/680 (10%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT+FP  +    ++N  L   +G +   EAR +      G T  ++P +++ RD RWGR 
Sbjct: 173 ATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWGRY 226

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           +E  GE P L  +     VRG+Q S     +VAA+ KHF AY  +          + ++S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGMQHSH----QVAATGKHFVAYSNNKGAREGMARVDPQMS 282

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++E     PF+  + E  +  VM SYN  +GVP       L   +RGE    GY+VSD
Sbjct: 283 PREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGEMGFRGYVVSD 342

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPFLG-----LHTESAVQRGLLSEIDI 343
            D+V   Y         +EA   ++ AGL++ C  F       L     V+ G LSE  I
Sbjct: 343 SDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEEVI 401

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           N+ + + L V+  +G+FD    +   G    K+V   +++ LAL+A+R+ +VLLKN+   
Sbjct: 402 NDRVRDILRVKFLVGLFDTPYQTDLAG--ADKEVEKAENESLALQASRESLVLLKNENNV 459

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK- 458
           LPL     + +AV GPN+D     + +Y  +A   TT L+GI +     A  ++ +GC  
Sbjct: 460 LPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEVLYTKGCDL 519

Query: 459 -DVACADDQL------------FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPG 505
            D    + +L               A++ +RQAD  ++V+G  Q    E   R+ L LPG
Sbjct: 520 VDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLDLPG 579

Query: 506 RQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILF 565
           RQ +L+  V    K P +LVL++G P+ + +A  D  +  I+ A YPG  GGTA+AD+LF
Sbjct: 580 RQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPVILEAWYPGSKGGTAVADVLF 636

Query: 566 GTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGHG 621
           G  NPGGKL +T +P+  +  +P      +PS      + PG      +    +Y FG+G
Sbjct: 637 GDYNPGGKLTVT-FPKS-VGQIPFN-FPCKPSSQIDGGKNPGLDGNMSRVNGALYSFGYG 693

Query: 622 MSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           +SYT F ++                   +  IS K I             V+  V N G 
Sbjct: 694 LSYTTFEYS-------------------DIEISPKVITPNQKAT------VRCKVTNTGK 728

Query: 682 KDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
           + G   + ++          + K L  FE++H+  G  + V   +   K L ++D+    
Sbjct: 729 RAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFTLD-RKQLELLDKHMEW 787

Query: 741 RIPLGEHNIHIGGTKHSVSL 760
            +  G+ +I +G +   + L
Sbjct: 788 VVEPGDFSIMVGASSEDIRL 807


>gi|251795943|ref|YP_003010674.1| beta-glucosidase [Paenibacillus sp. JDR-2]
 gi|247543569|gb|ACT00588.1| Beta-glucosidase [Paenibacillus sp. JDR-2]
          Length = 940

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 155/424 (36%), Positives = 233/424 (54%), Gaps = 25/424 (5%)

Query: 36  TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
           + +  PF    LP+ QRVNDL+ R +L+EK+ L+      +  LG+K Y+  +EA HG++
Sbjct: 5   SEQKYPFQNPDLPLEQRVNDLVNRFTLEEKINLMCQYQDEITHLGVKAYKHGTEAAHGMA 64

Query: 96  NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGTAGLTYW 153
            +G           AT+FPQ I  A +++  L + IG V+ DEAR  Y  N    GLT W
Sbjct: 65  WLGE----------ATTFPQPIGLACTWDKDLMKKIGSVIGDEARGFYSQNPTHNGLTLW 114

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +P V++ RDPRWGR +E  GEDP L+GK +A+  +G+QG     +K  AS KHF    + 
Sbjct: 115 APTVDMERDPRWGRTEEAYGEDPHLTGKLSAALTQGIQGDHPFYVKAVASLKHF----IG 170

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           N N ++R   +  +  ++  + +   F +   EG   S+M +YN VNGVP   +P++   
Sbjct: 171 NNNEINRGECSVSLDPRNKNEYYLKAFEIPFKEGGALSMMTAYNSVNGVPANINPDV-NN 229

Query: 274 TIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLD--LDCGPFLGLHTES 331
            ++ +W +NG++VSD   V    +  H+  T +EA A  I+AG+D   D  P        
Sbjct: 230 IVKRDWGMNGFVVSDAGDVLGTVNDHHYVDTYKEAVALTIKAGVDSITDDHPISKQAIRD 289

Query: 332 AVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQ-PYGHLGPKDVCTPDHQELALEAA 390
           A+  G+L+E D++ AL NT  V+ RLG FD  P+ + PY  +    +  P+H ELAL+A 
Sbjct: 290 ALAEGMLTENDLDIALRNTFRVRFRLGEFD--PADRNPYAAIDESVIMKPEHAELALDAV 347

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR 450
           R+ IVLLKN G  LPLS  +    AVIGP + +       Y+G+     +P +GI +   
Sbjct: 348 RKSIVLLKNDG-ILPLSAEKLSKAAVIGPLAGIVYRDW--YSGLMPYSISPFEGIRKKMN 404

Query: 451 TIHQ 454
             HQ
Sbjct: 405 GAHQ 408



 Score = 83.6 bits (205), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 68/258 (26%), Positives = 117/258 (45%), Gaps = 43/258 (16%)

Query: 471 AIDASRQADATILVMGLDQSIEA-EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
           AI A+R  +  ++V+G    I   E +DR  L L   Q++L+ +V   +  P  +V++ G
Sbjct: 552 AIAAARNNEVAVVVVGNHPLINGKETIDRPDLTLAAYQEKLIREVYAVN--PNTVVVVVG 609

Query: 530 G-PIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
             P  + + + +  I AI++  + GQ  G A+AD+LFG  NP G++ MTWY         
Sbjct: 610 SYPFAMPWVQEN--IPAIVYLSHAGQELGHALADVLFGDYNPAGRVNMTWYESADQLGEF 667

Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
           M    ++         RTY+++ G  +YPFGHG+SY+ F                 ++ +
Sbjct: 668 MDYDIIKSE-------RTYQYFAGKPLYPFGHGLSYSRF-----------------QYDA 703

Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVA 707
           +    SG             +L V   V N    DG   + +++         P +QL +
Sbjct: 704 LAQEQSGN------------SLTVSFRVTNTSDVDGEEVVQLYTRALESRVKRPLRQLQS 751

Query: 708 FEKVHVPAGAQQRVGINI 725
           FE++ + AG  + V   +
Sbjct: 752 FERIALAAGESREVSFKL 769


>gi|198274480|ref|ZP_03207012.1| hypothetical protein BACPLE_00628 [Bacteroides plebeius DSM 17135]
 gi|198272682|gb|EDY96951.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           plebeius DSM 17135]
          Length = 912

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 206/715 (28%), Positives = 336/715 (46%), Gaps = 92/715 (12%)

Query: 78  RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
           RLGI   ++ +E + GV N             AT+FP  +    ++N  L   IG +   
Sbjct: 118 RLGIPA-DFTNEGIRGVENYI-----------ATNFPTQLALGHTWNRELIRQIGYITGR 165

Query: 138 EARAMYNGGTAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGD 196
           EAR +      G T  ++P +++ RD RWGR +E  GE P L  +   +  +GLQ     
Sbjct: 166 EARLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIAMGKGLQTD--- 216

Query: 197 RLKVAASCKHFTAYDLDNWNGVDRF-HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCS 255
            ++VA++ KHF AY  +N    + F   + ++S +++E+    PF   + E  +  VM S
Sbjct: 217 -MQVASTAKHFIAYS-NNKGAREGFARVDPQMSWREVENIHAYPFTRVIQEAGILGVMSS 274

Query: 256 YNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRA 315
           YN  +G P  +    L + +RG     GY+VSD D+V   Y         +EA   ++ A
Sbjct: 275 YNDYDGFPIQSSYYWLTQRLRGTMGFRGYVVSDSDAVEYLYSKHKTAKDMKEAVRQSVEA 334

Query: 316 GLDLDCG----PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGH 371
           GL++ C         L     +Q G LS   I+N + + L V+   G+FD      PY  
Sbjct: 335 GLNVRCTFRSPESYVLPLRELIQEGGLSMETIDNRVRDILRVKFLTGLFD-----TPYQT 389

Query: 372 ---LGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMI 428
              L  K+V +  HQ++AL+A+R+G+VLLKN    LPL   + + +AV GPN+D     +
Sbjct: 390 DLALADKEVNSEAHQQVALQASREGLVLLKNANNLLPLDKSQIKRIAVCGPNADEASFAL 449

Query: 429 GNYAGIACGYTTPLQGIGRY----ARTIHQQGCKDV--------------ACADDQLFGA 470
            +Y  +A   TT L+GI +      +  + +GC  V                 +      
Sbjct: 450 THYGPVAVEVTTVLEGIKQQVKEGTKVTYTKGCDLVDANWPESEIISYPLTAEEKTEIQK 509

Query: 471 AIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGG 530
           A+D  +++D  ++V+G       E   R  L LPG QQ+L+  + +A+  P +LVL++G 
Sbjct: 510 AVDNVKESDVAVVVLGGGIRTCGENKSRTSLDLPGHQQQLLEAI-VATGKPVVLVLINGR 568

Query: 531 PIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMT 590
           P+ + +A  D  + AI+ A YPG  GGTAIA+ LFG  NPGGKL +T +P+  +  +P  
Sbjct: 569 PLSINWA--DKFVPAILEAWYPGSQGGTAIAEALFGDYNPGGKLTVT-FPKT-VGQIPFN 624

Query: 591 EMAMRPSQSK--RYPGR--TYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRH 646
             A   SQ    + PG         GP +YPFG+G+SYT F ++                
Sbjct: 625 FPAKPASQVDGGQTPGMKGNQSRINGP-LYPFGYGLSYTTFEYS---------------- 667

Query: 647 GSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQL 705
              N  +S   I       ++  + V   +KN G++ G   + +++         + K L
Sbjct: 668 ---NLQLSSPVI------TDKEPVTVTCKIKNTGTRSGDEVVQLYTRDVISSVTTYEKNL 718

Query: 706 VAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
             FE+VH+  G  ++V   + + +   ++++     +  G   I IG +   + L
Sbjct: 719 RGFERVHLEPGETKKVSFQL-LPRDFQLLNKDNHWVVEPGMFQIMIGASSEDIRL 772


>gi|189467715|ref|ZP_03016500.1| hypothetical protein BACINT_04107 [Bacteroides intestinalis DSM
           17393]
 gi|189435979|gb|EDV04964.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 943

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 199/680 (29%), Positives = 324/680 (47%), Gaps = 74/680 (10%)

Query: 110 ATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRG 168
           AT+FP  +    ++N  L   +G +   EAR +      G T  ++P +++ RD RWGR 
Sbjct: 173 ATNFPTQLGLGHTWNRELIRQVGLITGREARIL------GYTNVYAPILDVGRDQRWGRY 226

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVS 228
           +E  GE P L  +     VRG+Q +     +VAA+ KHF AY  +          + ++S
Sbjct: 227 EEVYGESPYLVAELGIEMVRGMQHNH----QVAATGKHFVAYSNNKGAREGMARVDPQMS 282

Query: 229 KQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSD 288
            +++E     PF+  + E  +  VM SYN  +GVP       L   +RGE    GY+VSD
Sbjct: 283 PREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGEMGFRGYVVSD 342

Query: 289 CDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG----PFLGLHTESAVQRGLLSEIDIN 344
            D+V   Y         +EA   ++ AGL++ C         L     V+ G LSE  IN
Sbjct: 343 SDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVIN 402

Query: 345 NALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSL 404
           + + + L V+  +G+FD    +   G    ++V   +++ LAL+A+R+ +VLLKN+   L
Sbjct: 403 DRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKAENESLALQASRESLVLLKNENNVL 460

Query: 405 PLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGR----YARTIHQQGCK-- 458
           PL     + +AV GPN+D     + +Y  +A   TT L+GI +     A  ++ +GC   
Sbjct: 461 PLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKSEGKAEVLYTKGCDLV 520

Query: 459 ----------DVACADDQL--FGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGR 506
                     D    D++      A++ +RQAD  ++V+G  Q    E   R+ L LPGR
Sbjct: 521 DANWPESELIDYPMTDNEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLDLPGR 580

Query: 507 QQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFG 566
           Q +L+  V    K P +LVL++G P+ + +A  D  + AI+ A YPG  GGTA+AD+LFG
Sbjct: 581 QLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGTAVADVLFG 637

Query: 567 TSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQS---KRYPGRTYRFYK-GPVVYPFGHGM 622
             NPGGK+ +T +P+  +  +P      +PS      + PG      +    +Y FG+G+
Sbjct: 638 DYNPGGKMTVT-FPKS-VGQIPFN-FPCKPSSQIDGGKNPGLDGNMSRVNGALYSFGYGL 694

Query: 623 SYTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGS 681
           SYT F ++ +  +P V+             T + KA              V+  V N G 
Sbjct: 695 SYTTFEYSGIEISPKVI-------------TPNQKAT-------------VRCKVTNTGK 728

Query: 682 KDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTR 740
           + G   + ++          + K L  FE++H+  G  + V   +   K L ++D+    
Sbjct: 729 RAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFTLD-RKQLELLDKHMEW 787

Query: 741 RIPLGEHNIHIGGTKHSVSL 760
            +  G+ +I +G +   + L
Sbjct: 788 VVEPGDFSIMVGASSEDIRL 807


>gi|332881173|ref|ZP_08448832.1| glycosyl hydrolase family 3 protein, partial [Capnocytophaga sp.
           oral taxon 329 str. F0087]
 gi|332680887|gb|EGJ53825.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
          Length = 675

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 157/432 (36%), Positives = 229/432 (53%), Gaps = 30/432 (6%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L   +R +DL+ RL+L+EK+ L+ + +  V RLGIK Y WWSEALHGV+  G  
Sbjct: 24  PYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALHGVARNGL- 82

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM---------YNGGTAGLT 151
                    AT +P  +  AS F+  L E I   VSDE RA          Y  G  GLT
Sbjct: 83  ---------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGRYGRGNEGLT 133

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
           +W+PNVNIFRDPRWGRGQET GEDP L+ +   + VRG+QG +D    K  A  KH+  +
Sbjct: 134 FWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQGPADAKYDKTHACAKHYAVH 193

Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
                    R  F+ + +  +D+ +T+   F+  V E  V  VMC+Y +  G P C    
Sbjct: 194 SGPE---AKRHSFDVENLEPRDLWETYLPAFKALVQEADVKEVMCAYQRFEGEPCCGSNR 250

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEA--AADAIRAGLDLDCGPFLGL 327
           +L + +R EW     +VSDC ++  ++      + P+ A  +A A+  G DL+CG     
Sbjct: 251 LLTQILRDEWGYKHLVVSDCGAISDFFYQGRHETHPDAATSSASAVINGTDLECGVEYA- 309

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
           H + AV+RGL++E  I+ +L   L  +  LG  D + +  P+  +    V    H+ +AL
Sbjct: 310 HLDEAVERGLITEHRIDTSLRRLLEARFALGEMDDD-ALVPWSRISIDTVDCGTHRRMAL 368

Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-G 446
           +  R+ +VLL N G  LPL       +AV+GPN+  +V   GNY G+     T L+GI G
Sbjct: 369 DVTRKSMVLLHNNG-ILPLDKGDAGKIAVMGPNAVDSVMQWGNYKGVPAHTYTILEGIRG 427

Query: 447 RYARTIHQQGCK 458
                 +++GC+
Sbjct: 428 AIGNVPYEKGCE 439


>gi|365122063|ref|ZP_09338970.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363643257|gb|EHL82578.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 819

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 202/712 (28%), Positives = 320/712 (44%), Gaps = 81/712 (11%)

Query: 78  RLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSD 137
           RLGI   ++ +E +HG+++             AT  P  I   S++NA L    G +   
Sbjct: 151 RLGIP-VDFSNETIHGLNHTK-----------ATPLPAPIGIGSTWNAPLVYKAGSIAGK 198

Query: 138 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDR 197
           EA+A+  G T     ++P +++ RDPRWGR  E  GEDP L        V+G+Q      
Sbjct: 199 EAKAL--GYT---NIYAPILDLARDPRWGRVLECYGEDPFLVATLGTQMVKGIQEQG--- 250

Query: 198 LKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYN 257
             VAA+ KHF  Y +           +  V+ +++      PF+  + +     VM SYN
Sbjct: 251 --VAATLKHFAVYSVPKGGRDGSVRTDPHVAPREMHQMHLYPFKKVIQDAHPMGVMSSYN 308

Query: 258 QVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGL 317
             +GVP  A    L + +R E+  +GY+VSD D+V   Y+  H   T EEA    + AGL
Sbjct: 309 DWDGVPVTASYYFLTQLLRQEFGFDGYVVSDSDAVEYVYNKHHVAETYEEAVRMVLEAGL 368

Query: 318 DLDCG----PFLGLHTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLG 373
           ++           L     V+ G LS   I+  + + L V+ RLG+FD    + P     
Sbjct: 369 NVRTTFAAPDIFILPARKLVKEGRLSMKVIDERVADVLRVKFRLGLFDQPFVADPKA--A 426

Query: 374 PKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG 433
            K V    +++  L+  RQ +VLLKN+   LPL   +   + + GP +     M+  Y  
Sbjct: 427 DKIVGADKNKDFVLDIQRQSLVLLKNENNLLPLDKNKLSRILITGPLAKEENYMVSRYGP 486

Query: 434 IACGYTTPLQGIGRY----ARTIHQQGCK--------------DVACADDQLFGAAIDAS 475
                 T  +GI  Y        +  GCK               +   + Q    A++ +
Sbjct: 487 QELENITVYEGIKNYLGNKVAVDYALGCKVKDAKWPESEIIHSPLTTEEQQEIQNAVEKA 546

Query: 476 RQADATILVMGLDQSIEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           + +D  I V+G D+    E+  R+GL LPGRQQ+L+  +    K P +LVL++G P+ + 
Sbjct: 547 KLSDIVIAVLGEDEESTGESKSRSGLDLPGRQQQLLEALYATGK-PVVLVLINGQPLTIN 605

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQ---EYITNLPMTEM 592
           +A  D  I AI+ A +PGQ GGTAIA+ LFG  NPGGKLP+T +P+   +   N P  + 
Sbjct: 606 WA--DRYIPAILEAWFPGQMGGTAIAETLFGDYNPGGKLPVT-FPKTLGQIELNFPF-KP 661

Query: 593 AMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINAT 652
           A +  Q +  P    +      +YPFG G+SYT F ++       V+    G  G I   
Sbjct: 662 ASQSKQPEAGPNGYGKTRVNGALYPFGFGLSYTTFEYSNLK----VSPERQGPKGDIQ-- 715

Query: 653 ISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLV-AFEKV 711
                              V  D+ N G + G   + ++          ++ L+  FE+V
Sbjct: 716 -------------------VSFDITNTGKRAGDEIVQLYVKDKVSSVISYESLLRGFERV 756

Query: 712 HVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVSLHAA 763
            +  G  + +   +H  + L ++D +    +  GE  + IG +   + L  +
Sbjct: 757 SLQPGETKNIQFTLH-PEDLEILDINMNWNVEPGEFEVRIGASSEDIKLKKS 807


>gi|357047866|ref|ZP_09109459.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
           YIT 11840]
 gi|355529205|gb|EHG98644.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
           YIT 11840]
          Length = 676

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 157/432 (36%), Positives = 229/432 (53%), Gaps = 30/432 (6%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L   +R +DL+ RL+L+EK+ L+ + +  V RLGIK Y WWSEALHGV+  G  
Sbjct: 24  PYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALHGVARNGL- 82

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM---------YNGGTAGLT 151
                    AT +P  +  AS F+  L E I   VSDE RA          Y  G  GLT
Sbjct: 83  ---------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGRYGRGNEGLT 133

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
           +W+PNVNIFRDPRWGRGQET GEDP L+ +   + VRG+QG +D    K  A  KH+  +
Sbjct: 134 FWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQGPADAKYDKTHACAKHYAVH 193

Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
                    R  F+ + +  +D+ +T+   F+  V E  V  VMC+Y +  G P C    
Sbjct: 194 SGPE---AKRHSFDVENLEPRDLWETYLPAFKALVQEADVKEVMCAYQRFEGEPCCGSNR 250

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEA--AADAIRAGLDLDCGPFLGL 327
           +L + +R EW     +VSDC ++  ++      + P+ A  +A A+  G DL+CG     
Sbjct: 251 LLTQILRDEWGYKHLVVSDCGAISDFFYQGRHETHPDAATSSASAVINGTDLECGVEYA- 309

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
           H + AV+RGL++E  I+ +L   L  +  LG  D + +  P+  +    V    H+ +AL
Sbjct: 310 HLDEAVERGLITEHRIDTSLRRLLEARFALGEMDDD-ALVPWSRISIDTVDCGTHRRMAL 368

Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-G 446
           +  R+ +VLL N G  LPL       +AV+GPN+  +V   GNY G+     T L+GI G
Sbjct: 369 DVTRKSMVLLHNNG-ILPLDKGDAGKIAVMGPNAVDSVMQWGNYKGVPAHTYTILEGIRG 427

Query: 447 RYARTIHQQGCK 458
                 +++GC+
Sbjct: 428 AIGNVPYEKGCE 439


>gi|393784338|ref|ZP_10372503.1| hypothetical protein HMPREF1071_03371 [Bacteroides salyersiae
           CL02T12C01]
 gi|392666114|gb|EIY59631.1| hypothetical protein HMPREF1071_03371 [Bacteroides salyersiae
           CL02T12C01]
          Length = 857

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 221/786 (28%), Positives = 358/786 (45%), Gaps = 122/786 (15%)

Query: 38  RTLPFCQVSLPIPQRVNDLIGRLSLQEKV---------KLLISGAAAVPRLG--IKGYEW 86
           + L + Q SLPI +RV+DL+GR++L+EK+          +       + +LG    G  W
Sbjct: 24  QQLSYRQSSLPISERVDDLLGRMTLEEKIAQIRHIHSWNVFNGQDLDMEKLGKFTGGVSW 83

Query: 87  -WSEA--LHGVS---NVGPGTKF-------------------GGDFPGATSFPQVITTAS 121
            + E   L GV+   N+    KF                   G    G+T +PQ I   S
Sbjct: 84  GFVEGFPLTGVNCKKNMQLIQKFMVENTRLGIPVFTVAESLHGSVHEGSTIYPQNIAMGS 143

Query: 122 SFNATL-WEAIGRVVSD-EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 179
           +F   L +     +  D  A+ M+          +P +++ RD RWGR +E+ GEDPVL 
Sbjct: 144 TFRPELAYRKAAMITKDLHAQGMHQ-------VLAPCIDVVRDLRWGRVEESFGEDPVLC 196

Query: 180 GKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHF-NAKVSKQDIEDTFDV 238
           G +  + V+G   +      ++   KH+  +     N +   +  + +   +D+ + +  
Sbjct: 197 GLFGIAEVKGYMDNG-----ISPMLKHYGPHG----NPLSGLNLASVECGLRDLHEVYLK 247

Query: 239 PFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDT 298
           PF M +    V +VM +YN  N VP  A   +L   +RG++   GY+ SD  ++ +    
Sbjct: 248 PFEMVIRNTPVLAVMSTYNSWNHVPNSASHYLLTEVLRGQFGFKGYVYSDWGAIEMLKTL 307

Query: 299 QHFTSTPEEAAADAIRAGLDLD----CGPFLGLHTESAVQRGLLSEIDINNALVNTLTVQ 354
                  EEAA  A  AGLD++    C P L       +Q+G L E  +N ++   L  +
Sbjct: 308 HRVAHNSEEAAMQAFTAGLDVEASSNCYPLLA----GLIQKGKLDEEVLNESVRRVLYAK 363

Query: 355 MRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTV 414
            ++G+F+ +P  + Y H    ++   +   L+ E A + +VLLKN+   LPL+  + ++V
Sbjct: 364 FKMGLFE-DPYGEQYSH---SEMHGAESIRLSKEIADESVVLLKNENGLLPLNADKLKSV 419

Query: 415 AVIGPNSDVTVTMIGNYAGIACGY--TTPLQGIGRY----ARTIHQQGCKDVACADDQLF 468
           AVIGPN+D      G+Y          TPL+GI R     A   + +GC D+   +    
Sbjct: 420 AVIGPNAD--QVQFGDYTWSRNNKDGVTPLEGIRRLLGGKATVRYAKGC-DLVSLNAGGI 476

Query: 469 GAAIDASRQADATILVMG---------LDQSIEAEALDRAGLLLPGRQQELVSKVSMASK 519
             A++A+R+++  IL  G            S   E  D   L L G Q +L+ +V   + 
Sbjct: 477 KEAVEAARKSEVAILFCGSASAALARDYKSSTCGEGFDLNDLNLTGVQGQLIKEV-YETG 535

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
            P +LVL++G P  +++ K    I AI+   Y G+  G +IADILFG+ +P G+L  + Y
Sbjct: 536 TPVVLVLVTGKPFAISWEKK--HIPAILTQWYAGEQAGNSIADILFGSISPSGRLTFS-Y 592

Query: 580 PQE------YITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVAN 633
           PQ       Y   LP  +   +   S   PGR Y F     ++ FGHG++YT+FV+   N
Sbjct: 593 PQTTGHLPVYYNYLPSDKGFYKNPGSYESPGRDYVFSSPDALWAFGHGLTYTSFVY--KN 650

Query: 634 APTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFST 693
             T      D  H  +N TI                  + VD+KN G ++G   + ++  
Sbjct: 651 LRT------DKEHYGLNDTIY-----------------IDVDIKNTGKREGKEVVQLYVN 687

Query: 694 PPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
                   P KQL  F+KV V AG  + V + + V   L +V+    R +  GE  + +G
Sbjct: 688 DKVSTVVTPVKQLRDFKKVDVEAGKTETVKLKVAVND-LYIVNAGNKRVVEPGEFELQVG 746

Query: 753 GTKHSV 758
               ++
Sbjct: 747 AASDNI 752


>gi|333377782|ref|ZP_08469515.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
           22836]
 gi|332883802|gb|EGK04082.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
           22836]
          Length = 727

 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 212/722 (29%), Positives = 341/722 (47%), Gaps = 110/722 (15%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIK--GYEWWSEALHGVSNVG 98
           PF   SL   +R+++L+  +++ EK+  L S    VPRLGI+  G+   SE LHG++  G
Sbjct: 25  PFQNTSLSDEKRLDNLLSIMTIDEKINAL-STNLGVPRLGIRNTGH---SEGLHGMALGG 80

Query: 99  PGTKFGG----------DFPGATSFPQVITTASSFNATLWEAIGRVVSDEAR-----AMY 143
           PG  +GG          D    T+FPQ      +++  L + +  + + E R       Y
Sbjct: 81  PGN-WGGFKMVNYQRVPDVYPTTTFPQAYGLGETWDTELIKKVADIEATEIRYYTQNERY 139

Query: 144 NGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAAS 203
             G  GL   +PN ++ RDPRWGR +E+ GEDP L  + A ++++GLQG +    K A+ 
Sbjct: 140 TKG--GLVMRAPNADLARDPRWGRTEESFGEDPFLVSEMAVAFIKGLQGENPRYWKSASL 197

Query: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVP 263
            KHF A    N N   R   ++    +   + +  PFR  + +G   + M +YN  N +P
Sbjct: 198 MKHFLA----NSNEDGRDSTSSNFDNRLFHEYYSYPFRKGIEKGGSQAFMAAYNSWNEIP 253

Query: 264 TCADPNILKRTIRGEWRLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGP 323
               P ILK+ IR +W   G I +D  ++ +         T  E +A  ++AG+    G 
Sbjct: 254 MTIHP-ILKK-IRKDWNFKGIICTDGGALDLLIKAHKTFPTHTEGSAAIVKAGV----GQ 307

Query: 324 FLGL---HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCT- 379
           FL     +   A+++G+L+E +I+ A+     + ++LG+ DG+ +  PY H+G  D  + 
Sbjct: 308 FLDNFRPYIYQALEKGMLTEAEIDKAIRGNFYIALKLGLLDGDQTKLPYAHIGVTDTVSV 367

Query: 380 ---PDHQELALEAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC 436
               + Q+       + +VLLKN+   LPL+    + +AVIGP ++    ++  Y+G   
Sbjct: 368 WRNKEIQDFVRLVTAKSVVLLKNEKKLLPLNKGNIKRIAVIGPRAN--EVLLDWYSGTPP 425

Query: 437 GYTTPLQGI----GRYARTIHQQGCKDVACADDQLFGAAIDASRQADATIL-----VMGL 487
              + LQGI    G     I++   +            A  A+++AD  I+     V G 
Sbjct: 426 YTVSILQGIKNAVGNNVEVIYESSNE---------IDKAYLAAQKADIAIVCVGNHVYGT 476

Query: 488 DQSIE--------AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKN 539
           D   +         EA+DR  L L   Q++LV  V  A+   T++VL+S  P  + +++ 
Sbjct: 477 DPKWKYSPVPSDGREAVDRKALSL--EQEDLVKIVHKANPN-TVMVLVSSFPFAINWSQE 533

Query: 540 DPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNL-PMTEMAMRPSQ 598
           +  I AI+      Q  G  +AD++FG  NP G+   TW     I +L PM +  +R   
Sbjct: 534 N--IPAILHITNNSQELGNGLADVIFGNYNPAGRTNQTWVKS--IADLPPMMDYDIR--- 586

Query: 599 SKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAI 658
                GRTY + K   +YPFG+G+SYTNF ++                 S +A   GK +
Sbjct: 587 ----NGRTYMYAKEKPLYPFGYGLSYTNFTYSDMAL-------------SSSALSKGKNL 629

Query: 659 KVTHAKCNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGA 717
           KV+            V+VKN G  DG     L  S P +    P KQL  F+++ +  G 
Sbjct: 630 KVS------------VNVKNTGDMDGEEVAQLYVSFPQSKVVRPIKQLKGFDRISIKKGE 677

Query: 718 QQ 719
            +
Sbjct: 678 SK 679


>gi|329922637|ref|ZP_08278189.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328941979|gb|EGG38262.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 765

 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 208/686 (30%), Positives = 322/686 (46%), Gaps = 103/686 (15%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  ++  S++N  L+  + R V+ E R+       G   +SP +++ RDPRWGR 
Sbjct: 122 GGTVFPVPLSIGSTWNLDLYRDMCRAVALETRSQ-----GGAVTYSPVLDVVRDPRWGRT 176

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSDGDR-LKVAASCKHFTAYDLDNWNGVDRFHFNA-- 225
           +E  GEDP L  +YA + V GLQG   D    VAA+ KHF  Y      G      NA  
Sbjct: 177 EECFGEDPYLISEYAVASVEGLQGESLDSPSSVAATLKHFVGY------GSSEGGRNAGP 230

Query: 226 -KVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +  +++ +   +PF+  V E   AS+M +YN+++GVP   +  +L   +R EW  +G 
Sbjct: 231 VHMGTRELMEVDMLPFKKAV-EAGAASIMPAYNEIDGVPCTVNTELLDGILRKEWGFDGM 289

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLD-CGPFLGLHTESAVQRGLLSEIDI 343
           +++DC ++ +            +AA  AIRAG+DL+  G   G H + AV+   L    +
Sbjct: 290 VITDCGAIDMLASGHDTAEDGMDAAVQAIRAGIDLEMSGEMFGKHLQKAVESNKLEVSVL 349

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           + A+   LT++ +LG+F+  P   P        + +  H  LA + A +GIVLLKN+  +
Sbjct: 350 DEAVRRVLTLKFKLGLFE-NPYVDP--QTAENVIGSGQHIGLARQLAAEGIVLLKNEAKA 406

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIA--CGYTTPLQGI----GRYA-RTIHQQG 456
           LPLS      +AVIGPN+D     +G+Y         TT L GI    G  A R ++  G
Sbjct: 407 LPLSK-EGGVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPG 465

Query: 457 CKDVACADD--QLFGAAIDASRQADATILVMG---------------------LDQSIE- 492
           C+     DD  + F  A+  + QAD  ++V+G                      D ++  
Sbjct: 466 CR---IKDDSREGFEFALSCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSD 522

Query: 493 ---AEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWA 549
               E +DR  L L G Q +L  ++    K   I+V ++G PI   +   D    AI+ A
Sbjct: 523 MDCGEGIDRMTLQLSGVQLDLAQEIHKLGK-RMIVVYINGRPIAEPWI--DEHADAILEA 579

Query: 550 GYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRF 609
            YPGQ GG AIADILFG  NP GKL M+    +++  LP+          KR  G+ Y  
Sbjct: 580 WYPGQEGGHAIADILFGDVNPSGKLTMSI--PKHVGQLPVY------YNGKRSRGKRYLE 631

Query: 610 YKGPVVYPFGHGMSYTNFVHT-VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRL 668
                 YPFG+G+SYT F ++ +   P V+                G A+          
Sbjct: 632 EDSQPRYPFGYGLSYTEFSYSDIQMTPEVIGT-------------DGTAV---------- 668

Query: 669 TLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWA-PHKQLVAFEKVHVPAGAQQRVGINI-- 725
              V V+V N G  +G+  + ++ +  A  +  P ++L  F+K+ +  G +++V   I  
Sbjct: 669 ---VSVNVTNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKISLQPGERRKVEFTIGP 725

Query: 726 ----HVCKYLSVVDRSGTRRIPLGEH 747
               ++ +    V   G  R+ LG H
Sbjct: 726 EQLQYIGQDYRQVVEPGLFRVMLGRH 751


>gi|404254492|ref|ZP_10958460.1| glycosyl hydrolase family protein [Sphingomonas sp. PAMC 26621]
          Length = 898

 Score =  255 bits (652), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 155/414 (37%), Positives = 226/414 (54%), Gaps = 41/414 (9%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           ++ R++ +EK+  L++ A A+PR+ I  Y WW+E+LHG     P           T+FP+
Sbjct: 51  IVARMTTEEKLPQLLNVAPAIPRIKIPAYNWWTESLHGALGAVP----------TTNFPE 100

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWG 166
            I  A+SF+A L   +   +S E RA++  G           GL  WSPN+NIFRDPRWG
Sbjct: 101 PIGLAASFDAPLVHQVAGAISTEVRALHTMGRQTGKLGRIGTGLDTWSPNINIFRDPRWG 160

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
           RGQET GEDP L+     ++V G+QG + D   V A+ KHF  +         R   N  
Sbjct: 161 RGQETYGEDPYLAAHMGVAFVTGMQGPNPDLPDVIATPKHFAVHSGPEST---RHGANVF 217

Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
           VS  D+EDT+   FR  ++EG+  S+MC+YN+++G P CA+  +LK  +RG W   GY+V
Sbjct: 218 VSAHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIDGQPACANDLLLKDHLRGAWGFTGYVV 277

Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTE--SAVQRGLLS 339
           SDCD+V    D   +      A A A++AG D +C     G   GL      +++RGL+S
Sbjct: 278 SDCDAVKDIADGHKYAPDQATAIAAAMKAGTDNECHTQTIGDMGGLPERFGDSLRRGLIS 337

Query: 340 EIDINNALVNTLTVQMRLGMFD---GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           + DI+ +LV   + ++R G      G P++ P      + + +PDH  LAL+AA + +VL
Sbjct: 338 QDDIDRSLVRLFSARLRNGDLPGIAGAPAAVPV-----RAILSPDHMALALQAAEKSLVL 392

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPL-QGIGRYA 449
           LKN G  LPL       +AVIGP  D T  + GNY+        P+ +G+ R A
Sbjct: 393 LKNDG-VLPLRP--GARIAVIGPLGDATRVLRGNYSSTQSAPPVPIVEGLRRAA 443



 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 88/277 (31%), Positives = 141/277 (50%), Gaps = 48/277 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +EAE            D+  L LP  Q  ++ +     K P I+V M+G PI++A
Sbjct: 635 GLTSDLEAEETGTDIPGFQGGDKTSLDLPAEQMAMLERAKATGK-PVIVVAMNGSPINLA 693

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK++   AAI+ A YPGQ+GG A+A++L G ++PGG+LP+T+Y    + +LP       
Sbjct: 694 WAKDN--AAAIVEAWYPGQSGGLAVANVLTGKTDPGGRLPLTFY--RSVADLP------- 742

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
           P  +    GRTYR++ G  VYPFGHG+SYT+F +T          PL      +    SG
Sbjct: 743 PFDNYAMTGRTYRYFTGKPVYPFGHGLSYTSFAYT----------PL------VLNPASG 786

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
            A +          L V  DV N G + G   + ++ + P    AP   L  ++++ +  
Sbjct: 787 GAEQ---------GLRVTTDVSNTGKRAGDEVVQLYLSFPDVPGAPRIALRGYQRITLAP 837

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           G ++ V  ++   + LS V  +G R +  G++ + +G
Sbjct: 838 GERRSVTFDLS-PRDLSAVRANGVREVMQGQYRVSVG 873


>gi|317476310|ref|ZP_07935559.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316907336|gb|EFV29041.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 862

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 160/409 (39%), Positives = 223/409 (54%), Gaps = 34/409 (8%)

Query: 66  VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
           V L++  + AV RLGIK Y WW+EALHGV+  G           AT FPQ I  A+SF+ 
Sbjct: 46  VALMMDNSQAVERLGIKPYNWWNEALHGVARAGL----------ATVFPQPIGMAASFSP 95

Query: 126 TLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQETPGEDP 176
                +   VSDEARA  N   A         GLT W+P VNI+RDPRWGRG ET GEDP
Sbjct: 96  ESVYEVFSAVSDEARAK-NANYASQGSYARYQGLTMWTPTVNIYRDPRWGRGIETYGEDP 154

Query: 177 VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK-VSKQDIED 234
            L+ +     V+GLQG+ DG   K+ A  KHF  +    WN   R  FN + +S +D+ +
Sbjct: 155 YLTSRMGVMVVKGLQGTNDGKYDKLHACAKHFAVHSGPEWN---RHSFNVEDLSARDLYE 211

Query: 235 TFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGV 294
           T+  PF   V E KV  VMC+YN   G P C    +L + +R +W  +G I+SDC ++  
Sbjct: 212 TYLPPFEALVKEAKVKEVMCAYNSFEGEPCCGSNRLLMQILRNDWGFDGIILSDCGAIAD 271

Query: 295 YYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLT 352
           +Y+     + P  E A+A A+  G DL+CG       + AV+ G ++E DI+ A++  L 
Sbjct: 272 FYNEHGHKAYPDAESASAAAVLNGTDLECGSSYKALVK-AVRAGQINEKDIDKAVMRLLE 330

Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
            +  LG  D +P +  +  +    V + +H  LAL+ AR+ + LL+N G  LPL      
Sbjct: 331 ARFALGEMD-DPDNVSWTKIPFSVVASAEHDSLALDMARKSMTLLQNTGNILPLKR-GGL 388

Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
           TVAV+GPN++ +V   GNY G+     T L GI    G   + I++QGC
Sbjct: 389 TVAVMGPNANDSVMQWGNYNGMPPHTVTILGGIRKALGADDKLIYEQGC 437



 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 91/306 (29%), Positives = 140/306 (45%), Gaps = 47/306 (15%)

Query: 470 AAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASK 519
           A++   + AD  +   G+   +E E +          DR  + LP  Q+EL+S +  A K
Sbjct: 591 ASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIELPAIQRELISALHRAGK 650

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
              ILV  SG PI  A         AI+ A YPGQ GGTA+A++LFG  NPGG+LP+T+Y
Sbjct: 651 -KIILVNCSGSPI--ALEPETKNCEAILQAWYPGQQGGTAVAEVLFGDYNPGGRLPVTFY 707

Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
               ++ LP  E            GRTYR+     ++PFG+G+SYT F            
Sbjct: 708 RN--MSQLPDFE-------DYNMTGRTYRYMTQQPLFPFGYGLSYTTF------------ 746

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                 +G +  ++  + IK          L + V V N G +DG   + V+        
Sbjct: 747 -----DYGKL--SLGKEQIKQGEP------LKLAVSVTNSGQRDGEEVVQVYLKKQDDAE 793

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
            P K L AF++V++PAG    V  ++   +      +S T R+  G+++I +G +     
Sbjct: 794 GPGKTLRAFKRVYIPAGKSVDVEFDLKDKELEWWDAQSNTMRVCAGKYDIMVGRSSQDQD 853

Query: 760 LHAATL 765
           L   + 
Sbjct: 854 LQRGSF 859


>gi|365836152|ref|ZP_09377549.1| glycosyl hydrolase family 3 protein [Hafnia alvei ATCC 51873]
 gi|364564272|gb|EHM42040.1| glycosyl hydrolase family 3 protein [Hafnia alvei ATCC 51873]
          Length = 810

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 205/689 (29%), Positives = 332/689 (48%), Gaps = 102/689 (14%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASS+N      +GR+      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 163 TVFPISLGLASSWNRDAVTTVGRI------SAYEASEDGLNMTWAPMVDVTRDPRWGRTS 216

Query: 170 ETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+ +     V  +QG++  DR  V  S KHF AY       ++N VD     
Sbjct: 217 EGFGEDTYLTSEMGRLMVEAMQGNNPADRHSVMTSVKHFAAYGAVEGGRDYNTVD----- 271

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q +   +  P++  +  G    VM S N +NG P  ++  +LK  +R EW   G 
Sbjct: 272 --MSPQRLFQDYMPPYKAALDAGS-GGVMVSLNSINGTPATSNSWLLKDVLRDEWNFKGI 328

Query: 285 IVSDCDSVGVYYDTQH-FTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H   S PE+A   A+++G+D+     +   +  S V+ G +SE +
Sbjct: 329 TISDHGAIKEL--IKHGVASDPEDAVRVAVKSGIDMSMSDEYYSKYLPSLVKSGRVSEKE 386

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F     +  Y HLGP      D       H+  A   AR+ +V
Sbjct: 387 VDDAARHVLNVKYDMGLF-----TDAYSHLGPVGSDPVDTNAESRLHRPEARSVARESMV 441

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQG----IGRYA 449
           LLKN+  +LPL+  +  T+A+IGP +D    ++G++  AG+     T LQG    +G  A
Sbjct: 442 LLKNRLNTLPLA--KSGTIALIGPLADSKRDVMGSWSAAGVVDQSVTVLQGLRNAVGYKA 499

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                  +   A+ A++++D  + V+G  Q 
Sbjct: 500 QIVYAKGANVSNDPGITDFLNLYEKAVTVDPRSPQAMIDEAVAAAKKSDVIVAVVGEAQG 559

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R+ + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+ + 
Sbjct: 560 MAHEASSRSDITIPQSQRDLIAALKQTGK-PLVLVLMNGRPL--ALEKEDQQADAILESW 616

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+      T     P + ++Y  
Sbjct: 617 FSGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPIYYSHLNTGRPYNPEKPEKYTS 674

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
             Y    GP +YPFG+G+SYT F  +V++    ++ P+  R GS+ A+++          
Sbjct: 675 HYYDAANGP-LYPFGYGLSYTTF--SVSDVK--MSSPIMKRDGSVTASVT---------- 719

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGI 723
                      VKN G +DGA  + L      A    P K+L  FEKV + +G Q+ +  
Sbjct: 720 -----------VKNTGKRDGATVVQLYIQDKTATMSRPVKELKGFEKVMLKSGEQKTISF 768

Query: 724 NIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I   + L   + S       G+ N+ IG
Sbjct: 769 KIDANQ-LKFWNASMKYVSEPGKFNVFIG 796


>gi|94495010|ref|ZP_01301591.1| family 3 glycoside hydrolase [Sphingomonas sp. SKA58]
 gi|94425276|gb|EAT10296.1| family 3 glycoside hydrolase [Sphingomonas sp. SKA58]
          Length = 872

 Score =  255 bits (651), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 150/398 (37%), Positives = 224/398 (56%), Gaps = 43/398 (10%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           ++ +L+LQEK + L++ A A+PRLGI  Y WW+E+LHG   +GP        P  T+FP+
Sbjct: 38  IVRQLTLQEKTEQLLNTAPAIPRLGIPAYNWWTESLHGA--IGP-------VP-TTNFPE 87

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWG 166
            I  A++F+A L   +  V+S E +A++  G           GL  WSPN+NIFRDPRWG
Sbjct: 88  PIGLAATFDAPLIHTVASVISTEVQALHTLGRQTGHLGRIGTGLDTWSPNINIFRDPRWG 147

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
           RGQET GEDP L+ +   ++++G+QG + D   V A+ KHF  +         R   N  
Sbjct: 148 RGQETYGEDPYLTARMGVAFIQGMQGDNPDLPDVVATPKHFAVHSGPE---PSRHTDNIF 204

Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
            +K+D+EDT+   FR  ++EGK  S+MC+YN+V+G P C    +L   +R  W   GY+V
Sbjct: 205 ATKRDLEDTYLPAFRAAIVEGKAGSIMCAYNRVDGEPACGSAMLLTDYLRNAWGFRGYVV 264

Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-------LGLHTESAVQRGLLS 339
           SDCD+V   Y    +  +P    + A+R G+D +C          LG   + A+    +S
Sbjct: 265 SDCDAVVDIYAHHKYAPSPATGVSVALRRGVDSECNNATLSGQAGLGDRYKDALAANHIS 324

Query: 340 EIDINNALVNTLTVQMRLGMFDG----EPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
             DI++ALV   + ++R G   G    +P+S P   +G     TP H  LAL+AA + +V
Sbjct: 325 IGDIDSALVRLFSARLRNGDLPGLSARKPNSTPVSAIG-----TPAHGALALDAAVKSLV 379

Query: 396 LLKNQGPSLPLSH-IRHRTVAVIGPNSDVTVTMIGNYA 432
           LLKN+G +LPL   +R   + + GP +D T  + GNY+
Sbjct: 380 LLKNEG-ALPLKPGVR---IVLAGPLADATRVLRGNYS 413



 Score =  149 bits (375), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 104/289 (35%), Positives = 153/289 (52%), Gaps = 50/289 (17%)

Query: 476 RQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILV 525
           R AD  + VMGL   +E E +          DR  L LP  Q   + K     K P ILV
Sbjct: 599 RDADVIVAVMGLTSDLEGEEMPVKVEGFEGGDRTTLALPADQIAFLEKAKATGK-PLILV 657

Query: 526 LMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYIT 585
           +M+G  ID+ +AK++   AAI+ A YPGQ+GG AIA++L G ++PGG+LP+T+Y    + 
Sbjct: 658 MMNGSAIDLGWAKDN--AAAILEAWYPGQSGGLAIANVLSGKADPGGRLPLTFY--HSVD 713

Query: 586 NL-PMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDG 644
           +L P T+ +M         GRTYR+++G  VYPFGHG+SYT F +    AP VV  P+DG
Sbjct: 714 DLPPFTDYSME--------GRTYRYFRGAPVYPFGHGLSYTRFRY----APLVVE-PIDG 760

Query: 645 RHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQ 704
                      K ++VT     R+T        NVG + G     ++ TPPA   AP   
Sbjct: 761 --------AVEKGLRVT----TRIT--------NVGQRPGDEVAQLYITPPAFEGAPRTA 800

Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGG 753
           L  F+++ + AG  + +   +   + LS V  +G R +  G +++ +GG
Sbjct: 801 LRGFQRLSLKAGESRAISFTLS-PRDLSFVTMAGDRMLIPGNYDLSVGG 848


>gi|218131278|ref|ZP_03460082.1| hypothetical protein BACEGG_02889 [Bacteroides eggerthii DSM 20697]
 gi|217986495|gb|EEC52831.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 862

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 160/409 (39%), Positives = 222/409 (54%), Gaps = 34/409 (8%)

Query: 66  VKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQVITTASSFNA 125
           V L++  + AV RLGIK Y WW+EALHGV+  G           AT FPQ I  A+SF+ 
Sbjct: 46  VALMMDNSQAVERLGIKPYNWWNEALHGVARAGL----------ATVFPQPIGMAASFSP 95

Query: 126 TLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFRDPRWGRGQETPGEDP 176
                +   VSDEARA  N   A         GLT W+P VNI+RDPRWGRG ET GEDP
Sbjct: 96  ESVYEVFSAVSDEARAK-NANYASQGSYARYQGLTMWTPTVNIYRDPRWGRGIETYGEDP 154

Query: 177 VLSGKYAASYVRGLQGS-DGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK-VSKQDIED 234
            L+ +     V+GLQG+ DG   K+ A  KHF  +    WN   R  FN + +S +D+ +
Sbjct: 155 YLTSRMGVMVVKGLQGTNDGKYDKLHACAKHFAVHSGPEWN---RHSFNVEDLSARDLYE 211

Query: 235 TFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGV 294
           T+  PF   V E KV  VMC+YN   G P C    +L + +R +W  +G I+SDC ++  
Sbjct: 212 TYLPPFEALVKEAKVKEVMCAYNSFEGEPCCGSNRLLMQILRNDWGFDGIILSDCGAIAD 271

Query: 295 YYDTQHFTSTP--EEAAADAIRAGLDLDCGPFLGLHTESAVQRGLLSEIDINNALVNTLT 352
           +Y+     + P  E A+A A+  G DL+CG       + AV+ G ++E DI+ A++  L 
Sbjct: 272 FYNEHGHKAYPDAESASAAAVLNGTDLECGSSYKALVK-AVRAGQINEKDIDKAVMRLLE 330

Query: 353 VQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSHIRHR 412
            +  LG  D +P +  +  +    V + +H  LAL  AR+ + LL+N G  LPL      
Sbjct: 331 ARFALGEMD-DPDNVSWTKIPFSVVASAEHDSLALNMARKSMTLLQNTGNILPLKR-GGL 388

Query: 413 TVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI----GRYARTIHQQGC 457
           TVAV+GPN++ +V   GNY G+     T L GI    G   + I++QGC
Sbjct: 389 TVAVMGPNANDSVMQWGNYNGMPPHTVTILGGIRKALGADDKLIYEQGC 437



 Score =  122 bits (306), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 91/306 (29%), Positives = 140/306 (45%), Gaps = 47/306 (15%)

Query: 470 AAIDASRQADATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASK 519
           A++   + AD  +   G+   +E E +          DR  + LP  Q+EL+S +  A K
Sbjct: 591 ASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIELPAIQRELISALHRAGK 650

Query: 520 GPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWY 579
              ILV  SG PI  A         AI+ A YPGQ GGTA+A++LFG  NPGG+LP+T+Y
Sbjct: 651 -KIILVNCSGSPI--ALEPETKNCEAILQAWYPGQQGGTAVAEVLFGDYNPGGRLPVTFY 707

Query: 580 PQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
               ++ LP  E            GRTYR+     ++PFG+G+SYT F            
Sbjct: 708 RN--MSQLPDFE-------DYNMTGRTYRYMTQQPLFPFGYGLSYTTF------------ 746

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
                 +G +  ++  + IK          L + V V N G +DG   + V+        
Sbjct: 747 -----DYGKL--SLDKEQIKQGEP------LKLAVSVTNSGQRDGEEVVQVYLKKQDDAE 793

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGTKHSVS 759
            P K L AF++V++PAG    V  ++   +      +S T R+  G+++I +G +     
Sbjct: 794 GPGKTLRAFKRVYIPAGKSVNVEFDLKDKELEWWDAQSNTMRVCAGKYDIMVGRSSQDQD 853

Query: 760 LHAATL 765
           L   + 
Sbjct: 854 LQRGSF 859


>gi|421846569|ref|ZP_16279716.1| beta-D-glucoside glucohydrolase [Citrobacter freundii ATCC 8090 =
           MTCC 1658]
 gi|411772163|gb|EKS55801.1| beta-D-glucoside glucohydrolase [Citrobacter freundii ATCC 8090 =
           MTCC 1658]
          Length = 765

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 209/691 (30%), Positives = 332/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GR+      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM   N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVGLNSLNGTPATSDAWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   AI++G+++     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVAIKSGINMSMSDEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLG K+    D       H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPQDTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
           LLKN+  +LPL   +  TVAV+GP +D    ++G++  AG+A    T L GI        
Sbjct: 397 LLKNRLETLPLK--KSATVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGDQG 454

Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
              YA+               ++++  K    +   +   A++A++Q+D  + V+G  Q 
Sbjct: 455 KVLYAKGANITNDKGIVDFLNLYEEAVKVDPRSPQAMIDEAVNAAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+     K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--VLVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G++ A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--LSSPTMKRDGTVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V V N G ++GA  + ++     A    P KQL  FEKV++  G  Q V  
Sbjct: 673 ---------VQVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVNLKPGETQTVSF 723

Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I+I   K+ +   R      P G+ N+ IG
Sbjct: 724 PIDIDALKFWN--QRMQYDAEP-GKFNVFIG 751


>gi|395230595|ref|ZP_10408899.1| periplasmic beta-glucosidase [Citrobacter sp. A1]
 gi|424731344|ref|ZP_18159928.1| periplasmic beta-glucosidase [Citrobacter sp. L17]
 gi|394715980|gb|EJF21765.1| periplasmic beta-glucosidase [Citrobacter sp. A1]
 gi|422893995|gb|EKU33810.1| periplasmic beta-glucosidase [Citrobacter sp. L17]
          Length = 765

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 210/691 (30%), Positives = 332/691 (48%), Gaps = 106/691 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN      +GR+      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVRTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM   N +NG P  +D  +LK  +R EW   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVGLNSLNGTPATSDAWLLKDVLRDEWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T S PE+A   AI++G+++     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTASDPEDAVRVAIKSGINMSMSDEYYSKYLPGLIKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLG K+    D       H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGAKESDPQDTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGIGR------ 447
           LLKN+  +LPL   +  TVAV+GP +D    ++G++  AG+A    T L GI        
Sbjct: 397 LLKNRLETLPLK--KSATVAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGDQG 454

Query: 448 ---YAR--------------TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQS 490
              YA+               ++++  K    +   +   A+ A++Q+D  + V+G  Q 
Sbjct: 455 KVLYAKGANITNDKGIVDFLNLYEEAVKVDPRSPQAMIDEAVIAAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
           R +    GP +YPFG+G+SYT F  TV++    ++ P   R G++ A+            
Sbjct: 630 RYFDEANGP-LYPFGYGLSYTTF--TVSDVK--LSSPTMKRDGTVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V V N G ++GA  + ++     A    P KQL  FEKV++  G  Q V  
Sbjct: 673 ---------VQVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFEKVNLKPGETQTVSF 723

Query: 723 -INIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
            I+I   K+ +   R      P G+ N+ IG
Sbjct: 724 PIDIDALKFWN--QRMQYDAEP-GKFNVFIG 751


>gi|380692851|ref|ZP_09857710.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 854

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 165/447 (36%), Positives = 241/447 (53%), Gaps = 34/447 (7%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           P+    L   +R +DL+GRL+L+EK+ L+ + + AV RLGI  Y WWSE LHGV+  G  
Sbjct: 15  PYKNHQLSPKERADDLLGRLTLKEKIGLMQNSSNAVERLGIAPYNWWSEGLHGVARNGL- 73

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAM---------YNGGTAGLT 151
                    AT FP  +  AS+F+    E I   VSDE RA          Y  G  GLT
Sbjct: 74  ---------ATVFPITMGMASTFDDEAIEKIYTAVSDEGRAKFHDAHRRNRYGCGNEGLT 124

Query: 152 YWSPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAY 210
           +W+PNVNIFRDPRWGRGQET GEDP L+ +   S V+G+QG +D    K  A  KH+  +
Sbjct: 125 FWNPNVNIFRDPRWGRGQETFGEDPYLTTRMGVSVVKGMQGPADAAYDKTHACAKHYAVH 184

Query: 211 DLDNWNGVDRFHFNAK-VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPN 269
                    R  F+ + +S +D+ +T+   F+  V E  V  VMC+Y +  G P C    
Sbjct: 185 SGPE---AKRHSFDVEHLSPRDLWETYLPAFKALVQEADVKEVMCAYQRFEGEPCCDSNR 241

Query: 270 ILKRTIRGEWRLNGYIVSDCDSVG-VYYDTQHFT-STPEEAAADAIRAGLDLDCGPFLGL 327
           +L + +R EW     +VSDC ++   +Y  +H T     +A+A A+  G DL+CG     
Sbjct: 242 LLTQILRDEWGYKHLVVSDCGAISDFFYKDRHGTHKDAADASASAVINGTDLECGVEYA- 300

Query: 328 HTESAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELAL 387
           H E AV+RGL++E  IN +L   L  +  LG  D + S   +  +G   V    H+++AL
Sbjct: 301 HLEEAVKRGLITEERINTSLRRLLEARFALGEMDPD-SIVAWSKIGIDTVDCDLHRQMAL 359

Query: 388 EAARQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGI-G 446
           +  R+ +VLL N G  LPL+    R +AV+GPN+  +V   GNY G+     T L+GI  
Sbjct: 360 DITRKSMVLLHNNG-ILPLAKTSTR-IAVMGPNAVDSVMQWGNYKGVPSHTYTILEGIRN 417

Query: 447 RYARTIHQQGCKDVACADDQLFGAAID 473
           +     +++GC+     ++Q+F +  +
Sbjct: 418 KVGNVPYEKGCE---LLENQVFDSYFN 441



 Score = 99.0 bits (245), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 86/295 (29%), Positives = 127/295 (43%), Gaps = 50/295 (16%)

Query: 472 IDASRQADATILVMGLDQSIEAEAL-----------DRAGLLLPGRQQELVSKVSMASKG 520
           +D  + AD  I + G+   +E E             DR  + LP  Q++ + K    +  
Sbjct: 584 VDKVKDADIVIFLGGISPDLEGEEKHFVNCPGFSGGDRTSIELPQVQRD-ILKALKKAGK 642

Query: 521 PTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYP 580
             + V  SG    VA         AI+ A YPGQAGG A+AD+LFG  NP GKLP+T+Y 
Sbjct: 643 KVVFVNCSGSA--VALVPEMESCDAILQAWYPGQAGGLAVADVLFGDFNPSGKLPVTFYK 700

Query: 581 QEYITNLP-MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVA 639
                 LP   + +M+         RTYR+     ++PFG+G+SYT F            
Sbjct: 701 S--TDQLPDFEDYSMK--------NRTYRYMVEAPLFPFGYGLSYTTF------------ 738

Query: 640 VPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHW 699
              D   G +N     K+I           L  +V+VKN G  DGA  + V+        
Sbjct: 739 ---DISKGRLNK----KSISAGK------DLNFKVNVKNTGKCDGAEVIQVYVRKVDDME 785

Query: 700 APHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIGGT 754
            P K L AF +V + AG    V I++    +      + T R+  G++ +  G +
Sbjct: 786 GPVKSLRAFRRVPLKAGESCVVSIDLLPATFEFFDPVTNTMRVMPGKYEVMYGNS 840


>gi|253574725|ref|ZP_04852065.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251845771|gb|EES73779.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 953

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 164/446 (36%), Positives = 238/446 (53%), Gaps = 38/446 (8%)

Query: 36  TTRTLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVS 95
            T   PF    LP+ +RV+DL+ R +L EK++L+      VPRLG+K Y+  +E  HGV+
Sbjct: 4   NTYQYPFQNPDLPLAERVDDLVSRFTLDEKIELMCQFQTEVPRLGVKKYKHGTEGAHGVA 63

Query: 96  NVGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMY--NGGTAGLTYW 153
            +G           ATSFPQ    A ++N  L   IG V+ DEAR  +  N    GLT W
Sbjct: 64  WLGE----------ATSFPQNTGLACTWNPELMRQIGSVIGDEARVYFQRNPEVNGLTIW 113

Query: 154 SPNVNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLD 213
           +P V++ RDPRWGR +E  GEDP L+G      V+GLQG      K  A+ KHF A    
Sbjct: 114 APTVDMERDPRWGRTEEAYGEDPHLTGALTTELVKGLQGDHPFYYKAVATLKHFYA---- 169

Query: 214 NWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKR 273
           N N + R   +  +  ++  + +   F     EG+  S+M +YN +NG P C   + +  
Sbjct: 170 NNNEIGRGSESVSIDPRNKREYYLKAFERAFREGRAGSMMTAYNGINGTP-CNLNSEVNE 228

Query: 274 TIRGEWRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLD--LDCGPFLGLHTE 330
            ++GEW ++G++V D  D +G   D ++  S   EA A +I+AG+D   D  P +     
Sbjct: 229 IVKGEWEMDGFVVGDAGDVMGTVLDHKYVESYA-EAVAGSIKAGIDSMTDEQPIMFHALR 287

Query: 331 SAVQRGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAA 390
           +A+++GLL+E D++ AL N   V+ RLG FD E    PY  +  + +C P+H  LAL+A+
Sbjct: 288 NALEQGLLAESDLDRALRNAFRVRFRLGEFDPE-ERNPYSRVPEEKLCAPEHAALALKAS 346

Query: 391 RQGIVLLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYAR 450
           R+ IVLLKN+G  LPL      +VAV+GP ++   T    Y+G      TPLQGI     
Sbjct: 347 REAIVLLKNEG-LLPLPQTLG-SVAVVGPLANEAYTDW--YSGTPPYRITPLQGIK---- 398

Query: 451 TIHQQGCKDVACADDQLFGAAIDASR 476
              + G + V      LFG  +D  R
Sbjct: 399 --EKMGDRPV------LFGTGLDRIR 416



 Score = 96.3 bits (238), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 138/297 (46%), Gaps = 47/297 (15%)

Query: 471 AIDASRQADATILVMGLDQSIEA-EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSG 529
           A+ A++ A+  ++ +G +  I A E +DR  ++LP  QQ L+  V   +   T++V++  
Sbjct: 559 AVQAAKSAETAVVFVGNNPFINAKETIDRVDIVLPPAQQALIQAVRAVNPN-TVVVIVGS 617

Query: 530 GPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM 589
            P  V +   +  + +I++  +  Q  G A+ D+LFG  NP G+L MTWY  + +  LP 
Sbjct: 618 YPFAVNW--ENAHVPSILFTSHSAQELGHAVTDVLFGDYNPAGRLNMTWY--QSVDQLPD 673

Query: 590 TEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSI 649
             M     + KR    TY+++ G V+YPFGHG+SYT F ++   A    A+  DG     
Sbjct: 674 I-MDYDIIKGKR----TYQYFDGEVLYPFGHGLSYTTFEYSDL-ALDRAAIGQDGE---- 723

Query: 650 NATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF-----STPPAGHWAPHKQ 704
                               + +  DV+N G+  G     ++     S  P     P K 
Sbjct: 724 --------------------VQISFDVRNAGNIAGDEVPQLYVRIGQSRVP----RPLKT 759

Query: 705 LVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRR-IPLGEHNIHIGGTKHSVSL 760
           L  F ++H+     QRV   +   + L+  D +  R  +  GE+ + +G +   ++L
Sbjct: 760 LKGFTRLHLQPDETQRVTFTLK-AEELAFWDVTRDRYCVESGEYTVMVGPSSDEIAL 815


>gi|218130693|ref|ZP_03459497.1| hypothetical protein BACEGG_02282 [Bacteroides eggerthii DSM 20697]
 gi|217987037|gb|EEC53368.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 954

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 213/743 (28%), Positives = 355/743 (47%), Gaps = 103/743 (13%)

Query: 39  TLPFCQVSLPIPQRVNDLIGRLSLQEKVKLLISG--AAAVPRLGIKGYEWWSEALHGVSN 96
           TL +    LP+ +RV  L+  ++ ++K++L+  G     +P L +       EA+HG S 
Sbjct: 167 TLRYMDPRLPVNERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFS- 224

Query: 97  VGPGTKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPN 156
                 +G    GAT FPQ +   +++N  L E +   + +E  A    GT  +  WSP 
Sbjct: 225 ------YGS---GATIFPQALAMGATWNKNLTEKVAMAIGEETLA---AGT--MQAWSPV 270

Query: 157 VNIFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWN 216
           +++ +D RWGR +ET GEDPVL  +   ++++G Q S G    +  + KHF  +      
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-SKG----LFTTPKHFGGHGAP-LG 324

Query: 217 GVDRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIR 276
           G D       +S++++ +   VPFR  +      S+M +Y+   GVP      +L+  +R
Sbjct: 325 GRDSHDIG--LSEREMREVHLVPFRHVIRNYSCQSLMMAYSDFLGVPVAKSKELLRNILR 382

Query: 277 GEWRLNGYIVSDCDSVGVYYDTQHFTSTPE-EAAADAIRAGLDLDCG-PFLGLHTESAVQ 334
            EW  +G++VSDC ++G     +H+T+  + EAA  A+ AG+  +CG  +       A +
Sbjct: 383 EEWGFDGFVVSDCGAIGNLTSRKHYTAKNKIEAANQALAAGIATNCGDTYNDKEVIQAAK 442

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKD-------VCTPDHQELAL 387
            G L   +++N     L +  R  +F+ E       H  P D         +  H+E+A 
Sbjct: 443 DGRLDMANLDNVCRTMLRMMFRNELFEKE-------HKEPLDWNKIYPGWNSDSHKEIAR 495

Query: 388 EAARQGIVLLKNQGPSLPLS-HIRHRTVAVIGPNSDVTVTMIGNYA------GIACGYTT 440
           ++AR+ IV+L+N+   LPLS H+  RT+AV+GP +D      G+Y        +    T 
Sbjct: 496 QSARESIVMLENKDDVLPLSKHV--RTIAVLGPGAD--NLQPGDYTPKLRPGQLKSVLTG 551

Query: 441 PLQGIGRYARTIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEA------- 493
             Q +G   + +++QGC +   ++ +    A+  + Q+D  +LV+G   + EA       
Sbjct: 552 IKQAVGNQTKILYEQGC-EFTGSNGENIPNAVKVAAQSDVVVLVLGDCSTSEATTDVYKT 610

Query: 494 --EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
             E  D A L+LPGRQQEL+  V    K P +LVL  G P ++   K      AII    
Sbjct: 611 SGENHDYATLILPGRQQELLEAVCATGK-PVVLVLQIGRPYNL--TKESELCKAIIVNWL 667

Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYK 611
           PGQ GG A AD+LFG  NP G+LPMT +P+ ++  LP+       +  +RY      +Y 
Sbjct: 668 PGQEGGLATADVLFGDYNPAGRLPMT-FPR-HVGQLPL--YYNFKTSGRRYEYSDLEYYP 723

Query: 612 GPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
              +Y FG+G+SYT+F ++            +  +G+I A                    
Sbjct: 724 ---LYYFGYGLSYTSFEYSDLKVE-------ERENGNIIA-------------------- 753

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKY 730
             V+VKN+G + G   + ++ T           +L  F +VH+  G  + +   +   + 
Sbjct: 754 -HVNVKNIGHRAGDEVVQLYVTDMYASVKTRITELKDFTRVHLRPGESKSISFELTPYE- 811

Query: 731 LSVVDRSGTRRIPLGEHNIHIGG 753
           LS+++ +  R +  G   I +GG
Sbjct: 812 LSLLNDNMDRVVEKGTFKILVGG 834


>gi|329928264|ref|ZP_08282164.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328937943|gb|EGG34345.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 934

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 163/465 (35%), Positives = 247/465 (53%), Gaps = 28/465 (6%)

Query: 41  PFCQVSLPIPQRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPG 100
           PF    LP+ +RV+DL+ RL+L EK++L+      +PRLG++ Y+  +E  HGV+ +G  
Sbjct: 4   PFQDPDLPLEERVHDLVSRLTLDEKIELMCQYQTEIPRLGVQKYKHGTEGAHGVAWLGE- 62

Query: 101 TKFGGDFPGATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA--GLTYWSPNVN 158
                    AT FPQ    A ++N  L   IG V+++EAR  Y    A  GLT W+P V+
Sbjct: 63  ---------ATVFPQNTGLACTWNPELMREIGSVIAEEARVYYQRDRAINGLTIWAPTVD 113

Query: 159 IFRDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGV 218
           + RDPRWGR +E  GEDP L+G+ +   V+G+QG      K  A+ KHF   + +   G 
Sbjct: 114 LERDPRWGRTEEAYGEDPHLTGQLSTGLVKGMQGDHPFYYKTVATLKHFYGNNNEADRGS 173

Query: 219 DRFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGE 278
                + +  ++     F+ PFR    EGK  S+M +YN +NG P C   + +   ++ E
Sbjct: 174 ASVSIDPRNKREYYLKAFEAPFR----EGKAGSMMTAYNGINGTP-CNLNHEVNDIVKQE 228

Query: 279 WRLNGYIVSDC-DSVGVYYDTQHFTSTPEEAAADAIRAGLD--LDCGPFLGLHTESAVQR 335
           W ++G++V D  D +G   D  H+ ++  EA AD+++AG+D   D           A+++
Sbjct: 229 WGMDGFVVGDAGDVLGTVMD-HHYVASYAEAVADSVKAGIDSITDDQDISFRALRDALEQ 287

Query: 336 GLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIV 395
           GLL E  +++AL NT  V+ RLG FD E    PY  +    +C P+H EL+L AAR+ IV
Sbjct: 288 GLLEEQHLDHALRNTFRVRFRLGEFDPE-ERNPYSRVPESKLCAPEHAELSLRAARESIV 346

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRYA--RTIH 453
           LLKN G  LPL     ++ AVIGP ++   T    Y+G      TPLQG+   A  R++ 
Sbjct: 347 LLKNDG-LLPLPRDPFKSAAVIGPLANEAFT--DWYSGTPPYRITPLQGVQAKAGDRSVQ 403

Query: 454 QQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDR 498
                D       + G  +  S + +  IL+ G  +S +A   +R
Sbjct: 404 FHTGLDQVRLRSAVSGTYVALSSE-EQGILLAGTSESADAAVFER 447



 Score = 90.5 bits (223), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 68/235 (28%), Positives = 110/235 (46%), Gaps = 40/235 (17%)

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E +DR  + LP  QQ L+  V  A+   T++V++ G P  V + +    + AI++  + G
Sbjct: 565 ETIDRPDITLPPAQQALIQAVFEANP-RTVVVIVGGYPFAVNWEQE--HLPAILFTSHAG 621

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
           Q  G A+AD+L+G  NPGG+L MTWY       LP   M     + K    RTY+++ G 
Sbjct: 622 QELGHAVADVLYGDYNPGGRLNMTWYKS--TEQLPDI-MDYDIIKGK----RTYQYFDGD 674

Query: 614 VVYPFGHGMSYTNFVHT--VANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLG 671
           V+YPFGHG+SY+ F ++  + + P + A       GS+  T++                 
Sbjct: 675 VLYPFGHGLSYSRFKYSGLLLSQPKMTA------EGSVTVTVA----------------- 711

Query: 672 VQVDVKNVGSKDGAHTLLVFSTPPAGH-WAPHKQLVAFEKVHVPAGAQQRVGINI 725
               V+N   + G   + ++    +     P K L  F +VHV     + V   +
Sbjct: 712 ----VRNESQRAGDEVVQLYVKADSSRVIRPLKTLKGFRRVHVAPDTSETVTFEL 762


>gi|149280000|ref|ZP_01886125.1| putative beta-glucosidase [Pedobacter sp. BAL39]
 gi|149229197|gb|EDM34591.1| putative beta-glucosidase [Pedobacter sp. BAL39]
          Length = 793

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 202/688 (29%), Positives = 330/688 (47%), Gaps = 89/688 (12%)

Query: 109 GATSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 168
           G T FP  I  +S+++  L + +   ++ E R    GG  G   + P +++ R+PRW R 
Sbjct: 152 GTTVFPTSIGQSSTWDPALIKEMAAAIAMETR--LQGGHIG---YGPVLDLAREPRWSRV 206

Query: 169 QETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVDRFHFN--- 224
           +ET GEDPVL+ +   + V G QG++ G  + + ++ KHFTAY      GV     N   
Sbjct: 207 EETYGEDPVLNSRMGEAMVSGFQGTNIGSGVNILSTLKHFTAY------GVPEGGHNGGS 260

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             V  +++  ++  PF+  V  G + SVM +YN V+G+P  ++  +L   +RG+W  NG+
Sbjct: 261 VTVGNRELFQSYLPPFKAAVKAGAL-SVMTAYNSVDGIPCSSNRYLLTDILRGQWGFNGF 319

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLLSEIDI 343
           +VSD +S+       H  S+  EAAA A+ AGLD D   +  G     AV  GL+    +
Sbjct: 320 VVSDLNSISGLEGNHHVASSATEAAALAMNAGLDADLSGYGYGPALVKAVNGGLVKMATV 379

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVLLKNQGPS 403
           + AL   L ++  +G+F+  P   P      K V    H  LA + A++ +VLLKN+   
Sbjct: 380 DTALARVLRLKFNMGLFE-NPYVNP--KQAEKQVMNAKHVTLARKVAQESVVLLKNEKNI 436

Query: 404 LPLSHIRHRTVAVIGPNSDVTVTMIGNYAGIAC--GYTTPLQGI----GRYARTIHQQGC 457
           LPLS    + +AVIGPN+D     +G+Y          T L GI     +     +Q+GC
Sbjct: 437 LPLSKAL-KNIAVIGPNADNVYNQLGDYTAPQADGKVITVLNGIRAKVSKETGVFYQKGC 495

Query: 458 -------------------KDVAC-----ADDQLFGAAIDASRQADATILVMGLDQSIEA 493
                               DVA      +  + F      +  A+     + +      
Sbjct: 496 AIRDTASAGIAAAVALASKSDVAIVVLGGSSARDFKTEYQNTGAAEVKASAVAVSDMESG 555

Query: 494 EALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPG 553
           E  DR+ L L GRQ EL+  V + +  P ++VL+ G P+ + +A  +  +AA++ A YPG
Sbjct: 556 EGFDRSTLDLMGRQMELLRAV-VKTGTPVVVVLIKGRPLTLNWAAEN--VAAMVDAWYPG 612

Query: 554 QAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGP 613
           Q GG AIAD+LFG  NP G+L ++  P+  +  LP+     RP      P       + P
Sbjct: 613 QEGGNAIADVLFGDYNPAGRLSVS-VPKS-VGQLPVYYNKKRP-----LPHNYVELDEQP 665

Query: 614 VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQ 673
            +Y FG+G+SY+ F ++  N  T V+    GR         GK ++V           V 
Sbjct: 666 -LYSFGYGLSYSTFEYS--NLKTNVS----GR---------GKDVRVQ----------VT 699

Query: 674 VDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLS 732
            D+KN GS+DG   + L      +    P +QL  F ++ + +G QQ++   +   + L 
Sbjct: 700 FDLKNTGSRDGDEVVQLYLRDEQSSVVTPMQQLKQFRRLSLKSGQQQQLSFELS-AEDLQ 758

Query: 733 VVDRSGTRRIPLGEHNIHIGGTKHSVSL 760
           ++++    ++  G+ ++ +G +   + L
Sbjct: 759 LMNQQMEWQVEPGDFSLMVGASSSDIRL 786


>gi|395490413|ref|ZP_10421992.1| glycosyl hydrolase family protein [Sphingomonas sp. PAMC 26617]
          Length = 898

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 152/396 (38%), Positives = 218/396 (55%), Gaps = 40/396 (10%)

Query: 56  LIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGATSFPQ 115
           ++ R++ +EK+  L++ A A+PRL I  Y WW+E+LHG     P           T+FP+
Sbjct: 51  IVARMTTEEKLPQLLNVAPAIPRLKIPAYNWWTESLHGALGAVP----------TTNFPE 100

Query: 116 VITTASSFNATLWEAIGRVVSDEARAMYNGGT---------AGLTYWSPNVNIFRDPRWG 166
            I  A+SF+A L   +   +S E RA++  G           GL  WSPN+NIFRDPRWG
Sbjct: 101 PIGLAASFDAPLVHEVAGAISTEVRALHTMGRQTGKLGRIGTGLDTWSPNINIFRDPRWG 160

Query: 167 RGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAK 226
           RGQET GEDP LS     ++V G+QG + D   V A+ KHF  +         R   N  
Sbjct: 161 RGQETYGEDPYLSAHMGVAFVTGMQGPNPDLPDVIATPKHFAVHSGPEST---RHGANVF 217

Query: 227 VSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIV 286
           VS  D+EDT+   FR  ++EG+  S+MC+YN+++G P CA+  +LK  +RG W   GY+V
Sbjct: 218 VSAHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIDGQPACANDLLLKDHLRGAWGFTGYVV 277

Query: 287 SDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTE--SAVQRGLLS 339
           SDCD+V    D   +      A A A++AG D +C     G   GL      +++RGL+S
Sbjct: 278 SDCDAVKDIADGHKYAPDQATAIAAAMKAGTDNECHTQTIGDMGGLPERFGDSLRRGLIS 337

Query: 340 EIDINNALVNTLTVQMRLGMFD---GEPSSQPYGHLGPKDVCTPDHQELALEAARQGIVL 396
           + DI+ +LV   + ++R G      G P++ P      + + +PDH  LAL+A  + +VL
Sbjct: 338 QDDIDRSLVRLFSARLRNGDLPGIAGAPAAVPV-----RAILSPDHMALALQATEKSLVL 392

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYA 432
           LKN G  LPL       +AVIGP  D T  + GNY+
Sbjct: 393 LKNDG-VLPLRP--GARIAVIGPLGDATRVLRGNYS 425



 Score =  122 bits (306), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 87/277 (31%), Positives = 140/277 (50%), Gaps = 48/277 (17%)

Query: 486 GLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVA 535
           GL   +EAE            D+  L LP  Q  ++ +     K P I+V M+G PI++A
Sbjct: 635 GLTSDLEAEETGTDIPGFQGGDKTSLDLPAEQMAMLERAKATGK-PVIVVAMNGSPINLA 693

Query: 536 FAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAMR 595
           +AK++   AAI+ A YPGQ+GG A+A++L G ++PGG+LP+T+Y    + +LP       
Sbjct: 694 WAKDN--AAAIVEAWYPGQSGGVAVANVLTGKTDPGGRLPLTFY--RSVADLP------- 742

Query: 596 PSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISG 655
           P  +    GRTYR++ G  VYPFGHG+SYT+F +T          PL      +    SG
Sbjct: 743 PFDNYTMTGRTYRYFTGKPVYPFGHGLSYTSFAYT----------PL------VLNPASG 786

Query: 656 KAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAFEKVHVPA 715
            A +          L V  DV N   + G   + ++ + P    AP   L  ++++ +  
Sbjct: 787 GAEQ---------GLRVTTDVSNTVKRAGDEVVQLYLSFPDVPGAPRIALRGYQRITLAP 837

Query: 716 GAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           G ++ V  ++   + LS V  +G R +  G++ + +G
Sbjct: 838 GERRSVTFDLS-PRDLSAVRANGVREVMQGQYRVSVG 873


>gi|336412679|ref|ZP_08593032.1| hypothetical protein HMPREF1017_00140 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942725|gb|EGN04567.1| hypothetical protein HMPREF1017_00140 [Bacteroides ovatus
           3_8_47FAA]
          Length = 735

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 233/795 (29%), Positives = 368/795 (46%), Gaps = 109/795 (13%)

Query: 8   FFLGLILLSASSSGLAAREPFACDPKDATTRTLP-FCQVSLPIPQRVNDLIGRLSLQEKV 66
             LGL LL  + S L++++           +++P +     PI +R++DLI R++L+EKV
Sbjct: 6   LILGLCLLGVTHS-LSSKD----------KKSIPLYKDAKAPIEKRIDDLISRMTLEEKV 54

Query: 67  KLLI--------------SGAAAVP-RLGIKGYEWWSEALHGV--------SNVGPGTKF 103
             L                    VP  +G   Y   +  L           S +G    F
Sbjct: 55  LQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFDINPELRNSMQKKAMEESRLGIPIIF 114

Query: 104 GGDFPGA--TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIF 160
           G D      T +P  +  A S+N  L E    V + EAR       +G+ + +SP +++ 
Sbjct: 115 GYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSAQEAR------MSGVDWTFSPMIDVA 168

Query: 161 RDPRWGRGQETPGEDPVLSGKYAASYVRGLQGSD-GDRLKVAASCKHFTAYDLDNWNGVD 219
           RDPRWGR  E  GEDP  +G +AA+ VRG QG D     ++AA  KH+  Y         
Sbjct: 169 RDPRWGRVAEGYGEDPYTNGVFAAASVRGYQGDDMSAENRMAACLKHYVGYGASE---AG 225

Query: 220 RFHFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEW 279
           R +   ++S Q + DT+ +P+ M V  G  A++M S+N ++GVP  A+P I+   ++  W
Sbjct: 226 RDYVYTEISAQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRW 284

Query: 280 RLNGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCGPF-LGLHTESAVQRGLL 338
           + +G+IVSD  +V      Q   +T ++AA  A  AGL++D        H +  V+ G +
Sbjct: 285 KHDGFIVSDWGAVE-QLKNQGLAATKKDAARYAFNAGLEMDMMSHAYDRHLKELVEEGKV 343

Query: 339 SEIDINNALVNTLTVQMRLGMFDGEPSSQPYGHL-GPKD-VCTPDHQELALEAARQGIVL 396
           +   ++ ++   L V+ RLG+F+     +PY  +   KD    P    +A + A + +VL
Sbjct: 344 TMAQVDESVRRVLRVKFRLGLFE-----RPYTPVTNEKDRFFRPQSMAVAAQLAAESMVL 398

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNYAG------IACGYTTPLQGIGRYAR 450
           LKN    LPL++   + +AV+GP +     ++G++ G      +   Y       G  A 
Sbjct: 399 LKNDNQILPLTN--KKRIAVVGPMAKNGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGEAE 456

Query: 451 TIHQQGCKDVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQEL 510
             +  GCK     D   F  A+D  R +D  I+ +G   +   E   R+ + LP  Q+EL
Sbjct: 457 LRYAMGCKPQG-NDRSGFAGALDVVRWSDVVIVCLGEMLTWSGENASRSTIALPQIQEEL 515

Query: 511 VSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNP 570
           V ++  A K P ILVL +G P+++   + +P   AI+    PG  G  ++A IL G  NP
Sbjct: 516 VKELKEAGK-PIILVLSNGRPLEL--NRMEPLCDAILEIWQPGINGARSMAGILSGRINP 572

Query: 571 GGKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYR-FYKGPV---VYPFGHGMSYTN 626
            GKL +T          P +   +    ++R  GR ++ FYK       Y FG+G+SYT 
Sbjct: 573 SGKLAIT---------FPYSTGQIPIYYNRRKSGRWHQGFYKDITSDPFYSFGYGLSYTE 623

Query: 627 FVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAH 686
           F + V   P+   V    + G                      L V+V V NVG +DGA 
Sbjct: 624 FQYGVVT-PSSTTV----KRGE--------------------KLSVEVTVTNVGKRDGAE 658

Query: 687 TLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLG 745
           T+  F S P      P K+L  FEK  +  G  +    ++ + + L  VD +G R +  G
Sbjct: 659 TVHWFISDPYCSITRPVKELKHFEKQFIKVGETRTFRFDVDLERDLGFVDGNGKRFLEAG 718

Query: 746 EHNIHIGGTKHSVSL 760
           E+NI +   K  + L
Sbjct: 719 EYNIWVQDQKVKIEL 733


>gi|295689400|ref|YP_003593093.1| glycosyl hydrolase family protein [Caulobacter segnis ATCC 21756]
 gi|295431303|gb|ADG10475.1| glycoside hydrolase family 3 domain protein [Caulobacter segnis
           ATCC 21756]
          Length = 895

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 151/401 (37%), Positives = 224/401 (55%), Gaps = 39/401 (9%)

Query: 51  QRVNDLIGRLSLQEKVKLLISGAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGDFPGA 110
           Q   DL+ +++ +EK+  L++ A A+PRLG+  Y WW+E+LHG           G  P  
Sbjct: 41  QMAADLVAKMTTEEKLDQLLNTAPAIPRLGVPAYNWWTESLHGAL---------GSLP-T 90

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTA---------GLTYWSPNVNIFR 161
           T+FP+ +  A++F+  L + +G  +  E R ++    A          L  WSPN+NIFR
Sbjct: 91  TNFPEPVGLAATFDTDLVKDVGGAIGAEVRGLHTLARATGRMGRIGTALNTWSPNINIFR 150

Query: 162 DPRWGRGQETPGEDPVLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRF 221
           DPRWGRGQET GEDP L+ +   ++V G+QG D D  +V A+ KH+  +         R 
Sbjct: 151 DPRWGRGQETYGEDPHLAARMGVAFVEGVQGPDPDLPQVIATPKHYAVHSGPEST---RH 207

Query: 222 HFNAKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRL 281
           H N  VS++D+EDT+   FR  ++E K  S+MC+YN+++G P CA   +LK  +R  W+ 
Sbjct: 208 HANVYVSRRDLEDTYLPAFRAAIVEAKAGSIMCAYNRIDGQPACASDMLLKDYLRTAWKF 267

Query: 282 NGYIVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-------PFLGLHTESAVQ 334
           +GY+VSDCD+V    D   +      A A A+RAG+D +C          L      A+Q
Sbjct: 268 DGYVVSDCDAVKDINDNHKYAPDAATAVAAAVRAGVDNECNGATLTDTAGLSNRYRDALQ 327

Query: 335 RGLLSEIDINNALVNTLTVQMRLGMFDGEPSSQPY--GHLGPKDVCTPDHQELALEAARQ 392
           +GL++  D++ ALV   + ++R+G     P  +P       P  V TP+H+ LAL A+ +
Sbjct: 328 QGLITIGDVDQALVRLYSARLRVGDL---PGVRPVDTAPKAPSAVMTPEHEALALAASEK 384

Query: 393 GIVLLKNQGPSLPLSH-IRHRTVAVIGPNSDVTVTMIGNYA 432
            +VLLKN G  LPL   +R   VAVIGP  D T  + GNY+
Sbjct: 385 SLVLLKNDG-LLPLKPGLR---VAVIGPLGDATRVLRGNYS 421



 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/284 (32%), Positives = 146/284 (51%), Gaps = 48/284 (16%)

Query: 479 DATILVMGLDQSIEAEAL----------DRAGLLLPGRQQELVSKVSMASKGPTILVLMS 528
           D  + V+GL   +EAE            D+  L +P  QQ L+ +     K P ++V M+
Sbjct: 625 DVLVAVVGLTSDLEAEESPVEIPGFKGGDKTTLDIPADQQALLEQAKALGK-PLVVVAMN 683

Query: 529 GGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLP 588
           G P+++++AK++   AAI+ A YPGQ+GG AIA++L G +NP G+LP+T+Y  + + +LP
Sbjct: 684 GSPLNLSWAKDN--AAAILEAWYPGQSGGLAIANVLTGKTNPSGRLPLTFY--KSVEDLP 739

Query: 589 MTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGS 648
                  P    R  GRTYR++ G  VYPFG+G+SYT F +    AP  V  P+ G    
Sbjct: 740 -------PFGDYRMDGRTYRYFTGQPVYPFGYGLSYTRFDY----APLKVE-PIKGD--- 784

Query: 649 INATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVFSTPPAGHWAPHKQLVAF 708
                +G+ ++VT              V+NVG + G     ++   P    AP   L  F
Sbjct: 785 -----AGQGLRVT------------TTVRNVGQRAGDEVAQLYLNFPNSPGAPRVALRGF 827

Query: 709 EKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           ++V +  G  + V  ++   + LS VD  G R++  G + + +G
Sbjct: 828 QRVSLKPGEAKSVTFSLS-SRDLSSVDPDGVRQVMTGRYKVSVG 870


>gi|380692929|ref|ZP_09857788.1| glycoside hydrolase family protein [Bacteroides faecis MAJ27]
          Length = 777

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 225/791 (28%), Positives = 351/791 (44%), Gaps = 131/791 (16%)

Query: 46  SLPIPQRVNDLIGRLSLQEKVKLL--ISGAAAVPRLGIKGYEWWSEALH-GVSNV----- 97
           S PI +RV DL+ ++++ EK   +  + G+  V    +   +W SE    G+ N+     
Sbjct: 36  SAPIEERVKDLLSQMNMDEKTCQMATLYGSGRVLADALPTEKWKSEIWKDGIGNIDEEHN 95

Query: 98  GPGTKFGGD--FP--------------------------------------GATSFPQVI 117
           G G KFG +  FP                                       AT FP   
Sbjct: 96  GLG-KFGSEYAFPYAKHVKAIHDIQRWFVEETRLGIPVDFTNEGIRGVCHEKATFFPAQC 154

Query: 118 TTASSFNATLWEAIGRVVSDEARAMYNGGTAGLT-YWSPNVNIFRDPRWGRGQETPGEDP 176
              S++N  L   IG V + EA A+      G T  +SP ++I +DPRWGR  E  GEDP
Sbjct: 155 GQGSTWNKELIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRAVECYGEDP 208

Query: 177 VLSGKYAASYVRGLQGSDGDRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTF 236
            L G+     ++ LQ     + K+ A+ KHF  Y +           +  V+ +++   +
Sbjct: 209 YLVGQLGKQMIQSLQ-----KHKLVATPKHFAVYSIPVGGRDGGTRTDPHVAPREMRTLY 263

Query: 237 DVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGYIVSDCDSVGVYY 296
             PFR+   E     VM SYN  +G P       L + +R EW   GY+VSD D+V    
Sbjct: 264 LEPFRVAFQEAGALGVMSSYNDYDGEPITGSYRFLTQILRQEWGFKGYVVSDSDAVEFIS 323

Query: 297 DTQHFTSTPEEAAADAIRAGLDLDC-----GPFLGLHTESAVQRGLLSEIDINNALVNTL 351
                    EEA   ++ AGL++         F+     SA+ +G +S+  I+  +   L
Sbjct: 324 SKHKVADNNEEAVVQSVNAGLNVRTNFSSPAGFIK-PLRSAIAKGKVSQATIDQRVSEIL 382

Query: 352 TVQMRLGMFDGEPSSQPY---GHLGPKDVCTPDHQELALEAARQGIVLLKNQGPSLPLSH 408
            V+  LG+FD      PY   G L  K V   +HQ +ALEAARQ IVLLKNQ   LPL  
Sbjct: 383 YVKFWLGLFD-----NPYRGDGKLADKIVHCKEHQAVALEAARQSIVLLKNQDNLLPLQK 437

Query: 409 IRHRTVAVIGPNSDVTVTMIGNYAGIACGYTTPLQGIGRY---ARTIHQQGCK------- 458
              ++VAVIGPN+D    +I  Y        T  +GI      A+ ++++GC+       
Sbjct: 438 TL-KSVAVIGPNADEQKELICRYGPSNAPIKTVYKGIKEALPGAKVVYKKGCEIVDPHFP 496

Query: 459 -------DVACADDQLFGAAIDASRQADATILVMGLDQSIEAEALDRAGLLLPGRQQELV 511
                  D+   + Q+   AI+A++ A+  I+V+G  +    E   R  L LPGRQ+EL+
Sbjct: 497 ESEVLPFDITPKEQQIMDEAIEAAKSAEVVIMVLGGSEVTVREERSRTSLDLPGRQEELL 556

Query: 512 SKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGYPGQAGGTAIADILFGTSNPG 571
             V    K PTILV++ G    + +AK    + AI+ A +PG+  G A+A+ +FG +NPG
Sbjct: 557 KAVCKLGK-PTILVMIDGRASSINYAKK--YVPAILHAWFPGEFCGQAVAETIFGDNNPG 613

Query: 572 GKLPMTWYPQEYITNLPMTEMAMRPSQSKRYPGRTYRFYKGPVVYPFGHGMSYTNFVHTV 631
           GKL +T +P+  +  +P      +P  S    G +        ++PFGHG+SYT F +  
Sbjct: 614 GKLAVT-FPKS-VGQIPFA-FPFKPG-SDSGCGTSVT----GALFPFGHGLSYTTFEYN- 664

Query: 632 ANAPTVVAVPLDGRHGSINATISGKAIKVTHAKCNRLTLGVQVDVKNVGSKDGAHTLLVF 691
                             N  IS +   V         + V   VKN G + G   + ++
Sbjct: 665 ------------------NLKISPEQQGVLG------EVKVSCTVKNTGKRPGDEVVQLY 700

Query: 692 STPPAGHWAPH-KQLVAFEKVHVPAGAQQRVGINIHVCKYLSVVDRSGTRRIPLGEHNIH 750
                     + K L  FE++ +    +++V   +   + L++ D++   ++  G   + 
Sbjct: 701 LRDEISSVTTYVKILRGFERITLQPNEEKKVTFTLS-PQDLAIWDKNMKFQVEPGTFKVM 759

Query: 751 IGGTKHSVSLH 761
           IG +   + L 
Sbjct: 760 IGASSKDIRLE 770


>gi|419862653|ref|ZP_14385247.1| beta-D-glucoside glucohydrolase [Escherichia coli O103:H25 str.
           CVM9340]
 gi|388344793|gb|EIL10617.1| beta-D-glucoside glucohydrolase [Escherichia coli O103:H25 str.
           CVM9340]
          Length = 765

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 200/668 (29%), Positives = 322/668 (48%), Gaps = 101/668 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTST-PEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            VSD  ++      +H T+  PE+A   A+++G+++     +   +    ++ G ++  +
Sbjct: 284 TVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYYSKYLPGLIKSGKVTMEE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLGPK+    D       H++ A E AR+ +V
Sbjct: 342 LDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   +  T+AV+GP +D    ++G++  AG+A    T L GI    G   
Sbjct: 397 LLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVADQSVTVLTGIKNAVGENG 454

Query: 450 RTIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQS 490
           + ++ +G     D    D                 ++   A+  ++Q+D  + V+G  Q 
Sbjct: 455 KVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQTAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L++ +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RPSQSKRYPGRTY 607
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+    +   RP  + +    T 
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTS 629

Query: 608 RFYKGP--VVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC 665
           R++      +YPFG+G+SYT F  TV++    ++ P   R G + A+             
Sbjct: 630 RYFDEANGALYPFGYGLSYTTF--TVSDVK--LSAPTMKRDGKVTAS------------- 672

Query: 666 NRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG-- 722
                   V V N G +DGA  + ++     A    P KQL  FEK+ +  G  Q V   
Sbjct: 673 --------VQVTNTGKRDGATVVQMYLQDVTASMSRPVKQLKGFEKITLKPGETQTVSFP 724

Query: 723 INIHVCKY 730
           I+I   K+
Sbjct: 725 IDIEALKF 732


>gi|423121272|ref|ZP_17108956.1| periplasmic beta-glucosidase [Klebsiella oxytoca 10-5246]
 gi|376394107|gb|EHT06758.1| periplasmic beta-glucosidase [Klebsiella oxytoca 10-5246]
          Length = 765

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 208/688 (30%), Positives = 326/688 (47%), Gaps = 100/688 (14%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GR+      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPISLGLASSFNLDAVKTVGRI------SAYEAADDGLNMTWAPMVDVSRDPRWGRAS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+ +   + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTTEMGRAMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G   +VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSDAWLLKEVLRDDWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFTSTPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEIDI 343
            VSD  ++          + PE+A   A+++G+++     +   +    V+ G ++  ++
Sbjct: 284 TVSDHGAIKELI-KHGVAADPEDAVRVALKSGINMSMSDEYYSKYLPGLVKSGKVTMAEL 342

Query: 344 NNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIVL 396
           ++A  + L V+  +G+F+      PY HLGP+     D       H++ A E ARQ +VL
Sbjct: 343 DDATRHVLNVKYDMGLFN-----DPYSHLGPQGSDPQDTNAESRLHRQEAREVARQSLVL 397

Query: 397 LKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYAR 450
           LKN+  +LPL   +  TVAVIGP +D    M+G++  AG+A    T L GI    G   +
Sbjct: 398 LKNRLETLPLK--KSGTVAVIGPLADSKRDMMGSWSAAGVADQSVTVLTGIKEALGDKGK 455

Query: 451 TIHQQGCK---DVACAD----------------DQLFGAAIDASRQADATILVMGLDQSI 491
            I+ +G     D    D                  +   A+ A++Q+D  + V+G  Q +
Sbjct: 456 VIYARGANVTNDKGIVDFLNLYEKAVQVDPRSPQAMIDEAVAAAKQSDVVVAVVGEAQGM 515

Query: 492 EAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAGY 551
             EA  R  + LP  Q++L+S +    K P +LVLM+G P+  A  K D +  AI+   +
Sbjct: 516 AHEASSRTDITLPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETWF 572

Query: 552 PGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPMTEMAM---RP---SQSKRYPGR 605
            G  GG AIAD+LFG +NP GKLPM+ +P+  +  +P+    +   RP    +  +Y  R
Sbjct: 573 AGTEGGHAIADVLFGDANPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNADKPNKYTSR 630

Query: 606 TYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAKC 665
            +    GP +YPFG+G+SYT F  +V++    ++ P   R GS+ A+             
Sbjct: 631 YFDEANGP-LYPFGYGLSYTTF--SVSDVK--MSAPTLQRDGSVTAS------------- 672

Query: 666 NRLTLGVQVDVKNVGSKDGAHTL-LVFSTPPAGHWAPHKQLVAFEKVHVPAGAQQRVGIN 724
                   V V N G ++GA  + L      A    P K L  F++V +  G  Q V   
Sbjct: 673 --------VQVTNTGKREGATVIQLYLQDVTASMSRPVKMLRGFKRVTLKPGETQTVAFP 724

Query: 725 IHVCKYLSVVDRSGTRRIPLGEHNIHIG 752
           I +   L   ++        G+ N+ IG
Sbjct: 725 IEI-DALKFWNQKMKYDAEPGKFNVFIG 751


>gi|365850358|ref|ZP_09390822.1| glycosyl hydrolase family 3 protein [Yokenella regensburgei ATCC
           43003]
 gi|364567365|gb|EHM45033.1| glycosyl hydrolase family 3 protein [Yokenella regensburgei ATCC
           43003]
          Length = 765

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 202/669 (30%), Positives = 323/669 (48%), Gaps = 103/669 (15%)

Query: 111 TSFPQVITTASSFNATLWEAIGRVVSDEARAMYNGGTAGLTY-WSPNVNIFRDPRWGRGQ 169
           T FP  +  ASSFN    + +GRV      + Y     GL   W+P V++ RDPRWGR  
Sbjct: 118 TVFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRVS 171

Query: 170 ETPGEDPVLSGKYAASYVRGLQG-SDGDRLKVAASCKHFTAYDL----DNWNGVDRFHFN 224
           E  GED  L+     + V  +QG S  DR  V  S KHF AY        +N VD     
Sbjct: 172 EGFGEDTYLTAIMGKTMVESMQGKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVD----- 226

Query: 225 AKVSKQDIEDTFDVPFRMCVMEGKVASVMCSYNQVNGVPTCADPNILKRTIRGEWRLNGY 284
             +S Q + + +  P++  +  G    VM + N +NG P  +D  +LK  +R +W   G 
Sbjct: 227 --MSPQRLFNDYLPPYKAALDAGS-GGVMVALNSLNGTPASSDSWLLKDILRDDWGFKGI 283

Query: 285 IVSDCDSVGVYYDTQHFT-STPEEAAADAIRAGLDLDCG-PFLGLHTESAVQRGLLSEID 342
            +SD  ++      +H T S PE+A   A+ +G+++     +   +    V+ G ++  +
Sbjct: 284 TISDHGAIKEL--IKHGTASDPEDAVRIALTSGINMSMSDEYYSKYLPGLVKSGKVTMAE 341

Query: 343 INNALVNTLTVQMRLGMFDGEPSSQPYGHLGPKDVCTPD-------HQELALEAARQGIV 395
           +++A  + L V+  +G+F+      PY HLG K+    D       H++ A E AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGAKETDPQDTNAESRLHRKEAREVARESLV 396

Query: 396 LLKNQGPSLPLSHIRHRTVAVIGPNSDVTVTMIGNY--AGIACGYTTPLQGI----GRYA 449
           LLKN+  +LPL   ++  +AV+GP +D    M+G++  AG+A    T L GI    G   
Sbjct: 397 LLKNRLETLPLK--KNAIIAVVGPLADSQRDMMGSWSAAGVASQSVTLLTGIQNAVGTEG 454

Query: 450 RTIHQQGC-----KDVACADDQ--------------LFGAAIDASRQADATILVMGLDQS 490
           + ++ +G      K +    +Q              +   A+ A++Q+D  + V+G  Q 
Sbjct: 455 KVLYAKGANVTNDKGIVEFLNQYEPAVVVDPRSPQAMIDEAVKAAKQSDVVVAVVGEAQG 514

Query: 491 IEAEALDRAGLLLPGRQQELVSKVSMASKGPTILVLMSGGPIDVAFAKNDPRIAAIIWAG 550
           +  EA  R  + +P  Q++L+S +    K P +LVLM+G P+  A  K D +  AI+   
Sbjct: 515 MAHEASSRTDITIPQSQRDLISALKATGK-PLVLVLMNGRPL--ALVKEDQQADAILETW 571

Query: 551 YPGQAGGTAIADILFGTSNPGGKLPMTWYPQEYITNLPM------TEMAMRPSQSKRYPG 604
           + G  GG AIAD+LFG  NP GKLPM+ +P+  +  +P+      T     P +  +Y  
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPMS-FPRS-VGQIPVYYSHLNTGRPYNPEKPNKYTS 629

Query: 605 RTYRFYKGPVVYPFGHGMSYTNFVHTVANAPTVVAVPLDGRHGSINATISGKAIKVTHAK 664
             +    GP +YPFG+G+SYT F  TV++    ++ P+  R G + A+            
Sbjct: 630 HYFDEANGP-LYPFGYGLSYTTF--TVSDVK--MSSPVMKRDGKVTAS------------ 672

Query: 665 CNRLTLGVQVDVKNVGSKDGAHTLLVF-STPPAGHWAPHKQLVAFEKVHVPAGAQQRVG- 722
                    V+V N G ++GA  + ++     A    P KQL  FEKV +  G  Q V  
Sbjct: 673 ---------VEVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGFEKVTLKPGETQTVSF 723

Query: 723 -INIHVCKY 730
            I+++  K+
Sbjct: 724 PIDVNALKF 732


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.136    0.416 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,862,684,050
Number of Sequences: 23463169
Number of extensions: 573245454
Number of successful extensions: 1194741
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6066
Number of HSP's successfully gapped in prelim test: 1659
Number of HSP's that attempted gapping in prelim test: 1145659
Number of HSP's gapped (non-prelim): 13788
length of query: 770
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 619
effective length of database: 8,816,256,848
effective search space: 5457262988912
effective search space used: 5457262988912
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)