BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 047621
(550 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255545293|ref|XP_002513707.1| Beta-glucosidase, putative [Ricinus communis]
gi|223547158|gb|EEF48654.1| Beta-glucosidase, putative [Ricinus communis]
Length = 777
Score = 900 bits (2325), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/565 (78%), Positives = 474/565 (83%), Gaps = 46/565 (8%)
Query: 32 AQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLG 91
AQSSPVFACDV SNPSLAS GFCN SLGI RV DLV RLTLQEKI FLV+SAGSVSRLG
Sbjct: 35 AQSSPVFACDVKSNPSLASFGFCNVSLGISDRVTDLVNRLTLQEKIGFLVNSAGSVSRLG 94
Query: 92 IPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEAR 151
IPKYEWWSEALHGVSYVGPGTHFSN+VPGATSFPQVILTAASFNASLF+AIGKVVSTEAR
Sbjct: 95 IPKYEWWSEALHGVSYVGPGTHFSNIVPGATSFPQVILTAASFNASLFEAIGKVVSTEAR 154
Query: 152 AMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR 211
AMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLL+SKY + YV+GLQQTD G R
Sbjct: 155 AMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVRGLQQTDNGDSER 214
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTYY 246
LKVAACCKHYTAYDLDNWKGTDRYHFNA+V + +Y
Sbjct: 215 LKVAACCKHYTAYDLDNWKGTDRYHFNAVVTKQDLDDTFQPPFKSCVIDGNVASVMCSYN 274
Query: 247 LIKFK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+ K YIVSDCDSVDV+YNSQHYTKTPEEAAA +ILAGL
Sbjct: 275 QVNGKPTCADPDLLAGIIRGEWKLNGYIVSDCDSVDVIYNSQHYTKTPEEAAAITILAGL 334
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
DLNCGSFLGKHTEAAV AGL+N SA+D A+SNNFATLMRLGFFDG PSKQ YG+LGPKDV
Sbjct: 335 DLNCGSFLGKHTEAAVNAGLLNVSAVDKAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDV 394
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
CT NQ+LA +AARQGIVLLKN+ GSLPLSPTAIK LAVIGPNANVTKTMIGNYEGTPCK
Sbjct: 395 CTAVNQELAREAARQGIVLLKNSPGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCK 454
Query: 406 YTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDR 465
YTTPLQGL A VAT Y AGCSNV C AQVDDAKK AASADATVLVMGADQSIEAES DR
Sbjct: 455 YTTPLQGLTASVATTYLAGCSNVACAAAQVDDAKKLAASADATVLVMGADQSIEAESRDR 514
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
+D+LLPGQQQLLIT+VA V+KGPVIL+IMSGGG D+SFAK + KITSILWVGYPG+AGGA
Sbjct: 515 VDVLLPGQQQLLITQVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGA 574
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
AIADV FG YNPSGRLPMTWYPQ+Y
Sbjct: 575 AIADVIFGYYNPSGRLPMTWYPQAY 599
>gi|297745522|emb|CBI40687.3| unnamed protein product [Vitis vinifera]
Length = 751
Score = 876 bits (2263), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 443/592 (74%), Positives = 487/592 (82%), Gaps = 49/592 (8%)
Query: 8 NRAPKVSVLFLFLTYCSLQHSSSS---AQSSPVFACDVVSNPSLASLGFCNTSLGIDLRV 64
N APKV+V FL+ S SS AQSSPVFACDV +NP+L GFCNTSL RV
Sbjct: 6 NSAPKVTVFLCFLSCFSHFLSSPKWVLAQSSPVFACDVENNPTLGQFGFCNTSLETAARV 65
Query: 65 GDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSF 124
DLVKRLTL+EKI FLV+SA SVSRLGIPKYEWWSEALHGVSYVGPGTHF++VVPGATSF
Sbjct: 66 ADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPGATSF 125
Query: 125 PQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPG 184
PQVILTAASFNASLF+AIGK VSTEARAMYNVGLAGLTFWSPN+NIFRDPRWGRGQETPG
Sbjct: 126 PQVILTAASFNASLFEAIGKAVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQETPG 185
Query: 185 EDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV--- 241
EDPLL+SKYA+GYV+GLQQ+D GSP+RLKVAACCKHYTAYDLDNWKG DR+HFNA+V
Sbjct: 186 EDPLLSSKYASGYVRGLQQSDDGSPDRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQ 245
Query: 242 ----------------------IYTYYLIKFK---------------------YIVSDCD 258
+ +Y + K YIVSDCD
Sbjct: 246 DMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCD 305
Query: 259 SVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNN 318
SVDV YNSQHYTKTPEEAAAK+ILAGLDLNCGSFLG+HTEAAVK GLV+ESA+D A+SNN
Sbjct: 306 SVDVFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNN 365
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
FATLMRLGFFDG+PSK YG+LGPKDVCT +Q+LA +AARQGIVLLKN+ GSLPLSPTA
Sbjct: 366 FATLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTA 425
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDA 438
IK LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL A+VAT Y GCSNV CGTAQ+D+A
Sbjct: 426 IKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQIDEA 485
Query: 439 KKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
KK AA+ADATVL++G DQSIEAE DR+++ LPGQQ LLITEVAK +KG VIL++MSGGG
Sbjct: 486 KKIAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGG 545
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
FDISFAKND KITSILWVGYPG+AGGAAIADV FG YNPSGRLPMTWYPQSY
Sbjct: 546 FDISFAKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSY 597
>gi|359485890|ref|XP_002264183.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Vitis vinifera]
Length = 774
Score = 876 bits (2263), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 443/592 (74%), Positives = 487/592 (82%), Gaps = 49/592 (8%)
Query: 8 NRAPKVSVLFLFLTYCSLQHSSSS---AQSSPVFACDVVSNPSLASLGFCNTSLGIDLRV 64
N APKV+V FL+ S SS AQSSPVFACDV +NP+L GFCNTSL RV
Sbjct: 6 NSAPKVTVFLCFLSCFSHFLSSPKWVLAQSSPVFACDVENNPTLGQFGFCNTSLETAARV 65
Query: 65 GDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSF 124
DLVKRLTL+EKI FLV+SA SVSRLGIPKYEWWSEALHGVSYVGPGTHF++VVPGATSF
Sbjct: 66 ADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPGATSF 125
Query: 125 PQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPG 184
PQVILTAASFNASLF+AIGK VSTEARAMYNVGLAGLTFWSPN+NIFRDPRWGRGQETPG
Sbjct: 126 PQVILTAASFNASLFEAIGKAVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQETPG 185
Query: 185 EDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV--- 241
EDPLL+SKYA+GYV+GLQQ+D GSP+RLKVAACCKHYTAYDLDNWKG DR+HFNA+V
Sbjct: 186 EDPLLSSKYASGYVRGLQQSDDGSPDRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQ 245
Query: 242 ----------------------IYTYYLIKFK---------------------YIVSDCD 258
+ +Y + K YIVSDCD
Sbjct: 246 DMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCD 305
Query: 259 SVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNN 318
SVDV YNSQHYTKTPEEAAAK+ILAGLDLNCGSFLG+HTEAAVK GLV+ESA+D A+SNN
Sbjct: 306 SVDVFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNN 365
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
FATLMRLGFFDG+PSK YG+LGPKDVCT +Q+LA +AARQGIVLLKN+ GSLPLSPTA
Sbjct: 366 FATLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTA 425
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDA 438
IK LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL A+VAT Y GCSNV CGTAQ+D+A
Sbjct: 426 IKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQIDEA 485
Query: 439 KKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
KK AA+ADATVL++G DQSIEAE DR+++ LPGQQ LLITEVAK +KG VIL++MSGGG
Sbjct: 486 KKIAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGG 545
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
FDISFAKND KITSILWVGYPG+AGGAAIADV FG YNPSGRLPMTWYPQSY
Sbjct: 546 FDISFAKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSY 597
>gi|224054312|ref|XP_002298197.1| predicted protein [Populus trichocarpa]
gi|222845455|gb|EEE83002.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 873 bits (2255), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 445/564 (78%), Positives = 471/564 (83%), Gaps = 48/564 (8%)
Query: 35 SPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
SPVFACDVVSNPSLAS GFCNTSLG+ RV DLVKRLTLQEKI FLV+SAGSVSRLGIPK
Sbjct: 1 SPVFACDVVSNPSLASFGFCNTSLGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRLGIPK 60
Query: 95 YEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
YEWWSEALHGVSYVGPGTHFS+VVPGATSFPQVILTAASFN SLF AIGKVVSTEARAMY
Sbjct: 61 YEWWSEALHGVSYVGPGTHFSSVVPGATSFPQVILTAASFNTSLFVAIGKVVSTEARAMY 120
Query: 155 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKV 214
NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLL+SKY +GYVKGLQQ D G+P+ LKV
Sbjct: 121 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQQRDDGNPDGLKV 180
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK----------------------- 251
AACCKHYTAYDLDNWKG DRYHFNA+V F+
Sbjct: 181 AACCKHYTAYDLDNWKGVDRYHFNAVVTKQDMDDTFQPPFKSCVVDGNVASVMCSYNKVN 240
Query: 252 -----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG--LD 286
YIV+DCDS+DV YNSQHYTKTPEEAAAK+ILAG LD
Sbjct: 241 GIPTCADPDLLSGVIRGEWKLNGYIVTDCDSIDVFYNSQHYTKTPEEAAAKAILAGIRLD 300
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
LNCGSFLGKHTEAAV AGLVNESAID A+SNNFATLMRLGFFDG PSKQ YG+LGPKDVC
Sbjct: 301 LNCGSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVC 360
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
T NQ+LA +AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY
Sbjct: 361 TAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 420
Query: 407 TTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
TTPLQGLAA+VAT Y GCSNV C TAQVDDAKK AA+ADATVLVMGAD SIEAES DR+
Sbjct: 421 TTPLQGLAALVATTYLPGCSNVACSTAQVDDAKKIAAAADATVLVMGADLSIEAESRDRV 480
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
D+LLPGQQQLLIT VA + GPVIL+IMSGGG D+SFAK + KITSILWVGYPG+AGGAA
Sbjct: 481 DILLPGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAA 540
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
IAD+ FG YNPSGRLPMTWYPQSY
Sbjct: 541 IADIIFGSYNPSGRLPMTWYPQSY 564
>gi|224070626|ref|XP_002303181.1| predicted protein [Populus trichocarpa]
gi|222840613|gb|EEE78160.1| predicted protein [Populus trichocarpa]
Length = 773
Score = 869 bits (2246), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 450/596 (75%), Positives = 488/596 (81%), Gaps = 46/596 (7%)
Query: 1 MAPAQNRNRAPKVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGI 60
MA +N PKV V LF + + SAQSSPVFACDVVSNPSLASLGFCNTS+GI
Sbjct: 1 MASVSPQNSVPKVPVFLLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGI 60
Query: 61 DLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG 120
+ RV DLVKRLTLQEKI FLV+SAG+VSRLGIPKYEWWSEALHGVSYVGPGTHFS+ V G
Sbjct: 61 NDRVVDLVKRLTLQEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAG 120
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQ 180
ATSFPQVILTAASFN SLF+AIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQ
Sbjct: 121 ATSFPQVILTAASFNTSLFEAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQ 180
Query: 181 ETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAM 240
ETPGEDPLL+SKY + YVKGLQQ D G P++LKVAACCKHYTAYDLDNWKG+DRYHFNA+
Sbjct: 181 ETPGEDPLLSSKYGSCYVKGLQQRDDGDPDKLKVAACCKHYTAYDLDNWKGSDRYHFNAV 240
Query: 241 V-------------------------IYTYYLIKFK---------------------YIV 254
V + +Y + K YIV
Sbjct: 241 VTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWNLNGYIV 300
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
+DCDS+DV Y SQ+YTKTPEEAAA +ILAG+DLNCGSFLG+HTEAAVK GLVNE AID A
Sbjct: 301 TDCDSLDVFYKSQNYTKTPEEAAAAAILAGVDLNCGSFLGQHTEAAVKGGLVNEHAIDIA 360
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+SNNFATLMRLGFFDG PSKQ YG+LGPKDVCT NQ+LA +AARQGIVLLKNTAGSLPL
Sbjct: 361 VSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPL 420
Query: 375 SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ 434
SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA VAT Y GCSNV C TAQ
Sbjct: 421 SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAASVATTYLPGCSNVACSTAQ 480
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIM 494
VDDAKK AA+ADATVLVMGAD SIEAES DR+D+LLPGQQQLLIT VA V+ GPVIL+IM
Sbjct: 481 VDDAKKLAAAADATVLVMGADLSIEAESRDRVDVLLPGQQQLLITAVANVSCGPVILVIM 540
Query: 495 SGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
SGGG D+SFA+ + KITSILWVGYPG+AGGAAIAD+ FG YNPSGRLPMTWYPQSY
Sbjct: 541 SGGGMDVSFARTNDKITSILWVGYPGEAGGAAIADIIFGYYNPSGRLPMTWYPQSY 596
>gi|359481045|ref|XP_002268626.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Vitis vinifera]
gi|296089342|emb|CBI39114.3| unnamed protein product [Vitis vinifera]
Length = 774
Score = 866 bits (2237), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/592 (73%), Positives = 479/592 (80%), Gaps = 49/592 (8%)
Query: 8 NRAPKVSVLFLFLTYCSLQHSSSS---AQSSPVFACDVVSNPSLASLGFCNTSLGIDLRV 64
N APKV+V FL+ S SS QSSPVFACDV +NP+L GFCNTSL RV
Sbjct: 6 NSAPKVTVFLCFLSCFSHFLSSPKWVLGQSSPVFACDVENNPTLGQFGFCNTSLETAARV 65
Query: 65 GDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSF 124
DLVKRLTL+EKI FLV+SA SVSRLGIPKYEWWSEALHGVSYVGPGTHF+++VPGATSF
Sbjct: 66 ADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSIVPGATSF 125
Query: 125 PQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPG 184
PQVILTAASFNASLF+AIGKVVSTEARAMYNVGLAGLTFWSPN+NIFRDPRWGRGQETPG
Sbjct: 126 PQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQETPG 185
Query: 185 EDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYT 244
EDPLL+SKYA+ YV+GLQQ D GSP+RLKVAACCKHYTAYDLDNWKG DR HFNA+V
Sbjct: 186 EDPLLSSKYASAYVRGLQQGDDGSPDRLKVAACCKHYTAYDLDNWKGVDRLHFNAVVTKQ 245
Query: 245 YYLIKFK----------------------------------------------YIVSDCD 258
F+ YIVSDCD
Sbjct: 246 DMDDTFQPPFKSCVIDGNVASVMCSFNQVNGKPTCADPDLLSGIVRGEWKLNGYIVSDCD 305
Query: 259 SVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNN 318
SVDV YNSQHYTKTPEEAAAK+ILAGLDLNCGSFLG+HTEAAVK GLV+ESA+D A+SNN
Sbjct: 306 SVDVFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNN 365
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
FATLMRLGFFDG+PSK YG+LGPKDVCT +Q++A +AARQGIVLLKN+ GSLPLSPTA
Sbjct: 366 FATLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQEMAREAARQGIVLLKNSKGSLPLSPTA 425
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDA 438
IK LA+IGPNANVTKTMIGNYEGTPCKYTTPLQGL A+VAT Y GCSNV CGTAQ+D+A
Sbjct: 426 IKTLAIIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQIDEA 485
Query: 439 KKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
KK AA+ADATVL++G DQSIEAE DR+ + LPGQQ LLITEVAK +KG VIL++MSGGG
Sbjct: 486 KKIAAAADATVLIVGIDQSIEAEGRDRVSIQLPGQQPLLITEVAKASKGNVILVVMSGGG 545
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
FDISFAKND KI SILWVGYPG+AGGAAIADV FG YNPSGRLPMTWYPQSY
Sbjct: 546 FDISFAKNDDKIASILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSY 597
>gi|297797477|ref|XP_002866623.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
gi|297312458|gb|EFH42882.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
Length = 784
Score = 863 bits (2230), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 434/598 (72%), Positives = 472/598 (78%), Gaps = 50/598 (8%)
Query: 2 APAQNRNRAPKVSVLFLFLTY-CSLQH--SSSSAQSSPVFACDVVSNPSLASLGFCNTSL 58
+P RNRAP SV +FLT+ C + S+AQSSPVFACDV +NPSLA+ GFCNT L
Sbjct: 5 SPLTRRNRAPPSSVSSVFLTFLCFFLYFLDLSNAQSSPVFACDVAANPSLAAYGFCNTVL 64
Query: 59 GIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVV 118
I+ RV DLV RLTLQEKI FLV A V+RLGIP YEWWSEALHGVSY+GPGTHFS+ V
Sbjct: 65 KIEYRVADLVARLTLQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQV 124
Query: 119 PGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGR 178
PGATSFPQVILTAASFN SLFQAIGKVVSTEARAMYNVGLAGLT+WSPN+NIFRDPRWGR
Sbjct: 125 PGATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGR 184
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
GQETPGEDPLLASKYA+GYVKGLQ+TDGG NRLKVAACCKHYTAYD+DNWKG +RY FN
Sbjct: 185 GQETPGEDPLLASKYASGYVKGLQETDGGDSNRLKVAACCKHYTAYDVDNWKGVERYSFN 244
Query: 239 AMV----IYTYYLIKFK------------------------------------------Y 252
A+V + Y FK Y
Sbjct: 245 AVVTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGY 304
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAID 312
IVSDCDSVDVLY +QHYTKTP EAAA SILAGLDLNCGSFLG+HTE AVK+GLVNE+AID
Sbjct: 305 IVSDCDSVDVLYKNQHYTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAID 364
Query: 313 HAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
AISNNF TLMRLGFFDG+P Q YG LGP DVCT ANQ+LA DAARQGIVLLKNT G L
Sbjct: 365 KAISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNT-GFL 423
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGT 432
PLSP +IK LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA V+T Y GCSNV C
Sbjct: 424 PLSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGAVSTTYLPGCSNVACAV 483
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A V A K AA+AD TVL++GADQSIEAES DR+DL LPGQQQ L+ +VAK AKGPV+L+
Sbjct: 484 ADVAGATKLAATADVTVLLIGADQSIEAESRDRVDLNLPGQQQELVIQVAKAAKGPVLLV 543
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
IMSGGGFDI+FAKNDPKI ILWVGYPG+AGG AIAD+ FGRYNPSGRLPMTWYPQSY
Sbjct: 544 IMSGGGFDITFAKNDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGRLPMTWYPQSY 601
>gi|15237736|ref|NP_201262.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
gi|75262663|sp|Q9FLG1.1|BXL4_ARATH RecName: Full=Beta-D-xylosidase 4; Short=AtBXL4; Flags: Precursor
gi|10178060|dbj|BAB11424.1| beta-xylosidase [Arabidopsis thaliana]
gi|332010539|gb|AED97922.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
Length = 784
Score = 859 bits (2219), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/598 (72%), Positives = 470/598 (78%), Gaps = 50/598 (8%)
Query: 2 APAQNRNRAPKVSV---LFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSL 58
+P RNRAP SV +FL + + S+AQSSPVFACDV +NPSLA+ GFCNT L
Sbjct: 5 SPLTRRNRAPPSSVSSVYLIFLCFFLYFLNFSNAQSSPVFACDVAANPSLAAYGFCNTVL 64
Query: 59 GIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVV 118
I+ RV DLV RLTLQEKI FLV A V+RLGIP YEWWSEALHGVSY+GPGTHFS+ V
Sbjct: 65 KIEYRVADLVARLTLQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQV 124
Query: 119 PGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGR 178
PGATSFPQVILTAASFN SLFQAIGKVVSTEARAMYNVGLAGLT+WSPN+NIFRDPRWGR
Sbjct: 125 PGATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGR 184
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
GQETPGEDPLLASKYA+GYVKGLQ+TDGG NRLKVAACCKHYTAYD+DNWKG +RY FN
Sbjct: 185 GQETPGEDPLLASKYASGYVKGLQETDGGDSNRLKVAACCKHYTAYDVDNWKGVERYSFN 244
Query: 239 AMV----IYTYYLIKFK------------------------------------------Y 252
A+V + Y FK Y
Sbjct: 245 AVVTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGY 304
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAID 312
IVSDCDSVDVLY +QHYTKTP EAAA SILAGLDLNCGSFLG+HTE AVK+GLVNE+AID
Sbjct: 305 IVSDCDSVDVLYKNQHYTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAID 364
Query: 313 HAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
AISNNF TLMRLGFFDG+P Q YG LGP DVCT ANQ+LA DAARQGIVLLKNT G L
Sbjct: 365 KAISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNT-GCL 423
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGT 432
PLSP +IK LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA V+T Y GCSNV C
Sbjct: 424 PLSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGTVSTTYLPGCSNVACAV 483
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A V A K AA+AD +VLV+GADQSIEAES DR+DL LPGQQQ L+ +VAK AKGPV+L+
Sbjct: 484 ADVAGATKLAATADVSVLVIGADQSIEAESRDRVDLHLPGQQQELVIQVAKAAKGPVLLV 543
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
IMSGGGFDI+FAKNDPKI ILWVGYPG+AGG AIAD+ FGRYNPSG+LPMTWYPQSY
Sbjct: 544 IMSGGGFDITFAKNDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSY 601
>gi|449438167|ref|XP_004136861.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Cucumis sativus]
Length = 782
Score = 842 bits (2175), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/568 (73%), Positives = 470/568 (82%), Gaps = 46/568 (8%)
Query: 29 SSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVS 88
++SAQS FACD +NPS++ FC++SLG + RV DLVKRLTLQEKI FL+++A +V+
Sbjct: 38 TTSAQSPTAFACDAETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVT 97
Query: 89 RLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVST 148
RLGIPKYEWWSEALHGVSYVGPGT FSNVVPGATSFPQVILTAASFNASLF+AIGKVVST
Sbjct: 98 RLGIPKYEWWSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVST 157
Query: 149 EARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
EARAMYNVGLAGLT+WSPN+NIFRDPRWGRGQETPGEDPLL+SKYA GYV+GLQQ D G
Sbjct: 158 EARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRDDGD 217
Query: 209 PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IY 243
P+RLKVAACCKHYTAYDLDNWKGTDRYHFNA+V +
Sbjct: 218 PDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGNVASVMC 277
Query: 244 TYYLIKFK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSIL 282
+Y + K YIVSDCDSVDVLYNSQHYTK+PEEAAAK+IL
Sbjct: 278 SYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEEAAAKTIL 337
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
AGLDL+CG FLGKHTEAAV GLVNE+AI A+ NN TLMRLGFFDG+PSKQ YG+LGP
Sbjct: 338 AGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQLYGKLGP 397
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
KDVCT +Q+LA +AARQGIVLLKN+ SLPLS +AIK+LAVIGPNANVTKTMIGNYEGT
Sbjct: 398 KDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMIGNYEGT 457
Query: 403 PCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES 462
PCKYTTPLQGL+AVV+T +Q GC+NV C +AQ+D+AKK AASADATVLV+G+DQSIEAES
Sbjct: 458 PCKYTTPLQGLSAVVSTSFQPGCANVACTSAQLDEAKKIAASADATVLVVGSDQSIEAES 517
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR+DL LPGQQ LLITEVAK +KGPVIL+IM+GGG DI+FAK D KITSILWVG+PG+A
Sbjct: 518 RDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILWVGFPGEA 577
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GGAAIADV FG +NPSGRLPMTWYPQSY
Sbjct: 578 GGAAIADVIFGSFNPSGRLPMTWYPQSY 605
>gi|449479116|ref|XP_004155509.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Cucumis sativus]
Length = 809
Score = 839 bits (2167), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/568 (73%), Positives = 470/568 (82%), Gaps = 46/568 (8%)
Query: 29 SSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVS 88
++SAQS FACD +NPS++ FC++SLG + RV DLVKRLTLQEKI FL+++A +V+
Sbjct: 65 TTSAQSPTAFACDAETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVT 124
Query: 89 RLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVST 148
RLGIPKYEWWSEALHGVSYVGPGT FSNVVPGATSFPQVILTAASFNASLF+AIGKVVST
Sbjct: 125 RLGIPKYEWWSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVST 184
Query: 149 EARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
EARAMYNVGLAGLT+WSPN+NIFRDPRWGRGQETPGEDPLL+SKYA GYV+GLQQ D G
Sbjct: 185 EARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRDDGD 244
Query: 209 PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IY 243
P+RLKVAACCKHYTAYDLDNWKGTDRYHFNA+V +
Sbjct: 245 PDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGNVASVMC 304
Query: 244 TYYLIKFK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSIL 282
+Y + K YIVSDCDSVDVLYNSQHYTK+PEEAAAK+IL
Sbjct: 305 SYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEEAAAKTIL 364
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
AGLDL+CG FLGKHTEAAV GLVNE+AI A+ NN TLMRLGFFDG+PSKQ YG+LGP
Sbjct: 365 AGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQLYGKLGP 424
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
KDVCT +Q+LA +AARQGIVLLKN+ SLPLS +AIK+LAVIGPNANVTKTMIGNYEGT
Sbjct: 425 KDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMIGNYEGT 484
Query: 403 PCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES 462
PCKYTTPLQGL+AVV+T +Q GC+NV C +AQ+D+AKK AASADATVLV+G+DQSIEAES
Sbjct: 485 PCKYTTPLQGLSAVVSTSFQPGCANVACTSAQLDEAKKIAASADATVLVVGSDQSIEAES 544
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR+DL LPGQQ LLITEVAK +KGPVIL+IM+GGG DI+FAK D KITSILWVG+PG+A
Sbjct: 545 RDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILWVGFPGEA 604
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GGAAIADV FG +NPSGRLPMTWYPQSY
Sbjct: 605 GGAAIADVIFGSFNPSGRLPMTWYPQSY 632
>gi|74355968|dbj|BAE44362.1| alpha-L-arabinofuranosidase [Raphanus sativus]
Length = 780
Score = 839 bits (2167), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/595 (71%), Positives = 460/595 (77%), Gaps = 49/595 (8%)
Query: 2 APAQNRNRAPKVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGID 61
+P RNRAP SV L L + L SS+AQS+PVFACDV NPSLA+ GFCNT++ I+
Sbjct: 5 SPPTRRNRAPS-SVFSLSLIFLCLL-DSSNAQSTPVFACDVAGNPSLAAYGFCNTAIKIE 62
Query: 62 LRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGA 121
RV DLV RLTLQEKI L V+RLGIP YEWWSEALHGVSYVGPGT FS VPGA
Sbjct: 63 YRVADLVARLTLQEKIGVLTSKLHGVARLGIPTYEWWSEALHGVSYVGPGTRFSGQVPGA 122
Query: 122 TSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQE 181
TSFPQVILTAASFN SLFQAIGKVVSTEARAMYNVGLAGLT+WSPN+NIFRDPRWGRGQE
Sbjct: 123 TSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQE 182
Query: 182 TPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV 241
TPGEDPLL+SKYA+GYVKGLQ+TD NRLKVAACCKHYTAYD+DNWKG +RY FNA+V
Sbjct: 183 TPGEDPLLSSKYASGYVKGLQETDSSDANRLKVAACCKHYTAYDVDNWKGVERYSFNAVV 242
Query: 242 ----IYTYYLIKFK------------------------------------------YIVS 255
+ Y FK YIVS
Sbjct: 243 NQQDLDDTYQPPFKSCVVDGNVASVMCSYNKVNGKPTCADPDLLSGVIRGEWKLNGYIVS 302
Query: 256 DCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAI 315
DCDSVDVLY +QHYTKTPEEAAA SI AGLDLNCG FLG HTEAAVKAGLV E+AID AI
Sbjct: 303 DCDSVDVLYKNQHYTKTPEEAAAISINAGLDLNCGYFLGDHTEAAVKAGLVKEAAIDKAI 362
Query: 316 SNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS 375
+NNF TLMRLGFFDG P KQ YG LGPKDVCT ANQ+LA +AARQGIVLLKNT G+LPLS
Sbjct: 363 TNNFLTLMRLGFFDGDPKKQIYGGLGPKDVCTPANQELAAEAARQGIVLLKNT-GALPLS 421
Query: 376 PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV 435
P IK LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA V T Y GCSNV C A V
Sbjct: 422 PKTIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGTVHTTYLPGCSNVACAVADV 481
Query: 436 DDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
+ K AA++DATVLV+GADQSIEAES DR+DL LPGQQQ L+T+VAK AKGPV L+IMS
Sbjct: 482 AGSTKLAAASDATVLVIGADQSIEAESRDRVDLNLPGQQQELVTQVAKAAKGPVFLVIMS 541
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GGGFDI+FAKND KI ILWVGYPG+AGG A ADV FGRYNPSGRLPMTWYPQSY
Sbjct: 542 GGGFDITFAKNDAKIAGILWVGYPGEAGGIATADVIFGRYNPSGRLPMTWYPQSY 596
>gi|292630923|sp|A5JTQ3.1|XYL2_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 2;
AltName: Full=Xylan
1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 2;
Short=MsXyl2; Includes: RecName: Full=Beta-xylosidase;
AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
Full=Alpha-N-arabinofuranosidase; AltName:
Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
Flags: Precursor
gi|146762263|gb|ABQ45228.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
varia]
Length = 774
Score = 837 bits (2163), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/596 (70%), Positives = 470/596 (78%), Gaps = 49/596 (8%)
Query: 4 AQNRNRAPKVSV-LFLFLTYCSLQHSSS--SAQSSPVFACDVVSNPSLASLGFCNTSLGI 60
A NR P VSV L F+ + +L S S+Q+S VFACDV NP+LA+ GFCN L +
Sbjct: 2 ASVENRTPNVSVFLCFFVLFATLLLSGGRVSSQTSAVFACDVAKNPALANYGFCNKKLSV 61
Query: 61 DLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG 120
D RV DLV+RLTLQEK+ LV+SA VSRLGIPKYEWWSEALHGVS +GPGTHFSNV+PG
Sbjct: 62 DARVKDLVRRLTLQEKVGNLVNSAVDVSRLGIPKYEWWSEALHGVSNIGPGTHFSNVIPG 121
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQ 180
ATSFP IL AASFNASLFQ IGKVVSTEARAM+NVGLAGLT+WSPNINIFRDPRWGRGQ
Sbjct: 122 ATSFPMPILIAASFNASLFQTIGKVVSTEARAMHNVGLAGLTYWSPNINIFRDPRWGRGQ 181
Query: 181 ETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAM 240
ETPGEDPLLASKYA GYVKGLQQTD G N+LKVAACCKHYTAYD+D+WKG RY FNA+
Sbjct: 182 ETPGEDPLLASKYAAGYVKGLQQTDDGDSNKLKVAACCKHYTAYDVDDWKGVQRYTFNAV 241
Query: 241 V----IYTYYLIKFK------------------------------------------YIV 254
V + Y FK YIV
Sbjct: 242 VTQQDLDDTYQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGKWKLNGYIV 301
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
SDCDSVDVL+ +QHYTKTPEEAAAKSILAGLDLNCGSFLG++TE AVK GL+ E++I++A
Sbjct: 302 SDCDSVDVLFKNQHYTKTPEEAAAKSILAGLDLNCGSFLGRYTEGAVKQGLIGEASINNA 361
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+ NNFATLMRLGFFDG PSKQPYG LGPKDVCT ANQ+LA +AARQGIVLLKN AGSLPL
Sbjct: 362 VYNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTSANQELAREAARQGIVLLKNCAGSLPL 421
Query: 375 SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ 434
+ AIK+LAVIGPNAN T+ MIGNYEG PCKYT+PLQGL A+V T + AGC +VQC A
Sbjct: 422 NAKAIKSLAVIGPNANATRAMIGNYEGIPCKYTSPLQGLTALVPTSFAAGCPDVQCTNAA 481
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIM 494
+DDAKK AASADATV+V+GA+ +IEAESHDR+++LLPGQQQ L+TEVA VAKGPVIL IM
Sbjct: 482 LDDAKKIAASADATVIVVGANLAIEAESHDRINILLPGQQQQLVTEVANVAKGPVILAIM 541
Query: 495 SGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
SGGG D+SFAK + KITSILWVGYPG+AGGAAIADV FG +NPSGRLPMTWYPQSY
Sbjct: 542 SGGGMDVSFAKTNKKITSILWVGYPGEAGGAAIADVIFGYHNPSGRLPMTWYPQSY 597
>gi|85813770|emb|CAJ65921.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 704
Score = 825 bits (2132), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/620 (71%), Positives = 484/620 (78%), Gaps = 71/620 (11%)
Query: 1 MAPAQNRNRAPKVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGI 60
MA +N PKV V LF + + SAQSSPVFACDVVSNPSLASLGFCNTS+GI
Sbjct: 3 MASVSPQNSVPKV-VFLLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGI 61
Query: 61 DLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG 120
+ RV DLVKRLTLQEKI FLV+SAG+VSRLGIPKYEWWSEALHGVSYVGPGTHFS+ V G
Sbjct: 62 NDRVVDLVKRLTLQEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAG 121
Query: 121 ATSFPQVILTAASFNASLFQAIGK-----VVSTEARAMYNVGLAGLTFWSPNINIFRDPR 175
ATSFPQVILTAASFN SLF+AIGK VVSTEARAMYNVGLAGLTFWSPNINIFRDPR
Sbjct: 122 ATSFPQVILTAASFNTSLFEAIGKVYYTQVVSTEARAMYNVGLAGLTFWSPNINIFRDPR 181
Query: 176 WGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRY 235
WGRGQETPGEDPLL+SKY + YVKGLQQ D G P++LKVAACCKHYTAYDLDNWKG+DRY
Sbjct: 182 WGRGQETPGEDPLLSSKYGSCYVKGLQQRDDGDPDKLKVAACCKHYTAYDLDNWKGSDRY 241
Query: 236 HFNAMVIY--------------------------TYYLIKFKYIVSDCD----------- 258
HFNA+V+ +Y + K +D D
Sbjct: 242 HFNAVVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWN 301
Query: 259 -----------------SVDVLYNSQHYTKTPEEAAAKSILA------GLDLNCGSFLGK 295
S+DV Y SQ+YTKTPEEAAA +ILA G+DLNCGSFLG+
Sbjct: 302 LNGYQWGCCRYIVTDCDSLDVFYKSQNYTKTPEEAAAAAILAGNSLVTGVDLNCGSFLGQ 361
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
HTEAAVK GLVNE AID A+SNNFATLMRLGFFDG PSKQ YG+LGPKDVCT NQ+LA
Sbjct: 362 HTEAAVKGGLVNEHAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAR 421
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE-GTPCKYTTPLQGLA 414
+AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE GTPCKYTTPLQGLA
Sbjct: 422 EAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGGTPCKYTTPLQGLA 481
Query: 415 AVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQ 474
A VAT Y GCSNV C TAQVDDAKK AA+ADATVLVMGAD SIEAES DR+D+LLPGQQ
Sbjct: 482 ASVATTYLPGCSNVACSTAQVDDAKKLAAAADATVLVMGADLSIEAESRDRVDVLLPGQQ 541
Query: 475 QLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGR 534
QLLIT VA V+ GPVIL+IMSGGG D+SFA+ + KITSILWVGYPG+AGGAAIAD+ FG
Sbjct: 542 QLLITAVANVSCGPVILVIMSGGGMDVSFARTNDKITSILWVGYPGEAGGAAIADIIFGY 601
Query: 535 YNPS----GRLPMTWYPQSY 550
YNPS GRLPMTWYPQSY
Sbjct: 602 YNPSTHQPGRLPMTWYPQSY 621
>gi|356558612|ref|XP_003547598.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Glycine max]
Length = 776
Score = 821 bits (2120), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/595 (67%), Positives = 467/595 (78%), Gaps = 51/595 (8%)
Query: 7 RNRAPKVSVLFLFLTY-----CSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGID 61
NR PKV V F ++ L + S Q+S VFACDV NP+LA GFC+ SL ++
Sbjct: 5 ENRQPKVPVFLCFFSFTFVASVLLNCNRVSGQTSAVFACDVAKNPALAGYGFCDKSLSVE 64
Query: 62 LRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGA 121
RV DLVKRLTLQEKI LV+SA SVSRLGIPKYEWWSEALHGVS VGPGTHFS++VPGA
Sbjct: 65 DRVADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLVPGA 124
Query: 122 TSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQE 181
TSFP ILTAASFNASLF+AIG+VVSTEARAMYNVGLAGLT+WSPNINIFRDPRWGRGQE
Sbjct: 125 TSFPMPILTAASFNASLFEAIGRVVSTEARAMYNVGLAGLTYWSPNINIFRDPRWGRGQE 184
Query: 182 TPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV 241
TPGEDPLL+SKYATGYVKGLQQTD G N+LKVAACCKHYTAYDLDNWKG RY FNA+V
Sbjct: 185 TPGEDPLLSSKYATGYVKGLQQTDDGDSNKLKVAACCKHYTAYDLDNWKGIQRYTFNAVV 244
Query: 242 -------------------------IYTYYLIKFK---------------------YIVS 255
+ +Y + K YIVS
Sbjct: 245 TQQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGIIRGEWKLNGYIVS 304
Query: 256 DCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAI 315
DCDSV+VL+ QHYTKTPEEAAA++ILAGLDLNCG++LG++TE AVK GL++E++I++A+
Sbjct: 305 DCDSVEVLFKDQHYTKTPEEAAAQTILAGLDLNCGNYLGQYTEGAVKQGLLDEASINNAV 364
Query: 316 SNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS 375
SNNFATLMRLGFFDG PSKQPYG LGPKDVCT N++LA +AARQGIVLLKN+ GSLPL+
Sbjct: 365 SNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTSENRELAREAARQGIVLLKNSPGSLPLN 424
Query: 376 PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV 435
IK+LAVIGPNAN T+ MIGNYEG PC Y +PLQ L A+V T Y AGC NVQC A++
Sbjct: 425 AKTIKSLAVIGPNANATRVMIGNYEGIPCNYISPLQTLTALVPTSYAAGCPNVQCANAEL 484
Query: 436 DDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
DDA + AASADATV+++GA +IEAES DR+++LLPGQQQLL++EVA +KGPVIL+IMS
Sbjct: 485 DDATQIAASADATVIIVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILVIMS 544
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GGG D+SFAK++ KITSILWVGYPG+AGGAAIADV FG YNPSGRLPMTWYPQ+Y
Sbjct: 545 GGGMDVSFAKSNDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQAY 599
>gi|356525896|ref|XP_003531557.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Glycine max]
Length = 776
Score = 820 bits (2119), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/598 (68%), Positives = 467/598 (78%), Gaps = 51/598 (8%)
Query: 4 AQNRNRAPKVSVL-----FLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSL 58
A NR PKV V F+F+ L S Q+S VFACDV NP+LA GFC+ SL
Sbjct: 2 ACTENRQPKVPVFLCFFSFMFVATVLLNCDRVSGQTSSVFACDVAKNPALAGYGFCDKSL 61
Query: 59 GIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVV 118
++ RV DLVKRLTLQEKI LV+SA SVSRLGIPKYEWWSEALHGVS VGPGTHFS++V
Sbjct: 62 SLEDRVADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLV 121
Query: 119 PGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGR 178
PGATSFP ILTAASFNASLF+AIG+VVSTEARAMYNVGLAGLT+WSPNINIFRDPRWGR
Sbjct: 122 PGATSFPMPILTAASFNASLFEAIGRVVSTEARAMYNVGLAGLTYWSPNINIFRDPRWGR 181
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
GQETPGEDPLL+SKYATGYVKGLQQTD G N+LKVAACCKHYTAYDLDNWKG RY FN
Sbjct: 182 GQETPGEDPLLSSKYATGYVKGLQQTDDGDSNKLKVAACCKHYTAYDLDNWKGIQRYTFN 241
Query: 239 AMV-------------------------IYTYYLIKFK---------------------Y 252
A+V + +Y + K Y
Sbjct: 242 AVVTQQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGEWKLNGY 301
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAID 312
IVSDCDSV+VL+ QHYTKTPEEAAA++ILAGLDLNCG++LG++TE AVK GL++E++I+
Sbjct: 302 IVSDCDSVEVLFKDQHYTKTPEEAAAETILAGLDLNCGNYLGQYTEGAVKQGLLDEASIN 361
Query: 313 HAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
+A+SNNFATLMRLGFFDG PSKQ YG LGP DVCT N++LA +AARQGIVLLKN+ GSL
Sbjct: 362 NAVSNNFATLMRLGFFDGDPSKQTYGNLGPNDVCTSENRELAREAARQGIVLLKNSLGSL 421
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGT 432
PL+ AIK+LAVIGPNAN T+ MIGNYEG PC Y +PLQ L A+V T Y AGC NVQC
Sbjct: 422 PLNAKAIKSLAVIGPNANATRVMIGNYEGIPCNYISPLQALTALVPTSYAAGCPNVQCAN 481
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A++DDA + AASADATV+V+GA +IEAES DR+++LLPGQQQLL++EVA +KGPVIL+
Sbjct: 482 AELDDATQIAASADATVIVVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILV 541
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
IMSGGG D+SFAK++ KITSILWVGYPG+AGGAAIADV FG YNPSGRLPMTWYPQSY
Sbjct: 542 IMSGGGMDVSFAKSNDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSY 599
>gi|85813772|emb|CAJ65922.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 757
Score = 808 bits (2086), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/614 (70%), Positives = 461/614 (75%), Gaps = 96/614 (15%)
Query: 1 MAPAQNRNRAPKVSVLFLFLTYCSLQHSSSS---AQSSPVFACDVVSNPSLASLGFCNTS 57
MA ++NR PKVSV F C L SSS AQSSPVFACDVVSNPSLAS GFCNTS
Sbjct: 1 MASFCSQNRVPKVSVFLFFSLVCFLLFSSSHVVLAQSSPVFACDVVSNPSLASFGFCNTS 60
Query: 58 LGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNV 117
LG+ RV DLVKRLTLQEKI FLV+SAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFS+V
Sbjct: 61 LGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSSV 120
Query: 118 VPGATSFPQVILTAASFNASLFQAIGK----VVSTEARAMYNVGLAGLTFWSPNINIFRD 173
VPGATSFPQVILTAASFN SLF AIGK VVSTEARAMYNVGLAGLTFWSPNINIFRD
Sbjct: 121 VPGATSFPQVILTAASFNTSLFVAIGKVISQVVSTEARAMYNVGLAGLTFWSPNINIFRD 180
Query: 174 PRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD 233
PRWGRGQETPGEDPLL+SKY +GYVKGLQQ D G+P+ LKVAACCKHYTAYDLDNWKG D
Sbjct: 181 PRWGRGQETPGEDPLLSSKYGSGYVKGLQQRDDGNPDGLKVAACCKHYTAYDLDNWKGVD 240
Query: 234 RYHFNAMVIY-------------------------------------------------T 244
RYHFNA+V+
Sbjct: 241 RYHFNAVVVTKQDMDDTFQPPFKSCVVDGNVASVMCSYNKVNGIPTCADPDLLSGVIRGE 300
Query: 245 YYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA--GLDLNCGSFLGKHTEAAVK 302
+ L + YIV+DCDS+DV YNSQHYTKTPEEAAAK+ILA GLDLNCGSFLGKHTEAAV
Sbjct: 301 WKLNGYVYIVTDCDSIDVFYNSQHYTKTPEEAAAKAILAGIGLDLNCGSFLGKHTEAAVT 360
Query: 303 AGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGI 362
AGLVNESAID A+SNNFATLMRLGFFDG PSKQ YG+LGPKDVCT NQ+LA +AARQGI
Sbjct: 361 AGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGI 420
Query: 363 VLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQ 422
VLLKNT GTPCKYTTPLQGLAA+VAT Y
Sbjct: 421 VLLKNT--------------------------------GTPCKYTTPLQGLAALVATTYL 448
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVA 482
GCSNV C TAQVDDAKK AA+ADATVLVMGAD SIEAES DR+D+LLPGQQQLLIT VA
Sbjct: 449 PGCSNVACSTAQVDDAKKIAAAADATVLVMGADLSIEAESRDRVDILLPGQQQLLITAVA 508
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPS---- 538
+ GPVIL+IMSGGG D+SFAK + KITSILWVGYPG+AGGAAIAD+ FG YNPS
Sbjct: 509 NASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADIIFGSYNPSTHQP 568
Query: 539 --GRLPMTWYPQSY 550
GRLPMTWYPQSY
Sbjct: 569 PGGRLPMTWYPQSY 582
>gi|350534908|ref|NP_001233910.1| beta-D-xylosidase 1 precursor [Solanum lycopersicum]
gi|37359706|dbj|BAC98298.1| LEXYL1 [Solanum lycopersicum]
Length = 770
Score = 805 bits (2080), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/565 (69%), Positives = 447/565 (79%), Gaps = 46/565 (8%)
Query: 32 AQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLG 91
AQ+SPVFACDV SNP+L +L FC+ SL ++ RV DLV RLTL EKI FLV AG VSRLG
Sbjct: 29 AQNSPVFACDVTSNPALGNLTFCDASLAVENRVNDLVNRLTLGEKIGFLVSGAGGVSRLG 88
Query: 92 IPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEAR 151
IPKYEWWSEALHGV+Y GPG HF+++VPGATSFPQVILTAASFN +LFQ IGKVVSTEAR
Sbjct: 89 IPKYEWWSEALHGVAYTGPGVHFTSLVPGATSFPQVILTAASFNVTLFQTIGKVVSTEAR 148
Query: 152 AMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR 211
AMYNVGLAGLT+WSPN+NIFRDPRWGRGQETPGEDP L SKY YV+GLQQTD GS N+
Sbjct: 149 AMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLTSKYGVAYVEGLQQTDDGSTNK 208
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTYY 246
LKVAACCKHYTAYD+DNWKG +RY FNA+V + +Y
Sbjct: 209 LKVAACCKHYTAYDVDNWKGIERYSFNAVVRQQDLDDTFQPPFRSCVLEGAVASVMCSYN 268
Query: 247 LIKFK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+ K YIV+DCDS+ V++ SQ+YTKTPEEAAA + +G+
Sbjct: 269 QVNGKPTCGDPNLLAGIVRGEWKLNGYIVTDCDSLQVIFKSQNYTKTPEEAAALGLNSGV 328
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
DLNCGS+L +T+ AV LVNES ID AISNNFATLMRLGFFDG+P + YG LGPKDV
Sbjct: 329 DLNCGSWLSTYTQGAVNQKLVNESVIDRAISNNFATLMRLGFFDGNPKSRIYGNLGPKDV 388
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
CT NQ+LA +AARQGIVLLKNTAGSLPL+PTAIK+LAVIGPNANVTKTMIGNYEG PCK
Sbjct: 389 CTPENQELAREAARQGIVLLKNTAGSLPLTPTAIKSLAVIGPNANVTKTMIGNYEGIPCK 448
Query: 406 YTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDR 465
YTTPLQGL A VATIY+ GC++V C TAQ+DDAK+ A +ADA VLVMG+DQSIE ES DR
Sbjct: 449 YTTPLQGLTASVATIYKPGCADVSCNTAQIDDAKQIATTADAVVLVMGSDQSIEKESLDR 508
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
+ LPGQQ +L+ EVAKVAKGPVIL+IMSGGG D+ FA ++PKITSILWVG+PG+AGGA
Sbjct: 509 TSITLPGQQSILVAEVAKVAKGPVILVIMSGGGMDVQFAVDNPKITSILWVGFPGEAGGA 568
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
A+ADV FG YNPSGRLPMTWYPQSY
Sbjct: 569 ALADVIFGYYNPSGRLPMTWYPQSY 593
>gi|15242492|ref|NP_196535.1| beta-xylosidase 3 [Arabidopsis thaliana]
gi|75264323|sp|Q9LXD6.1|BXL3_ARATH RecName: Full=Beta-D-xylosidase 3; Short=AtBXL3; AltName:
Full=Alpha-L-arabinofuranosidase; Flags: Precursor
gi|7671416|emb|CAB89357.1| beta-xylosidase-like protein [Arabidopsis thaliana]
gi|9759004|dbj|BAB09531.1| beta-xylosidase [Arabidopsis thaliana]
gi|15450735|gb|AAK96639.1| AT5g09730/F17I14_80 [Arabidopsis thaliana]
gi|332004056|gb|AED91439.1| beta-xylosidase 3 [Arabidopsis thaliana]
Length = 773
Score = 803 bits (2074), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/592 (68%), Positives = 446/592 (75%), Gaps = 50/592 (8%)
Query: 6 NRNRAP-KVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRV 64
+RNRA VS LFL C + S++ QSSPVFACDV NPSLA L FCN L I RV
Sbjct: 3 SRNRALFSVSTLFLCFIVCISEQSNN--QSSPVFACDVTGNPSLAGLRFCNAGLSIKARV 60
Query: 65 GDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSF 124
DLV RLTL+EKI FL A VSRLGIP Y+WWSEALHGVS VG G+ F+ VPGATSF
Sbjct: 61 TDLVGRLTLEEKIGFLTSKAIGVSRLGIPSYKWWSEALHGVSNVGGGSRFTGQVPGATSF 120
Query: 125 PQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPG 184
PQVILTAASFN SLFQAIGKVVSTEARAMYNVG AGLTFWSPN+NIFRDPRWGRGQETPG
Sbjct: 121 PQVILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQETPG 180
Query: 185 EDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV--- 241
EDP L+SKYA YVKGLQ+TDGG PNRLKVAACCKHYTAYD+DNW+ +R FNA+V
Sbjct: 181 EDPTLSSKYAVAYVKGLQETDGGDPNRLKVAACCKHYTAYDIDNWRNVNRLTFNAVVNQQ 240
Query: 242 ----------------------IYTYYLIKFK---------------------YIVSDCD 258
+ +Y + K YIVSDCD
Sbjct: 241 DLADTFQPPFKSCVVDGHVASVMCSYNQVNGKPTCADPDLLSGVIRGQWQLNGYIVSDCD 300
Query: 259 SVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNN 318
SVDVL+ QHY KTPEEA AKS+LAGLDLNC F G+H AVKAGLVNE+AID AISNN
Sbjct: 301 SVDVLFRKQHYAKTPEEAVAKSLLAGLDLNCDHFNGQHAMGAVKAGLVNETAIDKAISNN 360
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
FATLMRLGFFDG P KQ YG LGPKDVCT NQ+LA D ARQGIVLLKN+AGSLPLSP+A
Sbjct: 361 FATLMRLGFFDGDPKKQLYGGLGPKDVCTADNQELARDGARQGIVLLKNSAGSLPLSPSA 420
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDA 438
IK LAVIGPNAN T+TMIGNY G PCKYTTPLQGLA V++ YQ GC NV C A + A
Sbjct: 421 IKTLAVIGPNANATETMIGNYHGVPCKYTTPLQGLAETVSSTYQLGC-NVACVDADIGSA 479
Query: 439 KKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
AASADA VLV+GADQSIE E HDR+DL LPG+QQ L+T VA A+GPV+L+IMSGGG
Sbjct: 480 VDLAASADAVVLVVGADQSIEREGHDRVDLYLPGKQQELVTRVAMAARGPVVLVIMSGGG 539
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
FDI+FAKND KITSI+WVGYPG+AGG AIADV FGR+NPSG LPMTWYPQSY
Sbjct: 540 FDITFAKNDKKITSIMWVGYPGEAGGLAIADVIFGRHNPSGNLPMTWYPQSY 591
>gi|356524862|ref|XP_003531047.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Glycine max]
Length = 765
Score = 794 bits (2050), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/559 (69%), Positives = 443/559 (79%), Gaps = 46/559 (8%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACDV +P++A GFC+ SLG++ RV DLV RLTLQEKI LV+SA VSRLGIPKYEW
Sbjct: 30 FACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAVDVSRLGIPKYEW 89
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS VGPGT FSNV+PGATSFP ILTAASFN SLF+ IG+VVSTEARAMYNVG
Sbjct: 90 WSEALHGVSNVGPGTRFSNVIPGATSFPMPILTAASFNTSLFEVIGRVVSTEARAMYNVG 149
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
LAGLT+WSPNINIFRDPRWGRG ETPGEDP+L SKYA GYVKGLQQTDGG PN+LKVAAC
Sbjct: 150 LAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDGGDPNKLKVAAC 209
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTYYLIKFK- 251
CKHYTAYD+DNWKG RY FNA+V + +Y + K
Sbjct: 210 CKHYTAYDVDNWKGIQRYTFNAVVTKQDMEDTFQPPFKSCVIDGNVASVMCSYNKVNGKP 269
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YIVSDCDSV+VLY QHYTKTPEEAAA SILAGLDLNCG
Sbjct: 270 TCADPDLLKGVVRGEWKLNGYIVSDCDSVEVLYKDQHYTKTPEEAAAISILAGLDLNCGR 329
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FLG++TE AVK GL++E++I++A++NNFATLMRLGFFDG P KQPYG LGPKDVCTQ NQ
Sbjct: 330 FLGQYTEGAVKQGLIDEASINNAVTNNFATLMRLGFFDGDPRKQPYGNLGPKDVCTQENQ 389
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
+LA +AARQGIVLLKN+ SLPL+ AIK+LAVIGPNAN T+ MIGNYEG PCKY +PLQ
Sbjct: 390 ELAREAARQGIVLLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYISPLQ 449
Query: 412 GLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLP 471
GL A T Y AGC +V+C +DDAKK AASADATV+V+GA +IEAES DR+++LLP
Sbjct: 450 GLTAFAPTSYAAGCLDVRCPNPVLDDAKKIAASADATVIVVGASLAIEAESLDRVNILLP 509
Query: 472 GQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVC 531
GQQQLL++EVA +KGPVIL+IMSGGG D+SFAKN+ KITSILWVGYPG+AGGAAIADV
Sbjct: 510 GQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKNNNKITSILWVGYPGEAGGAAIADVI 569
Query: 532 FGRYNPSGRLPMTWYPQSY 550
FG +NPSGRLPMTWYPQSY
Sbjct: 570 FGFHNPSGRLPMTWYPQSY 588
>gi|357445735|ref|XP_003593145.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
gi|355482193|gb|AES63396.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
Length = 775
Score = 790 bits (2039), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/597 (67%), Positives = 463/597 (77%), Gaps = 50/597 (8%)
Query: 4 AQNRNRAPKVSVLFL----FLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLG 59
A +NR PKVS +FL F L + Q+S VFACDV N +++S GFC+ SL
Sbjct: 2 ANTKNREPKVSSVFLCFSIFYVAVLLNCNHVYGQTSTVFACDVAKNTNVSSYGFCDKSLS 61
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
++ RV DLVKRLTLQEKI L +SA VSRLGIPKYEWWSEALHGVS +GPGTHFS++VP
Sbjct: 62 VEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSNIGPGTHFSSLVP 121
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
GATSFP ILTAASFN SLFQAIG VVS EARAMYNVGLAGLT+WSPNINIFRDPRWGRG
Sbjct: 122 GATSFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVGLAGLTYWSPNINIFRDPRWGRG 181
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
QETPGEDPLL+SKYA GYVKGLQQTD G ++LKVAACCKHYTAYD+DNWKG RY F+A
Sbjct: 182 QETPGEDPLLSSKYAAGYVKGLQQTDDGDSDKLKVAACCKHYTAYDVDNWKGVQRYTFDA 241
Query: 240 MV-------------------------IYTYYLIKFK---------------------YI 253
+V + +Y + K YI
Sbjct: 242 VVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKLNGYI 301
Query: 254 VSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDH 313
VSDCDSV+VL+ QHYTKTPEEAAAK+IL+GLDL+CGS+LG++T AVK GLV+E++I++
Sbjct: 302 VSDCDSVEVLFKDQHYTKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDEASINN 361
Query: 314 AISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLP 373
A+SNNFATLMRLGFFDG PSKQPYG LGPKDVCT NQ+LA +AARQGIVLLKN+ GSLP
Sbjct: 362 AVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNSPGSLP 421
Query: 374 LSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTA 433
LS AIK+LAVIGPNAN T+ MIGNYEG PCKYT+PLQGL A V T Y GC +VQC A
Sbjct: 422 LSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQGLTAFVPTSYAPGCPDVQCANA 481
Query: 434 QVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
Q+DDA K AASADAT++V+GA+ +IEAES DR+++LLPGQQQ L+ EVA V+KGPVIL+I
Sbjct: 482 QIDDAAKIAASADATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKGPVILVI 541
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
MSGGG D+SFAK + KITSILWVGYPG+AGGAAIADV FG YNPSGRLPMTWYPQSY
Sbjct: 542 MSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADVIFGSYNPSGRLPMTWYPQSY 598
>gi|292630922|sp|A5JTQ2.1|XYL1_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 1;
AltName: Full=Xylan
1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 1;
Short=MsXyl1; Includes: RecName: Full=Beta-xylosidase;
AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
Full=Alpha-N-arabinofuranosidase; AltName:
Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
Flags: Precursor
gi|146762261|gb|ABQ45227.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
varia]
Length = 774
Score = 786 bits (2031), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/597 (67%), Positives = 461/597 (77%), Gaps = 50/597 (8%)
Query: 4 AQNRNRAPKVSVLFL----FLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLG 59
A +NR PKVS +FL F L + Q+S VFACDV N +++S GFC+ SL
Sbjct: 1 ANTKNREPKVSSVFLCFSIFYVTVLLNCNHVYGQTSTVFACDVAKNTNVSSYGFCDNSLS 60
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
++ RV DLVKRLTLQEKI L +SA VSRLGIPKYEWWSEALHGVS +GPGTHFS++VP
Sbjct: 61 VEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSNIGPGTHFSSLVP 120
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
GAT+FP ILTAASFN SLFQAIG VVS EARAMYNVGLAGLT+WSPNINIFRDPRWGRG
Sbjct: 121 GATNFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVGLAGLTYWSPNINIFRDPRWGRG 180
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
QETPGEDPLL+SKYA GYVKGLQQTD G ++LKVAACCKHYTAYD+DNWKG RY F+A
Sbjct: 181 QETPGEDPLLSSKYAAGYVKGLQQTDDGDSDKLKVAACCKHYTAYDVDNWKGVQRYTFDA 240
Query: 240 MV-------------------------IYTYYLIKFK---------------------YI 253
+V + +Y + K YI
Sbjct: 241 VVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKLNGYI 300
Query: 254 VSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDH 313
VSDCDSV+VLY QHYTKTPEEAAAK+IL+GLDL+CGS+LG++T AVK GLV+E++I +
Sbjct: 301 VSDCDSVEVLYKDQHYTKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDEASITN 360
Query: 314 AISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLP 373
A+SNNFATLMRLGFFDG PSKQPYG LGPKDVCT NQ+LA +AARQGIVLLKN+ SLP
Sbjct: 361 AVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNSPRSLP 420
Query: 374 LSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTA 433
LS AIK+LAVIGPNAN T+ MIGNYEG PCKYT+PLQGL A V T Y GC +VQC A
Sbjct: 421 LSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQGLTAFVPTSYAPGCPDVQCANA 480
Query: 434 QVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
Q+DDA K AASADAT++V+GA+ +IEAES DR+++LLPGQQQ L+ EVA V+KGPVIL+I
Sbjct: 481 QIDDAAKIAASADATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKGPVILVI 540
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
MSGGG D+SFAK + KITSILWVGYPG+AGGAAIADV FG YNPSGRLPMTWYPQSY
Sbjct: 541 MSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADVIFGSYNPSGRLPMTWYPQSY 597
>gi|297811069|ref|XP_002873418.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
gi|297319255|gb|EFH49677.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
Length = 780
Score = 783 bits (2023), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/592 (69%), Positives = 446/592 (75%), Gaps = 51/592 (8%)
Query: 7 RNRAP-KVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVG 65
RNRA VS L L C + S+ AQSSPVFACDV NPSLA L FCNT L I RV
Sbjct: 9 RNRALFSVSTLLLCFLLCISEQSN--AQSSPVFACDVTGNPSLAGLRFCNTGLNIKSRVT 66
Query: 66 DLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFP 125
DLV RLTL+EKI FL +A VSRLGIP Y+WWSEALHGVS VG G+ FS VPGATSFP
Sbjct: 67 DLVGRLTLEEKIGFLGSNAIGVSRLGIPAYKWWSEALHGVSNVGGGSSFSGQVPGATSFP 126
Query: 126 QVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGE 185
QVILTAASFN SLFQAIGKVVSTEARAMYNVG AGLTFWSPN+NIFRDPRWGRGQETPGE
Sbjct: 127 QVILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQETPGE 186
Query: 186 DPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV---- 241
DP L+SKYA YV+GLQ+TDGG PNRLKVAACCKHYTAYD+DNWK R+ FNA+V
Sbjct: 187 DPELSSKYAVAYVRGLQETDGGDPNRLKVAACCKHYTAYDVDNWKDVHRFTFNAVVNQQD 246
Query: 242 ---------------------IYTYYLIKFK---------------------YIVSDCDS 259
+ +Y + K YIVSDCDS
Sbjct: 247 MADTFQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGQWKLNGYIVSDCDS 306
Query: 260 VDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNF 319
VDVLY QHYTKTPEEA AKSILAGLDLNC F G++ AVK GLVNE+AID AISNNF
Sbjct: 307 VDVLYTKQHYTKTPEEAVAKSILAGLDLNCDHFTGQYAMKAVKVGLVNETAIDKAISNNF 366
Query: 320 ATLMRLGFFDGHPSKQP-YGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
ATLMRLGFFDG P KQ YG LGP DVCT NQ+LA DAARQGIVLLKN+AGSLPLSP+A
Sbjct: 367 ATLMRLGFFDGDPKKQQLYGGLGPNDVCTANNQELARDAARQGIVLLKNSAGSLPLSPSA 426
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDA 438
IK LAVIGPNAN T+TMIGNY G PCKYTTPLQGLA V++ YQ GC NV C + A
Sbjct: 427 IKTLAVIGPNANATETMIGNYNGIPCKYTTPLQGLAETVSSTYQLGC-NVACAEPDLGSA 485
Query: 439 KKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
AASADA VLVMGADQSIE E+ DRLDL LPG+QQ L+T+VAKVAKGPV+L+IMSGG
Sbjct: 486 AALAASADAVVLVMGADQSIEQENLDRLDLYLPGKQQELVTQVAKVAKGPVVLVIMSGGA 545
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
FDI+FAKN+ KIT I+WVGYPG+AGG AIADV FGR+NPSG LPMTWYPQSY
Sbjct: 546 FDITFAKNEEKITGIMWVGYPGEAGGLAIADVIFGRHNPSGNLPMTWYPQSY 597
>gi|356510699|ref|XP_003524073.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Glycine max]
Length = 613
Score = 746 bits (1926), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/546 (68%), Positives = 426/546 (78%), Gaps = 47/546 (8%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACDV +P++A GFC+ SLG++ RV DLV RLTLQEKI LV+SAG VSRLGIP+YEW
Sbjct: 30 FACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAGDVSRLGIPRYEW 89
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS VG GT FSNVVPGATSFP ILTAASFN SLF+ IG+VVSTEA AMYNVG
Sbjct: 90 WSEALHGVSNVGLGTRFSNVVPGATSFPMPILTAASFNTSLFEVIGRVVSTEAGAMYNVG 149
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
LAGLT+WSPNINIFRDPRWGRG ETPGEDP+L SKYA GYVKGLQQTDGG PN+LKVAAC
Sbjct: 150 LAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDGGDPNKLKVAAC 209
Query: 218 CKHYTAYDLDNWKGTDRYHFNAM-------------------------VIYTYYLIKFK- 251
CKHYTAYD+D WKG RY FNA+ V+ +Y + K
Sbjct: 210 CKHYTAYDVDKWKGIQRYTFNAVLTKQDLEDTFQPPFKSCVIDGNVASVMCSYNKVNGKP 269
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
Y+VSDCDSV+VLY QHYTKTPEEAAA SILAGLDLNCG
Sbjct: 270 TCADPDLLKGVVRGEWKLNGYMVSDCDSVEVLYKYQHYTKTPEEAAAISILAGLDLNCGR 329
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FLG++TE AVK GL++ES I++A+SNNFATLMRLGFFDG P KQPYG LGPKDVCT ANQ
Sbjct: 330 FLGQYTEGAVKQGLIDES-INNAVSNNFATLMRLGFFDGDPRKQPYGNLGPKDVCTPANQ 388
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
+LA +AARQGIV LKN+ SLPL+ AIK+LAVIGPNAN T+ MIGNYEG PCKY +PLQ
Sbjct: 389 ELAREAARQGIVSLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYISPLQ 448
Query: 412 GLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLP 471
GL A V T Y AGC +V+C +DDAKK +AS DATV+V+GA +IEAES DR+++LLP
Sbjct: 449 GLTAFVPTSYAAGCLDVRCPNPVLDDAKKISASGDATVIVVGASLAIEAESLDRVNILLP 508
Query: 472 GQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVC 531
GQQQLL+TEVA +KGPVIL+IMSGGG D+SFAK++ KITSILWVGYPG+AGGAAIADV
Sbjct: 509 GQQQLLVTEVANASKGPVILVIMSGGGMDVSFAKDNNKITSILWVGYPGEAGGAAIADVI 568
Query: 532 FGRYNP 537
FG +NP
Sbjct: 569 FGFHNP 574
>gi|242077366|ref|XP_002448619.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
gi|241939802|gb|EES12947.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
Length = 767
Score = 726 bits (1873), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/564 (66%), Positives = 430/564 (76%), Gaps = 51/564 (9%)
Query: 35 SPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
+PVFACD SN +LAS GFCN S R DLV RLTL EK+ FLVD ++ RLGIP
Sbjct: 31 TPVFACDA-SNATLASYGFCNRSASASARAADLVSRLTLAEKVGFLVDKQAALPRLGIPL 89
Query: 95 YEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
YEWWSEALHGVSYVGPGT FS++VP ATSFPQ ILTAASFNA+LF+AIG+VVS EARAM+
Sbjct: 90 YEWWSEALHGVSYVGPGTRFSSLVPAATSFPQPILTAASFNATLFRAIGEVVSNEARAMH 149
Query: 155 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKV 214
NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLL SKYA GYV GLQ GS LKV
Sbjct: 150 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDAGSGS-GSLKV 208
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTYYLIK 249
AACCKHYTAYD+DNWKG +RY FNA+V + +Y +
Sbjct: 209 AACCKHYTAYDVDNWKGVERYTFNAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQVN 268
Query: 250 FK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
K YI SDCDSVDVLYN+QHYTKTPE+AAA SI AGLDLN
Sbjct: 269 GKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLDLN 328
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG+FL +HT AAV+AG ++ES +D AI+NNF TLMRLGFFDG P K P+G LGP DVCT
Sbjct: 329 CGNFLAQHTVAAVQAGKLSESDVDRAITNNFITLMRLGFFDGDPRKLPFGNLGPSDVCTS 388
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
+NQ+LA +AARQGIVLLKN+ G+LPLS ++IK+LAVIGPNAN + TMIGNYEGTPCKYTT
Sbjct: 389 SNQELAREAARQGIVLLKNS-GALPLSASSIKSLAVIGPNANASFTMIGNYEGTPCKYTT 447
Query: 409 PLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
PLQGL A VAT+YQ GC+NV C + Q+D A KAAASAD TVLV+GADQSIE ES DR
Sbjct: 448 PLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESLDRT 507
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
LLLPGQQ L++ VA ++GP IL+IMSGG FDISFAK+ KI +ILWVGYPG+AGGAA
Sbjct: 508 SLLLPGQQPQLVSAVANASRGPCILVIMSGGPFDISFAKSSDKIAAILWVGYPGEAGGAA 567
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
IADV FG +NPSGRLP+TWYP+S+
Sbjct: 568 IADVLFGHHNPSGRLPVTWYPESF 591
>gi|115460876|ref|NP_001054038.1| Os04g0640700 [Oryza sativa Japonica Group]
gi|38344900|emb|CAE02971.2| OSJNBb0079B02.3 [Oryza sativa Japonica Group]
gi|113565609|dbj|BAF15952.1| Os04g0640700 [Oryza sativa Japonica Group]
gi|116310882|emb|CAH67823.1| OSIGBa0138H21-OSIGBa0138E01.14 [Oryza sativa Indica Group]
gi|218195682|gb|EEC78109.1| hypothetical protein OsI_17615 [Oryza sativa Indica Group]
Length = 765
Score = 719 bits (1857), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/564 (64%), Positives = 431/564 (76%), Gaps = 51/564 (9%)
Query: 35 SPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
+PVFACD SN +++ GFC+ + R DL+ RLTL EK+ FLV+ ++ RLGIP
Sbjct: 28 TPVFACDA-SNATVSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPA 86
Query: 95 YEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
YEWWSEALHGVSYVGPGT FS +VPGATSFPQ ILTAASFNASLF+AIG+VVSTEARAM+
Sbjct: 87 YEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMH 146
Query: 155 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKV 214
NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA GYV GLQ GGS + LKV
Sbjct: 147 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGGGS-DALKV 205
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTYYLIK 249
AACCKHYTAYD+DNWKG +RY F+A+V + +Y +
Sbjct: 206 AACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVN 265
Query: 250 FK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
K YIVSDCDSVDVLYN+QHYTK PE+AAA +I +GLDLN
Sbjct: 266 GKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNPEDAAAITIKSGLDLN 325
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG+FL +HT AAV+AG ++ES +D AI+NNF LMRLGFFDG P K P+G LGPKDVCT
Sbjct: 326 CGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTS 385
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
+NQ+LA +AARQGIVLLKNT G+LPLS +IK++AVIGPNAN + TMIGNYEGTPCKYTT
Sbjct: 386 SNQELAREAARQGIVLLKNT-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 444
Query: 409 PLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
PLQGL A VAT+YQ GC+NV C + Q+ A +AAASAD TVLV+GADQS+E ES DR
Sbjct: 445 PLQGLGANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDRT 504
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
LLLPGQQ L++ VA ++GPVIL++MSGG FDISFAK+ KI++ILWVGYPG+AGGAA
Sbjct: 505 SLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPGEAGGAA 564
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+AD+ FG +NP GRLP+TWYP S+
Sbjct: 565 LADILFGYHNPGGRLPVTWYPASF 588
>gi|413919688|gb|AFW59620.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 773
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/564 (65%), Positives = 426/564 (75%), Gaps = 51/564 (9%)
Query: 35 SPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
+P FACD SN +LAS GFCN S R DLV RLTL EK+ FLVD ++ RLG+P
Sbjct: 37 TPAFACDA-SNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPL 95
Query: 95 YEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
YEWWSEALHGVSYVGPGT FS +VPGATSFPQ ILTAASFNA+LF+AIG+VVS EARAM+
Sbjct: 96 YEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMH 155
Query: 155 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKV 214
NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLL SKYA GYV GLQ G+ LKV
Sbjct: 156 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGA-GALKV 214
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTYYLIK 249
AACCKHYTAYD+DNWKG +RY F+A+V + +Y +
Sbjct: 215 AACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQVN 274
Query: 250 FK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
K YI SDCDSVDVLYN+QHYTKTPE+AAA SI AGLDLN
Sbjct: 275 GKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLDLN 334
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG+FL +HT AAV+AG ++ES +D A++NN TLMRLGFFDG P + P+G LGP DVCT
Sbjct: 335 CGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVCTP 394
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
+NQ+LA +AARQGIVLLKNT G LPLS T+IK++AVIGPNAN + TMIGNYEGTPCKYTT
Sbjct: 395 SNQELAREAARQGIVLLKNT-GKLPLSATSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 453
Query: 409 PLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
PLQGL A VAT+YQ GC+NV C + Q+D A KAAASAD TVLV+GADQSIE ES DR
Sbjct: 454 PLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESLDRT 513
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
LLLPGQQ L++ VA + GP IL++MSGG FDISFAK+ KI +ILWVGYPG+AGGAA
Sbjct: 514 SLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPGEAGGAA 573
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
IADV FG +NPSGRLP+TWYP+S+
Sbjct: 574 IADVLFGYHNPSGRLPVTWYPESF 597
>gi|326494302|dbj|BAJ90420.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326521150|dbj|BAJ96778.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326527851|dbj|BAK08165.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 775
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/574 (63%), Positives = 424/574 (73%), Gaps = 52/574 (9%)
Query: 27 HSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGS 86
++ ++ +PVFACD SN +LA+ GFCN R DLV RLTL EK+ FLV+ +
Sbjct: 26 NAGTAEAQAPVFACDA-SNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPA 84
Query: 87 VSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVV 146
+ RLGIP YEWWSEALHGVSYVGPGT FS +VPGATSFPQ ILTAASFNASLF+AIG+VV
Sbjct: 85 LGRLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVV 144
Query: 147 STEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
STEARAM+NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA GYV GLQ
Sbjct: 145 STEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGA 204
Query: 207 G--SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----------------------- 241
G + LKVAACCKHYTAYD+DNWKG +RY F+A V
Sbjct: 205 GGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVA 264
Query: 242 --IYTYYLIKFK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAA 278
+ +Y + K YIVSDCDSVDVLY QHYTKTPEEAAA
Sbjct: 265 SVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAA 324
Query: 279 KSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYG 338
+I +GLDLNCG+FL +HT AAV+AG ++E +D AI+NNF LMRLGFFDG P + +G
Sbjct: 325 ITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFG 384
Query: 339 QLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGN 398
LGPKDVCT +N++LA + ARQGIVLLKN+ G+LPLS +IK++AVIGPNAN + TMIGN
Sbjct: 385 SLGPKDVCTSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIGPNANASFTMIGN 443
Query: 399 YEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKKAAASADATVLVMGADQ 456
YEGTPCKYTTPLQGL A V T+YQ GC+NV C + Q+ A AAASAD TVLV+GADQ
Sbjct: 444 YEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQ 503
Query: 457 SIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWV 516
SIE ES DR LLLPGQQ L++ VA + GPVIL++MSGG FDISFAK KI +ILWV
Sbjct: 504 SIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWV 563
Query: 517 GYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GYPG+AGGAA+AD+ FG +NPSGRLP+TWYP SY
Sbjct: 564 GYPGEAGGAALADILFGSHNPSGRLPVTWYPASY 597
>gi|357166259|ref|XP_003580652.1| PREDICTED: beta-D-xylosidase 4-like [Brachypodium distachyon]
Length = 774
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/564 (64%), Positives = 419/564 (74%), Gaps = 50/564 (8%)
Query: 35 SPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
+PVFACD +N ++A FC+ + R DLV RLTL +K+ FLV+ +++RLGIP
Sbjct: 35 TPVFACDA-ANSTVAGYAFCDRAKSASARAADLVSRLTLADKVGFLVNKQPALARLGIPA 93
Query: 95 YEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
YEWWSEALHGVSYVGPGT FS +VPGATSFPQ ILTAASFNASLF+AIG+VVS EARAM+
Sbjct: 94 YEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSNEARAMH 153
Query: 155 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKV 214
NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS+YA GYV GLQ + LKV
Sbjct: 154 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASRYAVGYVSGLQDAGADADGPLKV 213
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTYYLIK 249
AACCKHYTAYD+DNWKG +RY F+A V + +Y +
Sbjct: 214 AACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVIDGKVASVMCSYNKVN 273
Query: 250 FK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
K YIVSDCDSVDVLY+ QHYTKTPEEAAA +I +GLDLN
Sbjct: 274 GKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYSQQHYTKTPEEAAAITIKSGLDLN 333
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG FL KHT AAV+AG ++ES +D AI+NNF LMRLGFFDG P K YG LGPKDVCT
Sbjct: 334 CGDFLAKHTVAAVQAGNLSESDVDRAITNNFIMLMRLGFFDGDPRKLAYGSLGPKDVCTS 393
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
+NQ+LA + ARQGIVLLKN G+LPLS +IK++AVIGPNAN + TMIGNYEGTPCKYTT
Sbjct: 394 SNQELARETARQGIVLLKND-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 452
Query: 409 PLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
PL GL VAT+YQ GCSNV C + Q+ A AAASAD TVLV+GADQSIE E+ DR
Sbjct: 453 PLHGLGNNVATVYQPGCSNVGCSGNSLQLSAATAAAASADVTVLVVGADQSIEREALDRT 512
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
LLLPGQQ LI+ VA +KG VIL++MSGG FDISFAK KI++ILWVGYPG+AGGAA
Sbjct: 513 SLLLPGQQPDLISAVANASKGHVILVVMSGGPFDISFAKASDKISAILWVGYPGEAGGAA 572
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
IAD+ FG+YNPSGRLP+TWYP S+
Sbjct: 573 IADIIFGKYNPSGRLPVTWYPASF 596
>gi|326492918|dbj|BAJ90315.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 775
Score = 701 bits (1808), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/574 (63%), Positives = 424/574 (73%), Gaps = 52/574 (9%)
Query: 27 HSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGS 86
++ ++ +PVFACD SN +LA+ GFCN R DLV RLTL EK+ FLV+ +
Sbjct: 26 NAGTAEAQAPVFACDA-SNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPA 84
Query: 87 VSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVV 146
+ RLGIP YEWWSEALHGVSYVGPGT FS +VPGATSFPQ ILTAASFNASLF+AIG+VV
Sbjct: 85 LGRLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVV 144
Query: 147 STEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
STEARAM+NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA GYV GLQ
Sbjct: 145 STEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGA 204
Query: 207 G--SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----------------------- 241
G + LKVAACCKHYTAYD+DNWKG +RY F+A V
Sbjct: 205 GGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVA 264
Query: 242 --IYTYYLIKFK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAA 278
+ +Y + K YIVSDCDSVDVLY QHYTKTPEEAAA
Sbjct: 265 SVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAA 324
Query: 279 KSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYG 338
+I +GLDLNCG+FL +HT AAV+AG ++E +D AI+NNF LMRLGFFDG P + +G
Sbjct: 325 ITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFG 384
Query: 339 QLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGN 398
LGPKDVCT +N++LA + ARQGIVLLKN+ G+LPLS +IK++AVIGPNAN + TMIGN
Sbjct: 385 SLGPKDVCTSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIGPNANASFTMIGN 443
Query: 399 YEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKKAAASADATVLVMGADQ 456
YEGTPCKYTTPLQGL A V T+YQ GC+NV C + Q+ A AAASAD TVLV+GADQ
Sbjct: 444 YEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQ 503
Query: 457 SIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWV 516
SIE ES DR LLLPGQQ L++ VA + GPVIL++MSGG FDISFAK KI +ILWV
Sbjct: 504 SIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWV 563
Query: 517 GYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GYPG+AGGAA+AD+ FG +NPSG+LP+TWYP SY
Sbjct: 564 GYPGEAGGAALADILFGSHNPSGKLPVTWYPASY 597
>gi|18025340|gb|AAK38481.1| alpha-L-arabinofuranosidase/beta-D-xylosidase isoenzyme ARA-I
[Hordeum vulgare]
Length = 777
Score = 694 bits (1791), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/565 (64%), Positives = 416/565 (73%), Gaps = 52/565 (9%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
PVFACD SN +LA+ GFCN R DLV RLTL EK+ FLV+ ++ RLGIP Y
Sbjct: 37 PVFACDA-SNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAY 95
Query: 96 EWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
EWWSEALHGVSYVGPGT FS +VPGATSFPQ ILTAASFNASLF+AIG+VVSTEARAM+N
Sbjct: 96 EWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHN 155
Query: 156 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG--SPNRLK 213
VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA GYV GLQ G + LK
Sbjct: 156 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGAGGVTDGALK 215
Query: 214 VAACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTYYLI 248
VAACCKHYTAYD+DNWKG +RY F+A V + +Y +
Sbjct: 216 VAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKV 275
Query: 249 KFK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL 287
K YIVSDCDSVDVLY QHYTKTPEEAAA +I +G+DL
Sbjct: 276 NGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGVDL 335
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
NCG+FL +HT AAV+AG ++E +D AI+NNF LMRLGFFDG P + +G LGPKDVCT
Sbjct: 336 NCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCT 395
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
+N++LA + ARQGIVLLKN+ G+LPLS +IK++AVIGPNAN + TMIGNYEGTPCKYT
Sbjct: 396 SSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYT 454
Query: 408 TPLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKKAAASADATVLVMGADQSIEAESHDR 465
TPLQGL A V T+YQ GC+NV C + Q+ A AAASAD TVLV+GADQSIE ES DR
Sbjct: 455 TPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDR 514
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
LLLPGQQ L++ VA + GPVIL++MSGG FDISFAK KI + LWVGYPG+AGGA
Sbjct: 515 TSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAATLWVGYPGEAGGA 574
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
A+ D FG +NPSGRLP+TWYP SY
Sbjct: 575 ALDDTLFGSHNPSGRLPVTWYPASY 599
>gi|357511337|ref|XP_003625957.1| Beta-xylosidase [Medicago truncatula]
gi|355500972|gb|AES82175.1| Beta-xylosidase [Medicago truncatula]
Length = 771
Score = 680 bits (1754), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/580 (59%), Positives = 408/580 (70%), Gaps = 53/580 (9%)
Query: 18 LFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKI 77
L + L SS A+ S FACD + + +L FCN L I RV DL+ RLT+QEK+
Sbjct: 10 LITLFILLLQSSCDARDS--FACDA-KDAATKNLPFCNVKLAIPERVKDLIGRLTMQEKV 66
Query: 78 TFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNAS 137
LV++A +V R+G+ YEWWSEALHGVS VGPGT F V P ATSFPQVI TAASFNAS
Sbjct: 67 NLLVNNAPAVPRVGMKSYEWWSEALHGVSNVGPGTRFGGVFPAATSFPQVITTAASFNAS 126
Query: 138 LFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGY 197
L++AIG+VVS EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LA +YA Y
Sbjct: 127 LWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGRYAASY 186
Query: 198 VKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-- 251
VKGLQ TDG N+LKVAACCKH+TAYD+DNW G DR+HFNA+V I + + F+
Sbjct: 187 VKGLQGTDG---NKLKVAACCKHFTAYDVDNWNGVDRFHFNALVSKQDIEDTFDVPFRMC 243
Query: 252 ----------------------------------------YIVSDCDSVDVLYNSQHYTK 271
YIVSDCDSV VLYNSQHYT
Sbjct: 244 VKEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGVWGLDGYIVSDCDSVGVLYNSQHYTS 303
Query: 272 TPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGH 331
TPEEAAA +I AGLDL+CG FLG HT+ AVK GL+ E+ +++A+ N MRLG FDG
Sbjct: 304 TPEEAAADAIKAGLDLDCGPFLGVHTQDAVKKGLLTEADVNNALVNTLKVQMRLGMFDGE 363
Query: 332 PSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANV 391
PS Q YG+LGPKDVC A+Q+LAL+AARQGIVLLKNT +LPLSP + +AVIGPN++V
Sbjct: 364 PSAQAYGRLGPKDVCKPAHQELALEAARQGIVLLKNTGPTLPLSPQRHRTVAVIGPNSDV 423
Query: 392 TKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVL 450
T TMIGNY G C YT+PLQG+ TI+Q GCSNV C Q A AA ADAT+L
Sbjct: 424 TVTMIGNYAGIACGYTSPLQGIGRYAKTIHQQGCSNVACRDDKQFGPALDAARHADATIL 483
Query: 451 VMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKI 510
V+G DQSIEAE+ DR LLLPG QQ L+++VA +KGP IL++MSGG DI+FAKNDPK+
Sbjct: 484 VIGLDQSIEAETVDRTSLLLPGHQQDLVSKVAAASKGPTILVLMSGGPVDITFAKNDPKV 543
Query: 511 TSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
ILW GYPGQAGGAAIAD+ FG +P G+LP+TWYPQ Y
Sbjct: 544 AGILWAGYPGQAGGAAIADILFGTASPGGKLPVTWYPQEY 583
>gi|255548487|ref|XP_002515300.1| Beta-glucosidase, putative [Ricinus communis]
gi|223545780|gb|EEF47284.1| Beta-glucosidase, putative [Ricinus communis]
Length = 768
Score = 676 bits (1744), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/560 (60%), Positives = 395/560 (70%), Gaps = 51/560 (9%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACD + + +L FC L I RV DL+ RLTL EK+ LV++AG+VSRLGI YEW
Sbjct: 28 FACDS-KDGTTKNLPFCQVKLPIQDRVKDLIGRLTLAEKVGLLVNNAGAVSRLGIKGYEW 86
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS VGPGT F PGATSFPQVI TAASFN++L++AIG+VVS EARAMYN G
Sbjct: 87 WSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNSTLWEAIGRVVSDEARAMYNGG 146
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
AGLT+WSPN+NI RDPRWGRGQETPGEDPLL KYA YVKGLQ DG RLKVAAC
Sbjct: 147 AAGLTYWSPNVNILRDPRWGRGQETPGEDPLLVGKYAASYVKGLQGNDG---ERLKVAAC 203
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKH+TAYDLDNW G DR+HFNA V + + + F+
Sbjct: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDMKDTFDVPFRMCVKEGKVASVMCSYNQVNGIP 263
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YIVSDCDSV V Y+ QHYT TPEEAAA +I AGLDL+CG
Sbjct: 264 TCADPNLLRKTVRTQWGLNGYIVSDCDSVGVFYDKQHYTSTPEEAAADAIKAGLDLDCGP 323
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FL HT+ AVK GL++E+ ++ A+ N MRLG FDG PS QPYG LGPKDVCT A+Q
Sbjct: 324 FLAVHTQDAVKRGLISEADVNGALFNTLTVQMRLGMFDGEPSAQPYGNLGPKDVCTPAHQ 383
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
+LAL+A RQGIVLLKN SLPLSP + +A+IGPN+NVT TMIGNY G C+YTTPLQ
Sbjct: 384 ELALEAGRQGIVLLKNHGPSLPLSPRRHRTVAIIGPNSNVTVTMIGNYAGVACQYTTPLQ 443
Query: 412 GLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
G+ + TI+Q GC++V C T Q+ A AA ADATVLVMG DQSIEAE DR LLL
Sbjct: 444 GIGSYAKTIHQQGCADVGCVTDQLFSGAIDAARQADATVLVMGLDQSIEAEFRDRTGLLL 503
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG+QQ L+++VA +KGP IL++MSGG D+SFAK DPKI +ILW GYPGQAGGAAIADV
Sbjct: 504 PGRQQELVSKVAMASKGPTILVLMSGGPIDVSFAKKDPKIAAILWAGYPGQAGGAAIADV 563
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG NP G+LPMTWYPQ Y
Sbjct: 564 LFGTINPGGKLPMTWYPQEY 583
>gi|356501877|ref|XP_003519750.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 772
Score = 676 bits (1744), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/560 (60%), Positives = 393/560 (70%), Gaps = 51/560 (9%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACD N + +L FC SL RV DL+ RLTLQEK+ LV++A +V RLGI YEW
Sbjct: 28 FACDP-KNTATKNLPFCKASLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEW 86
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS VGPGT F P ATSFPQVI TAASFNASL++AIG+V S EARAMYN G
Sbjct: 87 WSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGG 146
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LA KYA YV+GLQ TDG NRLKVAA
Sbjct: 147 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQGTDG---NRLKVAAS 203
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKH+TAYDLDNW G DR+HFNA V I + + F+
Sbjct: 204 CKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQVNGVP 263
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YIVSDCDSV V YNSQHYT TPEEAAA +I AGLDL+CG
Sbjct: 264 TCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTSTPEEAAADAIKAGLDLDCGP 323
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FLG+HT+ AVK GL++E+ ++ A+ N MRLG +DG PS PY LGP+DVCTQ++Q
Sbjct: 324 FLGQHTQNAVKKGLISEADVNGALLNTLTVQMRLGMYDGEPSSHPYNNLGPRDVCTQSHQ 383
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
+LAL+AARQGIVLLKN SLPLS + +AVIGPN+NVT TMIGNY G C YT+PLQ
Sbjct: 384 ELALEAARQGIVLLKNKGPSLPLSTRRGRTVAVIGPNSNVTFTMIGNYAGIACGYTSPLQ 443
Query: 412 GLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
G+ TIY+ GC+NV C Q A AA ADATVLVMG DQSIEAE+ DR LLL
Sbjct: 444 GIGTYTKTIYEHGCANVACTDDKQFGRAINAAQQADATVLVMGLDQSIEAETVDRASLLL 503
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG QQ L+++VA +KGP IL+IMSGG DI+FAKNDP+I ILW GYPGQAGGAAIAD+
Sbjct: 504 PGHQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNDPRIQGILWAGYPGQAGGAAIADI 563
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG NP G+LPMTWYPQ Y
Sbjct: 564 LFGTSNPGGKLPMTWYPQGY 583
>gi|183579871|dbj|BAG28345.1| arabinofuranosidase [Citrus unshiu]
Length = 769
Score = 673 bits (1737), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/569 (60%), Positives = 399/569 (70%), Gaps = 51/569 (8%)
Query: 29 SSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVS 88
S +S FACD N SL FC TS+ I +RV DL+ RLTLQEKI LV++A +V
Sbjct: 19 SRFVESRVPFACDP-RNGLTRSLRFCRTSVPIHVRVQDLIGRLTLQEKIRLLVNNAAAVP 77
Query: 89 RLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVST 148
RLGI YEWWSEALHGVS VGPGT F PGATSFPQVI TAA+FN SL++ IG+VVS
Sbjct: 78 RLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAAAFNESLWEEIGRVVSD 137
Query: 149 EARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
EARAMYN G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LA KYA YV+ LQ G +
Sbjct: 138 EARAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRRLQ---GNT 194
Query: 209 PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------- 251
+RLKVAACCKHYTAYDLDNW G DRYHFNA V + Y + FK
Sbjct: 195 GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 254
Query: 252 -----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSIL 282
YIVSDCDSV VLYN+QHYT+TPEEAAA +I
Sbjct: 255 SYNQVNGKPTCADPDILKNTIRGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 314
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
AGLDL+CG FL HTE AV+ GL+ E ++ A + MRLG FDG PS QP+G LGP
Sbjct: 315 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLASAYTITVQMRLGMFDGEPSAQPFGNLGP 374
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
+DVCT A+Q LAL AA QGIVLLKN+A +LPLS +AVIGPN++VT TMIGNY G
Sbjct: 375 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 434
Query: 403 PCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAE 461
C YTTPLQG++ TI+QAGC V C G + A+ AA ADATVLVMG DQSIEAE
Sbjct: 435 ACGYTTPLQGISRYAKTIHQAGCLGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 494
Query: 462 SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQ 521
DR LLLPG+QQ L++ VAK ++GPV+L++M GG D+SFAKNDP+I +ILWVGYPGQ
Sbjct: 495 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 554
Query: 522 AGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
AGGAAIADV FGR NP G+LPMTWYPQ Y
Sbjct: 555 AGGAAIADVLFGRANPGGKLPMTWYPQDY 583
>gi|298364130|gb|ADI79208.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Malus x domestica]
Length = 774
Score = 671 bits (1731), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/570 (60%), Positives = 403/570 (70%), Gaps = 51/570 (8%)
Query: 28 SSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSV 87
S + + P FACD NP +L FC + I +RV DL+ RLTLQEKI LV++A +V
Sbjct: 22 SIAVVHARPPFACDP-RNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAV 80
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
RLGI YEWWSEALHGVS VGPGT F + GATSFPQVI TAASFN SL++ IG+VVS
Sbjct: 81 PRLGIQGYEWWSEALHGVSNVGPGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVS 139
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
EARAMYN G AGLTFWSPN+NIFRDPRWGRGQETPGEDP+LA+KY YVKGLQ G
Sbjct: 140 DEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPILAAKYGARYVKGLQ--GDG 197
Query: 208 SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------ 251
+ NRLKVAACCKHYTAYDLDNW G DR+HFNA V + Y + F+
Sbjct: 198 AGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFRACVVDGNVASVM 257
Query: 252 ------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSI 281
YIVSDCDSV V Y++QHYTKTPEEAAA +I
Sbjct: 258 CSYNQVNGKPTCADPELLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEEAAAYAI 317
Query: 282 LAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG 341
AGLDL+CG FLG HTEAAV+ G VNE I++A++N MRLG FDG PS Q YG LG
Sbjct: 318 KAGLDLDCGPFLGIHTEAAVRFGQVNEIDINYALANTITVQMRLGMFDGEPSAQRYGNLG 377
Query: 342 PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG 401
DVC ++ +LAL+AARQGIVLL+N SLPLS + +AVIGPN++VT+TMIGNY G
Sbjct: 378 LADVCKPSSNELALEAARQGIVLLENRGNSLPLSTMRHRTVAVIGPNSDVTETMIGNYAG 437
Query: 402 TPCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEA 460
C YTTPLQG+A TI+QAGC++V C G + A+ AA ADATVLV+G DQSIEA
Sbjct: 438 IACGYTTPLQGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEA 497
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E DR DLLLPG QQ L++ VA+ ++GP IL+IMSGG D++FAKNDP+I +I+WVGYPG
Sbjct: 498 EFRDRTDLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYPG 557
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
QAGG AIADV FG NPSG+LPMTWYPQ+Y
Sbjct: 558 QAGGTAIADVLFGTTNPSGKLPMTWYPQNY 587
>gi|225437531|ref|XP_002270249.1| PREDICTED: probable beta-D-xylosidase 2 [Vitis vinifera]
gi|297743965|emb|CBI36935.3| unnamed protein product [Vitis vinifera]
Length = 768
Score = 671 bits (1730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/584 (57%), Positives = 411/584 (70%), Gaps = 55/584 (9%)
Query: 14 SVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTL 73
S L +FL ++ S ++ FACD + + A FC S+GI RV DL+ RLTL
Sbjct: 9 SSLLIFLVVLAV----VSGEARDPFACDP-KDGANAGFPFCRKSIGIGERVKDLIGRLTL 63
Query: 74 QEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAAS 133
+EK+ LV++A V RLGI YEWWSEALHGVS VGPGT FS PGATSFPQVI TAAS
Sbjct: 64 EEKVRLLVNNAAGVPRLGIKGYEWWSEALHGVSNVGPGTKFSGDFPGATSFPQVITTAAS 123
Query: 134 FNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
FN+SL++AIG+VVS EARAMYN G AGLTFWSPN+NIFRDPRWGRGQETPGEDP+LA KY
Sbjct: 124 FNSSLWEAIGQVVSDEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAGKY 183
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIK 249
A YV+GLQ G + +RLKVAACCKH+TAYDLDNW G DR+HF+A V + + +
Sbjct: 184 AARYVRGLQ---GNAGDRLKVAACCKHFTAYDLDNWNGVDRFHFDARVSKQEMEDTFDVP 240
Query: 250 FK------------------------------------------YIVSDCDSVDVLYNSQ 267
F+ Y+VSDCDSV V Y++Q
Sbjct: 241 FRSCVVEGKVASVMCSYNQVNGVPTCADPNLLRNTVRKQWHLNGYVVSDCDSVGVFYDNQ 300
Query: 268 HYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGF 327
HYT TPEEAAA +I AGLDL+CG FL HT+ A+K GLV+E+ +D A+ N MRLG
Sbjct: 301 HYTNTPEEAAADAIKAGLDLDCGPFLAVHTQDAIKKGLVSEADVDSALVNTVTVQMRLGM 360
Query: 328 FDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGP 387
FDG PS QP+G LGPKDVC+ A+Q+LA++AARQGIVLLKN SLPLS + +++AVIGP
Sbjct: 361 FDGEPSAQPFGDLGPKDVCSPAHQELAIEAARQGIVLLKNHGHSLPLSTRSHRSIAVIGP 420
Query: 388 NANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAASAD 446
N++ TMIGNY G PC+YTTPLQG+ TI+Q GC++V C Q+ A AA+ AD
Sbjct: 421 NSDANVTMIGNYAGIPCEYTTPLQGIGRYSRTIHQKGCADVACSEDQLFAGAIDAASQAD 480
Query: 447 ATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN 506
ATVLVMG DQSIEAE+ DR DLLLPG+QQ L+++VA ++GP +L++MSGG D+SFAK
Sbjct: 481 ATVLVMGLDQSIEAEAKDRADLLLPGRQQELVSKVAMASRGPTVLVLMSGGPVDVSFAKK 540
Query: 507 DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
DP+I +I+W GYPGQAGGAAIAD+ FG NP G+LPMTWYPQ Y
Sbjct: 541 DPRIAAIVWAGYPGQAGGAAIADILFGVANPGGKLPMTWYPQEY 584
>gi|356534827|ref|XP_003535953.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 771
Score = 669 bits (1725), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/560 (60%), Positives = 394/560 (70%), Gaps = 51/560 (9%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACD N + +L FC L RV DL+ RLTLQEK+ LV++A +V RLGI YEW
Sbjct: 27 FACDP-KNTATKNLPFCKAWLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEW 85
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS VGPGT F P ATSFPQVI TAASFNASL++AIG+V S EARAMYN G
Sbjct: 86 WSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGG 145
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LA KYA YV+GLQ+TDG NRLKVAA
Sbjct: 146 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQETDG---NRLKVAAS 202
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKH+TAYDLDNW G DR+HFNA V I + + F+
Sbjct: 203 CKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQVNGVP 262
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YIVSDCDSV V YNSQHYT TPEEAAA +I AGLDL+CG
Sbjct: 263 TCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTSTPEEAAADAIKAGLDLDCGP 322
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FLG+HT+ AVK GL++E+ ++ A+ N MRLG +DG PS PYG+LGP+DVCT ++Q
Sbjct: 323 FLGQHTQNAVKKGLISETDVNGALLNTLTVQMRLGMYDGEPSSHPYGKLGPRDVCTPSHQ 382
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
+LAL+AARQGIVLLKN SLPLS +AVIGPN+NVT TMIGNY G C YT+PL+
Sbjct: 383 ELALEAARQGIVLLKNKGPSLPLSTRRHPTVAVIGPNSNVTVTMIGNYAGIACGYTSPLE 442
Query: 412 GLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
G+ TI++ GC+NV C Q A A ADATVLVMG DQSIEAE+ DR LLL
Sbjct: 443 GIGRYTKTIHELGCANVACTNDKQFGRAINVAQQADATVLVMGLDQSIEAETVDRAGLLL 502
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG+QQ L+++VA +KGP IL+IMSGG DI+FAKN+P+I +ILW GYPGQAGGAAIAD+
Sbjct: 503 PGRQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNNPRIQAILWAGYPGQAGGAAIADI 562
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG NP G+LPMTWYPQ Y
Sbjct: 563 LFGTSNPGGKLPMTWYPQGY 582
>gi|224111912|ref|XP_002316021.1| predicted protein [Populus trichocarpa]
gi|222865061|gb|EEF02192.1| predicted protein [Populus trichocarpa]
Length = 768
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/573 (59%), Positives = 401/573 (69%), Gaps = 58/573 (10%)
Query: 28 SSSSAQSSPVFACDVVSNPSLA---SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSA 84
S+ +S FACD P L SL FC +L I +RV DL+ RLTLQEKI LV++A
Sbjct: 18 SNHIVESREPFACD----PKLGLTRSLKFCRVNLPIHVRVRDLIGRLTLQEKIRLLVNNA 73
Query: 85 GSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGK 144
+V RLGI YEWWSEALHGVS VGPGT F PGAT+FPQVI TAASFN SL++ IG+
Sbjct: 74 AAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATAFPQVITTAASFNESLWEEIGR 133
Query: 145 VVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT 204
VVS EARAMYN G+AGLT+WSPN+N+FRDPRWGRGQETPGEDP++A KYA YV+GLQ
Sbjct: 134 VVSDEARAMYNGGMAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGN 193
Query: 205 DGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------- 251
+G RLKVAACCKHYTAYDLDNW G DRYHFNA V + Y + FK
Sbjct: 194 NG---LRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKSCVVAGKVA 250
Query: 252 ---------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAA 278
YIVSDCDSV VL+++QHYT TPEEAAA
Sbjct: 251 SVMCSYNQVNGKPTCADPYLLKNTIRGEWGLNGYIVSDCDSVGVLFDTQHYTATPEEAAA 310
Query: 279 KSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYG 338
+I AGLDL+CG FL HTE AVK GL+ E ++ A++N MRLG FDG PS QP+G
Sbjct: 311 STIRAGLDLDCGPFLAIHTENAVKGGLLKEEDVNMALANTITVQMRLGMFDGEPSAQPFG 370
Query: 339 QLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGN 398
LGP+DVCT A+Q LAL AARQGIVLL+N +LPLS T ++ +AVIGPN++VT TMIGN
Sbjct: 371 NLGPRDVCTPAHQQLALQAARQGIVLLQNRGRTLPLSRT-LQTVAVIGPNSDVTVTMIGN 429
Query: 399 YEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQS 457
Y G C YTTPLQG+ T++ GC++V C G Q + A+ AA ADAT+LVMG DQS
Sbjct: 430 YAGVACGYTTPLQGIRRYAKTVHHPGCNDVFCNGNQQFNAAEVAARHADATILVMGLDQS 489
Query: 458 IEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
IEAE DR LLLPG QQ L++ VA+ ++GP IL++MSGG D+SFAKNDP+I +ILWVG
Sbjct: 490 IEAEFRDRKGLLLPGYQQELVSIVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWVG 549
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
YPGQAGGAAIADV FG NP G+LPMTWYP +Y
Sbjct: 550 YPGQAGGAAIADVLFGTANPGGKLPMTWYPHNY 582
>gi|356503923|ref|XP_003520749.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 775
Score = 665 bits (1716), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/588 (57%), Positives = 406/588 (69%), Gaps = 54/588 (9%)
Query: 10 APKVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVK 69
+P ++++ +FL ++H+ ++ FACD N + ++ FC SL I RV DLV
Sbjct: 6 SPLLNLIAVFLLLFLVRHT---CEARDPFACDP-KNGATENMPFCKASLAIPERVKDLVG 61
Query: 70 RLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVIL 129
RLTLQEK+ LV++A +V RLG+ YEWWSEALHGVS VGPG F+ PGATSFPQVI
Sbjct: 62 RLTLQEKVRLLVNNAAAVPRLGMKGYEWWSEALHGVSNVGPGVKFNAQFPGATSFPQVIT 121
Query: 130 TAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLL 189
TAASFNASL++AIG+VVS EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L
Sbjct: 122 TAASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL 181
Query: 190 ASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTY 245
A YA YV+GLQ TDG NRLKVAACCKH+TAYDLDNW G DR+HFNA V I
Sbjct: 182 AGTYAASYVRGLQGTDG---NRLKVAACCKHFTAYDLDNWNGMDRFHFNAQVSKQDIEET 238
Query: 246 YLIKFK------------------------------------------YIVSDCDSVDVL 263
+ + F+ YIVSDCDSV V
Sbjct: 239 FDVPFRMCVSEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGLWQLDGYIVSDCDSVGVF 298
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLM 323
Y++QHYT TPEEAAA +I AGLDL+CG FL HT+ AV+ GL++E+ ++ A+ N M
Sbjct: 299 YDNQHYTPTPEEAAADAIKAGLDLDCGPFLAVHTQNAVEKGLLSEADVNGALVNTLTVQM 358
Query: 324 RLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLA 383
RLG FDG PS YG+LGPKDVC A+Q+LAL+AARQGIVLLKNT LPLSP +A
Sbjct: 359 RLGMFDGEPSAHAYGKLGPKDVCKPAHQELALEAARQGIVLLKNTGPVLPLSPQRHHTVA 418
Query: 384 VIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV-DDAKKAA 442
VIGPN+ T TMIGNY G C YT PLQG+ TI+Q GC NV C ++ A AA
Sbjct: 419 VIGPNSKATVTMIGNYAGVACGYTNPLQGIGRYAKTIHQLGCENVACKNDKLFGSAINAA 478
Query: 443 ASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS 502
ADATVLVMG DQSIEAE+ DR LLLPG+QQ L+++VA +KGP IL+IMSGG DI+
Sbjct: 479 RQADATVLVMGLDQSIEAETVDRTGLLLPGRQQDLVSKVAAASKGPTILVIMSGGSVDIT 538
Query: 503 FAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
FAKN+P+I ILW GYPGQAGGAAIAD+ FG NP G+LP+TWYPQ Y
Sbjct: 539 FAKNNPRIVGILWAGYPGQAGGAAIADILFGTTNPGGKLPVTWYPQEY 586
>gi|408354266|gb|AFU54452.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
Length = 775
Score = 665 bits (1716), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/565 (60%), Positives = 400/565 (70%), Gaps = 50/565 (8%)
Query: 33 QSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGI 92
+ P FACD NP L FC ++ I +RV DL+ RLTLQEKI LV++A +V RLGI
Sbjct: 27 HARPPFACDP-HNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGI 85
Query: 93 PKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA 152
YEWWSEALHGVS VGPGT F PGATSFPQVI TAASFN SL+Q IG+VV EARA
Sbjct: 86 QGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRVVPDEARA 145
Query: 153 MYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
MYN G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LASKYA YVKGLQ G+ NRL
Sbjct: 146 MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQ--GDGAGNRL 203
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------- 251
KVAACCKHYTAYDLDNW G +R+HFNA V + Y + FK
Sbjct: 204 KVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVMCSYNQ 263
Query: 252 -------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
YIVSDCDSV VLY QHYT+TPEEAAA +I AGLD
Sbjct: 264 VNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHYTRTPEEAAADAIKAGLD 323
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
L+CG FL HTEAAV+ GLV++ I+ A++N MRLG FDG PS YG LGP+DVC
Sbjct: 324 LDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLGPRDVC 383
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
T A+Q LAL+AARQGIVLL+N SLPLS + +AVIGPN++VT TMIGNY G C Y
Sbjct: 384 TPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVACGY 443
Query: 407 TTPLQGLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAASADATVLVMGADQSIEAESHDR 465
TTPLQG+ TI+QAGC++V C Q+ A+ AA ADATVLVMG DQSIEAE DR
Sbjct: 444 TTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFVDR 503
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
+ LLLPG QQ L++ VA+ ++GP IL++MSGG D++FAKNDP+I++I+WVGYPGQAGG
Sbjct: 504 VGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAGGT 563
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
AIADV FG NP G+LPMTWYPQ+Y
Sbjct: 564 AIADVLFGTTNPGGKLPMTWYPQNY 588
>gi|357442285|ref|XP_003591420.1| Beta xylosidase [Medicago truncatula]
gi|355480468|gb|AES61671.1| Beta xylosidase [Medicago truncatula]
Length = 765
Score = 663 bits (1711), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/585 (57%), Positives = 399/585 (68%), Gaps = 59/585 (10%)
Query: 13 VSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLT 72
++++FL L S+++ FACD N S + FC SL I RV DL+ RLT
Sbjct: 6 ITIVFLLLLM--------SSEARDPFACDP-KNTSTNNFPFCKASLPIPTRVNDLIGRLT 56
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAA 132
LQEK++ LV++A +V R+GI YEWWSEALHGVS VGPGT F+ P ATSFPQVI T A
Sbjct: 57 LQEKVSMLVNNAAAVPRVGIKGYEWWSEALHGVSNVGPGTKFAGQFPAATSFPQVITTVA 116
Query: 133 SFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASK 192
SFNASL++AIG+V S EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LA K
Sbjct: 117 SFNASLWEAIGRVASDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGK 176
Query: 193 YATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLI 248
YA YV+GLQ TD +RLKVAA CKH+TAYDLDNW G DR+HFNA V + + +
Sbjct: 177 YAASYVRGLQGTDS---SRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDMEDTFNV 233
Query: 249 KFK------------------------------------------YIVSDCDSVDVLYNS 266
F+ YIVSDCDSV V Y +
Sbjct: 234 PFRMCVKEGNVASVMCSYNQVNGVPTCADPNLLKRTIRGQWHLDGYIVSDCDSVGVFYTN 293
Query: 267 QHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLG 326
QHYT TPEEAAA +I AGLDL+CG FL +HT+ AVK GL+ E+ ++ A++N MRLG
Sbjct: 294 QHYTSTPEEAAADAIKAGLDLDCGPFLAQHTQNAVKKGLLTETDVNGALANTLTVQMRLG 353
Query: 327 FFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG 386
FDG PS QPYG LGP DVCT +Q+LALDAARQGIVLLKNT SLPLS + +AVIG
Sbjct: 354 MFDGEPSAQPYGNLGPTDVCTPTHQELALDAARQGIVLLKNTGPSLPLSTKNHQTVAVIG 413
Query: 387 PNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCG-TAQVDDAKKAAASA 445
PN+N T TMIGNY G C YT+PLQG+ TI++ GC+NV C Q A AA A
Sbjct: 414 PNSNATVTMIGNYAGIACGYTSPLQGIGKYARTIHEPGCANVACNDDKQFGSALNAARQA 473
Query: 446 DATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAK 505
DATVLVMG DQSIEAE DR LLLPG QQ L+++VA ++GP IL++MSGG DI+FAK
Sbjct: 474 DATVLVMGLDQSIEAEMVDRTGLLLPGHQQDLVSKVAAASRGPTILVLMSGGPIDITFAK 533
Query: 506 NDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
NDP+I ILW GYPGQAGGAAIAD+ FG NP +LPMTWYPQ Y
Sbjct: 534 NDPRIMGILWAGYPGQAGGAAIADILFGTTNPGAKLPMTWYPQGY 578
>gi|408354264|gb|AFU54451.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
Length = 775
Score = 662 bits (1709), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/565 (60%), Positives = 399/565 (70%), Gaps = 50/565 (8%)
Query: 33 QSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGI 92
+ P FACD NP L FC ++ I +RV DL+ RLTLQEKI LV++A +V RLGI
Sbjct: 27 HARPPFACDP-HNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGI 85
Query: 93 PKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA 152
YEWWSEALHGVS VGPGT F PGATSFPQVI TAASFN SL+Q IG+ V EARA
Sbjct: 86 QGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRGVPDEARA 145
Query: 153 MYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
MYN G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LASKYA YVKGLQ G+ NRL
Sbjct: 146 MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQ--GDGAGNRL 203
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------- 251
KVAACCKHYTAYDLDNW G +R+HFNA V + Y + FK
Sbjct: 204 KVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVMCSYNQ 263
Query: 252 -------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
YIVSDCDSV VLY QHYT+TPEEAAA +I AGLD
Sbjct: 264 VNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHYTRTPEEAAADAIKAGLD 323
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
L+CG FL HTEAAV+ GLV++ I+ A++N MRLG FDG PS YG LGP+DVC
Sbjct: 324 LDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLGPRDVC 383
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
T A+Q LAL+AARQGIVLL+N SLPLS + +AVIGPN++VT TMIGNY G C Y
Sbjct: 384 TPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVACGY 443
Query: 407 TTPLQGLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAASADATVLVMGADQSIEAESHDR 465
TTPLQG+ TI+QAGC++V C Q+ A+ AA ADATVLVMG DQSIEAE DR
Sbjct: 444 TTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFVDR 503
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
+ LLLPG QQ L++ VA+ ++GP IL++MSGG D++FAKNDP+I++I+WVGYPGQAGG
Sbjct: 504 VGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAGGT 563
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
AIADV FG NP G+LPMTWYPQ+Y
Sbjct: 564 AIADVLFGTTNPGGKLPMTWYPQNY 588
>gi|86553064|gb|AAS17751.2| beta xylosidase [Fragaria x ananassa]
Length = 772
Score = 660 bits (1704), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/586 (58%), Positives = 413/586 (70%), Gaps = 52/586 (8%)
Query: 12 KVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRL 71
K+S++ L L +L + A+ P FACD NP FC T + + +RV DL+ RL
Sbjct: 8 KLSLIALVLCVSALLFNLVHAR--PPFACDP-RNPLTRGFKFCRTRVPVHVRVQDLIGRL 64
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
TLQEKI LV++A +V RLGI YEWWSEALHGVS VGPGT F PGATSFPQVI TA
Sbjct: 65 TLQEKIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTA 124
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
ASFN SL+Q IG+VVS EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L++
Sbjct: 125 ASFNQSLWQEIGQVVSDEARAMYNGGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSA 184
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYL 247
KYA YVKGLQ G+ NRLKVAACCKHYTAYDLDNW G DR+HFNA V + Y
Sbjct: 185 KYAASYVKGLQ--GDGAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLADTYD 242
Query: 248 IKFK------------------------------------------YIVSDCDSVDVLYN 265
+ F+ YIVSDCDSV V Y+
Sbjct: 243 VPFRGCVLEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGEWKLNGYIVSDCDSVGVFYD 302
Query: 266 SQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRL 325
QHYT+TPEEAAA++I AGLDL+CG FL HTE A+KAGL+ E +D+A++N MRL
Sbjct: 303 QQHYTRTPEEAAAEAIKAGLDLDCGPFLAIHTEGAIKAGLLPEIDVDYALANTLTVQMRL 362
Query: 326 GFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVI 385
G FDG PS Q YG LGP+DVCT A+Q+LAL+A+RQGIVLL+N +LPLS + +AV+
Sbjct: 363 GMFDGEPSAQQYGNLGPRDVCTPAHQELALEASRQGIVLLQNNGHTLPLSTVRHRTVAVV 422
Query: 386 GPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAAS 444
GPN++VT+TMIGNY G C YTTPLQG+ TI+Q GC+NV C T Q+ A+ AA
Sbjct: 423 GPNSDVTETMIGNYAGVACGYTTPLQGIGRYTKTIHQQGCTNVACTTNQLFGAAEAAARQ 482
Query: 445 ADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFA 504
ADATVLVMG DQSIEAE DR DL++PG QQ L++ VA+ ++GP +L++MSGG D+SFA
Sbjct: 483 ADATVLVMGLDQSIEAEFRDRTDLVMPGHQQELVSRVARASRGPTVLVLMSGGPIDVSFA 542
Query: 505 KNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
KNDPKI +I+WVGYPGQAGG A+ADV FG NPSG+LPMTWYPQ Y
Sbjct: 543 KNDPKIGAIIWVGYPGQAGGTAMADVLFGTTNPSGKLPMTWYPQDY 588
>gi|326489197|dbj|BAK01582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 709
Score = 660 bits (1704), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/526 (65%), Positives = 395/526 (75%), Gaps = 51/526 (9%)
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASF 134
+K+ FLV+ ++ RLGIP YEWWSEALHGVSYVGPGT FS +VPGATSFPQ ILTAASF
Sbjct: 7 QKVGFLVNKQPALGRLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASF 66
Query: 135 NASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 194
NASLF+AIG+VVSTEARAM+NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA
Sbjct: 67 NASLFRAIGEVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 126
Query: 195 TGYVKGLQQTDGG--SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----------- 241
GYV GLQ G + LKVAACCKHYTAYD+DNWKG +RY F+A V
Sbjct: 127 VGYVTGLQDAGAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQP 186
Query: 242 --------------IYTYYLIKFK---------------------YIVSDCDSVDVLYNS 266
+ +Y + K YIVSDCDSVDVLY
Sbjct: 187 PFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQ 246
Query: 267 QHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLG 326
QHYTKTPEEAAA +I +GLDLNCG+FL +HT AAV+AG ++E +D AI+NNF LMRLG
Sbjct: 247 QHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLG 306
Query: 327 FFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG 386
FFDG P + +G LGPKDVCT +N++LA + ARQGIVLLKN+ G+LPLS +IK++AVIG
Sbjct: 307 FFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIG 365
Query: 387 PNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKKAAAS 444
PNAN + TMIGNYEGTPCKYTTPLQGL A V T+YQ GC+NV C + Q+ A AAAS
Sbjct: 366 PNANASFTMIGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAAS 425
Query: 445 ADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFA 504
AD TVLV+GADQSIE ES DR LLLPGQQ L++ VA + GPVIL++MSGG FDISFA
Sbjct: 426 ADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFA 485
Query: 505 KNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
K KI +ILWVGYPG+AGGAA+AD+ FG +NPSGRLP+TWYP SY
Sbjct: 486 KASDKIAAILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASY 531
>gi|356556038|ref|XP_003546334.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
Length = 775
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/560 (60%), Positives = 394/560 (70%), Gaps = 50/560 (8%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACD N FCNT + I +RV DL+ RLTL EKI +V++A +V RLGI YEW
Sbjct: 37 FACDP-RNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGYEW 95
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS VGPGT F PGAT FPQVI TAASFN SL+Q IG+VVS EARAMYN G
Sbjct: 96 WSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYNGG 155
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
AGLT+WSPN+NIFRDPRWGRGQETPGEDP LA+KYA YVKGLQ G N LKVAAC
Sbjct: 156 QAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQGDSAG--NHLKVAAC 213
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKHYTAYDLDNW G DR+HFNA V + Y + FK
Sbjct: 214 CKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVMCSYNQVNGKP 273
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YIVSDCDSV V +++QHYTKTPEEAAA++I AGLDL+CG
Sbjct: 274 TCADPDLLRNTIRGQWRLNGYIVSDCDSVGVFFDNQHYTKTPEEAAAEAIKAGLDLDCGP 333
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FL HT++A++ GL++E+ ++ A++N + MRLG FDG PS QPYG LGP+DVCT A+Q
Sbjct: 334 FLAIHTDSAIRKGLISENDLNLALANLISVQMRLGMFDGEPSTQPYGNLGPRDVCTSAHQ 393
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LAL+AAR+ IVLL+N SLPLSP+ ++ + V+GPNA+ T TMIGNY G C YTTPLQ
Sbjct: 394 QLALEAARESIVLLQNKGNSLPLSPSRLRTIGVVGPNADATVTMIGNYAGVACGYTTPLQ 453
Query: 412 GLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
G+A V T +Q GC V C G A+ A ADA VLVMG DQ++EAE+ DR+ LLL
Sbjct: 454 GIARYVKTAHQVGCRGVACRGNELFGAAETIARQADAIVLVMGLDQTVEAETRDRVGLLL 513
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG QQ L+T VA+ AKGPVIL+IMSGG DISFAKNDPKI++ILWVGYPGQAGG AIADV
Sbjct: 514 PGLQQELVTRVARAAKGPVILLIMSGGPVDISFAKNDPKISAILWVGYPGQAGGTAIADV 573
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG NP GRLPMTWYPQ Y
Sbjct: 574 IFGTTNPGGRLPMTWYPQGY 593
>gi|255556320|ref|XP_002519194.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223541509|gb|EEF43058.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 782
Score = 658 bits (1698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/565 (59%), Positives = 397/565 (70%), Gaps = 51/565 (9%)
Query: 33 QSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGI 92
+S FACD N +L FC +L I +RV DL+ RLTLQEKI LV++A +V RLGI
Sbjct: 37 ESRAPFACDP-RNGVTRNLKFCRANLPIHVRVRDLISRLTLQEKIRLLVNNAAAVPRLGI 95
Query: 93 PKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA 152
YEWWSEALHGVS VGPG F PGATSFPQVI TAASFN SL++ IG+VVS EARA
Sbjct: 96 QGYEWWSEALHGVSNVGPGVKFGGAFPGATSFPQVITTAASFNQSLWEQIGRVVSDEARA 155
Query: 153 MYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
MYN GLAGLT+WSPN+N+FRDPRWGRGQETPGEDP+LA KYA YV+GLQ + G +L
Sbjct: 156 MYNGGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQSSTG---LKL 212
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------- 251
KVAACCKHYTAYDLDNW G DRYHFNA V + Y + FK
Sbjct: 213 KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKACVVEGKVASVMCSYNQ 272
Query: 252 -------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
YIVSDCDSV VLY++QHYT TPEEAAA +I AGLD
Sbjct: 273 VNGKPTCADPILLKNTIRGQWGLNGYIVSDCDSVGVLYDNQHYTSTPEEAAAATIKAGLD 332
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
L+CG FL HTE AVK GL+ E ++ A++N MRLG FDG PS PYG LGP+DVC
Sbjct: 333 LDCGPFLAIHTENAVKKGLLVEEDVNLALANTITVQMRLGMFDGEPSAHPYGNLGPRDVC 392
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
T A+Q+LAL+AARQGIVLL+N +LPLS + +AVIGPN++VT TMIGNY G CKY
Sbjct: 393 TPAHQELALEAARQGIVLLENRGQALPLSSSRHHTIAVIGPNSDVTVTMIGNYAGIACKY 452
Query: 407 TTPLQGLAAVVATIYQAGCSNVQCGT-AQVDDAKKAAASADATVLVMGADQSIEAESHDR 465
T+PLQG++ T++Q GC +V C + Q A+ AA ADATVLVMG DQSIEAE DR
Sbjct: 453 TSPLQGISRYAKTLHQNGCGDVACHSNQQFGAAEAAARQADATVLVMGLDQSIEAEFRDR 512
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
+ LLLPG QQ L++ VA+ ++GP IL++MSGG D+SFAKNDP++ +ILW GYPGQAGGA
Sbjct: 513 VGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRVGAILWAGYPGQAGGA 572
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
AIADV FG NP G+LPMTWYPQ Y
Sbjct: 573 AIADVLFGTTNPGGKLPMTWYPQGY 597
>gi|157041199|dbj|BAF79669.1| beta-D-xylosidase [Pyrus pyrifolia]
Length = 774
Score = 658 bits (1697), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/570 (59%), Positives = 401/570 (70%), Gaps = 51/570 (8%)
Query: 28 SSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSV 87
S + + P FACD NP +L FC + I +RV DL+ RLTLQEKI LV++A +V
Sbjct: 22 SMAVVHARPPFACDP-RNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAV 80
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
RLGI YEWWSEALHGVS VGPGT F + GATSFPQVI TAASFN SL++ IG+VVS
Sbjct: 81 PRLGIQGYEWWSEALHGVSNVGPGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVS 139
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
EARAMYN G AGLTFWSPN+NIFRDPRWGRGQETPGEDP+LA+KY YVKGLQ G
Sbjct: 140 DEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQGD--G 197
Query: 208 SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------ 251
+ NRLKVAACCKHYTAYDLDNW G DR+HFNA V + Y + FK
Sbjct: 198 AGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVM 257
Query: 252 ------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSI 281
YIVSDCDSV V Y++QHYTKTPE AAA +I
Sbjct: 258 CSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEAAAAYAI 317
Query: 282 LAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG 341
AGLDL+CG FLG HTEAA++ G VNE I++A++N MRLG FDG PS Q YG LG
Sbjct: 318 KAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLG 377
Query: 342 PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG 401
DVC ++ +LAL+AARQGIVLL+N SLPLS + +AVIGPN++VT+TMIGNY G
Sbjct: 378 LADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAG 437
Query: 402 TPCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEA 460
C YTTPLQG+A TI+QAGC++V C G + A+ AA ADATVLV+G DQSIEA
Sbjct: 438 IACGYTTPLQGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEA 497
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E DR LLLPG QQ L++ VA+ ++GP IL+IMSGG D++FAKNDP+I +I+WVGYPG
Sbjct: 498 EFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYPG 557
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
QAGG AIADV FG NPSG+LPMTWYPQ+Y
Sbjct: 558 QAGGTAIADVLFGTTNPSGKLPMTWYPQNY 587
>gi|15239867|ref|NP_199747.1| beta-xylosidase 1 [Arabidopsis thaliana]
gi|75262458|sp|Q9FGY1.1|BXL1_ARATH RecName: Full=Beta-D-xylosidase 1; Short=AtBXL1; AltName:
Full=Alpha-L-arabinofuranosidase; Flags: Precursor
gi|9759419|dbj|BAB09906.1| xylosidase [Arabidopsis thaliana]
gi|21539545|gb|AAM53325.1| xylosidase [Arabidopsis thaliana]
gi|332008419|gb|AED95802.1| beta-xylosidase 1 [Arabidopsis thaliana]
Length = 774
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/586 (58%), Positives = 412/586 (70%), Gaps = 56/586 (9%)
Query: 12 KVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRL 71
KV V+ +FL L HSS S + P+FACD +N +L FC ++ I +RV DL+ RL
Sbjct: 13 KVVVILVFL--LCLVHSSESLR--PLFACDP-ANGLTRTLRFCRANVPIHVRVQDLLGRL 67
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
TLQEKI LV++A +V RLGI YEWWSEALHG+S VGPG F PGATSFPQVI TA
Sbjct: 68 TLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFPQVITTA 127
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
ASFN SL++ IG+VVS EARAMYN G+AGLT+WSPN+NI RDPRWGRGQETPGEDP++A+
Sbjct: 128 ASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAA 187
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYL 247
KYA YV+GLQ T G NRLKVAACCKHYTAYDLDNW G DR+HFNA V + Y
Sbjct: 188 KYAASYVRGLQGTAAG--NRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVTQQDLEDTYN 245
Query: 248 IKFK------------------------------------------YIVSDCDSVDVLYN 265
+ FK YIVSDCDSVDV +N
Sbjct: 246 VPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGQWRLNGYIVSDCDSVDVFFN 305
Query: 266 SQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRL 325
QHYT TPEEAAA+SI AGLDL+CG FL TE AVK GL+ E+ I+ A++N MRL
Sbjct: 306 QQHYTSTPEEAAARSIKAGLDLDCGPFLAIFTEGAVKKGLLTENDINLALANTLTVQMRL 365
Query: 326 GFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVI 385
G FDG+ PY LGP+DVCT A++ LAL+AA QGIVLLKN+A SLPLSP + +AVI
Sbjct: 366 GMFDGNLG--PYANLGPRDVCTPAHKHLALEAAHQGIVLLKNSARSLPLSPRRHRTVAVI 423
Query: 386 GPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAAS 444
GPN++VT+TMIGNY G C YT+PLQG++ T++QAGC+ V C G A+ AA
Sbjct: 424 GPNSDVTETMIGNYAGKACAYTSPLQGISRYARTLHQAGCAGVACKGNQGFGAAEAAARE 483
Query: 445 ADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFA 504
ADATVLVMG DQSIEAE+ DR LLLPG QQ L+T VA+ ++GPVIL++MSGG D++FA
Sbjct: 484 ADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTFA 543
Query: 505 KNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
KNDP++ +I+W GYPGQAGGAAIA++ FG NP G+LPMTWYPQ Y
Sbjct: 544 KNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDY 589
>gi|449484229|ref|XP_004156823.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
[Cucumis sativus]
Length = 769
Score = 655 bits (1691), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/583 (57%), Positives = 397/583 (68%), Gaps = 55/583 (9%)
Query: 15 VLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQ 74
+L +F+ ++ +S A FACD +N FC SL ++ RV DL+ RLTL+
Sbjct: 8 ILSIFILLSAIHGRASRAP----FACDP-NNSVTTDYPFCRRSLVVEERVKDLIGRLTLE 62
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASF 134
EK+ LV +AG V RLGI Y+WWSEALHGVS VGPGT F P ATSFPQVI TAASF
Sbjct: 63 EKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTAASF 122
Query: 135 NASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 194
NASL++AIG+VVS EARAMYN G+ GLT+WSPN+NIFRDPRWGRGQETPGEDP+LA YA
Sbjct: 123 NASLWEAIGRVVSDEARAMYNGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGTYA 182
Query: 195 TGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF 250
YV+GLQ T+G NRLKVAACCKH+TAYDLDNW G DR+HFNA V I + + F
Sbjct: 183 VNYVRGLQGTEG---NRLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPF 239
Query: 251 K------------------------------------------YIVSDCDSVDVLYNSQH 268
+ YIVSDCDSV V YNSQH
Sbjct: 240 RMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQH 299
Query: 269 YTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFF 328
YT TPEEAAA +I AGLDL+CGSFL HTE AVK GL+NES I+ A+SN + MRLG F
Sbjct: 300 YTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMF 359
Query: 329 DGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPN 388
DG QPY LG K VC+ N+ LA+DAARQGIVLL+N GSLPLS + +AV+GPN
Sbjct: 360 DGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPN 419
Query: 389 ANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ-VDDAKKAAASADA 447
+N T TMIGNY G C+Y TPLQG++ TI+Q GC V C + + A +AA ADA
Sbjct: 420 SNATLTMIGNYAGIACEYITPLQGISKYTRTIHQEGCRGVACRSNKFFGGAIEAARVADA 479
Query: 448 TVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKND 507
VLVMG DQSIEAE DR LLLPG Q L+ +VA VAKGPVIL++MSGG D+SFAK+
Sbjct: 480 VVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKDH 539
Query: 508 PKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PKI+ I+W GYPGQAGG AIADV FG+ NP G+LPMTWYPQ Y
Sbjct: 540 PKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDY 582
>gi|65736613|dbj|BAD98523.1| alpha-L-arabinofuranosidase / beta-D-xylosidase [Pyrus pyrifolia]
Length = 774
Score = 655 bits (1691), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/570 (59%), Positives = 400/570 (70%), Gaps = 51/570 (8%)
Query: 28 SSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSV 87
S + + P FACD NP +L FC + I +RV DL+ RLTLQEKI LV++A +V
Sbjct: 22 SMAVVHARPPFACDP-RNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAV 80
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
RLGI YEWWSEALHGVS VGPGT F + GATSFPQVI TAASFN SL++ IG+VVS
Sbjct: 81 PRLGIQGYEWWSEALHGVSNVGPGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVS 139
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
EARAMYN G AGLTFWSPN+NIFRDPRWGRGQETPGEDP+LA+KY YVKGLQ G
Sbjct: 140 DEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQ--GDG 197
Query: 208 SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------ 251
+ NRLKVAACCKHYTAYDLDNW G DR+HFNA V + Y + FK
Sbjct: 198 AGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVM 257
Query: 252 ------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSI 281
YIVSDCDSV V Y++QHYTKTPE AAA +I
Sbjct: 258 CSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEAAAAYAI 317
Query: 282 LAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG 341
AGLDL+CG FLG HTEAA++ G VNE I++A++N MRLG FDG PS Q YG LG
Sbjct: 318 KAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLG 377
Query: 342 PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG 401
DVC ++ +LAL+AARQGIVLL+N SLPLS + +AVIGPN++VT+TMIGNY G
Sbjct: 378 LADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAG 437
Query: 402 TPCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEA 460
C YTTPLQG+A TI+QAGC++V C G + A+ AA ADATVLV+G DQSIEA
Sbjct: 438 IACGYTTPLQGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEA 497
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E DR LLLPG QQ L++ VA+ ++GP IL+IMSGG D++FAKNDP I +I+WVGYPG
Sbjct: 498 EFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPCIGAIIWVGYPG 557
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
QAGG AIADV FG NPSG+LPMTWYPQ+Y
Sbjct: 558 QAGGTAIADVLFGTTNPSGKLPMTWYPQNY 587
>gi|297795695|ref|XP_002865732.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
gi|297311567|gb|EFH41991.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
Length = 774
Score = 655 bits (1690), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/586 (58%), Positives = 411/586 (70%), Gaps = 56/586 (9%)
Query: 12 KVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRL 71
KV V+ +FL L HSS S + P+FACD +N +L FC ++ I +RV DL+ RL
Sbjct: 13 KVVVILVFL--LCLVHSSESLR--PLFACDP-ANGLTRTLRFCRVNVPIHVRVQDLIGRL 67
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
TLQEKI LV++A +V RLGI YEWWSEALHGVS VGPG+ F PGATSFPQVI TA
Sbjct: 68 TLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGVSDVGPGSKFGGAFPGATSFPQVITTA 127
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
ASFN SL++ IG+VVS EARAMYN G+AGLT+WSPN+NI RDPRWGRGQETPGEDP++A+
Sbjct: 128 ASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAA 187
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYL 247
KYA YV+GLQ T G NRLKVAACCKHYTAYDLDNW G DR+HFNA V + Y
Sbjct: 188 KYAASYVRGLQGTAAG--NRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVTQQDLEDTYN 245
Query: 248 IKFK------------------------------------------YIVSDCDSVDVLYN 265
+ FK YIVSDCDSVDV +N
Sbjct: 246 VPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGKWRLNGYIVSDCDSVDVFFN 305
Query: 266 SQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRL 325
QHYT TPEEAAA SI AGLDL+CG FL TE AVK GL+ E+ I+ A++N MRL
Sbjct: 306 QQHYTSTPEEAAAASIKAGLDLDCGPFLAIFTEGAVKKGLLTENDINLALANTLTVQMRL 365
Query: 326 GFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVI 385
G FDG+ PY LGP+DVC+ A++ LAL+AA QGIVLLKN+ SLPLSP + +AVI
Sbjct: 366 GMFDGNLG--PYANLGPRDVCSLAHKHLALEAAHQGIVLLKNSGRSLPLSPRRHRTVAVI 423
Query: 386 GPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAAS 444
GPN++VT+TMIGNY G C YTTPLQG++ T++QAGC+ V C G A+ AA
Sbjct: 424 GPNSDVTETMIGNYAGKACAYTTPLQGISRYARTLHQAGCAGVACKGNQGFGAAEAAARE 483
Query: 445 ADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFA 504
ADATVLVMG DQSIEAE+ DR LLLPG QQ L+T VA+ ++GPVIL++MSGG D++FA
Sbjct: 484 ADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTFA 543
Query: 505 KNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
KNDP++ +I+W GYPGQAGGAAIA++ FG NP G+LPMTWYPQ Y
Sbjct: 544 KNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDY 589
>gi|449469042|ref|XP_004152230.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 769
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/583 (57%), Positives = 396/583 (67%), Gaps = 55/583 (9%)
Query: 15 VLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQ 74
+L +F+ ++ +S A FACD +N FC SL + RV DL+ RLTL+
Sbjct: 8 ILSIFILLSAIHGRASRAP----FACDP-NNSVTTDYPFCRRSLVVGERVKDLIGRLTLE 62
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASF 134
EK+ LV +AG V RLGI Y+WWSEALHGVS VGPGT F P ATSFPQVI TAASF
Sbjct: 63 EKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTAASF 122
Query: 135 NASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 194
NASL++AIG+VVS EARAMYN G+ GLT+WSPN+NIFRDPRWGRGQETPGEDP+LA YA
Sbjct: 123 NASLWEAIGRVVSDEARAMYNGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGTYA 182
Query: 195 TGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF 250
YV+GLQ T+G NRLKVAACCKH+TAYDLDNW G DR+HFNA V I + + F
Sbjct: 183 VNYVRGLQGTEG---NRLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPF 239
Query: 251 K------------------------------------------YIVSDCDSVDVLYNSQH 268
+ YIVSDCDSV V YNSQH
Sbjct: 240 RMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQH 299
Query: 269 YTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFF 328
YT TPEEAAA +I AGLDL+CGSFL HTE AVK GL+NES I+ A+SN + MRLG F
Sbjct: 300 YTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMF 359
Query: 329 DGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPN 388
DG QPY LG K VC+ N+ LA+DAARQGIVLL+N GSLPLS + +AV+GPN
Sbjct: 360 DGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPN 419
Query: 389 ANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ-VDDAKKAAASADA 447
+N T TMIGNY G C+Y TPLQG++ TI+Q GC V C + + A +AA ADA
Sbjct: 420 SNATLTMIGNYAGIACEYITPLQGISKYTRTIHQEGCRGVACRSNKFFGGAIEAARVADA 479
Query: 448 TVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKND 507
VLVMG DQSIEAE DR LLLPG Q L+ +VA VAKGPVIL++MSGG D+SFAK+
Sbjct: 480 VVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKDH 539
Query: 508 PKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PKI+ I+W GYPGQAGG AIADV FG+ NP G+LPMTWYPQ Y
Sbjct: 540 PKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDY 582
>gi|32481073|gb|AAP83934.1| auxin-induced beta-glucosidase [Chenopodium rubrum]
Length = 767
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/583 (59%), Positives = 404/583 (69%), Gaps = 59/583 (10%)
Query: 15 VLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQ 74
VLF+ L S+ A+++P+ ACD S + A L FC +L I RV DL+ RL LQ
Sbjct: 12 VLFILL--------SAEARAAPL-ACDPKSGLTRA-LRFCRVNLPIRARVQDLIGRLNLQ 61
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASF 134
EK+ LV++A V RLGI YEWWSEALHGVS VGPGT F P ATSFPQVI TAASF
Sbjct: 62 EKVKLLVNNAAPVPRLGISGYEWWSEALHGVSNVGPGTKFRGAFPAATSFPQVITTAASF 121
Query: 135 NASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 194
NASL++AIG+VVS EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP LAS+YA
Sbjct: 122 NASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLASQYA 181
Query: 195 TGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF 250
YV+GLQ + NRLKVAACCKHYTAYDLDNW DR+HFNA V + Y + F
Sbjct: 182 ASYVRGLQGIY--NKNRLKVAACCKHYTAYDLDNWNAVDRFHFNAKVSKQDLEDTYNVPF 239
Query: 251 K------------------------------------------YIVSDCDSVDVLYNSQH 268
K YIVSDCDSV VLY+ QH
Sbjct: 240 KGCVQEGRVASVMCSYNQVNGKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVLYDDQH 299
Query: 269 YTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFF 328
YT+TPEEAAA +I AGLDL+CG FL HTEAAVK GL+ E+ ++ A++N F MRLG F
Sbjct: 300 YTRTPEEAAADTIKAGLDLDCGPFLAVHTEAAVKRGLLTEADVNQALTNTFTVQMRLGMF 359
Query: 329 DGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPN 388
DG + QP+G LGPKDVC+ A+QDLAL AARQGIVLL+N SLPLS +N+AVIGPN
Sbjct: 360 DGEAAAQPFGHLGPKDVCSPAHQDLALQAARQGIVLLQNRGRSLPLSTARHRNIAVIGPN 419
Query: 389 ANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADA 447
A+ T TMIGNY G C YT+PLQG+A T++QAGC V C Q A AAA ADA
Sbjct: 420 ADATVTMIGNYAGVACGYTSPLQGIARYAKTVHQAGCIGVACTSNQQFGAATAAAAHADA 479
Query: 448 TVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKND 507
TVLVMG DQSIEAE DR +LLPG QQ L+++VA ++GP IL++M GG D++FAKND
Sbjct: 480 TVLVMGLDQSIEAEFRDRASVLLPGHQQELVSKVALASRGPTILVLMCGGPVDVTFAKND 539
Query: 508 PKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PKI++ILWVGYPGQAGG AIADV FG NP G+LP TWYPQSY
Sbjct: 540 PKISAILWVGYPGQAGGTAIADVLFGTTNPGGKLPNTWYPQSY 582
>gi|356529243|ref|XP_003533205.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
Length = 774
Score = 654 bits (1686), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/560 (60%), Positives = 394/560 (70%), Gaps = 50/560 (8%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACD N FCNT + I +RV DL+ RLTL EKI +V++A +V RLGI YEW
Sbjct: 36 FACDP-RNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGYEW 94
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS VGPGT F PGAT FPQVI TAASFN SL+Q IG+VVS EARAMYN G
Sbjct: 95 WSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYNGG 154
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
AGLT+WSPN+NIFRDPRWGRGQETPGEDP LA+KYA YVKGLQ G+ NRLKVAAC
Sbjct: 155 QAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQ--GDGAGNRLKVAAC 212
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKHYTAYDLDNW G DR+HFNA V + Y + FK
Sbjct: 213 CKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVMCSYNQVNGKP 272
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YIVSDCDSV V +++QHYT+TPEEAAA++I AGLDL+CG
Sbjct: 273 TCADPDLLRNTIRGQWGLNGYIVSDCDSVGVFFDNQHYTRTPEEAAAEAIKAGLDLDCGP 332
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FL HT++A++ GL++E+ ++ A++N MRLG FDG PS QP+G LGP+DVCT A+Q
Sbjct: 333 FLAIHTDSAIRKGLISENDLNLALANLITVQMRLGMFDGEPSTQPFGNLGPRDVCTPAHQ 392
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LAL+AAR+ IVLL+N SLPLSP+ ++ + VIGPN + T TMIGNY G C YTTPLQ
Sbjct: 393 QLALEAARESIVLLQNKGNSLPLSPSRLRIVGVIGPNTDATVTMIGNYAGVACGYTTPLQ 452
Query: 412 GLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
G+A V T +Q GC V C G A+ A DATVLVMG DQ+IEAE+ DR+ LLL
Sbjct: 453 GIARYVKTAHQVGCRGVACRGNELFGAAEIIARQVDATVLVMGLDQTIEAETRDRVGLLL 512
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG QQ L+T VA+ AKGPVIL+IMSGG D+SFAKN+PKI++ILWVGYPGQAGG AIADV
Sbjct: 513 PGLQQELVTRVARAAKGPVILVIMSGGPVDVSFAKNNPKISAILWVGYPGQAGGTAIADV 572
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG NP GRLPMTWYPQ Y
Sbjct: 573 IFGATNPGGRLPMTWYPQGY 592
>gi|9294427|dbj|BAB02547.1| beta-1,4-xylosidase [Arabidopsis thaliana]
Length = 876
Score = 650 bits (1677), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/584 (56%), Positives = 397/584 (67%), Gaps = 52/584 (8%)
Query: 17 FLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEK 76
F+ L+ + SS +S FACD+ S P+ A GFCN SL + R DLV RL+L+EK
Sbjct: 6 FVRLSLLIIALVSSLCESQKNFACDI-SAPATAKYGFCNVSLSYEARAKDLVSRLSLKEK 64
Query: 77 ITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNA 136
+ LV+ A V RLG+P YEWWSEALHGVS VGPG HF+ VPGATSFP ILTAASFN
Sbjct: 65 VQQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAASFNT 124
Query: 137 SLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATG 196
SL+ +G+VVSTEARAM+NVGLAGLT+WSPN+N+FRDPRWGRGQETPGEDPL+ SKYA
Sbjct: 125 SLWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVN 184
Query: 197 YVKGLQQT-DGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK 251
YVKGLQ D G RLKV++CCKHYTAYDLDNWKG DR+HF+A V + Y FK
Sbjct: 185 YVKGLQDVHDAGKSRRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFK 244
Query: 252 ------------------------------------------YIVSDCDSVDVLYNSQHY 269
YIVSDCDS+ V +N HY
Sbjct: 245 SCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHY 304
Query: 270 TKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFD 329
TKT E+A A ++ AGL++NCG FLGK+TE AVK +N S +D A+ N+ LMRLGFFD
Sbjct: 305 TKTREDAVALALKAGLNMNCGDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFD 364
Query: 330 GHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNA 389
G P P+G LGP DVC++ +Q LAL+AA+QGIVLL+N G LPL T +K LAVIGPNA
Sbjct: 365 GDPKSLPFGNLGPSDVCSKDHQMLALEAAKQGIVLLENR-GDLPLPKTTVKKLAVIGPNA 423
Query: 390 NVTKTMIGNYEGTPCKYTTPLQGLAAVV--ATIYQAGCSNVQCGTAQ-VDDAKKAAASAD 446
N TK MI NY G PCKYT+P+QGL V +Y+ GC +V+CG + A KA + AD
Sbjct: 424 NATKVMISNYAGVPCKYTSPIQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEAD 483
Query: 447 ATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN 506
TVLV+G DQ++EAE DR++L LPG Q+ L+ +VA AK V+L+IMS G DISFAKN
Sbjct: 484 VTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKN 543
Query: 507 DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
I ++LWVGYPG+AGG AIA V FG YNPSGRLP TWYPQ +
Sbjct: 544 LSTIRAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYPQEF 587
>gi|15230897|ref|NP_188596.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
gi|259585724|sp|Q9LJN4.2|BXL5_ARATH RecName: Full=Probable beta-D-xylosidase 5; Short=AtBXL5; Flags:
Precursor
gi|332642747|gb|AEE76268.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
Length = 781
Score = 649 bits (1675), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/584 (56%), Positives = 397/584 (67%), Gaps = 52/584 (8%)
Query: 17 FLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEK 76
F+ L+ + SS +S FACD+ S P+ A GFCN SL + R DLV RL+L+EK
Sbjct: 6 FVRLSLLIIALVSSLCESQKNFACDI-SAPATAKYGFCNVSLSYEARAKDLVSRLSLKEK 64
Query: 77 ITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNA 136
+ LV+ A V RLG+P YEWWSEALHGVS VGPG HF+ VPGATSFP ILTAASFN
Sbjct: 65 VQQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAASFNT 124
Query: 137 SLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATG 196
SL+ +G+VVSTEARAM+NVGLAGLT+WSPN+N+FRDPRWGRGQETPGEDPL+ SKYA
Sbjct: 125 SLWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVN 184
Query: 197 YVKGLQQT-DGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK 251
YVKGLQ D G RLKV++CCKHYTAYDLDNWKG DR+HF+A V + Y FK
Sbjct: 185 YVKGLQDVHDAGKSRRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFK 244
Query: 252 ------------------------------------------YIVSDCDSVDVLYNSQHY 269
YIVSDCDS+ V +N HY
Sbjct: 245 SCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHY 304
Query: 270 TKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFD 329
TKT E+A A ++ AGL++NCG FLGK+TE AVK +N S +D A+ N+ LMRLGFFD
Sbjct: 305 TKTREDAVALALKAGLNMNCGDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFD 364
Query: 330 GHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNA 389
G P P+G LGP DVC++ +Q LAL+AA+QGIVLL+N G LPL T +K LAVIGPNA
Sbjct: 365 GDPKSLPFGNLGPSDVCSKDHQMLALEAAKQGIVLLENR-GDLPLPKTTVKKLAVIGPNA 423
Query: 390 NVTKTMIGNYEGTPCKYTTPLQGLAAVV--ATIYQAGCSNVQCGTAQ-VDDAKKAAASAD 446
N TK MI NY G PCKYT+P+QGL V +Y+ GC +V+CG + A KA + AD
Sbjct: 424 NATKVMISNYAGVPCKYTSPIQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEAD 483
Query: 447 ATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN 506
TVLV+G DQ++EAE DR++L LPG Q+ L+ +VA AK V+L+IMS G DISFAKN
Sbjct: 484 VTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKN 543
Query: 507 DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
I ++LWVGYPG+AGG AIA V FG YNPSGRLP TWYPQ +
Sbjct: 544 LSTIRAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYPQEF 587
>gi|449466797|ref|XP_004151112.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
Length = 770
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/558 (60%), Positives = 396/558 (70%), Gaps = 50/558 (8%)
Query: 40 CDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWS 99
CD N ++GFC SLGI+ RV DL+ RLTL EKI LV++A +V RLGI YEWWS
Sbjct: 34 CDK-RNVGTRNMGFCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWS 92
Query: 100 EALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA 159
EALHGVS VGPGT F PGATSFPQVI TAASFN SL+ IG+VVS EARAMYN G A
Sbjct: 93 EALHGVSNVGPGTKFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTA 152
Query: 160 GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCK 219
GLT+WSPN+NIFRDPRWGRGQETPGEDP+LA+KYA YV+GLQ DG RLKVAACCK
Sbjct: 153 GLTYWSPNVNIFRDPRWGRGQETPGEDPILAAKYAANYVQGLQGNDG--KKRLKVAACCK 210
Query: 220 HYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------ 251
HYTAYDLDNW G DRYHFNA V + Y + FK
Sbjct: 211 HYTAYDLDNWNGVDRYHFNAKVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 270
Query: 252 ------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL 293
YIVSDCDSV VLY+SQH+T TPEEAAA +I AGLDL+CG FL
Sbjct: 271 ADPDLLKNTIRGAWGLDGYIVSDCDSVGVLYDSQHFTPTPEEAAASTIKAGLDLDCGPFL 330
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
HT AV GL+ E +++A++N + MRLG FDG P+ QPYG LGPKDVCT A++ L
Sbjct: 331 AVHTATAVGRGLLKEVDLNNALANLLSVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHL 390
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL+AARQGIVLL+N AG+LPLSPT + +AVIGPN++ T TMIGNY G C+YTTP+QG+
Sbjct: 391 ALEAARQGIVLLQNRAGALPLSPTRHRTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGI 450
Query: 414 AAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPG 472
+ V TI+ GC+NV C G + +A+ AA ADA V+V+G DQSIEAES DR +LLPG
Sbjct: 451 SKYVKTIHAKGCANVACVGDQLIGEAEAAARVADAAVVVVGLDQSIEAESRDRNGVLLPG 510
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCF 532
+Q+ L+ + KGP ++++MSGG D+SFAKND KI+ ILWVGYPGQAGGAAIADV F
Sbjct: 511 KQEELVRRIGLACKGPTVVVLMSGGPIDVSFAKNDGKISGILWVGYPGQAGGAAIADVLF 570
Query: 533 GRYNPSGRLPMTWYPQSY 550
G NP G+LPMTWYPQSY
Sbjct: 571 GATNPGGKLPMTWYPQSY 588
>gi|357449039|ref|XP_003594795.1| Beta xylosidase [Medicago truncatula]
gi|355483843|gb|AES65046.1| Beta xylosidase [Medicago truncatula]
Length = 762
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/560 (59%), Positives = 393/560 (70%), Gaps = 52/560 (9%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACD N S FCNT + I RV DL+ RL L EKI +V++A +V RLGI YEW
Sbjct: 27 FACDP-KNGLTRSYKFCNTRVPIHARVQDLIGRLALPEKIRLVVNNAIAVPRLGIQGYEW 85
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS VGPGT F ATSFPQVI TAASFN SL+ IG++VS EARAMYN G
Sbjct: 86 WSEALHGVSNVGPGTKFGGAFSAATSFPQVITTAASFNQSLWLEIGRIVSDEARAMYNGG 145
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
AGLTFWSPN+NIFRDPRWGRGQETPGEDP +A KYA YV+GLQ G+ NRLKVAAC
Sbjct: 146 AAGLTFWSPNVNIFRDPRWGRGQETPGEDPTVAGKYAASYVQGLQGN--GAGNRLKVAAC 203
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKHYTAYDLDNW G DR+HFNA V + Y + FK
Sbjct: 204 CKHYTAYDLDNWNGVDRFHFNAKVSKQDLADTYDVPFKACVRDGKVASVMCSYNQVNGKP 263
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YIVSDCDSV VLY++QHYT+TPE+AAA +I AGLDL+CG
Sbjct: 264 TCADPELLRNTIRGEWGLNGYIVSDCDSVGVLYDNQHYTRTPEQAAAAAIKAGLDLDCGP 323
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FL HT+ A+K GL++E+ ++ A++N MRLG FDG QPYG LG +DVC ++
Sbjct: 324 FLALHTDGAIKQGLISENDLNLALANLITVQMRLGMFDG--DAQPYGNLGTRDVCLPSHN 381
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
D+AL+AARQGIVLL+N +LPLSPT + + VIGPN++VT TMIGNY G C YTTPLQ
Sbjct: 382 DVALEAARQGIVLLQNKGNALPLSPTRYRTVGVIGPNSDVTVTMIGNYAGIACGYTTPLQ 441
Query: 412 GLAAVVATIYQAGCSNVQCGTAQVDD-AKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
G+A V TI+QAGC +V CG Q+ +++ A ADATVLVMG DQSIEAE DR LLL
Sbjct: 442 GIARYVKTIHQAGCKDVGCGGNQLFGLSEQVARQADATVLVMGLDQSIEAEFRDRTGLLL 501
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG QQ L++ VA+ A+GPVIL++MSGG D++FAKNDPKI++ILWVGYPGQ+GG AIADV
Sbjct: 502 PGHQQELVSRVARAARGPVILVLMSGGPIDVTFAKNDPKISAILWVGYPGQSGGTAIADV 561
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FGR NPSGRLP TWYPQ Y
Sbjct: 562 IFGRTNPSGRLPNTWYPQDY 581
>gi|371917282|dbj|BAL44717.1| SlArf/Xyl2 [Solanum lycopersicum]
Length = 774
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/585 (57%), Positives = 406/585 (69%), Gaps = 57/585 (9%)
Query: 14 SVLFLFL-TYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLT 72
S LF+F+ + S+Q + P FACD N + + FC T+L I RV DL+ RLT
Sbjct: 12 SSLFIFIFLFVSIQ------AARPPFACDQ-KNRAFRNFPFCQTNLPIGDRVRDLIGRLT 64
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAA 132
LQEK+ L ++A +V RLGI YEWWSEALHGVS VGPGT F PGATSFPQVI TAA
Sbjct: 65 LQEKVKLLGNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGEFPGATSFPQVITTAA 124
Query: 133 SFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASK 192
SFNASL++ IG+VVS EARAMYN + GLT+WSPN+NIFRDPRWGRGQETPGEDP++A+
Sbjct: 125 SFNASLWEEIGRVVSDEARAMYNGEMGGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAAL 184
Query: 193 YATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLI 248
YA YV+GLQ + G + LKVAACCKHYTAYDLDNW G DR+HFNA V I + +
Sbjct: 185 YAERYVRGLQGNEDG--DSLKVAACCKHYTAYDLDNWGGVDRFHFNAKVTKQDIEDTFDV 242
Query: 249 KFK------------------------------------------YIVSDCDSVDVLYNS 266
F+ YIVSDCDSV V Y++
Sbjct: 243 PFRSCVKQGKVASIMCSYNQVNGIPTCADPQLLRKTIRGGWGLNGYIVSDCDSVGVFYDT 302
Query: 267 QHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLG 326
QHYT TPEEAAA +I AGLDL+CG FL +HTE AV G++ E+AID ++N A MRLG
Sbjct: 303 QHYTSTPEEAAAAAIKAGLDLDCGPFLSQHTENAVHIGILKEAAIDTNLANTVAVQMRLG 362
Query: 327 FFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG 386
FDG PS Q YG LGP+DVC+ A+Q+LA++AARQGIVLLKN +LPLSP + +AVIG
Sbjct: 363 MFDGEPSAQQYGHLGPRDVCSPAHQELAVEAARQGIVLLKNHGPALPLSPRRHRTVAVIG 422
Query: 387 PNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAASA 445
PN++VT TMIGNY G C YT+PLQG++ TI++ GC +V C ++ A AA A
Sbjct: 423 PNSDVTVTMIGNYAGVACGYTSPLQGISKYAKTIHEKGCGDVACSDDKLFAGAVNAARQA 482
Query: 446 DATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAK 505
DATVLVMG DQSIEAE DR LLLPG QQ LI+EV+K ++GPV+L++MSGG D++FA
Sbjct: 483 DATVLVMGLDQSIEAEFRDRTGLLLPGFQQELISEVSKASRGPVVLVLMSGGPVDVTFAN 542
Query: 506 NDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
NDP+I +I+W GYPGQ GGAAIADV FG +NP G+LPMTWYPQ Y
Sbjct: 543 NDPRIGAIVWAGYPGQGGGAAIADVLFGAHNPGGKLPMTWYPQEY 587
>gi|9972374|gb|AAG10624.1|AC022521_2 Similar to xylosidase [Arabidopsis thaliana]
Length = 763
Score = 646 bits (1666), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/585 (56%), Positives = 396/585 (67%), Gaps = 55/585 (9%)
Query: 13 VSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLT 72
++V+ FL + SS S FACD + + A+L FC S+ I RV DL+ RLT
Sbjct: 4 LAVILFFL----ISSSSVCVHSRETFACDT-KDAATATLRFCQLSVPIPERVRDLIGRLT 58
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAA 132
L EK++ L ++A ++ RLGI YEWWSEALHGVS VGPGT F V P ATSFPQVI T A
Sbjct: 59 LAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVA 118
Query: 133 SFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASK 192
SFNASL+++IG+VVS EARAMYN G+ GLT+WSPN+NI RDPRWGRGQETPGEDP++A K
Sbjct: 119 SFNASLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGK 178
Query: 193 YATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLI 248
YA YV+GLQ G +RLKVAACCKH+TAYDLDNW G DR+HFNA V I + +
Sbjct: 179 YAASYVRGLQ---GNDRSRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDV 235
Query: 249 KFK------------------------------------------YIVSDCDSVDVLYNS 266
F+ YIVSDCDSV VLY++
Sbjct: 236 PFRMCVKEGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDT 295
Query: 267 QHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLG 326
QHYT TPEEAAA SI AGLDL+CG FLG HT AVK L+ ES +D+A+ N MRLG
Sbjct: 296 QHYTGTPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLG 355
Query: 327 FFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG 386
FDG + QPYG LGP VCT ++ LAL+AA+QGIVLLKN SLPLS + +AVIG
Sbjct: 356 MFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIG 415
Query: 387 PNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAASA 445
PN++ T TMIGNY G C YT+P+QG+ TI+Q GC +V C ++ D A +AA A
Sbjct: 416 PNSDATVTMIGNYAGVACGYTSPVQGITGYARTIHQKGCVDVHCMDDRLFDAAVEAARGA 475
Query: 446 DATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAK 505
DATVLVMG DQSIEAE DR LLLPG+QQ L++ VAK AKGPVIL++MSGG DISFA+
Sbjct: 476 DATVLVMGLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAAKGPVILVLMSGGPIDISFAE 535
Query: 506 NDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
D KI +I+W GYPGQ GG AIAD+ FG NP G+LPMTWYPQ Y
Sbjct: 536 KDRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDY 580
>gi|18378991|ref|NP_563659.1| beta-glucosidase [Arabidopsis thaliana]
gi|75250279|sp|Q94KD8.1|BXL2_ARATH RecName: Full=Probable beta-D-xylosidase 2; Short=AtBXL2; Flags:
Precursor
gi|14194121|gb|AAK56255.1|AF367266_1 At1g02640/T14P4_11 [Arabidopsis thaliana]
gi|23506063|gb|AAN28891.1| At1g02640/T14P4_11 [Arabidopsis thaliana]
gi|332189332|gb|AEE27453.1| beta-glucosidase [Arabidopsis thaliana]
Length = 768
Score = 645 bits (1665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/585 (56%), Positives = 396/585 (67%), Gaps = 55/585 (9%)
Query: 13 VSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLT 72
++V+ FL + SS S FACD + + A+L FC S+ I RV DL+ RLT
Sbjct: 9 LAVILFFL----ISSSSVCVHSRETFACDT-KDAATATLRFCQLSVPIPERVRDLIGRLT 63
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAA 132
L EK++ L ++A ++ RLGI YEWWSEALHGVS VGPGT F V P ATSFPQVI T A
Sbjct: 64 LAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVA 123
Query: 133 SFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASK 192
SFNASL+++IG+VVS EARAMYN G+ GLT+WSPN+NI RDPRWGRGQETPGEDP++A K
Sbjct: 124 SFNASLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGK 183
Query: 193 YATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLI 248
YA YV+GLQ G +RLKVAACCKH+TAYDLDNW G DR+HFNA V I + +
Sbjct: 184 YAASYVRGLQ---GNDRSRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDV 240
Query: 249 KFK------------------------------------------YIVSDCDSVDVLYNS 266
F+ YIVSDCDSV VLY++
Sbjct: 241 PFRMCVKEGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDT 300
Query: 267 QHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLG 326
QHYT TPEEAAA SI AGLDL+CG FLG HT AVK L+ ES +D+A+ N MRLG
Sbjct: 301 QHYTGTPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLG 360
Query: 327 FFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG 386
FDG + QPYG LGP VCT ++ LAL+AA+QGIVLLKN SLPLS + +AVIG
Sbjct: 361 MFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIG 420
Query: 387 PNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAASA 445
PN++ T TMIGNY G C YT+P+QG+ TI+Q GC +V C ++ D A +AA A
Sbjct: 421 PNSDATVTMIGNYAGVACGYTSPVQGITGYARTIHQKGCVDVHCMDDRLFDAAVEAARGA 480
Query: 446 DATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAK 505
DATVLVMG DQSIEAE DR LLLPG+QQ L++ VAK AKGPVIL++MSGG DISFA+
Sbjct: 481 DATVLVMGLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAAKGPVILVLMSGGPIDISFAE 540
Query: 506 NDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
D KI +I+W GYPGQ GG AIAD+ FG NP G+LPMTWYPQ Y
Sbjct: 541 KDRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDY 585
>gi|297843058|ref|XP_002889410.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
lyrata]
gi|297335252|gb|EFH65669.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
lyrata]
Length = 763
Score = 645 bits (1664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/584 (56%), Positives = 396/584 (67%), Gaps = 56/584 (9%)
Query: 14 SVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTL 73
++LF ++ SS QS FACD+ + + A+L FC S+ I RV DL+ RLTL
Sbjct: 6 AILFFLIS-----SSSVCVQSRETFACDI-KDAATATLRFCQLSVPITERVKDLIGRLTL 59
Query: 74 QEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAAS 133
EK++ L ++A ++ RLGI YEWWSEALHGVS VGPGT F V P ATSFPQVI T AS
Sbjct: 60 VEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVAS 119
Query: 134 FNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
FNASL+++IG+VVS EARAMYN G+ GLT+WSPN+NI RDPRWGRGQETPGEDP++A KY
Sbjct: 120 FNASLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKY 179
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIK 249
A YV+GLQ G +RLKVAACCKH+TAYDLDNW G DR+HFNA V I + +
Sbjct: 180 AASYVRGLQ---GNDRSRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVP 236
Query: 250 FK------------------------------------------YIVSDCDSVDVLYNSQ 267
F+ YIVSDCDSV VLY++Q
Sbjct: 237 FRMCVKEGNVASIMCSYNEVNGVPTCADPNLLKKTIRNEWGLNGYIVSDCDSVGVLYDTQ 296
Query: 268 HYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGF 327
HYT TPEEAAA SI AGLDL+CG FLG HT AVK L+ ES +D+A+ N MRLG
Sbjct: 297 HYTGTPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGM 356
Query: 328 FDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGP 387
FDG + QPYG LGP VCT ++ LAL+AA+QGIVLLKN SLPLS + +AVIGP
Sbjct: 357 FDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGP 416
Query: 388 NANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAASAD 446
N++ T MIGNY G C YT+P+QG+ T++Q GC +V C ++ D A +AA AD
Sbjct: 417 NSDATVAMIGNYAGIACGYTSPVQGITGYARTVHQKGCVDVHCMDDRLFDAAVEAARGAD 476
Query: 447 ATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN 506
ATVLVMG DQSIEAE DR LLLPG+QQ LI+ VAK AKGPVIL++MSGG DISFA+
Sbjct: 477 ATVLVMGLDQSIEAEFKDRNSLLLPGKQQELISRVAKAAKGPVILVLMSGGPIDISFAEK 536
Query: 507 DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
D KI +I+W GYPGQ GG AIAD+ FG NP G+LPMTWYPQ Y
Sbjct: 537 DRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDY 580
>gi|225431898|ref|XP_002276351.1| PREDICTED: beta-D-xylosidase 1-like [Vitis vinifera]
Length = 770
Score = 644 bits (1662), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/568 (59%), Positives = 401/568 (70%), Gaps = 51/568 (8%)
Query: 30 SSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSR 89
S A++ FACD N +L FC SL I R DLV RLTLQEKI LV++A V R
Sbjct: 19 SFAEAREPFACDP-RNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPR 77
Query: 90 LGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTE 149
LGI YEWWSEALHGVS VGPGT F PGATSFPQVI TAASFNASL++ IG+VVS E
Sbjct: 78 LGIKGYEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDE 137
Query: 150 ARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSP 209
ARAMYN G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP +A+KYA YV+GLQ G +
Sbjct: 138 ARAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQ---GNAR 194
Query: 210 NRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------- 251
+RLKVAACCKHYTAYDLD+W G DR+HFNA V + Y + FK
Sbjct: 195 DRLKVAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCS 254
Query: 252 ----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
YIVSDCDSV V Y+ QHYT TPEEAAA +I A
Sbjct: 255 YNQVNGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTATPEEAAAVAIKA 314
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
GLDL+CG FL HTEAA++ G + E+ ++ A+ N + MRLG FDG PS QPYG LGP+
Sbjct: 315 GLDLDCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPR 374
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
DVCT A+Q LAL+AARQGIVL++N +LPLS + + +AVIGPN++VT+TMIGNY G
Sbjct: 375 DVCTPAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVA 434
Query: 404 CKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAES 462
C YTTPLQG+ TI+QAGCS V C Q A AA ADATVLVMG DQSIEAE
Sbjct: 435 CGYTTPLQGIGRYARTIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEF 494
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR+D+LLPG+QQ L+++VA ++GP +L++MSGG D+SFAKNDP+I +I+WVGYPGQA
Sbjct: 495 RDRVDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQA 554
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG AIADV FGR NP G+LP+TWYPQSY
Sbjct: 555 GGTAIADVLFGRTNPGGKLPVTWYPQSY 582
>gi|356572781|ref|XP_003554544.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 771
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/560 (58%), Positives = 384/560 (68%), Gaps = 51/560 (9%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACD N + FC SL I RV DL+ RLTL+EK+ LV++A +V RLG+ YEW
Sbjct: 27 FACDP-KNGGTKKMAFCKVSLAIAERVKDLIGRLTLEEKVRLLVNNAAAVPRLGMKGYEW 85
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS +GP F+ P ATSFPQVI TAASFNASL++AIG+VVS EARAMYN G
Sbjct: 86 WSEALHGVSNLGPAVKFNAQFPAATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGG 145
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LA YA YV+GLQ T NRLKVAAC
Sbjct: 146 TAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGTYAATYVRGLQGTHA---NRLKVAAC 202
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKH+TAYDLDNW G DR+HFNA V I + + FK
Sbjct: 203 CKHFTAYDLDNWNGMDRFHFNAQVSKQDIEDTFDVPFKMCVSEGKVASVMCSYNQVNGVP 262
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YIVSDCDSV V Y++QHYT TPEEAAA +I AGLDL+CG
Sbjct: 263 TCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQHYTPTPEEAAADAIKAGLDLDCGP 322
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FL HT+ AVK GL++E+ ++ A+ N MRLG FDG P+ PYG LGPKDVC A+Q
Sbjct: 323 FLAVHTQNAVKKGLLSEADVNGALVNTLTVQMRLGMFDGEPTAHPYGHLGPKDVCKPAHQ 382
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
+LAL+AARQGIVLLKNT LPLS + +AVIGPN+ T TMIGNY G C YT PLQ
Sbjct: 383 ELALEAARQGIVLLKNTGPVLPLSSQLHRTVAVIGPNSKATITMIGNYAGVACGYTNPLQ 442
Query: 412 GLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
G+ T++Q GC NV C ++ A AA ADATVLVMG DQSIEAE+ DR LLL
Sbjct: 443 GIGRYARTVHQLGCQNVACKNDKLFGPAINAARQADATVLVMGLDQSIEAETVDRTGLLL 502
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG+Q L+++VA +KGP IL++MSGG DI+FAKN+P+I ILW GYPGQAGGAAIAD+
Sbjct: 503 PGRQPDLVSKVAAASKGPTILVLMSGGPVDITFAKNNPRIVGILWAGYPGQAGGAAIADI 562
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG NP G+LP+TWYP+ Y
Sbjct: 563 LFGTANPGGKLPVTWYPEEY 582
>gi|296083274|emb|CBI22910.3| unnamed protein product [Vitis vinifera]
Length = 738
Score = 643 bits (1658), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/568 (59%), Positives = 401/568 (70%), Gaps = 51/568 (8%)
Query: 30 SSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSR 89
S A++ FACD N +L FC SL I R DLV RLTLQEKI LV++A V R
Sbjct: 19 SFAEAREPFACDP-RNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPR 77
Query: 90 LGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTE 149
LGI YEWWSEALHGVS VGPGT F PGATSFPQVI TAASFNASL++ IG+VVS E
Sbjct: 78 LGIKGYEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDE 137
Query: 150 ARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSP 209
ARAMYN G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP +A+KYA YV+GLQ G +
Sbjct: 138 ARAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQ---GNAR 194
Query: 210 NRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------- 251
+RLKVAACCKHYTAYDLD+W G DR+HFNA V + Y + FK
Sbjct: 195 DRLKVAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCS 254
Query: 252 ----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
YIVSDCDSV V Y+ QHYT TPEEAAA +I A
Sbjct: 255 YNQVNGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTATPEEAAAVAIKA 314
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
GLDL+CG FL HTEAA++ G + E+ ++ A+ N + MRLG FDG PS QPYG LGP+
Sbjct: 315 GLDLDCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPR 374
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
DVCT A+Q LAL+AARQGIVL++N +LPLS + + +AVIGPN++VT+TMIGNY G
Sbjct: 375 DVCTPAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVA 434
Query: 404 CKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAES 462
C YTTPLQG+ TI+QAGCS V C Q A AA ADATVLVMG DQSIEAE
Sbjct: 435 CGYTTPLQGIGRYARTIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEF 494
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR+D+LLPG+QQ L+++VA ++GP +L++MSGG D+SFAKNDP+I +I+WVGYPGQA
Sbjct: 495 RDRVDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQA 554
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG AIADV FGR NP G+LP+TWYPQSY
Sbjct: 555 GGTAIADVLFGRTNPGGKLPVTWYPQSY 582
>gi|224099193|ref|XP_002311398.1| predicted protein [Populus trichocarpa]
gi|222851218|gb|EEE88765.1| predicted protein [Populus trichocarpa]
Length = 755
Score = 637 bits (1644), Expect = e-180, Method: Compositional matrix adjust.
Identities = 340/560 (60%), Positives = 397/560 (70%), Gaps = 52/560 (9%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACD N SL FC ++ + +RV DL+ RLTLQEKI LV++A +V RLGI YEW
Sbjct: 20 FACDA-KNGLTRSLKFCRVNMPLHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQGYEW 78
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS VGPGT F PGATSFPQVI TAASFN SL++ IG+VVS EARAM+N G
Sbjct: 79 WSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNKSLWEEIGRVVSDEARAMFNGG 138
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
+AGLT+WSPN+N+FRDPRWGRGQETPGEDP++A KYA YV+GLQ G S RLKVAAC
Sbjct: 139 MAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQ---GNSGFRLKVAAC 195
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKHYTAYDLDNW G DRYHFNA V + Y + FK
Sbjct: 196 CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKSCVVEGKVASVMCSYNQVNGKP 255
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YIVSDCDSV VLY +QHYT TPEEAAA +I AGLDL+CG
Sbjct: 256 TCADPNLLKNTIRGEWRLNGYIVSDCDSVGVLYENQHYTATPEEAAAATIKAGLDLDCGP 315
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FL HTE AVK GL+NE ++ A++N MRLG FDG PS QP+G+LGP+DVCT A+Q
Sbjct: 316 FLAIHTENAVKGGLLNEEDVNMALANTITVQMRLGLFDGEPSAQPFGKLGPRDVCTPAHQ 375
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LAL AA+QGIVLL+N+ +LPLS + +AVIGP A+VT TMIGNY G C YTTPLQ
Sbjct: 376 QLALHAAQQGIVLLQNSGRTLPLSRPNL-TVAVIGPIADVTVTMIGNYAGVACGYTTPLQ 434
Query: 412 GLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
G++ TI+Q+GC +V C G Q A+ AA+ ADATVLVMG DQSIEAE DR DLLL
Sbjct: 435 GISRYAKTIHQSGCIDVACNGNQQFGMAEAAASQADATVLVMGLDQSIEAEFRDRKDLLL 494
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG QQ LI+ VA+ ++GP IL++MSGG D+SFAKNDP+I +ILW GYPGQAGGAAIADV
Sbjct: 495 PGYQQELISRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWAGYPGQAGGAAIADV 554
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG NP G+LPMTWYPQ Y
Sbjct: 555 LFGTTNPGGKLPMTWYPQDY 574
>gi|449436749|ref|XP_004136155.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 772
Score = 635 bits (1639), Expect = e-179, Method: Compositional matrix adjust.
Identities = 326/560 (58%), Positives = 391/560 (69%), Gaps = 51/560 (9%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACD + +L+ FC +L I RV DL+ RLTLQEK+ LV++A +V RLGI YEW
Sbjct: 29 FACDP-KDAALSRYPFCRVALPIPERVKDLIGRLTLQEKVRLLVNNAAAVPRLGIKGYEW 87
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS VGPGT F PGATSFPQVI T ASFN SL++AIG+VVS EARAMYN G
Sbjct: 88 WSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVASFNVSLWEAIGRVVSDEARAMYNGG 147
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
AGLT+WSPN+NIFRDPRWGRGQETPGEDP++A +YA Y+KGLQ DG +RLKVAAC
Sbjct: 148 AAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAARYIKGLQGNDG---DRLKVAAC 204
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK-------------------------- 251
CKH+TAYDLDNW GTDR+HFNA V + F+
Sbjct: 205 CKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVASVMCSYNQVNGVP 264
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YIVSDCDSV V Y++QHYT T EEAAA +I AGLDL+CG
Sbjct: 265 TCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDNQHYTSTAEEAAADAIKAGLDLDCGP 324
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FL HTE AVK GL+ ++ I++A++N MRLG FDG PS YG+LGPK+VC+ ++Q
Sbjct: 325 FLAVHTEDAVKKGLLTQTHINNALANTITVQMRLGMFDGAPSSHAYGKLGPKNVCSPSHQ 384
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LALDAARQGIVLLKN LPLS + +AVIGPN++V TMIGNY G C Y TPL+
Sbjct: 385 QLALDAARQGIVLLKNRLPGLPLSADHHRTVAVIGPNSDVNVTMIGNYAGVACGYVTPLE 444
Query: 412 GLAAVVATIYQAGCSNVQCGTA-QVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
G+ +++ GC NV C T DA AA++ADATVLVMG DQS+EAE+ DR LLL
Sbjct: 445 GIKRYTTVVHRKGCDNVACATDYSFTDALAAASTADATVLVMGLDQSVEAETKDRDGLLL 504
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG+QQ L+ +VA ++GP ++I+MSGG D+SFA NDP+I++ILWVGYPGQAGGAAIADV
Sbjct: 505 PGRQQELVLKVAAASRGPTVVILMSGGPIDVSFADNDPRISAILWVGYPGQAGGAAIADV 564
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG NP G+LPMTWYPQSY
Sbjct: 565 LFGTTNPGGKLPMTWYPQSY 584
>gi|371917280|dbj|BAL44716.1| SlArf/Xyl1 [Solanum lycopersicum]
Length = 771
Score = 634 bits (1634), Expect = e-179, Method: Compositional matrix adjust.
Identities = 329/567 (58%), Positives = 384/567 (67%), Gaps = 50/567 (8%)
Query: 31 SAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRL 90
S +S FACD +N + +L FC TSL I +RV DL+ RLTLQEKI LV++A V RL
Sbjct: 18 SGESRQPFACDP-ANAGIRNLRFCKTSLPIHVRVQDLIARLTLQEKIRLLVNNAAPVQRL 76
Query: 91 GIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEA 150
GI YEWWSEALHGVS G G F PGATSFPQVI TAASFNASL++ IG+VVS E
Sbjct: 77 GISGYEWWSEALHGVSNTGYGVKFGGAFPGATSFPQVITTAASFNASLWEEIGRVVSEEG 136
Query: 151 RAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPN 210
RAMYN G AGLTFWSPN+NIFRDPRWGRGQETPGEDP L ++Y YVKGLQ G
Sbjct: 137 RAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPHLVAQYGVSYVKGLQGGGGRGNT 196
Query: 211 RLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------------- 251
RLKVAACCKHYTAYDLD+W G DRYHFNA V + Y FK
Sbjct: 197 RLKVAACCKHYTAYDLDDWNGYDRYHFNAKVSMQDLEDTYNAPFKACVVEGNVASVMCSY 256
Query: 252 ---------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
YIVSDCDSV VL+ QHYT+ PE+AAA +I AG
Sbjct: 257 NQINGKPSCADPTLLRDTIRNQWHLNGYIVSDCDSVGVLFEKQHYTRYPEDAAAITIKAG 316
Query: 285 LDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
LDL+CG FL HT+ AV G V++ I++A++N MRLG FDG PY LGPKD
Sbjct: 317 LDLDCGPFLAIHTDKAVHTGKVSQVEINNALANTITVQMRLGMFDG--PNGPYANLGPKD 374
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
VC+ A+Q LAL AAR+GIVLLKN +LPLS + +AVIGPN++ T MIGNY G PC
Sbjct: 375 VCSPAHQQLALQAAREGIVLLKNIGQALPLSTKRHRTVAVIGPNSDATLAMIGNYAGVPC 434
Query: 405 KYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
Y +PLQG++ TI+Q GC V C G A+ AA ADATVLVMG DQSIEAE+
Sbjct: 435 GYISPLQGISRYARTIHQQGCMGVACPGNQNFGLAEVAARHADATVLVMGLDQSIEAEAK 494
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR+ LLLPG QQ LI+ VA +KGPV+L++MSGG D++FAKNDP+++SI+WVGYPGQAG
Sbjct: 495 DRVTLLLPGHQQDLISRVAMASKGPVVLVLMSGGPIDVTFAKNDPRVSSIVWVGYPGQAG 554
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
GAAIADV FG NP G+LPMTWYPQ Y
Sbjct: 555 GAAIADVLFGATNPGGKLPMTWYPQDY 581
>gi|449505346|ref|XP_004162442.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
[Cucumis sativus]
Length = 772
Score = 634 bits (1634), Expect = e-179, Method: Compositional matrix adjust.
Identities = 325/560 (58%), Positives = 390/560 (69%), Gaps = 51/560 (9%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACD + +L+ FC +L I RV DL+ RLTLQEK+ LV++A +V RLGI YEW
Sbjct: 29 FACDP-KDAALSRYPFCRVALPIPERVKDLIGRLTLQEKVRLLVNNAAAVPRLGIKGYEW 87
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS VGPGT F PGATSFPQVI T ASFN SL++AIG+VVS EARAMYN G
Sbjct: 88 WSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVASFNVSLWEAIGRVVSDEARAMYNGG 147
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
AGLT+WSPN+NIFRDPRWGRGQETPGEDP++A +YA Y+KGLQ DG +RLKVAAC
Sbjct: 148 AAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAARYIKGLQGNDG---DRLKVAAC 204
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK-------------------------- 251
CKH+TAYDLDNW GTDR+HFNA V + F+
Sbjct: 205 CKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVASVMCSYNQVNGVP 264
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YIVSDCDSV V Y++QHYT T EEAAA +I AGLDL+CG
Sbjct: 265 TCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDNQHYTSTAEEAAADAIKAGLDLDCGP 324
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FL HTE AVK L+ ++ I++A++N MRLG FDG PS YG+LGPK+VC+ ++Q
Sbjct: 325 FLAVHTEDAVKKXLLTQTHINNALANTITVQMRLGMFDGAPSSHAYGKLGPKNVCSPSHQ 384
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LALDAARQGIVLLKN LPLS + +AVIGPN++V TMIGNY G C Y TPL+
Sbjct: 385 QLALDAARQGIVLLKNRLPGLPLSAXHHRTVAVIGPNSDVNVTMIGNYAGVACGYVTPLE 444
Query: 412 GLAAVVATIYQAGCSNVQCGTA-QVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
G+ +++ GC NV C T DA AA++ADATVLVMG DQS+EAE+ DR LLL
Sbjct: 445 GIKRYTTVVHRKGCDNVACATDYSFTDALAAASTADATVLVMGLDQSVEAETKDRDGLLL 504
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG+QQ L+ +VA ++GP ++I+MSGG D+SFA NDP+I++ILWVGYPGQAGGAAIADV
Sbjct: 505 PGRQQELVLKVAAASRGPTVVILMSGGPIDVSFADNDPRISAILWVGYPGQAGGAAIADV 564
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG NP G+LPMTWYPQSY
Sbjct: 565 LFGTTNPGGKLPMTWYPQSY 584
>gi|189380221|gb|ACD93208.1| beta xylosidase [Camellia sinensis]
Length = 767
Score = 633 bits (1632), Expect = e-179, Method: Compositional matrix adjust.
Identities = 325/565 (57%), Positives = 388/565 (68%), Gaps = 52/565 (9%)
Query: 31 SAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRL 90
+++S P FACD + +L FC SL I RV DL+ RLTLQEKI LV++A +V RL
Sbjct: 24 TSESRPAFACDGATR----NLPFCRVSLPIQDRVRDLIGRLTLQEKIRLLVNNAAAVPRL 79
Query: 91 GIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEA 150
GI YEWWSEALHGVS PG F PGATSFPQVI TAASFNASL++ IG+VVS EA
Sbjct: 80 GIKGYEWWSEALHGVSNADPGVKFGGAFPGATSFPQVISTAASFNASLWEHIGRVVSDEA 139
Query: 151 RAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPN 210
RAMYN G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LA KYA YV+GLQ G S N
Sbjct: 140 RAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQ---GNSGN 196
Query: 211 RLKVAACCKHYTAYDLDNWKGTDRYHFNAMVI-----------YTYYLIKFKYIVSDCDS 259
+LKVAACCKHYTAYDLDNW DRY FNA V + +++ KY V +
Sbjct: 197 QLKVAACCKHYTAYDLDNWNSVDRYRFNARVSKQDLADTYDVPFKACVVEGKYQVYCAHT 256
Query: 260 VDVLYNS--------QHY-------------------------TKTPEEAAAKSILAGLD 286
+ ++ N QH+ TPE+AAA +I AGLD
Sbjct: 257 IKLMANPLVLTLISPQHHPWSWHSWLHCFRLYRCWGFICHSTLHSTPEDAAAATIKAGLD 316
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
L CG FL HTE AV+ G + E+ ++ A+ N + MRLG FDG PS QPYG LGP+DVC
Sbjct: 317 LECGPFLAIHTEQAVRQGKLGEADVNGALINTLSVQMRLGMFDGEPSSQPYGNLGPRDVC 376
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
T A+Q LAL+AARQGIVLL+N SLPLS + +AVIGPN++VT TM+GNY G C +
Sbjct: 377 TPAHQQLALEAARQGIVLLQNRGRSLPLSTQLHRTVAVIGPNSDVTVTMLGNYAGVACGF 436
Query: 407 TTPLQGLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAASADATVLVMGADQSIEAESHDR 465
TTPLQG+ V TI+Q+GC +V C Q+ A+ AA ADATVLVMG DQSIE E DR
Sbjct: 437 TTPLQGIERYVRTIHQSGCDSVACSNNQLFGVAETAARQADATVLVMGLDQSIETEFKDR 496
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
+ LLLPG QQ L++ VA ++GPV+L++MSGG D+SFAKNDP+I +ILWVGYPGQAGG
Sbjct: 497 VGLLLPGPQQELVSRVAMASRGPVVLVLMSGGPIDVSFAKNDPRIGAILWVGYPGQAGGT 556
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
AIADV FGR NP GRLPMTWYPQ Y
Sbjct: 557 AIADVLFGRTNPGGRLPMTWYPQDY 581
>gi|297834874|ref|XP_002885319.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
gi|297331159|gb|EFH61578.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
Length = 865
Score = 632 bits (1629), Expect = e-178, Method: Compositional matrix adjust.
Identities = 323/572 (56%), Positives = 386/572 (67%), Gaps = 63/572 (11%)
Query: 29 SSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVS 88
SS +S FACD ++P+ A GFCN SL + R DLV RL+L+EK+ LV+ A VS
Sbjct: 18 SSLCESQKNFACDR-NDPATAKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKATGVS 76
Query: 89 RLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVST 148
RLG+P YEWWSEALHGVS VGPG F+ VPGATSFP ILTAASFN SL+ +G+VVST
Sbjct: 77 RLGVPPYEWWSEALHGVSDVGPGVRFNGTVPGATSFPATILTAASFNTSLWLKMGEVVST 136
Query: 149 EARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT-DGG 207
EARAM+NVGLAGLT+WSPN+NIFRDPRWGRGQETPGEDPL+ SKYA YVKGLQ D G
Sbjct: 137 EARAMHNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDVQDAG 196
Query: 208 SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------ 251
RLKV++CCKHYTAYDLDNWKG DR+HF+A V + Y FK
Sbjct: 197 KSRRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQPPFKSCVEEGDVSSVM 256
Query: 252 ------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSI 281
YIVSDCDS+ V ++ HYTKT
Sbjct: 257 CSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFDDIHYTKTR-------- 308
Query: 282 LAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG 341
L++NCG FLGK+TE AVK +N S +D A+ N+ LMRLGFFDG P P+GQLG
Sbjct: 309 ---LNMNCGDFLGKYTENAVKLKKLNGSEVDEALIYNYIVLMRLGFFDGDPKSLPFGQLG 365
Query: 342 PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG 401
P DVC++ +Q LAL+AA+QGIVLL+N G LPLS TA+K +AVIGPNAN TK MI NY G
Sbjct: 366 PSDVCSKDHQMLALEAAKQGIVLLENR-GDLPLSKTAVKKIAVIGPNANATKVMISNYAG 424
Query: 402 TPCKYTTPLQGLAAVV--ATIYQAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSI 458
PCKYT+PLQGL V +Y+ GC +V CG + A KA + AD TVLV+G DQ++
Sbjct: 425 VPCKYTSPLQGLQKYVPEKVVYEPGCKDVNCGEQTLISAAVKAVSEADVTVLVVGLDQTV 484
Query: 459 EAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGY 518
EAE DR++L LPG Q+ L+ +VA AK V+L+IMS G DISFAKN I+++LWVGY
Sbjct: 485 EAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTISAVLWVGY 544
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PG+AGG AIA V FG YNPSGRLP TWY Q +
Sbjct: 545 PGEAGGDAIAQVIFGDYNPSGRLPETWYSQEF 576
>gi|226531269|ref|NP_001145980.1| uncharacterized protein LOC100279508 precursor [Zea mays]
gi|219885199|gb|ACL52974.1| unknown [Zea mays]
gi|413920228|gb|AFW60160.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 794
Score = 619 bits (1595), Expect = e-174, Method: Compositional matrix adjust.
Identities = 317/571 (55%), Positives = 389/571 (68%), Gaps = 60/571 (10%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FAC P+ ASL FC SL + R DLV RLT EK+ LV++A V RLG+ YEW
Sbjct: 27 FAC-APGGPA-ASLPFCRQSLPLRARARDLVSRLTRAEKVRLLVNNAAGVPRLGVAGYEW 84
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS GPG F PGAT+FPQVI TAAS NA+L++ +G+ VS EARAMYN G
Sbjct: 85 WSEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASLNATLWELVGRAVSDEARAMYNGG 144
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT----DGGSPNRLK 213
AGLTFWSPN+NIFRDPRWGRGQETPGEDP ++++YA YV+GLQQ +GG NRLK
Sbjct: 145 RAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGHRNRLK 204
Query: 214 VAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------ 251
+AACCKH+TAYDLD W GTDR+HFNA+V + + + F+
Sbjct: 205 LAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVMCSYNQV 264
Query: 252 ------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL 287
YIVSDCDSVDV + QHYT+TPE+AAA ++ AGLDL
Sbjct: 265 NGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTRTPEDAAAATLRAGLDL 324
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
+CG FL + +AV AG V ++ +D A+ N MRLG FDG P+ P+G+LGP DVCT
Sbjct: 325 DCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRLGPADVCT 384
Query: 348 QANQDLALDAARQGIVLLKNTAGS------LPLSPTAIKNLAVIGPNANVTKTMIGNYEG 401
+ +QDLALDAARQG+VLLKN G+ LPL P A + +AV+GP+A+ T MIGNY G
Sbjct: 385 REHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAG 444
Query: 402 TPCKYTTPLQGLAAVVATI-YQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIE 459
PC+YTTPLQG+AA A + +QAGC++V C G + A +AA ADATV+V G DQ +E
Sbjct: 445 KPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVAGLDQRVE 504
Query: 460 AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYP 519
AE DR LLLPG+Q LI+ VAK +KGPVIL++MSGG DI+FA+NDP+I ILWVGYP
Sbjct: 505 AEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYP 564
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GQAGG AIADV FG +NP +LP+TWY Q Y
Sbjct: 565 GQAGGQAIADVIFGHHNPGAKLPVTWYHQDY 595
>gi|222629651|gb|EEE61783.1| hypothetical protein OsJ_16354 [Oryza sativa Japonica Group]
Length = 771
Score = 610 bits (1572), Expect = e-172, Method: Compositional matrix adjust.
Identities = 331/558 (59%), Positives = 385/558 (68%), Gaps = 94/558 (16%)
Query: 83 SAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAI 142
S ++ RLGIP YEWWSEALHGVSYVGPGT FS +VPGATSFPQ ILTAASFNASLF+AI
Sbjct: 41 SGRALPRLGIPAYEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAI 100
Query: 143 G------------------------------------------KVVSTEARAMYNVGLAG 160
G +VVSTEARAM+NVGLAG
Sbjct: 101 GESACNNTSQFFFSSKSPFSICIAMENLHCDFRSRLVRFYRGARVVSTEARAMHNVGLAG 160
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA GYV GLQ GGS + LKVAACCKH
Sbjct: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGGGS-DALKVAACCKH 219
Query: 221 YTAYDLDNWKGTDRYHFNAMV-------------------------IYTYYLIKFK---- 251
YTAYD+DNWKG +RY F+A+V + +Y + K
Sbjct: 220 YTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCA 279
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
YIVSDCDSVDVLYN+QHYTK PE+AAA +I +GLDLNCG+FL
Sbjct: 280 DKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNPEDAAAITIKSGLDLNCGNFLA 339
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
+HT AAV+AG ++ES +D AI+NNF LMRLGFFDG P K P+G LGPKDVCT +NQ+LA
Sbjct: 340 QHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTSSNQELA 399
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
+AARQGIVLLKNT G+LPLS +IK++AVIGPNAN + TMIGNYEGTPCKYTTPLQGL
Sbjct: 400 REAARQGIVLLKNT-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLG 458
Query: 415 AVVATIYQAGCSNVQC--GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPG 472
A VAT+YQ GC+NV C + Q+ A +AAASAD TVLV+GADQS+E ES DR LLLPG
Sbjct: 459 ANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDRTSLLLPG 518
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCF 532
QQ L++ VA ++GPVIL++MSGG FDISFAK+ KI++ILWVGYP ++
Sbjct: 519 QQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPRRSRWRRPRRH-- 576
Query: 533 GRYNPSGRLPMTWYPQSY 550
P LP+TWYP S+
Sbjct: 577 PLRIPQSWLPVTWYPASF 594
>gi|147844622|emb|CAN82161.1| hypothetical protein VITISV_035506 [Vitis vinifera]
Length = 925
Score = 609 bits (1571), Expect = e-171, Method: Compositional matrix adjust.
Identities = 310/570 (54%), Positives = 384/570 (67%), Gaps = 53/570 (9%)
Query: 32 AQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLG 91
+Q + +ACD ++P+ + FCNTSL R DLV RLTLQEK L++SA +SRLG
Sbjct: 21 SQPTHRYACDR-TDPNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLG 79
Query: 92 IPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEAR 151
+P YEWWSEALHGVS G G HF + +P T FP VIL+AASFN SL+ +G+VVSTE R
Sbjct: 80 VPDYEWWSEALHGVSNSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGR 139
Query: 152 AMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT--DGG-S 208
AMYNVG AGLT+WSPN+NIFRDPRWGRGQETPGEDPL+ S+YA YV+GLQ+ +G +
Sbjct: 140 AMYNVGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGKEGNFA 199
Query: 209 PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------- 251
+RLKV++CCKHYTAYD+D WKG DR+HF+A V + Y FK
Sbjct: 200 ADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKXCVEEGHVSSVMC 259
Query: 252 -----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSIL 282
YIVSDCDS+ V + +YT+TPE+A A ++
Sbjct: 260 SYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNYTETPEDAVALALK 319
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
AGL+LNCGS+LG +T+ AV G V ES +B A+ N+ LMRLGFFDG P+ P+G++GP
Sbjct: 320 AGLNLNCGSYLGDYTKNAVNLGKVKESIVBQALIYNYIVLMRLGFFDGDPTMLPFGKMGP 379
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
DVCT +Q LALDAA+QGIVLL N G+LPLSP K LAVIGPNA+ T TM+ NY G
Sbjct: 380 SDVCTVDHQLLALDAAKQGIVLLHNN-GALPLSPNTTKTLAVIGPNADATNTMLSNYAGV 438
Query: 403 PCKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEA 460
PC+YT+PLQGL V+ + Y+ GC+NV C ++ A A+ ADATV+V+G D IEA
Sbjct: 439 PCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEA 498
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E DR++L LPG Q+ L+ E AK A G VIL++MS G DISF KN KI ILWVGYPG
Sbjct: 499 EDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPG 558
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
QAGG AI+ V FG YNP GR P TWYPQ Y
Sbjct: 559 QAGGDAISQVIFGDYNPGGRSPFTWYPQEY 588
>gi|225428983|ref|XP_002264114.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 818
Score = 608 bits (1569), Expect = e-171, Method: Compositional matrix adjust.
Identities = 316/599 (52%), Positives = 397/599 (66%), Gaps = 57/599 (9%)
Query: 3 PAQNRNRAPKVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDL 62
P++ + ++ +L +FL+ L S Q + +ACD ++P+ + FCNTSL
Sbjct: 20 PSKEAMKKNQLILLPIFLSLPLLAIS----QPTHRYACDR-TDPNSSQFPFCNTSLPYQD 74
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R DLV RLTLQEK L++SA +SRLG+P YEWWSEALHGVS G G HF + +P T
Sbjct: 75 RASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVSNSGIGVHFHDPIPAVT 134
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQET 182
FP VIL+AASFN SL+ +G+VVSTE RAMYNVG AGLT+WSPN+NIFRDPRWGRGQET
Sbjct: 135 IFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWSPNVNIFRDPRWGRGQET 194
Query: 183 PGEDPLLASKYATGYVKGLQQT--DGG-SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
PGEDPL+ S+YA YV+GLQ+ +G + +RLKV++CCKHYTAYD+D WKG DR+HF+A
Sbjct: 195 PGEDPLVVSRYAVNYVRGLQEVGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDA 254
Query: 240 MV----IYTYYLIKFK------------------------------------------YI 253
V + Y FK YI
Sbjct: 255 KVTLQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYI 314
Query: 254 VSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDH 313
VSDCDS+ V + +YT+TPE+A A ++ AGL+LNCGS+LG +T+ AV G V ES ++
Sbjct: 315 VSDCDSIMVYHERMNYTETPEDAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQ 374
Query: 314 AISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLP 373
A+ N+ LMRLGFFDG P+ P+G++GP DVCT +Q LALDAA+QGIVLL N G+LP
Sbjct: 375 ALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNN-GALP 433
Query: 374 LSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGT 432
LSP K LAVIGPNA+ T TM+ NY G PC+YT+PLQGL V+ + Y+ GC+NV C
Sbjct: 434 LSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSE 493
Query: 433 AQ-VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVIL 491
++ A A+ ADATV+V+G D IEAE DR++L LPG Q+ L+ E AK A G VIL
Sbjct: 494 ETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVIL 553
Query: 492 IIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
++MS G DISF KN KI ILWVGYPGQAGG AI+ V FG YNP GR P TWYPQ Y
Sbjct: 554 VVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEY 612
>gi|296083056|emb|CBI22460.3| unnamed protein product [Vitis vinifera]
Length = 896
Score = 608 bits (1569), Expect = e-171, Method: Compositional matrix adjust.
Identities = 315/599 (52%), Positives = 398/599 (66%), Gaps = 57/599 (9%)
Query: 3 PAQNRNRAPKVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDL 62
P++ + ++ +L +FL+ L + +Q + +ACD ++P+ + FCNTSL
Sbjct: 20 PSKEAMKKNQLILLPIFLSLPLL----AISQPTHRYACDR-TDPNSSQFPFCNTSLPYQD 74
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R DLV RLTLQEK L++SA +SRLG+P YEWWSEALHGVS G G HF + +P T
Sbjct: 75 RASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVSNSGIGVHFHDPIPAVT 134
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQET 182
FP VIL+AASFN SL+ +G+VVSTE RAMYNVG AGLT+WSPN+NIFRDPRWGRGQET
Sbjct: 135 IFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWSPNVNIFRDPRWGRGQET 194
Query: 183 PGEDPLLASKYATGYVKGLQQT--DGG-SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
PGEDPL+ S+YA YV+GLQ+ +G + +RLKV++CCKHYTAYD+D WKG DR+HF+A
Sbjct: 195 PGEDPLVVSRYAVNYVRGLQEVGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDA 254
Query: 240 MV----IYTYYLIKFK------------------------------------------YI 253
V + Y FK YI
Sbjct: 255 KVTLQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYI 314
Query: 254 VSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDH 313
VSDCDS+ V + +YT+TPE+A A ++ AGL+LNCGS+LG +T+ AV G V ES ++
Sbjct: 315 VSDCDSIMVYHERMNYTETPEDAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQ 374
Query: 314 AISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLP 373
A+ N+ LMRLGFFDG P+ P+G++GP DVCT +Q LALDAA+QGIVLL N G+LP
Sbjct: 375 ALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNN-GALP 433
Query: 374 LSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGT 432
LSP K LAVIGPNA+ T TM+ NY G PC+YT+PLQGL V+ + Y+ GC+NV C
Sbjct: 434 LSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSE 493
Query: 433 AQ-VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVIL 491
++ A A+ ADATV+V+G D IEAE DR++L LPG Q+ L+ E AK A G VIL
Sbjct: 494 ETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVIL 553
Query: 492 IIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
++MS G DISF KN KI ILWVGYPGQAGG AI+ V FG YNP GR P TWYPQ Y
Sbjct: 554 VVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEY 612
>gi|356574315|ref|XP_003555294.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 5-like
[Glycine max]
Length = 901
Score = 605 bits (1559), Expect = e-170, Method: Compositional matrix adjust.
Identities = 325/585 (55%), Positives = 390/585 (66%), Gaps = 66/585 (11%)
Query: 17 FLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEK 76
FL L S +H+ SA++S + FC+TSL + R DLV RLTLQEK
Sbjct: 14 FLILPTASQKHACGSAKTS--------------NFPFCDTSLSYEDRAKDLVSRLTLQEK 59
Query: 77 ITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNA 136
LV+ + +SRLG+P YEWWSEALHGVS +GPGT F VPGATSFP VIL+AASFNA
Sbjct: 60 TQQLVNPSAGISRLGVPAYEWWSEALHGVSNLGPGTRFDKKVPGATSFPAVILSAASFNA 119
Query: 137 SLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATG 196
SL+Q +G+VVSTEARAMYNV LAGLTFWSPN+N+FRDPRWGRGQETPGEDPL+ S+YA
Sbjct: 120 SLWQKMGQVVSTEARAMYNVDLAGLTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVM 179
Query: 197 YVKGLQQTD---GGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIK 249
Y++GLQ+ + +RLKV++CCKHYTAYDLDNWKG DR+HF+A V + Y
Sbjct: 180 YLRGLQEVEDEASAKADRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDSYQPP 239
Query: 250 FK------------------------------------------YIVSDCDSVDVLYNSQ 267
FK YIVSDCDSV+V YN+
Sbjct: 240 FKSCVVEGHVSSVMCSYNRVNGIPTCADPDLLKGIIRGQWGLDGYIVSDCDSVEVYYNAI 299
Query: 268 HYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGF 327
HYT TPE+A A ++ AGL++NCG FL K+T AV V+ + +D A+ N+ LMRLGF
Sbjct: 300 HYTATPEDAVALALKAGLNMNCGDFLKKYTANAVNLKKVDVATVDQALVYNYIVLMRLGF 359
Query: 328 FDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGP 387
FD P P+ LGP DVCT+ NQ LALDAA+QGIVLL+N G+LPLS T IK LAVIGP
Sbjct: 360 FD-DPKSLPFANLGPSDVCTKDNQQLALDAAKQGIVLLENNNGALPLSQTNIKKLAVIGP 418
Query: 388 NANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQC-GTAQVDDAKKAAASA 445
NAN T MI NY G PC+YT+PLQGL ++++ Y GCSNV+C + + A KAAASA
Sbjct: 419 NANATTVMISNYAGIPCRYTSPLQGLQKYISSVNYAPGCSNVKCDNQSLIAAAVKAAASA 478
Query: 446 DATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAK 505
DA VLV+G DQSIEAE DR +L LPG Q+ + +VA KG VIL+IM+ G DIS K
Sbjct: 479 DAVVLVVGLDQSIEAEGLDRENLTLPGFQEKFVKDVAGATKGKVILVIMAAGPIDISSTK 538
Query: 506 NDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+ I ILWVGYPGQAGG AIA V FG YNP GR P TWYPQSY
Sbjct: 539 SVSNIGGILWVGYPGQAGGDAIAQVIFGDYNPGGRSPFTWYPQSY 583
>gi|357444469|ref|XP_003592512.1| Xylosidase [Medicago truncatula]
gi|355481560|gb|AES62763.1| Xylosidase [Medicago truncatula]
Length = 781
Score = 604 bits (1558), Expect = e-170, Method: Compositional matrix adjust.
Identities = 322/563 (57%), Positives = 387/563 (68%), Gaps = 53/563 (9%)
Query: 39 ACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWW 98
ACD S P ++ FCNTSL + R DLV RLTLQEK LV+ + +SRLG+P YEWW
Sbjct: 25 ACDKGS-PKTSNFPFCNTSLSYETRAKDLVSRLTLQEKAQQLVNPSTGISRLGVPAYEWW 83
Query: 99 SEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
SEALHGVS VGPGT F + VPGATSFP VIL+AASFN +L+ +G+VVS EARAMYNV L
Sbjct: 84 SEALHGVSNVGPGTRFDSRVPGATSFPAVILSAASFNETLWYTMGQVVSNEARAMYNVDL 143
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT-DGGSP--NRLKVA 215
AGLTFWSPN+N+FRDPRWGRGQETPGEDPL+ S+YA YV+GLQ+ D S +RLKV+
Sbjct: 144 AGLTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEASAKGDRLKVS 203
Query: 216 ACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------------- 251
+CCKHYTAYD+DNWKG DR+HF+A V + Y FK
Sbjct: 204 SCCKHYTAYDVDNWKGVDRFHFDAKVTKQDLEDTYQPPFKSCVLEGHVSSVMCSYNRVNG 263
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
YIVSDCDSV+V YNS HYTKTPE+A A ++ AGL++NC
Sbjct: 264 IPTCADPDLLQGVIRGQWGLDGYIVSDCDSVEVYYNSIHYTKTPEDAVALALKAGLNMNC 323
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G FL K+T AV V+ S +D A+ N+ LMRLGFF+ +P P+ LGP DVCT+
Sbjct: 324 GDFLKKYTANAVNLKKVDVSIVDQALVYNYIVLMRLGFFE-NPKSLPFANLGPSDVCTKE 382
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
NQ LAL+AA+QGIVLL+N G+LPLS T IKNLAVIGPNAN T MI NY G PC+Y++P
Sbjct: 383 NQQLALEAAKQGIVLLENNKGALPLSKTKIKNLAVIGPNANATTVMISNYAGIPCRYSSP 442
Query: 410 LQGLAAVVATI-YQAGCSNVQCGTAQV-DDAKKAAASADATVLVMGADQSIEAESHDRLD 467
LQGL ++++ Y GCS+V+C + A KAAASADA VLV+G DQSIEAE DR++
Sbjct: 443 LQGLQKYISSVTYARGCSDVKCSNQNLFAAAVKAAASADAVVLVVGLDQSIEAEGLDRVN 502
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
L LPG Q+ L+ +VA KG +IL+IM+ G DISF K+ I ILWVGYPGQ GG AI
Sbjct: 503 LTLPGFQEKLVKDVAAATKGTLILVIMAAGPIDISFTKSVSNIGGILWVGYPGQDGGNAI 562
Query: 528 ADVCFGRYNPSGRLPMTWYPQSY 550
A V FG YNP GR P TWYPQSY
Sbjct: 563 AQVIFGDYNPGGRSPFTWYPQSY 585
>gi|302786124|ref|XP_002974833.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
gi|300157728|gb|EFJ24353.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
Length = 784
Score = 601 bits (1550), Expect = e-169, Method: Compositional matrix adjust.
Identities = 305/564 (54%), Positives = 379/564 (67%), Gaps = 54/564 (9%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
+ACDV SN SL S FC+T LGID+RV DLV RLTL EK+ +V++A + RLG+P Y+W
Sbjct: 36 YACDVSSNASLGSFPFCDTKLGIDVRVQDLVSRLTLDEKVDEMVNAAQGIPRLGVPSYQW 95
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
W EALHGV+ PG F + P ATSFP I TAASFN++LF +IG+ VS+EARA++N+G
Sbjct: 96 WQEALHGVA-SSPGVQFGGLAPAATSFPMPIATAASFNSTLFYSIGEAVSSEARALHNLG 154
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ---QTDGGSPNRLKV 214
AGLTFWSPN+NIFRDPRWGRGQETPGEDPLLASK+A+ YV+GLQ S LKV
Sbjct: 155 RAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYEGSASDGFLKV 214
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------- 251
+ACCKH TAYD+DNWKG DRYHFNA V + Y F+
Sbjct: 215 SACCKHLTAYDVDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSSVMCSYNRVN 274
Query: 252 -----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
YIVSDCD++ VL+ Y + E+A A SILAGLDLN
Sbjct: 275 GVPTCADRNLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPSAEDAVADSILAGLDLN 334
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG+FLGKH ++A++AG + E+ +DHA+SN T MRLG FDG P+ QPY LG D+C+
Sbjct: 335 CGTFLGKHAKSALQAGKITEADLDHAVSNLMRTRMRLGLFDGDPNSQPYSSLGATDICSN 394
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
+Q LALDAA QG+VLLKN GSLPLS TA+K +A+IGPNAN T TM+GNYEG PCKY +
Sbjct: 395 DHQQLALDAALQGVVLLKND-GSLPLS-TALKTVALIGPNANATYTMLGNYEGIPCKYIS 452
Query: 409 PLQGLAAVVATI-YQAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEAESHDRL 466
PLQG+ + I Y GC NV C V A + A ADA VLV+G DQS E E+ DR
Sbjct: 453 PLQGMQIYSSNILYSPGCRNVACNEGDLVASAVEVATKADAVVLVVGLDQSQERETFDRT 512
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
LLLPG Q L++ +A P++L+IMS G DIS K++ +I+S++W+GYPGQ+GGAA
Sbjct: 513 SLLLPGMQSQLVSNIANAVTSPIVLVIMSAGPVDISTFKDNSRISSVIWLGYPGQSGGAA 572
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+A V FG YNP GRLP TWY + +
Sbjct: 573 LAHVVFGAYNPGGRLPNTWYHEEF 596
>gi|255573163|ref|XP_002527511.1| Beta-glucosidase, putative [Ricinus communis]
gi|223533151|gb|EEF34909.1| Beta-glucosidase, putative [Ricinus communis]
Length = 810
Score = 600 bits (1546), Expect = e-169, Method: Compositional matrix adjust.
Identities = 315/564 (55%), Positives = 380/564 (67%), Gaps = 53/564 (9%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACD S P FCNTSL R DL+ RLTLQEK+ +V+ A + RLGIP YEW
Sbjct: 25 FACDKNS-PQTNDYSFCNTSLSYQDRAKDLISRLTLQEKVQQVVNHAAGIPRLGIPAYEW 83
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS VG G F+ VPGATSFP +IL+AASFN +L+ +G+VVSTEAR M++VG
Sbjct: 84 WSEALHGVSNVGFGVRFNGTVPGATSFPAMILSAASFNETLWLKMGQVVSTEARTMHSVG 143
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT-DGG--SPNRLKV 214
LAGLT+WSPN+N+FRDPRWGRGQETPGEDPL+ S+YA YV+GLQ+ D G + ++LKV
Sbjct: 144 LAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEGNSTADKLKV 203
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------- 251
++CCKHYTAYDLD WKG DR+HF+A V + Y F+
Sbjct: 204 SSCCKHYTAYDLDKWKGVDRFHFDAKVTKQDLEDTYQPPFRSCVEEAHVSSVMCSYNRVN 263
Query: 252 -----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
YIVSDCDS++V Y+S +YT TPE+A A ++ AGL++N
Sbjct: 264 GIPTCADPDLLKGIIRGEWNLDGYIVSDCDSIEVYYDSINYTATPEDAVALALKAGLNMN 323
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG FLGK+T AVK V ES +D A+ NF LMRLGFFDG P +G LGP DVC+
Sbjct: 324 CGEFLGKYTVDAVKLNKVEESVVDQALIYNFIVLMRLGFFDGDPKSLLFGNLGPSDVCSD 383
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
+Q LALDAARQGIVLL N G+LPLS +NLAVIGPNANVT TMI NY G PCKYTT
Sbjct: 384 GHQKLALDAARQGIVLLYN-KGALPLSKNNTRNLAVIGPNANVTTTMISNYAGIPCKYTT 442
Query: 409 PLQGLAAVVATI-YQAGCSNVQCG-TAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
PLQGL V+T+ Y AGC +V C +D A +AAA+ADA VL++G DQSIE E DR
Sbjct: 443 PLQGLQKYVSTVTYAAGCKSVSCSDDTLIDAATQAAAAADAVVLLVGLDQSIEREGLDRE 502
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+L LPG Q+ L+ +V G V+L++MS D+SFA N KI ILWVGYPGQAGG A
Sbjct: 503 NLTLPGFQEKLVVDVVNATNGTVVLVVMSSSPIDVSFAVNKSKIKGILWVGYPGQAGGDA 562
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+A V FG YNP+GR P TWYPQ Y
Sbjct: 563 VAQVMFGDYNPAGRSPFTWYPQEY 586
>gi|255545664|ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223546978|gb|EEF48475.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 774
Score = 598 bits (1542), Expect = e-168, Method: Compositional matrix adjust.
Identities = 301/568 (52%), Positives = 380/568 (66%), Gaps = 52/568 (9%)
Query: 34 SSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
+ P F+CD SNPS +S FC TSL I RV DLV RLTL EKI+ LV SA S+ RLGIP
Sbjct: 25 TEPPFSCDP-SNPSTSSFLFCKTSLPISQRVRDLVSRLTLDEKISQLVSSAPSIPRLGIP 83
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
YEWWSEALHGV+ VG G HF + ATSFPQVILTAASF+A + IG+V+ EARA+
Sbjct: 84 AYEWWSEALHGVANVGRGIHFEGAIKAATSFPQVILTAASFDAYQWYRIGQVIGREARAV 143
Query: 154 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD---GGSP 209
YN G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+ KYA YV+G+Q G
Sbjct: 144 YNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDSFQGGKLK 203
Query: 210 NRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------- 251
L+ +ACCKH+TAYDLDNWKG +R+ F+A V + Y F+
Sbjct: 204 GHLQASACCKHFTAYDLDNWKGVNRFVFDARVTMQDLADTYQPPFQSCVQQGKASGIMCA 263
Query: 252 ----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
YI SDCD+V ++Y++Q Y K+PE+A + A
Sbjct: 264 YNRVNGIPSCADFNLLSRTARGQWDFHGYIASDCDAVSIIYDNQGYAKSPEDAVVDVLKA 323
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
G+D+NCGS+L KHT+AAV+ + E++ID A+ N F+ MRLG F+G+P++QP+ +GP
Sbjct: 324 GMDVNCGSYLQKHTKAAVEQKKLPEASIDRALHNLFSVRMRLGLFNGNPTEQPFSNIGPD 383
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
VC+Q +Q LAL+AAR GIVLLKN+A LPL + +LAVIGPNAN +T++GNY G P
Sbjct: 384 QVCSQEHQILALEAARNGIVLLKNSARLLPLQKSKTVSLAVIGPNANSVQTLLGNYAGPP 443
Query: 404 CKYTTPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES 462
CK TPLQ L V TIY +GC V+C +A +D A A D V++MG DQ+ E E
Sbjct: 444 CKTVTPLQALQYYVKNTIYYSGCDTVKCSSASIDKAVDIAKGVDRVVMIMGLDQTQEREE 503
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DRLDL+LPG+QQ LIT VAK AK P++L+++SGG DISFAK D I SILW GYPG+A
Sbjct: 504 LDRLDLVLPGKQQELITNVAKSAKNPIVLVLLSGGPVDISFAKYDENIGSILWAGYPGEA 563
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG A+A++ FG +NP G+LPMTWYPQ +
Sbjct: 564 GGIALAEIIFGDHNPGGKLPMTWYPQEF 591
>gi|357130854|ref|XP_003567059.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
distachyon]
Length = 779
Score = 595 bits (1535), Expect = e-167, Method: Compositional matrix adjust.
Identities = 310/570 (54%), Positives = 380/570 (66%), Gaps = 58/570 (10%)
Query: 34 SSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
+ P FAC PS L FC +L R DLV RLT EK+ LV++A V RLG+
Sbjct: 23 TRPPFAC-APGGPS-TRLPFCRQALPPRARARDLVARLTRAEKVRLLVNNAAGVPRLGVE 80
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
YEWWSEALHGVS GPG F PGAT+FPQVI TAASFNASL++ IG+ VS E RA+
Sbjct: 81 GYEWWSEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASFNASLWELIGRAVSDEGRAI 140
Query: 154 YNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLK 213
YN AGLTFWSPN+NIFRDPRWGRGQETPGEDP ++ +YA YV+GLQQ G RLK
Sbjct: 141 YNGRQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSGRYAAAYVRGLQQQHAG---RLK 197
Query: 214 VAACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTYYLI 248
AACCKH+TAYDLD W G DR+HFNA+V + +Y +
Sbjct: 198 TAACCKHFTAYDLDRWSGADRFHFNAIVTPQDLEDTFNAPFRACVVEGRAAAVMCSYNQV 257
Query: 249 ------------------KFK---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL 287
K+K YIVSDCDSVDV Y QHYT+T E+A A ++ AGLDL
Sbjct: 258 NGVPTCADQGFLRGTIRGKWKLDGYIVSDCDSVDVFYREQHYTRTREDAVAATLRAGLDL 317
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
+CG FL ++TEAAV G V E+ ID A+ N MRLG FDG + QP+G LGP+ VCT
Sbjct: 318 DCGPFLAQYTEAAVAQGKVKEADIDAAVVNTVTVQMRLGMFDGDVAAQPFGHLGPQHVCT 377
Query: 348 QANQDLALDAARQGIVLLKNTAGS---LPLSPTAIK-NLAVIGPNANVTKTMIGNYEGTP 403
A+++LAL+AA Q IVLLKN G+ LPLS + +AV+GP++ T MIGNY G P
Sbjct: 378 PAHRELALEAACQSIVLLKNGGGNNMRLPLSSHHRRGTVAVVGPHSEATVAMIGNYAGKP 437
Query: 404 CKYTTPLQGLAAVV-ATIYQAGCSNVQC-GTAQ-VDDAKKAAASADATVLVMGADQSIEA 460
C YTTPLQG+ AT++QAGC++V C G+ Q +D A AA ADATV+V+G DQS+EA
Sbjct: 438 CAYTTPLQGVGRYARATVHQAGCTDVACQGSGQPIDAAVDAARHADATVVVVGLDQSVEA 497
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E DR LLLPG+Q L++ VA+ +KGPVIL++MSGG DI+FA+ND + +ILW GYPG
Sbjct: 498 EGLDRTTLLLPGRQAELVSAVARASKGPVILVLMSGGPVDIAFAQNDRNVAAILWAGYPG 557
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
QAGG AIADV FG +NP G+LP+TWYP+ Y
Sbjct: 558 QAGGQAIADVIFGHHNPGGKLPVTWYPEDY 587
>gi|302760655|ref|XP_002963750.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
gi|300169018|gb|EFJ35621.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
Length = 785
Score = 595 bits (1533), Expect = e-167, Method: Compositional matrix adjust.
Identities = 308/572 (53%), Positives = 384/572 (67%), Gaps = 56/572 (9%)
Query: 30 SSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSR 89
S+AQ P +ACDV SN SL S FC+T LG+D+RV DLV RLTL EK+ +V++A + R
Sbjct: 31 STAQ--PRYACDVSSNASLGSFPFCDTKLGVDVRVQDLVSRLTLDEKVDEMVNAAQGIPR 88
Query: 90 LGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTE 149
LG+P Y+WW EALHGV+ PG F + P ATSFP I AASFN++LF +IG+ VS+E
Sbjct: 89 LGVPSYQWWQEALHGVA-SSPGVQFGGLAPAATSFPMPIAMAASFNSTLFYSIGEAVSSE 147
Query: 150 ARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQ-TDGGS 208
ARA++N+G AGLTFWSPN+NIFRDPRWGRGQETPGEDPLLASK+A+ YV+GLQ GGS
Sbjct: 148 ARALHNLGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYGGS 207
Query: 209 PNR--LKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------- 251
+ LKV+ACCKH TAYD+DNWKG DRYHFNA V + Y F+
Sbjct: 208 ASDGFLKVSACCKHLTAYDMDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSSV 267
Query: 252 -------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKS 280
YIVSDCD++ VL+ Y + E+A A S
Sbjct: 268 MCSYNRVNGVPTCADRSLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPSAEDAVADS 327
Query: 281 ILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQL 340
ILAGLDLNCG+FLGKH ++A++AG V E+ +DHAISN T MRLG FDG + +PY L
Sbjct: 328 ILAGLDLNCGTFLGKHAKSALQAGKVTEADLDHAISNLMRTRMRLGLFDGDLNTRPYSSL 387
Query: 341 GPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE 400
G D+C+ +Q LALDAA QG+VLLKN GSLPLS TA+K +A+IGPNAN T TM+GNYE
Sbjct: 388 GATDICSNDHQQLALDAALQGVVLLKND-GSLPLS-TALKTVALIGPNANATYTMLGNYE 445
Query: 401 GTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSI 458
G PCKY +PLQG+ I Y GC +V C V A + A ADA VLV+G DQS
Sbjct: 446 GIPCKYVSPLQGMQIYNNNILYSPGCRDVACSEGDLVASAVEVATKADAVVLVVGLDQSQ 505
Query: 459 EAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGY 518
E E+ DR LLLPG Q L++ +A P++L+IMS G DIS K++ +I+S++W+GY
Sbjct: 506 ERETFDRTSLLLPGMQSQLVSNIANAVTCPIVLVIMSAGPVDISTFKDNSRISSVIWIGY 565
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PGQ+GGAA+A V FG YNP GRLP TWY + +
Sbjct: 566 PGQSGGAALAHVVFGAYNPGGRLPNTWYHEEF 597
>gi|147857580|emb|CAN78858.1| hypothetical protein VITISV_030325 [Vitis vinifera]
Length = 699
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 309/469 (65%), Positives = 349/469 (74%), Gaps = 52/469 (11%)
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WS L + FSN V +F + S F + KVVSTEARAMYNVG
Sbjct: 90 WSHQLSSAHLLASTIGFSNGVGFIFAFWGIFWWGTKI-VSKFMRLRKVVSTEARAMYNVG 148
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
LAGLTFWSPN+NIF+DPRWGRGQETPGEDPLL+SKYA+GYV+GLQQ+D GSP+RLKVAAC
Sbjct: 149 LAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSDDGSPDRLKVAAC 208
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------YIVSDCDSVD 261
CKHYTAYDLDNWKG D +HFNA+V + + FK YIVSDCDSVD
Sbjct: 209 CKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTFQPPFKSCVIDGNVASVIYIVSDCDSVD 268
Query: 262 VLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFAT 321
V YNSQHYTKTPEEAAAK+ILAGLDLNCGSFLG+HTEAAVK GLV+ESA+D A+SNNFAT
Sbjct: 269 VFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFAT 328
Query: 322 LMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKN 381
LMRLGFFDG+PSK YG+LGPKDVCT +Q+ A +A RQGIV
Sbjct: 329 LMRLGFFDGNPSKAIYGKLGPKDVCTSEHQERAREAPRQGIV------------------ 370
Query: 382 LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKA 441
+ GTPCKYTTPLQGL A+VAT Y GCSNV CGTAQ+D+AKK
Sbjct: 371 -----------------FAGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQIDEAKKI 413
Query: 442 AASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDI 501
AA+ADATVL++G DQSIEAE DR+++ LPGQQ LLITEVAK +KG VIL++MSGGGFDI
Sbjct: 414 AAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKXSKGNVILVVMSGGGFDI 473
Query: 502 SFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
SFAKND KITSI WVGYPG+AGGAAIADV FG YNPSG+LPMTWYPQSY
Sbjct: 474 SFAKNDDKITSIQWVGYPGEAGGAAIADVIFGFYNPSGKLPMTWYPQSY 522
>gi|326488213|dbj|BAJ89945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 525
Score = 590 bits (1521), Expect = e-166, Method: Compositional matrix adjust.
Identities = 314/502 (62%), Positives = 363/502 (72%), Gaps = 52/502 (10%)
Query: 27 HSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGS 86
++ ++ +PVFACD SN +LA+ GFCN R DLV RLTL EK+ FLV+ +
Sbjct: 26 NAGTAEAQAPVFACDA-SNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPA 84
Query: 87 VSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVV 146
+ RLGIP YEWWSEALHGVSYVGPGT FS +VPGATSFPQ ILTAASFNASLF+AIG+VV
Sbjct: 85 LGRLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVV 144
Query: 147 STEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
STEARAM+NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLL SKYA GYV GLQ
Sbjct: 145 STEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDAGA 204
Query: 207 G--SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAM------------------------ 240
G + LKVAACCKHYTAYD+DNWKG +RY F+A
Sbjct: 205 GGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVA 264
Query: 241 -VIYTYYLIKFK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAA 278
V+ +Y + K YIVSDCDSVDVLY QHYTKTPEEAAA
Sbjct: 265 SVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAA 324
Query: 279 KSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYG 338
+I +GLDLNCG+FL +HT AAV+AG ++E +D AI+NNF LMRLGFFDG P + +G
Sbjct: 325 ITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFG 384
Query: 339 QLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGN 398
LGPKDVCT +N++LA + ARQGIVLLKN+ G+LPLS +IK++AVIGPNAN + TMIGN
Sbjct: 385 SLGPKDVCTSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIGPNANASFTMIGN 443
Query: 399 YEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKKAAASADATVLVMGADQ 456
YEGTPCKYTTPLQGL A V T+YQ GC+NV C + Q+ A AAASAD TVLV+GADQ
Sbjct: 444 YEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQ 503
Query: 457 SIEAESHDRLDLLLPGQQQLLI 478
SIE ES DR LLLPGQQ L+
Sbjct: 504 SIERESLDRTSLLLPGQQTQLV 525
>gi|224066931|ref|XP_002302285.1| predicted protein [Populus trichocarpa]
gi|222844011|gb|EEE81558.1| predicted protein [Populus trichocarpa]
Length = 773
Score = 590 bits (1520), Expect = e-166, Method: Compositional matrix adjust.
Identities = 299/568 (52%), Positives = 373/568 (65%), Gaps = 52/568 (9%)
Query: 34 SSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
+ P F+CD SNPS + FC T+L I R DLV RLTL EKI+ LV+SA + RLGIP
Sbjct: 24 TQPPFSCDS-SNPSTKAFPFCETTLPISQRARDLVSRLTLDEKISQLVNSAPPIPRLGIP 82
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
YEWWSEALHGVS GPG HF++ + GATSFPQVILTAASF+A + IG+ + EARA+
Sbjct: 83 GYEWWSEALHGVSNAGPGIHFNDNIKGATSFPQVILTAASFDAYQWYRIGQAIGKEARAL 142
Query: 154 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--QTDGGS-P 209
YN G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+ YA YVKG+Q +GG
Sbjct: 143 YNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGLYAASYVKGVQGDSFEGGKIK 202
Query: 210 NRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------- 251
L+ +ACCKH+TAYDLDNWKG +R+ F+A V + Y FK
Sbjct: 203 GHLQASACCKHFTAYDLDNWKGMNRFVFDARVTMQDLADTYQPPFKSCVEQGRASGIMCA 262
Query: 252 ----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
YI SDCD+V ++++ Q Y K+PE+A + A
Sbjct: 263 YNKVNGVPSCADSNLLSKTARAQWGFRGYITSDCDAVSIIHDDQGYAKSPEDAVVDVLKA 322
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
G+D+NCGS+L KH + AV+ ++ES ID A+ N F+ MRLG F+G P Q +G +GP
Sbjct: 323 GMDVNCGSYLLKHAKVAVEQKKLSESDIDKALHNLFSVRMRLGLFNGRPEGQLFGNIGPD 382
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
VC+Q +Q LAL+AAR GIVLLKN+A LPLS + K+LAVIGPNAN + ++GNY G P
Sbjct: 383 QVCSQEHQILALEAARNGIVLLKNSARLLPLSKSKTKSLAVIGPNANSGQMLLGNYAGPP 442
Query: 404 CKYTTPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES 462
C++ TPLQ L + + T+Y C VQC +A VD A A AD VL+MG DQ+ E E
Sbjct: 443 CRFVTPLQALQSYIKQTVYHPACDTVQCSSASVDRAVDVAKGADNVVLMMGLDQTQEREE 502
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR DLLLPG+QQ LI VAK AK PV+L++ SGG DISFAKND I SILW GYPG+
Sbjct: 503 LDRTDLLLPGKQQELIIAVAKAAKNPVVLVLFSGGPVDISFAKNDKNIGSILWAGYPGEG 562
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
G A+A++ FG +NP GRLPMTWYPQ +
Sbjct: 563 GAIALAEIVFGDHNPGGRLPMTWYPQEF 590
>gi|224082152|ref|XP_002306583.1| predicted protein [Populus trichocarpa]
gi|222856032|gb|EEE93579.1| predicted protein [Populus trichocarpa]
Length = 745
Score = 587 bits (1512), Expect = e-165, Method: Compositional matrix adjust.
Identities = 301/585 (51%), Positives = 378/585 (64%), Gaps = 56/585 (9%)
Query: 17 FLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEK 76
F T LQ S+ P F+CD SNPS + FC T+L I R DLV RLTL+EK
Sbjct: 11 FTIFTVIVLQVDSTQ----PPFSCDS-SNPSTKTFPFCKTTLPISQRANDLVSRLTLEEK 65
Query: 77 ITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNA 136
I+ LV+SA + RLGIP Y+WWSEALHGV+Y GPG F+ + ATSFPQVIL+AASF+A
Sbjct: 66 ISQLVNSAQPIPRLGIPGYQWWSEALHGVAYAGPGIRFNGTIKRATSFPQVILSAASFDA 125
Query: 137 SLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAT 195
+ + I + + EARA+YN G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+ KYA
Sbjct: 126 NQWYRISQAIGKEARALYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLMTGKYAV 185
Query: 196 GYVKGLQQTD---GGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLI 248
YV+GLQ G L+ +ACCKH+TAYDL+NW GT RY F+A V + Y
Sbjct: 186 SYVRGLQGDSFKGGEIKGPLQASACCKHFTAYDLENWNGTSRYVFDAYVTAQDLADTYQP 245
Query: 249 KFK------------------------------------------YIVSDCDSVDVLYNS 266
FK YI SDCD+V +++++
Sbjct: 246 PFKSCVEEGRASGIMCAYNRVNGIPNCADSNFLSRTARAQWGFDGYIASDCDAVSIIHDA 305
Query: 267 QHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLG 326
Q Y KTPE+A + AG+D+NCGS+L +HT+AAV + S ID A+ N F+ MRLG
Sbjct: 306 QGYAKTPEDAVVAVLKAGMDVNCGSYLQQHTKAAVDQKKLTISEIDRALHNLFSVRMRLG 365
Query: 327 FFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG 386
F+G+P+ Q +G +GP VC+Q NQ LALDAAR GIVLLKN+AG LPLS + +LAVIG
Sbjct: 366 LFNGNPTGQQFGNIGPDQVCSQENQILALDAARNGIVLLKNSAGLLPLSKSKTMSLAVIG 425
Query: 387 PNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAKKAAASA 445
PNAN +T++GNY G PCK TPLQ L + + TI GC +VQC +A + A A A
Sbjct: 426 PNANSVQTLLGNYAGPPCKLVTPLQALQSYIKHTIPYPGCDSVQCSSASIVGAVNVAKGA 485
Query: 446 DATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAK 505
D VL+MG D + E E DR DL+LPG+QQ LI VAK AK PV+L+++SGG DISFAK
Sbjct: 486 DHVVLIMGLDDTQEKEGLDRRDLVLPGKQQELIISVAKAAKNPVVLVLLSGGPVDISFAK 545
Query: 506 NDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
ND I SILW GYPG+AG A+A++ FG +NP G+LPMTWYPQ +
Sbjct: 546 NDKNIGSILWAGYPGEAGAIALAEIIFGDHNPGGKLPMTWYPQEF 590
>gi|115486595|ref|NP_001068441.1| Os11g0673200 [Oryza sativa Japonica Group]
gi|113645663|dbj|BAF28804.1| Os11g0673200 [Oryza sativa Japonica Group]
Length = 822
Score = 582 bits (1499), Expect = e-163, Method: Compositional matrix adjust.
Identities = 316/584 (54%), Positives = 380/584 (65%), Gaps = 83/584 (14%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYV 108
A+L FC SL R DLV RLT EK+ LV++A V RLG+ YEWWSEALHGVS
Sbjct: 39 ATLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDT 98
Query: 109 GPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKV----------------------- 145
GPG F PGAT+FPQVI TAASFNA+L++ IG+V
Sbjct: 99 GPGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQVMPILKGGHARCNQRPSCIRISVF 158
Query: 146 ---------VSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATG 196
VS E RAMYN G AGLTFWSPN+NIFRDPRWGRGQETPGEDP +A++YA
Sbjct: 159 MYVYVCAQAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAA 218
Query: 197 YVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK- 251
YV+GLQQ S RLK+AACCKH+TAYDLDNW GTDR+HFNA+V + + + F+
Sbjct: 219 YVRGLQQQQPSS-GRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRS 277
Query: 252 -----------------------------------------YIVSDCDSVDVLYNSQHYT 270
YIVSDCDSVDV Y+ QHYT
Sbjct: 278 CVVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYT 337
Query: 271 KTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDG 330
+T E+A A ++ AGLDL+CG FL ++TE AV G V + ID A++N MRLG FDG
Sbjct: 338 RTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDG 397
Query: 331 HPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP-TAIKNLAVIGPNA 389
P+ QP+G LGP+ VCT A+Q+LA++AARQGIVLLKN +LPLSP TA + +AV+GP+A
Sbjct: 398 DPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHA 457
Query: 390 NVTKTMIGNYEGTPCKYTTPLQGLAAVVA-TIYQAGCSNVQC-GTAQ-VDDAKKAAASAD 446
T MIGNY G PC+YTTPLQG+A A +Q GC++V C G+ Q + A AA AD
Sbjct: 458 EATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRAD 517
Query: 447 ATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN 506
AT++V G DQ IEAE DR LLLPG+Q LI+ VAK +KGPVIL++MSGG DI FA+N
Sbjct: 518 ATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQN 577
Query: 507 DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
DPKI ILW GYPGQAGG AIADV FG +NP G+LP+TWYPQ Y
Sbjct: 578 DPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDY 621
>gi|302141935|emb|CBI19138.3| unnamed protein product [Vitis vinifera]
Length = 1411
Score = 581 bits (1498), Expect = e-163, Method: Compositional matrix adjust.
Identities = 297/567 (52%), Positives = 374/567 (65%), Gaps = 52/567 (9%)
Query: 34 SSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
SSP FACD S+P S FCNT+L I R DL+ RLTL EKI+ L+ SA S+ RLGIP
Sbjct: 694 SSPPFACDS-SDPLTKSYAFCNTTLRISQRASDLISRLTLDEKISQLISSAASIPRLGIP 752
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
YEWWSEALHG+ G F+ + ATSFPQVILTAASF+A L+ IG+ + E RAM
Sbjct: 753 AYEWWSEALHGIRDRH-GIRFNGTIRSATSFPQVILTAASFDAHLWYRIGQAIGIETRAM 811
Query: 154 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--QTDGGSPN 210
YN G A G+TFW+PNINIFRDPRWGRGQETPGEDP++A KYA YV+GLQ +GG +
Sbjct: 812 YNAGQAMGMTFWAPNINIFRDPRWGRGQETPGEDPVVAGKYAVSYVRGLQGDTFEGGKVD 871
Query: 211 RLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------------- 251
L+ +ACCKH+TAYDLDNW DRY F+A V + Y F+
Sbjct: 872 VLQASACCKHFTAYDLDNWTSIDRYTFDARVTMQDLADTYQPPFRSCIEEGRASGLMCAY 931
Query: 252 ---------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
YIVSDCD+V ++++ Q Y K+PE+A A + AG
Sbjct: 932 NLVNGVPNCADFNLLSKTARGQWGFDGYIVSDCDAVSLVHDVQGYAKSPEDAVAIVLTAG 991
Query: 285 LDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
+D+ CG +L KH ++AV + ES ID A+ N F MRLG F+G+P K P+G +GP
Sbjct: 992 MDVACGGYLQKHAKSAVSQKKLTESEIDRALLNLFTVRMRLGLFNGNPRKLPFGNIGPDQ 1051
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
VC+ +Q LAL+AAR GIVLLKN+ LPLS +LAVIGPNAN T T++GNY G PC
Sbjct: 1052 VCSTEHQTLALEAARSGIVLLKNSDRLLPLSKGETLSLAVIGPNANATDTLLGNYAGPPC 1111
Query: 405 KYTTPLQGLAAVV-ATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
K+ +PLQGL + V T+Y AGC++V C +A +++A A AD VLVMG DQ+ E E +
Sbjct: 1112 KFISPLQGLQSYVNNTMYHAGCNDVACSSASIENAVDVAKQADYVVLVMGLDQTQEREKY 1171
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DRLDL+LPG+Q+ LIT VAK AK PV+L+++ GG DISFAK I SILW GYPG+AG
Sbjct: 1172 DRLDLVLPGKQEQLITGVAKAAKKPVVLVLLCGGPVDISFAKGSSNIGSILWAGYPGEAG 1231
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
GAAIA+ FG +NP GRLP+TWYP+ +
Sbjct: 1232 GAAIAETIFGDHNPGGRLPVTWYPKDF 1258
Score = 555 bits (1430), Expect = e-155, Method: Compositional matrix adjust.
Identities = 301/588 (51%), Positives = 377/588 (64%), Gaps = 52/588 (8%)
Query: 14 SVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTL 73
S+L + L Y ++ + SP F+CD SNPS S FC T+L I RV DLV RLTL
Sbjct: 5 SLLLINLIYVTVILVGVESTQSPPFSCDS-SNPSTKSYHFCKTTLPIPDRVRDLVSRLTL 63
Query: 74 QEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAAS 133
EKI+ LV+SA ++ RLGIP YEWWSEALHGV+ GPG F+ + ATSFPQVILTAAS
Sbjct: 64 DEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQVILTAAS 123
Query: 134 FNASLFQAIGKVVSTEARAMYNVG-LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASK 192
F+ L+ IG+ + EARA+YN G G+TFW+PNINIFRDPRWGRGQETPGEDPL+
Sbjct: 124 FDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGS 183
Query: 193 YATGYVKGLQQTDGGSPNR---LKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTY 245
YA YV+G+Q R L+ +ACCKH+TAYDLD+WKG DR+ F+A V +
Sbjct: 184 YAVSYVRGVQGDCLRGLKRCGELQASACCKHFTAYDLDDWKGIDRFKFDARVTMQDLADT 243
Query: 246 YLIKFK------------------------------------------YIVSDCDSVDVL 263
Y F YI SDCD+V ++
Sbjct: 244 YQPPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSDCDAVSLI 303
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLM 323
++S + KTPE+A + AG+D+NCG++L HT++AV + ES +D A+ N FA M
Sbjct: 304 HDSYGFAKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRALENLFAVRM 363
Query: 324 RLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLA 383
RLG F+G+P QPYG +GP VC+ +Q LALDAAR GIVLLKN+ LPL +LA
Sbjct: 364 RLGLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLPKGKTMSLA 423
Query: 384 VIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV-ATIYQAGCSNVQCGTAQVDDAKKAA 442
VIGPNAN KT+IGNY G PCK+ TPLQ L + V +T+Y GC V C + ++ A + A
Sbjct: 424 VIGPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVACSSPSIEKAVEIA 483
Query: 443 ASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS 502
AD VLVMG DQ+ E E+HDRLDL+LPG+QQ LI VA AK PV+L+++SGG DIS
Sbjct: 484 QKADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVLLSGGPVDIS 543
Query: 503 FAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
FAK I SILW GYPG AGGAAIA+ FG +NP GRLP+TWYPQ +
Sbjct: 544 FAKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPGGRLPVTWYPQDF 591
>gi|242071935|ref|XP_002451244.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
gi|241937087|gb|EES10232.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
Length = 790
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 296/561 (52%), Positives = 366/561 (65%), Gaps = 76/561 (13%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVG 109
+L FC SL + R DLV RLT EK+ LV++A V+RLG+ YEWWSEALHGVS G
Sbjct: 44 TLPFCRQSLPLHARARDLVSRLTRAEKVRLLVNNAAGVARLGVGGYEWWSEALHGVSDTG 103
Query: 110 PGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNIN 169
PG F PGAT+FPQVI AA+ NA+L++ IG+ VS EARAMYN G AGLTFWSPN+N
Sbjct: 104 PGVKFGGAFPGATAFPQVIGAAAALNATLWELIGRAVSDEARAMYNGGRAGLTFWSPNVN 163
Query: 170 IFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNW 229
IFRDPRWGRGQETPGEDP ++S+YA YV+GLQQ NRLK+AACCKH+TAYDLD+W
Sbjct: 164 IFRDPRWGRGQETPGEDPAISSRYAAAYVRGLQQPY--DHNRLKLAACCKHFTAYDLDSW 221
Query: 230 KGTDRYHFNAMV----IYTYYLIKFK---------------------------------- 251
GTDR+HFNA+V + + + F+
Sbjct: 222 GGTDRFHFNAVVSPQDLEDTFNVPFRACVAGGRAASVMCSYNQVNGVPTCADQGFLRGTI 281
Query: 252 --------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKA 303
YIVSDCDSVDV + QHYT+T E+A A ++ AGLDL+CG FL +TE AV
Sbjct: 282 RKAWGLDGYIVSDCDSVDVFFRDQHYTRTAEDAVAATLRAGLDLDCGPFLALYTENAVAR 341
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIV 363
V+++ +D A+ N MRLG FDG P+ P+G LG DVCT+A+QDLALDAARQ +V
Sbjct: 342 KKVSDADVDAALLNTVTVQMRLGMFDGDPASGPFGHLGAADVCTKAHQDLALDAARQSVV 401
Query: 364 LLKNTAGS-------LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
LLKN G LPL P A + +AV+GP+A+ T MIGNY G PC+YTTPLQG+AA
Sbjct: 402 LLKNQRGRKHRDRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAGKPCRYTTPLQGVAAY 461
Query: 417 VA-TIYQAGCSNVQCG------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLL 469
A ++QAGC++V C A VD A++ + + L LL
Sbjct: 462 AARVVHQAGCADVACQGKNQPIAAAVDAARRLTPPSSSPGLTR--------------SLL 507
Query: 470 LPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIAD 529
LPG+Q LI+ VAK AKGPVIL++MSGG DI+FA+NDP+I ILWVGYPGQAGG AIAD
Sbjct: 508 LPGRQAELISAVAKAAKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYPGQAGGQAIAD 567
Query: 530 VCFGRYNPSGRLPMTWYPQSY 550
V FG++NP G+LP+TWYPQ Y
Sbjct: 568 VIFGQHNPGGKLPVTWYPQDY 588
>gi|302796583|ref|XP_002980053.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
gi|300152280|gb|EFJ18923.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
Length = 772
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 307/560 (54%), Positives = 387/560 (69%), Gaps = 51/560 (9%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
+ACD SN +LA+ FCNTSL I RV D V RLTL+EKI+ L+++A + RLG+PKY+W
Sbjct: 30 YACDQ-SNATLAAFPFCNTSLAITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQW 88
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
W EALHGV+ PG F VP ATSFP I TAASFN SLF IG+ VSTEARAM+N+G
Sbjct: 89 WQEALHGVA-SSPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLG 147
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
+GLTFWSPNINI+RDPRWGRGQETPGEDPLL+S +AT YV+GLQ++ GS ++LKV+AC
Sbjct: 148 QSGLTFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQAGS-DKLKVSAC 206
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKH TAYD+DNW GTDRYHFNA+V + Y FK
Sbjct: 207 CKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRLNGVP 266
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YIVSDCDS+ V +++ +Y T E+AAA ++LAGL+LNCG+
Sbjct: 267 TCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADALLAGLNLNCGT 326
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FL KHT +A++ V E+ I+ A++ MRLG +DG P Q YG LG DVCT +Q
Sbjct: 327 FLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCTSEHQ 386
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LAL+AARQG+VLLKN G+LPLS + IK+LAV+GP+AN T+ MIGNY G PCKYT+PLQ
Sbjct: 387 TLALEAARQGMVLLKNL-GALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYTSPLQ 445
Query: 412 GLAAVVATIYQAGCSNVQCGT-AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
Y GC+NV C + + + A AAA+ADA V+ +G D +IEAES DR LLL
Sbjct: 446 AFQKYAQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRTSLLL 505
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG+QQ L+++V + AKGPV+++I+S G DI FA +D +I ILW GYPGQAGGAAIA+V
Sbjct: 506 PGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAAIAEV 565
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG +NPSG+LP TWYPQ++
Sbjct: 566 IFGDHNPSGKLPATWYPQNF 585
>gi|37359708|dbj|BAC98299.1| LEXYL2 [Solanum lycopersicum]
Length = 633
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 291/455 (63%), Positives = 348/455 (76%), Gaps = 47/455 (10%)
Query: 142 IGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGL 201
IGKVVSTE RAMYNVG AGLT+WSPN+NI+RDPRWGRGQET GEDP L+S+Y YVKGL
Sbjct: 2 IGKVVSTEGRAMYNVGQAGLTYWSPNVNIYRDPRWGRGQETAGEDPTLSSRYGVAYVKGL 61
Query: 202 QQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV-------------------- 241
QQ D G + LKVA+CCKHYTAYD+D+WKG RY+FNA V
Sbjct: 62 QQRDDGKKDMLKVASCCKHYTAYDVDDWKGIQRYNFNAKVTQQDLDDTFNPPFKSCVLDG 121
Query: 242 -----IYTYYLIKFK---------------------YIVSDCDSVDVLYNSQHYTKTPEE 275
+ +Y + K YIV+DCDS++ +Y +QHYTKTPEE
Sbjct: 122 NVASVMCSYNQVDGKPTCGDYDLLAGVIRGQWKLNGYIVTDCDSLNEMYWAQHYTKTPEE 181
Query: 276 AAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ 335
AA S+ AGL LNCGS+LGK+T+ AV GLVNES ID A++NNFATLMRLGFFDG+P Q
Sbjct: 182 TAALSLNAGLGLNCGSWLGKYTQGAVNQGLVNESVIDRAVTNNFATLMRLGFFDGNPKNQ 241
Query: 336 PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTM 395
YG LGPKD+CT+ +Q+LA +AARQGIVLLKNTAGSLPLSP +IK+LAVIGPNAN+ TM
Sbjct: 242 LYGNLGPKDICTEDHQELAREAARQGIVLLKNTAGSLPLSPKSIKSLAVIGPNANLAYTM 301
Query: 396 IGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGAD 455
+G+YEG+PCKYTTPL GL A V+T+YQ GC ++ C TAQVD+AKK AA+ADA VLVMG+D
Sbjct: 302 VGSYEGSPCKYTTPLDGLGASVSTVYQQGC-DIACATAQVDNAKKVAAAADAVVLVMGSD 360
Query: 456 QSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILW 515
Q+IE ES DR ++ LPGQQ LL+TEVA V+KGPVIL+IMSGGG D+ FA ++PK+TSILW
Sbjct: 361 QTIERESKDRFNITLPGQQSLLVTEVASVSKGPVILVIMSGGGMDVKFAVDNPKVTSILW 420
Query: 516 VGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
VG+PG+AGGAA+ADV FG +NP GRLPMTWYPQSY
Sbjct: 421 VGFPGEAGGAALADVVFGYHNPGGRLPMTWYPQSY 455
>gi|302811514|ref|XP_002987446.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
gi|300144852|gb|EFJ11533.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
Length = 772
Score = 577 bits (1486), Expect = e-162, Method: Compositional matrix adjust.
Identities = 307/560 (54%), Positives = 387/560 (69%), Gaps = 51/560 (9%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
+ACD SN +LA+ FCNTSL I RV D V RLTL+EKI+ L+++A + RLG+PKY+W
Sbjct: 30 YACDQ-SNATLAAFPFCNTSLPITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQW 88
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
W EALHGV+ PG F VP ATSFP I TAASFN SLF IG+ VSTEARAM+N+G
Sbjct: 89 WQEALHGVA-SSPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLG 147
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
+GLTFWSPNINI+RDPRWGRGQETPGEDPLL+S +AT YV+GLQ++ GS ++LKV+AC
Sbjct: 148 QSGLTFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQAGS-DKLKVSAC 206
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKH TAYD+DNW GTDRYHFNA+V + Y FK
Sbjct: 207 CKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRLNGVP 266
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YIVSDCDS+ V +++ +Y T E+AAA ++LAGL+LNCG+
Sbjct: 267 TCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADALLAGLNLNCGT 326
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
FL KHT +A++ V E+ I+ A++ MRLG +DG P Q YG LG DVCT +Q
Sbjct: 327 FLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCTSEHQ 386
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LAL+AARQG+VLLKN G+LPLS + IK+LAV+GP+AN T+ MIGNY G PCKYT+PLQ
Sbjct: 387 TLALEAARQGMVLLKNL-GALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYTSPLQ 445
Query: 412 GLAAVVATIYQAGCSNVQCGT-AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
Y GC+NV C + + + A AAA+ADA V+ +G D +IEAES DR LLL
Sbjct: 446 AFQKYAQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRTSLLL 505
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG+QQ L+++V + AKGPV+++I+S G DI FA +D +I ILW GYPGQAGGAAIA+V
Sbjct: 506 PGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAAIAEV 565
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG +NPSG+LP TWYPQ++
Sbjct: 566 IFGDHNPSGKLPATWYPQNF 585
>gi|32488698|emb|CAE03635.1| OSJNBb0003B01.27 [Oryza sativa Japonica Group]
Length = 839
Score = 576 bits (1484), Expect = e-161, Method: Compositional matrix adjust.
Identities = 295/461 (63%), Positives = 348/461 (75%), Gaps = 50/461 (10%)
Query: 138 LFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGY 197
++ I VVSTEARAM+NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA GY
Sbjct: 204 MYNLIVLVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGY 263
Query: 198 VKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV---------------- 241
V GLQ GGS + LKVAACCKHYTAYD+DNWKG +RY F+A+V
Sbjct: 264 VTGLQDAGGGS-DALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSC 322
Query: 242 ---------IYTYYLIKFK---------------------YIVSDCDSVDVLYNSQHYTK 271
+ +Y + K YIVSDCDSVDVLYN+QHYTK
Sbjct: 323 VIDGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTK 382
Query: 272 TPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGH 331
PE+AAA +I +GLDLNCG+FL +HT AAV+AG ++ES +D AI+NNF LMRLGFFDG
Sbjct: 383 NPEDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGD 442
Query: 332 PSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANV 391
P K P+G LGPKDVCT +NQ+LA +AARQGIVLLKNT G+LPLS +IK++AVIGPNAN
Sbjct: 443 PRKLPFGSLGPKDVCTSSNQELAREAARQGIVLLKNT-GALPLSAKSIKSMAVIGPNANA 501
Query: 392 TKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKKAAASADATV 449
+ TMIGNYEGTPCKYTTPLQGL A VAT+YQ GC+NV C + Q+ A +AAASAD TV
Sbjct: 502 SFTMIGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVTV 561
Query: 450 LVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPK 509
LV+GADQS+E ES DR LLLPGQQ L++ VA ++GPVIL++MSGG FDISFAK+ K
Sbjct: 562 LVVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDK 621
Query: 510 ITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
I++ILWVGYPG+AGGAA+AD+ FG +NP GRLP+TWYP S+
Sbjct: 622 ISAILWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASF 662
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 72/110 (65%), Positives = 86/110 (78%), Gaps = 1/110 (0%)
Query: 35 SPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
+PVFACD SN +++ GFC+ + R DL+ RLTL EK+ FLV+ ++ RLGIP
Sbjct: 28 TPVFACDA-SNATVSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPA 86
Query: 95 YEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGK 144
YEWWSEALHGVSYVGPGT FS +VPGATSFPQ ILTAASFNASLF+AIG+
Sbjct: 87 YEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGE 136
>gi|224066929|ref|XP_002302284.1| predicted protein [Populus trichocarpa]
gi|222844010|gb|EEE81557.1| predicted protein [Populus trichocarpa]
Length = 742
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 291/567 (51%), Positives = 371/567 (65%), Gaps = 51/567 (8%)
Query: 34 SSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
+ P ++CD S+PS FC T L I RV DLV RLTL EK++ LVD+A ++ RLGIP
Sbjct: 28 TQPPYSCDS-SDPSTKLYPFCQTKLPISQRVEDLVSRLTLDEKVSQLVDTAPAIPRLGIP 86
Query: 94 KYEWWSEALHGVSY---VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEA 150
YEWWSEALHGV+ V G F+ + ATSFPQVILTAASF+A L+ IG+V+ EA
Sbjct: 87 AYEWWSEALHGVALQTTVRQGIRFNGTIRFATSFPQVILTAASFDAHLWYRIGQVIGKEA 146
Query: 151 RAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ-QTDGGS 208
R +YN G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+A KYA YV+G+Q + GG
Sbjct: 147 RGIYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVAGKYAVSYVRGVQGDSFGGG 206
Query: 209 P--NRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK--------------- 251
+L+ +ACCKH+TAYDLD WKG +R+ F+A + Y F+
Sbjct: 207 TLGEQLQASACCKHFTAYDLDKWKGMNRFVFDAQDLADTYQPPFQSCIQEGKASGIMCAY 266
Query: 252 ---------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
YI SDCD+V ++++ Q Y K+PE+A A + AG
Sbjct: 267 NRVNGVPNCADYNLLSKKARGQWGFYGYITSDCDAVAIIHDDQGYAKSPEDAVADVLKAG 326
Query: 285 LDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
+D+NCG +L +T++AVK + ES ID A+ N F+ MRLG F+G+P+KQPYG + P
Sbjct: 327 MDVNCGDYLKNYTKSAVKKKKLPESEIDRALHNLFSIRMRLGLFNGNPTKQPYGNIAPDQ 386
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
VC+Q +Q LAL AA+ GIVLLKN LPLS K+LAVIGPNAN + ++GNY G PC
Sbjct: 387 VCSQEHQALALKAAQDGIVLLKNPDKLLPLSKLETKSLAVIGPNANNSTKLLGNYFGPPC 446
Query: 405 KYTTPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
K TPLQGL + T Y GCS V C +A ++ A K A AD +LVMG DQ+ E E
Sbjct: 447 KTVTPLQGLQNYIKNTRYHPGCSRVACSSASINQAVKIAKGADQVILVMGLDQTQEKEEQ 506
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR+DL+LPG+Q+ LIT VAK AK PV+L++ GG D+SFAK D I SI+W GYPG+AG
Sbjct: 507 DRVDLVLPGKQRELITAVAKAAKKPVVLVLFCGGPVDVSFAKYDQNIGSIIWAGYPGEAG 566
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
G A+A + FG +NP GRLPMTWYPQ +
Sbjct: 567 GTALAQIIFGDHNPGGRLPMTWYPQDF 593
>gi|449465962|ref|XP_004150696.1| PREDICTED: probable beta-D-xylosidase 7-like [Cucumis sativus]
Length = 783
Score = 569 bits (1467), Expect = e-159, Method: Compositional matrix adjust.
Identities = 296/572 (51%), Positives = 371/572 (64%), Gaps = 52/572 (9%)
Query: 30 SSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSR 89
+ + S P +ACD SNP +L FC T L I LR DLV RLTL EK+ LV++ + R
Sbjct: 30 AGSSSQPPYACDS-SNPLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPR 88
Query: 90 LGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTE 149
LGIP YEWWSEALHGV+ VG G + + ATSFPQVILTAASF+ +L+ IG+ + TE
Sbjct: 89 LGIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTE 148
Query: 150 ARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--QTDG 206
ARA+YN G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+ KY+ YV+G+Q +G
Sbjct: 149 ARAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGDAIEG 208
Query: 207 GS-PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYT--------------------- 244
G N+LK +ACCKH+TAYDLD W G RY F+A V
Sbjct: 209 GKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEEGKASG 268
Query: 245 ----------------YYLI--------KFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAK 279
++L+ KF YI SDCD+V +++++Q Y K PE+A A
Sbjct: 269 IMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAKIPEDAVAD 328
Query: 280 SILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQ 339
+ AG+D+NCG++L +HT++AV+ V ID A+ N F+ MRLG FDG+P+K P+GQ
Sbjct: 329 VLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFDGNPTKLPFGQ 388
Query: 340 LGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
+G VC+Q +Q+LAL AAR+GIVLLKN+A LPLS + +LAVIG N N KT+ GNY
Sbjct: 389 IGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNGNDPKTLRGNY 448
Query: 400 EGTPCKYTTPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSI 458
G PCK TP QGL V T+Y GC+ C A + A K A S D VLVMG DQ+
Sbjct: 449 AGIPCKSATPFQGLNNYVKNTVYHRGCNYANCTEATIYQAVKIAKSVDYVVLVMGLDQTQ 508
Query: 459 EAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGY 518
E E DR +L LPG+Q LI EVAK AK PVIL+I+SGG DIS AK + KI SILW GY
Sbjct: 509 EREDFDRTELGLPGKQDKLIAEVAKAAKRPVILVILSGGPVDISSAKYNEKIGSILWAGY 568
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PGQAGG AIA++ FG +NP GRLP+TWYP +
Sbjct: 569 PGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDF 600
>gi|449508468|ref|XP_004163321.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 7-like
[Cucumis sativus]
Length = 783
Score = 569 bits (1466), Expect = e-159, Method: Compositional matrix adjust.
Identities = 296/572 (51%), Positives = 371/572 (64%), Gaps = 52/572 (9%)
Query: 30 SSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSR 89
+ + S P +ACD SNP +L FC T L I LR DLV RLTL EK+ LV++ + R
Sbjct: 30 AGSSSQPPYACDS-SNPLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPR 88
Query: 90 LGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTE 149
LGIP YEWWSEALHGV+ VG G + + ATSFPQVILTAASF+ +L+ IG+ + TE
Sbjct: 89 LGIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTE 148
Query: 150 ARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--QTDG 206
ARA+YN G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+ KY+ YV+G+Q +G
Sbjct: 149 ARAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGDAIEG 208
Query: 207 GS-PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYT--------------------- 244
G N+LK +ACCKH+TAYDLD W G RY F+A V
Sbjct: 209 GKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEEGKASG 268
Query: 245 ----------------YYLI--------KFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAK 279
++L+ KF YI SDCD+V +++++Q Y K PE+A A
Sbjct: 269 IMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAKIPEDAVAD 328
Query: 280 SILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQ 339
+ AG+D+NCG++L +HT++AV+ V ID A+ N F+ MRLG FDG+P+K P+GQ
Sbjct: 329 VLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFDGNPTKLPFGQ 388
Query: 340 LGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
+G VC+Q +Q+LAL AAR+GIVLLKN+A LPLS + +LAVIG N N KT+ GNY
Sbjct: 389 IGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNGNDPKTLRGNY 448
Query: 400 EGTPCKYTTPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSI 458
G PCK TP QGL V T+Y GC+ C A + A K A S D VLVMG DQ+
Sbjct: 449 AGIPCKSATPFQGLNNYVKNTVYHRGCNYANCTEATIYQAVKIAKSVDYVVLVMGLDQTQ 508
Query: 459 EAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGY 518
E E DR +L LPG+Q LI EVAK AK PVIL+I+SGG DIS AK + KI SILW GY
Sbjct: 509 EREDFDRTELGLPGKQDKLIAEVAKAAKXPVILVILSGGPVDISSAKYNEKIGSILWAGY 568
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PGQAGG AIA++ FG +NP GRLP+TWYP +
Sbjct: 569 PGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDF 600
>gi|15218202|ref|NP_177929.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
gi|259585708|sp|Q9SGZ5.2|BXL7_ARATH RecName: Full=Probable beta-D-xylosidase 7; Short=AtBXL7; Flags:
Precursor
gi|18086336|gb|AAL57631.1| At1g78060/F28K19_32 [Arabidopsis thaliana]
gi|332197942|gb|AEE36063.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
Length = 767
Score = 565 bits (1457), Expect = e-158, Method: Compositional matrix adjust.
Identities = 291/567 (51%), Positives = 370/567 (65%), Gaps = 53/567 (9%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
P +CD SNP+ FC T L I R DLV RLT+ EKI+ LV++A + RLG+P Y
Sbjct: 22 PPHSCDP-SNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVPAY 80
Query: 96 EWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
EWWSEALHGV+Y GPG F+ V ATSFPQVILTAASF++ + I +V+ EAR +YN
Sbjct: 81 EWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYN 140
Query: 156 VGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--QTDGGS--PN 210
G A G+TFW+PNINIFRDPRWGRGQETPGEDP++ YA YV+GLQ DG N
Sbjct: 141 AGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDGRKTLSN 200
Query: 211 RLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------------- 251
L+ +ACCKH+TAYDLD WKG RY FNA V + Y FK
Sbjct: 201 HLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGIMCAY 260
Query: 252 ---------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
YI SDCD+V ++Y++Q Y K+PE+A A + AG
Sbjct: 261 NRVNGIPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIYDAQGYAKSPEDAVADVLKAG 320
Query: 285 LDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
+D+NCGS+L KHT++A++ V+E+ ID A+ N F+ +RLG F+G P+K PYG + P +
Sbjct: 321 MDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNISPNE 380
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
VC+ A+Q LALDAAR GIVLLKN LP S ++ +LAVIGPNA+V KT++GNY G PC
Sbjct: 381 VCSPAHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVVKTLLGNYAGPPC 440
Query: 405 KYTTPLQGLAAVVAT-IYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
K TPL L + V +Y GC +V C A +D A A +AD VL+MG DQ+ E E
Sbjct: 441 KTVTPLDALRSYVKNAVYHQGCDSVACSNAAIDQAVAIAKNADHVVLIMGLDQTQEKEDF 500
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR+DL LPG+QQ LIT VA AK PV+L+++ GG DISFA N+ KI SI+W GYPG+AG
Sbjct: 501 DRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFAANNNKIGSIIWAGYPGEAG 560
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
G AI+++ FG +NP GRLP+TWYPQS+
Sbjct: 561 GIAISEIIFGDHNPGGRLPVTWYPQSF 587
>gi|371917286|dbj|BAL44719.1| SlArf/Xyl4 [Solanum lycopersicum]
Length = 775
Score = 564 bits (1453), Expect = e-158, Method: Compositional matrix adjust.
Identities = 289/590 (48%), Positives = 376/590 (63%), Gaps = 58/590 (9%)
Query: 13 VSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLT 72
++ + + L+ S+ S+ P F+CD SNP SL FC T L I +RV DLV RLT
Sbjct: 9 ITTILISLSLVSIVQSTQ-----PPFSCDS-SNPQTKSLKFCQTGLPISVRVLDLVSRLT 62
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAA 132
L EKI+ LV+SA ++ RLGIP YEWWSE+LHGV G G F+ + GATSFPQVILTAA
Sbjct: 63 LDEKISQLVNSAPAIPRLGIPAYEWWSESLHGVGSAGKGIFFNGSIAGATSFPQVILTAA 122
Query: 133 SFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
+F+ +L+ IG+V+ EAR +YN G A G+TFW+PNINIFRDPRWGRGQETPGEDP++
Sbjct: 123 TFDENLWYRIGQVIGVEARGVYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPIMTG 182
Query: 192 KYATGYVKGLQ--QTDGGSPNR--LKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYL 247
KYA YV+G+Q +GG + L+ +ACCKH+TAYDLD WK DR+ FNA+V
Sbjct: 183 KYAIRYVRGVQGDSFNGGQLKKGHLQASACCKHFTAYDLDQWKNLDRFSFNAIVTPQDMA 242
Query: 248 IKFK----------------------------------------------YIVSDCDSVD 261
F+ YI SDCD+V
Sbjct: 243 DTFQPPFQDCIQKAQASGIMCSYNSVNGIPSCANYNLLTKTARQQWGFHGYITSDCDAVQ 302
Query: 262 VLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFAT 321
V++++ Y TPE++ A ++ AG+D++CG +L K+T++AV V++ ID A+ N F+
Sbjct: 303 VMHDNHRYGNTPEDSTAFALKAGMDIDCGDYLKKYTKSAVMKKKVSQVHIDRALHNLFSI 362
Query: 322 LMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKN 381
MRLG F+G P KQ YG + P VC +Q LAL+AAR GIVLLKNT LPLS +
Sbjct: 363 RMRLGLFNGDPRKQLYGNISPSQVCAPQHQQLALEAARNGIVLLKNTGKLLPLSKAKTNS 422
Query: 382 LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKK 440
LAVIG NAN + GNY+G PCKY L+ L ++ YQ GC+ C +A +D A
Sbjct: 423 LAVIGHNANNAYILRGNYDGPPCKYIEILKALVGYAKSVQYQQGCNAANCTSANIDQAVN 482
Query: 441 AAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFD 500
A +AD VL+MG DQ+ E E DR DL+LPGQQ+ LI VAK AK PVIL+I+SGG D
Sbjct: 483 IARNADYVVLIMGLDQTQEREQFDRDDLVLPGQQENLINSVAKAAKKPVILVILSGGPVD 542
Query: 501 ISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
ISFAK +PKI SILW GYPG+AGG A+A++ FG +NP G+LP+TWYPQ++
Sbjct: 543 ISFAKYNPKIGSILWAGYPGEAGGIALAEIIFGEHNPGGKLPVTWYPQAF 592
>gi|302791321|ref|XP_002977427.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
gi|300154797|gb|EFJ21431.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
Length = 772
Score = 561 bits (1446), Expect = e-157, Method: Compositional matrix adjust.
Identities = 285/560 (50%), Positives = 366/560 (65%), Gaps = 60/560 (10%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYV 108
+S FC+ SL + RV DLV R+ L EKI +V +A + RLGIP Y+WW EALHGV+
Sbjct: 31 SSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALHGVAE- 89
Query: 109 GPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNI 168
PG F+ VP ATSFPQVILT ASFN+SL+ I + +S EA AMYN G +GLTFWSPNI
Sbjct: 90 SPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSGLTFWSPNI 149
Query: 169 NIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD----------GGSPNRLKVAACC 218
NIFRDPRWGRGQETPGEDPLL+SKYA +V+GLQ+ D GSP RLKV++CC
Sbjct: 150 NIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQGSPTRLKVSSCC 209
Query: 219 KHYTAYDLDNWKGTDRYHFNAMVI------------------------------------ 242
KH+TAYD++ +GTD +HFNA V
Sbjct: 210 KHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCSYNRVNGVPS 269
Query: 243 ---YTYYLIKFK-------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
YT+ + YIVSDCD+V +LY +YT T E+A A + AG+DLNCG+F
Sbjct: 270 CADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINYTTTAEDAVADVLSAGMDLNCGTF 329
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
L +HT AA++ G V E+A+D A+SN MRLG FDG+ S + Y +GP VCT ++
Sbjct: 330 LLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGN-SGETYNSIGPDAVCTPEHRQ 388
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
L+L+AA QGIVLLKN+ LP + +AVIGP+ N T+TM+GNY G PC+Y TP QG
Sbjct: 389 LSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVPCQYITPFQG 448
Query: 413 LAAVV-ATIYQAGCSNVQCG-TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
L +++ GC ++ C T A +AA ++DA V+V+G D+ E E DR LLL
Sbjct: 449 LQEYTKCVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQEREGLDRTSLLL 508
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG QQ L+ EV+KVAKGPVIL++MSGG D++FAK + KI+++LWVGYPG+AGG AIA V
Sbjct: 509 PGNQQGLVLEVSKVAKGPVILVVMSGGPIDVTFAKENCKISNVLWVGYPGEAGGKAIARV 568
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG +NP+GRLPMTWYPQ++
Sbjct: 569 IFGDHNPAGRLPMTWYPQAF 588
>gi|302786474|ref|XP_002975008.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
gi|300157167|gb|EFJ23793.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
Length = 772
Score = 558 bits (1439), Expect = e-156, Method: Compositional matrix adjust.
Identities = 284/560 (50%), Positives = 365/560 (65%), Gaps = 60/560 (10%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYV 108
+S FC+ SL + RV DLV R+ L EKI +V +A + RLGIP Y+WW EALHGV+
Sbjct: 31 SSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALHGVAE- 89
Query: 109 GPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNI 168
PG F+ VP ATSFPQVILT ASFN+SL+ I + +S EA AMYN G +GLTFWSPNI
Sbjct: 90 SPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSGLTFWSPNI 149
Query: 169 NIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS----------PNRLKVAACC 218
NIFRDPRWGRGQETPGEDPLL+SKYA +V+GLQ+ D P RLKV++CC
Sbjct: 150 NIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQRRPTRLKVSSCC 209
Query: 219 KHYTAYDLDNWKGTDRYHFNAMVI------------------------------------ 242
KH+TAYD++ +GTD +HFNA V
Sbjct: 210 KHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCSYNRVNGVPS 269
Query: 243 ---YTYYLIKFK-------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
YT+ + YIVSDCD+V +LY +YT T E+A A + AG+DLNCG+F
Sbjct: 270 CADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINYTTTAEDAVADVLSAGMDLNCGTF 329
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
L +HT AA++ G V E+A+D A+SN MRLG FDG+ S + Y +GP VCT+ ++
Sbjct: 330 LLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGN-SGETYNSIGPDAVCTREHRQ 388
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
L+L+AA QGIVLLKN+ LP + +AVIGP+ N T+TM+GNY G PC+Y TP QG
Sbjct: 389 LSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVPCQYITPFQG 448
Query: 413 LAAVV-ATIYQAGCSNVQCG-TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
L +++ GC ++ C T A +AA ++DA V+V+G D+ E E DR LLL
Sbjct: 449 LQEYTKGVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQEREGLDRTSLLL 508
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG QQ L+ EV+KVAKGPVIL++MSGG D++FAK + KI+S+LWVGYPG+AGG AIA V
Sbjct: 509 PGYQQDLVLEVSKVAKGPVILVVMSGGPIDVTFAKGNCKISSVLWVGYPGEAGGKAIARV 568
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG +NP+GRLPMTWYPQ++
Sbjct: 569 IFGDHNPAGRLPMTWYPQAF 588
>gi|357489431|ref|XP_003615003.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516338|gb|AES97961.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 780
Score = 558 bits (1437), Expect = e-156, Method: Compositional matrix adjust.
Identities = 295/593 (49%), Positives = 380/593 (64%), Gaps = 58/593 (9%)
Query: 13 VSVLFLFLT-YCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRL 71
++ +FLFLT Y L H+ S A + P ++CD SNP S FCN +L I R D+V RL
Sbjct: 8 ITFIFLFLTRYHRLVHADSLATNVPPYSCDT-SNPLTKSFPFCNLNLTITQRAKDIVSRL 66
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
TL EKI+ LV++A ++ RLGIP Y+WW+EALHGVSYVG G + + ATSFPQ+IL A
Sbjct: 67 TLDEKISQLVNTAPAIPRLGIPSYQWWNEALHGVSYVGKGIRLNGSITAATSFPQIILIA 126
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLA 190
ASF+ L+ I KV+ TEAR +YN G A G+TFW+PNINIFRDPRWGRGQET GEDPL+
Sbjct: 127 ASFDPKLWYRISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVN 186
Query: 191 SKYATGYVKGLQ--QTDGGS--PNRLKVAACCKHYTAYDLDNWKGTDRYHFNA------- 239
SKY YV+GLQ +GG RLK +ACCKH+TAYDL+NWKG +RY F+A
Sbjct: 187 SKYGVSYVRGLQGDSFEGGKLIGGRLKASACCKHFTAYDLENWKGVNRYVFDAKVTLQDL 246
Query: 240 --------------------MVIYT----------YYLI--------KFK-YIVSDCDSV 260
M Y Y L+ F YI SDCD+V
Sbjct: 247 ADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLLTNTARKKWNFNGYIASDCDAV 306
Query: 261 DVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFA 320
+Y Q Y KTPE+ A + AG+D+ CG+++ KH ++AV + S ID A+ N F
Sbjct: 307 RFIYEKQGYAKTPEDVVADVLRAGMDVECGNYMTKHAKSAVLQKKIPISQIDRALHNLFT 366
Query: 321 TLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIK 380
+RLG FDG+P+K YG++GP VC++ N DLAL+AAR GIVLLKNTA LPL +
Sbjct: 367 IRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTASILPLP--RVN 424
Query: 381 NLAVIGPNANVTK-TMIGNYEGTPCKYTTPLQGLAAVVA-TIYQAGCSN-VQCGTAQVDD 437
L VIGPNAN + ++GNY G PCK + L+G + T Y++GC++ V+C +A++D
Sbjct: 425 TLGVIGPNANKSSIVLLGNYFGQPCKQVSILKGFYTYASQTHYRSGCTDGVKCASAEIDR 484
Query: 438 AKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
A + A +D +LVMG DQS E E+ DR L LPG+QQ LI VAK +K PVIL+I+ GG
Sbjct: 485 AVEVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQKLINSVAKASKKPVILVILCGG 544
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
DI+FAKN+ KI I+W GYPG+ GG A+A V FG YNP GRLPMTWYP+ +
Sbjct: 545 PVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPMTWYPKDF 597
>gi|297842585|ref|XP_002889174.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297335015|gb|EFH65433.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 766
Score = 557 bits (1435), Expect = e-156, Method: Compositional matrix adjust.
Identities = 289/567 (50%), Positives = 369/567 (65%), Gaps = 53/567 (9%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
P +CD SNP+ FC T L I R DLV RL + EKI+ L ++A + RLG+P Y
Sbjct: 21 PPHSCDP-SNPTTKLYQFCRTDLPISQRARDLVSRLNIDEKISQLGNTAPGIPRLGVPAY 79
Query: 96 EWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
EWWSEALHGV+Y GPG F+ V ATSFPQVILTAASF++ + I +V+ EAR +YN
Sbjct: 80 EWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYN 139
Query: 156 VGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--QTDGGS--PN 210
G A G+TFW+PNINIFRDPRWGRGQETPGEDP++ YA YV+GLQ DG
Sbjct: 140 AGQAQGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGTYAVAYVRGLQGDSFDGRKTLSI 199
Query: 211 RLKVAACCKHYTAYDLDNWKGTDRYHFNAMV------------------------IYTYY 246
L+ +ACCKH+TAYDLD WKG RY FNA V I Y
Sbjct: 200 HLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGIMCAY 259
Query: 247 ---------------------LIKFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
L +F+ YI SDCD+V +++++Q Y KTPE+A A + AG
Sbjct: 260 NRVNGIPSCADPNLLTRTARGLWRFRGYITSDCDAVSIIHDAQGYAKTPEDAVADVLKAG 319
Query: 285 LDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
+D+NCGS+L KHT++A++ V+E+ ID A+ N F+ +RLG F+G P+K PYG + P D
Sbjct: 320 MDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNISPND 379
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
VC+ A+Q LAL+AAR GIVLLKN LP S ++ +LAVIGPNA+V KT++GNY G PC
Sbjct: 380 VCSPAHQALALEAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVAKTLLGNYAGPPC 439
Query: 405 KYTTPLQGLAAVVAT-IYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
K TPL L + V +Y GC +V C A +D A A +AD VL+MG DQ+ E E
Sbjct: 440 KTVTPLDALRSYVKNAVYHNGCDSVACSNAAIDQAVAIARNADHVVLIMGLDQTQEKEDM 499
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR+DL LPG+QQ LIT VA AK PV+L+++ GG DISFA N+ KI SI+W GYPG+AG
Sbjct: 500 DRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFATNNDKIGSIMWAGYPGEAG 559
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
G A+A++ FG +NP GRLP+TWYPQS+
Sbjct: 560 GIALAEIIFGDHNPGGRLPVTWYPQSF 586
>gi|6573772|gb|AAF17692.1|AC009243_19 F28K19.27 [Arabidopsis thaliana]
Length = 696
Score = 556 bits (1433), Expect = e-155, Method: Compositional matrix adjust.
Identities = 278/521 (53%), Positives = 356/521 (68%), Gaps = 32/521 (6%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
P +CD SNP+ FC T L I R DLV RLT+ EKI+ LV++A + RLG+P Y
Sbjct: 22 PPHSCDP-SNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVPAY 80
Query: 96 EWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
EWWSEALHGV+Y GPG F+ V ATSFPQVILTAASF++ + I +V+ EAR +YN
Sbjct: 81 EWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYN 140
Query: 156 VGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--QTDGGS--PN 210
G A G+TFW+PNINIFRDPRWGRGQETPGEDP++ YA YV+GLQ DG N
Sbjct: 141 AGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDGRKTLSN 200
Query: 211 RLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYT 270
L+ +ACCKH+TAYDLD WK DCD+V ++Y++Q Y
Sbjct: 201 HLQASACCKHFTAYDLDRWK-------------------------DCDAVSIIYDAQGYA 235
Query: 271 KTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDG 330
K+PE+A A + AG+D+NCGS+L KHT++A++ V+E+ ID A+ N F+ +RLG F+G
Sbjct: 236 KSPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNG 295
Query: 331 HPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNAN 390
P+K PYG + P +VC+ A+Q LALDAAR GIVLLKN LP S ++ +LAVIGPNA+
Sbjct: 296 DPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAH 355
Query: 391 VTKTMIGNYEGTPCKYTTPLQGLAAVVAT-IYQAGCSNVQCGTAQVDDAKKAAASADATV 449
V KT++GNY G PCK TPL L + V +Y GC +V C A +D A A +AD V
Sbjct: 356 VVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVACSNAAIDQAVAIAKNADHVV 415
Query: 450 LVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPK 509
L+MG DQ+ E E DR+DL LPG+QQ LIT VA AK PV+L+++ GG DISFA N+ K
Sbjct: 416 LIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFAANNNK 475
Query: 510 ITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
I SI+W GYPG+AGG AI+++ FG +NP GRLP+TWYPQS+
Sbjct: 476 IGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSF 516
>gi|225459350|ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like [Vitis vinifera]
Length = 774
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 301/588 (51%), Positives = 377/588 (64%), Gaps = 52/588 (8%)
Query: 14 SVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTL 73
S+L + L Y ++ + SP F+CD SNPS S FC T+L I RV DLV RLTL
Sbjct: 5 SLLLINLIYVTVILVGVESTQSPPFSCDS-SNPSTKSYHFCKTTLPIPDRVRDLVSRLTL 63
Query: 74 QEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAAS 133
EKI+ LV+SA ++ RLGIP YEWWSEALHGV+ GPG F+ + ATSFPQVILTAAS
Sbjct: 64 DEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQVILTAAS 123
Query: 134 FNASLFQAIGKVVSTEARAMYNVG-LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASK 192
F+ L+ IG+ + EARA+YN G G+TFW+PNINIFRDPRWGRGQETPGEDPL+
Sbjct: 124 FDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGS 183
Query: 193 YATGYVKGLQQTDGGSPNR---LKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTY 245
YA YV+G+Q R L+ +ACCKH+TAYDLD+WKG DR+ F+A V +
Sbjct: 184 YAVSYVRGVQGDCLRGLKRCGELQASACCKHFTAYDLDDWKGIDRFKFDARVTMQDLADT 243
Query: 246 YLIKFK------------------------------------------YIVSDCDSVDVL 263
Y F YI SDCD+V ++
Sbjct: 244 YQPPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSDCDAVSLI 303
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLM 323
++S + KTPE+A + AG+D+NCG++L HT++AV + ES +D A+ N FA M
Sbjct: 304 HDSYGFAKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRALENLFAVRM 363
Query: 324 RLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLA 383
RLG F+G+P QPYG +GP VC+ +Q LALDAAR GIVLLKN+ LPL +LA
Sbjct: 364 RLGLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLPKGKTMSLA 423
Query: 384 VIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV-ATIYQAGCSNVQCGTAQVDDAKKAA 442
VIGPNAN KT+IGNY G PCK+ TPLQ L + V +T+Y GC V C + ++ A + A
Sbjct: 424 VIGPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVACSSPSIEKAVEIA 483
Query: 443 ASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS 502
AD VLVMG DQ+ E E+HDRLDL+LPG+QQ LI VA AK PV+L+++SGG DIS
Sbjct: 484 QKADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVLLSGGPVDIS 543
Query: 503 FAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
FAK I SILW GYPG AGGAAIA+ FG +NP GRLP+TWYPQ +
Sbjct: 544 FAKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPGGRLPVTWYPQDF 591
>gi|242062502|ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
gi|241932371|gb|EES05516.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
Length = 784
Score = 548 bits (1413), Expect = e-153, Method: Compositional matrix adjust.
Identities = 282/570 (49%), Positives = 365/570 (64%), Gaps = 54/570 (9%)
Query: 32 AQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLG 91
A S P + C + P++ FC+T+L ID RV DLV RLT+ EKI+ L D + ++ RLG
Sbjct: 34 AASEPPYTCGAGAPPNIP---FCDTALPIDRRVDDLVSRLTVAEKISQLGDESPAIPRLG 90
Query: 92 IPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEAR 151
+P Y+WWSEALHGV+ G G H + ATSFPQVILTAASFN L+ IG+V+ EAR
Sbjct: 91 VPAYKWWSEALHGVANAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGVEAR 150
Query: 152 AMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSP- 209
A+YN G A GLTFW+PNIN+FRDPRWGRGQETPGEDP + KYA +V+G+Q P
Sbjct: 151 AVYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGVAGPV 210
Query: 210 --NRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------ 251
L+ +ACCKH+TAYDL+NWKG RY ++A V + Y FK
Sbjct: 211 NSTDLEASACCKHFTAYDLENWKGITRYVYDAKVTAQDLEDTYNPPFKSCVEDGHASGIM 270
Query: 252 ------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSI 281
YI SDCD+V +++++Q Y KT E+A A +
Sbjct: 271 CSYNRVNGVPTCADYNLLSKTARQSWGFYGYITSDCDAVSIIHDAQGYAKTSEDAVADVL 330
Query: 282 LAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG 341
AG+D+NCG ++ K+ +A++ G + E I+ A+ N F MRLG F+G P + YG +G
Sbjct: 331 KAGMDVNCGGYVQKYGASALQQGKITEQDINRALHNLFTVRMRLGLFNGDPRRNRYGNIG 390
Query: 342 PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG 401
P VCTQ +QDLAL+AA+ GIVLLKN G+LPLS + + +LAVIG NAN +++GNY G
Sbjct: 391 PDQVCTQEHQDLALEAAQDGIVLLKNDGGALPLSKSGVASLAVIGFNANNATSLLGNYFG 450
Query: 402 TPCKYTTPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA 460
PC TPLQ L V T + AGC++ C + +A +AA+SAD+ VL MG DQ+ E
Sbjct: 451 PPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIPEAVQAASSADSVVLFMGLDQNQER 510
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E DRLDL LPGQQQ LI VA AK PVIL+++ GG D+SFAK +PKI +ILW GYPG
Sbjct: 511 EEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPG 570
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+AGG AIA V FG +NP GRLP+TWYPQ +
Sbjct: 571 EAGGIAIAQVLFGEHNPGGRLPVTWYPQDF 600
>gi|449433577|ref|XP_004134574.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
gi|449530107|ref|XP_004172038.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 812
Score = 548 bits (1412), Expect = e-153, Method: Compositional matrix adjust.
Identities = 284/601 (47%), Positives = 363/601 (60%), Gaps = 66/601 (10%)
Query: 13 VSVLFLFLTYCSLQHSSSSAQSSP-----VFACDVVSNPSL----ASLGFCNTSLGIDLR 63
+S+ F+F + S P F CD L +S GFC++SL R
Sbjct: 26 LSLFFIFTANARVFPRRSLLDDPPAVNNFTFVCDPSRYDKLGLDFSSFGFCDSSLSFPER 85
Query: 64 VGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATS 123
DL+ R+TL EK L A V RLG+P Y WWSEALHGVS VGPGT F VVPGATS
Sbjct: 86 AKDLIDRMTLSEKAAQLGHVASGVDRLGLPPYNWWSEALHGVSNVGPGTQFDKVVPGATS 145
Query: 124 FPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETP 183
FP VI TA+SFN L++ IG+ VSTEARAMYN+G AGLT+WSP IN+ RDPRWGR ETP
Sbjct: 146 FPNVITTASSFNEDLWKTIGQAVSTEARAMYNLGRAGLTYWSPTINVIRDPRWGRTVETP 205
Query: 184 GEDPLLASKYATGYVKGLQQTDGG------SPNRLKVAACCKHYTAYDLDNWKGTDRYHF 237
GEDP + KYA YV+GLQ +G + LKV++CCKHY AYD+DNW G +RY F
Sbjct: 206 GEDPFVVGKYAKNYVRGLQDVEGSENVTDLNSRPLKVSSCCKHYAAYDVDNWLGVERYSF 265
Query: 238 NAMVIYTYYLIKFK---------------------------------------------- 251
+A V L F
Sbjct: 266 DARVTEQDMLETFNKPFEMCVKEGDVSSVMCSYNRVNGIPTCADPVLLKDTIRGNWGLHG 325
Query: 252 YIVSDCDSVDVLYNSQHYTK-TPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
YIVSDCDSV V+ HY + T E+A A+++ AGLDL+CG +TE+ V+ G V
Sbjct: 326 YIVSDCDSVKVMVEDAHYLQDTNEDAVAQTLKAGLDLDCGQIYPNYTESTVRQGKVGMRN 385
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
ID+A++N + LMRLG+FDG+ + LG D+C+ + +LA +AARQG VLLKN
Sbjct: 386 IDNALNNLYVVLMRLGYFDGNTG---FESLGKPDICSDEHIELATEAARQGTVLLKNDND 442
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC 430
+LP P+ K LAV+GP+AN T M+GNY G PC+ +P+ GL+ YQ GC +V C
Sbjct: 443 TLPFDPSNYKTLAVVGPHANATSAMLGNYAGVPCRMNSPMDGLSEYAKVKYQMGCDSVAC 502
Query: 431 GTAQ-VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPV 489
+ A +AA ++DATV+ +G D SIEAES DR+DLLLPG Q L+ +VA V+KGPV
Sbjct: 503 KNDTFIFGAMEAARTSDATVIFVGIDLSIEAESLDRVDLLLPGYQTQLVQQVATVSKGPV 562
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+L+I+S GG D+SFAKN+ I +I+W GYPG+ GG AIADV FG++NP GRLP+TWY
Sbjct: 563 VLVILSAGGIDVSFAKNNSNIKAIIWAGYPGEEGGRAIADVIFGKFNPGGRLPLTWYEND 622
Query: 550 Y 550
Y
Sbjct: 623 Y 623
>gi|255557375|ref|XP_002519718.1| Beta-glucosidase, putative [Ricinus communis]
gi|223541135|gb|EEF42691.1| Beta-glucosidase, putative [Ricinus communis]
Length = 802
Score = 542 bits (1397), Expect = e-151, Method: Compositional matrix adjust.
Identities = 286/586 (48%), Positives = 372/586 (63%), Gaps = 65/586 (11%)
Query: 24 SLQHSSSSAQ-SSPVFACDVVSNPSLA----SLGFCNTSLGIDLRVGDLVKRLTLQEKIT 78
+L H ++ + SS + CD +L + GFC++SL ++R DLV ++TL+EK+
Sbjct: 34 TLNHDDANPRGSSFTYVCDSSRYDNLGLDMTTFGFCDSSLSYEVRAKDLVNQMTLKEKVQ 93
Query: 79 FLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASL 138
L D A V RLGIPKYEWWSEALHGVS VGPGT F ++VPGATSFP ILT ASFN SL
Sbjct: 94 QLGDLAYGVPRLGIPKYEWWSEALHGVSDVGPGTFFDDLVPGATSFPTTILTTASFNESL 153
Query: 139 FQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYV 198
++ IG+ S +ARAMYN+G AGLT+WSPN+N+ RDPRWGR ETPGEDP + +YA YV
Sbjct: 154 WKNIGQ-ASAKARAMYNLGRAGLTYWSPNVNVVRDPRWGRTVETPGEDPYVVGRYAVNYV 212
Query: 199 KGLQQTDGG------SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLI 248
+GLQ +G + LKV++CCKHY AYD++ W+G +R F+A V + +L
Sbjct: 213 RGLQDVEGTENYTDLNTRPLKVSSCCKHYAAYDVEKWQGVERLTFDARVTEQDMVETFLR 272
Query: 249 KFK------------------------------------------YIVSDCDSVDVLYNS 266
F+ YIVSDCDS++V+ ++
Sbjct: 273 PFEMCVKEGDVSSVMCSFNRVNGIPTCADPKLLNQTIRGDWDLHGYIVSDCDSIEVMVDN 332
Query: 267 QHYT-KTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRL 325
+ T E+A A+ + AGLDL+CG + TE +VK G E ID ++ + LMRL
Sbjct: 333 HKFLGDTNEDAVAQVLKAGLDLDCGGYYTNFTETSVKQGKAREEYIDRSLKYLYVVLMRL 392
Query: 326 GFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVI 385
GFFDG P Y +LG KD+CT+ N +LA AAR+GIVLLKN +LPLS +KNLAV+
Sbjct: 393 GFFDGTPQ---YQKLGKKDICTKENVELAKQAAREGIVLLKNN-DTLPLSMDKVKNLAVV 448
Query: 386 GPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAAS 444
GP+AN T+ MIGNY G PC+Y +P+ G + Y+ GC +V C + V A AA +
Sbjct: 449 GPHANATRVMIGNYAGVPCRYVSPIDGFSIYSNVTYEIGC-DVPCKNESLVFPAVHAAKN 507
Query: 445 ADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFA 504
ADAT++V G D +IEAE DR DLLLPG Q LI +VA A GPVIL+IM+ GG DISFA
Sbjct: 508 ADATIIVAGLDLTIEAEGLDRNDLLLPGYQTQLINQVAGAANGPVILVIMAAGGVDISFA 567
Query: 505 KNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+++ KI +ILWVGYPGQ GG AIADV FG+YNP GRLP+TWY +
Sbjct: 568 RDNEKIKAILWVGYPGQEGGHAIADVVFGKYNPGGRLPITWYEADF 613
>gi|358349509|ref|XP_003638778.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355504713|gb|AES85916.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 776
Score = 542 bits (1396), Expect = e-151, Method: Compositional matrix adjust.
Identities = 294/588 (50%), Positives = 370/588 (62%), Gaps = 62/588 (10%)
Query: 16 LFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQE 75
LFL LTY L AQ P FACD SNPS S FCN L I R DLV RLTL E
Sbjct: 15 LFLTLTYSVL------AQLPP-FACDY-SNPSTRSYPFCNPKLPITQRTKDLVSRLTLDE 66
Query: 76 KITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFN 135
K+ LV+SA + RLGIP YEWWSEALHG+ VG G F+ + ATSFPQVILTAASF+
Sbjct: 67 KLAQLVNSAPPIPRLGIPAYEWWSEALHGIGNVGRGIFFNGSITSATSFPQVILTAASFD 126
Query: 136 ASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 194
+ L+ IG+ + EARA+YN G A G+TFW+PNINIFRDPRWGRGQET GEDP++ S YA
Sbjct: 127 SHLWYRIGQAIGVEARAIYNGGQAMGMTFWAPNINIFRDPRWGRGQETAGEDPMMTSNYA 186
Query: 195 TGYVKGLQQTD---GGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYL 247
YV+GLQ G L+ +ACCKH+TAYDLDNWKG +R+HF+A V + Y
Sbjct: 187 VSYVRGLQGDSFQGGKLRGHLQASACCKHFTAYDLDNWKGVNRFHFDARVSLQDLADTYQ 246
Query: 248 IKFK------------------------------------------YIVSDCDSVDVLYN 265
F+ YIVSDC +V ++++
Sbjct: 247 PPFRSCIEQGRASGIMCAYNRVNGIPSCADFNLLTNTVRKQWEFHGYIVSDCGAVGIIHD 306
Query: 266 SQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRL 325
Q Y K+ E+A A + AG+DL CGS+L H ++AV+ + ID A+ N F+ +RL
Sbjct: 307 EQGYAKSAEDAVADVLHAGMDLECGSYLTDHAKSAVQQKKLPIVRIDRALHNLFSIRIRL 366
Query: 326 GFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVI 385
G FDG+P+K P+G +GP VC++ + LAL+AAR GIVLLKNTA LPL T+I +LAVI
Sbjct: 367 GQFDGNPAKLPFGMIGPNHVCSENHLYLALEAARNGIVLLKNTASLLPLPKTSI-SLAVI 425
Query: 386 GPNANVTK-TMIGNYEGTPCKYTTPLQGLAAVVAT-IYQAGCSN-VQCGTAQVDDAKKAA 442
GPNAN + T++GNY G PCK T LQG V ++ GC +C +A +D A K A
Sbjct: 426 GPNANASPLTLLGNYAGPPCKSITILQGFQHYVKNAVFHPGCDGGPKCASAPIDKAVKVA 485
Query: 443 ASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS 502
+AD VLVMG DQS+E E DR+ L LPG+Q LI VAK +K PVIL+++ GG DIS
Sbjct: 486 KNADYVVLVMGLDQSVEREERDRVHLDLPGKQLELINSVAKASKRPVILVLLCGGPIDIS 545
Query: 503 FAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
AKN+ KI I+W GYPG+ GG A+A + FG +NP GRLP+TWYP+ Y
Sbjct: 546 SAKNNDKIGGIIWAGYPGELGGIALAQIIFGDHNPGGRLPITWYPKDY 593
>gi|371917284|dbj|BAL44718.1| SlArf/Xyl3 [Solanum lycopersicum]
Length = 777
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 277/591 (46%), Positives = 367/591 (62%), Gaps = 55/591 (9%)
Query: 12 KVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRL 71
K +V+F+F+ L + S + P F+CD SNP+ +S FCN +L I RV DLV RL
Sbjct: 8 KFTVMFIFVILVLLFRRTESTK--PPFSCDS-SNPNTSSYPFCNAALPIPQRVNDLVSRL 64
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
T+ EKI LV+ A + RLGI YEWWSE LHG+S G GT F+ + AT FPQ+ILTA
Sbjct: 65 TVDEKILQLVNGAPEIPRLGISAYEWWSEGLHGISRHGKGTLFNGTIKAATQFPQIILTA 124
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVG-LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLA 190
+SF+ +L+ I + + EARA+YN G L G+T W+PNINI RDPRWGRGQETPGEDP++
Sbjct: 125 SSFDENLWYRIAQAIGREARAVYNAGQLKGITLWAPNINILRDPRWGRGQETPGEDPMMV 184
Query: 191 SKYATGYVKGLQ--QTDGGS--PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYT-- 244
KY YV+GLQ +GG L+ +ACCKH+ A D+DNW RY F+A V+
Sbjct: 185 GKYGVAYVRGLQGDSFEGGKLKDGHLQTSACCKHFIAQDMDNWHNFSRYTFDAQVLKQDL 244
Query: 245 --YYLIKFK------------------------------------------YIVSDCDSV 260
Y FK YIVSDCD+V
Sbjct: 245 ADSYEPPFKDCVEQGKASSVMCAYNLVNGIPNCANFDLLTTTARGKWGLQGYIVSDCDAV 304
Query: 261 DVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFA 320
D +Y+ QHY K PE+A A ++ AG+D+NCGS L +T++A++ V ES ID A+ N F+
Sbjct: 305 DKMYSEQHYAKEPEDAVAATLKAGMDVNCGSHLKTYTKSALEKQKVKESDIDRALHNLFS 364
Query: 321 TLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIK 380
MRLG F+G PSK YG + +VC++ ++ LA++AAR G VLLKN+ LPLS
Sbjct: 365 VRMRLGLFNGDPSKLEYGDISAAEVCSEEHRALAVEAARSGSVLLKNSNRLLPLSKMKTA 424
Query: 381 NLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAK 439
+LAVIGP AN ++ ++GNYEG CK T QGL VA T+Y GC + C + +D+A
Sbjct: 425 SLAVIGPKANDSEVLLGNYEGFSCKNVTLFQGLQGYVANTMYHPGCDFINCTSPAIDEAV 484
Query: 440 KAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGF 499
A AD VLVMG DQ++E E DR +L LPG Q+ LIT +A+ A PVIL++M GG
Sbjct: 485 NIAKKADYVVLVMGLDQTLEREKFDRTELGLPGMQEKLITSIAEAASKPVILVLMCGGPV 544
Query: 500 DISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
D++FAK++PKI ILWVGYPG+ G AA+A + FG +NP GR P+TWYP+ +
Sbjct: 545 DVTFAKDNPKIGGILWVGYPGEGGAAALAQILFGEHNPGGRSPVTWYPKEF 595
>gi|357489463|ref|XP_003615019.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
gi|355516354|gb|AES97977.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
Length = 785
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 293/595 (49%), Positives = 376/595 (63%), Gaps = 60/595 (10%)
Query: 13 VSVLFLFLT-YCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRL 71
+S +FLFLT Y L H+ S P ++CD+ +NP S FCN +L R D+V RL
Sbjct: 11 ISFIFLFLTRYNQLVHADSPTLVPP-YSCDI-TNPLTKSYTFCNLNLTTIQRAKDIVSRL 68
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSN--VVPGATSFPQVIL 129
TL EK+ LV++A ++ RLGI Y+WWSEALHGV+ G G + + AT FPQVIL
Sbjct: 69 TLDEKLAQLVNTAPAIPRLGIHSYQWWSEALHGVADYGKGIRLNGNVTIKAATIFPQVIL 128
Query: 130 TAASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPL 188
TAASF++ L+ I KV+ TEARA+YN G A G+TFW+PNINIFRDPRWGRGQET GEDPL
Sbjct: 129 TAASFDSKLWYRISKVIGTEARAVYNAGQAEGMTFWAPNINIFRDPRWGRGQETAGEDPL 188
Query: 189 LASKYATGYVKGLQ--QTDGGSPN--RLKVAACCKHYTAYDLDNWKGTDRYHFNA----- 239
+++KYA +V+GLQ +GG N RLK +ACCKH+TAYDLDNWKG DR+ F+A
Sbjct: 189 VSAKYAVSFVRGLQGDSFEGGKLNEDRLKASACCKHFTAYDLDNWKGVDRFDFDANVTLQ 248
Query: 240 ----------------------MVIYT----------YYLI--------KFK-YIVSDCD 258
M Y Y L+ F YI SDC
Sbjct: 249 DLADTYQPPFHSCIVQGRSSGIMCAYNRVNGIPNCADYNLLTNTARKKWNFNGYITSDCS 308
Query: 259 SVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNN 318
+VD++++ Q Y K PE+A A + AG+D+ CG + H+++AV V S ID A+ N
Sbjct: 309 AVDIIHDRQGYAKAPEDAVADVLQAGMDVECGDYFTSHSKSAVLQKKVPISQIDRALHNL 368
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
F+ +RLG FDGHP+K YG++GP VC++ N ++AL+AAR GIVLLKN A LPL P +
Sbjct: 369 FSIRIRLGLFDGHPTKLKYGKIGPNRVCSKQNLNIALEAARSGIVLLKNAASILPL-PKS 427
Query: 379 IKNLAVIGPNANVT-KTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSN-VQCGTAQV 435
++ VIGPNAN + + ++GNY G PC T LQG + Y GCS+ +C +A++
Sbjct: 428 TDSIVVIGPNANSSSQVVLGNYFGRPCNLVTILQGFENYSDNLLYHPGCSDGTKCVSAEI 487
Query: 436 DDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
D A + A D VLVMG DQS E+E HDR DL LPG+QQ LI VAK +K PVIL++
Sbjct: 488 DRAVEVAKVVDYVVLVMGLDQSQESEGHDRDDLELPGKQQELINSVAKASKRPVILVLFC 547
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG DISFAK D KI ILW GYPG+ GG A+A V FG YNP GRLPMTWYP+ +
Sbjct: 548 GGPVDISFAKVDDKIGGILWAGYPGELGGMALAQVVFGDYNPGGRLPMTWYPKDF 602
>gi|125534112|gb|EAY80660.1| hypothetical protein OsI_35838 [Oryza sativa Indica Group]
Length = 771
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 284/567 (50%), Positives = 359/567 (63%), Gaps = 58/567 (10%)
Query: 35 SPVFACDVVSNPSLASLG--FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGI 92
P ++C P SLG FC+ L R DLV RLT EK+ L D AG V+RLG+
Sbjct: 24 EPPYSC----GPRSPSLGYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVARLGV 79
Query: 93 PKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA 152
P Y+WWSE LHG+SY G G HF+ V TSFPQV+LTAA+F+ L+ IG+ + TEARA
Sbjct: 80 PPYKWWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARA 139
Query: 153 MYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR 211
+YN+G A GLT WSPN+NI+RDPRWGRGQETPGEDP ASKYA +VKGLQ G +P
Sbjct: 140 LYNLGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQ---GSTPGT 196
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK-------------------- 251
L+ +ACCKH TAYDL+ W G RY+FNA V F
Sbjct: 197 LQTSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYT 256
Query: 252 --------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
Y+ SDCD+V +L ++Q Y TPE+ A +I AGL
Sbjct: 257 DINGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYAPTPEDTVAVAIKAGL 316
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP-SKQPYGQLGPKD 344
DLNCG++ H AA++ G + ES +D A++N FA MRLG FDG P S YG LG D
Sbjct: 317 DLNCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAAD 376
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
VCTQA++DLAL+AA+ GIVLLKN AG+LPL +++ AVIGPNAN + GNY G PC
Sbjct: 377 VCTQAHRDLALEAAQNGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPC 436
Query: 405 KYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
+ TTPLQG+ ++++ + AGC + CG A A A+S+D ++ MG Q E E
Sbjct: 437 ETTTPLQGVQRYISSVRFLAGCDSPACGFAATGQAAALASSSDQVIMFMGLSQDQEKEGL 496
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR LLLPG+QQ LIT VA A+ PVIL++++GG D++FAKN+PKI +ILW GYPGQAG
Sbjct: 497 DRTSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAG 556
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
G AIA V FG +NPSGRLP+TWYP+ +
Sbjct: 557 GLAIAKVLFGDHNPSGRLPVTWYPEEF 583
>gi|212275712|ref|NP_001130324.1| uncharacterized protein LOC100191418 precursor [Zea mays]
gi|194688848|gb|ACF78508.1| unknown [Zea mays]
gi|413938927|gb|AFW73478.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 780
Score = 541 bits (1393), Expect = e-151, Method: Compositional matrix adjust.
Identities = 286/599 (47%), Positives = 367/599 (61%), Gaps = 56/599 (9%)
Query: 5 QNRNRAPKVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRV 64
+ R R V + L + S P + C + P++ FC+ L ID RV
Sbjct: 2 EGRARLSLVHAAAALVVLLQLHGGAVVVASEPPYTCGAGAPPNIP---FCDAGLPIDRRV 58
Query: 65 GDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSF 124
DLV R+T+ EKI+ L D + ++ RLG+P Y+WWSEALHG+S G G H + ATSF
Sbjct: 59 DDLVSRMTVAEKISQLGDQSPAIPRLGVPAYKWWSEALHGISNQGRGIHLDGPLRAATSF 118
Query: 125 PQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETP 183
PQVILTAASFN L+ IG+V+ EARA+YN G A GLTFW+PNIN+FRDPRWGRGQETP
Sbjct: 119 PQVILTAASFNPHLWYRIGQVIGVEARAVYNNGQAEGLTFWAPNINVFRDPRWGRGQETP 178
Query: 184 GEDPLLASKYATGYVKGLQQTDGGSP---NRLKVAACCKHYTAYDLDNWKGTDRYHFNAM 240
GEDP + KYA +V+G+Q P L+ +ACCKH+TAYDL+NWKG RY F+A
Sbjct: 179 GEDPTMTGKYAAVFVRGVQGYGLAGPVNSTGLEASACCKHFTAYDLENWKGVTRYVFDAK 238
Query: 241 VIYT-------------------------------------YYLIK---------FKYIV 254
V Y L+ + YI
Sbjct: 239 VTAQDLADTYNPPFKSCVEDGHASGIMCSYNRVNGVPTCADYNLLSTTARQDWGFYGYIT 298
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
SDCD+V +++++Q Y KT E+A A + AG+D+NCGS++ H +A++ G + E I+ A
Sbjct: 299 SDCDAVAIIHDAQGYAKTAEDAVADVLKAGMDVNCGSYVQDHGASALQQGKITEQDINRA 358
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT--AGSL 372
+ N FA MRLG F+G P + YG +GP VCTQ +QDLAL+AA+ GIVLLKN AG+L
Sbjct: 359 LHNLFAVRMRLGLFNGDPRRNLYGDIGPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGAL 418
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA-TIYQAGCSNVQCG 431
PLS + +LAVIG NAN + GNY G PC TPLQ L V T + AGC++ C
Sbjct: 419 PLSKPNVASLAVIGFNANDAIRLRGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACN 478
Query: 432 TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVIL 491
+ +A +AA+SAD+ VL MG DQ E E DRLDL LPGQQQ LI VA AK PVIL
Sbjct: 479 VTTIPEAVQAASSADSVVLFMGLDQDQEREEVDRLDLTLPGQQQTLIESVANAAKKPVIL 538
Query: 492 IIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+++ GG D+SFAK +PKI +ILW GYPG+AGG AIA V FG +NP GRLP+TWYPQ +
Sbjct: 539 VLLCGGPVDVSFAKTNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDF 597
>gi|62701898|gb|AAX92971.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|62733926|gb|AAX96035.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|77550045|gb|ABA92842.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
expressed [Oryza sativa Japonica Group]
gi|125576900|gb|EAZ18122.1| hypothetical protein OsJ_33667 [Oryza sativa Japonica Group]
Length = 771
Score = 539 bits (1389), Expect = e-150, Method: Compositional matrix adjust.
Identities = 284/565 (50%), Positives = 361/565 (63%), Gaps = 54/565 (9%)
Query: 35 SPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
P ++C S PS + FC+ L R DLV RLT EK+ L D AG V RLG+P
Sbjct: 24 EPPYSCGPRS-PS-SGYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVPRLGVPP 81
Query: 95 YEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
Y+WWSE LHG+SY G G HF+ V TSFPQV+LTAA+F+ L+ IG+ + TEARA+Y
Sbjct: 82 YKWWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALY 141
Query: 155 NVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLK 213
N+G A GLT WSPN+NI+RDPRWGRGQETPGEDP ASKYA +VKGLQ G +P L+
Sbjct: 142 NLGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQ---GSTPGTLQ 198
Query: 214 VAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------ 251
+ACCKH TAYDL+ W G RY+FNA V + + FK
Sbjct: 199 TSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYTDI 258
Query: 252 ------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL 287
Y+ SDCD+V +L ++Q Y TPE+ A +I AGLDL
Sbjct: 259 NGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYAPTPEDTVAVAIKAGLDL 318
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP-SKQPYGQLGPKDVC 346
NCG++ H AA++ G + ES +D A++N FA MRLG FDG P S YG LG DVC
Sbjct: 319 NCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAADVC 378
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
TQA++DLAL+AA+ GIVLLKN AG+LPL +++ AVIGPNAN + GNY G PC+
Sbjct: 379 TQAHRDLALEAAQDGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPCET 438
Query: 407 TTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDR 465
TTPLQG+ ++++ + AGC + CG A A A+S+D ++ MG Q E E DR
Sbjct: 439 TTPLQGVQRYISSVRFLAGCDSPACGFAATGQAAALASSSDQVIMFMGLSQDQEKEGLDR 498
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
LLLPG+QQ LIT VA A+ PVIL++++GG D++FAKN+PKI +ILW GYPGQAGG
Sbjct: 499 TSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGL 558
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
AIA V FG +NPSGRLP+TWYP+ +
Sbjct: 559 AIAKVLFGDHNPSGRLPVTWYPEEF 583
>gi|224058158|ref|XP_002299457.1| predicted protein [Populus trichocarpa]
gi|222846715|gb|EEE84262.1| predicted protein [Populus trichocarpa]
Length = 780
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 277/578 (47%), Positives = 366/578 (63%), Gaps = 58/578 (10%)
Query: 28 SSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSV 87
S+S + ++P F C P+ + FCN SL I R L+ LTLQEKI L D+A +
Sbjct: 20 SNSKSVANPQFPC---KPPTHNTYSFCNKSLPITRRAQSLISHLTLQEKIQQLSDNASGI 76
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP--GATSFPQVILTAASFNASLFQAIGKV 145
RLGIP YEWWSE+LHG+S GPG F N P AT FPQVI++AASFN +L+ IG
Sbjct: 77 PRLGIPHYEWWSESLHGISINGPGVSFKNGGPVTSATGFPQVIVSAASFNRTLWFLIGSA 136
Query: 146 VSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--- 202
++ EARAMYNVG AGLTFW+PNINIFRDPRWGRGQETPGEDP++AS YA +VKG Q
Sbjct: 137 IAIEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAIEFVKGFQGGH 196
Query: 203 -QTDGGSPN--RLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---- 251
+ + G N +L ++ACCKH TAYDL+ W RY FNA+V + Y F+
Sbjct: 197 WKNEDGEINDDKLMLSACCKHSTAYDLEKWGNFSRYSFNAVVTEQDMEDTYQPPFRSCIQ 256
Query: 252 -------------------------------------YIVSDCDSVDVLYNSQHYTKTPE 274
YI SDCD+V ++ Q+Y+K+PE
Sbjct: 257 KGKASCLMCSYNEVNGVPACAREDLLQKPRTEWGFKGYITSDCDAVATIFEYQNYSKSPE 316
Query: 275 EAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK 334
+A A ++ AG+D+NCG+++ ++ ++AV+ G + E ID A+ N F+ +RLG FDG P K
Sbjct: 317 DAVAIALKAGMDINCGTYVLRNAQSAVEKGKLQEEDIDRALHNLFSVQLRLGLFDGDPRK 376
Query: 335 QPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKT 394
+G+LGPK+VCT+ ++ LAL+AARQGIVLLKN LPL+ A+ +LA+IGP AN+ +
Sbjct: 377 GQFGKLGPKNVCTKEHKTLALEAARQGIVLLKNDKKLLPLNKKAVSSLAIIGPLANMANS 436
Query: 395 MIGNYEGTPCKYTTPLQGLAAVV-ATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVM 452
+ G+Y G PC + +GL A V T Y GC +V C Q A A AD ++V
Sbjct: 437 LGGDYTGYPCDPQSLFEGLKAYVKKTSYAIGCLDVACVSDTQFHKAIIVAKRADFVIIVA 496
Query: 453 GADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITS 512
G D S E E HDR+ LLLPG+Q L++ VA +K PVIL++ GG D+SFAK DP+I S
Sbjct: 497 GLDLSQETEEHDRVSLLLPGKQMSLVSSVAAASKKPVILVLTGGGPLDVSFAKGDPRIAS 556
Query: 513 ILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
ILW+GYPG+AG A+A++ FG YNP GRLPMTWYP+S+
Sbjct: 557 ILWIGYPGEAGAKALAEIIFGEYNPGGRLPMTWYPESF 594
>gi|296084630|emb|CBI25718.3| unnamed protein product [Vitis vinifera]
Length = 768
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 281/582 (48%), Positives = 359/582 (61%), Gaps = 53/582 (9%)
Query: 16 LFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQE 75
LF+ L L S S + P F C P+ + FCNTSL I R LV LTL E
Sbjct: 8 LFICLFLQVLPLFSISESTHPQFPC---MPPTNSDYPFCNTSLPISTRAQSLVSLLTLSE 64
Query: 76 KITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFN 135
KI L D A ++ RL IP YEWWSE+LHG++ GPG F+ V ATSFPQV+LTAASFN
Sbjct: 65 KIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLLTAASFN 124
Query: 136 ASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAT 195
SL+ +IG ++ EARAMYNVG AGLTFW+PNINIFRDPRWGRGQETPGEDP++AS YA
Sbjct: 125 RSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAV 184
Query: 196 GYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK 251
+V+G Q G + L ++ACCKH TAYDL+ W RY F+A+V + Y F+
Sbjct: 185 EFVRGFQGDSDG--DGLMLSACCKHLTAYDLEKWGNFSRYSFDAVVSNQDLEDTYQPPFR 242
Query: 252 -----------------------------------------YIVSDCDSVDVLYNSQHYT 270
YI SDCD+V +Y QHY
Sbjct: 243 SCVQQGKASCLMCSYNRVNGVPACARQDLFQKAKTEWGFKGYITSDCDAVATVYEYQHYA 302
Query: 271 KTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDG 330
+PE+A A + AG D+NCGS++ +HT++A+ G V E ID A+ N F+ MRLG FDG
Sbjct: 303 NSPEDAVADVLKAGTDINCGSYMLRHTQSAIDQGKVKEEDIDRALFNLFSVQMRLGLFDG 362
Query: 331 HPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNAN 390
P+ YG LGPKDVCT+ ++ LAL+AARQGIVLLKN LPL + I +LA+IGP A+
Sbjct: 363 DPANGLYGNLGPKDVCTKEHRTLALEAARQGIVLLKNDKKFLPLDKSRISSLAIIGPQAD 422
Query: 391 VTKTMIGNYEGTPCKYTTPLQGLAAVVA-TIYQAGCSNVQC-GTAQVDDAKKAAASADAT 448
+ G Y G PCK + ++GL V T + AGC +V C D+A A AD
Sbjct: 423 -QPFLGGGYTGIPCKPESLVEGLKTYVEKTSFAAGCVDVPCLSDTGFDEAVSIARKADIV 481
Query: 449 VLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDP 508
V+V G D S E E HDR+ LLLPG+Q LI+ VA + P++L++ GG D+SFA+ DP
Sbjct: 482 VVVAGLDLSQETEDHDRVSLLLPGKQMALISSVASAIQKPLVLVLTGGGPLDVSFAEQDP 541
Query: 509 KITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+I SILW+GYPG+AG A+A++ FG +NP GRLPMTWYP+S+
Sbjct: 542 RIASILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF 583
>gi|168065036|ref|XP_001784462.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663987|gb|EDQ50724.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 726
Score = 535 bits (1379), Expect = e-149, Method: Compositional matrix adjust.
Identities = 292/550 (53%), Positives = 358/550 (65%), Gaps = 57/550 (10%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
+ FC+TSL ++RV DLV RLTL+EK+T LV++A ++ RL IP YEWW E LHGV++V
Sbjct: 1 MRFCDTSLSDEIRVFDLVSRLTLEEKVTQLVNTASAIPRLSIPAYEWWQEGLHGVAHVS- 59
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
F +P ATSFP ILT ASFN L+ IG+ STEARA YN G+AGLT+WSP INI
Sbjct: 60 ---FGGSLPRATSFPLPILTTASFNKDLWNQIGQAFSTEARAFYNDGIAGLTYWSPVINI 116
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR QET GEDP S YAT +V+G+Q+ D S RLK++ACCKH+TAYD+DNW+
Sbjct: 117 ARDPRWGRIQETSGEDPYTTSAYATHFVQGMQEGDANS-KRLKLSACCKHFTAYDVDNWE 175
Query: 231 GTDRYHFNA------------------------MVIYT----------YYLIKFK----- 251
G DRYHF+A M Y Y ++
Sbjct: 176 GIDRYHFDAKANLADTYNPPFQSCVQEGRSASLMCSYNKVNGVPTCANYDFLENTVRRAW 235
Query: 252 ----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVN 307
YIVSDCDSV V++ S +Y T E+AAA ++ AGLDLNCG +L +TE AV G VN
Sbjct: 236 GLNGYIVSDCDSVLVMHESTNYAPTTEDAAADALNAGLDLNCGDYLASYTEGAVAMGKVN 295
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
S +D+A+ N F MRLG FDG+P+ Q +G +G DVCT A+Q+LA++AARQGIVLLKN
Sbjct: 296 ASRVDNAVYNVFLVRMRLGMFDGNPANQEFGNIGVADVCTPAHQELAVEAARQGIVLLKN 355
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQ----- 422
LPLS N AVIGPNAN T TM+GNYEG PC+Y TPLQGL + Y
Sbjct: 356 DGNILPLSKNI--NTAVIGPNANATHTMLGNYEGIPCQYITPLQGLVKFGSGDYHKVWFS 413
Query: 423 AGCSNVQCGT-AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEV 481
GC N C Q+ A AA ADA VLV+G Q E+E+ DR LLLPG QQ LI EV
Sbjct: 414 EGCVNTACQQDDQISSAVSTAAVADAVVLVVGLSQVQESEALDRTSLLLPGYQQTLIDEV 473
Query: 482 AKVAKG-PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGR 540
A A G PV+L++M G DI+FAKND +I SILWVGYPGQ+GG AIA+V FG +NP G+
Sbjct: 474 AGAAAGRPVVLVLMCAGPVDINFAKNDKRIQSILWVGYPGQSGGQAIAEVIFGAHNPGGK 533
Query: 541 LPMTWYPQSY 550
LPM+WYP+ Y
Sbjct: 534 LPMSWYPEDY 543
>gi|218191593|gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indica Group]
Length = 774
Score = 534 bits (1376), Expect = e-149, Method: Compositional matrix adjust.
Identities = 278/568 (48%), Positives = 360/568 (63%), Gaps = 56/568 (9%)
Query: 34 SSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
S P F+C S S FCN L I+ R DLV RLTL+EKI+ L D + +V RLG+P
Sbjct: 26 SGPPFSCGAPS-----SAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVP 80
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
Y+WWSEALHGVS G G H + ATSFPQVILTAASFN L+ IG+V+ TEARA+
Sbjct: 81 AYKWWSEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAV 140
Query: 154 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ-QTDGGSPNR 211
YN G A GLTFW+PNIN+FRDPRWGRGQETPGEDP + KYA +V+G+Q G+ N
Sbjct: 141 YNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAINS 200
Query: 212 --LKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYT------------------------- 244
L+ +ACCKH+TAYDL+NWKG RY F+A V
Sbjct: 201 TDLEASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCS 260
Query: 245 ------------YYLIK---------FKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
Y L+ + YI SDCD+V ++++ Q Y KT E+A A + A
Sbjct: 261 YNRVNGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLKA 320
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
G+D+NCGS++ +H +A++ G + E I+ A+ N FA MRLG F+G+P YG +GP
Sbjct: 321 GMDVNCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPD 380
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
VCTQ +Q+LAL+AA+ G+VLLKN A +LPLS + + ++AVIG NAN ++GNY G P
Sbjct: 381 QVCTQEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPP 440
Query: 404 CKYTTPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES 462
C TPLQ L V T + AGC++ C + + +A + A+S D VL MG DQ E E
Sbjct: 441 CISVTPLQVLQGYVKDTRFLAGCNSAACNVSSIGEAAQLASSVDYVVLFMGLDQDQEREE 500
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DRL+L LPG Q+ LI VA AK PVIL+++ GG D++FAK +PKI +ILW GYPG+A
Sbjct: 501 VDRLELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEA 560
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG AIA V FG +NP GRLP+TWYP+ +
Sbjct: 561 GGIAIAQVLFGEHNPGGRLPVTWYPKEF 588
>gi|115448721|ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group]
gi|46390122|dbj|BAD15557.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
gi|46390225|dbj|BAD15656.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
gi|113537671|dbj|BAF10054.1| Os02g0752200 [Oryza sativa Japonica Group]
gi|125583710|gb|EAZ24641.1| hypothetical protein OsJ_08409 [Oryza sativa Japonica Group]
Length = 780
Score = 534 bits (1376), Expect = e-149, Method: Compositional matrix adjust.
Identities = 278/568 (48%), Positives = 360/568 (63%), Gaps = 56/568 (9%)
Query: 34 SSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
S P F+C S S FCN L I+ R DLV RLTL+EKI+ L D + +V RLG+P
Sbjct: 32 SGPPFSCGAPS-----SAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVP 86
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
Y+WWSEALHGVS G G H + ATSFPQVILTAASFN L+ IG+V+ TEARA+
Sbjct: 87 AYKWWSEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAV 146
Query: 154 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ-QTDGGSPNR 211
YN G A GLTFW+PNIN+FRDPRWGRGQETPGEDP + KYA +V+G+Q G+ N
Sbjct: 147 YNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAINS 206
Query: 212 --LKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYT------------------------- 244
L+ +ACCKH+TAYDL+NWKG RY F+A V
Sbjct: 207 TDLEASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCS 266
Query: 245 ------------YYLIK---------FKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
Y L+ + YI SDCD+V ++++ Q Y KT E+A A + A
Sbjct: 267 YNRVNGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLKA 326
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
G+D+NCGS++ +H +A++ G + E I+ A+ N FA MRLG F+G+P YG +GP
Sbjct: 327 GMDVNCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPD 386
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
VCTQ +Q+LAL+AA+ G+VLLKN A +LPLS + + ++AVIG NAN ++GNY G P
Sbjct: 387 QVCTQEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPP 446
Query: 404 CKYTTPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES 462
C TPLQ L V T + AGC++ C + + +A + A+S D VL MG DQ E E
Sbjct: 447 CISVTPLQVLQGYVKDTRFLAGCNSAACNVSSIGEAAQLASSVDYVVLFMGLDQDQEREE 506
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DRL+L LPG Q+ LI VA AK PVIL+++ GG D++FAK +PKI +ILW GYPG+A
Sbjct: 507 VDRLELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEA 566
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG AIA V FG +NP GRLP+TWYP+ +
Sbjct: 567 GGIAIAQVLFGEHNPGGRLPVTWYPKEF 594
>gi|357152329|ref|XP_003576084.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 779
Score = 533 bits (1374), Expect = e-149, Method: Compositional matrix adjust.
Identities = 284/566 (50%), Positives = 361/566 (63%), Gaps = 58/566 (10%)
Query: 36 PVFACDVVSNPSLASLG--FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
P F+C P A+ G FC+ +L ++ R DLV RLTL EK++ L D A +V RLG+P
Sbjct: 34 PPFSC----GPGSATQGYAFCDKALPVERRAADLVSRLTLAEKVSQLGDEADAVPRLGVP 89
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
Y+WWSE LHG+S+ G G HF V TSFPQV+LTAASF+ ++ IG+ + TEARA+
Sbjct: 90 AYKWWSEGLHGLSFWGHGMHFDGAVRAITSFPQVLLTAASFDQDIWYRIGQAIGTEARAL 149
Query: 154 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
YN+G A GLT WSPN+NI+RDPRWGRGQETPGEDP ASKYA +VKGLQ G S L
Sbjct: 150 YNLGQAQGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQ---GTSATTL 206
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNA---------------------------MVIYTY 245
+ +ACCKH TAYDL++W G RY+FNA M YT
Sbjct: 207 QTSACCKHATAYDLEDWNGVVRYNFNAKVTLQDLADTFNPPFKSCVEEGKATCVMCAYTN 266
Query: 246 Y----------LIK--FK-------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
LI FK Y+ SDCD+V +L ++Q Y TPE+ A ++ AGLD
Sbjct: 267 INGVPACASSDLITKTFKGDWGLNGYVSSDCDAVALLRDAQRYRATPEDTVAVALKAGLD 326
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQP-YGQLGPKDV 345
LNCG++ H +A++ G + E +D+A+ N FA MRLG FDG P YG LG DV
Sbjct: 327 LNCGNYTQVHGMSALQQGKMTEQDVDNALKNLFAVRMRLGHFDGDPRTSALYGSLGAADV 386
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
C+ A+++LAL+AA+ GIVLLKN AG LPL P+A+ + A IG NAN + GNY G PC+
Sbjct: 387 CSPAHKNLALEAAQSGIVLLKNDAGILPLDPSAVASAAAIGHNANDPAALNGNYFGPPCE 446
Query: 406 YTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHD 464
TTPLQGL V + + AGC + CG A A A+S+D +L MG Q E E D
Sbjct: 447 TTTPLQGLQGYVKNVKFLAGCDSAACGFAATGQAVTLASSSDYVILFMGLSQKEEQEGID 506
Query: 465 RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGG 524
R LLLPG+QQ LIT VA +K PVIL++++GG DI+FAK++PKI +ILW GYPGQAGG
Sbjct: 507 RTSLLLPGKQQNLITAVASASKRPVILVLLTGGSVDITFAKSNPKIGAILWAGYPGQAGG 566
Query: 525 AAIADVCFGRYNPSGRLPMTWYPQSY 550
AIA V FG +NPSGRLP+TWYP+ +
Sbjct: 567 LAIARVLFGDHNPSGRLPVTWYPEEF 592
>gi|225432136|ref|XP_002274651.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 809
Score = 533 bits (1373), Expect = e-148, Method: Compositional matrix adjust.
Identities = 270/557 (48%), Positives = 357/557 (64%), Gaps = 59/557 (10%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
+ +C++S ++R DLV R+TL EK+ D A V R+G+PKY WWSEALHGVS
Sbjct: 65 MKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEALHGVSN 124
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
G F VVPGATSFP VIL+AASFN SL++ +G+ VSTEARAMYN G AGLTFWSPN
Sbjct: 125 FGRCVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLTFWSPN 184
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG------SPNRLKVAACCKHY 221
IN+ RDPRWGR ETPGEDP L YA YV+GLQ G + LKV++CCKHY
Sbjct: 185 INVVRDPRWGRILETPGEDPHLVGLYAVNYVRGLQDVVGAENTTDLNSRPLKVSSCCKHY 244
Query: 222 TAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------------------- 251
AYDLDNWKG DR HF+A V + +++ F+
Sbjct: 245 AAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVMCSYNKINGIPSCAD 304
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP-EEAAAKSILAGLDLNCGSFLG 294
YIVSDCDSV+V+ Q + + ++AA+++ AG++L+CG+F
Sbjct: 305 SRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQALNAGMNLDCGTFNN 364
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
+ AV G N++ +DH++ + LMR+GFFDG P+ + LG D+C+ + +LA
Sbjct: 365 RSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPA---FASLGKDDICSAEHIELA 421
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
+AARQGIVLLKN +LPL ++KN+A++GP+AN T MIGNY G PC Y +PL +
Sbjct: 422 REAARQGIVLLKNDNATLPLK--SVKNIALVGPHANATDAMIGNYAGIPCYYVSPLDAFS 479
Query: 415 AVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQ 473
++ Y+ GC++VQC + +A +AA ADAT++ G D SIEAE+ DR+DLLLPG
Sbjct: 480 SMGEVRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEAEALDRVDLLLPGY 539
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q LI +VA ++ GPV+L+IMSGGG DISFA+++PKI +ILW GYPG+ GG AIADV G
Sbjct: 540 QTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPGEQGGNAIADVILG 599
Query: 534 RYNPSGRLPMTWYPQSY 550
+YNP GRLP+TWY Y
Sbjct: 600 KYNPGGRLPITWYEADY 616
>gi|225469218|ref|XP_002264031.1| PREDICTED: probable beta-D-xylosidase 6-like [Vitis vinifera]
Length = 789
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 283/601 (47%), Positives = 361/601 (60%), Gaps = 70/601 (11%)
Query: 16 LFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQE 75
LF+ L L S S + P F C P+ + FCNTSL I R LV LTL E
Sbjct: 8 LFICLFLQVLPLFSISESTHPQFPC---MPPTNSDYPFCNTSLPISTRAQSLVSLLTLSE 64
Query: 76 KITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFN 135
KI L D A ++ RL IP YEWWSE+LHG++ GPG F+ V ATSFPQV+LTAASFN
Sbjct: 65 KIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLLTAASFN 124
Query: 136 ASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAT 195
SL+ +IG ++ EARAMYNVG AGLTFW+PNINIFRDPRWGRGQETPGEDP++AS YA
Sbjct: 125 RSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAV 184
Query: 196 GYVKGLQQTD--GGSPNR-----------------LKVAACCKHYTAYDLDNWKGTDRYH 236
+V+G Q + GG R L ++ACCKH TAYDL+ W RY
Sbjct: 185 EFVRGFQGGNWKGGDEIRGAVGKKRVLRGDSDGDGLMLSACCKHLTAYDLEKWGNFSRYS 244
Query: 237 FNAMV----IYTYYLIKFK----------------------------------------- 251
F+A+V + Y F+
Sbjct: 245 FDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGVPACARQDLFQKAKTEWGFKG 304
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAI 311
YI SDCD+V +Y QHY +PE+A A + AG D+NCGS++ +HT++A+ G V E I
Sbjct: 305 YITSDCDAVATVYEYQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSAIDQGKVKEEDI 364
Query: 312 DHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS 371
D A+ N F+ MRLG FDG P+ YG LGPKDVCT+ ++ LAL+AARQGIVLLKN
Sbjct: 365 DRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGIVLLKNDKKF 424
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA-TIYQAGCSNVQC 430
LPL + I +LA+IGP A+ + G Y G PCK + ++GL V T + AGC +V C
Sbjct: 425 LPLDKSRISSLAIIGPQAD-QPFLGGGYTGIPCKPESLVEGLKTYVEKTSFAAGCVDVPC 483
Query: 431 -GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPV 489
D+A A AD V+V G D S E E HDR+ LLLPG+Q LI+ VA + P+
Sbjct: 484 LSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISSVASAIQKPL 543
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+L++ GG D+SFA+ DP+I SILW+GYPG+AG A+A++ FG +NP GRLPMTWYP+S
Sbjct: 544 VLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 603
Query: 550 Y 550
+
Sbjct: 604 F 604
>gi|115485165|ref|NP_001067726.1| Os11g0297800 [Oryza sativa Japonica Group]
gi|62734696|gb|AAX96805.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|77549999|gb|ABA92796.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
expressed [Oryza sativa Japonica Group]
gi|113644948|dbj|BAF28089.1| Os11g0297800 [Oryza sativa Japonica Group]
gi|125534139|gb|EAY80687.1| hypothetical protein OsI_35869 [Oryza sativa Indica Group]
gi|215766717|dbj|BAG98945.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 782
Score = 529 bits (1362), Expect = e-147, Method: Compositional matrix adjust.
Identities = 278/571 (48%), Positives = 357/571 (62%), Gaps = 55/571 (9%)
Query: 32 AQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLG 91
A P F+C + FC+ +L + R DLV RLT EK+ L D A V RLG
Sbjct: 31 AGGEPPFSCGGAAAGGGQGYAFCDATLPAEQRAADLVARLTAAEKVAQLGDQAAGVPRLG 90
Query: 92 IPKYEWWSEALHGVSYVGPGTHFS---NVVPGATSFPQVILTAASFNASLFQAIGKVVST 148
+P Y+WWSEALHG++ G G HF + ATSFPQV+LTAA+F+ L+ IG+ + T
Sbjct: 91 VPAYKWWSEALHGLATSGRGLHFDAPGSAARAATSFPQVLLTAAAFDDDLWFRIGQAIGT 150
Query: 149 EARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
EARA+YN+G A GLT WSPN+NIFRDPRWGRGQETPGEDP +ASKYA +VKG+Q G
Sbjct: 151 EARALYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPTMASKYAVAFVKGMQ---GN 207
Query: 208 SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------ 251
S L+ +ACCKH TAYDL++W G RY+FNA V + Y F+
Sbjct: 208 SSAILQTSACCKHVTAYDLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKATCIM 267
Query: 252 ------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSI 281
YI SDCD+V ++ ++Q YT+TPE+A A ++
Sbjct: 268 CAYTGINGVPACANADLLTKTVRGDWGLDGYIASDCDAVAIMRDAQRYTQTPEDAVAVAL 327
Query: 282 LAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP-SKQPYGQL 340
AGLD+NCG+++ +H AA++ G + E ID A+ N FA MRLG FDG P S YG L
Sbjct: 328 KAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSVYGGL 387
Query: 341 GPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE 400
G D+CT ++ LAL+AA GIVLLKN AG LPL TA+ + AVIGPNAN +IGNY
Sbjct: 388 GAADICTPEHRSLALEAAMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALIGNYF 447
Query: 401 GTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIE 459
G PC+ TTPL G+ + + + AGC++ C A D A A+S+D L MG Q E
Sbjct: 448 GPPCESTTPLNGILGYIKNVRFLAGCNSAACDVAATDQAAAVASSSDYVFLFMGLSQKQE 507
Query: 460 AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYP 519
+E DR LLLPG+QQ LIT VA AK PVIL++++GG D++FA+ +PKI +ILW GYP
Sbjct: 508 SEGRDRTSLLLPGEQQSLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILWAGYP 567
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GQAGG AIA V FG +NP GRLP+TWYP+ +
Sbjct: 568 GQAGGLAIARVLFGDHNPGGRLPVTWYPEEF 598
>gi|357156390|ref|XP_003577440.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 755
Score = 529 bits (1362), Expect = e-147, Method: Compositional matrix adjust.
Identities = 283/567 (49%), Positives = 358/567 (63%), Gaps = 59/567 (10%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
P F+C P A FCN +L + R DLV +LTL+EK++ L D A V R G+P Y
Sbjct: 12 PAFSC---GPPQQAQYAFCNRALPAEQRAADLVAKLTLEEKVSQLGDQAPGVPRFGVPGY 68
Query: 96 EWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
WWSE LHGVS G G HF+ V G T+FPQV+LT ASF+ S++ IG+ + TEARAM+N
Sbjct: 69 NWWSEGLHGVSMWGHGMHFNGAVRGVTTFPQVLLTTASFDDSIWYRIGQAIGTEARAMFN 128
Query: 156 VGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKV 214
+G A GLT WSPN+NI+RDPRWGRGQETPGEDP ASKYA +V+GLQ G S L+
Sbjct: 129 LGQADGLTIWSPNVNIYRDPRWGRGQETPGEDPATASKYAVAFVRGLQ---GTSTTTLQT 185
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------- 251
+ACCKH TAYDLD+W RY+FNA V + + FK
Sbjct: 186 SACCKHATAYDLDDWNRIGRYNFNAKVTAQDLEETFNPPFKSCVVEGKATCVMCAYTSVN 245
Query: 252 -----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
YI SDCD+V +LY ++ Y+ TPE+A A +I AGLD+N
Sbjct: 246 GIPACADSGLLTKTIKGEWGMNGYISSDCDAVALLYGTR-YSGTPEDAVAAAIKAGLDMN 304
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQP-YGQLGPKDVCT 347
CG+F H AA++ ++E +D A+ N FA MRLG FDG P + P YG+LG +DVC+
Sbjct: 305 CGNFSQVHGMAALQQRKMSEQDVDKALRNLFAIRMRLGHFDGDPLQSPLYGRLGAQDVCS 364
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLS-PTAIK-NLAVIGPNANVTKTMIGNYEGTPCK 405
A++DLAL+AA+ GIVLLKN A +LPLS PTA + AVIGPNAN ++GNY G PC+
Sbjct: 365 PAHKDLALEAAQNGIVLLKNDAATLPLSRPTAASASFAVIGPNANEPGALLGNYFGPPCE 424
Query: 406 YTTPLQGLAAVVA--TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
TTPLQ L + + GC + C A A AA++D T+L MG Q E E
Sbjct: 425 TTTPLQALQKFYSKNVRFVPGCDSAACNVADTYQASGLAATSDYTILFMGLSQKQEQEGL 484
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR LLLPG+Q+ LIT VA AK P+IL++++GG DI+FAK +PKI +ILW GYPGQAG
Sbjct: 485 DRTSLLLPGKQESLITAVAAAAKRPIILVLLTGGPVDITFAKFNPKIGAILWAGYPGQAG 544
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
G AIA V FG +NPSGRLP+TWYP+ Y
Sbjct: 545 GLAIAKVLFGEHNPSGRLPVTWYPEEY 571
>gi|18025342|gb|AAK38482.1| beta-D-xylosidase [Hordeum vulgare]
Length = 777
Score = 525 bits (1353), Expect = e-146, Method: Compositional matrix adjust.
Identities = 271/568 (47%), Positives = 351/568 (61%), Gaps = 56/568 (9%)
Query: 34 SSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
+ P F+C S S FC+ L I+ R DLV +LTL+EKI+ L D + +V RLG+P
Sbjct: 30 ADPPFSCGAPS-----SAAFCDRRLPIEQRAADLVSKLTLEEKISQLGDESPAVDRLGVP 84
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
Y+WWSEALHGV+ G G H + ATSFPQVILTAASFN L+ IG+V+ TEAR +
Sbjct: 85 AYKWWSEALHGVANAGRGVHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARGV 144
Query: 154 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ---QTDGGSP 209
YN G A GLTFW+PNIN+FRDPRWGRGQETPGEDP + KYA +V+G+Q + +
Sbjct: 145 YNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGMSGAINS 204
Query: 210 NRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------- 251
+ L+ +ACCKH+TAYDL+NWKG R+ F+A V + Y FK
Sbjct: 205 SDLEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCS 264
Query: 252 ----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
YI SDCD+V ++++ Q Y K PE+A A + A
Sbjct: 265 YNRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLKA 324
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
G+D+NCG ++ H +A + G + ID A+ N FA MRLG FDG+P YG +G
Sbjct: 325 GMDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFDGNPKYNRYGNIGAD 384
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
VC++ +QDLAL AAR GIVLLKN +LPLS + + +LAVIGPN N ++GNY G P
Sbjct: 385 QVCSKEHQDLALQAARDGIVLLKNDGAALPLSKSKVSSLAVIGPNGNNASLLLGNYFGPP 444
Query: 404 CKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES 462
C TPLQ L V + GC+ C + + +A AA SAD VL MG DQ+ E E
Sbjct: 445 CISVTPLQALQGYVKDARFVQGCNAAVCNVSNIGEAVHAAGSADYVVLFMGLDQNQEREE 504
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DRL+L LPG Q+ L+ VA AK PVIL+++ GG D++FAKN+PKI +I+W GYPGQA
Sbjct: 505 VDRLELGLPGMQESLVNSVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQA 564
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG AIA V FG +NP GRLP+TWYP+ +
Sbjct: 565 GGIAIAQVLFGDHNPGGRLPVTWYPKEF 592
>gi|356548162|ref|XP_003542472.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 778
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 276/588 (46%), Positives = 361/588 (61%), Gaps = 58/588 (9%)
Query: 15 VLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQ 74
++ FL H+ S+ P ++CD SN S FCNT L I R DLV RLTL
Sbjct: 11 IISFFLLNLHHHHAEST---RPPYSCDSSSNSPYYS--FCNTKLPITKRAQDLVSRLTLD 65
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASF 134
EK+ LV++A ++ RLGIP Y+WWSEALHGV+ G G F+ + ATSFPQVILTAASF
Sbjct: 66 EKLAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAASF 125
Query: 135 NASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
+ +L+ I K + EARA+YN G A G+TFW+PNIN+FRDPRWGRGQET GEDPL+ +KY
Sbjct: 126 DPNLWYQISKTIGREARAVYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKY 185
Query: 194 ATGYVKGLQ--QTDGGS-PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYY 246
YV+GLQ +GG RL+ +ACCKH+TAYDLD WKG DR+ F+A V + Y
Sbjct: 186 GVAYVRGLQGDSFEGGKLAERLQASACCKHFTAYDLDQWKGLDRFVFDARVTSQDLADTY 245
Query: 247 LIKFK------------------------------------------YIVSDCDSVDVLY 264
F+ YI SDC +V +++
Sbjct: 246 QPPFQSCIEQGRASGIMCAYNRVNGVPNCADFNLLTKTARQQWKFDGYITSDCGAVSIIH 305
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMR 324
Q Y KT E+A A AG+D+ CG ++ KH ++AV + S ID A+ N F+ +R
Sbjct: 306 EKQGYAKTAEDAIADVFRAGMDVECGDYITKHAKSAVFQKKLPISQIDRALQNLFSIRIR 365
Query: 325 LGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAV 384
LG FDG+P+K P+G +GP +VC++ + LAL+AAR GIVLLKNT LPL P +A+
Sbjct: 366 LGLFDGNPTKLPFGTIGPNEVCSKQSLQLALEAARDGIVLLKNTNSLLPL-PKTNPTIAL 424
Query: 385 IGPNANVT-KTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSN-VQCGTAQVDDAKKAA 442
IGPNAN + K +GNY G PC T LQG T+Y GC + QC AQ+++A + A
Sbjct: 425 IGPNANASSKVFLGNYYGRPCNLVTLLQGFEGYAKTVYHPGCDDGPQCAYAQIEEAVEVA 484
Query: 443 ASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS 502
D VLVMG DQS E ESHDR L LPG+Q+ LI VA+ AK PV+++++ GG DI+
Sbjct: 485 KKVDYVVLVMGLDQSQERESHDREYLGLPGKQEELIKSVARAAKRPVVVVLLCGGPVDIT 544
Query: 503 FAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
AK D K+ ILW GYPG+ GG A+A V FG +NP G+LP+TWYP+ +
Sbjct: 545 SAKFDDKVGGILWAGYPGELGGVALAQVVFGDHNPGGKLPITWYPKDF 592
>gi|224093292|ref|XP_002309869.1| predicted protein [Populus trichocarpa]
gi|222852772|gb|EEE90319.1| predicted protein [Populus trichocarpa]
Length = 694
Score = 524 bits (1350), Expect = e-146, Method: Compositional matrix adjust.
Identities = 270/538 (50%), Positives = 345/538 (64%), Gaps = 69/538 (12%)
Query: 66 DLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFP 125
DLV ++TL EK+ L + A V RLG+ +Y+WWSEALHGVS VGPGT F +++PG+TSFP
Sbjct: 2 DLVNQMTLNEKVLQLGNKAYGVPRLGLAEYQWWSEALHGVSNVGPGTFFDDLIPGSTSFP 61
Query: 126 QVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGE 185
VI TAA+FN SL++ IG+ VSTEARAMYN+G AGLT+WSPNIN+ RDPRWGR ETPGE
Sbjct: 62 TVITTAAAFNESLWKVIGQAVSTEARAMYNLGRAGLTYWSPNINVVRDPRWGRAIETPGE 121
Query: 186 DPLLASKYATGYVKGLQQTDGGS----PNR--LKVAACCKHYTAYDLDNWKGTDRYHFNA 239
DP L +YA YV+GLQ +G PN LKV++CCKHY AYD+DNWKG +RY F+A
Sbjct: 122 DPYLVGRYAVNYVRGLQDVEGSENYTDPNSRPLKVSSCCKHYAAYDVDNWKGVERYTFDA 181
Query: 240 MV----IYTYYLIKFK------------------------------------------YI 253
V + +L F+ YI
Sbjct: 182 RVSEQDMVETFLRPFEMCVKDGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWDLHGYI 241
Query: 254 VSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDH 313
VSDCDS+ V+ + + GLDL+CG++ ++ EAAV+ G V E+ ID
Sbjct: 242 VSDCDSLQVMVENHKWL-------------GLDLDCGAYYTENVEAAVRQGKVREADIDK 288
Query: 314 AISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLP 373
+++ + LMRLGFFDG P Y G DVC++ N +LA +AAR+G VLLKN SLP
Sbjct: 289 SLNFLYVVLMRLGFFDGIPQ---YNSFGKNDVCSKENIELATEAAREGAVLLKNENDSLP 345
Query: 374 LSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTA 433
LS +K LAVIGP++N T MIGNY G PC+ TP++GL+ YQ GCS++ C
Sbjct: 346 LSIEKVKTLAVIGPHSNATSAMIGNYAGIPCQIITPIEGLSKYAKVDYQMGCSDIACKDE 405
Query: 434 Q-VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
+ A ++A ADAT+++ G D SIEAES DR DLLLPG Q LI +VA V+ GPV+L+
Sbjct: 406 SFIFPAMESAKKADATIILAGIDLSIEAESLDRDDLLLPGYQTQLINQVASVSNGPVVLV 465
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+MS GG DISFAK++ I SILWVGYPG+ GG AIADV FG+YNP GRLP+TW+ Y
Sbjct: 466 LMSAGGVDISFAKSNGDIKSILWVGYPGEEGGNAIADVIFGKYNPGGRLPLTWHEADY 523
>gi|253761872|ref|XP_002489310.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
gi|241946958|gb|EES20103.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
Length = 772
Score = 524 bits (1350), Expect = e-146, Method: Compositional matrix adjust.
Identities = 274/565 (48%), Positives = 352/565 (62%), Gaps = 52/565 (9%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
P F+C S + L FC+ +L R DLV RLT EKI L D A V RLG+P Y
Sbjct: 26 PPFSCGPTSAEASEGLAFCDVTLSPAQRAADLVSRLTPAEKIAQLGDQATGVPRLGVPGY 85
Query: 96 EWWSEALHGVSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
+WW+EALHG++ G G HF V V ATSFPQV+LTAA+F+ L+ IG+ + EARA+
Sbjct: 86 KWWNEALHGLATSGKGLHFDVVGGVRAATSFPQVLLTAAAFDDDLWFRIGQAIGREARAL 145
Query: 154 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
+NVG A GLT WSPN+NIFRDPRWGRGQETPGEDP +AS+YA +V+G+Q S + L
Sbjct: 146 FNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNS--SSSLL 203
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMVI-----------YTYYLIKFK---------- 251
+ +ACCKH TAYDL++W G RY F A V + +++ K
Sbjct: 204 QTSACCKHATAYDLEDWNGVARYSFVARVTAQDLEDTFNPPFRSCVVEGKASCIMCAYTA 263
Query: 252 -------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
Y+ SDCD+V ++ ++Q Y TPE+A A S+ AGLD
Sbjct: 264 INGVPACANTDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLKAGLD 323
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
++CGS++ +H AA++ G + E ID A+ N FA MRLG FDG P K YG L D+C
Sbjct: 324 IDCGSYIQQHATAAIQQGKLTELDIDKALVNLFAVRMRLGHFDGDPRKNMYGALSAADIC 383
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
T ++ LAL+AA+ GIVLLKN G LPL + + + AVIGPN+N +I NY G PC+
Sbjct: 384 TPEHRSLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNSNDGMALIANYFGPPCES 443
Query: 407 TTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDR 465
TTPLQGL + V + + AGCS+ C A D A + S D L MG Q E+E DR
Sbjct: 444 TTPLQGLQSYVNNVRFLAGCSSAACDVAVTDQAVVLSGSEDYVFLFMGLSQQQESEGKDR 503
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
LLLPG QQ LIT VA +K PVIL+++SGG DI+FA+++PKI +ILW GYPGQAGG
Sbjct: 504 TSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQAGGL 563
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
AIA V FG +NPSGRLPMTWYP+ +
Sbjct: 564 AIAKVLFGDHNPSGRLPMTWYPEDF 588
>gi|356515806|ref|XP_003526589.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 772
Score = 523 bits (1348), Expect = e-146, Method: Compositional matrix adjust.
Identities = 285/574 (49%), Positives = 357/574 (62%), Gaps = 56/574 (9%)
Query: 30 SSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSR 89
SS +P FACD SNPS S FCN L I R DL+ RLTL EK++ LV++A + R
Sbjct: 19 SSKPEAP-FACDF-SNPSSRSYPFCNPKLPIPQRTKDLLSRLTLDEKLSQLVNTAPPIPR 76
Query: 90 LGIPKYEWWSEALHGVSYVGPGTHFSN--VVPGATSFPQVILTAASFNASLFQAIGKVVS 147
LGIP Y+WWSEALHGVS VGPG F N + ATSFPQVILTAASF++ L+ IG +
Sbjct: 77 LGIPAYQWWSEALHGVSGVGPGILFDNNSTISSATSFPQVILTAASFDSRLWYRIGHAIG 136
Query: 148 TEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
EARA++N G A GLTFW+PNINIFRDPRWGRGQET GEDPLL S+YA +V+GLQ D
Sbjct: 137 IEARAIFNAGQANGLTFWAPNINIFRDPRWGRGQETAGEDPLLTSRYAVSFVRGLQ-GDS 195
Query: 207 GSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------- 251
L +ACCKH+TAYDLDNWKG DR+ F+A V + Y F+
Sbjct: 196 FKGAHLLASACCKHFTAYDLDNWKGVDRFVFDARVSLQDLADTYQPPFQSCVQQGRASGI 255
Query: 252 -------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKS 280
YI SDC +V +++ Q Y K+PE+ A
Sbjct: 256 MCAYNRVNGVPNCADYGLLTQTARNQWDFNGYITSDCGAVGFIHDRQRYAKSPEDVVADV 315
Query: 281 ILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQL 340
+ AG+DL CGS+L H ++AV + S ID A+ N F+ MRLG FDG+P++ +G +
Sbjct: 316 LRAGMDLECGSYLTYHAKSAVLQKKLGMSEIDRALQNLFSIRMRLGLFDGNPTRLSFGLI 375
Query: 341 GPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIK-NLAVIGPNANVTK-TMIGN 398
G VC++ +Q LAL+AAR GIVLLKN+ LPL T+ +LAVIGPNAN + T++GN
Sbjct: 376 GSNHVCSKEHQYLALEAARNGIVLLKNSPTLLPLPKTSPSISLAVIGPNANSSPLTLLGN 435
Query: 399 YEGTPCKYTTPLQGLAAVVAT-IYQAGCSN-VQCGTAQVDDAKKAAASADATVLVMGADQ 456
Y G PCKY T LQG V Y GC +C +AQ+D A + A D VLVMG DQ
Sbjct: 436 YAGPPCKYVTILQGFRHYVKNAFYHPGCDGGPKCSSAQIDQAVEVAKKVDYVVLVMGLDQ 495
Query: 457 SIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWV 516
S E E DR+ L LPG+Q LI VA+ +K PVIL+++SGG DI+ AK + KI ILW
Sbjct: 496 SEEREERDRVHLDLPGKQLELINGVAEASKKPVILVLLSGGPLDITSAKYNHKIGGILWA 555
Query: 517 GYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GYPG+ GG A+A + FG +NP GRLP TWYP+ Y
Sbjct: 556 GYPGELGGIALAQIIFGDHNPGGRLPTTWYPKDY 589
>gi|297745533|emb|CBI40698.3| unnamed protein product [Vitis vinifera]
Length = 461
Score = 523 bits (1347), Expect = e-145, Method: Compositional matrix adjust.
Identities = 265/374 (70%), Positives = 300/374 (80%), Gaps = 46/374 (12%)
Query: 153 MYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
MYNVGLAGLTFWSPN+NIFRDPRWGRGQETPGEDPLL+SKYA+GYV+GLQQ+D GSP+RL
Sbjct: 1 MYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSDDGSPDRL 60
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTYYL 247
K+AACCKHYTAYDLDNWKG DR+HFNA+V + +Y
Sbjct: 61 KIAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQ 120
Query: 248 IKFK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
+ K YIVSDCDSVDV YNSQHYTKTPEEAAAK+ILAGLD
Sbjct: 121 VNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHYTKTPEEAAAKAILAGLD 180
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
LNCGSFLG+HTEAAVK GLV+ESA+D A+SNNFATLMRLGFFDG+PSK YG+LGPKDVC
Sbjct: 181 LNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVC 240
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
T +Q+LA +AARQGI+LLKN+ GSLPLSPTAIK LA+IGPNANVTKTMIGNYEGTPCKY
Sbjct: 241 TLEHQELAREAARQGIMLLKNSKGSLPLSPTAIKTLAIIGPNANVTKTMIGNYEGTPCKY 300
Query: 407 TTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
TTPLQGL A+VAT Y +GCSNV C TAQ+D+AKK AA+ADATVL++G DQSIEAE DR+
Sbjct: 301 TTPLQGLMALVATTYLSGCSNVACSTAQIDEAKKIAAAADATVLIVGIDQSIEAEGRDRV 360
Query: 467 DLLLPGQQQLLITE 480
++ LPGQQ LLITE
Sbjct: 361 NIQLPGQQPLLITE 374
>gi|326517420|dbj|BAK00077.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 781
Score = 523 bits (1346), Expect = e-145, Method: Compositional matrix adjust.
Identities = 270/565 (47%), Positives = 351/565 (62%), Gaps = 49/565 (8%)
Query: 34 SSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
+ P F+C S + FC+ +L + R DLV RLT EK+ L D A V RLG+P
Sbjct: 34 ADPPFSCGPSSTAATQGYAFCDATLPVAQRAADLVARLTTAEKVAQLGDEAAGVPRLGVP 93
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
Y+WW+EALHG++ G G HF+ V ATSFPQV LTAA+F+ L+ IG+ + EARA+
Sbjct: 94 AYKWWNEALHGLATSGKGLHFNGAVRSATSFPQVSLTAAAFDDDLWLRIGQAIGREARAL 153
Query: 154 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
YNVG A GLT WSPN+NI+RDPRWGRGQETPGEDP AS+Y +VKGLQ + S + L
Sbjct: 154 YNVGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYGVAFVKGLQ-GNSTSSSLL 212
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------- 251
+ +ACCKH TAYDL++W G RY+F+A V + Y F+
Sbjct: 213 QTSACCKHATAYDLEDWGGVARYNFDARVTAQDLEDTYNPPFRSCVVDGKASCVMCAYTA 272
Query: 252 -------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
Y+ SDCD+V ++ ++Q Y TPE+A A ++ AGLD
Sbjct: 273 INGVPACANSGLLTNTVRADWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVALALKAGLD 332
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
++CG+++ +H AA++ G + E +D A+ N FA MRLG FDG P YG L +C
Sbjct: 333 IDCGTYMQQHAPAALQQGKITEDDVDKALKNLFAIRMRLGHFDGDPRANIYGGLNAAHIC 392
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
T ++ LAL+AA+ GIVLLKN AG LPL AI + AVIGPNAN +IGNY G PC+
Sbjct: 393 TPEHRSLALEAAQDGIVLLKNDAGILPLDRAAIASAAVIGPNANNPGLLIGNYFGPPCES 452
Query: 407 TTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDR 465
TPL+G+ V + + AGC + C A D A A S+D +L MG Q E+E DR
Sbjct: 453 VTPLKGVQGYVKDVRFMAGCGSAACDVADTDQAATLAGSSDYVLLFMGLSQQQESEGRDR 512
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
LLLPGQQQ LIT VA AK PVIL++++GG D++FAKN+PKI +ILW GYPGQAGG
Sbjct: 513 TSLLLPGQQQSLITAVADAAKRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGL 572
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
AIA V FG +NP GRLP+TWYP+ +
Sbjct: 573 AIARVLFGDHNPGGRLPVTWYPEEF 597
>gi|225432132|ref|XP_002274591.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Vitis vinifera]
Length = 805
Score = 521 bits (1341), Expect = e-145, Method: Compositional matrix adjust.
Identities = 260/557 (46%), Positives = 351/557 (63%), Gaps = 59/557 (10%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
+ S FC+ SL R DLV R+TLQEK+ V +A V RLG+P+Y WWSEALHG+S
Sbjct: 61 MKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLPEYSWWSEALHGISN 120
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
+GPG F +PGATS P VIL+ A+FN +L++ +G+VVSTE RAMYN+G AGLTFWSPN
Sbjct: 121 LGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAMYNLGHAGLTFWSPN 180
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG------SPNRLKVAACCKHY 221
IN+ RD RWGR QET GEDP + ++A YV+GLQ +G + LKV++CCKHY
Sbjct: 181 INVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTDLNSRPLKVSSCCKHY 240
Query: 222 TAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------------------- 251
AYD+D+W DR+ F+A V + ++ F+
Sbjct: 241 AAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVMCSFNKINGIPPCSD 300
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTPE-EAAAKSILAGLDLNCGSFLG 294
YIVSDC ++V+ ++Q+Y + +A AK++ AGLDL CG +
Sbjct: 301 PRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKTLQAGLDLECGHYYT 360
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
+V G V++ +D A+ N + LMR+G+FDG P+ Y LG KD+C + +LA
Sbjct: 361 DALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA---YESLGLKDICAADHIELA 417
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
+AARQGIVLLKN LPL P K +A++GP+AN T+ MIGNY G PCKY +PL+ +
Sbjct: 418 REAARQGIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIGNYAGLPCKYVSPLEAFS 475
Query: 415 AVVATIYQAGCSNVQCGT-AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQ 473
A+ Y GC + C +AK+AA SA+ T++ +G D SIEAE DR+D LLPG
Sbjct: 476 AIGNVTYATGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSIEAEFVDRVDFLLPGN 535
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q LI +VA+V+ GPVIL+++SG DI+FAKN+P+I++ILWVG+PG+ GG AIADV FG
Sbjct: 536 QTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQGGHAIADVVFG 595
Query: 534 RYNPSGRLPMTWYPQSY 550
+YNP GRLP+TWY Y
Sbjct: 596 KYNPGGRLPVTWYEADY 612
>gi|413919687|gb|AFW59619.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 451
Score = 520 bits (1340), Expect = e-145, Method: Compositional matrix adjust.
Identities = 267/417 (64%), Positives = 305/417 (73%), Gaps = 49/417 (11%)
Query: 35 SPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
+P FACD SN +LAS GFCN S R DLV RLTL EK+ FLVD ++ RLG+P
Sbjct: 37 TPAFACDA-SNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPL 95
Query: 95 YEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
YEWWSEALHGVSYVGPGT FS +VPGATSFPQ ILTAASFNA+LF+AIG+VVS EARAM+
Sbjct: 96 YEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMH 155
Query: 155 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKV 214
NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLL SKYA GYV GLQ G+ LKV
Sbjct: 156 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGA-GALKV 214
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAM-------------------------VIYTYYLIK 249
AACCKHYTAYD+DNWKG +RY F+A+ V+ +Y +
Sbjct: 215 AACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQVN 274
Query: 250 FK---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
K YI SDCDSVDVLYN+QHYTKTPE+AAA SI AGLDLN
Sbjct: 275 GKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLDLN 334
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG+FL +HT AAV+AG ++ES +D A++NN TLMRLGFFDG P + P+G LGP DVCT
Sbjct: 335 CGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVCTP 394
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
+NQ+LA +AARQGIVLLKNT G LPLS T+IK++AVIGPNAN + TMIGNYEGT C
Sbjct: 395 SNQELAREAARQGIVLLKNT-GKLPLSATSIKSMAVIGPNANASFTMIGNYEGTSCN 450
>gi|413925162|gb|AFW65094.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 774
Score = 520 bits (1338), Expect = e-144, Method: Compositional matrix adjust.
Identities = 272/563 (48%), Positives = 354/563 (62%), Gaps = 50/563 (8%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
P F+C S + L FC+ +L R DLV RLT EKI L D A V RLG+P Y
Sbjct: 29 PPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLGVPGY 88
Query: 96 EWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
+WW+EALHG++ G G HF V ATSFPQV+LTAA+F+ L+ IG+ + EARA++N
Sbjct: 89 KWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREARALFN 148
Query: 156 VGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKV 214
VG A GLT WSPN+NIFRDPRWGRGQETPGEDP +AS+YA +V+G+Q S + L+
Sbjct: 149 VGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNS--SSSLLQT 206
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMVI-----------YTYYLIKFK------------ 251
+ACCKH TAYDL++W G RY F A V + +++ K
Sbjct: 207 SACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYTAIN 266
Query: 252 -----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
Y+ SDCD+V ++ ++Q Y TPE+A A S+ AGLD++
Sbjct: 267 GVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLKAGLDID 326
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CGS++ +H AA++ G + E ID A++N +A MRLG FDG P K YG LG D+CT
Sbjct: 327 CGSYVQQHAAAAIQQGKLTEQDIDKALTNLYAVRMRLGHFDGDPRKNMYGVLGAADICTP 386
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
+++LAL+AA+ GIVLLKN G LPL + + + AVIGPNAN +I NY G PC+ TT
Sbjct: 387 EHRNLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNANDGMALIANYFGPPCESTT 446
Query: 409 PLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
PL+GL + V + + AGC++ C A D A A S D L MG Q E+E DR
Sbjct: 447 PLKGLQSYVNDVRFLAGCNSAACDVAATDQAVALAGSEDYVFLFMGLSQKQESEGKDRTS 506
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
LLLPG QQ LIT VA +K PVIL+++SGG DI+FA+++PKI +ILW GYPGQAGG AI
Sbjct: 507 LLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQAGGLAI 566
Query: 528 ADVCFGRYNPSGRLPMTWYPQSY 550
A V FG +NPSGRLP+TWYP+ +
Sbjct: 567 AKVLFGDHNPSGRLPVTWYPEEF 589
>gi|318136853|gb|ADV41671.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Actinidia deliciosa
var. deliciosa]
Length = 634
Score = 519 bits (1336), Expect = e-144, Method: Compositional matrix adjust.
Identities = 264/449 (58%), Positives = 311/449 (69%), Gaps = 50/449 (11%)
Query: 149 EARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
EARAMYN G+AGLTFWSPN+NIFRDPRWGRGQETPGEDP+LA YA YV+GLQ DG
Sbjct: 2 EARAMYNGGMAGLTFWSPNVNIFRDPRWGRGQETPGEDPMLAGNYAASYVRGLQGNDG-- 59
Query: 209 PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------- 251
RLKVAACCKHYTAYDLDNW+G DR+HFNA V I + I F+
Sbjct: 60 -ERLKVAACCKHYTAYDLDNWRGVDRFHFNARVSKQDIKDTFEIPFRECVLGGKVASVMC 118
Query: 252 -----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSIL 282
YIVSDCDSV V + +QHYT PEEA A +I
Sbjct: 119 SYNQVNGIPTCANPKLLKGTIRGSWRLNGYIVSDCDSVGVFFENQHYTSKPEEAVAAAIK 178
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
AGLDL+CG FL HTEAAV+ GLV++ I+ A++N MRLG FDG PS YG LGP
Sbjct: 179 AGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTAQMRLGMFDGEPSAHQYGNLGP 238
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
+DVCT A+Q LAL+AARQGIVLL+N SLPLS + +AVIGPN++VT TMIGNY G
Sbjct: 239 RDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGV 298
Query: 403 PCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAASADATVLVMGADQSIEAE 461
C YTTPLQG+ TI+QAGC++V C Q+ A+ AA ADATVLVMG DQSIEAE
Sbjct: 299 ACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAE 358
Query: 462 SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQ 521
DR LLPG QQ L++ VA+ ++GP IL++MSGG D++FAKNDP+I++I+WVGYPGQ
Sbjct: 359 FVDRAGPLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQ 418
Query: 522 AGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
AGG AIADV FG NP G+LPMTWYPQ+Y
Sbjct: 419 AGGTAIADVLFGTTNPGGKLPMTWYPQNY 447
>gi|357156904|ref|XP_003577615.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 767
Score = 518 bits (1334), Expect = e-144, Method: Compositional matrix adjust.
Identities = 269/563 (47%), Positives = 350/563 (62%), Gaps = 55/563 (9%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
P F+C S+ S FC+ +L + R DLV RLT EK+ L D A V RLG+P Y
Sbjct: 25 PPFSCGQASS----SYAFCDAALPVAQRAADLVSRLTAAEKVAQLGDEAAGVPRLGVPGY 80
Query: 96 EWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
+WW+EALHG++ G G HF V ATSFPQV LTAA+F+ L+ IG+ + EARA+YN
Sbjct: 81 KWWNEALHGLATSGKGLHFDGAVRSATSFPQVCLTAAAFDDDLWFRIGQAIGREARALYN 140
Query: 156 VGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKV 214
+G A GLT WSPN+NI+RDPRWGRGQETPGEDP AS+YA +V+G+Q G S + L+
Sbjct: 141 LGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYAVAFVRGMQ---GNSTSLLQA 197
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK----------------------- 251
+ACCKH TAYDL++W G RY+F+A V F
Sbjct: 198 SACCKHATAYDLEDWNGVARYNFDAKVTAQDLEDTFNPPFRSCVVDGKASCVMCAYTGIN 257
Query: 252 -----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
Y SDCD+V ++ ++Q Y ++PE+A A ++ AGLD++
Sbjct: 258 GVPACANADLLTKTVRGDWGLDGYTASDCDAVAIMRDAQRYAQSPEDAVALALKAGLDID 317
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG+++ +H AA++ G + E ID A+ N FA MRLG FDG P YG LG D+CT
Sbjct: 318 CGTYMQQHAAAAIQQGKITEEDIDKALKNLFAIRMRLGHFDGDPRTNMYGGLGAADICTA 377
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
++ LALDAA+ GIVLLKN AG LPL A+ + AVIGPNAN +I NY G PC+ TT
Sbjct: 378 EHRSLALDAAQDGIVLLKNDAGILPLDRAAVASTAVIGPNANNPGALIANYFGPPCESTT 437
Query: 409 PLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
PL+G+ V + AGCS+ C A D A A+++D L MG Q E+E DR
Sbjct: 438 PLKGIQGYVKDARFLAGCSSTACDVATTDQAAALASTSDYVFLFMGLGQRQESEGRDRTS 497
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
LLLPG+QQ LIT VA A+ PVIL+++SGG D++FA+ +PKI +ILW GYPGQAGG AI
Sbjct: 498 LLLPGKQQSLITAVADAAQRPVILVLLSGGPVDVTFAQTNPKIGAILWAGYPGQAGGLAI 557
Query: 528 ADVCFGRYNPSGRLPMTWYPQSY 550
A V FG +NPSGRLP+TWYP+ +
Sbjct: 558 ARVLFGDHNPSGRLPVTWYPEEF 580
>gi|356552866|ref|XP_003544783.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 776
Score = 517 bits (1332), Expect = e-144, Method: Compositional matrix adjust.
Identities = 277/591 (46%), Positives = 362/591 (61%), Gaps = 61/591 (10%)
Query: 13 VSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLT 72
+ + FL LT L H + S Q P ++CD SN FCNT L I R DLV RLT
Sbjct: 10 IFISFLLLT---LHHHAESTQ--PPYSCDSSSNSPYYP--FCNTRLPISKRAQDLVSRLT 62
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAA 132
L EK+ LV++A ++ RLGIP Y+WWSEALHGV+ G G F+ + ATSFPQVILTAA
Sbjct: 63 LDEKLAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAA 122
Query: 133 SFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
SF+ +L+ I K + EARA+YN G A G+TFW+PNIN+FRDPRWGRGQET GEDPL+ +
Sbjct: 123 SFDPNLWYQISKTIGKEARAVYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNA 182
Query: 192 KYATGYVKGLQ--QTDGGS-PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYT 244
KY YV+GLQ +GG RL+ +ACCKH+TAYDLD+WKG DR+ ++A V +
Sbjct: 183 KYGVAYVRGLQGDSFEGGKLGERLQASACCKHFTAYDLDHWKGLDRFVYDARVTSQDLAD 242
Query: 245 YYLIKFK------------------------------------------YIVSDCDSVDV 262
Y F+ YI SDC +V +
Sbjct: 243 TYQPPFQSCIEQGRASGIMCAYNRVNGVPNCANFNLLTKTARQQWKFDGYITSDCGAVSI 302
Query: 263 LYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATL 322
+++ Q Y KT E+A A AG+D+ CG ++ KH ++AV + S ID A+ N F+
Sbjct: 303 IHDEQGYAKTAEDAIADVFRAGMDVECGDYITKHGKSAVSQKKLPISQIDRALQNLFSIR 362
Query: 323 MRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNL 382
+RLG DG+P+K P+G +GP VC++ + LAL+AAR GIVLLKNT LPL P +
Sbjct: 363 IRLGLLDGNPTKLPFGTIGPDQVCSKQSLQLALEAARDGIVLLKNTNSLLPL-PKTNPTI 421
Query: 383 AVIGPNANVT-KTMIGNYEGTPCKYTTPLQGLAAVVA-TIYQAGCSN-VQCGTAQVDDAK 439
A+IGPNAN + K +GNY G PC T LQG T+Y GC + QC AQ++ A
Sbjct: 422 ALIGPNANASSKVFLGNYYGRPCNLVTLLQGFEGYAKDTVYHPGCDDGPQCAYAQIEGAV 481
Query: 440 KAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGF 499
+ A D VLVMG DQS E ESHDR L LPG+Q+ LI VA+ +K PV+L+++ GG
Sbjct: 482 EVAKKVDYVVLVMGLDQSQERESHDREYLGLPGKQEELIKSVARASKRPVVLVLLCGGPV 541
Query: 500 DISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
DI+ AK D K+ ILW GYPG+ GG A+A V FG +NP G+LP+TWYP+ +
Sbjct: 542 DITSAKFDDKVGGILWAGYPGELGGVALAQVVFGDHNPGGKLPITWYPKDF 592
>gi|225432134|ref|XP_002274619.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 805
Score = 516 bits (1329), Expect = e-143, Method: Compositional matrix adjust.
Identities = 269/552 (48%), Positives = 347/552 (62%), Gaps = 59/552 (10%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+C++SL D+RV DLV R+TL+EK ++D A V R+G+P Y+WWSEALHGV+ VG T
Sbjct: 67 YCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANVGSAT 126
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFR 172
F VVPGATSFP VIL+AASFN SL++ +G+VVSTEARAMYN+G AGLTFWSPNIN+ R
Sbjct: 127 FFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTFWSPNINVAR 186
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG------SPNRLKVAACCKHYTAYDL 226
DPRWGR ETPGEDPL Y YV+GLQ +G + LK+A+ CKH+ AYDL
Sbjct: 187 DPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASSCKHFAAYDL 246
Query: 227 DNWKGTDRYHFNAMV----IYTYYLIKFK------------------------------- 251
D W DR HF+A V + +L F+
Sbjct: 247 DQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIPPCADPRFLK 306
Query: 252 -----------YIVSDCDSVDVLYNSQHYTK-TPEEAAAKSILAGLDLNCGSFLGKHTEA 299
YIVSDC ++D + Q + T EE A S+ AGLDL CG +
Sbjct: 307 GVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECGHYYNDSLAT 366
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
AV+ G V+E +D ++S + LMR+GFFDG PS LG KD+C + +LA +AAR
Sbjct: 367 AVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPS---LASLGKKDICNDEHIELAREAAR 423
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT 419
QGIVLLKN +LPL P +K LA++GP+AN T MIGNY G PC Y +PL + +
Sbjct: 424 QGIVLLKNDNATLPLKP--VKKLALVGPHANATVAMIGNYAGIPCHYVSPLDAFSELGDV 481
Query: 420 IYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLI 478
Y+ GC++V+C V A +AA +ADAT++++G D SIEAE DR DLLLPG Q ++
Sbjct: 482 TYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPGYQTEMV 541
Query: 479 TEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPS 538
+V ++ GPVIL++M GG DISFAKN+PKI +ILW G+PG+ GG AIAD+ FG+YNP
Sbjct: 542 NQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVFGKYNPG 601
Query: 539 GRLPMTWYPQSY 550
GR P+TWY Y
Sbjct: 602 GRSPITWYENGY 613
>gi|297736787|emb|CBI25988.3| unnamed protein product [Vitis vinifera]
Length = 774
Score = 516 bits (1328), Expect = e-143, Method: Compositional matrix adjust.
Identities = 269/552 (48%), Positives = 347/552 (62%), Gaps = 59/552 (10%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+C++SL D+RV DLV R+TL+EK ++D A V R+G+P Y+WWSEALHGV+ VG T
Sbjct: 67 YCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANVGSAT 126
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFR 172
F VVPGATSFP VIL+AASFN SL++ +G+VVSTEARAMYN+G AGLTFWSPNIN+ R
Sbjct: 127 FFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTFWSPNINVAR 186
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG------SPNRLKVAACCKHYTAYDL 226
DPRWGR ETPGEDPL Y YV+GLQ +G + LK+A+ CKH+ AYDL
Sbjct: 187 DPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASSCKHFAAYDL 246
Query: 227 DNWKGTDRYHFNAMV----IYTYYLIKFK------------------------------- 251
D W DR HF+A V + +L F+
Sbjct: 247 DQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIPPCADPRFLK 306
Query: 252 -----------YIVSDCDSVDVLYNSQHYTK-TPEEAAAKSILAGLDLNCGSFLGKHTEA 299
YIVSDC ++D + Q + T EE A S+ AGLDL CG +
Sbjct: 307 GVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECGHYYNDSLAT 366
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
AV+ G V+E +D ++S + LMR+GFFDG PS LG KD+C + +LA +AAR
Sbjct: 367 AVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPS---LASLGKKDICNDEHIELAREAAR 423
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT 419
QGIVLLKN +LPL P +K LA++GP+AN T MIGNY G PC Y +PL + +
Sbjct: 424 QGIVLLKNDNATLPLKP--VKKLALVGPHANATVAMIGNYAGIPCHYVSPLDAFSELGDV 481
Query: 420 IYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLI 478
Y+ GC++V+C V A +AA +ADAT++++G D SIEAE DR DLLLPG Q ++
Sbjct: 482 TYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPGYQTEMV 541
Query: 479 TEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPS 538
+V ++ GPVIL++M GG DISFAKN+PKI +ILW G+PG+ GG AIAD+ FG+YNP
Sbjct: 542 NQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVFGKYNPG 601
Query: 539 GRLPMTWYPQSY 550
GR P+TWY Y
Sbjct: 602 GRSPITWYENGY 613
>gi|125534137|gb|EAY80685.1| hypothetical protein OsI_35867 [Oryza sativa Indica Group]
Length = 779
Score = 515 bits (1326), Expect = e-143, Method: Compositional matrix adjust.
Identities = 275/566 (48%), Positives = 349/566 (61%), Gaps = 55/566 (9%)
Query: 35 SPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
+P F C S + FCN +L + R DLV RLT EK+ L D A V RLGIP
Sbjct: 34 NPGFTCGPAS--AQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPV 91
Query: 95 YEWWSEALHGVSYVGPGTHFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
Y+WWSEALHG++ G G HF N ATSFPQVI TAA+F+ L+ IG+ + E RA
Sbjct: 92 YKWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAF 151
Query: 154 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
YN+G A GL WSPN+NIFRDPRWGRGQETPGEDP ASKY +VKGLQ G S L
Sbjct: 152 YNLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQ---GSSLTNL 208
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTYYL 247
+ +ACCKH TAYD++ WKG RY+FNA V + Y L
Sbjct: 209 QTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTL 268
Query: 248 I------------------KFK---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
I ++K Y SDCD+V +L+ S+H+T+T EEA A ++ AGLD
Sbjct: 269 INGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLD 328
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPS-KQPYGQLGPKDV 345
+NCG ++ ++ +A++ G + E +D A+ N FA MRLG FDG P + YG+LG DV
Sbjct: 329 INCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLGAADV 388
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
CT ++ LAL+AAR+G+VLLKN A LPL + + AVIG NAN ++GNY G PC+
Sbjct: 389 CTPVHKALALEAARRGVVLLKNDARLLPLRAPTVSSAAVIGHNANDILALLGNYYGLPCE 448
Query: 406 YTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHD 464
TTP G+ V + + GCS+ C A D A A S+D LVMG Q E E D
Sbjct: 449 TTTPFGGIQKYVKSAKFLPGCSSAACDVAATDQATALAKSSDYVFLVMGLSQKQEQEGLD 508
Query: 465 RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGG 524
R LLLPG+QQ LIT VA +K PVILI+++GG DI+FA+ +PKI +ILW GYPGQAGG
Sbjct: 509 RTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGG 568
Query: 525 AAIADVCFGRYNPSGRLPMTWYPQSY 550
AIADV FG +NPSG+LP+TWYP+ +
Sbjct: 569 QAIADVLFGEFNPSGKLPVTWYPEEF 594
>gi|357485313|ref|XP_003612944.1| Beta-D-xylosidase [Medicago truncatula]
gi|355514279|gb|AES95902.1| Beta-D-xylosidase [Medicago truncatula]
Length = 783
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 273/596 (45%), Positives = 366/596 (61%), Gaps = 68/596 (11%)
Query: 14 SVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTL 73
S + +FL L H ++P + C P + FCN SL I R L+ LTL
Sbjct: 8 STIIIFLFSLLLIHLPKFF-TTPDYPC----KPPHSHYPFCNISLPISTRTTSLISLLTL 62
Query: 74 QEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAAS 133
+KI L ++A S+S LGIP Y+WWSEALHG++ GPG +F+ V AT+FPQVI++AA+
Sbjct: 63 SDKINQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAA 122
Query: 134 FNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
FN SL+ IG V E RAM+NVG AGL+FW+PN+N+FRDPRWGRGQETPGEDP++ S Y
Sbjct: 123 FNRSLWFLIGYAVGVEGRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAY 182
Query: 194 ATGYVKGLQQTDG---------GSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIY- 243
A +V+G+Q DG + L V+ACCKH+TAYDL+ W RY+FNA+V
Sbjct: 183 AVEFVRGIQGVDGIKKVLNDHDSDDDGLMVSACCKHFTAYDLEKWGEFSRYNFNAVVTQQ 242
Query: 244 ----TYY-----------------------------------LIKFK-----YIVSDCDS 259
TY L++ K YI SDCD+
Sbjct: 243 DLEDTYQPPFRGCVQQGKASCLMCSYNEVNGVPACASKDLLGLVRNKWGFEGYIASDCDA 302
Query: 260 VDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNF 319
V ++ Q Y K+ E+A A + AG+D+NCG+F+ +HTE+A++ GLV E +D A+ N F
Sbjct: 303 VATVFEYQKYAKSAEDAVADVLKAGMDINCGTFMLRHTESAIEQGLVKEEDLDRALFNLF 362
Query: 320 ATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAI 379
+ MRLG F+G P K +G+LGP+DVCT ++ LAL+AARQGIVLLKN LPL
Sbjct: 363 SVQMRLGLFNGDPEKGKFGKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKDR 422
Query: 380 KNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDD- 437
+LA+IGP A T + G Y G PC + GL V TI Y GCS+V+C + DD
Sbjct: 423 VSLAIIGPMA-TTSELGGGYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDS---DDG 478
Query: 438 ---AKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIM 494
A A AD V+V G D ++E E DR+ LLLPG+Q L++ VA +K PVIL++
Sbjct: 479 FAVAIDIAKQADFVVIVAGLDTTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLT 538
Query: 495 SGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG D+SFA+++ ITSILW+GYPG+AGG A+A++ FG +NP+GRLPMTWYP+S+
Sbjct: 539 GGGPLDVSFAESNQLITSILWIGYPGEAGGKALAEIIFGEFNPAGRLPMTWYPESF 594
>gi|356531391|ref|XP_003534261.1| PREDICTED: probable beta-D-xylosidase 6-like [Glycine max]
Length = 780
Score = 514 bits (1323), Expect = e-143, Method: Compositional matrix adjust.
Identities = 269/552 (48%), Positives = 349/552 (63%), Gaps = 55/552 (9%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
FC+TSL R LV LTL EKI L ++A S+ RLGIP Y+WWSE+LHG++ GPG
Sbjct: 41 FCDTSLPTLTRARSLVSLLTLPEKILLLSNNASSIPRLGIPAYQWWSESLHGLALNGPGV 100
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFR 172
F+ VP ATSFPQVIL+AASFN SL+ ++ EARAM+NVG AGLTFW+PNIN+FR
Sbjct: 101 SFAGAVPSATSFPQVILSAASFNRSLWLRTAAAIAREARAMFNVGQAGLTFWAPNINLFR 160
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG-------GSPNRLKVAACCKHYTAYD 225
DPRWGRGQETPGEDP+LAS YA YV+GLQ G + L V+ACCKH+TAYD
Sbjct: 161 DPRWGRGQETPGEDPMLASAYAVEYVRGLQGLSGIQDAVVVDDDDTLMVSACCKHFTAYD 220
Query: 226 LDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------------ 251
LD W RY+FNA+V + Y F+
Sbjct: 221 LDMWGQFSRYNFNAVVSQQDLEDTYQPPFRSCIQQGKASCLMCSYNEVNGVPACASEELL 280
Query: 252 -----------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAA 300
YI SDCD+V +Y Q Y K+ E+A A + AG+D+NCG+F+ +HTE+A
Sbjct: 281 GLARDKWGFKGYITSDCDAVATVYEYQKYAKSQEDAVADVLKAGMDINCGTFMLRHTESA 340
Query: 301 VKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQ 360
++ G V E +D A+ N F+ +RLG FDG P + +G+LGPKDVCTQ ++ LALDAARQ
Sbjct: 341 IEQGKVKEEDLDRALLNLFSVQLRLGLFDGDPIRGRFGKLGPKDVCTQEHKTLALDAARQ 400
Query: 361 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI 420
GIVLLKN LPL +LAVIGP A TK + G Y G PC ++ +GL I
Sbjct: 401 GIVLLKNDKKFLPLDRDIGASLAVIGPLATTTK-LGGGYSGIPCSSSSLYEGLGEFAERI 459
Query: 421 -YQAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLI 478
Y GC +V C + +A A AD V+V G D + E E HDR+ LLLPG+Q L+
Sbjct: 460 SYAFGCYDVPCDSDDGFAEAIDTAKQADFVVIVAGLDATQETEDHDRVSLLLPGKQMNLV 519
Query: 479 TEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPS 538
+ VA +K PVIL+++ GG D+SFA+ +P+I SI+W+GYPG+AGG A+A++ FG +NP+
Sbjct: 520 SSVADASKNPVILVLIGGGPLDVSFAEKNPQIASIIWLGYPGEAGGKALAEIIFGEFNPA 579
Query: 539 GRLPMTWYPQSY 550
GRLPMTWYP+++
Sbjct: 580 GRLPMTWYPEAF 591
>gi|62734691|gb|AAX96800.1| Glycosyl hydrolase family 3 C terminal domain, putative [Oryza
sativa Japonica Group]
gi|77549994|gb|ABA92791.1| beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
Group]
Length = 853
Score = 513 bits (1321), Expect = e-142, Method: Compositional matrix adjust.
Identities = 274/566 (48%), Positives = 348/566 (61%), Gaps = 55/566 (9%)
Query: 35 SPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
+P F C S + FCN +L + R DLV RLT EK+ L D A V RLGIP
Sbjct: 108 NPGFTCGPAS--AQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPV 165
Query: 95 YEWWSEALHGVSYVGPGTHFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
Y+WWSEALHG++ G G HF N ATSFPQVI TAA+F+ L+ IG+ + E RA
Sbjct: 166 YKWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAF 225
Query: 154 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
YN+G A GL WSPN+NIFRDPRWGRGQETPGEDP ASKY +VKGLQ G S L
Sbjct: 226 YNLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQ---GSSLTNL 282
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTYYL 247
+ +ACCKH TAYD++ WKG RY+FNA V + Y L
Sbjct: 283 QTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTL 342
Query: 248 I------------------KFK---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
I ++K Y SDCD+V +L+ S+H+T+T EEA A ++ AGLD
Sbjct: 343 INGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLD 402
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPS-KQPYGQLGPKDV 345
+NCG ++ ++ +A++ G + E +D A+ N FA MRLG FDG P + YG+L DV
Sbjct: 403 INCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADV 462
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
CT ++ LAL+AAR+G+VLLKN A LPL + + AVIG NAN ++GNY G PC+
Sbjct: 463 CTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCE 522
Query: 406 YTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHD 464
TTP G+ V + + GCS+ C A D A A S+D LVMG Q E E D
Sbjct: 523 TTTPFGGIQKYVKSAKFLPGCSSAACDVAATDQATALAKSSDYVFLVMGLSQKQEQEGLD 582
Query: 465 RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGG 524
R LLLPG+QQ LIT VA +K PVILI+++GG DI+FA+ +PKI +ILW GYPGQAGG
Sbjct: 583 RTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGG 642
Query: 525 AAIADVCFGRYNPSGRLPMTWYPQSY 550
AIADV FG +NPSG+LP+TWYP+ +
Sbjct: 643 QAIADVLFGEFNPSGKLPVTWYPEEF 668
>gi|413925166|gb|AFW65098.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 830
Score = 513 bits (1320), Expect = e-142, Method: Compositional matrix adjust.
Identities = 272/568 (47%), Positives = 347/568 (61%), Gaps = 56/568 (9%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
P F+C PSL L FCNT L R DLV R+T EK + L D A V RLG+P Y
Sbjct: 84 PPFSCG--GGPSLG-LPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSY 140
Query: 96 EWWSEALHGVSYVGPGTHFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
+WW+EALHGV+ G G H V ATSFPQV+LTAASFN +L+ IG+ EARA Y
Sbjct: 141 KWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFY 200
Query: 155 NVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS---PN 210
N+G A GLT WSPN+NIFRDPRWGRGQETPGEDP +AS+YA +V+GLQ + + P
Sbjct: 201 NIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQGSSSNTKSVPP 260
Query: 211 RLKVAACCKHYTAYDLDNWKGTDRYHFNA---------------------------MVIY 243
L +ACCKH TAYDL++WKG RY F A M Y
Sbjct: 261 VLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAY 320
Query: 244 TY--------------------YLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
T + + +Y+ +DCD+V ++ NSQ Y T E+ A ++ A
Sbjct: 321 TSVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIMRNSQFYRPTAEDTVATTLKA 380
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
GLD++CG ++ +H AA++ G + + +D A+ N F T MRLG FDG P YG LG
Sbjct: 381 GLDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAA 440
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
+CTQ +++LAL+AA GIVLLKN+AG LPL ++ + AVIG NAN ++GNY G P
Sbjct: 441 HICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPP 500
Query: 404 CKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES 462
C TTPLQG+ V + + AGC C A A A+++D+ +L MG Q E+E
Sbjct: 501 CAPTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALASTSDSVILFMGLSQEQESEG 560
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR LLLPG QQ LIT VA AK PVIL++++GG DI+FA+ +PKI +ILW GYPGQA
Sbjct: 561 KDRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQA 620
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG AIA V FG NPSGRLP+TWYP+ +
Sbjct: 621 GGLAIAKVLFGEKNPSGRLPVTWYPEEF 648
>gi|115485163|ref|NP_001067725.1| Os11g0297300 [Oryza sativa Japonica Group]
gi|113644947|dbj|BAF28088.1| Os11g0297300 [Oryza sativa Japonica Group]
Length = 779
Score = 512 bits (1319), Expect = e-142, Method: Compositional matrix adjust.
Identities = 274/566 (48%), Positives = 348/566 (61%), Gaps = 55/566 (9%)
Query: 35 SPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
+P F C S + FCN +L + R DLV RLT EK+ L D A V RLGIP
Sbjct: 34 NPGFTCGPAS--AQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPV 91
Query: 95 YEWWSEALHGVSYVGPGTHFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
Y+WWSEALHG++ G G HF N ATSFPQVI TAA+F+ L+ IG+ + E RA
Sbjct: 92 YKWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAF 151
Query: 154 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
YN+G A GL WSPN+NIFRDPRWGRGQETPGEDP ASKY +VKGLQ G S L
Sbjct: 152 YNLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQ---GSSLTNL 208
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTYYL 247
+ +ACCKH TAYD++ WKG RY+FNA V + Y L
Sbjct: 209 QTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTL 268
Query: 248 I------------------KFK---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
I ++K Y SDCD+V +L+ S+H+T+T EEA A ++ AGLD
Sbjct: 269 INGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLD 328
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPS-KQPYGQLGPKDV 345
+NCG ++ ++ +A++ G + E +D A+ N FA MRLG FDG P + YG+L DV
Sbjct: 329 INCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADV 388
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
CT ++ LAL+AAR+G+VLLKN A LPL + + AVIG NAN ++GNY G PC+
Sbjct: 389 CTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCE 448
Query: 406 YTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHD 464
TTP G+ V + + GCS+ C A D A A S+D LVMG Q E E D
Sbjct: 449 TTTPFGGIQKYVKSAKFLPGCSSAACDVAATDQATALAKSSDYVFLVMGLSQKQEQEGLD 508
Query: 465 RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGG 524
R LLLPG+QQ LIT VA +K PVILI+++GG DI+FA+ +PKI +ILW GYPGQAGG
Sbjct: 509 RTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGG 568
Query: 525 AAIADVCFGRYNPSGRLPMTWYPQSY 550
AIADV FG +NPSG+LP+TWYP+ +
Sbjct: 569 QAIADVLFGEFNPSGKLPVTWYPEEF 594
>gi|297811163|ref|XP_002873465.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297319302|gb|EFH49724.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 796
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 269/607 (44%), Positives = 363/607 (59%), Gaps = 71/607 (11%)
Query: 12 KVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRL 71
+++++ L ++ + + S P F C P +S FCN SL I R LV L
Sbjct: 4 QLTLISLVFFTSAIAETFKNLDSHPQFPC---KPPHFSSYPFCNVSLSIKQRAISLVSLL 60
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
TL EKI L +A SV RLGIP YEWWSE+LHG++ GPG F+ + ATSFPQVI++A
Sbjct: 61 TLPEKIGQLSTTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSA 120
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
ASFN +L+ IG V+ EARAMYN G AGLTFW+PNIN+FRDPRWGRGQETPGEDP + S
Sbjct: 121 ASFNRTLWYEIGSAVAVEARAMYNGGQAGLTFWAPNINLFRDPRWGRGQETPGEDPKVVS 180
Query: 192 KYATGYVKGLQQ--------TDGGSPN-------------RLKVAACCKHYTAYDLDNWK 230
+Y +V+G Q+ T GS N +L ++ACCKH+TAYDL+ W
Sbjct: 181 EYGVEFVRGFQEKKKRKVLKTRFGSDNVDDDARYDDDADGKLMLSACCKHFTAYDLEKWG 240
Query: 231 GTDRYHFNAMV----IYTYYLIKFK----------------------------------- 251
RY FNA+V + Y F+
Sbjct: 241 NFTRYDFNAVVTEQDMEDTYQPPFETCIKDGKASCLMCSYNAVNGVPACAQGDLLQKARV 300
Query: 252 ------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGL 305
YI SDCD+V ++ Q YTK+PEEA A +I AG+D+NCG+++ ++T++A++ G
Sbjct: 301 EWGFDGYITSDCDAVATIFEYQGYTKSPEEAVADAIKAGVDINCGTYMLRNTQSAIEQGK 360
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
V+E +D A+ N FA +RLG FDG P YG+LG D+C+ ++ LAL+AARQGIVLL
Sbjct: 361 VSEELVDRALLNLFAVQLRLGLFDGDPRGGHYGKLGSNDICSSDHRKLALEAARQGIVLL 420
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPL-QGLAAVVATIYQAG 424
KN LPL+ + +LA++GP AN M G Y G PC+ T + L V T Y +G
Sbjct: 421 KNDYKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASG 480
Query: 425 CSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAK 483
CS+V C +A A AD ++V G D S E E DR L LPG+Q+ L++ VA
Sbjct: 481 CSDVSCVSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRFSLSLPGKQKDLVSSVAA 540
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
V+K PVIL++ GG D++FAK DP+I SI+W+GYPG+ GG A+A++ FG +NP GRLP+
Sbjct: 541 VSKKPVILVLTGGGPVDVTFAKTDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRLPI 600
Query: 544 TWYPQSY 550
TWYP+S+
Sbjct: 601 TWYPESF 607
>gi|15238197|ref|NP_196618.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
gi|75264319|sp|Q9LXA8.1|BXL6_ARATH RecName: Full=Probable beta-D-xylosidase 6; Short=AtBXL6; Flags:
Precursor
gi|7671447|emb|CAB89387.1| beta-xylosidase-like protein [Arabidopsis thaliana]
gi|15982753|gb|AAL09717.1| AT5g10560/F12B17_90 [Arabidopsis thaliana]
gi|332004180|gb|AED91563.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
Length = 792
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 264/605 (43%), Positives = 359/605 (59%), Gaps = 69/605 (11%)
Query: 12 KVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRL 71
+++++ L ++ + + S P F C P +S FCN SL I R LV L
Sbjct: 4 QLTLISLLFFTSAIAETFKNLDSHPQFPC---KPPHFSSYPFCNVSLSIKQRAISLVSLL 60
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
L EKI L ++A SV RLGIP YEWWSE+LHG++ GPG F+ + ATSFPQVI++A
Sbjct: 61 MLPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSA 120
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
ASFN +L+ IG V+ E RAMYN G AGLTFW+PNIN+FRDPRWGRGQETPGEDP + S
Sbjct: 121 ASFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGEDPKVVS 180
Query: 192 KYATGYVKGLQQTDGGSP-------------------NRLKVAACCKHYTAYDLDNWKGT 232
+Y +V+G Q+ +L ++ACCKH+TAYDL+ W
Sbjct: 181 EYGVEFVRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNF 240
Query: 233 DRYHFNAMV----IYTYYLIKFK------------------------------------- 251
RY FNA+V + Y F+
Sbjct: 241 TRYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLLQKARVEW 300
Query: 252 ----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVN 307
YI SDCD+V ++ Q YTK+PEEA A +I AG+D+NCG+++ +HT++A++ G V+
Sbjct: 301 GFEGYITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKVS 360
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E +D A+ N FA +RLG FDG P + YG+LG D+C+ ++ LAL+A RQGIVLLKN
Sbjct: 361 EELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLKN 420
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV-ATIYQAGCS 426
LPL+ + +LA++GP AN M G Y G PC+ T L V T Y +GCS
Sbjct: 421 DHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCS 480
Query: 427 NVQCGT-AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVA 485
+V C + +A A AD ++V G D S E E DR+ L LPG+Q+ L++ VA V+
Sbjct: 481 DVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAVS 540
Query: 486 KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
K PVIL++ GG D++FAKNDP+I SI+W+GYPG+ GG A+A++ FG +NP GRLP TW
Sbjct: 541 KKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRLPTTW 600
Query: 546 YPQSY 550
YP+S+
Sbjct: 601 YPESF 605
>gi|413925164|gb|AFW65096.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 829
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 271/567 (47%), Positives = 343/567 (60%), Gaps = 55/567 (9%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
P F+C PSL L FCNT L R DLV R+T EK + L D A V RLG+P Y
Sbjct: 84 PPFSCG--GGPSLG-LPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSY 140
Query: 96 EWWSEALHGVSYVGPGTHFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
+WW+EALHGV+ G G H V ATSFPQV+LTAASFN +L+ IG+ EARA Y
Sbjct: 141 KWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFY 200
Query: 155 NVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS---PN 210
N+G A GLT WSPN+NIFRDPRWGRGQETPGEDP +AS+YA +V+GLQ + + P
Sbjct: 201 NIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQGSSSNTKSVPP 260
Query: 211 RLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK------------------- 251
L +ACCKH TAYDL++WKG RY F A V F
Sbjct: 261 VLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAY 320
Query: 252 ---------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
Y+ +DCD+V ++ NSQ Y T E+ A ++ AG
Sbjct: 321 TSVNGVPSCANADLLTKTFRGSWGLDGYVAADCDAVSIMRNSQFYRPTAEDTVATTLKAG 380
Query: 285 LDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
LD++CG ++ +H AA++ G + + +D A+ N F T MRLG FDG P YG LG
Sbjct: 381 LDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAAH 440
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
+CTQ +++LAL+AA GIVLLKN+AG LPL ++ + AVIG NAN ++GNY G PC
Sbjct: 441 ICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPPC 500
Query: 405 KYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
TTPLQG+ V + + AGC C A A A+++D+ +L MG Q E+E
Sbjct: 501 APTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALASTSDSVILFMGLSQEQESEGK 560
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR LLLPG QQ LIT VA AK PVIL++++GG DI+FA+ +PKI +ILW GYPGQAG
Sbjct: 561 DRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAG 620
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
G AIA V FG NPSGRLP+TWYP+ +
Sbjct: 621 GLAIAKVLFGEKNPSGRLPVTWYPEEF 647
>gi|297611657|ref|NP_001067709.2| Os11g0291000 [Oryza sativa Japonica Group]
gi|255680005|dbj|BAF28072.2| Os11g0291000 [Oryza sativa Japonica Group]
Length = 764
Score = 511 bits (1316), Expect = e-142, Method: Compositional matrix adjust.
Identities = 269/548 (49%), Positives = 346/548 (63%), Gaps = 56/548 (10%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
FC+ L + R DLV LTL EK++ L D A V+RLG+P YEWWSE LHG+S G G
Sbjct: 31 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIF 171
F+ V TSFPQVILTAA+F+A L++ +G+ V EARA+YN+G A GLT WSPN+NIF
Sbjct: 91 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
RDPRWGRGQETPGEDP+ AS+YA +V GLQ G + +ACCKH TAYDLD W
Sbjct: 151 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGG------EASACCKHATAYDLDYWNN 204
Query: 232 TDRYHFNAMV----IYTYYLIKFK------------------------------------ 251
RY++++ V + Y FK
Sbjct: 205 VVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKKVRQ 264
Query: 252 ------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGL 305
Y+ SDCD+V + ++ HYT +PE+ A SI G+D+NCG++ H AAV+ G
Sbjct: 265 EWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGN 324
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHP-SKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
+ E ID A+ N FA MRLG FDG P S YG LG DVC+ A++ LAL+AA+ GIVL
Sbjct: 325 LTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVL 384
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA--TIYQ 422
LKN AG+LPL P+A+ +LAVIGPNA+ + GNY G PC+ TTPLQG+ + +
Sbjct: 385 LKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFL 444
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVA 482
AGC + C A ++A A+S+D VL MG Q E + DR LLLPG+QQ LIT VA
Sbjct: 445 AGCDSPACAVAATNEAAALASSSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVA 504
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
A+ PVIL++++GG D++FAK++PKI +ILW GYPGQAGG AIA V FG +NPSGRLP
Sbjct: 505 NAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLP 564
Query: 543 MTWYPQSY 550
+TWYP+ +
Sbjct: 565 VTWYPEEF 572
>gi|90399376|emb|CAJ86207.1| B1011H02.4 [Oryza sativa Indica Group]
Length = 738
Score = 508 bits (1309), Expect = e-141, Method: Compositional matrix adjust.
Identities = 259/523 (49%), Positives = 353/523 (67%), Gaps = 17/523 (3%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
++P+ ++ FCN +L R LV LTL EKI L ++A RLG+P +EWWSE+LH
Sbjct: 31 ASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLH 90
Query: 104 GVSYVGPGTHFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT 162
GV GPG +FS+ V AT FPQVIL+AA+FN SL++A + ++ EARAM+N G AGLT
Sbjct: 91 GVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLT 150
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PNIN+FRDPRWGRGQETPGEDP + S Y+ YVKG Q+ D G R+ ++ACCKHY
Sbjct: 151 FWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQR-DYGEEGRMMLSACCKHYI 209
Query: 223 AYDLDNWKGTDRYHFNAMV------IYTYYLIKFK-------YIVSDCDSVDVLYNSQHY 269
AYDL+ W+G RY FNA V L + + YI SDCD+V +++ +Q Y
Sbjct: 210 AYDLEKWRGFTRYTFNAKVNGVPACARKDILQRARDEWGFQGYITSDCDAVAIIHENQTY 269
Query: 270 TKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFD 329
T + E++ A + AG+D+NCGSFL +HT++A++ G V E I+HA+ N F+ +RLGFFD
Sbjct: 270 TASDEDSIAVVLKAGMDINCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFD 329
Query: 330 GHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNA 389
Q + QLGP +VCT +++LA +A RQG VLLKN G LPL + + ++A+IGP A
Sbjct: 330 KTNENQWFTQLGPNNVCTTEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAA 389
Query: 390 NVTKTMIGNYEGTPCKYTTPLQGLAAVV-ATIYQAGCSNVQCG-TAQVDDAKKAAASADA 447
N + G+Y G PC TT ++G+ A V T + AGC +V C T +A +AA AD
Sbjct: 390 NDPYILGGDYTGVPCHSTTFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADV 449
Query: 448 TVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKND 507
VL+ G + + E E HDR+ LLLPG+Q LI VA V K PV+L++M GG D+SFAK+D
Sbjct: 450 VVLIAGLNLTEETEDHDRVSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHD 509
Query: 508 PKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
P+I SILW+GYPG+ GG + ++ FG+YNP G+LP+TWYP+S+
Sbjct: 510 PRIASILWIGYPGEVGGNVLPEILFGKYNPGGKLPITWYPESF 552
>gi|449451581|ref|XP_004143540.1| PREDICTED: probable beta-D-xylosidase 6-like [Cucumis sativus]
Length = 777
Score = 508 bits (1309), Expect = e-141, Method: Compositional matrix adjust.
Identities = 270/588 (45%), Positives = 356/588 (60%), Gaps = 70/588 (11%)
Query: 25 LQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSA 84
L +SSS Q F C+ +PS FCN SL R LV LTL EKI L ++A
Sbjct: 11 LCNSSSDFQ----FPCNSSLHPSYP---FCNRSLSFTARAQSLVSLLTLDEKIQQLSNNA 63
Query: 85 GSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGK 144
S+ RLGIP Y+WWSE LHG++ GPG F+ + ATSFPQV++TAASFN +L+ IG
Sbjct: 64 SSIPRLGIPSYQWWSEGLHGIATNGPGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGS 123
Query: 145 VVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ-- 202
++ EARAM+NVG GLT W+PNINIFRDPRWGRGQETPGEDP++AS Y+ +V+GLQ
Sbjct: 124 AIAVEARAMFNVGQCGLTIWAPNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSG 183
Query: 203 -------------QTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTY 245
+ D G L V+ACCKH+TAYDL+ W RY F+++V +
Sbjct: 184 NWMKEHEIRNEVLEEDNGM-GSLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDT 242
Query: 246 YLIKFK-----------------------------------------YIVSDCDSVDVLY 264
Y F+ YI SDCD+V +Y
Sbjct: 243 YQPPFRSCIQQGKASCLMCSYNAVNGVPACANPDLLKKARNDWGLKGYITSDCDAVATVY 302
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMR 324
Q YT TPE+A A + AG+D+NCG+F+ + T++A+ G V E +D A+ N F+ R
Sbjct: 303 EYQKYTDTPEDAIADVLKAGMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQAR 362
Query: 325 LGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAV 384
LGFFDG+P + +G+LG +DVCT ++ LAL+AARQGIVLLKN LPL AI +L V
Sbjct: 363 LGFFDGNPREGKFGELGAQDVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTV 422
Query: 385 IGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQA-GCSNVQCGT-AQVDDAKKAA 442
IG AN + ++G Y G PC + ++G TI+ A GC +V C + + +DA A
Sbjct: 423 IGSLANDSSKLLGGYAGVPCSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIA 482
Query: 443 ASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS 502
AD + V G D S E E DR+ LLLPG+Q L++ VA V+K P+IL+++ GG DIS
Sbjct: 483 KKADFVIAVAGLDASQETEDLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDIS 542
Query: 503 FAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
FAK D ++ SILW+G PG+AGG A+A+V FG YNP GRLP+TWYPQS+
Sbjct: 543 FAKKDSRVASILWIGNPGEAGGKALAEVIFGDYNPGGRLPVTWYPQSF 590
>gi|449496501|ref|XP_004160150.1| PREDICTED: probable beta-D-xylosidase 6-like, partial [Cucumis
sativus]
Length = 767
Score = 508 bits (1308), Expect = e-141, Method: Compositional matrix adjust.
Identities = 270/588 (45%), Positives = 356/588 (60%), Gaps = 70/588 (11%)
Query: 25 LQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSA 84
L +SSS Q F C+ +PS FCN SL R LV LTL EKI L ++A
Sbjct: 1 LCNSSSDFQ----FPCNSSLHPSYP---FCNRSLSFTARAQSLVSLLTLDEKIQQLSNNA 53
Query: 85 GSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGK 144
S+ RLGIP Y+WWSE LHG++ GPG F+ + ATSFPQV++TAASFN +L+ IG
Sbjct: 54 SSIPRLGIPSYQWWSEGLHGIATNGPGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGS 113
Query: 145 VVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ-- 202
++ EARAM+NVG GLT W+PNINIFRDPRWGRGQETPGEDP++AS Y+ +V+GLQ
Sbjct: 114 AIAVEARAMFNVGQCGLTIWAPNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSG 173
Query: 203 -------------QTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTY 245
+ D G L V+ACCKH+TAYDL+ W RY F+++V +
Sbjct: 174 NWMKEHEIRNEVLEEDNGM-GSLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDT 232
Query: 246 YLIKFK-----------------------------------------YIVSDCDSVDVLY 264
Y F+ YI SDCD+V +Y
Sbjct: 233 YQPPFRSCIQQGKASCLMCSYNAVNGVPACANPDLLKKARNDWGLKGYITSDCDAVATVY 292
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMR 324
Q YT TPE+A A + AG+D+NCG+F+ + T++A+ G V E +D A+ N F+ R
Sbjct: 293 EYQKYTDTPEDAIADVLKAGMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQAR 352
Query: 325 LGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAV 384
LGFFDG+P + +G+LG +DVCT ++ LAL+AARQGIVLLKN LPL AI +L V
Sbjct: 353 LGFFDGNPREGKFGELGAQDVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTV 412
Query: 385 IGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQA-GCSNVQCGT-AQVDDAKKAA 442
IG AN + ++G Y G PC + ++G TI+ A GC +V C + + +DA A
Sbjct: 413 IGSLANDSSKLLGGYAGVPCSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIA 472
Query: 443 ASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS 502
AD + V G D S E E DR+ LLLPG+Q L++ VA V+K P+IL+++ GG DIS
Sbjct: 473 KKADFVIAVAGLDASQETEDLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDIS 532
Query: 503 FAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
FAK D ++ SILW+G PG+AGG A+A+V FG YNP GRLP+TWYPQS+
Sbjct: 533 FAKKDSRVASILWIGNPGEAGGKALAEVIFGDYNPGGRLPVTWYPQSF 580
>gi|85813774|emb|CAJ65923.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 704
Score = 508 bits (1307), Expect = e-141, Method: Compositional matrix adjust.
Identities = 264/530 (49%), Positives = 346/530 (65%), Gaps = 46/530 (8%)
Query: 34 SSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
S P ++CD S+PS + FC T+L I R DLV RLT +EK T LVD++ ++ RLGIP
Sbjct: 26 SQPPYSCDS-SDPSTKTYDFCKTTLPISRRAEDLVSRLTFEEKATQLVDTSPAIPRLGIP 84
Query: 94 KYEWWSEALHGVSY---VGPGTHFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTE 149
YEWWSE LHG+ + V G F N + ATSFPQVILTAASF+A ++ IG+V E
Sbjct: 85 AYEWWSEGLHGIGFLTRVQQGISFFNRTIQHATSFPQVILTAASFDAHIWYRIGQV-GKE 143
Query: 150 ARAMYNVG-LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ----QT 204
ARA+YN G + GL FW+PN+NIFRDPRWGRGQETPGEDPL+ KY +V+G+Q +
Sbjct: 144 ARALYNAGQVTGLGFWAPNVNIFRDPRWGRGQETPGEDPLVVGKYGASFVRGVQGDSFEG 203
Query: 205 DGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLY 264
+ + L+ +ACCKHYTA+DLDNW DCD+V+VL+
Sbjct: 204 ESTLGDHLQASACCKHYTAHDLDNW--------------------------DCDAVNVLH 237
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMR 324
Q Y KTPE+A A ++ +G+ S+L +T++AV+ V S ID A+ N F+T MR
Sbjct: 238 VEQKYAKTPEDAVADALKSGI-----SYLRNYTKSAVEKKKVTVSEIDRALHNLFSTRMR 292
Query: 325 LGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAV 384
LG F+G P+KQ Y +GP VC+Q +Q LAL+AA GIVLLKN LPLS + I +LAV
Sbjct: 293 LGLFNGDPTKQLYSDIGPDQVCSQEHQALALEAALDGIVLLKNADRLLPLSKSGISSLAV 352
Query: 385 IGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAA 443
IGPNA+ + ++GNY G CK T L+GL V++ Y+ GC+NV C +A + A
Sbjct: 353 IGPNAHNSTNLLGNYFGPACKNVTILEGLRNYVSSASYEKGCNNVSCTSAAKKKPVEMAQ 412
Query: 444 SADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISF 503
+ D +LVMG DQS E E DR+DL+LPG+Q LIT VAK AK P++L+++ G D++F
Sbjct: 413 TEDQVILVMGLDQSQEKERLDRMDLVLPGKQPTLITAVAKAAKRPIVLVLLGGSPMDVTF 472
Query: 504 AKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP---SGRLPMTWYPQSY 550
AKN+ KI SILW GYPGQAG A+A + FG +NP GRLPMTWYPQ +
Sbjct: 473 AKNNRKIGSILWAGYPGQAGATALAQIIFGEHNPGNAGGRLPMTWYPQDF 522
>gi|253761874|ref|XP_002489311.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
gi|241946959|gb|EES20104.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
Length = 791
Score = 507 bits (1305), Expect = e-141, Method: Compositional matrix adjust.
Identities = 266/567 (46%), Positives = 345/567 (60%), Gaps = 53/567 (9%)
Query: 35 SPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
+P F+C S+PS L FCN L R DLV R+T EK + L D A V RLG+P
Sbjct: 46 APPFSCGP-SSPS-KGLPFCNMKLPASQRAADLVSRMTPAEKASQLGDIANGVPRLGVPS 103
Query: 95 YEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
Y+WW+EALHGV+ G G H + V ATSFPQV+ TAASFN +L+ IG+ EARA Y
Sbjct: 104 YKWWNEALHGVAISGKGIHMNQGVRSATSFPQVLHTAASFNDNLWFRIGQATGKEARAFY 163
Query: 155 NVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS---PN 210
N+G A GLT WSPN+NIFRDPRWGRGQETPGEDP +AS+Y +V+GLQ + + P
Sbjct: 164 NIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYGAAFVRGLQGSSSNTKSVPP 223
Query: 211 RLKVAACCKHYTAYDLDNWKGTDRYHFNAMV-------------------------IYTY 245
L+ +ACCKH TAYDL++WKG RY F A V + Y
Sbjct: 224 VLQTSACCKHATAYDLEDWKGVSRYSFKATVTIQDLADTFNPPFRSCVVDGKASCVMCAY 283
Query: 246 YLIK--------------FK-------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
++ F+ Y+ +DCD+V ++ NSQ Y T E+ A ++ AG
Sbjct: 284 TIVNGVPSCANGDLLTKTFRGSWGLDGYVAADCDAVAIMRNSQFYRPTAEDTVAATLKAG 343
Query: 285 LDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
LD++CG ++ ++ AA++ G + + +D A+ N T MRLG FDG P YG LG
Sbjct: 344 LDIDCGPYIQQYAMAAIQKGKLTQQDVDKAVKNLLTTRMRLGHFDGDPKTNVYGNLGAGH 403
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
+CT +++LAL+AA GIVLLKN+AG LPL + + AVIG NAN ++GNY G PC
Sbjct: 404 ICTAEHKNLALEAALDGIVLLKNSAGVLPLKRGTVNSAAVIGHNANDVLALLGNYWGPPC 463
Query: 405 KYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
TTPLQG+ V + + AGC+ C A A A+S+DA +L MG Q E+E
Sbjct: 464 APTTPLQGIQGYVKNVKFLAGCNKAACNVAATPQATALASSSDAVILFMGLSQEQESEGK 523
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR LLLPG QQ LI VA AK PVIL++++GG DI+FA+ +PKI +ILW GYPGQAG
Sbjct: 524 DRTTLLLPGNQQSLINAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAG 583
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
G AIA V FG NPSG+LP TWYP+ +
Sbjct: 584 GLAIAKVLFGEKNPSGKLPNTWYPEEF 610
>gi|414588273|tpg|DAA38844.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 775
Score = 506 bits (1302), Expect = e-140, Method: Compositional matrix adjust.
Identities = 274/573 (47%), Positives = 358/573 (62%), Gaps = 63/573 (10%)
Query: 34 SSPVFACDVVSNPSLASLG--FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLG 91
S P+F+C PS AS FC+ SL R DLV RLT+ EK++ L D A V RLG
Sbjct: 26 SDPMFSC----GPSSASRAYPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVPRLG 81
Query: 92 IPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEAR 151
+P Y+WWSE LHG+++ G G F+ V TSFPQV+LT ASF+ SL+ IG+ + EAR
Sbjct: 82 VPPYKWWSEGLHGLAFWGHGMRFNGTVSAVTSFPQVLLTTASFDESLWFRIGQAIGREAR 141
Query: 152 AMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPN 210
A+YN+G A GLT WSPN+NIFRDPRWGRGQETPGEDP +ASKYA +V+G+Q G +P
Sbjct: 142 ALYNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQ---GSNPA 198
Query: 211 RLKVAA-----CCKHYTAYDLDNWKGTDRYHFNAMV------------------------ 241
A CCKH TAYDL++W G RY+F+A V
Sbjct: 199 GAAAAPLQASACCKHATAYDLEDWNGVARYNFDARVTLQDLADTFNPPFQSCVVDGKASC 258
Query: 242 -IYTYYLIK--------------FK-------YIVSDCDSVDVLYNSQHYTKTPEEAAAK 279
+ Y +I F+ Y+ SDCD+V ++ ++Q Y TPE+ A
Sbjct: 259 VMCAYTVINGVPACASSDLLTKTFRGAWGLDGYVSSDCDAVAIMRDAQRYEPTPEDTVAV 318
Query: 280 SILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQP-YG 338
++ AGLDLNCG++ +H AA++ G + E +D A++N FA MRLG FDG P YG
Sbjct: 319 ALKAGLDLNCGTYTQQHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYG 378
Query: 339 QLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGN 398
+LG DVCT +++LAL+AA+ GIVLLKN AG LPL +A+ + AVIG NAN + GN
Sbjct: 379 RLGAADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAVGSAAVIGHNANDPLVLSGN 438
Query: 399 YEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQS 457
Y G C+ TTPL+GL + V + + AGCS+ CG A A A+SA+ L MG Q
Sbjct: 439 YFGPACETTTPLEGLQSYVRNVRFLAGCSSAACGYAATGQAAALASSAEYVFLFMGLSQD 498
Query: 458 IEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
E E DR LLLPG+QQ L+T VA AK PV+L++++GG DI+FA+++PKI +ILW G
Sbjct: 499 QEKEGLDRTSLLLPGKQQSLVTAVASAAKRPVVLVLLTGGPVDITFAQSNPKIGAILWAG 558
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
YPGQAGG AIA V FG +NPSGRLP+TWY + +
Sbjct: 559 YPGQAGGLAIARVLFGDHNPSGRLPVTWYTEDF 591
>gi|168046596|ref|XP_001775759.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672911|gb|EDQ59442.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 784
Score = 505 bits (1301), Expect = e-140, Method: Compositional matrix adjust.
Identities = 275/575 (47%), Positives = 357/575 (62%), Gaps = 64/575 (11%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
+ACD P+ FCNTS+ D RV DL+ RLT+QEKI LV++A +VSRLGIP Y+W
Sbjct: 20 YACDP-DGPADLLFPFCNTSISDDDRVEDLISRLTIQEKIEQLVNTAANVSRLGIPPYQW 78
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
W E LHGV+ + P +F P ATSFP L+ S+N +L+ IG+VVSTE RAMYN G
Sbjct: 79 WGEGLHGVA-ISPSVYFGGATPAATSFPLPCLSVCSYNRTLWNKIGQVVSTEGRAMYNQG 137
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD---------GGS 208
+GLT+WSPNINI RDPRWGR QETPGEDP L+S YA +VKGLQ+ D
Sbjct: 138 RSGLTYWSPNINIARDPRWGRTQETPGEDPKLSSGYAVHFVKGLQEGDYDQNQPQAVSRG 197
Query: 209 PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------- 251
P RLK++ACCKH+TA+DLD WK DR HF++ V + Y FK
Sbjct: 198 PRRLKISACCKHFTAHDLDRWKDYDRDHFDSKVTQQDLEDTYNPSFKSCVKEGQSSSVMC 257
Query: 252 -----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSIL 282
YIVSDCD+V ++++ +Y T E+A + +L
Sbjct: 258 SYNRLNGIPMCTHYELLTLTVRNQWGFDGYIVSDCDAVALIHDYINYAPTSEDAVSYVML 317
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
AG+DLNCGS H AA+ L+ E ID + N F MRLG FDG+PS PYG LGP
Sbjct: 318 AGMDLNCGSTTLVHGLAALDKKLIWEGLIDMHLRNLFRVRMRLGMFDGNPSTLPYGSLGP 377
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
+D+CT+ NQ LAL+AARQ +VLLKN +LP T LAVIG +A+ T+ M+GNYEG
Sbjct: 378 EDMCTEDNQHLALEAARQSLVLLKNEKNALPWKKTHGLKLAVIGHHADATREMLGNYEGY 437
Query: 403 PCKYTTPLQGLAAVVA-----TIYQAGCSNVQCGTA-QVDDAKKAAASADATVLVMGADQ 456
PCK+ +PLQG A V++ ++ GCS+ C + AK+AAA ADA VLV+G Q
Sbjct: 438 PCKFVSPLQGFAKVLSDHSPRISHERGCSDAACEDQFYIYAAKEAAAQADAVVLVLGISQ 497
Query: 457 SIEAESHDRLDLLLPGQQQLLITEVAKVAKG-PVILIIMSGGGFDISFAKNDPKITSILW 515
+ E E DR LLLPG+Q L++ V + + G PV+L+++SG D+SFA +DP+I SI+W
Sbjct: 498 AQEKEGRDRDSLLLPGRQMELVSSVVEASAGRPVVLVLLSGSPLDVSFANDDPRIQSIIW 557
Query: 516 VGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GYPGQ+GG AIA+ FG NP GRL +WY ++Y
Sbjct: 558 AGYPGQSGGEAIAEAIFGLVNPGGRLAQSWYYENY 592
>gi|297736788|emb|CBI25989.3| unnamed protein product [Vitis vinifera]
Length = 746
Score = 504 bits (1297), Expect = e-140, Method: Compositional matrix adjust.
Identities = 259/551 (47%), Positives = 342/551 (62%), Gaps = 75/551 (13%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
+ +C++S ++R DLV R+TL EK+ D A V R+G+PKY WWSEALHGVS
Sbjct: 65 MKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEALHGVSN 124
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
G F VVPGATSFP VIL+AASFN SL++ +G+ VSTEARAMYN G AGLTFWSPN
Sbjct: 125 FGRCVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLTFWSPN 184
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLD 227
IN+ RDPRWGR ETPGEDP L YA Y HY AYDLD
Sbjct: 185 INVVRDPRWGRILETPGEDPHLVGLYAVNY----------------------HYAAYDLD 222
Query: 228 NWKGTDRYHFNAMV----IYTYYLIKFK-------------------------------- 251
NWKG DR HF+A V + +++ F+
Sbjct: 223 NWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVMCSYNKINGIPSCADSRLLKQ 282
Query: 252 ----------YIVSDCDSVDVLYNSQHYTKTP-EEAAAKSILAGLDLNCGSFLGKHTEAA 300
YIVSDCDSV+V+ Q + + ++AA+++ AG++L+CG+F + A
Sbjct: 283 TIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQALNAGMNLDCGTFNNRSLTEA 342
Query: 301 VKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQ 360
V G N++ +DH++ + LMR+GFFDG P+ + LG D+C+ + +LA +AARQ
Sbjct: 343 VNQGKANQADLDHSLRYLYVLLMRVGFFDGIPA---FASLGKDDICSAEHIELAREAARQ 399
Query: 361 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI 420
GIVLLKN +LPL ++KN+A++GP+AN T MIGNY G PC Y +PL +++
Sbjct: 400 GIVLLKNDNATLPLK--SVKNIALVGPHANATDAMIGNYAGIPCYYVSPLDAFSSMGEVR 457
Query: 421 YQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
Y+ GC++VQC + +A +AA ADAT++ G D SIEAE+ DR+DLLLPG Q LI
Sbjct: 458 YEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEAEALDRVDLLLPGYQTQLIN 517
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
+VA ++ GPV+L+IMSGGG DISFA+++PKI +ILW GYPG+ GG AIADV G+YNP G
Sbjct: 518 QVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPGEQGGNAIADVILGKYNPGG 577
Query: 540 RLPMTWYPQSY 550
RLP+TWY Y
Sbjct: 578 RLPITWYEADY 588
>gi|115459584|ref|NP_001053392.1| Os04g0530700 [Oryza sativa Japonica Group]
gi|38346629|emb|CAD41212.2| OSJNBa0074L08.23 [Oryza sativa Japonica Group]
gi|38346760|emb|CAE03865.2| OSJNBa0081C01.11 [Oryza sativa Japonica Group]
gi|113564963|dbj|BAF15306.1| Os04g0530700 [Oryza sativa Japonica Group]
gi|218195263|gb|EEC77690.1| hypothetical protein OsI_16749 [Oryza sativa Indica Group]
Length = 770
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 261/555 (47%), Positives = 354/555 (63%), Gaps = 49/555 (8%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
++P+ ++ FCN +L R LV LTL EKI L ++A RLG+P +EWWSE+LH
Sbjct: 31 ASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLH 90
Query: 104 GVSYVGPGTHFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT 162
GV GPG +FS+ V AT FPQVIL+AA+FN SL++A + ++ EARAM+N G AGLT
Sbjct: 91 GVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLT 150
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PNIN+FRDPRWGRGQETPGEDP + S Y+ YVKG Q+ D G R+ ++ACCKHY
Sbjct: 151 FWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQR-DYGEEGRMMLSACCKHYI 209
Query: 223 AYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------------------------- 251
AYDL+ W+G RY FNA V + Y FK
Sbjct: 210 AYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVNGVPACARK 269
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT 297
YI SDCD+V +++ +Q YT + E++ A + AG+D+NCGSFL +HT
Sbjct: 270 DILQRARDEWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDINCGSFLIRHT 329
Query: 298 EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDA 357
++A++ G V E I+HA+ N F+ +RLGFFD Q + QLGP +VCT +++LA +A
Sbjct: 330 KSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRELAAEA 389
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
RQG VLLKN G LPL + + ++A+IGP AN + G+Y G PC TT ++G+ A V
Sbjct: 390 VRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKGMQAYV 449
Query: 418 -ATIYQAGCSNVQCG-TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQ 475
T + AGC +V C T +A +AA AD VL+ G + + E E HDR+ LLLPG+Q
Sbjct: 450 PKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLLPGRQM 509
Query: 476 LLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRY 535
LI VA V K PV+L++M GG D+SFAK+DP+I SILW+GYPG+ GG + ++ FG+Y
Sbjct: 510 DLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEILFGKY 569
Query: 536 NPSGRLPMTWYPQSY 550
NP G+LP+TWYP+S+
Sbjct: 570 NPGGKLPITWYPESF 584
>gi|384872601|gb|AFI25186.1| putative beta-D-xylosidase [Nicotiana tabacum]
Length = 791
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 254/566 (44%), Positives = 337/566 (59%), Gaps = 69/566 (12%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
FCN +L I RV L+ LT+ EKI L D+ S+ RLG+P YEWWSE+LHG++ GP
Sbjct: 43 FCNKNLPISTRVQSLISLLTIDEKILHLSDNTTSIPRLGLPAYEWWSESLHGIATNGPAV 102
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFR 172
+F+ + G TSFPQVILTAA+FN +L+ +I ++ EARAMYN+G AGLTFW+PNINI R
Sbjct: 103 NFNGQIKGVTSFPQVILTAAAFNRTLWHSIATAIAVEARAMYNLGQAGLTFWAPNINILR 162
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQ---------------------QTDGGSPNR 211
DPRWGRGQETPGEDP++ S YA YV G Q + D R
Sbjct: 163 DPRWGRGQETPGEDPMVVSAYAIEYVTGFQGLNPKAKKGNRNGYGKKRRVLKEDDNDGER 222
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK-------------------- 251
L ++ACCKH+TAYDL+ W RY FNA+V F+
Sbjct: 223 LMLSACCKHFTAYDLEKWGDATRYDFNAVVTKQDMEDTFQAPFRSCIQQGKASCLMCSYN 282
Query: 252 -------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
YI SDCD+V +Y +Q YTKTPE+A A ++ AG +
Sbjct: 283 SVNGVPACADKELLDKVRTDWGFDGYITSDCDAVATIYENQKYTKTPEDAVAVALKAGTN 342
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
+NCG+++ +H ++A + G V E +D A+ F+ RLG FDG+P+ + G +DVC
Sbjct: 343 INCGTYMLRHMKSAFQQGSVLEEDLDRALQYLFSVQFRLGLFDGNPADGQFANFGAQDVC 402
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
T + +LALDAARQGIVLLKN LPL T++ LA++GP ANV+ G Y G PCK
Sbjct: 403 TSNHLNLALDAARQGIVLLKNDQKFLPLDKTSVSTLAIVGPMANVSSPG-GTYSGVPCKL 461
Query: 407 TTPLQGLAAVV-ATIYQAGCSNVQCG-TAQVDDAKKAAASADATVLVMGADQSIEAESHD 464
+ +G + T+Y AGC +V C TA DA AD ++V G+D S E E HD
Sbjct: 462 KSIREGFHRHINRTLYAAGCLDVGCNSTAGFQDAISIVKEADYVIVVAGSDLSEETEDHD 521
Query: 465 RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGG 524
R LLLPGQQ L+T +A +K P+IL++ GG D+SFA+ DP+I SILWV YPG+ GG
Sbjct: 522 RYSLLLPGQQTNLVTTLAAASKKPIILVLTGGGPVDVSFAEKDPRIASILWVAYPGETGG 581
Query: 525 AAIADVCFGRYNPSGRLPMTWYPQSY 550
A++++ FG NP G+LPMTWY +S+
Sbjct: 582 KALSEIIFGYQNPGGKLPMTWYLESF 607
>gi|357138088|ref|XP_003570630.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 1026
Score = 498 bits (1283), Expect = e-138, Method: Compositional matrix adjust.
Identities = 272/593 (45%), Positives = 359/593 (60%), Gaps = 63/593 (10%)
Query: 14 SVLF-LFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLT 72
S++F + L C + ++ + + P F+C S +S FC+ L I R DL RLT
Sbjct: 5 SIIFHVVLPLCLVLQATMA--TDPPFSCG-----SPSSYPFCDRKLPIGQRAADLASRLT 57
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYV---GPGTHFSN-VVPGATSFPQVI 128
++EK++ L D + V RLG+P Y+WWSEALHGV+ G F + V ATSFPQV+
Sbjct: 58 VEEKVSLLGDVSPGVPRLGVPAYKWWSEALHGVANAPADRAGVRFDDGPVRAATSFPQVL 117
Query: 129 LTAASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDP 187
+TAASFN L+ IG+V+ EAR +YN G A GLTFW+PNIN+FRDPRWGRGQETPGEDP
Sbjct: 118 VTAASFNPHLWYRIGQVIGREARGIYNSGQAEGLTFWAPNINVFRDPRWGRGQETPGEDP 177
Query: 188 LLASKYATGYVKGLQQTDG-GSPNR--LKVAACCKHYTAYDLDNWKGTDRYHFNAMV--- 241
+ KYA +V+G+Q G+ N L+ +ACCKH+TAYDL+NW G R+ FNA V
Sbjct: 178 TMTGKYAAVFVRGVQGYGASGAVNSSGLEASACCKHFTAYDLENWNGVTRFAFNAKVSEQ 237
Query: 242 -IYTYYLIKFK------------------------------------------YIVSDCD 258
+ Y F+ YI SDCD
Sbjct: 238 DLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCADHNLLSKTARGDWRFNGYITSDCD 297
Query: 259 SVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNN 318
+V ++++ Q Y K PE+A A + AG+D+NCG ++ KH +A G + E ID A+ N
Sbjct: 298 AVAIIHDVQGYAKEPEDAVADVLKAGMDVNCGDYVQKHGVSAFHQGKITEQDIDRALQNL 357
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
FA MRLG FDG+P YG +G VC + +QDLAL+AA+ GIVLLKN AG+LPL
Sbjct: 358 FAIRMRLGLFDGNPKYNRYGNIGADQVCKKEHQDLALEAAQDGIVLLKNDAGTLPLPKQK 417
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV-ATIYQAGCSNVQCGTAQVDD 437
I +LAVIG NAN + + GNY G PC +PLQ L V T + AGC+ C + +
Sbjct: 418 ISSLAVIGHNANDAQRLQGNYFGPPCISVSPLQALQGYVRETKFVAGCNAAVCNVSDIAG 477
Query: 438 AKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
A KAA+ A+ VL MG DQ E E DR++L LPG Q+ L+ VA AK PV+L+++ GG
Sbjct: 478 AAKAASEAEYVVLFMGLDQDQEREDLDRIELGLPGMQESLVNAVADAAKKPVVLVLLCGG 537
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
D++FAK +PKI +I+W GYPGQAGG AIA V FG +NP GRLP+TWYP+ Y
Sbjct: 538 PVDVTFAKGNPKIGAIIWAGYPGQAGGIAIAQVLFGEHNPGGRLPVTWYPKEY 590
>gi|195614824|gb|ACG29242.1| auxin-induced beta-glucosidase [Zea mays]
gi|413920229|gb|AFW60161.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 655
Score = 495 bits (1274), Expect = e-137, Method: Compositional matrix adjust.
Identities = 251/456 (55%), Positives = 311/456 (68%), Gaps = 58/456 (12%)
Query: 153 MYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT----DGGS 208
MYN G AGLTFWSPN+NIFRDPRWGRGQETPGEDP ++++YA YV+GLQQ +GG
Sbjct: 1 MYNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGH 60
Query: 209 PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------- 251
NRLK+AACCKH+TAYDLD W GTDR+HFNA+V + + + F+
Sbjct: 61 RNRLKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVMC 120
Query: 252 -----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSIL 282
YIVSDCDSVDV + QHYT+TPE+AAA ++
Sbjct: 121 SYNQVNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTRTPEDAAAATLR 180
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
AGLDL+CG FL + +AV AG V ++ +D A+ N MRLG FDG P+ P+G+LGP
Sbjct: 181 AGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRLGP 240
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGS------LPLSPTAIKNLAVIGPNANVTKTMI 396
DVCT+ +QDLALDAARQG+VLLKN G+ LPL P A + +AV+GP+A+ T MI
Sbjct: 241 ADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVAMI 300
Query: 397 GNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGA 454
GNY G PC+YTTPLQG+AA A + +QAGC++V C G + A +AA ADATV+V G
Sbjct: 301 GNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVAGL 360
Query: 455 DQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSIL 514
DQ +EAE DR LLLPG+Q LI+ VAK +KGPVIL++MSGG DI+FA+NDP+I IL
Sbjct: 361 DQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDGIL 420
Query: 515 WVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
WVGYPGQAGG AIADV FG +NP +LP+TWY Q Y
Sbjct: 421 WVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDY 456
>gi|224128360|ref|XP_002320310.1| predicted protein [Populus trichocarpa]
gi|222861083|gb|EEE98625.1| predicted protein [Populus trichocarpa]
Length = 635
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 252/449 (56%), Positives = 304/449 (67%), Gaps = 45/449 (10%)
Query: 144 KVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQ 203
+VVS EARAM+N G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP++ KYA YV+GLQ
Sbjct: 2 QVVSDEARAMFNGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVVGKYAASYVRGLQG 61
Query: 204 TDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFKYIVSDCDS 259
+DG NRLKVAACCKH+TAYDLDNW G DR+HFNA V + + + F+ V +
Sbjct: 62 SDG---NRLKVAACCKHFTAYDLDNWNGVDRFHFNAEVSKQDMEDTFDVPFRMCVKEGKV 118
Query: 260 VDVL--YNSQHYTKT---------------------------------PEEAAAKSIL-- 282
V+ YN + T P + +L
Sbjct: 119 ASVMCSYNQVNGIPTCADPNLLKKTVRGTLFQTVTLLEFIMGSNTILQPRRKQPRMLLKQ 178
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
A LDL+CG FLG+HTE AVK GL+NE+ I++A+ N MRLG FDG PS Q YG LGP
Sbjct: 179 ASLDLDCGPFLGQHTEDAVKKGLLNEAEINNALLNTLTVQMRLGMFDGEPSSQLYGNLGP 238
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
DVCT A+Q+LAL+AARQGIVLLKN SLPLS ++A++GPN+NVT TMIGNY G
Sbjct: 239 NDVCTPAHQELALEAARQGIVLLKNHGPSLPLSTRRHLSVAIVGPNSNVTATMIGNYAGL 298
Query: 403 PCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAE 461
C YTTPLQG+ TI++ GC++V C Q A AA ADATVLVMG DQSIEAE
Sbjct: 299 ACGYTTPLQGIQRYAQTIHRQGCADVACVSDQQFSAAIDAARQADATVLVMGLDQSIEAE 358
Query: 462 SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQ 521
DR LLLPG+QQ L+++VA +KGP IL++MSGG D+SFA+NDPKI SI+W GYPGQ
Sbjct: 359 FRDRTGLLLPGRQQELVSKVAAASKGPTILVLMSGGPIDVSFAENDPKIGSIVWAGYPGQ 418
Query: 522 AGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
AGGAAI+DV FG NP G+LPMTWYPQ Y
Sbjct: 419 AGGAAISDVLFGITNPGGKLPMTWYPQDY 447
>gi|297736786|emb|CBI25987.3| unnamed protein product [Vitis vinifera]
Length = 745
Score = 493 bits (1270), Expect = e-137, Method: Compositional matrix adjust.
Identities = 250/550 (45%), Positives = 336/550 (61%), Gaps = 78/550 (14%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
+ S FC+ SL R DLV R+TLQEK+ V +A V RLG+P+Y WWSEALHG+S
Sbjct: 61 MKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLPEYSWWSEALHGISN 120
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
+GPG F +PGATS P VIL+ A+FN +L++ +G+VVSTE RAMYN+G AGLTFWSPN
Sbjct: 121 LGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAMYNLGHAGLTFWSPN 180
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLD 227
IN+ RD RWGR QET GEDP + ++A YV+GLQ +G V++CCKHY AYD+D
Sbjct: 181 INVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTE----NVSSCCKHYAAYDID 236
Query: 228 NWKGTDRYHFNAMV----IYTYYLIKFK-------------------------------- 251
+W DR+ F+A V + ++ F+
Sbjct: 237 SWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVMCSFNKINGIPPCSDPRLLKG 296
Query: 252 ----------YIVSDCDSVDVLYNSQHYTKTPE-EAAAKSILAGLDLNCGSFLGKHTEAA 300
YIVSDC ++V+ ++Q+Y + +A AK++ AGLDL CG + +
Sbjct: 297 VIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKTLQAGLDLECGHYYTDALNES 356
Query: 301 VKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQ 360
V G V++ +D A+ N + LMR+G+FDG P+ Y LG KD+C + +LA +AARQ
Sbjct: 357 VLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA---YESLGLKDICAADHIELAREAARQ 413
Query: 361 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI 420
GIVLLKN LPL P K +A++GP+AN T+ MIGNY G PCKY +PL+ +A+
Sbjct: 414 GIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIGNYAGLPCKYVSPLEAFSAIGNVT 471
Query: 421 YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITE 480
Y G T++ +G D SIEAE DR+D LLPG Q LI +
Sbjct: 472 YATGF----------------------TIIFVGTDLSIEAEFVDRVDFLLPGNQTELIKQ 509
Query: 481 VAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGR 540
VA+V+ GPVIL+++SG DI+FAKN+P+I++ILWVG+PG+ GG AIADV FG+YNP GR
Sbjct: 510 VAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQGGHAIADVVFGKYNPGGR 569
Query: 541 LPMTWYPQSY 550
LP+TWY Y
Sbjct: 570 LPVTWYEADY 579
>gi|14164501|dbj|BAB55751.1| putative alpha-L-arabinofuranosidase/beta-D- xylosidase isoenzyme
ARA-I [Oryza sativa Japonica Group]
Length = 818
Score = 492 bits (1267), Expect = e-136, Method: Compositional matrix adjust.
Identities = 264/572 (46%), Positives = 345/572 (60%), Gaps = 77/572 (13%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
+A +C+ SL RV DLV R+TL+EK+ L D AG R+G+P+Y WW EALHGVS
Sbjct: 59 MAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRYLWWGEALHGVSD 118
Query: 108 VGPG-THFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSP 166
VGPG T F + VPGATSFP VI +AASFN +L++AIG VVSTE RAMYN+G A LT+WSP
Sbjct: 119 VGPGGTWFGDAVPGATSFPLVINSAASFNETLWRAIGGVVSTEIRAMYNLGHAELTYWSP 178
Query: 167 NINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS------------PNRLKV 214
NIN+ RDPRWGR ETPGEDP + +YA +V+G+Q DG + +KV
Sbjct: 179 NINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAASAAAATDAFSRPIKV 238
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK----------------------- 251
++CCKHY AYD+D W GTDR F+A V + F+
Sbjct: 239 SSCCKHYAAYDVDAWNGTDRLTFDARVQERDMVETFERPFEMCIRDGDASCVMCSYNRIN 298
Query: 252 -----------------------YIVSDCDSVDVLYNSQHYTK-TPEEAAAKSILAGLDL 287
YIVSDCDSV V+ + T EA A ++ AGLDL
Sbjct: 299 GVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGLDL 358
Query: 288 NCG-------SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQL 340
+CG F + AV+ G + ESA+D+A++N + TLMRLGFFDG P + L
Sbjct: 359 DCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFFDGIPELE---SL 415
Query: 341 GPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGP--NANVTKTMIGN 398
G DVCT+ +++LA DAARQG+VLLKN A LPLSP + ++A+ G + N T M+G+
Sbjct: 416 GAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQHINATDVMLGD 475
Query: 399 YEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSI 458
Y G PC+ TP G+ VV++ C C D A AA + DAT++V G + S+
Sbjct: 476 YRGKPCRVVTPYDGVRKVVSSTSVHACDKGSC-----DTAAAAAKTVDATIVVAGLNMSV 530
Query: 459 EAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGY 518
E ES+DR DLLLP Q I VA+ + P++L+IMS GG D+SFA+++PKI +++W GY
Sbjct: 531 ERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDNPKIGAVVWAGY 590
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PG+ GG AIADV FG+YNP GRLP+TWY Y
Sbjct: 591 PGEEGGTAIADVLFGKYNPGGRLPLTWYKNEY 622
>gi|62321271|dbj|BAD94481.1| beta-xylosidase [Arabidopsis thaliana]
Length = 523
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 239/299 (79%), Positives = 259/299 (86%), Gaps = 1/299 (0%)
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAI 311
YIVSDCDSVDVLY +QHYTKTP EAAA SILAGLDLNCGSFLG+HTE AVK+GLVNE+AI
Sbjct: 43 YIVSDCDSVDVLYKNQHYTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAI 102
Query: 312 DHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS 371
D AISNNF TLMRLGFFDG+P Q YG LGP DVCT ANQ+LA DAARQGIVLLKNT G
Sbjct: 103 DKAISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNT-GC 161
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCG 431
LPLSP +IK LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA V+T Y GCSNV C
Sbjct: 162 LPLSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGTVSTTYLPGCSNVACA 221
Query: 432 TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVIL 491
A V A K AA+AD +VLV+GADQSIEAES DR+DL LPGQQQ L+ +VAK AKGPV+L
Sbjct: 222 VADVAGATKLAATADVSVLVIGADQSIEAESRDRVDLRLPGQQQELVIQVAKAAKGPVLL 281
Query: 492 IIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+IMSGGGFDI+FAKNDPKI ILWVGYPG+AGG AIAD+ FGRYNPSG+LPMTWYPQSY
Sbjct: 282 VIMSGGGFDITFAKNDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSY 340
>gi|334187562|ref|NP_196532.2| Glycosyl hydrolase family protein [Arabidopsis thaliana]
gi|332004052|gb|AED91435.1| Glycosyl hydrolase family protein [Arabidopsis thaliana]
Length = 526
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 234/302 (77%), Positives = 264/302 (87%), Gaps = 1/302 (0%)
Query: 250 FKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNES 309
F+YIVSDCDS+ +LY SQHYTKTPEEAAAKSILAGLDLNCGSFLG HTE AVK GL++E+
Sbjct: 43 FRYIVSDCDSLGILYGSQHYTKTPEEAAAKSILAGLDLNCGSFLGNHTENAVKKGLIDEA 102
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
AI+ AISNNFATLMRLGFFDG+P QPYG LGPKDVCT N++LA++ ARQGIVLLKN+A
Sbjct: 103 AINKAISNNFATLMRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSA 162
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA-AVVATIYQAGCSNV 428
GSLPLSP+AIK LAVIGPNANVTKTMIGNYEG CKYTTPLQGL V+ T Y GC NV
Sbjct: 163 GSLPLSPSAIKTLAVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNV 222
Query: 429 QCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGP 488
C A +D AK AASADATVLVMGADQ+IE E+ DR+DL LPG+QQ L+T+VAK A+GP
Sbjct: 223 TCTEADLDSAKTLAASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGP 282
Query: 489 VILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQ 548
V+L+IMSGGGFDI+FAKND KITSI+WVGYPG+AGG AIADV FGR+NPSG+LPMTWYPQ
Sbjct: 283 VVLVIMSGGGFDITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQ 342
Query: 549 SY 550
SY
Sbjct: 343 SY 344
>gi|242076578|ref|XP_002448225.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
gi|241939408|gb|EES12553.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
Length = 766
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 257/546 (47%), Positives = 350/546 (64%), Gaps = 49/546 (8%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
FC+ SL I R LV LTL EKI L ++AG V RLGIP Y+WWSE+LHG++ GPG
Sbjct: 36 FCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLADNGPGV 95
Query: 113 HFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIF 171
+FS+ V AT+FPQVIL+ A+FN SL++A+ + V+TEA M+N G AGLT+W+PNINIF
Sbjct: 96 NFSSGPVRAATTFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQAGLTYWAPNINIF 155
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
RDPRWGRGQET GEDP +A+ Y+ YVKG Q + G R++++ACCKHYTAYD++ W+G
Sbjct: 156 RDPRWGRGQETSGEDPAVAAAYSLEYVKGFQ-GEQGEEGRIRLSACCKHYTAYDMEKWEG 214
Query: 232 TDRYHFNAMV----IYTYYLIKFK------------------------------------ 251
RY FNA V + Y FK
Sbjct: 215 FSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNGVPMCANKDLLQKTRDE 274
Query: 252 -----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLV 306
YI SDCD+V +++ +Q YTK+ E++ A + AG+D+NCGSFL +HT++AV+ G V
Sbjct: 275 WGFQGYITSDCDAVAIIHENQTYTKSDEDSIAIVLKAGMDINCGSFLVRHTKSAVEKGKV 334
Query: 307 NESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLK 366
E ID A+ N F+ +RLG FD + Q QLGP +VCT+ +++LA +A RQG VLLK
Sbjct: 335 QEQDIDRALFNLFSVQLRLGIFDKPNNNQWSTQLGPNNVCTKEHRELAAEAVRQGAVLLK 394
Query: 367 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA-VVATIYQAGC 425
N LPL + ++++A+IGP+AN M G+Y G C TT L+G+ A T + AGC
Sbjct: 395 NDHSFLPLKRSEVRHVAIIGPSANDVYAMGGDYTGVACNPTTFLKGIQAYATQTTFAAGC 454
Query: 426 SNVQCGTAQV-DDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKV 484
+V C + ++ +A AA AD V+V G + + E E DR+ LLLPG+Q LI VA V
Sbjct: 455 KDVSCNSTELFGEAIAAAKRADIVVVVAGLNLTEEREDFDRVSLLLPGKQMSLIHAVASV 514
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
AK P++L+++ GG D+SFAK DP+I SILW+GYPG+ GG + ++ FG YNP G+L MT
Sbjct: 515 AKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVLPEILFGEYNPGGKLAMT 574
Query: 545 WYPQSY 550
WYP+S+
Sbjct: 575 WYPESF 580
>gi|253761860|ref|XP_002489304.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
gi|241946952|gb|EES20097.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
Length = 750
Score = 491 bits (1263), Expect = e-136, Method: Compositional matrix adjust.
Identities = 268/572 (46%), Positives = 350/572 (61%), Gaps = 64/572 (11%)
Query: 29 SSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVS 88
SS+ S P+F+C S+PS A FC+ SL R DLV RLT+ EK++ L D A V
Sbjct: 10 SSAVASDPLFSCGP-SSPSRA-YPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVP 67
Query: 89 RLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVST 148
RLG+P Y+WWSE LHG+++ G G F+ V G TSFPQV+LT ASF+ L+ IG+ +
Sbjct: 68 RLGVPPYKWWSEGLHGLAFWGHGMRFNGTVTGVTSFPQVLLTTASFDDGLWFRIGQAIGR 127
Query: 149 EARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
EARA+YN+G A GLT WSPN+NIFRDPRWGRGQETPGEDP +ASKYA +V+G+Q +
Sbjct: 128 EARALYNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSSAA 187
Query: 208 SPNR-LKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK--------------- 251
L+ +ACCKH TAYDL++W G RY+F+A V F
Sbjct: 188 GAAAPLQASACCKHATAYDLEDWNGVARYNFDARVTAQDLADTFNPPFQSCVVDGKATCV 247
Query: 252 -------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKS 280
Y+ SDCD+V +++++Q Y TPE+ A +
Sbjct: 248 MCAYTGINGVPACASSDLLTKTFRGAWGHDGYVSSDCDAVAIMHDAQRYVPTPEDTVAVA 307
Query: 281 ILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQP-YGQ 339
L +H AA++ G + E +D A++N FA MRLG FDG P YG
Sbjct: 308 ------------LKEHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYGH 355
Query: 340 LGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
LG DVCT +++LAL+AA+ GIVLLKN AG LPL +A+ + AVIG NAN + GNY
Sbjct: 356 LGAADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAMGSAAVIGHNANDALVLRGNY 415
Query: 400 EGTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSI 458
G C+ TTPLQG+ + V+ + + AGCS+ CG A A A+S++ L MG Q
Sbjct: 416 FGPACETTTPLQGVQSYVSNVRFLAGCSSAACGYAATGQAAALASSSEYVFLFMGLSQDQ 475
Query: 459 EAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGY 518
E E DR LLLPG+QQ LIT VA AK PVIL++++GG DI+FA+++PKI +ILW GY
Sbjct: 476 EKEGLDRTSLLLPGKQQSLITAVASAAKRPVILVLLTGGPVDITFAQSNPKIGAILWAGY 535
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PGQAGG AIA V FG +NPSGRLP+TWYP+ +
Sbjct: 536 PGQAGGLAIARVLFGDHNPSGRLPVTWYPEEF 567
>gi|26449574|dbj|BAC41913.1| putative beta-xylosidase [Arabidopsis thaliana]
Length = 732
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 247/544 (45%), Positives = 333/544 (61%), Gaps = 66/544 (12%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAA 132
L EKI L ++A SV RLGIP YEWWSE+LHG++ GPG F+ + ATSFPQVI++AA
Sbjct: 2 LPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAA 61
Query: 133 SFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASK 192
SFN +L+ IG V+ E RAMYN G AGLTFW+PNIN+FRDPRWGRGQETPGEDP + S+
Sbjct: 62 SFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGEDPKVVSE 121
Query: 193 YATGYVKGLQQTDGGSP-------------------NRLKVAACCKHYTAYDLDNWKGTD 233
Y +V+G Q+ +L ++ACCKH+TAYDL+ W
Sbjct: 122 YGVEFVRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNFT 181
Query: 234 RYHFNAMV----IYTYYLIKFK-------------------------------------- 251
RY FNA+V + Y F+
Sbjct: 182 RYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLLQKARVEWG 241
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNE 308
YI SDCD+V ++ Q YTK+PEEA A +I AG+D+NCG+++ +HT++A++ G V+E
Sbjct: 242 FEGYITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKVSE 301
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
+D A+ N FA +RLG FDG P + YG+LG D+C+ ++ LAL+A RQGIVLLKN
Sbjct: 302 ELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLKND 361
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPL-QGLAAVVATIYQAGCSN 427
LPL+ + +LA++GP AN M G Y G PC+ T + L V T Y +GCS+
Sbjct: 362 HKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCSD 421
Query: 428 VQCGT-AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAK 486
V C + +A A AD ++V G D S E E DR+ L LPG+Q+ L++ VA V+K
Sbjct: 422 VSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAVSK 481
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PVIL++ GG D++FAKNDP+I SI+W+GYPG+ GG A+A++ FG +NP GRLP TWY
Sbjct: 482 KPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRLPTTWY 541
Query: 547 PQSY 550
P+S+
Sbjct: 542 PESF 545
>gi|77552476|gb|ABA95273.1| Beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
Group]
Length = 883
Score = 486 bits (1252), Expect = e-134, Method: Compositional matrix adjust.
Identities = 258/457 (56%), Positives = 313/457 (68%), Gaps = 51/457 (11%)
Query: 144 KVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQ 203
+ VS E RAMYN G AGLTFWSPN+NIFRDPRWGRGQETPGEDP +A++YA YV+GLQQ
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQ 286
Query: 204 TDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------- 251
S RLK+AACCKH+TAYDLDNW GTDR+HFNA+V + + + F+
Sbjct: 287 QQPSS-GRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRA 345
Query: 252 ----------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAA 277
YIVSDCDSVDV Y+ QHYT+T E+A
Sbjct: 346 ASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAV 405
Query: 278 AKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY 337
A ++ AGLDL+CG FL ++TE AV G V + ID A++N MRLG FDG P+ QP+
Sbjct: 406 AATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDGDPAAQPF 465
Query: 338 GQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP-TAIKNLAVIGPNANVTKTMI 396
G LGP+ VCT A+Q+LA++AARQGIVLLKN +LPLSP TA + +AV+GP+A T MI
Sbjct: 466 GHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHAEATVAMI 525
Query: 397 GNYEGTPCKYTTPLQGLAAVVA-TIYQAGCSNVQC-GTAQ-VDDAKKAAASADATVLVMG 453
GNY G PC+YTTPLQG+A A +Q GC++V C G+ Q + A AA ADAT++V G
Sbjct: 526 GNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRADATIVVAG 585
Query: 454 ADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSI 513
DQ IEAE DR LLLPG+Q LI+ VAK +KGPVIL++MSGG DI FA+NDPKI I
Sbjct: 586 LDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGI 645
Query: 514 LWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
LW GYPGQAGG AIADV FG +NP G+LP+TWYPQ Y
Sbjct: 646 LWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDY 682
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 58/100 (58%), Positives = 69/100 (69%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYV 108
A+L FC SL R DLV RLT EK+ LV++A V RLG+ YEWWSEALHGVS
Sbjct: 39 ATLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDT 98
Query: 109 GPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVST 148
GPG F PGAT+FPQVI TAASFNA+L++ IG+ S+
Sbjct: 99 GPGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRSS 138
>gi|414586138|tpg|DAA36709.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 769
Score = 486 bits (1252), Expect = e-134, Method: Compositional matrix adjust.
Identities = 253/552 (45%), Positives = 352/552 (63%), Gaps = 49/552 (8%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
S ++ FC+ SL I R LV LTL EKI L ++AG V RLGIP Y+WWSE+LHG++
Sbjct: 33 STSAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLA 92
Query: 107 YVGPGTHFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWS 165
GPG +FS+ V AT FPQVIL+ A+FN SL++A+ + V+TEA M+N G AGLT+W+
Sbjct: 93 DNGPGVNFSSGPVRAATDFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQAGLTYWA 152
Query: 166 PNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD 225
PNINIFRDPRWGRGQET GEDP +A+ Y+ YVKG Q +G R++++ACCKHYTAYD
Sbjct: 153 PNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEEGEE-GRIRLSACCKHYTAYD 211
Query: 226 LDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------------ 251
++ W+G RY FNA V + Y FK
Sbjct: 212 MEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNGVPMCAHKDLL 271
Query: 252 -----------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAA 300
YI SDCD+V +++ +Q YTK+ E++ A + AG+D+NCGSFL +HT++A
Sbjct: 272 QKTRDEWGFQGYITSDCDAVAIIHENQTYTKSGEDSIAIVLKAGMDINCGSFLVRHTKSA 331
Query: 301 VKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQ 360
++ G + E ID A+ N F+ +RLG FD + Q + QLGP VCT+ +++LA +A RQ
Sbjct: 332 IEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCTKEHRELAAEAVRQ 391
Query: 361 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA-VVAT 419
G VLLKN LPL + ++++A+IGP+AN M G+Y G PC TT L+G+ A T
Sbjct: 392 GAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGVPCNPTTFLKGIQAYATQT 451
Query: 420 IYQAGCSNVQCGTAQV-DDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLI 478
+ GC + C + + +A +AA AD V++ G + + E E DR+ LLLPG+Q LI
Sbjct: 452 SFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDFDRVSLLLPGKQMGLI 511
Query: 479 TEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPS 538
+A VAK P++L+++ GG D+SFAK DP+I SILW+GYPG+ GG + ++ FG YNP
Sbjct: 512 HAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVLPEILFGEYNPG 571
Query: 539 GRLPMTWYPQSY 550
G+LP+TWYP+S+
Sbjct: 572 GKLPITWYPESF 583
>gi|125535275|gb|EAY81823.1| hypothetical protein OsI_36995 [Oryza sativa Indica Group]
Length = 885
Score = 486 bits (1252), Expect = e-134, Method: Compositional matrix adjust.
Identities = 258/457 (56%), Positives = 313/457 (68%), Gaps = 51/457 (11%)
Query: 144 KVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQ 203
+ VS E RAMYN G AGLTFWSPN+NIFRDPRWGRGQETPGEDP +A++YA YV+GLQQ
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQ 286
Query: 204 TDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------- 251
S RLK+AACCKH+TAYDLDNW GTDR+HFNA+V + + + F+
Sbjct: 287 QQPSS-GRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRA 345
Query: 252 ----------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAA 277
YIVSDCDSVDV Y+ QHYT+T E+A
Sbjct: 346 ASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAV 405
Query: 278 AKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY 337
A ++ AGLDL+CG FL ++TE AV G V + ID A++N MRLG FDG P+ QP+
Sbjct: 406 AATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDGDPAAQPF 465
Query: 338 GQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP-TAIKNLAVIGPNANVTKTMI 396
G LGP+ VCT A+Q+LA++AARQGIVLLKN +LPLSP TA + +AV+GP+A T MI
Sbjct: 466 GHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHAEATVAMI 525
Query: 397 GNYEGTPCKYTTPLQGLAAVVA-TIYQAGCSNVQC-GTAQ-VDDAKKAAASADATVLVMG 453
GNY G PC+YTTPLQG+A A +Q GC++V C G+ Q + A AA ADAT++V G
Sbjct: 526 GNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRADATIVVAG 585
Query: 454 ADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSI 513
DQ IEAE DR LLLPG+Q LI+ VAK +KGPVIL++MSGG DI FA+NDPKI I
Sbjct: 586 LDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGI 645
Query: 514 LWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
LW GYPGQAGG AIADV FG +NP G+LP+TWYPQ Y
Sbjct: 646 LWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDY 682
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 57/100 (57%), Positives = 69/100 (69%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYV 108
A+L FC SL R DLV R+T EK+ LV++A V RLG+ YEWWSEALHGVS
Sbjct: 39 ATLPFCRRSLPARARARDLVARMTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDT 98
Query: 109 GPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVST 148
GPG F PGAT+FPQVI TAASFNA+L++ IG+ S+
Sbjct: 99 GPGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRSS 138
>gi|357164885|ref|XP_003580200.1| PREDICTED: probable beta-D-xylosidase 6-like [Brachypodium
distachyon]
Length = 771
Score = 484 bits (1246), Expect = e-134, Method: Compositional matrix adjust.
Identities = 255/547 (46%), Positives = 343/547 (62%), Gaps = 49/547 (8%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
FC+ SL +R LV LTL EKI L ++A V RLGIP YEWWSE+LHG++ GPG
Sbjct: 39 FCDASLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGIPPYEWWSESLHGLADNGPGV 98
Query: 113 HFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIF 171
+FS+ V AT FPQVIL+AASFN SL++A+ + V+ EARAM+N G AGLT+W+PNIN+F
Sbjct: 99 NFSSGPVGAATIFPQVILSAASFNRSLWRAVAEAVAVEARAMHNAGQAGLTYWAPNINVF 158
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG-GSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGRGQETPGEDP + + Y+ YVKG Q G G R+ ++ACCKHY AYDL+ W
Sbjct: 159 RDPRWGRGQETPGEDPAVIAAYSVEYVKGFQGEYGDGKEGRMMLSACCKHYVAYDLEKWG 218
Query: 231 GTDRYHFNAMV----IYTYYLIKFK----------------------------------- 251
RY FNA V Y FK
Sbjct: 219 NFTRYTFNAKVNEQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVPACARKDLLQKVRD 278
Query: 252 ------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGL 305
Y+VSDCD+V ++Y Q+YT + E++ A + AG+D+NCGSFL +HT++A++ G
Sbjct: 279 EWGFQGYVVSDCDAVGIIYGYQNYTNSDEDSIAIVLKAGMDINCGSFLIRHTKSAIQKGK 338
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
+ E I+HA+ N F+ +RLG FD Q + QLGP ++CT+ +++LA +AARQG VLL
Sbjct: 339 ITEEDINHALFNLFSVQLRLGLFDKTSGNQWFTQLGPSNICTKEHRELAAEAARQGTVLL 398
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV-ATIYQAG 424
KN LPL + + ++A+IGP AN M G+Y G PC TT L+G+ AVV T AG
Sbjct: 399 KNDNSFLPLKRSEVSHIAIIGPVANDAYIMGGDYTGVPCNPTTFLKGMQAVVPQTTIAAG 458
Query: 425 CSNVQCG-TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAK 483
C ++ C T +A + A AD VL+ G + + E E DR+ LLLPG+Q LI +A
Sbjct: 459 CKDISCNSTDGFGEAIEVAKRADIVVLIAGLNLTQETEDLDRVSLLLPGKQMDLINSIAS 518
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
V K P++L+I GG D+SFAK D +I S+LW+GYPG+ GG + ++ FG YNP G+LP+
Sbjct: 519 VTKKPLVLVITGGGPVDVSFAKQDKRIASVLWIGYPGEVGGQVLPEILFGEYNPGGKLPI 578
Query: 544 TWYPQSY 550
TWYP+S+
Sbjct: 579 TWYPESF 585
>gi|242052713|ref|XP_002455502.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
gi|241927477|gb|EES00622.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
Length = 825
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 256/570 (44%), Positives = 341/570 (59%), Gaps = 70/570 (12%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
++ +C+ SL RV DLV RL+L+EK+ L D A R+G+P Y+WW EALHGVS
Sbjct: 63 MSRFRYCDASLPYAERVRDLVGRLSLEEKVRNLGDQAEGAPRVGLPPYKWWGEALHGVSD 122
Query: 108 VGPG-THFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSP 166
VGPG T F +VVPGATSFP VI +AA+FN SL++AIG VVSTE RAMYN+G A LT+WSP
Sbjct: 123 VGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMYNLGHAELTYWSP 182
Query: 167 NINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT-------DGGSP--NRLKVAAC 217
NIN+ RDPRWGR ETPGEDP + +YA +V+G+Q P +KV++C
Sbjct: 183 NINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVVIAAGAAATADPFSRPIKVSSC 242
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK-------------------------- 251
CKH+ AYD+D W DR F+A V + F+
Sbjct: 243 CKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDASCVMCSYNRINGIP 302
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTK-TPEEAAAKSILAGLDLNCG 290
YIVSDCDSV V+ + T EA A ++ AGLDL+CG
Sbjct: 303 ACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEATAAAMKAGLDLDCG 362
Query: 291 -------SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
F + AV+ G + E+ +D+A+ N + TLMRLGFFDG P + LG
Sbjct: 363 MFWEGARDFFTTYGVDAVRQGKIKEADVDNALGNVYTTLMRLGFFDGMPE---FESLGAD 419
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG--PNANVTKTMIGNYEG 401
DVCT+ +++LA DAARQG+VLLKN A LPL P+ I +++++G + N T M+G+Y G
Sbjct: 420 DVCTRDHKELAADAARQGMVLLKNDARRLPLDPSKINSVSLVGLLEHINATDVMLGDYRG 479
Query: 402 TPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEA 460
PC+ TP + VV Y C + C TA+ + A + A ADAT+++ G + S+E
Sbjct: 480 KPCRIVTPYDAIRQVVNATYVHACDSGACSTAEGMGRASRTAKIADATIVIAGLNMSVER 539
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
ES+DR DLLLP Q I VA+ + P++L+IMS GG D+SFA+N+ KI +I+W GYPG
Sbjct: 540 ESNDREDLLLPWNQSSWINAVAEASTTPIVLVIMSAGGVDVSFAQNNTKIGAIVWAGYPG 599
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+ GG AIADV FG+YNP GRLP+TW+ Y
Sbjct: 600 EEGGTAIADVLFGKYNPGGRLPLTWFKNEY 629
>gi|359477633|ref|XP_003632006.1| PREDICTED: LOW QUALITY PROTEIN: beta-D-xylosidase 3-like [Vitis
vinifera]
Length = 781
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 269/578 (46%), Positives = 346/578 (59%), Gaps = 76/578 (13%)
Query: 43 VSNPS-LASLGF-------CNTSLGI-DLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
V +P+ A+LGF CN+SL I D+RV DLV R+TL+EK T ++ A V R+G+P
Sbjct: 16 VCDPARFAALGFDMKDFVYCNSSLPIYDVRVKDLVDRMTLEEKATNVIYKAAGVERIGLP 75
Query: 94 KYEWWSEALHGVSYV---GPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEA 150
Y+WWSEALHGVS V GP T F VPGATSFP VIL+AASFN SL++ I +VVS EA
Sbjct: 76 PYQWWSEALHGVSSVSINGP-TFFDETVPGATSFPNVILSAASFNQSLWKTIRQVVSKEA 134
Query: 151 RAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG--- 207
RA YN+G AGLTFW PN+N+ RDPRWGR QET GEDP S YA YV+GLQ +G
Sbjct: 135 RATYNLGHAGLTFWCPNVNVARDPRWGRTQETXGEDPFTVSVYAVSYVRGLQDVEGTENT 194
Query: 208 ---SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------- 251
+ LKV++ KH+ AYDLDNW DR HFNA V + +L F+
Sbjct: 195 TDLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHFNARVSEQDMAETFLRPFEACVREGDVS 254
Query: 252 ---------------------------------YIVSDCDSVDVLYNSQHYTK-TPEEAA 277
YIVSDC S++ + Q + T EEA
Sbjct: 255 GVMCSFNNINGIPPCADPRLFKGTIRDEWNLHGYIVSDCWSIETIVEDQKFLDVTGEEAV 314
Query: 278 AKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY 337
A ++ AGLDL CG + +AV AG V + +D ++SN + LMRLGFFDG P+
Sbjct: 315 ALNLKAGLDLECGHYYNDSPASAVMAGRVGQHDLDQSLSNLYVVLMRLGFFDGIPA---L 371
Query: 338 GQLGPKDVCTQANQ-DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMI 396
LG D+C A +LA +AARQGIVLLKN +LPL ++KNLA++GPNA+ M+
Sbjct: 372 ASLGKDDICLSAEHIELAREAARQGIVLLKNDNATLPLK--SVKNLALVGPNADAYGAMM 429
Query: 397 GNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMG-A 454
GNY G PC+ +P +A+ Y+ GC +V C V A +AA AD T++V+G
Sbjct: 430 GNYAGPPCRSVSPRDAFSAIGNVTYEMGCGDVLCHNDTYVYKAVEAAKHADTTIIVVGIT 489
Query: 455 DQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMS--GGGFDISFAKNDPKITS 512
D SI E DR+DLLLPG Q L+ ++AK P+IL++ GG DISFA+++P I
Sbjct: 490 DVSIGTEDKDRVDLLLPGYQTHLVNQIAKATTAPIILVVCGHCGGPIDISFARDNPGIEP 549
Query: 513 ILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
ILW G+PG+ GG AIADV +G+YNP GRLP+TWY Y
Sbjct: 550 ILWAGFPGEEGGNAIADVVYGKYNPGGRLPVTWYENGY 587
>gi|226506870|ref|NP_001146482.1| uncharacterized protein LOC100280070 precursor [Zea mays]
gi|219887469|gb|ACL54109.1| unknown [Zea mays]
gi|413947917|gb|AFW80566.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 835
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 256/569 (44%), Positives = 336/569 (59%), Gaps = 69/569 (12%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
++ +C+ SL RV DLV RL L+EK+ L D A R+G+P Y+WW EALHGVS
Sbjct: 75 MSRFRYCDASLPYADRVRDLVGRLALEEKVRNLGDQAEGAPRVGLPPYKWWGEALHGVSD 134
Query: 108 VGPG-THFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSP 166
VGPG T F +VVPGATSFP VI +AA+FN SL++AIG VVSTE RAMYN+G A LT+WSP
Sbjct: 135 VGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMYNLGHAELTYWSP 194
Query: 167 NINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD------GGSP--NRLKVAACC 218
NIN+ RDPRWGR ETPGEDP + +YA +V+G+Q D P +KV++CC
Sbjct: 195 NINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVDDRPYAAAADPFSRPIKVSSCC 254
Query: 219 KHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK--------------------------- 251
KH+ AYD+D W DR F+A V + F+
Sbjct: 255 KHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDASCVMCSYNRINGIPA 314
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTK-TPEEAAAKSILAGLDLNCG- 290
YIVSDCDSV V+ + T EA A ++ AGLDL+CG
Sbjct: 315 CADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEATAAAMKAGLDLDCGM 374
Query: 291 ------SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
F + AV+ G + E +D+A+SN + TLMRLGFFDG P + LG +
Sbjct: 375 FWEGARDFFTTYGVDAVRQGKIKEGDVDNALSNVYTTLMRLGFFDGMPE---FESLGASN 431
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG--PNANVTKTMIGNYEGT 402
VCT +++LA DAARQG+VLLKN A LPL P I +++++G + N T M+G+Y G
Sbjct: 432 VCTDGHKELAADAARQGMVLLKNDARRLPLDPNKINSVSLVGLLEHINATDVMLGDYRGK 491
Query: 403 PCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEAE 461
PC+ TP + +V Y C + C TA+ + A A ADAT+++ G + S+E E
Sbjct: 492 PCRIVTPYNAIRNMVNATYVHACDSGACNTAEGMGRASSTAKIADATIVIAGLNMSVERE 551
Query: 462 SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQ 521
S+DR DLLLP Q I VA + P++L+IMS GG D+SFA N+ KI +I+W GYPG+
Sbjct: 552 SNDREDLLLPWNQSSWINAVAMASPTPIVLVIMSAGGVDVSFAHNNTKIGAIVWAGYPGE 611
Query: 522 AGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG AIADV FG+YNP GRLP+TW+ Y
Sbjct: 612 EGGTAIADVLFGKYNPGGRLPLTWFKNEY 640
>gi|326491679|dbj|BAJ94317.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 772
Score = 476 bits (1224), Expect = e-131, Method: Compositional matrix adjust.
Identities = 250/550 (45%), Positives = 344/550 (62%), Gaps = 49/550 (8%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVG 109
S FC+ SL +R LV LTL EKI L ++A V RLG+P YEWWSE+LHG++ G
Sbjct: 37 SYAFCDGSLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGVPPYEWWSESLHGLADNG 96
Query: 110 PGTHFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNI 168
PG +FS+ V AT FPQVIL+AA+FN SL++A+ + V+ EARAM+N G AGLT+W+PNI
Sbjct: 97 PGVNFSSGPVAAATIFPQVILSAAAFNRSLWRAVAEAVAVEARAMHNAGQAGLTYWAPNI 156
Query: 169 NIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG-GSPNRLKVAACCKHYTAYDLD 227
N+FRDPRWGRGQETPGEDP + + Y+ YVKG Q G G R+ ++ACCKHY AYDL+
Sbjct: 157 NVFRDPRWGRGQETPGEDPAMIAAYSVEYVKGFQGEYGDGREGRMMLSACCKHYIAYDLE 216
Query: 228 NWKGTDRYHFNAMV----IYTYYLIKFK-------------------------------- 251
W RY FNA V Y FK
Sbjct: 217 KWGKFARYTFNAEVNAQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVPACARKDLLQK 276
Query: 252 ---------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVK 302
YIVSDCD+V +++ +Q YT + E++ A + AG+D+NCGSFL +HT++A++
Sbjct: 277 IRDEWGFKGYIVSDCDAVAIIHENQTYTSSDEDSVAIVLKAGMDVNCGSFLIRHTKSAIE 336
Query: 303 AGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGI 362
G + E I+HA+ N F+ +RLG F+ Q + +LGP +VCT+ +++LA +A RQG
Sbjct: 337 KGKIQEEDINHALYNLFSVQLRLGLFEKANENQWFTRLGPSNVCTKEHRELAAEAVRQGT 396
Query: 363 VLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV-ATIY 421
VLLKN LPL + + ++A+IG AN M G+Y G PC T L+G+ A V T
Sbjct: 397 VLLKNDNSFLPLKRSKVSHIALIGAAANDAYIMGGDYTGVPCDPITFLKGMQAFVPQTTV 456
Query: 422 QAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITE 480
AGC +V C + +A +AA AD V++ G + + E+E DR+ LLLPG+QQ L+
Sbjct: 457 AAGCKDVSCDSPDGFGEAIEAAKRADIVVVIAGLNLTQESEDLDRVTLLLPGRQQDLVNI 516
Query: 481 VAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGR 540
+A V K P++L+I GG D++FAK DP+I S+LW+GYPG+ GG + ++ FG YNP G+
Sbjct: 517 IASVTKKPIVLVITGGGPVDVAFAKQDPRIASVLWIGYPGEVGGQVLPEILFGEYNPGGK 576
Query: 541 LPMTWYPQSY 550
LPMTWYP+S+
Sbjct: 577 LPMTWYPESF 586
>gi|326513064|dbj|BAK03439.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 694
Score = 476 bits (1224), Expect = e-131, Method: Compositional matrix adjust.
Identities = 261/464 (56%), Positives = 315/464 (67%), Gaps = 67/464 (14%)
Query: 137 SLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATG 196
+L + +G +V+ + A+ +G+ +WS ETPGEDPLLASKYA G
Sbjct: 70 TLAEKVGFLVNKQP-ALGRLGIPAYEWWS---------------ETPGEDPLLASKYAVG 113
Query: 197 YVKGLQQTDGG--SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV------------- 241
YV GLQ G + LKVAACCKHYTAYD+DNWKG +RY F+A V
Sbjct: 114 YVTGLQDAGAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPF 173
Query: 242 ------------IYTYYLIKFK---------------------YIVSDCDSVDVLYNSQH 268
+ +Y + K YIVSDCDSVDVLY QH
Sbjct: 174 KSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQH 233
Query: 269 YTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFF 328
YTKTPEEAAA +I +GLDLNCG+FL +HT AAV+AG ++E +D AI+NNF LMRLGFF
Sbjct: 234 YTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFF 293
Query: 329 DGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPN 388
DG P + +G LGPKDVCT +N++LA + ARQGIVLLKN+ G+LPLS +IK++AVIGPN
Sbjct: 294 DGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIGPN 352
Query: 389 ANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKKAAASAD 446
AN + TMIGNYEGTPCKYTTPLQGL A V T+YQ GC+NV C + Q+ A AAASAD
Sbjct: 353 ANASFTMIGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASAD 412
Query: 447 ATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN 506
TVLV+GADQSIE ES DR LLLPGQQ L++ VA + GPVIL++MSGG FDISFAK
Sbjct: 413 VTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKA 472
Query: 507 DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
KI +ILWVGYPG+AGGAA+AD+ FG +NPSGRLP+TWYP SY
Sbjct: 473 SDKIAAILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASY 516
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/74 (51%), Positives = 49/74 (66%), Gaps = 1/74 (1%)
Query: 27 HSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGS 86
++ ++ +PVFACD SN +LA+ GFCN R DLV RLTL EK+ FLV+ +
Sbjct: 26 NAGTAEAQAPVFACDA-SNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPA 84
Query: 87 VSRLGIPKYEWWSE 100
+ RLGIP YEWWSE
Sbjct: 85 LGRLGIPAYEWWSE 98
>gi|326523729|dbj|BAJ93035.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 810
Score = 473 bits (1216), Expect = e-130, Method: Compositional matrix adjust.
Identities = 252/569 (44%), Positives = 332/569 (58%), Gaps = 74/569 (13%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
+A +C+ SL RV DLV RLTL+EK+ L D A +R+G+P Y WW EALHGVS
Sbjct: 57 EMAGFRYCDASLPYADRVRDLVGRLTLEEKVRNLGDRAEGAARVGLPPYLWWGEALHGVS 116
Query: 107 YVGPG-THFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWS 165
GPG T F +VVPGATSFP VI +AA+FN +L+ AIG VSTE RAMYN+G A LT+WS
Sbjct: 117 DTGPGGTRFGDVVPGATSFPLVINSAAAFNETLWGAIGGAVSTEIRAMYNLGHAELTYWS 176
Query: 166 PNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR--------LKVAAC 217
PNIN+ RDPRWGR ETPGEDP + +YA +V+ +Q DG P +KV++C
Sbjct: 177 PNINVVRDPRWGRASETPGEDPFVVGRYAVSFVRAMQDIDGAGPGAGADPFARPIKVSSC 236
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK-------------------------- 251
CKHY AYD+D W DR F+A V + F+
Sbjct: 237 CKHYAAYDVDAWLTADRLTFDAQVEERDMIETFERPFEMCVRDGDASCVMCSYNRINGVP 296
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTK-TPEEAAAKSILAGLDLNCG 290
YIVSDCDSV V+ + EA A ++ AGLDL+CG
Sbjct: 297 ACANARLLSETVRGEWQLHGYIVSDCDSVRVMVRDAKWLGYNGVEATAAAMKAGLDLDCG 356
Query: 291 SFLGKHTE-------AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
F + AV+ G + ES +D+A+ N + TLMRLGFFDG P + LG
Sbjct: 357 MFWEGAQDFFTAFGLDAVRQGKLRESEVDNALRNLYLTLMRLGFFDGIPELE---SLGAN 413
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG--PNANVTKTMIGNYEG 401
DVCT+ +++LA DAARQG+VL+KN G LPL + + +L+++G + N T M+G+Y G
Sbjct: 414 DVCTEEHKELAADAARQGMVLIKNDHGRLPLDTSKVNSLSLVGLLQHINATDVMLGDYRG 473
Query: 402 TPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE 461
PC+ TP + VV+ C + C TA + DAT+++ G + S+E E
Sbjct: 474 KPCRVVTPYDAIRKVVSATSMQVCDHGACSTA------ANGKTVDATIVIAGLNMSVEKE 527
Query: 462 SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQ 521
+DR DLLLP Q I VA+ + P+IL+I+S GG D+SFA+N+PKI +I+W GYPG+
Sbjct: 528 GNDREDLLLPWNQTNWINAVAEASPYPIILVIISAGGVDVSFAQNNPKIGAIVWAGYPGE 587
Query: 522 AGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG AIADV FG+YNP GRLP+TWY Y
Sbjct: 588 EGGTAIADVLFGKYNPGGRLPLTWYKSEY 616
>gi|357128056|ref|XP_003565692.1| PREDICTED: beta-D-xylosidase 3-like [Brachypodium distachyon]
Length = 821
Score = 468 bits (1205), Expect = e-129, Method: Compositional matrix adjust.
Identities = 252/571 (44%), Positives = 335/571 (58%), Gaps = 72/571 (12%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSA-GSVSRLGIPKYEWWSEALHGVS 106
+A +C+ SL RV DLV RLTL+EK+ L D A G+ R+G+P+Y WW EALHGVS
Sbjct: 58 MAGFRYCDASLPYAERVRDLVGRLTLEEKVANLGDQAKGAEQRVGLPRYMWWGEALHGVS 117
Query: 107 YVGPG-THFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWS 165
PG T F +VVPGATSFP V+ +AA+FN +L++AIG STE RAMYN+G A LT+WS
Sbjct: 118 DTNPGGTRFGDVVPGATSFPLVLNSAAAFNETLWRAIGGATSTEIRAMYNLGHAELTYWS 177
Query: 166 PNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS----------PNRLKVA 215
PNIN+ RDPRWGR ETPGEDP L ++A +V+ +Q D G+ RLKV+
Sbjct: 178 PNINVVRDPRWGRASETPGEDPFLVGRFAVSFVRAMQDIDDGANAGAGAADPFARRLKVS 237
Query: 216 ACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK------------------------ 251
+CCKHY AYD+D W G DR F+A V + F+
Sbjct: 238 SCCKHYAAYDVDKWFGADRLSFDANVQERDMVETFERPFEMCVRDGDASCVMCSYNRING 297
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPE-EAAAKSILAGLDLN 288
YIVSDCDSV V+ + +A A ++ AGLDL+
Sbjct: 298 VPACANGRLLTGTVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYDGVQATAAAMKAGLDLD 357
Query: 289 CG-------SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG 341
CG F + AV+ G + E+ +D A+ + + TLMRLGFFDG P Q LG
Sbjct: 358 CGMFWEGAKDFFTAYGLQAVRQGKLKEAEVDEALGHLYLTLMRLGFFDGSPEFQ---SLG 414
Query: 342 PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG--PNANVTKTMIGNY 399
DVCT+ ++++A +AARQG+VLLKN LPL + +LA++G + N T M+G+Y
Sbjct: 415 ASDVCTEEHKEMAAEAARQGMVLLKNDHDRLPLDANKVNSLALVGLLQHINATDVMLGDY 474
Query: 400 EGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIE 459
G PC+ TP + + VV+ C CGT + A AA + DAT+++ G + S+E
Sbjct: 475 RGKPCRVVTPYEAIRKVVSGTSMQACDKGACGTTALG-AAIAAKTVDATIVITGLNMSVE 533
Query: 460 AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYP 519
E +DR DLLLP Q I VA+ ++ P+ L+I+S GG DISFA+N+PKI +ILW GYP
Sbjct: 534 REGNDREDLLLPWDQTQWINAVAEASRDPITLVIISAGGVDISFAQNNPKIGAILWAGYP 593
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
G+ GG IADV FG+YNP GRLP+TWY Y
Sbjct: 594 GEEGGTGIADVLFGKYNPGGRLPLTWYKNEY 624
>gi|222615852|gb|EEE51984.1| hypothetical protein OsJ_33664 [Oryza sativa Japonica Group]
Length = 753
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 253/548 (46%), Positives = 330/548 (60%), Gaps = 67/548 (12%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
FC+ L + R DLV LTL EK++ L D A V+RLG+P YEWWSE LHG+S G G
Sbjct: 31 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIF 171
F+ V TSFPQVILTAA+F+A L++ +G+ V EARA+YN+G A GLT WSPN+NIF
Sbjct: 91 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
RDP R PG D ++ +Q GG + +ACCKH TAYDLD W
Sbjct: 151 RDPSGTR----PG-DARRGPRHG-------EQGIGG-----EASACCKHATAYDLDYWNN 193
Query: 232 TDRYHFNAMV----IYTYYLIKFK------------------------------------ 251
RY++++ V + Y FK
Sbjct: 194 VVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKKVRQ 253
Query: 252 ------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGL 305
Y+ SDCD+V + ++ HYT +PE+ A SI G+D+NCG++ H AAV+ G
Sbjct: 254 EWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGN 313
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHP-SKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
+ E ID A+ N FA MRLG FDG P S YG LG DVC+ A++ LAL+AA+ GIVL
Sbjct: 314 LTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVL 373
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA--TIYQ 422
LKN AG+LPL P+A+ +LAVIGPNA+ + GNY G PC+ TTPLQG+ + +
Sbjct: 374 LKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFL 433
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVA 482
AGC + C ++A A+S+D VL MG Q E + DR LLLPG+QQ LIT VA
Sbjct: 434 AGCDSPACAVDATNEAAALASSSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVA 493
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
A+ PVIL++++GG D++FAK++PKI +ILW GYPGQAGG AIA V FG +NPSGRLP
Sbjct: 494 NAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLP 553
Query: 543 MTWYPQSY 550
+TWYP+ +
Sbjct: 554 VTWYPEEF 561
>gi|343172466|gb|AEL98937.1| beta-xylosidase, partial [Silene latifolia]
gi|343172468|gb|AEL98938.1| beta-xylosidase, partial [Silene latifolia]
Length = 374
Score = 457 bits (1176), Expect = e-126, Method: Compositional matrix adjust.
Identities = 229/377 (60%), Positives = 260/377 (68%), Gaps = 49/377 (12%)
Query: 89 RLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVST 148
RLG+ YEWWSEALHGVS VGPGT F P ATSFPQVI TAASFNASL+QAIG+ VS
Sbjct: 1 RLGLQGYEWWSEALHGVSNVGPGTKFQGAFPAATSFPQVITTAASFNASLWQAIGQAVSD 60
Query: 149 EARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
EARAMYN G AGLT+WSPN+NIFRDPRWGRGQETPGEDP L+++YA YV GLQ G
Sbjct: 61 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLSAQYAASYVTGLQ---GNY 117
Query: 209 PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------- 251
NRLKVAACCKHYTAYDLDNW G DR+HFNA V + Y + FK
Sbjct: 118 GNRLKVAACCKHYTAYDLDNWNGMDRFHFNAKVSKQDLEDTYNVPFKACVLEGKVASVMC 177
Query: 252 -----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSIL 282
YIVSDCDSV VLY+ QHYT+TPEEAAA +I
Sbjct: 178 SYNQVNGKPTCADPDILRNTIRGQWHLNGYIVSDCDSVGVLYDDQHYTRTPEEAAADTIN 237
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
AGLDL+CG FL HTE A++ GLV E+A++ A++N MRLG FDG PS QP+G LGP
Sbjct: 238 AGLDLDCGPFLAVHTEGAIRQGLVTEAAVNQALANTITVQMRLGMFDGEPSAQPFGNLGP 297
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
+DVCT A+QDLAL AAR+GIVLLKN GSLPLS +N+AVIGPNA T TMIGNY G
Sbjct: 298 RDVCTPAHQDLALQAAREGIVLLKNQVGSLPLSTVRHRNIAVIGPNAQATTTMIGNYAGI 357
Query: 403 PCKYTTPLQGLAAVVAT 419
C YT+PLQG++ T
Sbjct: 358 ACGYTSPLQGISRYART 374
>gi|357489437|ref|XP_003615006.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516341|gb|AES97964.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 685
Score = 457 bits (1175), Expect = e-126, Method: Compositional matrix adjust.
Identities = 241/494 (48%), Positives = 308/494 (62%), Gaps = 56/494 (11%)
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNIN 169
G + +P ATSFPQVILTAASF+ L+ I KV+ TEAR +YN G A G+ FW+PNIN
Sbjct: 2 GIILNGSIPAATSFPQVILTAASFDPKLWYQISKVIGTEARGVYNAGQAQGMNFWAPNIN 61
Query: 170 IFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--QTDGGS--PNRLKVAACCKHYTAYD 225
IFRDPRWGRGQET GEDPL+ SKY YV+GLQ +GG RLK +ACCKH+TAYD
Sbjct: 62 IFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSFEGGKLIGGRLKASACCKHFTAYD 121
Query: 226 LDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------------ 251
L+NWKG +RY F+A V + Y F
Sbjct: 122 LENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLL 181
Query: 252 ------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA 299
YI SDCD+V +Y Q Y KTPE+ A + AG+DL CG+++ KH ++
Sbjct: 182 TNTARKKWNFNGYIASDCDAVRFIYEKQGYAKTPEDVVADVLRAGMDLECGNYMTKHAKS 241
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
AV + S ID A+ N F +RLG FDG+P+K YG++GP VC++ N DLAL+AAR
Sbjct: 242 AVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAAR 301
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTK-TMIGNYEGTPCKYTTPLQGLAAVVA 418
GIVLLKNTA LPL + L VIGPNAN + ++GNY G PCK + L+G +
Sbjct: 302 SGIVLLKNTASILPLP--RVNTLGVIGPNANKSSIVLLGNYIGPPCKNVSILKGFYTYAS 359
Query: 419 -TIYQAGCSN-VQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQL 476
T Y +GC++ +C +A++D A + A +D +LVMG DQS E E+ DR L LPG+QQ
Sbjct: 360 QTHYHSGCTDGTKCASAEIDRAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQK 419
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI VAK +K PVIL+++ GG DI+FAKN+ KI I+W GYPG+ GG A+A V FG YN
Sbjct: 420 LINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYN 479
Query: 537 PSGRLPMTWYPQSY 550
P GRLPMTWYP+ +
Sbjct: 480 PGGRLPMTWYPKDF 493
>gi|222629257|gb|EEE61389.1| hypothetical protein OsJ_15562 [Oryza sativa Japonica Group]
Length = 771
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 247/563 (43%), Positives = 337/563 (59%), Gaps = 64/563 (11%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSR---------LGIPK 94
++P+ ++ FCN +L R LV LTL EKI L+ +G+P
Sbjct: 31 ASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLLQHRRGRPPPRRPALRVVVGVPS 90
Query: 95 YEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
+ S GP V AT FPQVIL+AA+FN SL++A + ++ EARAM+
Sbjct: 91 TASATTGPGSTSPRGP-------VRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMH 143
Query: 155 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKV 214
N G AGLTFW+PNIN+FRDPRWGRGQETPGEDP + S Y+ YVKG Q+ D G R+ +
Sbjct: 144 NAGQAGLTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQR-DYGEEGRMML 202
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------- 251
+ACCKHY AYDL+ W+G RY FNA V + Y FK
Sbjct: 203 SACCKHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVN 262
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
YI SDCD+V +++ +Q YT + E++ A + AG+D+NC
Sbjct: 263 GVPACARKDILQRARDEWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDINC 322
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
GSFL +HT++A++ G V E I+HA+ N F+ +RLGFFD Q + QLGP +VCT
Sbjct: 323 GSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTE 382
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
+++LA +A RQG VLLKN G LPL + + ++A+IGP AN + G+Y G PC TT
Sbjct: 383 HRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTF 442
Query: 410 LQGLAAVV-ATIYQAGCSNVQCG-TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
++G+ A V T + AGC +V C T +A +AA AD VL+ G + + E E HDR+
Sbjct: 443 VKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVS 502
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
LLLPG+Q LI VA V K PV+L++M GG D+SFAK+DP+I SILW+GYPG+ GG +
Sbjct: 503 LLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVL 562
Query: 528 ADVCFGRYNPSGRLPMTWYPQSY 550
++ FG+YNP G+LP+TWYP+S+
Sbjct: 563 PEILFGKYNPGGKLPITWYPESF 585
>gi|357153280|ref|XP_003576399.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
distachyon]
Length = 807
Score = 449 bits (1156), Expect = e-123, Method: Compositional matrix adjust.
Identities = 247/577 (42%), Positives = 326/577 (56%), Gaps = 103/577 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+C+ L RV DL+ +T++EK++ L D A R+G+P Y+WWSEALHG+S GP T
Sbjct: 62 YCDAKLPYGDRVRDLIGWMTVEEKVSNLGDWAAGAPRVGLPPYKWWSEALHGLSSTGPTT 121
Query: 113 HFSNV-----------VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGL 161
F ++ V T F VI +AASFN SL+++IG+ +STEARAMYN+G GL
Sbjct: 122 KFDDLKKPRLHSGRAAVFNGTVFANVINSAASFNESLWRSIGQAISTEARAMYNLGKGGL 181
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD------GGSP--NRLK 213
T+WSPNIN+ RDPRWGR ETPGEDP + +YA +V+G+Q D G P LK
Sbjct: 182 TYWSPNINVVRDPRWGRALETPGEDPFVVGRYAVNFVRGMQDVDDAAAGFNGDPLSRPLK 241
Query: 214 VAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK---------------------- 251
+ACCKHY AYD+D+W G R+ F+A V + F+
Sbjct: 242 TSACCKHYAAYDVDDWYGHTRFKFDARVTERDMVETFQRPFEMCVRDGDASAVMCSYNRV 301
Query: 252 ------------------------YIVSDCDSVDVLY-NSQHYTKTPEEAAAKSILAGLD 286
YIVSDCD+V V+ N+ TP EA+A S+ AGLD
Sbjct: 302 NGIPACADARLLAGTLRRDWGLHGYIVSDCDAVRVMTDNATWLGYTPAEASAASLKAGLD 361
Query: 287 LNCGS------------FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK 334
L+CG FL + AAV+ G + ES ID+A+ N + TLMRLG+FDG P
Sbjct: 362 LDCGESWIVQKGKPVMDFLSTYGMAAVRQGKMRESDIDNALVNLYTTLMRLGYFDGMPR- 420
Query: 335 QPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANV-TK 393
Y L KD+C++A++ LALD ARQ +VLLKN G LPL + + ++AV GP+A K
Sbjct: 421 --YESLDEKDICSEAHRSLALDGARQSMVLLKNLDGLLPLDASKLASVAVRGPHAEAPEK 478
Query: 394 TMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMG 453
M G+Y G PC+Y TP +G++ V Q G D T+ + G
Sbjct: 479 VMDGDYTGPPCRYITPREGISKDVNISQQGG---------------------DVTIYMGG 517
Query: 454 ADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSI 513
+ IE E +DR DLLLP Q I VA + P++L+I+SGGG D+SFA++ PKI +I
Sbjct: 518 INMHIEREGNDREDLLLPKNQTEEILRVAAASPSPIVLVILSGGGIDVSFAQSHPKIGAI 577
Query: 514 LWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
LW GYPG GG AIADV FGRYNP GRLP+TW+ Y
Sbjct: 578 LWAGYPGGEGGHAIADVIFGRYNPGGRLPLTWFKNKY 614
>gi|78482949|emb|CAJ41429.1| beta (1,4)-xylosidase [Populus tremula x Populus alba]
Length = 732
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 273/573 (47%), Positives = 335/573 (58%), Gaps = 93/573 (16%)
Query: 13 VSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLT 72
+SVLFL + SLQ ++++ FACD + + L FC +L I RV DL+ R+T
Sbjct: 9 LSVLFLGV---SLQ----TSKALDPFACDP-KDGTNRDLPFCQVNLPIHTRVNDLIGRMT 60
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAA 132
LQEK+ LV++A +V RLGI YEWWSEALHGVS VGPGT F P ATSFPQVI TAA
Sbjct: 61 LQEKVGLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGAFPVATSFPQVITTAA 120
Query: 133 SFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASK 192
SFNA+L++AIG+VVS EARAM+N G+AGLT+WSPN+ PRWGRGQETPGEDP++ K
Sbjct: 121 SFNATLWEAIGRVVSDEARAMFNGGVAGLTYWSPNVTYSVYPRWGRGQETPGEDPVVVGK 180
Query: 193 YATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLI 248
YA YV+GLQ +DG RLKVAACCKH+TAYDLDNW G DR+HFNA V + + +
Sbjct: 181 YAASYVRGLQGSDG---IRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMVDTFDV 237
Query: 249 KFK------------------------------------------YIVSDCDSVDVLYNS 266
F+ YIVSDCDS V Y
Sbjct: 238 PFRMCVKEGKVASVMCSYNQVNGIPTCADPNLLKKTVRGQWRLNGYIVSDCDSFGVYYGQ 297
Query: 267 QHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLG 326
QH+T +P ++ AGLDL+CG FL H +A KA E+ I++A + LG
Sbjct: 298 QHFT-SPRRSSLGCYKAGLDLDCGPFLVTHRDAVKKAA--EEAEINNAWLKTLTFQISLG 354
Query: 327 FFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG 386
FDG P Q G + P + NQDLA++A ++ + + KN A L SP I G
Sbjct: 355 IFDGSP-LQAVGDVVPT-MGPPTNQDLAVNAPKR-LFIFKNRA-FLLYSPRHI-----FG 405
Query: 387 PNANVTKT--MIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAAS 444
P A M+GNYEG PCKY PLQGLA V+ +Y GCSNV C A V A AAS
Sbjct: 406 PVALFKSLPFMLGNYEGLPCKYLFPLQGLAGFVSLLYLPGCSNVICAVADVGSAVDLAAS 465
Query: 445 ADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIM----SGGGFD 500
ADA VLV+GADQSIE E HDR+D LPG+QQ L+T VA AKGPV+L+IM SGGG
Sbjct: 466 ADAVVLVVGADQSIEREGHDRVDFYLPGKQQELVTRVAMAAKGPVLLVIMDLAISGGGCS 525
Query: 501 ISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
+ Q G I+DVC G
Sbjct: 526 YN------------------QVNGIPISDVCEG 540
>gi|326431595|gb|EGD77165.1| beta-glucosidase [Salpingoeca sp. ATCC 50818]
Length = 900
Score = 440 bits (1131), Expect = e-121, Method: Compositional matrix adjust.
Identities = 250/558 (44%), Positives = 324/558 (58%), Gaps = 56/558 (10%)
Query: 40 CDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWS 99
CDV SL FCNT+L D R+ DL+ R+ + LV+SA V L +P Y+WWS
Sbjct: 174 CDVEPGKSLP---FCNTALSYDDRIRDLISRINDSDLPGLLVNSATGVEHLNLPAYQWWS 230
Query: 100 EALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA 159
EALHGV + PG HF VP ATSFPQVI T A+FN +L++ IG V+STEARAM NV A
Sbjct: 231 EALHGVGH-SPGVHFGGDVPAATSFPQVIHTGATFNKTLYRKIGTVISTEARAMNNVQRA 289
Query: 160 GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCK 219
G TFW+PNINI RDPRWGRGQETPGEDP +YA +V G Q DG N +K ++CCK
Sbjct: 290 GNTFWAPNINIIRDPRWGRGQETPGEDPFATGEYAANFVSGFQ--DGEDMNYIKASSCCK 347
Query: 220 HYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------ 251
H+ Y+L+NW G DR+H+NA+ I YL F+
Sbjct: 348 HFFDYNLENWHGVDRHHYNAIATDQDIADTYLPSFEACVRYGRASGLMCSYNAVNGVPSC 407
Query: 252 ------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL 293
YI SDC +V + NS +T+ E + AG+D +CGSF+
Sbjct: 408 ANGDIMTVMARESWGFDGYITSDCGAVADVLNSHKFTRNTSETIRAVLEAGMDTDCGSFV 467
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
++ A++ G+V ++ A+ F RLG FD SKQPY V T ANQ L
Sbjct: 468 QQYLAKAMQEGVVPRELVNTALHRLFMVQFRLGLFDP-VSKQPYTNYSVARVNTPANQQL 526
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL+AA+QGIVLLKNT LPL T + ++A+IGPNA+ T M GNY+GT +P++G
Sbjct: 527 ALEAAQQGIVLLKNTNARLPLK-TGL-HVALIGPNADATTVMQGNYQGTAPFLISPVRGF 584
Query: 414 AAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPG 472
A + Y GC T+ D A AA ADA V+V+G DQ E+E HDR + LPG
Sbjct: 585 KNYSAAVTYAKGCDVACKDTSGFDAAVAAAKEADAVVVVVGLDQGQESEGHDRTSITLPG 644
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCF 532
Q+ L+ +VA AK P+++ +M+GG D+S K + + ILW GYPGQ+GG A+ADV F
Sbjct: 645 HQEDLVAQVAAAAKSPIVVFVMTGGAVDLSTIKANKNVAGILWCGYPGQSGGQAMADVVF 704
Query: 533 GRYNPSGRLPMTWYPQSY 550
G +P GRLP T YP SY
Sbjct: 705 GAVSPGGRLPYTIYPGSY 722
>gi|242093144|ref|XP_002437062.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
gi|241915285|gb|EER88429.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
Length = 809
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 248/583 (42%), Positives = 326/583 (55%), Gaps = 104/583 (17%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
++++ +C+ SL RV DL+ +T++EK+ L D + R+G+P Y+WWSEALHGVS
Sbjct: 57 NMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNLGDVSHGAPRVGLPPYKWWSEALHGVS 116
Query: 107 YVGPGTHFSNV------------VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
GP F ++ V AT F VI +AASFN +L+++IG+ VSTEARAMY
Sbjct: 117 STGPTMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWKSIGQAVSTEARAMY 176
Query: 155 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT---DGG---- 207
N+G GLT+WSPNIN+ RDPRWGR ETPGEDP +A +YA +V+G+Q DGG
Sbjct: 177 NLGKGGLTYWSPNINVVRDPRWGRALETPGEDPFVAGRYAVNFVRGMQDIPGHDGGGDDP 236
Query: 208 SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------ 251
S +K +ACCKHY AYD+D+W R+ F+A V + +L F+
Sbjct: 237 STRPIKTSACCKHYAAYDVDDWHNHTRFTFDARVSERDMAETFLRPFEMCVRDGDASGVM 296
Query: 252 ------------------------------YIVSDCDSVDVLYNSQHYTK-TPEEAAAKS 280
YIVSDCD+V V+ ++ + T E++A S
Sbjct: 297 CSYNRVNGIPACADARLLSGTIRGDWQLHGYIVSDCDAVRVMTDNATWLHFTGAESSAAS 356
Query: 281 ILAGLDLNCGS------------FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFF 328
I AGLDL+C FL ++ +AAV G + ES ID A+ N + TLMRLG+F
Sbjct: 357 IRAGLDLDCAESWIEEKGRPLRDFLSEYGKAAVAQGKMRESDIDSALRNQYMTLMRLGYF 416
Query: 329 DGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPN 388
D P Y L D+CT ++ LA D ARQG+VLLKN G LPL P I +AV GP+
Sbjct: 417 DNIPR---YASLNETDICTDEHKSLAHDGARQGMVLLKNDDGLLPLDPEKILAVAVHGPH 473
Query: 389 ANV-TKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADA 447
A K M G+Y G PC+Y TP QG++ V K + A+
Sbjct: 474 ARAPEKIMDGDYTGPPCRYVTPRQGISKDV----------------------KISHRANT 511
Query: 448 TVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKND 507
T+ + G + IE E +DR DLLLP Q I AK + P+IL+I+SGGG DISFA
Sbjct: 512 TIYLGGINLHIEREGNDREDLLLPKNQTEEILHFAKASPNPIILVILSGGGIDISFAHKH 571
Query: 508 PKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PKI +ILW GYPG GG AIADV FGRYNP GRLP+TW+ Y
Sbjct: 572 PKIGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKY 614
>gi|340370204|ref|XP_003383636.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 755
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 246/561 (43%), Positives = 322/561 (57%), Gaps = 58/561 (10%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D P + +CN S I RV DL+ RLT+ EK++ +A ++ RL IP Y+WWSE
Sbjct: 46 DYCEKPPFNAYLYCNYSASITERVKDLLSRLTVLEKMSQTATNASAIERLDIPAYDWWSE 105
Query: 101 ALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG 160
LHG++ PG F N + ATSFPQVI A+FN SL A+G+V+STEARA N G +G
Sbjct: 106 CLHGLAQ-SPGVFFENDLTSATSFPQVIGLGATFNMSLVLAMGQVISTEARAFANNGQSG 164
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR--LKVAACC 218
LTF++PNINI+RDPRWGRGQETPGEDP L S+YA +VKG+Q+ GS +R LK A C
Sbjct: 165 LTFFAPNINIYRDPRWGRGQETPGEDPYLTSQYAANFVKGIQE---GSEDRRYLKAIATC 221
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KHY AY+L+ + R +FNA+V + YL FK
Sbjct: 222 KHYAAYNLERYLDVRRVNFNAIVSDQDLEETYLPAFKACVQEGQVGSIMCSYNAINGVPN 281
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
YIVSDC ++ + +YT A ++ G DLNCG F
Sbjct: 282 CANDFINNKIARDTWGFEGYIVSDCGAILDIQYKHNYTSDTNITVADALKGGCDLNCGHF 341
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
K+ E A + E ID +++ F + MRLG FD P QP+ Q KDV T QD
Sbjct: 342 YEKYMEDAFDNSTITEEDIDKSLTRLFTSRMRLGMFDP-PEIQPFRQYSVKDVNTPEAQD 400
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LAL+AAR+GIVLL+N LPL N+A IGPNA+ T M GNY G +PLQG
Sbjct: 401 LALNAAREGIVLLQNKGSVLPLDIVKHSNIAAIGPNADATHIMQGNYHGIAPYLISPLQG 460
Query: 413 LAAV-VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLP 471
+ + + YQ GC T DA KA DA + V+G + + E ESHDR + LP
Sbjct: 461 FSNLGINATYQIGCPVACNDTEGFPDAVKAVQGVDAVIAVIGLNNTQEGESHDRTSIALP 520
Query: 472 GQQQLLITEVAK-VAKG-PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIAD 529
G Q+ L+ E+ K AKG P+I+++MSGG D++ K+ +ILW GYPGQ+GG AIA+
Sbjct: 521 GHQEDLLLELKKNAAKGTPLIVVVMSGGSVDLTGVKDIAD--AILWAGYPGQSGGQAIAE 578
Query: 530 VCFGRYNPSGRLPMTWYPQSY 550
V +G+ NPSGRLP+T+YP SY
Sbjct: 579 VIYGKVNPSGRLPVTFYPASY 599
>gi|348667575|gb|EGZ07400.1| xylosidase [Phytophthora sojae]
Length = 751
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 235/567 (41%), Positives = 337/567 (59%), Gaps = 65/567 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
+V +P ++SL FC+ SL ID RV DLV R+ L++ + LV+ A + + +P YEWW+E
Sbjct: 22 NVCDDPKVSSLPFCDGSLPIDARVSDLVNRIPLEQAVGLLVNKASAAPSVNVPSYEWWNE 81
Query: 101 ALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG 160
ALHGV+ + PG F + ATSFPQV+ TAASFN +LF I + +STEARA YN AG
Sbjct: 82 ALHGVA-LSPGVTFKGPLTAATSFPQVLSTAASFNRTLFYQIAEAISTEARAFYNEKNAG 140
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ-QTDGGSPNR-----LKV 214
LTFW+PN+NIFRDPRWGRGQETPGEDP L +YA +V+GLQ + G N+ LK+
Sbjct: 141 LTFWTPNVNIFRDPRWGRGQETPGEDPYLTGEYAVAFVRGLQGEAMEGHENKDDNKFLKI 200
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMVIY-----TYY----------------------- 246
++CCKH++AY + R+ +A+V TY+
Sbjct: 201 SSCCKHFSAYSQE----VPRHRNDAIVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVN 256
Query: 247 -------------LIKFK-----YIVSDCDSV-DVLYNSQHYTKTPEEAAAKSILAGLDL 287
L++ + YI SDC++V DV+Y H+T++PE+ A ++ AG+DL
Sbjct: 257 GIPSCADKGLLTDLVRNQWKFDGYITSDCEAVADVIYR-HHFTQSPEQTCATTLDAGMDL 315
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
NCG FL +H +A++ G+V+ + +A+ N F +MRLG F+ QP+ + V T
Sbjct: 316 NCGEFLRQHLSSAIEQGIVSTEMVHNALKNQFRVMMRLGMFE--KGTQPFSNITKDAVDT 373
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIK---NLAVIGPNANVTKTMIGNYEGTPC 404
A++ LAL+AARQ +VLLKN +LPL+ +LA+IGP+ N + ++GNY G P
Sbjct: 374 AAHRQLALEAARQSVVLLKNEDNTLPLATDVFSKDGSLALIGPHFNASTALLGNYFGIPS 433
Query: 405 KYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
TPL+G+++ V + Y GC D+A + AD V+ MG DQS E E
Sbjct: 434 HIVTPLKGVSSYVPNVAYSLGCKVSGEVLPDFDEAIEVVKKADRVVVFMGLDQSQEREEI 493
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR L LPG Q L+ + A P++L+++SGG D+S KN PK+ +I++ GY GQAG
Sbjct: 494 DRYHLKLPGFQIALLNRILAAASHPIVLVLISGGSVDLSLYKNHPKVGAIVFGGYLGQAG 553
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
G A+AD+ FG+Y+P+GRL T+Y Y
Sbjct: 554 GQALADMLFGKYSPAGRLTQTFYDSDY 580
>gi|413954831|gb|AFW87480.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 814
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 244/581 (41%), Positives = 326/581 (56%), Gaps = 102/581 (17%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
++++ +C+ SL RV DL+ +T++EK+ L D + R+G+P Y+WWSEALHGVS
Sbjct: 64 NMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNLGDISHGAPRVGLPPYKWWSEALHGVS 123
Query: 107 YVGPGTHFSNV------------VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
GP F ++ V AT F VI +AASFN +L+ +IG+ VSTEARAMY
Sbjct: 124 STGPTMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWNSIGQAVSTEARAMY 183
Query: 155 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG---GSPNR 211
N+G GLT+WSPNIN+ RDPRWGR ETPGEDP +A +YA +V+G+Q G G P+
Sbjct: 184 NLGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVAGRYAVNFVRGMQDIPGHYSGDPSA 243
Query: 212 --LKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------- 251
+K +ACCKH+ AYD+DNW R+ ++A V + +L F+
Sbjct: 244 RPIKTSACCKHHAAYDVDNWHNQTRFTYDARVSERDMAETFLRPFEMCVREGDVSSVMCS 303
Query: 252 ----------------------------YIVSDCDSVDVLYNSQHYTK-TPEEAAAKSIL 282
YIVSDCD+V V+ ++ + T E++A S+
Sbjct: 304 YNRVNGVPACADARLLSGTVRGEWHLNGYIVSDCDAVRVMTDNATWLNFTAAESSAVSLR 363
Query: 283 AGLDLNCGS------------FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDG 330
AG+DL+C +L ++ AAV G + ES ID+A++N + TLMRLG+FD
Sbjct: 364 AGMDLDCAESWIEEEGRPLRDYLSEYGMAAVAQGKMRESDIDNALTNLYMTLMRLGYFDN 423
Query: 331 HPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNAN 390
P Y L DVCT ++ LALD ARQGIVLLKN G LPL P +AV GP+A
Sbjct: 424 IPR---YASLNETDVCTDEHKSLALDGARQGIVLLKNDHGLLPLDPKKTLAVAVHGPHAR 480
Query: 391 V-TKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATV 449
K M G+Y G PC+Y TP QG++ V K + A T+
Sbjct: 481 APEKIMDGDYTGPPCRYVTPRQGISRDV----------------------KISHKAKMTI 518
Query: 450 LVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPK 509
+ G + IE E +DR DLLLP Q I A+ + P+IL+I+SGGG DISFA+ PK
Sbjct: 519 YLGGINLYIEREGNDREDLLLPKNQTEEILHFAQASPTPIILVILSGGGIDISFAQKHPK 578
Query: 510 ITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
I +ILW GYPG GG AIADV FGRYNP GRLP+TW+ Y
Sbjct: 579 IGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKY 619
>gi|115486735|ref|NP_001068511.1| Os11g0696400 [Oryza sativa Japonica Group]
gi|77552754|gb|ABA95551.1| Glycosyl hydrolase family 3 C terminal domain containing protein
[Oryza sativa Japonica Group]
gi|113645733|dbj|BAF28874.1| Os11g0696400 [Oryza sativa Japonica Group]
Length = 816
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 255/635 (40%), Positives = 340/635 (53%), Gaps = 124/635 (19%)
Query: 15 VLFLF-LTYCSLQHSSSSAQSSP-------------VFACDVVSNPSLA----SLGFCNT 56
+LF F + SL +++S ++SP CD L +C+
Sbjct: 14 ILFFFAIAASSLLSTATSTRASPADTAVGGIAAKVYTKVCDATRFAGLGLNMTEFRYCDA 73
Query: 57 SLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSN 116
SL RV DL+ R+T++EK+ L D +R+G+P Y WWSEALHG+S GP T F +
Sbjct: 74 SLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLSSTGPTTKFDD 133
Query: 117 V-----------VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWS 165
+ V AT F VI +AASFN +L+++IG+ VSTEARAMYN+G GLT+WS
Sbjct: 134 LATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNMGKGGLTYWS 193
Query: 166 PNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG-------GSPNR--LKVAA 216
PNIN+ RDPRWGR ETPGEDP + +YA +V+G+Q G G PN LK +A
Sbjct: 194 PNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQDIPGHEAVAAGGDPNTRPLKTSA 253
Query: 217 CCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK------------------------- 251
CCKHY AYDLD+W R+ F+A V + F+
Sbjct: 254 CCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFEMCVRDGDVSSVMCSYNRVNGI 313
Query: 252 ---------------------YIVSDCDSVDVLY-NSQHYTKTPEEAAAKSILAGLDLNC 289
YIVSDCD+V V+ N+ T EA+A ++ AGLDL+C
Sbjct: 314 PACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATWLGYTGAEASAAALKAGLDLDC 373
Query: 290 GS-------------FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQP 336
G FL + AV G + ES ID+A++N + TLMRLG+FD
Sbjct: 374 GESWKNDTDGHPLMDFLTTYGMEAVNKGKMRESDIDNALTNQYMTLMRLGYFD---DIAQ 430
Query: 337 YGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANV-TKTM 395
Y LG +D+CT ++ LALD ARQGIVLLKN LPL + + V GP+ K M
Sbjct: 431 YSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDANKVGFVNVRGPHVQAPEKIM 490
Query: 396 IGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGAD 455
G+Y G PC+Y TP QG++ V + + A+ T+ G +
Sbjct: 491 DGDYTGPPCRYVTPRQGVSKYV----------------------RFSHRANTTIYFGGLN 528
Query: 456 QSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILW 515
+IE E +DR D+LLP Q I VAK + P+IL+I+SGGG D+SFA+N+PKI +ILW
Sbjct: 529 LNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILSGGGIDVSFAQNNPKIGAILW 588
Query: 516 VGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GYPG GG AIADV FG++NPSGRLP+TW+ Y
Sbjct: 589 AGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKY 623
>gi|125535311|gb|EAY81859.1| hypothetical protein OsI_37025 [Oryza sativa Indica Group]
Length = 816
Score = 427 bits (1098), Expect = e-117, Method: Compositional matrix adjust.
Identities = 244/585 (41%), Positives = 324/585 (55%), Gaps = 106/585 (18%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
++ +C+ SL RV DL+ R+T++EK+ L D +R+G+P Y WWSEALHG+S
Sbjct: 63 NMTEFRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLS 122
Query: 107 YVGPGTHFSNV-----------VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
GP T F ++ V AT F VI +AASFN +L+++IG+ VSTEARAMYN
Sbjct: 123 STGPTTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYN 182
Query: 156 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG-------GS 208
+G GLT+WSPNIN+ RDPRWGR ETPGEDP + +YA +V+G+Q G G
Sbjct: 183 MGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQDIPGHEAVAAGGD 242
Query: 209 PNR--LKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK--------------- 251
PN LK +ACCKHY AYDLD+W R+ F+A V + F+
Sbjct: 243 PNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFEMCVRDGDVSSV 302
Query: 252 -------------------------------YIVSDCDSVDVLY-NSQHYTKTPEEAAAK 279
YIVSDCD+V V+ N+ T EA+A
Sbjct: 303 MCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATWLGYTGAEASAA 362
Query: 280 SILAGLDLNCGS-------------FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLG 326
++ AGLDL+CG FL + AV G + ES ID+A++N + TLMRLG
Sbjct: 363 ALKAGLDLDCGESWKNDTEGHPLMDFLTTYGMEAVNKGKMRESDIDNALTNQYMTLMRLG 422
Query: 327 FFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG 386
+FD Y LG +D+CT ++ LALD ARQGIVLLKN LPL + + V G
Sbjct: 423 YFD---DITQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDANKVGFVNVRG 479
Query: 387 PNANV-TKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASA 445
P+ K M G+Y G PC+Y TP QG++ V + + A
Sbjct: 480 PHVQAPEKIMDGDYTGPPCRYVTPRQGVSKYV----------------------RFSHRA 517
Query: 446 DATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAK 505
+ T+ G + +IE E +DR D+LLP Q I VAK + P+IL+I+SGGG D+SFA+
Sbjct: 518 NTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILSGGGIDVSFAQ 577
Query: 506 NDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
N+PKI +ILW GYPG GG AIADV FG++NPSGRLP+TW+ Y
Sbjct: 578 NNPKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKY 622
>gi|340370206|ref|XP_003383637.1| PREDICTED: probable beta-D-xylosidase 5-like [Amphimedon
queenslandica]
Length = 728
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 245/592 (41%), Positives = 331/592 (55%), Gaps = 73/592 (12%)
Query: 12 KVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRL 71
K +VL +FL S+ D + +C+ + I RV DL+ R+
Sbjct: 4 KFTVLLVFLFASSV--------------ADYCEKAPFNTYKYCDYTQSIPERVNDLLSRM 49
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
T+ +KI L+ SA ++ L IP Y+WWSE LHGV+ PG HF P ATSFPQVI
Sbjct: 50 TILDKIPQLITSAPAIPSLDIPAYQWWSEGLHGVA-GSPGVHFGGNFPNATSFPQVIGLG 108
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
A+FN SL A+ +V+STEARA N G AGLT+++PNINIFRDPRWGRGQETPGEDP L+S
Sbjct: 109 ATFNMSLVLAMAQVISTEARAFANGGQAGLTYFAPNINIFRDPRWGRGQETPGEDPYLSS 168
Query: 192 KYATGYVKGLQQTDGGSPNR-LKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYY 246
+YA +VKG+Q+ G R LK A CKHY AYDL+N+ R+ FNA+V Y
Sbjct: 169 QYAANFVKGMQE--GADDTRYLKTIATCKHYAAYDLENYLNLSRHTFNAIVSDQDFEETY 226
Query: 247 LIKFK------------------------------------------YIVSDCDSVDVLY 264
F+ Y+VSDC ++ +
Sbjct: 227 FPAFRSCVEEGKVGSIMCSYNAVNGVPSCANDFINNEVARGKWGFEGYVVSDCGAISDII 286
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMR 324
NS YT ++ A + G DLNCG F H +AA G + + ID A++ F MR
Sbjct: 287 NSHKYTSNTDDTVAAGLRGGCDLNCGHFYSDHAQAAYDNGAITDDDIDRAMTRLFTYRMR 346
Query: 325 LGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAV 384
LG FD PS QP+ V T+ ++ LALDA+R+ IVLL+N LPLS T + +A+
Sbjct: 347 LGMFD-PPSMQPFRDYTNDKVDTKQHEALALDASRESIVLLQNNKDILPLSLTTHRKIAL 405
Query: 385 IGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV-VATIYQAGCSNVQCGT-AQVDDAKKAA 442
+GP+ M GNY+GT +P+QGL + ++ + AGC+ V C T A + K
Sbjct: 406 VGPHGQAQGAMQGNYKGTAPYLISPMQGLQDLGLSVTFAAGCTQVACPTIAGFSEVTKLV 465
Query: 443 A--SADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAK--VAKGPVILIIMSGGG 498
S +A + V+G D+S E+E HDR L LPGQQ L+ ++ K V P I+++MSGG
Sbjct: 466 EEHSIEAIIAVIGLDESQESEGHDRTSLTLPGQQVQLLEDIKKKAVPGIPFIVVVMSGGP 525
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
D+S K+ +ILW GYPGQ+GG AIA+V +G+ NPSGRLP+T+YP SY
Sbjct: 526 VDLSGVKD--IADAILWAGYPGQSGGQAIAEVIYGKVNPSGRLPVTFYPASY 575
>gi|226491558|ref|NP_001146416.1| uncharacterized protein LOC100279996 [Zea mays]
gi|223975771|gb|ACN32073.1| unknown [Zea mays]
Length = 507
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 215/301 (71%), Positives = 251/301 (83%), Gaps = 3/301 (0%)
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAI 311
YI SDCDSVDVLYN+QHYTKTPE+AAA SI AGLDLNCG+FL +HT AAV+AG ++ES +
Sbjct: 32 YISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLDLNCGTFLAQHTVAAVQAGKLSESDV 91
Query: 312 DHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS 371
D A++NN TLMRLGFFDG P + P+G LGP DVCT +NQ+LA +AARQGIVLLKNT G
Sbjct: 92 DRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKNT-GK 150
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC- 430
LPLS T+IK++AVIGPNAN + TMIGNYEGTPCKYTTPLQGL A VAT+YQ GC+NV C
Sbjct: 151 LPLSATSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCS 210
Query: 431 -GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPV 489
+ Q+D A KAAASAD TVLV+GADQSIE ES DR LLLPGQQ L++ VA + GP
Sbjct: 211 GNSLQLDAATKAAASADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPC 270
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
IL++MSGG FDISFAK+ KI +ILWVGYPG+AGGAAIADV FG +NPSGRLP+TWYP+S
Sbjct: 271 ILVVMSGGPFDISFAKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPES 330
Query: 550 Y 550
+
Sbjct: 331 F 331
>gi|320170454|gb|EFW47353.1| beta-xylosidase [Capsaspora owczarzaki ATCC 30864]
Length = 779
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 234/564 (41%), Positives = 325/564 (57%), Gaps = 63/564 (11%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
+ +L +L FCN +L + R DLV RLTLQEKI+ +A V+RLG+ YEWWSEALH
Sbjct: 28 EDAALRNLPFCNPNLAWEQRADDLVGRLTLQEKISQFGTTAPGVARLGVNAYEWWSEALH 87
Query: 104 GVSYVGPGTHFSNVVPGATSFPQVI--------LTAASFNASLFQAIGKVVSTEARAMYN 155
GV+ PG +F+ P +T FPQ+I A+FN A+ +V+STEARA N
Sbjct: 88 GVAE-SPGVNFTGNTPVSTCFPQIIGNNCSSLSRVGATFNLDSVAAMAQVISTEARAFAN 146
Query: 156 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVA 215
G AGLT+++PNINIFRDPRWGRGQETPGEDP L S+Y V+ LQ +G LKV
Sbjct: 147 AGHAGLTYFTPNINIFRDPRWGRGQETPGEDPYLTSRYVETLVQNLQ--NGEDARYLKVV 204
Query: 216 ACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------------- 251
A CKHYTAYD+++W G DR+HFNA+V + ++ F+
Sbjct: 205 ATCKHYTAYDMEDWGGIDRFHFNAVVSDQDLVETFMPPFEACVRVGKGASLMCSYNAVNG 264
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
YIVSDC ++D + + +YT T + A I G DL+C
Sbjct: 265 IPSCADDFINNEIAREQWGFDGYIVSDCGAIDCIQYTHNYTNTTQATCAAGIQGGCDLDC 324
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G F H A+ ++E+ +D ++ F +RLG FD S QPY Q+ + +Q
Sbjct: 325 GDFYQSHLMDAIGNATLHEADLDFSLRRLFGHRIRLGEFDA-ASIQPYRQIPVSAINSQE 383
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
+Q+LAL AR+ IVLL N +LP S ++ LA+IGPNA+ +T++GNY G TP
Sbjct: 384 HQELALQIARESIVLLGNDNNTLPFSLATVRKLAIIGPNADDAETLLGNYYGDAPYLITP 443
Query: 410 LQGLAAVVATI---YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
L+G + T+ + GC T+ A AA +ADAT++V+G +Q++E+E+ DR
Sbjct: 444 LKGFQQLDPTLSITFVKGCDVNSTDTSGFVAAAAAAKAADATIVVVGLNQTVESENLDRT 503
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
L+LPG Q LI + A+GPVIL++MSG D+S + + + LW+GYPGQAGG A
Sbjct: 504 TLVLPGVQAELILALTAAARGPVILVVMSGSPIDLSNVIH--PVRAALWIGYPGQAGGRA 561
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+A+ FG ++P+GRLP T YP Y
Sbjct: 562 LAEAVFGVFSPAGRLPFTVYPADY 585
>gi|301110280|ref|XP_002904220.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
gi|262096346|gb|EEY54398.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
Length = 709
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 225/540 (41%), Positives = 311/540 (57%), Gaps = 67/540 (12%)
Query: 68 VKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQV 127
+ R+ L + + LV+ A + IP YEWW+EALHGV+ + PG F + ATSFPQV
Sbjct: 12 LTRIPLDQAVGLLVNKAAPAPSVNIPSYEWWNEALHGVA-LSPGVTFKGSITAATSFPQV 70
Query: 128 ILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDP 187
+ TAASFN SLF I V+STEARA +N AGLTFW+PN+NIFRDPRWGRGQETPGEDP
Sbjct: 71 LSTAASFNRSLFYQIADVISTEARAFHNAKDAGLTFWTPNVNIFRDPRWGRGQETPGEDP 130
Query: 188 LLASKYATGYVKGLQQTDGGSPNR-------LKVAACCKHYTAYDLDNWKGTDRYHFNAM 240
L +YA +V+GLQ G R LK+++CCKH++AY + R+ NAM
Sbjct: 131 YLTGEYAVAFVRGLQGE--GMEGREVENSKFLKISSCCKHFSAYSQE----VPRHRNNAM 184
Query: 241 VIY-----TYY------------------------------------LIKFK-----YIV 254
V TY+ L++ + YI
Sbjct: 185 VTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVNGIPSCADKGLLTDLVRGQWKFDGYIA 244
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
SDC++V + + HYT++PE+ A ++ AG+DLNCG FL +H A++ G+V I +A
Sbjct: 245 SDCEAVADVIDHHHYTQSPEQTCATTLDAGMDLNCGEFLRQHLPKALEQGIVTTEMIHNA 304
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+ N F LMRLG F+ +P+ + V T ++ LAL+AARQ IVLLKN +LPL
Sbjct: 305 LKNQFRVLMRLGMFE---KVEPFANITKDSVDTTMHRQLALEAARQSIVLLKNDGNTLPL 361
Query: 375 SP---TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQC 430
+ T ++LA+IGP+ N + ++GNY G P TPL+G++ V + + GC
Sbjct: 362 ATKDFTRDRSLALIGPHFNASAALLGNYFGIPSHIVTPLEGISQFVPNVAHSLGCKVSGE 421
Query: 431 GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVI 490
DDA A AD ++ +G DQS E E DR + LP Q L+ V +VA P++
Sbjct: 422 VLPDFDDAIAVAKKADRLIVFVGLDQSQEREEIDRYHIGLPAFQSTLLKRVLEVASHPIV 481
Query: 491 LIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+++SGG D+S KN PK+ +I++ GY GQAGG A+ADV FG+YNPSG+LP T+Y Y
Sbjct: 482 FVVISGGCVDLSAYKNHPKVGAIVFGGYLGQAGGQALADVLFGKYNPSGKLPQTFYDSEY 541
>gi|125576920|gb|EAZ18142.1| hypothetical protein OsJ_33692 [Oryza sativa Japonica Group]
Length = 618
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 210/436 (48%), Positives = 270/436 (61%), Gaps = 51/436 (11%)
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
WSPN+NIFRDPRWGRGQETPGEDP ASKY +VKGLQ G S L+ +ACCKH T
Sbjct: 1 MWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQ---GSSLTNLQTSACCKHIT 57
Query: 223 AYDLDNWKGTDRYHFNAMV-------------------------IYTYYLI--------- 248
AYD++ WKG RY+FNA V + Y LI
Sbjct: 58 AYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVPACASS 117
Query: 249 ---------KFK---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKH 296
++K Y SDCD+V +L+ S+H+T+T EEA A ++ AGLD+NCG ++ ++
Sbjct: 118 DLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDINCGVYMQQN 177
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPS-KQPYGQLGPKDVCTQANQDLAL 355
+A++ G + E +D A+ N FA MRLG FDG P + YG+L DVCT ++ LAL
Sbjct: 178 AASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTPVHKALAL 237
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
+AAR+G+VLLKN A LPL + + AVIG NAN ++GNY G PC+ TTP G+
Sbjct: 238 EAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTTPFGGIQK 297
Query: 416 VVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQ 474
V + + GCS+ C A D A A S+D LVMG Q E E DR LLLPG+Q
Sbjct: 298 YVKSAKFLPGCSSAACDVAATDQATALAKSSDYVFLVMGLSQKQEQEGLDRTSLLLPGKQ 357
Query: 475 QLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGR 534
Q LIT VA +K PVILI+++GG DI+FA+ +PKI +ILW GYPGQAGG AIADV FG
Sbjct: 358 QALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAIADVLFGE 417
Query: 535 YNPSGRLPMTWYPQSY 550
+NPSG+LP+TWYP+ +
Sbjct: 418 FNPSGKLPVTWYPEEF 433
>gi|167525174|ref|XP_001746922.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774702|gb|EDQ88329.1| predicted protein [Monosiga brevicollis MX1]
Length = 1620
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 225/542 (41%), Positives = 309/542 (57%), Gaps = 73/542 (13%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
FCN SL +D R+ D++ RL++Q+K+ ++AG+ + G+P Y+WWSEALHGV + PG
Sbjct: 926 FCNASLDLDTRIRDVISRLSIQDKVALTANTAGAAADAGLPAYQWWSEALHGVGF-SPGV 984
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFR 172
F V ATSFPQVI T+ASFN +L+ IG +STEARAM NV AGLTFW+PNINI R
Sbjct: 985 TFMGKVQAATSFPQVIHTSASFNKTLWHHIGMTISTEARAMNNVNQAGLTFWAPNINIIR 1044
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGRGQETPGEDP YA +V G+Q+ G +K ++CCKH+ Y+L++W
Sbjct: 1045 DPRWGRGQETPGEDPYATGLYAANFVPGMQE--GEDTRYIKASSCCKHFFDYNLEDWHNV 1102
Query: 233 DRYHFNAMV----IYTYYLIKFK------------------------------------- 251
DR+HFNA+ I YL F+
Sbjct: 1103 DRHHFNAIATDQDIADTYLPAFESCVRFGRASSLMCSYNAVNGVPSCANADIMTTLAREA 1162
Query: 252 -----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLV 306
YI SDC +V+ +Y++ Y T + AG+D++CGSFL +H A+ +G V
Sbjct: 1163 WGFDGYITSDCGAVEDVYSNHKYYNTTGATVNGVLSAGMDVDCGSFLSQHLADAIDSGDV 1222
Query: 307 NESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
+ +D A+ N F RLG FD P++ QPY L V T +Q LAL+AARQG+ LL
Sbjct: 1223 TNATVDQALYNLFRVQFRLGMFD--PAEDQPYLNLTTDAVNTPEHQQLALEAARQGMTLL 1280
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGC 425
+N LPL ++IK LA+IGPNAN T M GNY G +P QG+ V+ +
Sbjct: 1281 ENRDSRLPLDASSIKQLALIGPNANATGVMQGNYNGKAPFLISPQQGVQQYVSNVA---- 1336
Query: 426 SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVA 485
++ G A AA +AD V+V+G DQ+ E+E HDR + LPG Q L+ +VA +
Sbjct: 1337 --LELG------AVTAAKAADTVVMVIGLDQTQESEGHDREIIALPGMQAELVAQVANAS 1388
Query: 486 KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
P+++++M+GG D++ K+ + GQAGG A+A+ FG NP GRLP T
Sbjct: 1389 SSPIVVVVMTGGAVDLTPVKDLDNV---------GQAGGQALAETLFGDNNPGGRLPYTL 1439
Query: 546 YP 547
YP
Sbjct: 1440 YP 1441
>gi|340377241|ref|XP_003387138.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 733
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 238/552 (43%), Positives = 301/552 (54%), Gaps = 63/552 (11%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+CN L RV DL+ RLTL+EKI+ L +SA ++ RLGIP Y+WWSE LHGV+ V PG
Sbjct: 37 YCNYRLSFKDRVKDLLSRLTLEEKISQLGNSASAIDRLGIPGYQWWSEGLHGVA-VSPGL 95
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFR 172
H + TSFPQ+I TA+SFN SLF IG+ VSTEAR + G GLT+++PNINI R
Sbjct: 96 HLGGNLTCTTSFPQIITTASSFNKSLFYEIGEAVSTEARGFADNGQGGLTYFTPNINIVR 155
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGRGQET GEDP L S+YA V+G Q G K+ A CKH+ AYDL+++
Sbjct: 156 DPRWGRGQETAGEDPYLTSQYAVNLVRGAQ---GNDSEYKKIIATCKHFAAYDLESYING 212
Query: 233 D-RYHFNAMV----IYTYYLIKFK------------------------------------ 251
D R FNA V + Y F+
Sbjct: 213 DVRDSFNAEVTKQDLEETYFPAFRSCVTAGGVGSIMCSYNSVNGVPSCVDGVFNNKIARN 272
Query: 252 ------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGL 305
Y+VSDC ++D + N HYT TP + A + G DLNCGSF H A G
Sbjct: 273 KWKFDGYLVSDCGAIDDVMNKHHYTSTPTDTVAAGLKGGTDLNCGSFYQTHAMDAFLNGS 332
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
+ E ID A+ F MRLG FD P QPY V T+ +QDLAL AAR+ IVLL
Sbjct: 333 ITEVDIDRAVGRLFTARMRLGLFD-LPKYQPYSYFNTDVVNTKQHQDLALQAARESIVLL 391
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV-VATIYQAG 424
+N G LPLS +AV+GPN TM G + +P+ G + + Y G
Sbjct: 392 QNN-GKLPLSYEDHHKIAVVGPNILANVTMQGISQVIAPYLISPVDGFKSKGLHVTYSLG 450
Query: 425 CSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAK 483
C +V+C T DA K A A V VMG DQ IE E+ DR D+ LPG Q + +
Sbjct: 451 C-DVKCIVTDGFHDAFKLVKDAKAVVAVMGLDQGIERETVDREDIFLPGLQDKFLLGLRD 509
Query: 484 VAKG-----PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPS 538
P+I++IMSG D+S +K+ +ILWVGYPGQ+GG AIA+V +G NPS
Sbjct: 510 TLTNLQSPVPLIVVIMSGSSVDLSESKS--LADAILWVGYPGQSGGQAIAEVIYGEVNPS 567
Query: 539 GRLPMTWYPQSY 550
GRLP+T+YP Y
Sbjct: 568 GRLPLTFYPGEY 579
>gi|293336530|ref|NP_001167905.1| uncharacterized protein LOC100381616 [Zea mays]
gi|223944757|gb|ACN26462.1| unknown [Zea mays]
Length = 630
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 197/445 (44%), Positives = 278/445 (62%), Gaps = 48/445 (10%)
Query: 153 MYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
M+N G AGLT+W+PNINIFRDPRWGRGQET GEDP +A+ Y+ YVKG Q +G R+
Sbjct: 1 MHNAGQAGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEEGEE-GRI 59
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------- 251
+++ACCKHYTAYD++ W+G RY FNA V + Y FK
Sbjct: 60 RLSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQ 119
Query: 252 ------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL 287
YI SDCD+V +++ +Q YTK+ E++ A + AG+D+
Sbjct: 120 VNGVPMCAHKDLLQKTRDEWGFQGYITSDCDAVAIIHENQTYTKSGEDSIAIVLKAGMDI 179
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
NCGSFL +HT++A++ G + E ID A+ N F+ +RLG FD + Q + QLGP VCT
Sbjct: 180 NCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCT 239
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
+ +++LA +A RQG VLLKN LPL + ++++A+IGP+AN M G+Y G PC T
Sbjct: 240 KEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGVPCNPT 299
Query: 408 TPLQGLAA-VVATIYQAGCSNVQCGTAQV-DDAKKAAASADATVLVMGADQSIEAESHDR 465
T L+G+ A T + GC + C + + +A +AA AD V++ G + + E E DR
Sbjct: 300 TFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDFDR 359
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
+ LLLPG+Q LI +A VAK P++L+++ GG D+SFAK DP+I SILW+GYPG+ GG
Sbjct: 360 VSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQ 419
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
+ ++ FG YNP G+LP+TWYP+S+
Sbjct: 420 VLPEILFGEYNPGGKLPITWYPESF 444
>gi|340370208|ref|XP_003383638.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 732
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 227/563 (40%), Positives = 305/563 (54%), Gaps = 69/563 (12%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
S +CN SL I RV DL+ R+TL EKIT L ++AGS+ RL IP Y+WWSE LHGV+
Sbjct: 29 FQSFSYCNYSLPISDRVKDLLSRMTLAEKITQLGNTAGSIDRLDIPAYQWWSEGLHGVA- 87
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWS-- 165
PG HF+ + ATSFPQVI TA+SFN +L+ I V+STEARA N G+
Sbjct: 88 DSPGVHFNGMFHNATSFPQVITTASSFNKTLYHEIAAVMSTEARAFANQGIVYFKQHQQL 147
Query: 166 -PNINIF--RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
N +F RDPRWGR QETPGEDP L S+YA +V G Q G LKV CKH+
Sbjct: 148 LSNYLLFYCRDPRWGRAQETPGEDPYLNSQYAIQFVTGAQ----GDSKYLKVVTTCKHFA 203
Query: 223 AYDLDNW-KGTDRYHFNAMVI-----YTYYLIKFK------------------------- 251
YDL+++ G R+ FNA + TYY FK
Sbjct: 204 GYDLEDYVDGETRHSFNAKITPQDFEETYY-PAFKACVEEANVASIMCSYNEVNGVPSCA 262
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
+I SDC ++D + N HYT ++ A ++ G DLNCGS+
Sbjct: 263 DGQINNKLARDTWGFDGFIASDCGAIDDIQNKHHYTNNTDDTVAAALKGGCDLNCGSYYQ 322
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
H ++A G + I+ A++ F M+LG FD P QPY + P V + +Q LA
Sbjct: 323 SHAQSAFLNGTITIGEINLALTRLFTARMKLGMFDP-PELQPYNAISPDVVNSLEHQALA 381
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
L+AAR+ IVLL+N LPL+ +AV+GP+A T M GNY G +P++G
Sbjct: 382 LNAARESIVLLQNNNDVLPLNFEKHSTIAVVGPHAMATDVMQGNYNGVAPYLISPVEGFE 441
Query: 415 AV-VATIYQAGCSNVQCG-TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPG 472
+ + ++ A +V C T DA A ADA + V+G DQS E+E HDR DL LP
Sbjct: 442 NLGIDSVLTASGCDVNCEVTDGFQDAFDIAVKADAVIAVLGLDQSHESEGHDREDLFLPN 501
Query: 473 QQQLLITEVAKVAKG-----PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
Q + ++ K P+I+++MSG D++ K +ILW GYPGQ+GG AI
Sbjct: 502 LQDKFVQDLKNTLKAAGTNAPLIVVVMSGSSVDLTVTKKHAD--AILWAGYPGQSGGQAI 559
Query: 528 ADVCFGRYNPSGRLPMTWYPQSY 550
A++ +G+ NPSGRLP+T+YP SY
Sbjct: 560 AEIIYGKVNPSGRLPVTFYPGSY 582
>gi|359473427|ref|XP_002265788.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Vitis vinifera]
Length = 464
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 201/452 (44%), Positives = 273/452 (60%), Gaps = 59/452 (13%)
Query: 153 MYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG----- 207
MYN+G AGLTFWSPNIN+ RD RWGR QET EDP + ++A YV+GLQ +G
Sbjct: 1 MYNLGHAGLTFWSPNINVVRDTRWGRTQETSREDPFMVGEFAVNYVRGLQDVEGTENVTD 60
Query: 208 -SPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------- 251
+ LKV++CCKHY AYD+D+W DR+ F+A V + ++ F+
Sbjct: 61 LNSRPLKVSSCCKHYAAYDIDSWLNIDRHTFDARVSEQDMKETFVSPFERCVREGDVSSV 120
Query: 252 -------------------------------YIVSDCDSVDVLYNSQHYTKTPE-EAAAK 279
YIVSDC ++V+ ++Q+Y + +A AK
Sbjct: 121 MCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAK 180
Query: 280 SILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQ 339
++ AGLDL CG + V G V++ +D A+ N + LMR+G+FDG P+ Y
Sbjct: 181 TLQAGLDLECGHYYTDALNELVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA---YES 237
Query: 340 LGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
LG KD+C + +LA +AARQGIVLLKN PL P K LA++GP+AN T+ MIGNY
Sbjct: 238 LGLKDICAADHIELAREAARQGIVLLKNDYEVFPLKPG--KKLALVGPHANATEVMIGNY 295
Query: 400 EGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGT-AQVDDAKKAAASADATVLVMGADQSI 458
G P KY +PL+ +A+ Y GC + C +AK+AA SA+ T++ +G D SI
Sbjct: 296 AGLPRKYVSPLEAFSAIGNVTYTTGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSI 355
Query: 459 EAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGY 518
EAE DR+D LLPG Q LI +VA+V+ GPVIL+++SG DI+FAKN+P+I++ILWVG+
Sbjct: 356 EAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGF 415
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PG+ GG AIADV FG+YNP GRLP+TWY Y
Sbjct: 416 PGEQGGHAIADVVFGKYNPGGRLPVTWYEADY 447
>gi|300121549|emb|CBK22068.2| unnamed protein product [Blastocystis hominis]
Length = 690
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 220/535 (41%), Positives = 292/535 (54%), Gaps = 62/535 (11%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV LTL EK++ + +A V RL IPKY+WWSEALHGV+ PG F P AT
Sbjct: 4 RARALVAELTLAEKMSLMGHTASEVKRLNIPKYQWWSEALHGVA-ASPGVVFQEPTPFAT 62
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQET 182
+FPQV LTA SF+ LF I ++STEAR M N A LT+WSPN+N++RDPRWGRGQET
Sbjct: 63 AFPQVALTAQSFDKPLFHDIASIISTEARVMNNAERANLTYWSPNVNVYRDPRWGRGQET 122
Query: 183 PGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV- 241
PGEDP L + YA +V+GLQ+ G P LKV+ACCKHY+AYDL+NW G +R+ F+A+V
Sbjct: 123 PGEDPFLVATYAVEFVRGLQE--GEDPRYLKVSACCKHYSAYDLENWHGVERFEFDAIVS 180
Query: 242 ---IYTYYLIKFK------------------------------------------YIVSD 256
+ + + F+ YI SD
Sbjct: 181 DRDMTDTFQVPFEQCVKKGHVSSLMCSYNAINGIPACADRELLYGTARGGWGFEGYITSD 240
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAIS 316
C ++D + + HYT + A + A DL+CG F +H +V++G + E+ +D A++
Sbjct: 241 CGAIDTIIYNHHYTNDTDTTAMLGVRATCDLDCGGFYQQHILHSVESGRLKEAEVDDALA 300
Query: 317 NNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
N F MRLG FD +Q Y G + T+ +Q +AL AAR+GI LLKN LPLS
Sbjct: 301 NLFKVQMRLGLFDP-VEQQVYTHYGLDKLNTKEHQAMALRAAREGIALLKNQNDFLPLS- 358
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD 436
K++ V+GP A M+GNY G P T QGL V C +V +
Sbjct: 359 LKDKHVVVMGPYAEDAGVMLGNYNGIPEFIVTVAQGLRNV--------CDHVD--VVKSL 408
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A D V+ +G +Q IE E DR DLLLP Q+ L+ + PV+L ++SG
Sbjct: 409 EALSKLEGVDLIVVTVGLNQEIEREGLDREDLLLPASQRALLDGLLAQTDVPVVLTLLSG 468
Query: 497 GG-FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG DIS + + + +L VGY G GG AIA+V G NPSGRL T Y Y
Sbjct: 469 GGSVDISAYEQNEHVVGVLAVGYGGMFGGQAIAEVIVGDVNPSGRLVNTMYYNDY 523
>gi|393247584|gb|EJD55091.1| beta-xylosidase [Auricularia delicata TFB-10046 SS5]
Length = 763
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 216/549 (39%), Positives = 300/549 (54%), Gaps = 56/549 (10%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTH 113
CNT+ R L+ T +E + V+ + V RLG+P Y+WWSEALHGV+ PG H
Sbjct: 37 CNTTANFMDRAKALIDEFTTEELVNNTVNGSPGVPRLGLPPYQWWSEALHGVAGANPGVH 96
Query: 114 FSNV---VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
F+ ATSFPQ IL A+F+ L + V+STEARA N G +G+ F++PNIN
Sbjct: 97 FAPAGEDFDHATSFPQPILMGAAFDDELIHEVATVISTEARAFNNFGFSGIDFFTPNINP 156
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
FRDPRWGRGQETPGEDPL S+Y V LQ G SP K+ A CKH+ YDL++W+
Sbjct: 157 FRDPRWGRGQETPGEDPLHISRYVFQLVTALQGGLGPSP-YYKIVADCKHFAGYDLESWE 215
Query: 231 GTDRYHFNAMV-------IYT--------------------------------------- 244
G DR+HF+A++ YT
Sbjct: 216 GIDRFHFDAVITTQDLAEFYTPSFQSCVRDAKVGSVMCSYNSVNGVPACASSYLLQDIVR 275
Query: 245 -YYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKA 303
+Y + +I SDCD+V ++ + ++T T A+A S+ AG D++CG+ + A+
Sbjct: 276 DFYGLGDGWITSDCDAVQNVFTTHNFTTTQANASAISLKAGTDVDCGNVYAQSLGDALDQ 335
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIV 363
GLV E + A+ + +L+R G+FD P +QP+ QLG DV T A++ LAL AA +GIV
Sbjct: 336 GLVEEDDLKQALVRLYGSLVRTGYFD-SPEEQPFRQLGWADVDTPASRRLALLAAEEGIV 394
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL--AAVVATIY 421
LLKN G LPLS + N+ ++GP N T M GNY G +P QG A T +
Sbjct: 395 LLKND-GLLPLSSRDVPNVIMVGPWGNATTMMQGNYFGNAPYLVSPRQGFVDAGFNVTFF 453
Query: 422 QAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEV 481
T+ D+A AA D V V G D +E ES DR+++ PG Q LI E+
Sbjct: 454 NGTVGTNGTDTSGFDEAVAAAGDTDLIVFVGGPDNVVERESRDRINITWPGVQLDLIKEL 513
Query: 482 AKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRL 541
A V K P+I++ M G D ++ K I +++W GYPGQ+GG A+A++ G+ P+ RL
Sbjct: 514 AGVGK-PMIVLQMGAGQVDDTWLKESDAINALIWGGYPGQSGGTALANIVTGKTAPAARL 572
Query: 542 PMTWYPQSY 550
P+T YP+ Y
Sbjct: 573 PITQYPEDY 581
>gi|222618262|gb|EEE54394.1| hypothetical protein OsJ_01415 [Oryza sativa Japonica Group]
Length = 776
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 222/544 (40%), Positives = 301/544 (55%), Gaps = 63/544 (11%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
+A +C+ SL RV DLV R+TL+EK+ L D AG R+G+P+Y
Sbjct: 59 MAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRY------------ 106
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA------------MYN 155
G G + A A +VV +A A MYN
Sbjct: 107 CGGGRRCTACPTSARRDVVWRRRARRHQLPARHQQRRVVQRDAVARHRRRGVDGDQGMYN 166
Query: 156 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS------- 208
+G A LT+WSPNIN+ RDPRWGR ETPGEDP + +YA +V+G+Q DG +
Sbjct: 167 LGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAASAAA 226
Query: 209 -----PNRLKVAACCKHYTAYDLDNWKG-------TDRYHFNAMVIYTYYLIKFKYIVSD 256
+KV++CCKHY A + ++ D V + L YIVSD
Sbjct: 227 ATDAFSRPIKVSSCCKHYAACVMCSYNRINGVPACADARLLTETVRRDWQL--HGYIVSD 284
Query: 257 CDSVDVLYNSQHYTK-TPEEAAAKSILAGLDLNCG-------SFLGKHTEAAVKAGLVNE 308
CDSV V+ + T EA A ++ AGLDL+CG F + AV+ G + E
Sbjct: 285 CDSVRVMVRDAKWLGYTGVEATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKE 344
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
SA+D+A++N + TLMRLGFFDG P + LG DVCT+ +++LA DAARQG+VLLKN
Sbjct: 345 SAVDNALTNLYLTLMRLGFFDGIPELE---SLGAADVCTEEHKELAADAARQGMVLLKND 401
Query: 369 AGSLPLSPTAIKNLAVIGP--NANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCS 426
A LPLSP + ++A+ G + N T M+G+Y G PC+ TP G+ VV++ C
Sbjct: 402 AALLPLSPEKVNSVALFGQLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSVHACD 461
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAK 486
C D A AA + DAT++V G + S+E ES+DR DLLLP Q I VA+ +
Sbjct: 462 KGSC-----DTAAAAAKTVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASP 516
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
P++L+IMS GG D+SFA+++PKI +++W GYPG+ GG AIADV FG+YNP GRLP+TWY
Sbjct: 517 SPIVLVIMSAGGVDVSFAQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWY 576
Query: 547 PQSY 550
Y
Sbjct: 577 KNEY 580
>gi|396473219|ref|XP_003839293.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
gi|312215862|emb|CBX95814.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
Length = 789
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 225/560 (40%), Positives = 300/560 (53%), Gaps = 61/560 (10%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N L + CNTS R LV TL+EKI + V RLGIP Y+WWSE LHG
Sbjct: 26 NGPLQTETICNTSASPLDRAKSLVTLYTLEEKINATSSGSPGVPRLGIPPYQWWSEGLHG 85
Query: 105 VSYVGPGTHFSNV---VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGL 161
++ GP T+FS +TSFPQ IL A+F+ L + KV+STEARA N GL
Sbjct: 86 IA--GPYTNFSTSGIEYSYSTSFPQPILMGAAFDDHLITDVAKVISTEARAFNNANRTGL 143
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY 221
FW+PNIN FRDPRWGRGQETPGED S Y + GLQ + P + +V A CKH+
Sbjct: 144 DFWTPNINPFRDPRWGRGQETPGEDAFHLSSYVKALIAGLQ-GETTDPYK-RVVATCKHF 201
Query: 222 TAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------------------- 251
YD+++W G RY F+A + + YYL F+
Sbjct: 202 AGYDIEDWNGNLRYQFDAQISQQDLVEYYLQPFQACVQANVGAFMCSYNAVNGVPTCADP 261
Query: 252 ------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL 293
++ SDCD+V +Y ++ T E+A A +++AG DL+CG+++
Sbjct: 262 YLLQTILREHWGWTNEEQWVTSDCDAVQNIYLPHQWSATREQAVADALIAGTDLDCGTYM 321
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+H A GLVNE+ +D A+ +++L+RLG+FD + QPY Q G V T A+Q L
Sbjct: 322 QEHLPGAFAQGLVNENVLDQALVRQYSSLVRLGWFD-DAADQPYRQFGWDSVATDASQAL 380
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
A AA +GIVLLKN G LPLS + +L V G AN T ++GNY G P +PL L
Sbjct: 381 ARRAAVEGIVLLKND-GVLPLSIDSSVSLGVFGDWANATSQLLGNYAGVPTYLHSPLWAL 439
Query: 414 AAVVATIYQAGCSNVQCG---TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
TI AG + G T + A A++D + + G D SIE E HDR L
Sbjct: 440 QQENLTINYAGGNPGGQGDPTTNRWSSLSGAIATSDILIYIGGIDNSIEEEGHDRTSLAW 499
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
G Q +I ++A K P I+++M GG D + N+ I++ILW GYPGQ GG AI D+
Sbjct: 500 TGAQLDVIFQLAATGK-PTIVVVMGGGQIDSAPLANNANISAILWAGYPGQDGGPAIVDI 558
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
G+ P+GRLP T YP SY
Sbjct: 559 LTGKSPPAGRLPQTQYPASY 578
>gi|389748500|gb|EIM89677.1| glycoside hydrolase family 3 protein [Stereum hirsutum FP-91666
SS1]
Length = 770
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 230/561 (40%), Positives = 312/561 (55%), Gaps = 58/561 (10%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V+ P LAS CNTS R LV +TL+E + V+++ V RLG+P YEWWSE
Sbjct: 24 DCVNGP-LASNLVCNTSANFLDRAKALVNAMTLEEMVNNTVNTSPGVPRLGLPPYEWWSE 82
Query: 101 ALHGVSYVGPGTHF--SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV+ PG F S GATSFP+ IL +A+F+ L ++ +STEARA N
Sbjct: 83 ALHGVA-SSPGVTFETSGDFSGATSFPEPILMSAAFDDDLIFSVASTISTEARAFGNTNH 141
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
+GL F++PNIN F+DPRWGRGQETPGEDPL S+Y + GLQ G SP K+ A C
Sbjct: 142 SGLDFFTPNINPFKDPRWGRGQETPGEDPLHTSRYVYQLITGLQGGVGPSP-YYKIIADC 200
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ AYDL+NW+G +R FNA+V + +Y F+
Sbjct: 201 KHFAAYDLENWEGNNRMAFNAIVSTQDLAEFYTPSFQSCVRDAKVGSVMCSYNAVNGVPA 260
Query: 252 ---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG 290
+I SDCD+V +++ +YT T A+A ++LAG D++CG
Sbjct: 261 CGSPYLLQDLVRDYFELGNDTWITSDCDAVGNIFDPHNYTTTLTNASAVALLAGTDVDCG 320
Query: 291 SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQAN 350
+ + AV GLV++S ++ A+ + +L+RLG+FD S PY LG DV T A
Sbjct: 321 TSYSETLGEAVSEGLVSKSDVERALVRLYGSLVRLGYFDPEDSV-PYRALGASDVNTPAA 379
Query: 351 QDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPL 410
Q LA AA +GIVLLKN G LPLS + + ++A+IGP AN T M GNYEG +PL
Sbjct: 380 QTLAYTAAVEGIVLLKND-GLLPLS-SNVSHIALIGPWANATTQMQGNYEGIAPLLISPL 437
Query: 411 QGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLL 469
G + + + G + T+ DA A++AD V + G D ++EAE DR +
Sbjct: 438 DGFTSAGFNVSFTNGTTISGNSTSGFADALSMASAADVIVYIGGIDDTVEAEGQDRTSIT 497
Query: 470 LPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIAD 529
PG Q LI E+ K P ++I M GG D + K + + ++LW GYPGQAGG A+AD
Sbjct: 498 WPGNQLELIGELGAFGK-PFVVIQMGGGQVDDTELKANSSVNALLWGGYPGQAGGKALAD 556
Query: 530 VCFGRYNPSGRLPMTWYPQSY 550
+ G P+GRL T YP SY
Sbjct: 557 IITGVQAPAGRLTTTQYPASY 577
>gi|407922988|gb|EKG16078.1| Glycoside hydrolase family 3 [Macrophomina phaseolina MS6]
Length = 800
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 218/564 (38%), Positives = 317/564 (56%), Gaps = 62/564 (10%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V+ P +L C++S R LVK LTL+EK+ +++ V RLGIP+Y+WW+E
Sbjct: 29 DCVNGPLKDNL-VCDSSATPLARATALVKELTLEEKLNNTGNTSPGVPRLGIPEYQWWNE 87
Query: 101 ALHGVSYVGPGTHF--SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV++ PG S ATSFPQ IL A+F+ L + VVSTEARA N G
Sbjct: 88 ALHGVAFTYPGQPMTESGNFSSATSFPQPILMGAAFDDELIYEVASVVSTEARAYSNGGR 147
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--QTDGGSPNRLKVAA 216
+GL +W+PNIN ++DPRWGRGQETPGEDP + Y ++GL+ Q D P + K+ A
Sbjct: 148 SGLDYWTPNINPYKDPRWGRGQETPGEDPFHLASYVQNLIRGLEGNQND---PYK-KIVA 203
Query: 217 CCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------------------- 251
CKH+T YD++NW G RY F+A + + YY+ F+
Sbjct: 204 TCKHFTGYDMENWNGNFRYQFDAQINMRDMVEYYMPPFQACAREAKVGAFMCSYNAVNGV 263
Query: 252 ------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL 287
++VSDCD++ +Y + ++ E+A A ++ AG DL
Sbjct: 264 PTCADPWLLQTVLREHWGWNQEDQWVVSDCDAIQNVYLPHEWAESREQAVADTLNAGTDL 323
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
NCG++ ++ A + GL+N++ +D A++ +++L++LG+FD S QPY Q+G +DV +
Sbjct: 324 NCGTYYQRYLPGAYEQGLINDTTLDRALTRTYSSLIKLGYFDNADS-QPYRQIGWQDVNS 382
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
Q Q+LAL AA++GIVLLKN G LPLS + ++A+IG AN T+ M GNY G
Sbjct: 383 QHAQELALKAAQEGIVLLKND-GLLPLSLDGVSSIALIGSWANATEQMQGNYAGVAPYLH 441
Query: 408 TPLQGLAAV-VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
+PL + V Y G S T Q AA ++D ++V G D IE+E DR+
Sbjct: 442 SPLYAAEQLGVKVNYAEGASQSNPTTDQWGAEYTAAENSDVIIVVGGIDNDIESEELDRV 501
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ G Q +IT++A K PVI++ M G D + ++ I+++LW GYPGQ GG A
Sbjct: 502 AIAWSGPQLDMITKLATYGK-PVIVVQMGAGQLDSTPLVSNANISALLWGGYPGQDGGTA 560
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+ D+ G P+GRLP+T YP Y
Sbjct: 561 LFDIITGAVAPAGRLPITQYPARY 584
>gi|119473971|ref|XP_001258861.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
gi|292495290|sp|A1DJS5.1|XYND_NEOFI RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|119407014|gb|EAW16964.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
Length = 771
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 241/564 (42%), Positives = 314/564 (55%), Gaps = 65/564 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D S P L+ L C+TSL + R LV +T +EK+ ++ V RLG+P Y WWSE
Sbjct: 31 DCSSGP-LSKLAVCDTSLDVTTRARSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSE 89
Query: 101 ALHGVSYVGPGTHFSNVVPG--ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV+ PG F++ P ATSFPQ IL A+F+ L + + VVSTE RA N G
Sbjct: 90 ALHGVA-GSPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGR 148
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
AGL FW+PNIN FRD RWGRGQETPGEDPL S+Y V GLQ +G P KV A C
Sbjct: 149 AGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQ--NGIGPANPKVVATC 206
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ AYDL++W G R+ FNA V + +YL FK
Sbjct: 207 KHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDAKVDAVMCSYNALNGVPA 266
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
+I DC ++D +YN +YTKTP EAAA ++ AG DL+C
Sbjct: 267 CADSYLLQTILREHWKWDEPGHWITGDCGAIDDIYNGHNYTKTPAEAAATALNAGTDLDC 326
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQ 348
G+ K+ A GL +D A+ +++L++LG+FD P++ QPY +G KDV +
Sbjct: 327 GTVFPKYLGQAADEGLYTNKTLDKALVRLYSSLVKLGYFD--PAEDQPYRSIGWKDVDSP 384
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
A + LA AA +GIVLLKN +LPL A LA+IGP AN TK M GNYEG P KY
Sbjct: 385 AAEALAHKAAVEGIVLLKNDK-TLPLK--AKGTLALIGPYANATKQMQGNYEGPP-KYIR 440
Query: 409 PLQGLA--AVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
L A A Y AG + TA D A AA AD V G D +IEAE HDR
Sbjct: 441 TLLWAATQAGYDVKYVAGTAINANSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGHDRT 500
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
++ PG Q LI +++K+ K P++++ GG D S ++P + ++LW GYP Q GG+A
Sbjct: 501 TIVWPGNQLDLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPHVNALLWTGYPSQEGGSA 559
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
I D+ G+ P+GRLP+T YP Y
Sbjct: 560 IFDILTGKTAPAGRLPVTQYPADY 583
>gi|413919686|gb|AFW59618.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 475
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 189/270 (70%), Positives = 223/270 (82%), Gaps = 3/270 (1%)
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
AGLDLNCG+FL +HT AAV+AG ++ES +D A++NN TLMRLGFFDG P + P+G LGP
Sbjct: 31 AGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGP 90
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
DVCT +NQ+LA +AARQGIVLLKNT G LPLS T+IK++AVIGPNAN + TMIGNYEGT
Sbjct: 91 SDVCTPSNQELAREAARQGIVLLKNT-GKLPLSATSIKSMAVIGPNANASFTMIGNYEGT 149
Query: 403 PCKYTTPLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKKAAASADATVLVMGADQSIEA 460
PCKYTTPLQGL A VAT+YQ GC+NV C + Q+D A KAAASAD TVLV+GADQSIE
Sbjct: 150 PCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIER 209
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
ES DR LLLPGQQ L++ VA + GP IL++MSGG FDISFAK+ KI +ILWVGYPG
Sbjct: 210 ESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPG 269
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+AGGAAIADV FG +NPSGRLP+TWYP+S+
Sbjct: 270 EAGGAAIADVLFGYHNPSGRLPVTWYPESF 299
>gi|449531013|ref|XP_004172482.1| PREDICTED: beta-D-xylosidase 1-like, partial [Cucumis sativus]
Length = 534
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 188/300 (62%), Positives = 232/300 (77%), Gaps = 1/300 (0%)
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAI 311
YIVSDCDSV VLY+SQH+T TPEEAAA +I AGLDL+CG FL HT AV GL+ E +
Sbjct: 53 YIVSDCDSVGVLYDSQHFTPTPEEAAASTIKAGLDLDCGPFLAVHTATAVGRGLLKEVDL 112
Query: 312 DHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS 371
++A++N + MRLG FDG P+ QPYG LGPKDVCT A++ LAL+AARQGIVLL+N AG+
Sbjct: 113 NNALANLLSVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAARQGIVLLQNRAGA 172
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC- 430
LPLSPT + +AVIGPN++ T TMIGNY G C+YTTP+QG++ V TI+ GC+NV C
Sbjct: 173 LPLSPTRHRTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYVKTIHAKGCANVACV 232
Query: 431 GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVI 490
G + +A+ AA ADA V+V+G DQSIEAES DR +LLPG+Q+ L+ + KGP +
Sbjct: 233 GDQLIGEAEAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELVRRIGLACKGPTV 292
Query: 491 LIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+++MSGG D+SFAKND KI+ ILWVGYPGQAGGAAIADV FG NP G+LPMTWYPQSY
Sbjct: 293 VVLMSGGPIDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQSY 352
>gi|296439595|sp|A1CCL9.2|BXLB_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
Length = 771
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 235/563 (41%), Positives = 309/563 (54%), Gaps = 63/563 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D S P L+ L C+TS + R LV ++ EK+ A V RLG+P Y WWSE
Sbjct: 31 DCTSGP-LSKLAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSE 89
Query: 101 ALHGVSYVGPGTHFSNVVPG--ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV+ PG HF++ P ATSF Q IL ASF+ L + + VV TE RA N G
Sbjct: 90 ALHGVA-GAPGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGR 148
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
AGL +W+PNIN FRDPRWGRGQETPGEDPL S+Y V GLQ G P R ++AA C
Sbjct: 149 AGLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQ--GGIGPARPQIAATC 206
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ AYD+++W G R+ F+A V + +YL FK
Sbjct: 207 KHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNALNGVPT 266
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
++VSDC ++D +Y +YTKT EAAA ++ AG DL+C
Sbjct: 267 CADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIYIGHNYTKTGAEAAAVALNAGTDLDC 326
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G+ KH A + GL +D A+ +++L++LG+FD KQPYG +G KDV T A
Sbjct: 327 GTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVKLGYFD-PAEKQPYGSIGWKDVDTPA 385
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
+ LA AA +GIVLLKN +LPL A LA+IGP AN TK M GNY+G P KY
Sbjct: 386 AEQLAHKAAVEGIVLLKNDQ-TLPLK--AKGTLALIGPYANATKQMQGNYQGPP-KYIRT 441
Query: 410 LQGLAAV--VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
L+ A Y G + TA A AA AD + G D +IE+E+ DR
Sbjct: 442 LEWAATQHGYQVQYSPGTAINNSSTAGFAAALAAAKDADVVLYAGGIDNTIESETLDRTT 501
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
+ PG Q LI+E++ + K P+I+I GG D + +P + ++LW GYP Q GGAAI
Sbjct: 502 ITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQEGGAAI 560
Query: 528 ADVCFGRYNPSGRLPMTWYPQSY 550
D+ G+ P+GRLP+T YP +Y
Sbjct: 561 FDILTGKAAPAGRLPITQYPAAY 583
>gi|121712174|ref|XP_001273702.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
gi|119401854|gb|EAW12276.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
Length = 803
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 235/563 (41%), Positives = 309/563 (54%), Gaps = 63/563 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D S P L+ L C+TS + R LV ++ EK+ A V RLG+P Y WWSE
Sbjct: 63 DCTSGP-LSKLAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSE 121
Query: 101 ALHGVSYVGPGTHFSNVVPG--ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV+ PG HF++ P ATSF Q IL ASF+ L + + VV TE RA N G
Sbjct: 122 ALHGVA-GAPGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGR 180
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
AGL +W+PNIN FRDPRWGRGQETPGEDPL S+Y V GLQ G P R ++AA C
Sbjct: 181 AGLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQ--GGIGPARPQIAATC 238
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ AYD+++W G R+ F+A V + +YL FK
Sbjct: 239 KHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNALNGVPT 298
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
++VSDC ++D +Y +YTKT EAAA ++ AG DL+C
Sbjct: 299 CADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIYIGHNYTKTGAEAAAVALNAGTDLDC 358
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G+ KH A + GL +D A+ +++L++LG+FD KQPYG +G KDV T A
Sbjct: 359 GTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVKLGYFD-PAEKQPYGSIGWKDVDTPA 417
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
+ LA AA +GIVLLKN +LPL A LA+IGP AN TK M GNY+G P KY
Sbjct: 418 AEQLAHKAAVEGIVLLKNDQ-TLPLK--AKGTLALIGPYANATKQMQGNYQGPP-KYIRT 473
Query: 410 LQGLAAV--VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
L+ A Y G + TA A AA AD + G D +IE+E+ DR
Sbjct: 474 LEWAATQHGYQVQYSPGTAINNSSTAGFAAALAAAKDADVVLYAGGIDNTIESETLDRTT 533
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
+ PG Q LI+E++ + K P+I+I GG D + +P + ++LW GYP Q GGAAI
Sbjct: 534 ITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQEGGAAI 592
Query: 528 ADVCFGRYNPSGRLPMTWYPQSY 550
D+ G+ P+GRLP+T YP +Y
Sbjct: 593 FDILTGKAAPAGRLPITQYPAAY 615
>gi|392590128|gb|EIW79457.1| glycoside hydrolase family 3 protein [Coniophora puteana RWD-64-598
SS2]
Length = 770
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 220/548 (40%), Positives = 308/548 (56%), Gaps = 57/548 (10%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTH 113
C+TSL R LV+ T++E I V+ + V RLG+P Y+WWSE LHGV+ PG +
Sbjct: 37 CDTSLNATQRAAALVELFTVEELINNTVNGSPGVPRLGLPAYQWWSEGLHGVA-DSPGVN 95
Query: 114 FSNVVPG--ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIF 171
FS P ATSFPQ I+ +A+F+ +L +A+G VV E R+ N G AGL FW+PNIN F
Sbjct: 96 FSTSGPFSYATSFPQPIVMSAAFDDALIKAVGGVVGMEGRSFNNYGHAGLDFWTPNINPF 155
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
+DPRWGRGQETPGEDP ++Y ++GLQ P +V A CKH+ YDL++W+
Sbjct: 156 KDPRWGRGQETPGEDPYHIAQYVYNLIQGLQGGVNPEP-YFQVVATCKHFAGYDLEDWEN 214
Query: 232 TDRYHFNAMV----IYTYYLIKF------------------------------------- 250
RY F+A++ + +YL F
Sbjct: 215 NFRYGFDALITTQDLSEFYLPSFQSCYRDAQAGASMCSYNAVNGIPTCADTYLLQDILRD 274
Query: 251 -------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKA 303
+++ SDCD+V+ +YN +YT P++AAA ++ AG DL+CG+F ++ A
Sbjct: 275 YWNFDETRWVTSDCDAVENIYNPHNYTALPQQAAADALRAGTDLDCGTFYTEYLPLAYNQ 334
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIV 363
L+ E+ + A++ +A+L+RLG+FD ++QPY Q G +V T Q LA AA +GI
Sbjct: 335 SLITETELRAALTRQYASLVRLGYFD-PAAQQPYRQYGWSNVDTPYAQQLAYTAATEGIT 393
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG-LAAVVATIYQ 422
LLKN G+LPL P+ +KN+A+IGP AN T M GNY G +PLQG LAA Y
Sbjct: 394 LLKND-GTLPL-PSTLKNIALIGPWANATNQMQGNYFGVAPYLVSPLQGALAAGYNVTYV 451
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVA 482
G + TA A AA ADA V G D ++EAE+ DR ++ PG Q LI E+A
Sbjct: 452 FGTNITSNSTAGFAAAIAAAREADAVVYAGGIDVTVEAEAMDRYNVTWPGNQLQLIGELA 511
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
+ K P ++ GG D + K + + S++W GYPGQ+GG A+ D+ G+ P+GRL
Sbjct: 512 ALGK-PFVVAQFGGGQVDDTEIKANASVNSLIWAGYPGQSGGQALFDIISGKVAPAGRLV 570
Query: 543 MTWYPQSY 550
T YP Y
Sbjct: 571 TTQYPADY 578
>gi|297039776|gb|ADH95739.1| beta-xylosidase [Aspergillus fumigatus]
Length = 771
Score = 369 bits (946), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 236/564 (41%), Positives = 311/564 (55%), Gaps = 65/564 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D S P L+ L C+TSL + R LV +T +EK+ ++ V RLG+P Y WWSE
Sbjct: 31 DCSSGP-LSKLAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSE 89
Query: 101 ALHGVSYVGPGTHFSNVVPG--ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV+ PG F++ P ATSFPQ IL A+F+ L + + VVSTE RA N G
Sbjct: 90 ALHGVA-GSPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGR 148
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
+GL FW+PNIN FRD RWGRGQETPGEDPL S+Y V GLQ +G P KV A C
Sbjct: 149 SGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQ--NGIGPANPKVVATC 206
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ AY L++W G R+ FNA V + +YL FK
Sbjct: 207 KHFAAYGLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNALNGVPA 266
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
+I SDC ++D +YN ++T TP EAAA ++ AG DL+C
Sbjct: 267 CADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTTTPAEAAATALNAGTDLDC 326
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQ 348
G+ K+ A GL + +D A+ +++ ++LG+FD P++ QPY +G DV T
Sbjct: 327 GTVFPKYLGQAADEGLYSNQTLDRALVRLYSSFVKLGYFD--PAEDQPYRSIGWTDVDTP 384
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
A + LA AA +GIVLLKN +LPL A LA+IGP AN TK M GNYEG P KY
Sbjct: 385 AVEALAHKAAGEGIVLLKNDK-TLPLK--AKGTLALIGPYANATKQMQGNYEG-PAKYIR 440
Query: 409 PLQGLA--AVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
L A A Y AG + TA D A AA AD V G D +IEAE DR
Sbjct: 441 TLLWAATQAGYDVKYAAGTAINTNSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGRDRT 500
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ PG Q LI +++K+ K P++++ GG D S ++P++ ++LW GYP Q GG+A
Sbjct: 501 TIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALLWAGYPSQEGGSA 559
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
I D+ G+ P+GRLP+T YP Y
Sbjct: 560 IFDILTGKTAPAGRLPVTQYPADY 583
>gi|440799679|gb|ELR20723.1| betaxylosidase [Acanthamoeba castellanii str. Neff]
Length = 748
Score = 367 bits (943), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 224/591 (37%), Positives = 301/591 (50%), Gaps = 102/591 (17%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
L L FCNTSL R DLV RLTL + I + A +V LGIP Y WW+E LHGV
Sbjct: 10 LKDLPFCNTSLTAGQRTDDLVSRLTLDQLIGQMGHQAPAVPSLGIPAYNWWTECLHGVL- 68
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
T P TSFP A+FN L + + +S EARA+ N G+ GL FW+PN
Sbjct: 69 ----TKCGTNCP--TSFPAPCALGAAFNMKLIHKMARAISNEARALNNEGIGGLDFWAPN 122
Query: 168 I-----------------------NIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT 204
I +I RDPRWGR E PGEDP + ++Y +++GLQ+
Sbjct: 123 IKYSTQPTNKTRQESQLRNAMVCISINRDPRWGRNMEVPGEDPFMTAQYVAHFMRGLQE- 181
Query: 205 DGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------- 251
G +V CKH+ AY L+ WK DR+ F+A+V YL FK
Sbjct: 182 -GEDSRYPQVVGTCKHFAAYSLEAWKDYDRFMFDAIVSDYDFVETYLPAFKGCIVEGRAR 240
Query: 252 ---------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAA 278
Y+VSDCD+VD +YN+ H+TKTPE A A
Sbjct: 241 SIMCSYNSVNGVPSCANDFLLRTILRDSWSFDGYVVSDCDAVDTIYNNHHFTKTPEGACA 300
Query: 279 KSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYG 338
++ AG DLNCG F KH A G V E + A+ F M LG +D P++QPY
Sbjct: 301 VALHAGTDLNCGDFYQKHLGKAHSEGRVTEDEVRLAVKRLFRQRMELGMWDP-PAEQPYK 359
Query: 339 QLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGN 398
Q P V ++ + DLAL AAR+ +VLL+N G LPL +++ +AVIGPNAN T+TM+GN
Sbjct: 360 QYPPSVVGSREHSDLALQAARESMVLLQNRRGVLPLR-KSVRRVAVIGPNANATETMLGN 418
Query: 399 YEGTPCK---YTTPLQGLAAVVATIYQA------GCSNVQCGTAQVDDAKKAAASADATV 449
Y G+ C Y + A+ A + QA GC T + +A KAA +AD +
Sbjct: 419 YYGSRCHDGTYDCIVSPYLAIKAKLPQALVTYNLGCDVDSTNTTGIPEAVKAAQAADVAI 478
Query: 450 LVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPK 509
+V+G + S+E+E DR+ + LPG Q LI + P ++++M GG I + K+ +
Sbjct: 479 VVLGLNTSVESEGKDRVAITLPGMQDHLIKSIV-ATNTPTVVVMMHGGAVAIEWIKD--Q 535
Query: 510 ITSILWVGYPGQAGGAAIADVCFGRYNPS----------GRLPMTWYPQSY 550
+ I+ YPG+ GG AIADV FG YNP GRLP+T P +Y
Sbjct: 536 VDGIVDAFYPGENGGQAIADVLFGDYNPGDNKTDGTTLLGRLPVTVLPANY 586
>gi|409041356|gb|EKM50841.1| glycoside hydrolase family 3 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 764
Score = 367 bits (942), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 229/564 (40%), Positives = 319/564 (56%), Gaps = 64/564 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V+ P +L CN S R LV LTL+E + V+++ V RLG+P Y WWSE
Sbjct: 26 DCVNGPLKDNL-VCNPSADPTSRANALVDALTLEELVNNTVNASPGVPRLGLPPYNWWSE 84
Query: 101 ALHGVSYVGPGTHFSNVVPG-----ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
ALHGV+ + PGT+FS VPG ATSFPQ I+ A+F+ L +I V+STEARA N
Sbjct: 85 ALHGVA-LSPGTNFS--VPGSPFSSATSFPQPIILGATFDDDLVTSIATVISTEARAFNN 141
Query: 156 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPN-RLKV 214
G AGL F++PNIN F+DPRWGRGQETPGEDP ++Y V GLQ G SP+ KV
Sbjct: 142 AGRAGLDFFTPNINPFKDPRWGRGQETPGEDPFHIAQYVYQLVTGLQ--GGLSPDPYYKV 199
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------- 251
A CKH+ YDL+NW+G R FNA++ + YY F+
Sbjct: 200 IADCKHFAGYDLENWEGNSRMAFNAIISTQDLAEYYTPSFQSCVRDAHVGSVMCSYNAVN 259
Query: 252 ------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL 287
+I SDCD+V +++ YT T A+A ++ AG D+
Sbjct: 260 GIPSCANSYLLQDIIRGHFGLGDGWITSDCDAVANIFSPHQYTTTLVNASAVALKAGTDV 319
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
+CG+ + AV LV E I +++ + +L+RLG+FD P++QP+ QLG DV T
Sbjct: 320 DCGTTYSQTLVDAVDQNLVTEDDIKNSMIRLYRSLVRLGYFD-SPAEQPFRQLGWSDVNT 378
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
++Q LAL AA +G+ LLKN G+LPLS +AIK +A++GP AN T M GNY+G
Sbjct: 379 PSSQALALTAAEEGVTLLKND-GTLPLS-SAIKRIALVGPWANATTQMQGNYQGIAPFLV 436
Query: 408 TPLQGLA-AVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
+PLQ L A + G + + A A ADA + G D++IE+E +DR
Sbjct: 437 SPLQALQDAGFQVTFANGTAINSTDDSGFAAAVSAVQVADAVIYAGGIDETIESEGNDRE 496
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ PG Q L++++A V K P +++ M GG D S K++ + +++W GYPGQ+GGAA
Sbjct: 497 IITWPGNQLDLVSQLAAVGK-PFVVLQMGGGQVDSSSLKSNKAVNALIWGGYPGQSGGAA 555
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
I ++ G+ P+GRLP+T YP Y
Sbjct: 556 IVNILTGKIAPAGRLPITQYPADY 579
>gi|7671419|emb|CAB89360.1| beta-glucosidase-like protein [Arabidopsis thaliana]
gi|9758998|dbj|BAB09525.1| unnamed protein product [Arabidopsis thaliana]
Length = 411
Score = 367 bits (942), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 174/229 (75%), Positives = 197/229 (86%), Gaps = 1/229 (0%)
Query: 323 MRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNL 382
MRLGFFDG+P QPYG LGPKDVCT N++LA++ ARQGIVLLKN+AGSLPLSP+AIK L
Sbjct: 1 MRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGSLPLSPSAIKTL 60
Query: 383 AVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA-AVVATIYQAGCSNVQCGTAQVDDAKKA 441
AVIGPNANVTKTMIGNYEG CKYTTPLQGL V+ T Y GC NV C A +D AK
Sbjct: 61 AVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTCTEADLDSAKTL 120
Query: 442 AASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDI 501
AASADATVLVMGADQ+IE E+ DR+DL LPG+QQ L+T+VAK A+GPV+L+IMSGGGFDI
Sbjct: 121 AASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVVLVIMSGGGFDI 180
Query: 502 SFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+FAKND KITSI+WVGYPG+AGG AIADV FGR+NPSG+LPMTWYPQSY
Sbjct: 181 TFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQSY 229
>gi|451992719|gb|EMD85198.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
C5]
Length = 781
Score = 366 bits (939), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 221/561 (39%), Positives = 297/561 (52%), Gaps = 65/561 (11%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N L + C+ S R LV TL+EKI +SA V+RLG+P Y+WW+E LHG
Sbjct: 28 NGPLKNETICDPSASTLARAKSLVALYTLEEKINATSNSAPGVARLGVPPYQWWNEGLHG 87
Query: 105 VSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT 162
++ GP T F+ +TSFPQ IL A+F+ L + KV+STEARA N GL
Sbjct: 88 IA--GPFTSFAKQGDYSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANRTGLD 145
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PNIN FRDPRWGRGQETPGED S Y + GLQ + P R +V A CKHY
Sbjct: 146 FWTPNINPFRDPRWGRGQETPGEDSYHLSSYVKALIHGLQ-GNATDPYR-RVVATCKHYA 203
Query: 223 AYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------------------------- 251
YD++NW G RY + + + YYL F+
Sbjct: 204 GYDIENWNGNLRYQNDVQISQQDLVEYYLAPFEACVQANVGAFMCSYNAVNGAPPCADPY 263
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
++ SDCD++ +Y ++ T E AAA S+ AG DL+CG++L
Sbjct: 264 LLQTVLREHWGWSSDDHWVTSDCDAIQNVYLPHQWSSTREGAAADSLNAGTDLDCGTYLQ 323
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
H AVK GL +E+ +D A+ +++L++LG+FD P QPY QLG V T A+Q LA
Sbjct: 324 THLPGAVKQGLTDETTLDKALIRQYSSLIKLGYFDA-PENQPYRQLGFDAVATSASQALA 382
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
L AA +GIVLLKN G LP++ K + + G AN T + GNY G T+PL L
Sbjct: 383 LKAAEEGIVLLKND-GVLPIN-LGSKQVGIYGDWANATSQLQGNYFGVAKFLTSPLMALQ 440
Query: 415 AVVATIYQAGCSNVQCG-----TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLL 469
+ + AG N+ G T ++D + V G D +E+E DR L
Sbjct: 441 NLGVDVKYAG--NLPGGQGDPTTGAWSSLSGVITTSDVHIWVGGIDNGVESEDRDRSWLT 498
Query: 470 LPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIAD 529
L G Q +I ++A K PVI++IM GG D S +PKI+++LW GYPGQ GG AI +
Sbjct: 499 LTGGQLDVIGQLADTGK-PVIVVIMGGGQIDTSPLIRNPKISAVLWAGYPGQDGGTAIVN 557
Query: 530 VCFGRYNPSGRLPMTWYPQSY 550
+ G+ P+GRLP T YP Y
Sbjct: 558 ILTGKAAPAGRLPQTQYPSKY 578
>gi|451849522|gb|EMD62825.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
Length = 849
Score = 366 bits (939), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 223/545 (40%), Positives = 294/545 (53%), Gaps = 69/545 (12%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNV--VPG 120
R LV TL+EKI +SA V+RLGIP Y+WW+E LHG++ GP T F+
Sbjct: 114 RAKSLVALYTLEEKINATSNSAPGVARLGIPPYQWWNEGLHGIA--GPFTSFAKQGDYSY 171
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQ 180
+TSFPQ IL A+F+ +L + V+STEARA NV GL FW+PNIN FRDPRWGRGQ
Sbjct: 172 STSFPQPILMGAAFDDNLITEVANVISTEARAFNNVNRTGLDFWTPNINPFRDPRWGRGQ 231
Query: 181 ETPGEDPLLASKYATGYVKGLQ--QTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
ETPGED S Y + GLQ +TD P R +V A CKHY YD++NW G RY +
Sbjct: 232 ETPGEDSYHLSSYVKALIHGLQGNETD---PYR-RVVATCKHYAGYDIENWNGNLRYQND 287
Query: 239 AMV----IYTYYLIKFK------------------------------------------- 251
+ + YYL F+
Sbjct: 288 VQISQQDLVEYYLAPFEACVQANVGAFMCSYNAVNGAPPCADPYMLQTVLREHWGWSSDE 347
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++ SDCDS+ +Y ++ T E AAA S+ AG DL+CG++L H AVK GL NE+
Sbjct: 348 HWVTSDCDSIQNVYLPHQWSSTREGAAADSLNAGTDLDCGTYLQSHLPGAVKQGLTNETT 407
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
+D+A+ +++L++LG+FD P QPY QLG V T A+Q LAL AA +GIVLLKN G
Sbjct: 408 LDNALIRQYSSLIKLGYFD-IPENQPYRQLGFDAVATSASQALALKAAEEGIVLLKND-G 465
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC 430
LP++ KN+ + G AN T + GNY G T+P L + + AG N+
Sbjct: 466 VLPIN-FGSKNVGIYGDWANATSQLQGNYFGVAKFLTSPYMALEKLGVNVRYAG--NLPG 522
Query: 431 G-----TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVA 485
G T ++D + V G D IE+E DR L L G Q +I ++A
Sbjct: 523 GQGDPTTGSWPRLSGVITTSDVHIWVGGMDNGIESEDRDRSWLTLTGSQLDVIGQLADTG 582
Query: 486 KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
K PVI+IIM GG D S +PKI+++LW GYPGQ GG AI ++ G+ P+GRLP T
Sbjct: 583 K-PVIVIIMGGGQIDTSPLIKNPKISAVLWAGYPGQDGGTAIVNILTGKAAPAGRLPQTQ 641
Query: 546 YPQSY 550
Y Y
Sbjct: 642 YLYKY 646
>gi|409079878|gb|EKM80239.1| hypothetical protein AGABI1DRAFT_120267 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 786
Score = 365 bits (936), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 221/564 (39%), Positives = 305/564 (54%), Gaps = 57/564 (10%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
F+ +N L S C+++ R L++ T E I +++ V RLG+P YEW
Sbjct: 22 FSFPDCTNGPLKSTPVCDSAKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEW 81
Query: 98 WSEALHGVSYVGPGTHF--SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
WSEALHGV + PG F S ATSFPQ I+ A+F+ L +A+ VVSTEARA N
Sbjct: 82 WSEALHGVGH-SPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNN 140
Query: 156 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVA 215
G AGL +++PNIN F+DPRWGRGQETPGEDP S+Y V GLQ P +KVA
Sbjct: 141 FGRAGLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQGGIDPWP-YIKVA 199
Query: 216 ACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF--------------------- 250
A CKH+ AYDL+NW+G DR+HF+A V + YYL F
Sbjct: 200 ADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNG 259
Query: 251 -----------------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL 287
+++ SDC ++D +++S ++T++ EAAA S+ AG D+
Sbjct: 260 VPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDI 319
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
+CGS H AA+ L++ + A + +L+RLG+FD S Q Y Q DV T
Sbjct: 320 DCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSDS-QTYRQFDWSDVNT 378
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
Q L+ AA +G+VLLKN G LPL+P K +A+IGP N T +M GNY G T
Sbjct: 379 PEAQALSRRAAVEGLVLLKND-GLLPLAPDG-KTIAIIGPYTNATSSMQGNYFGNAPIIT 436
Query: 408 TPLQGLAAVVATIYQAGCSNVQ-CGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
+P QG V + A + V +A +A A +AD V V G D ++E E DR
Sbjct: 437 SPFQGAQDVGFKVVSAAGTTVNGTSSAGFAEAINTAKAADVVVFVGGIDNTLEREGLDRS 496
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ PG Q L+ ++A + K P+I++ GG D + + K+ +I+W GYPGQ+GG A
Sbjct: 497 SISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANKKVQAIIWAGYPGQSGGTA 555
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
I D+ G P+GRLP+T YP Y
Sbjct: 556 IFDIIVGSTAPAGRLPVTQYPADY 579
>gi|452989371|gb|EME89126.1| glycoside hydrolase family 3 protein [Pseudocercospora fijiensis
CIRAD86]
Length = 790
Score = 364 bits (935), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 221/567 (38%), Positives = 295/567 (52%), Gaps = 74/567 (13%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
+N L + C+T+ R L+ TL EKI ++ V RLG+ YEWW EALH
Sbjct: 29 TNGPLKNNTVCDTAADPLTRAKALIAEFTLAEKINNTGSTSPGVPRLGLLPYEWWQEALH 88
Query: 104 GVSYVGPGTHFSNVVPG----ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA 159
GV+ PG +FS V G ATSFPQ IL A+F+ L + V+STEARA N A
Sbjct: 89 GVA-SSPGVNFS--VSGEFRYATSFPQPILMGAAFDDQLIHDVASVISTEARAFSNDDRA 145
Query: 160 GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCK 219
GL FW+PNIN F+DPRWGRGQETPGEDP S Y ++GLQ G +P+ KV A CK
Sbjct: 146 GLDFWTPNINPFKDPRWGRGQETPGEDPYHLSSYVHSLIRGLQ---GDNPSYKKVVATCK 202
Query: 220 HYTAYDLDNWKGTDRY----HFNAMVIYTYYLIKFK------------------------ 251
H+ AYD++NW G RY H N+ + YY+ F+
Sbjct: 203 HFVAYDVENWNGNFRYQLDAHINSQDLVEYYMPPFRSCARDSNVGAFMCSYNSLNGVPTC 262
Query: 252 ---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG 290
++ SDCDSV ++ +Y + EEAAA S+ AG D+NCG
Sbjct: 263 ADPYLLQTVLREHWNWTAEEQWVTSDCDSVQNVFLYHNYASSREEAAAISLKAGTDINCG 322
Query: 291 SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQAN 350
++ +H A + GL+NE+ +D ++ + +L+RLG+FDG + PY L DV T
Sbjct: 323 TYYQEHLPRAYEQGLINETDVDTSLIRQYGSLIRLGYFDG--DRVPYRNLTWNDVSTPYA 380
Query: 351 QDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPL 410
QDLAL AA GI LLKN G LPL T +A+IG AN T M+GNY G P + +PL
Sbjct: 381 QDLALKAATSGITLLKND-GILPLQITNGTKIALIGDWANATDQMLGNYHGIPPYFHSPL 439
Query: 411 QGLAAVVATIYQAGCSNVQCGTAQVDDAK-------KAAASADATVLVMGADQSIEAESH 463
A + + VQ Q D AA +D + + G D+ +EAE
Sbjct: 440 WAAQQTGAEV-----TYVQGPGGQSDPTTYTWRPIWSAANKSDVIIYIGGMDERVEAEEK 494
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR+ + G Q +I ++A P I++ M GG D S +P I ++LW GYPGQ G
Sbjct: 495 DRVSIAWSGPQLDVIGQLADYYDKPTIVVQMGGGSLDSSPLVKNPNIRALLWGGYPGQDG 554
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
G AI D+ G P+GRLP+T Y Y
Sbjct: 555 GKAIFDILQGISAPAGRLPITQYRADY 581
>gi|70986056|ref|XP_748529.1| beta-xylosidase [Aspergillus fumigatus Af293]
gi|74668295|sp|Q4WFI6.1|BXLB_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|296439536|sp|B0Y0I4.1|BXLB_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|66846158|gb|EAL86491.1| beta-xylosidase, putative [Aspergillus fumigatus Af293]
gi|159128339|gb|EDP53454.1| beta-xylosidase [Aspergillus fumigatus A1163]
Length = 771
Score = 364 bits (934), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 238/564 (42%), Positives = 313/564 (55%), Gaps = 65/564 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D S P L+ L C+TSL + R LV +T +EK+ ++ V RLG+P Y WWSE
Sbjct: 31 DCSSGP-LSKLAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSE 89
Query: 101 ALHGVSYVGPGTHFSNVVPG--ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV+ PG F++ P ATSFPQ IL A+F+ L + + VVSTE RA N G
Sbjct: 90 ALHGVA-GSPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGR 148
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
+GL FW+PNIN FRD RWGRGQETPGEDPL S+Y V GLQ +G P KV A C
Sbjct: 149 SGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQ--NGIGPANPKVVATC 206
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ AYDL++W G R+ FNA V + +YL FK
Sbjct: 207 KHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNALNGVPA 266
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
+I SDC ++D +YN ++T TP EAAA ++ AG DL+C
Sbjct: 267 CADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTTTPAEAAATALNAGTDLDC 326
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQ 348
G+ K+ A GL + +D A+ +++L++LG+FD P++ QPY +G DV T
Sbjct: 327 GTVFPKYLGQAADEGLYSNQTLDRALVRLYSSLVKLGYFD--PAEDQPYRSIGWTDVDTP 384
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
A + LA AA +GIVLLKN +LPL A LA+IGP AN TK M GNYEG P KY
Sbjct: 385 AAEALAHKAAGEGIVLLKNDK-TLPLK--AKGTLALIGPYANATKQMQGNYEG-PAKYIR 440
Query: 409 PLQGLA--AVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
L A A Y AG + TA D A AA AD V G D +IEAE DR
Sbjct: 441 TLLWAATQAGYDVKYAAGTAINTNSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGRDRT 500
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ PG Q LI +++K+ K P++++ GG D S ++P++ ++LW GYP Q GG+A
Sbjct: 501 TIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALLWAGYPSQEGGSA 559
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
I D+ G+ P+GRLP+T YP Y
Sbjct: 560 IFDILTGKTAPAGRLPVTQYPADY 583
>gi|330934749|ref|XP_003304687.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
gi|311318569|gb|EFQ87188.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
Length = 798
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 216/561 (38%), Positives = 300/561 (53%), Gaps = 61/561 (10%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
+N L ++ C+ S R LV TL+EKI A V RLG+P Y+WW+E LH
Sbjct: 27 TNGPLKNVTICDPSASPLARAKSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWNEGLH 86
Query: 104 GVSYVGPGTHFSNV---VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG 160
G++ GP T+FS+ +TSFPQ IL A+F+ L + KV+STEARA N G
Sbjct: 87 GIA--GPYTNFSHSGVEWSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANRTG 144
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
L FW+PNIN FRDPRWGRGQETPGED S Y + GLQ + P + +V A CKH
Sbjct: 145 LDFWTPNINPFRDPRWGRGQETPGEDAYHLSSYVQALIHGLQ-GEATDPYK-RVVATCKH 202
Query: 221 YTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------- 251
+ YD+++W G RY + + + YYL F+
Sbjct: 203 FAGYDVEDWNGNLRYQNDVQITQQDLVEYYLAPFQACVQANVGAFMCSYNAVNGAPPCAD 262
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
++ DCD+V +Y ++ T AAA S++AG D+ CG++
Sbjct: 263 PYLLQTILREHWGWNKEEQWVTGDCDAVQNVYFPHQWSSTRAGAAADSLVAGTDITCGTY 322
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
+ +H AA + L+NES++D A+ +++L+RLG+FD P QPY QLG V T A+Q
Sbjct: 323 MQEHLPAAFRQKLLNESSLDLALIRQYSSLVRLGYFDA-PENQPYRQLGFDAVATNASQA 381
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LA AA +GIVLLKN G+LPLS + + + G AN T ++GNY G +PL
Sbjct: 382 LARRAAAEGIVLLKND-GTLPLSLDSSMTVGLFGDWANATTQLLGNYAGVATYLHSPLYA 440
Query: 413 LAAVVATIYQAGCSNVQCG---TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLL 469
L I AG G T + + A +++D + V G D +E E HDR L
Sbjct: 441 LKQTGVKINYAGGKPGGQGDPTTNRWSNLYGAYSTSDVLIYVGGIDNGVEEEGHDRGYLT 500
Query: 470 LPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIAD 529
G Q +I ++A+ K PVI+++ GG D S N+P I++I+W GYPGQ GG+AI D
Sbjct: 501 WTGPQLDVIGQLAETGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGGSAIID 559
Query: 530 VCFGRYNPSGRLPMTWYPQSY 550
+ G+ P+GRLP T YP SY
Sbjct: 560 IISGKTAPAGRLPQTQYPASY 580
>gi|426198356|gb|EKV48282.1| hypothetical protein AGABI2DRAFT_67675 [Agaricus bisporus var.
bisporus H97]
Length = 763
Score = 363 bits (931), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 222/565 (39%), Positives = 306/565 (54%), Gaps = 59/565 (10%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
F+ +N L S C+++ R L++ T E I +++ V RLG+P YEW
Sbjct: 22 FSFPDCTNGPLKSTPVCDSTKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEW 81
Query: 98 WSEALHGVSYVGPGTHF--SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
WSEALHGV + PG F S ATSFPQ I+ A+F+ L +A+ VVSTEARA N
Sbjct: 82 WSEALHGVGH-SPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNN 140
Query: 156 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVA 215
G AGL +++PNIN F+DPRWGRGQETPGEDP S+Y V GLQ P +KVA
Sbjct: 141 FGRAGLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQGGIDPWP-YIKVA 199
Query: 216 ACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF--------------------- 250
A CKH+ AYDL+NW+G DR+HF+A V + YYL F
Sbjct: 200 ADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNG 259
Query: 251 -----------------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL 287
+++ SDC ++D +++S ++T++ EAAA S+ AG D+
Sbjct: 260 VPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDI 319
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVC 346
+CGS H AA+ L++ + A + +L+RLG+FD PS Q Y Q DV
Sbjct: 320 DCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFD--PSHSQTYRQFDWSDVN 377
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
T Q L+ AA +G+VLLKN G LPL+P K +A+IGP N T +M GNY G
Sbjct: 378 TPEAQALSRRAAVEGLVLLKND-GLLPLAPDG-KTIAIIGPYTNATSSMQGNYFGNAPFI 435
Query: 407 TTPLQGLAAVVATIYQAGCSNVQ-CGTAQVDDAKKAAASADATVLVMGADQSIEAESHDR 465
T+P QG V + A + V +A +A A +AD V V G D ++E E DR
Sbjct: 436 TSPFQGAQDVGFKVVSAAGTIVNGTSSAGFAEAINTARAADVVVFVGGIDNTLEREGLDR 495
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
+ PG Q L+ ++A + K P+I++ GG D + + K+ +I+W GYPGQ+GG
Sbjct: 496 SSISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANEKVQAIIWAGYPGQSGGT 554
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
AI D+ G P+GRLP+T YP Y
Sbjct: 555 AIFDIIVGATAPAGRLPVTQYPADY 579
>gi|389748262|gb|EIM89440.1| hypothetical protein STEHIDRAFT_182874, partial [Stereum hirsutum
FP-91666 SS1]
Length = 772
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 228/570 (40%), Positives = 305/570 (53%), Gaps = 73/570 (12%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V+ P +L CNT+ R L++ L + + V+ + V RLG+P Y+WW+E
Sbjct: 27 DCVNGPLRDNL-VCNTTAHFVDRATSLIEEFNLTDLVNNTVNGSPGVDRLGLPPYQWWNE 85
Query: 101 ALHGVSYV-------GPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
ALHGV GP +F++ ATSFP IL A+FN SL +I V+STEARA
Sbjct: 86 ALHGVGSSPGVNWGSGPDANFTS----ATSFPAPILLGATFNDSLIASIADVISTEARAF 141
Query: 154 YNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPN-RL 212
N AGLTF++PNIN FRDPRWGRGQETPGEDP S+Y YV GLQ G SP+
Sbjct: 142 NNFNYAGLTFFTPNINPFRDPRWGRGQETPGEDPYHLSRYVYQYVVGLQ--GGLSPDPYY 199
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------- 251
KV A CKH AYD++NW+G DR FNA+V + +Y F+
Sbjct: 200 KVLANCKHVLAYDVENWEGNDRTGFNAVVTTQDLSEFYTPSFQGCLRDAQGASAMCSYNA 259
Query: 252 ----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
+I DC +V +Y YT T A A ++ A
Sbjct: 260 VNGVPSCASSYILKDLVRDFWGLGEREGWITGDCGAVQNIYQPHGYTDTLVNATAVAMDA 319
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
G DL+CG + AV GL+ I A+ + +L+RLG+FD +QPY
Sbjct: 320 GTDLDCGDVYSPNLWTAVVEGLITAGQIQTALIRLYGSLIRLGYFD-PAEQQPYRSFDWS 378
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
+V T ++QDLA +AA QGIVLL+N G LPLS T +KN+A+IGP AN T ++ GNY G
Sbjct: 379 NVNTPSSQDLAYNAAVQGIVLLEND-GLLPLS-TNVKNIALIGPMANATLSLQGNYAGIA 436
Query: 404 CKYTTPLQGLAAV---VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA 460
+P Q V + G SN + +A +AA AD V V G D SIEA
Sbjct: 437 PFVISPQQAFETAGYNVTFAFGTGISNSD--NSGYSEALEAAQGADVVVFVGGIDNSIEA 494
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E DR + PG Q LI ++ ++ K P++++ M GG D S K + + ++LW GYPG
Sbjct: 495 EGQDRTSIEWPGSQLDLIGQLGELGK-PLVVVRMGGGQCDDSTLKANATVNALLWAGYPG 553
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
Q+GG A+ D+ G+ +PSGRLP+T YP SY
Sbjct: 554 QSGGTALVDIISGKQSPSGRLPVTQYPSSY 583
>gi|336377735|gb|EGO18896.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 766
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 220/559 (39%), Positives = 309/559 (55%), Gaps = 59/559 (10%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
+N LA C+TSL R +V T+ E I V ++ V RLG+P Y+WWSE LH
Sbjct: 27 TNGPLAQNAICDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLH 86
Query: 104 GVSYVGPGTHFS--NVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGL 161
GV+ PG +FS ATSFPQ I+ A+F+ L +++G +V E R+ N G AGL
Sbjct: 87 GVA-DSPGVNFSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGL 145
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY 221
FW+PNIN F+DPRWGRGQETPGEDP ++Y V+GLQ P +V + CKH+
Sbjct: 146 DFWTPNINPFKDPRWGRGQETPGEDPYHLAQYVYNLVQGLQGGLDPKP-YYQVISTCKHF 204
Query: 222 TAYDLDNWKGTDRYHFNAMV----IYTYYLIKF--------------------------- 250
AYDL++W G RY F+A+V + YYL F
Sbjct: 205 AAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPSCAN 264
Query: 251 -----------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL 293
+++ SDCD+VD +Y+ +YTKTPEEA A ++ AG D++CG+F
Sbjct: 265 TYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVADALKAGTDIDCGTFY 324
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQD 352
++ A L+ E+ + A+ +A+L+RLG+FD P+ QPY Q +V T Q
Sbjct: 325 SEYLPGAYNQSLITETELRQALIRQYASLVRLGYFD--PTDIQPYRQYNWNNVDTPQAQQ 382
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LA AA +GIVLLKN G+LPLS + IKN+A+IGP N T M GNY G +PL G
Sbjct: 383 LAYQAAAEGIVLLKND-GTLPLS-SDIKNIALIGPWGNATGEMQGNYYGVAPYLISPLMG 440
Query: 413 LAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLP 471
A + Y G + T+ A AA AD + G D+++E+E +DR + P
Sbjct: 441 AVATGYNVTYVFGTNITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYITWP 500
Query: 472 GQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVC 531
G Q L+ E+A V K P++++ GG D + K + + ++LW GYPGQ+GG+A+ D+
Sbjct: 501 GNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALFDII 559
Query: 532 FGRYNPSGRLPMTWYPQSY 550
G+ P+GRLP+T YP Y
Sbjct: 560 SGKVAPAGRLPVTQYPADY 578
>gi|336365124|gb|EGN93476.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 732
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 220/559 (39%), Positives = 309/559 (55%), Gaps = 59/559 (10%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
+N LA C+TSL R +V T+ E I V ++ V RLG+P Y+WWSE LH
Sbjct: 12 TNGPLAQNAICDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLH 71
Query: 104 GVSYVGPGTHFS--NVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGL 161
GV+ PG +FS ATSFPQ I+ A+F+ L +++G +V E R+ N G AGL
Sbjct: 72 GVA-DSPGVNFSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGL 130
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY 221
FW+PNIN F+DPRWGRGQETPGEDP ++Y V+GLQ P +V + CKH+
Sbjct: 131 DFWTPNINPFKDPRWGRGQETPGEDPYHLAQYVYNLVQGLQGGLDPKP-YYQVISTCKHF 189
Query: 222 TAYDLDNWKGTDRYHFNAMV----IYTYYLIKF--------------------------- 250
AYDL++W G RY F+A+V + YYL F
Sbjct: 190 AAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPSCAN 249
Query: 251 -----------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL 293
+++ SDCD+VD +Y+ +YTKTPEEA A ++ AG D++CG+F
Sbjct: 250 TYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVADALKAGTDIDCGTFY 309
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQD 352
++ A L+ E+ + A+ +A+L+RLG+FD P+ QPY Q +V T Q
Sbjct: 310 SEYLPGAYNQSLITETELRQALIRQYASLVRLGYFD--PTDIQPYRQYNWNNVDTPQAQQ 367
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LA AA +GIVLLKN G+LPLS + IKN+A+IGP N T M GNY G +PL G
Sbjct: 368 LAYQAAAEGIVLLKND-GTLPLS-SDIKNIALIGPWGNATGEMQGNYYGVAPYLISPLMG 425
Query: 413 LAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLP 471
A + Y G + T+ A AA AD + G D+++E+E +DR + P
Sbjct: 426 AVATGYNVTYVFGTNITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYITWP 485
Query: 472 GQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVC 531
G Q L+ E+A V K P++++ GG D + K + + ++LW GYPGQ+GG+A+ D+
Sbjct: 486 GNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALFDII 544
Query: 532 FGRYNPSGRLPMTWYPQSY 550
G+ P+GRLP+T YP Y
Sbjct: 545 SGKVAPAGRLPVTQYPADY 563
>gi|238508313|ref|XP_002385353.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
gi|296439537|sp|B8NYD8.1|BXLB_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|220688872|gb|EED45224.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
Length = 776
Score = 361 bits (926), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 228/577 (39%), Positives = 313/577 (54%), Gaps = 68/577 (11%)
Query: 29 SSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVS 88
+S A + +F D + P L+ C+TSL R LV +TL+EKI +
Sbjct: 18 TSLASTESIFP-DCSTGP-LSKNNVCDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAP 75
Query: 89 RLGIPKYEWWSEALHGVSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVV 146
RLG+P Y WW+EALHGV+ G G FS+ ATSFP IL A+F+ L + + V+
Sbjct: 76 RLGLPAYNWWNEALHGVA-EGHGVSFSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVI 134
Query: 147 STEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
STEARA N G AGL +W+PNIN FRDPRWGRGQETPGEDPL S+Y V GLQ DG
Sbjct: 135 STEARAFANGGHAGLDYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQ--DG 192
Query: 207 GSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------- 251
P R KV A CKH+ AYDL+NW+G +RY F+A+V + YYL FK
Sbjct: 193 IGPERPKVVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAV 252
Query: 252 ----------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAA 277
++ DC ++D +Y HY AA
Sbjct: 253 MCSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAA 312
Query: 278 AKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY 337
A ++ AG DL+CGS ++ +A++ GL N +++A+ +++L++LG+FD QPY
Sbjct: 313 AAALNAGTDLDCGSVFPEYLRSALQQGLYNNQTLNNALIRLYSSLVKLGYFD-PADDQPY 371
Query: 338 GQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIG 397
+G +V T A ++LA A +GIV+LKN G+LPL +A+IGP AN T + G
Sbjct: 372 RSIGWNEVFTPAAEELAHKATVEGIVMLKND-GTLPLKSNG--TVAIIGPFANATTQLQG 428
Query: 398 NYEGTPCKYTTPLQGLAAVVATIYQA----GCSNVQCGTAQVDDAKKAAASADATVLVMG 453
NYEG P KY L + A V Y+ G +A +A AA AD + G
Sbjct: 429 NYEGPP-KYIRTL--IWAAVHNGYKVKFSQGTDINSNSSAGFAEAISAAKEADTVIYAGG 485
Query: 454 ADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSI 513
D +IE ES DR ++ PG Q LI +++ + K P+I++ GG D S + + ++
Sbjct: 486 IDNTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGAL 544
Query: 514 LWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
LW GYP QAGGAA+ D+ G+ P+GRLP+T YP SY
Sbjct: 545 LWAGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASY 581
>gi|395334835|gb|EJF67211.1| beta-xylosidase [Dichomitus squalens LYAD-421 SS1]
Length = 774
Score = 360 bits (924), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 238/587 (40%), Positives = 314/587 (53%), Gaps = 63/587 (10%)
Query: 16 LFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQE 75
L FLT ++ S S P D V+ P L++ C+TS R L+ T +E
Sbjct: 8 LSAFLTALAVGTSCVSGYGFP----DCVNGP-LSNNTVCDTSKDPITRATALIDLWTDEE 62
Query: 76 KITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHF--SNVVPGATSFPQVILTAAS 133
V+++ V RLG+P Y WWSE LHGV+ PG F S ATSFPQ IL A+
Sbjct: 63 LTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQ-SPGVTFAPSGNFSYATSFPQPILMGAA 121
Query: 134 FNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
F+ L QA+ VVSTE RA NVG AGL +W+PNIN F+DPRWGRGQETPGEDP Y
Sbjct: 122 FDDPLIQAVASVVSTEGRAFNNVGRAGLDYWTPNINPFKDPRWGRGQETPGEDPFHLQGY 181
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIK 249
+ GLQ +P KV A CKH+ AYD+DNW+G RY FNA+V + YYL
Sbjct: 182 VYNLILGLQGGLDPTP-YFKVVADCKHFAAYDMDNWEGNVRYGFNAVVTQQDLSEYYLPS 240
Query: 250 F--------------------------------------------KYIVSDCDSVDVLYN 265
F +++ SDCD+V +Y
Sbjct: 241 FQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQDILRDYWGFDDTRWVTSDCDAVQNIYT 300
Query: 266 SQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRL 325
+YT P +AAA ++LAG D++CG+F + A+ GLVN + + A +A+L+RL
Sbjct: 301 PHNYTDNPAQAAADALLAGTDIDCGTFSSTYLPDALSQGLVNATDLKRAAIRQYASLVRL 360
Query: 326 GFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVI 385
G+FD P QPY QLG DV T Q LA AA +G+VLLKN G+LPLS ++ LA+I
Sbjct: 361 GYFD-PPESQPYRQLGWSDVNTPEAQQLAHTAAVEGMVLLKND-GTLPLS-KHVRKLALI 417
Query: 386 GPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKKAAA 443
GP AN T M GNY G +PL G + +NV T+ A AA
Sbjct: 418 GPWANATTLMQGNYAGIAPYLISPLLGAQQAGFDVEYVFGTNVTTTNDTSGFAAAVAAAK 477
Query: 444 SADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISF 503
ADA + G D+++E E DRL++ PG Q L+ E+A V K P+I+ GG D S
Sbjct: 478 RADAVIFAGGLDETVEREEVDRLNVTWPGNQLDLVAELASVGK-PLIVAQFGGGQLDDSA 536
Query: 504 AKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
K+ + +I+W GYPGQ+GG A+ D+ G+ P+GRLP+T YP Y
Sbjct: 537 LKSKRSVNAIIWGGYPGQSGGTALFDILTGKAAPAGRLPITQYPAEY 583
>gi|317158006|ref|XP_001826724.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
Length = 776
Score = 360 bits (923), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 228/577 (39%), Positives = 313/577 (54%), Gaps = 68/577 (11%)
Query: 29 SSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVS 88
+S A + +F D + P L+ C+TSL R LV +TL+EKI +
Sbjct: 18 TSLASTESIFP-DCSTGP-LSKNNVCDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAP 75
Query: 89 RLGIPKYEWWSEALHGVSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVV 146
RLG+P Y WW+EALHGV+ G G FS+ ATSFP IL A+F+ L + + V+
Sbjct: 76 RLGLPAYNWWNEALHGVA-EGHGVSFSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVI 134
Query: 147 STEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
STEARA N G AGL +W+PNIN FRDPRWGRGQETPGEDPL S+Y V GLQ DG
Sbjct: 135 STEARAFANGGHAGLDYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQ--DG 192
Query: 207 GSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------- 251
P R KV A CKH+ AYDL+NW+G +RY F+A+V + YYL FK
Sbjct: 193 IGPERPKVVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAV 252
Query: 252 ----------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAA 277
++ DC ++D +Y HY AA
Sbjct: 253 MCSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAA 312
Query: 278 AKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY 337
A ++ AG DL+CGS ++ +A++ GL N +++A+ +++L++LG+FD QPY
Sbjct: 313 AAALNAGTDLDCGSVFPEYLGSALQQGLYNNQTLNNALIRLYSSLVKLGYFD-PADDQPY 371
Query: 338 GQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIG 397
+G +V T A ++LA A +GIV+LKN G+LPL +A+IGP AN T + G
Sbjct: 372 RSIGWNEVFTPAAEELAHKATVEGIVMLKND-GTLPLKSNG--TVAIIGPFANATTQLQG 428
Query: 398 NYEGTPCKYTTPLQGLAAVVATIYQA----GCSNVQCGTAQVDDAKKAAASADATVLVMG 453
NYEG P KY L + A V Y+ G +A +A AA AD + G
Sbjct: 429 NYEGPP-KYIRTL--IWAAVHNGYKVKFSQGTDINSNSSAGFAEAISAAKEADTVIYAGG 485
Query: 454 ADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSI 513
D +IE ES DR ++ PG Q LI +++ + K P+I++ GG D S + + ++
Sbjct: 486 IDNTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGAL 544
Query: 514 LWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
LW GYP QAGGAA+ D+ G+ P+GRLP+T YP SY
Sbjct: 545 LWAGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASY 581
>gi|121797681|sp|Q2TYT2.1|BXLB_ASPOR RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|83775471|dbj|BAE65591.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 797
Score = 360 bits (923), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 223/558 (39%), Positives = 304/558 (54%), Gaps = 66/558 (11%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
L+ C+TSL R LV +TL+EKI + RLG+P Y WW+EALHGV+
Sbjct: 56 LSKNNVCDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA- 114
Query: 108 VGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWS 165
G G FS+ ATSFP IL A+F+ L + + V+STEARA N G AGL +W+
Sbjct: 115 EGHGVSFSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWT 174
Query: 166 PNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD 225
PNIN FRDPRWGRGQETPGEDPL S+Y V GLQ DG P R KV A CKH+ AYD
Sbjct: 175 PNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQ--DGIGPERPKVVATCKHFAAYD 232
Query: 226 LDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------------ 251
L+NW+G +RY F+A+V + YYL FK
Sbjct: 233 LENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLL 292
Query: 252 ---------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKH 296
++ DC ++D +Y HY AAA ++ AG DL+CGS ++
Sbjct: 293 QTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEY 352
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
+A++ GL N +++A+ +++L++LG+FD QPY +G +V T A ++LA
Sbjct: 353 LGSALQQGLYNNQTLNNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFTPAAEELAHK 411
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
A +GIV+LKN G+LPL +A+IGP AN T + GNYEG P KY L + A
Sbjct: 412 ATVEGIVMLKND-GTLPLKSNG--TVAIIGPFANATTQLQGNYEGPP-KYIRTL--IWAA 465
Query: 417 VATIYQA----GCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPG 472
V Y+ G +A +A AA AD + G D +IE ES DR ++ PG
Sbjct: 466 VHNGYKVKFSQGTDINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPG 525
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCF 532
Q LI +++ + K P+I++ GG D S + + ++LW GYP QAGGAA+ D+
Sbjct: 526 NQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILT 584
Query: 533 GRYNPSGRLPMTWYPQSY 550
G+ P+GRLP+T YP SY
Sbjct: 585 GKSAPAGRLPVTQYPASY 602
>gi|83774566|dbj|BAE64689.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 822
Score = 358 bits (920), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 218/553 (39%), Positives = 300/553 (54%), Gaps = 61/553 (11%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
L S C+TSL I RV LVK LTL+EKI LVD++ +RLG+P YEWWSEA HGV
Sbjct: 74 LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVG- 132
Query: 108 VGPGTHFSNVVPG---ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFW 164
PG F++ ATSFP ILTAASF+ +L + I +V+ E RA N G +G FW
Sbjct: 133 SAPGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFSGFDFW 192
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNIN FRDPRWGRGQETPGEDPL+A Y +V GLQ G P +V A CKHY Y
Sbjct: 193 APNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQ---GDDPKNKQVIATCKHYAVY 249
Query: 225 DLDNWKGTDRYHFNAMVIYTYYLIKFK--------------------------------- 251
DL+ + + Y+ + Y+L FK
Sbjct: 250 DLETGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLSEV 309
Query: 252 ------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA 299
Y+VSDC +V ++ ++T T E AA+ ++ AG+DL CGS K E
Sbjct: 310 LRKHWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSYLKLNE- 368
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
++ A + +D +++ ++ L +GFFDG Y +L DV T Q LA +AA
Sbjct: 369 SLAANQTSVKVMDQSLARLYSALFTVGFFDG----GKYDKLDFSDVSTPDAQALAYEAAV 424
Query: 360 QGIVLLKNTAGSLPL-SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA 418
+G+ LLKN LPL SP K++AVIGP AN T M G+Y G +PL+
Sbjct: 425 EGMTLLKND-DLLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLEAFGDSRW 483
Query: 419 TI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLL 477
+ Y G + T+ ++A AA +D + + G D S+E+E+ DR L PG Q L
Sbjct: 484 KVNYALGTAMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLTWPGNQLDL 543
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
IT ++K++K P++++ GG D S + I +++W GYP Q+GG A+ DV G+ +P
Sbjct: 544 ITSLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSP 602
Query: 538 SGRLPMTWYPQSY 550
+GRLP+T YP SY
Sbjct: 603 AGRLPVTQYPASY 615
>gi|317156541|ref|XP_001825822.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
Length = 882
Score = 358 bits (918), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 218/553 (39%), Positives = 300/553 (54%), Gaps = 61/553 (11%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
L S C+TSL I RV LVK LTL+EKI LVD++ +RLG+P YEWWSEA HGV
Sbjct: 134 LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVG- 192
Query: 108 VGPGTHFSNVVPG---ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFW 164
PG F++ ATSFP ILTAASF+ +L + I +V+ E RA N G +G FW
Sbjct: 193 SAPGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFSGFDFW 252
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNIN FRDPRWGRGQETPGEDPL+A Y +V GLQ G P +V A CKHY Y
Sbjct: 253 APNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQ---GDDPKNKQVIATCKHYAVY 309
Query: 225 DLDNWKGTDRYHFNAMVIYTYYLIKFK--------------------------------- 251
DL+ + + Y+ + Y+L FK
Sbjct: 310 DLETGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLSEV 369
Query: 252 ------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA 299
Y+VSDC +V ++ ++T T E AA+ ++ AG+DL CGS K E
Sbjct: 370 LRKHWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSYLKLNE- 428
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
++ A + +D +++ ++ L +GFFDG Y +L DV T Q LA +AA
Sbjct: 429 SLAANQTSVKVMDQSLARLYSALFTVGFFDGG----KYDKLDFSDVSTPDAQALAYEAAV 484
Query: 360 QGIVLLKNTAGSLPL-SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA 418
+G+ LLKN LPL SP K++AVIGP AN T M G+Y G +PL+
Sbjct: 485 EGMTLLKND-DLLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLEAFGDSRW 543
Query: 419 TI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLL 477
+ Y G + T+ ++A AA +D + + G D S+E+E+ DR L PG Q L
Sbjct: 544 KVNYALGTAMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLTWPGNQLDL 603
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
IT ++K++K P++++ GG D S + I +++W GYP Q+GG A+ DV G+ +P
Sbjct: 604 ITSLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSP 662
Query: 538 SGRLPMTWYPQSY 550
+GRLP+T YP SY
Sbjct: 663 AGRLPVTQYPASY 675
>gi|391864313|gb|EIT73609.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 797
Score = 358 bits (918), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 223/558 (39%), Positives = 303/558 (54%), Gaps = 66/558 (11%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
L+ C+TSL R LV +TL+EKI + RLG+P Y WW+EALHGV+
Sbjct: 56 LSKNNVCDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA- 114
Query: 108 VGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWS 165
G G FS+ ATSFP IL A+F+ L + + V+STEARA N G AGL +W+
Sbjct: 115 EGHGVSFSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWT 174
Query: 166 PNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD 225
PNIN FRDPRWGRGQETPGEDPL S+Y V GLQ DG P R KV A CKH+ AYD
Sbjct: 175 PNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQ--DGIGPERPKVVATCKHFAAYD 232
Query: 226 LDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------------ 251
L+NW+G +RY F+A+V + YYL FK
Sbjct: 233 LENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLL 292
Query: 252 ---------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKH 296
++ DC ++D +Y HY AAA ++ AG DL+CGS ++
Sbjct: 293 QTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEY 352
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
+A++ GL N + +A+ +++L++LG+FD QPY +G +V T A ++LA
Sbjct: 353 LGSALQQGLYNNQTLYNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFTPAAEELAHK 411
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
A +GIV+LKN G+LPL +A+IGP AN T + GNYEG P KY L + A
Sbjct: 412 ATVEGIVMLKND-GTLPLKSNG--TVAIIGPFANATTQLQGNYEGPP-KYIRTL--IWAA 465
Query: 417 VATIYQA----GCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPG 472
V Y+ G +A +A AA AD + G D +IE ES DR ++ PG
Sbjct: 466 VHNGYKVKFSQGTDINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPG 525
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCF 532
Q LI +++ + K P+I++ GG D S + + ++LW GYP QAGGAA+ D+
Sbjct: 526 NQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILT 584
Query: 533 GRYNPSGRLPMTWYPQSY 550
G+ P+GRLP+T YP SY
Sbjct: 585 GKSAPAGRLPVTQYPASY 602
>gi|403412992|emb|CCL99692.1| predicted protein [Fibroporia radiculosa]
Length = 760
Score = 357 bits (916), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 217/562 (38%), Positives = 304/562 (54%), Gaps = 60/562 (10%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D + P LA+ C+TS R L+ TL+EKI +++ V RLG+P Y+WW E
Sbjct: 22 DCTTGP-LANNTVCDTSASPVARATALIGLFTLEEKINNTGNTSPGVPRLGLPAYQWWQE 80
Query: 101 ALHGVSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV+ PG F+ ATSFPQ IL A+F+ L + +VSTEARA N
Sbjct: 81 ALHGVA-ESPGVIFAETGEYSYATSFPQPILMGAAFDDELINQVATIVSTEARAFNNANR 139
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
+GL FW+PNIN F+DPRWGRGQETPGEDP Y + GLQ G P ++ A C
Sbjct: 140 SGLDFWTPNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQ--GGLDPEYKRIVATC 197
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF------------------------ 250
KHY YDL+NW+G RY F+A++ + +Y F
Sbjct: 198 KHYAGYDLENWEGNVRYGFDALISIQDLSEFYTRSFETCARDANVGAFMCSYNAVNGVPS 257
Query: 251 ---------------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
++I SDCD++ +Y +Y T E A ++ AG DL+C
Sbjct: 258 CANSYLLQDILRGHWNWTSDDQWITSDCDAIQNIYEPHYYAPTRELTVADALNAGADLDC 317
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G++ ++ AA GL ES +D A+ +A+L++LG+FD QPY Q+G +V T
Sbjct: 318 GTYYPENLGAAYDEGLFAESTLDRALIRQYASLVKLGYFDPA-ENQPYRQIGWANVSTPE 376
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
++LA AA +GI L+KN G+LPLSP+ IK+LA+IGP AN T M GNY G P +P
Sbjct: 377 AEELAYRAAVEGITLIKND-GTLPLSPS-IKSLALIGPWANATTQMQGNYYGQPPYLISP 434
Query: 410 LQGLAAVVATIYQAGCSNV-QCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDL 468
L A+ T+Y + V T+ A AA +ADA + + G D ++EAE+ DR L
Sbjct: 435 LMAAEALNYTVYYSPGPGVDDPTTSSFPAAFAAAQAADAIIYIGGIDTTVEAEAMDRYTL 494
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIA 528
PG Q I ++++ K P++++ M GG D S + + +++W GYPGQ+GG A+
Sbjct: 495 DWPGVQPDFIDQLSQFGK-PLVVLQMGGGQVDDSCLLPNTNVNALIWGGYPGQSGGTALM 553
Query: 529 DVCFGRYNPSGRLPMTWYPQSY 550
D+ G P+GRLP T YP Y
Sbjct: 554 DIIVGNAAPAGRLPTTQYPLDY 575
>gi|302683060|ref|XP_003031211.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
gi|300104903|gb|EFI96308.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
Length = 761
Score = 357 bits (916), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 236/592 (39%), Positives = 325/592 (54%), Gaps = 72/592 (12%)
Query: 16 LFLFLTYCSLQHSSSS-AQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQ 74
+ L +T +L ++ S AQ P D V+ P LAS C+TSLG R LV+ LT+
Sbjct: 1 MALLITSFALSIAAGSRAQGFP----DCVNGP-LASNAVCDTSLGHVERARALVEELTVA 55
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVS------YVGPGTHFSNVVPGATSFPQVI 128
E I V +A V RLG+P Y WW+EALHGV+ + PG FS+ ATSFP I
Sbjct: 56 EMINNTVHTAPGVPRLGLPPYNWWNEALHGVAASPGVVFTSPGEEFSS----ATSFPMPI 111
Query: 129 LTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPL 188
++F+ +L A+G V STEARA N GLAGL +W+PNIN F+DPRWGRG ETPGEDPL
Sbjct: 112 NMGSAFDDALMLAVGNVTSTEARAFNNAGLAGLDYWTPNINPFKDPRWGRGAETPGEDPL 171
Query: 189 LASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYT 244
A++Y V+GLQ G P LKVAA CKH+ AYDL++W G RY F+A+V +
Sbjct: 172 HAARYVRTLVEGLQG--GIDPPSLKVAADCKHWAAYDLEDWGGVARYAFDAVVTPQDLAE 229
Query: 245 YYLIKFK--------------------------------------------YIVSDCDSV 260
YY FK ++ SDCD+V
Sbjct: 230 YYSPPFKSCVRDARAASVMCSYNAVNGVPACASPYLLKTVLRDAWGLAEDRWVTSDCDAV 289
Query: 261 DVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFA 320
+Y+ YT+ +A S+ AG DL+CG+ ++ A GL++E + A++ +A
Sbjct: 290 GNVYDPHGYTEDFVNGSAVSLKAGSDLDCGTTYSQYLPEAYDRGLIDEDDLKAALTRLYA 349
Query: 321 TLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIK 380
+L+ LG+FD P QPY Q+ DV T A Q LA AA + VLLKN G+LPL+ +++
Sbjct: 350 SLVWLGYFDA-PEDQPYRQISWADVNTPAAQALAYTAAIESFVLLKND-GTLPLTDSSL- 406
Query: 381 NLAVIGPNANVTKTMI-GNYEGTPCKYTTPLQG-LAAVVATIYQAGCSNVQCGTAQVDDA 438
++A+IGP AN + + GNY G P PLQG L A Y G + +D A
Sbjct: 407 SIALIGPMANASAVQLQGNYNGIPPFAIAPLQGFLDAGFNVTYVLGTNVTGNDADDIDGA 466
Query: 439 KKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
AA +AD + V G D ++E E+ DR ++ P Q L++ + + K P++++ M GG
Sbjct: 467 VAAAEAADVVIYVGGIDSTVEEEAKDRTEISWPDNQLALLSALEEAGK-PLVVVQMGGGQ 525
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
D + K + +ILW GYPGQ+GG AIAD G+ P+GRL +T YP SY
Sbjct: 526 LDDTPLKESDAVNAILWAGYPGQSGGTAIADTVMGKVAPAGRLSITQYPASY 577
>gi|238492365|ref|XP_002377419.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|220695913|gb|EED52255.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 775
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 217/553 (39%), Positives = 299/553 (54%), Gaps = 61/553 (11%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
L S C+TSL I RV LVK LTL+EKI LVD++ +RLG+P YEWWSEA HGV
Sbjct: 27 LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVG- 85
Query: 108 VGPGTHFSNVVPG---ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFW 164
PG F++ ATSFP ILTAASF+ +L + I +V+ E R N G +G FW
Sbjct: 86 SAPGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRVFGNNGFSGFDFW 145
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNIN FRDPRWGRGQETPGEDPL+A Y +V GLQ G P +V A CKHY Y
Sbjct: 146 APNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQ---GDDPKNKQVIATCKHYAVY 202
Query: 225 DLDNWKGTDRYHFNAMVIYTYYLIKFK--------------------------------- 251
DL+ + + Y+ + Y+L FK
Sbjct: 203 DLETGRYGNNYNPTQQDLSEYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLDEV 262
Query: 252 ------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA 299
Y+VSDC +V ++ ++T T E AA+ ++ AG+DL CGS K E
Sbjct: 263 LRKHWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSYLKLNE- 321
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
++ A + +D +++ ++ L +GFFDG Y +L DV T Q LA +AA
Sbjct: 322 SLAANQTSVKVMDQSLARLYSALFTVGFFDGG----KYDKLDFSDVSTPDAQALAYEAAV 377
Query: 360 QGIVLLKNTAGSLPL-SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA 418
+G+ LLKN LPL SP K++AVIGP AN T M G+Y G +PL+
Sbjct: 378 EGMTLLKND-DLLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLEAFGDSRW 436
Query: 419 TI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLL 477
+ Y G + T+ ++A AA +D + + G D S+E+E+ DR L PG Q L
Sbjct: 437 KVNYALGTAINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLAWPGNQLDL 496
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
IT ++K++K P++++ GG D S + I +++W GYP Q+GG A+ DV G+ +P
Sbjct: 497 ITSLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSP 555
Query: 538 SGRLPMTWYPQSY 550
+GRLP+T YP SY
Sbjct: 556 AGRLPVTQYPASY 568
>gi|391865040|gb|EIT74331.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 822
Score = 356 bits (914), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 217/553 (39%), Positives = 299/553 (54%), Gaps = 61/553 (11%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
L S C+TSL I RV LVK LTL+EKI LVD++ +RLG+P YEWWSEA HGV
Sbjct: 74 LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVG- 132
Query: 108 VGPGTHFSNVVPG---ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFW 164
PG F++ ATSFP ILTAASF+ +L + I +V+ E RA N G +G FW
Sbjct: 133 SAPGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFSGFDFW 192
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNIN FRDPRWGRGQETPGEDPL+A Y +V GLQ G P +V A CKHY Y
Sbjct: 193 APNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQ---GDDPKNKQVIATCKHYAVY 249
Query: 225 DLDNWKGTDRYHFNAMVIYTYYLIKFK--------------------------------- 251
DL+ + + Y+ + Y+L FK
Sbjct: 250 DLETGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLDEV 309
Query: 252 ------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA 299
Y+VSDC +V ++ ++T T E AA+ ++ AG+DL CGS K E
Sbjct: 310 LRKHWNFNSDYYYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSYLKLNE- 368
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
++ A + +D +++ ++ L +GFFDG Y +L DV T Q LA +AA
Sbjct: 369 SLAANQTSVKVMDRSLARLYSALFTVGFFDGG----KYDKLDFSDVSTPDAQALAYEAAV 424
Query: 360 QGIVLLKNTAGSLPLS-PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA 418
+G+ LLKN LPL P K++AVIGP AN T M G+Y G +PL+
Sbjct: 425 EGMTLLKND-DLLPLDFPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLEAFGDSRW 483
Query: 419 TI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLL 477
+ Y G + T+ ++A AA +D + + G D S+E+E+ DR L PG Q L
Sbjct: 484 KVNYALGTAINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLAWPGNQLDL 543
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
IT ++K++K P++++ GG D S + I +++W GYP Q+GG A+ DV G+ +P
Sbjct: 544 ITSLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSP 602
Query: 538 SGRLPMTWYPQSY 550
+GRLP+T YP SY
Sbjct: 603 AGRLPVTQYPASY 615
>gi|452846807|gb|EME48739.1| glycoside hydrolase family 3 protein [Dothistroma septosporum
NZE10]
Length = 802
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 221/567 (38%), Positives = 299/567 (52%), Gaps = 69/567 (12%)
Query: 40 CDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWS 99
CD N L C+T+ R L+ TLQEK+ ++ V RLG+P Y WW
Sbjct: 28 CD---NGPLKDNTVCDTTADPLTRATALINAFTLQEKLNNTGSTSPGVPRLGLPAYTWWQ 84
Query: 100 EALHGVSYVGPGTHFSNVVPG--ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
EALHGV+ PG +FS+ P ATSFPQ IL A+F+ L + + V+STEARA N
Sbjct: 85 EALHGVA-SSPGVNFSDSGPFRYATSFPQPILMGAAFDDDLIRDVATVISTEARAFNNDK 143
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL--KVA 215
AGL FW+PNIN F+D RWGRGQETPGEDP S Y ++GLQ GSP+ +V
Sbjct: 144 RAGLDFWTPNINPFKDSRWGRGQETPGEDPYHLSSYVAALIEGLQ----GSPDDKYKRVV 199
Query: 216 ACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------------- 251
A CKH+ AYD+++W G RY F+A V + YY+ F+
Sbjct: 200 ATCKHFVAYDMESWNGNFRYQFDAQVSSQDLVEYYMPPFQQCARDSNVGAFMCSYNALNG 259
Query: 252 -------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
++ SDCD+V ++ Y T EEAAA S+ AG D
Sbjct: 260 VPTCADPWLLQTVLREKWNWTSEQQWVTSDCDAVQNVFLPHDYASTREEAAALSLKAGTD 319
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
+NCG++ H AA GL+N + +D ++ +++L+RLG+FDG PY L DV
Sbjct: 320 INCGTYYQDHLPAAYDQGLINTTDLDISLIRQYSSLVRLGYFDGL--AVPYRNLTWNDVS 377
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
T Q LA AA +GI LLKN G LPL+ + ++A+IG AN T M+GNY+G P +
Sbjct: 378 TPHAQQLAYKAAAEGITLLKND-GVLPLTISNGTSIALIGDWANATDQMLGNYDGIPPFF 436
Query: 407 TTPLQGLAAVVATIYQAGCSNVQCGTAQVDD---AKKAAASADATVLVMGADQSIEAESH 463
+PL AT+ A Q G D AA +D + G D S+E+E
Sbjct: 437 HSPLYAAQQTGATVNFATGPGGQ-GDPTTDHWLPVWAAANKSDVIIYAGGIDNSVESEGM 495
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR+ L G Q +I ++A K PVI++ M GG D S N+P +++++W GYPGQ G
Sbjct: 496 DRVSLTWTGAQLDMIGQLAMYGK-PVIVLQMGGGQIDSSPLVNNPNVSALIWGGYPGQDG 554
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
G A+ D+ G P+GRLP T YP Y
Sbjct: 555 GVALFDIIRGITAPAGRLPTTQYPAKY 581
>gi|189203341|ref|XP_001938006.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985105|gb|EDU50593.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 761
Score = 355 bits (911), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 211/541 (39%), Positives = 291/541 (53%), Gaps = 60/541 (11%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGA- 121
R LV TL+EKI A V RLG+P Y+WWSE LHG++ GP T+FS+ +
Sbjct: 10 RAQSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWSEGLHGIA--GPYTNFSDSGEWSY 67
Query: 122 -TSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQ 180
TSFPQ IL A+F+ L + KV+STEARA N GL FW+PNIN FRDPRWGRGQ
Sbjct: 68 STSFPQPILMGAAFDDDLITDVAKVISTEARAFNNANRTGLDFWTPNINPFRDPRWGRGQ 127
Query: 181 ETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAM 240
ETPGED S Y + GLQ + P + +V A CKH+ YD+++W G RY +
Sbjct: 128 ETPGEDAYHLSSYVQALIHGLQ-GESTDPYK-RVVATCKHFAGYDVEDWNGNLRYQNDVQ 185
Query: 241 V----IYTYYLIKFK--------------------------------------------Y 252
+ + YYL F+ +
Sbjct: 186 ITQQELVEYYLAPFQACVQANVGAFMCSYNAVNGAPPCADPYLLQTILREHWGWTNEEQW 245
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAID 312
+ DCD+V +Y ++ T AAA S++AG D+ CG+++ +H AA + L+NES++D
Sbjct: 246 VTGDCDAVQNVYLPHQWSPTRAGAAADSLVAGTDVTCGTYMQEHLPAAFQQKLLNESSLD 305
Query: 313 HAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
A+ +++L+RLG+FD QPY QLG V T A+Q LA AA +GIVLLKN G+L
Sbjct: 306 QALIRQYSSLVRLGYFDAS-ENQPYRQLGFDAVATNASQALARRAAAEGIVLLKND-GTL 363
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCG- 431
PLS + + + G AN T ++GNY G +PL L I AG + G
Sbjct: 364 PLSLDSSVTVGLFGDWANATSQLLGNYAGVATYLHSPLYALEQTGVKINYAGGNPGGQGD 423
Query: 432 --TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPV 489
T + + A +++D + V G D S+E E DR L G Q +I ++A K PV
Sbjct: 424 PTTNRWSNLYGAYSTSDVLIYVGGIDNSVEEEGRDRGYLTWTGAQLDVIGQLADTGK-PV 482
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
I+++ GG D S N+P I++I+W GYPGQ GG+AI D+ G+ P+GRLP T YP +
Sbjct: 483 IVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGGSAIIDIIGGKTAPAGRLPQTQYPAN 542
Query: 550 Y 550
Y
Sbjct: 543 Y 543
>gi|156062754|ref|XP_001597299.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980]
gi|154696829|gb|EDN96567.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 758
Score = 355 bits (911), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 224/563 (39%), Positives = 300/563 (53%), Gaps = 63/563 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V+ P LA+ C+T+ R LV TL EKI +++ V R+G+P Y+WW+E
Sbjct: 22 DCVNGP-LANNTVCDTTADPYTRATALVSLFTLAEKINNTGNTSPGVPRIGLPAYQWWNE 80
Query: 101 ALHGVSYVGPGTHFSNVVPG---ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
ALHG++Y GTHF+ ATSFPQ IL A+F+ +L + +STEARA N
Sbjct: 81 ALHGIAY---GTHFAAAGSNYSYATSFPQPILMGAAFDDALIHDVASQISTEARAFSNAN 137
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
GL FW+PNIN ++DPRWGRGQETPGEDP S Y V GLQ P + K A
Sbjct: 138 RYGLNFWTPNINPYKDPRWGRGQETPGEDPFHVSSYVNALVTGLQGGLDDLPYK-KGVAT 196
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKHY YDL+N G RY F+A++ + YYL F+
Sbjct: 197 CKHYAGYDLENGGGIQRYAFDAIINSQDLRDYYLPSFQQCARDSNVQSIMCSYNAVNGVP 256
Query: 252 -----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
++ SDCD+V +++S +YT TPE+AAA ++ AG DL+
Sbjct: 257 TCADDWLLQSLLREHWGWVEEDQWVTSDCDAVQNIWDSHNYTSTPEQAAADALNAGTDLD 316
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG F + +A L N S +D +++ +A+L+RLG+FD S QPY QLG DV T
Sbjct: 317 CGGFWPTYLGSAYNQSLYNISTLDRSLTRRYASLVRLGYFD-PASIQPYRQLGWSDVSTP 375
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
+ + LAL AA GIVLLKN G LPL P+ I N+A+IGP AN T M GNY G +
Sbjct: 376 SAEQLALQAAEDGIVLLKND-GILPL-PSNITNVALIGPWANATTQMQGNYYGQAPYLHS 433
Query: 409 PL-QGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
PL A Y G T + A AA AD + + G D SIEAE+ DR
Sbjct: 434 PLIAAQNAGFHVTYVQGADIDSTNTTEFTAAIAAAKKADVIIYIGGIDNSIEAEAKDRKT 493
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
+ P Q L+ ++A ++ P+I+ M G D S + + I+W GYPGQ GG AI
Sbjct: 494 IAWPSSQISLVNQLANLSI-PLIISQM-GTMIDSSSLLTNRGVNGIIWAGYPGQDGGTAI 551
Query: 528 ADVCFGRYNPSGRLPMTWYPQSY 550
++ G+ P+GRLP+T YP Y
Sbjct: 552 FNILTGKTAPAGRLPITQYPSDY 574
>gi|392596548|gb|EIW85871.1| hypothetical protein CONPUDRAFT_80240 [Coniophora puteana
RWD-64-598 SS2]
Length = 770
Score = 355 bits (910), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 216/554 (38%), Positives = 294/554 (53%), Gaps = 69/554 (12%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-------- 105
C+TSL R L+ T+ E I V+ A V RLG+P YEWWSE LHGV
Sbjct: 37 CDTSLNATQRAAALIDLFTVDELIVNTVNWAPGVPRLGLPAYEWWSEGLHGVANSAGVTW 96
Query: 106 SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWS 165
S GP ++ ATSFPQ IL +A+F+ +L +A+G V+ E RA N G AGL FW+
Sbjct: 97 SITGPFSY-------ATSFPQPILMSAAFDDALIKAVGGVIGMEGRAFNNYGHAGLDFWT 149
Query: 166 PNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD 225
PNIN F+DPRWGRGQETPGEDP ++Y ++GLQ P +V A CKH+ YD
Sbjct: 150 PNINPFKDPRWGRGQETPGEDPYHIAQYVYNLIQGLQGGLDPEP-YFQVVATCKHFAGYD 208
Query: 226 LDNWKGTDRYHFNAMV----IYTYYLIKF------------------------------- 250
L++W RY +NA++ + YYL F
Sbjct: 209 LEDWDFNYRYGYNAIISTQDLSEYYLPSFQSCYRDAFAGASMCSYNAINGIPTCADTYLL 268
Query: 251 -------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT 297
+++ DCDSV+ +Y+ HYT P++AAA ++ AG D++CG F
Sbjct: 269 QDILRGFWGFDQTRWVTGDCDSVEDIYDFHHYTALPQQAAADALKAGSDIDCGIFYTTWL 328
Query: 298 EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDA 357
A L+ E + A++ +A+L+RLG+FD S+QPY Q +V T Q+LA A
Sbjct: 329 PLAYTESLITEQDLRAALTRQYASLVRLGYFD-PASEQPYRQYNWSNVDTSYAQELAYTA 387
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
A +GI LLKN G+LP S +AIKN+A+IGP T M GNY G +P QG
Sbjct: 388 AVEGITLLKND-GTLPFS-SAIKNIALIGPWTFATTQMQGNYYGNAPYLISPYQGAQLAG 445
Query: 418 ATIYQAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQL 476
I +NV T A AA ADA V V G D ++EAE+ DR D+ P Q
Sbjct: 446 YNISYVLETNVTSNTTDGYAAAFTAAQGADAIVFVGGIDNTVEAEAMDRNDITWPAFQLW 505
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI E+ K+ K P++++ GG D + +P + ++LW GYPGQ+GG A+ D+ G+
Sbjct: 506 LIGELGKLGK-PLVVVQFGGGQVDDTEINANPDVNALLWGGYPGQSGGQALFDIISGKVA 564
Query: 537 PSGRLPMTWYPQSY 550
P+GRL T YP Y
Sbjct: 565 PAGRLVSTQYPADY 578
>gi|302683012|ref|XP_003031187.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
gi|300104879|gb|EFI96284.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
Length = 752
Score = 354 bits (908), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 233/577 (40%), Positives = 313/577 (54%), Gaps = 64/577 (11%)
Query: 17 FLFLTYCSLQHSSSS-AQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQE 75
FL LT +L ++ S AQ P D V+ P LAS C+ SLG R LV+ T+ E
Sbjct: 3 FLLLTSIALSLAAGSRAQGFP----DCVNGP-LASNPVCDASLGHVERARALVEEFTVPE 57
Query: 76 KITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG-ATSFPQVILTAASF 134
I V++A V RLG+P YEWW+EALHGV + PG F P ATSFP I ++F
Sbjct: 58 MINNTVNAAFGVPRLGLPPYEWWNEALHGVG-LSPGVVFFEPEPAVATSFPMPINMGSAF 116
Query: 135 NASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 194
+ +L A+G V+STEARA N G AGL +W+PNIN F+DPRWGRG ETPGEDPL A++Y
Sbjct: 117 DDALMLAMGDVISTEARAFSNAGRAGLDYWTPNINPFKDPRWGRGAETPGEDPLHAARYV 176
Query: 195 TGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF 250
V+GLQ G P LKVAA CKH+ AYDL+NW G RY F+A+V + YY F
Sbjct: 177 RSLVEGLQG--GIDPPSLKVAAACKHWAAYDLENWGGVTRYAFDAVVTPQDLAEYYAPPF 234
Query: 251 --------------------------------------------KYIVSDCDSVDVLYNS 266
+++ SDC +V +Y+
Sbjct: 235 RSCVRDARAASAMCSYNAVNGVPACASPYLLKTVLRDAWGLAEDRWVTSDCGAVGNVYDP 294
Query: 267 QHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLG 326
YT+ A+ S+ AG DLNCG+ ++ A GL++E + A++ +A+L+ LG
Sbjct: 295 HGYTEDLVNASTVSLKAGTDLNCGTNYTQYLPEAYDRGLIDEDDLKAALTRLYASLVWLG 354
Query: 327 FFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG 386
+FD P QPY Q+ DV T Q LA AA + VLLKN G+LPL+ + + +LA+IG
Sbjct: 355 YFDA-PEDQPYRQITWADVNTPEAQALAYTAAIKSFVLLKND-GTLPLTDSTL-SLALIG 411
Query: 387 PNANVTK-TMIGNYEGTPCKYTTPLQG-LAAVVATIYQAGCSNVQCGTAQVDDAKKAAAS 444
P AN + M+GNY G P PLQG L A Y G + D A AA +
Sbjct: 412 PMANASALQMLGNYFGIPPFVIAPLQGFLDAGFNVTYVLGTNVTGNDAGSFDAAVAAAEA 471
Query: 445 ADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFA 504
AD + V G D ++E E DR ++ P Q L++ + V K P++++ M GG D +
Sbjct: 472 ADVVIYVGGIDNTLEMEEKDRTEISWPDNQLALLSALEGVGK-PLVVVQMGGGQLDDTPL 530
Query: 505 KNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRL 541
K + +ILW GYPGQ+GG AIAD G+ P+GRL
Sbjct: 531 KESDAVNAILWAGYPGQSGGTAIADTVTGKVAPAGRL 567
>gi|125576923|gb|EAZ18145.1| hypothetical protein OsJ_33695 [Oryza sativa Japonica Group]
Length = 591
Score = 353 bits (907), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 191/410 (46%), Positives = 250/410 (60%), Gaps = 51/410 (12%)
Query: 189 LASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYT 244
+ASKYA +VKG+Q G S L+ +ACCKH TAYDL++W G RY+FNA V +
Sbjct: 1 MASKYAVAFVKGMQ---GNSSAILQTSACCKHVTAYDLEDWNGVQRYNFNAKVTAQDLED 57
Query: 245 YYLIKFK------------------------------------------YIVSDCDSVDV 262
Y F+ YI SDCD+V +
Sbjct: 58 TYNPPFRSCVVDAKATCIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASDCDAVAI 117
Query: 263 LYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATL 322
+ ++Q YT+TPE+A A ++ AGLD+NCG+++ +H AA++ G + E ID A+ N FA
Sbjct: 118 MRDAQRYTQTPEDAVAVALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKALKNLFAIR 177
Query: 323 MRLGFFDGHP-SKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKN 381
MRLG FDG P S YG LG D+CT ++ LAL+AA GIVLLKN AG LPL TA+ +
Sbjct: 178 MRLGHFDGDPRSNSVYGGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPLDRTAVAS 237
Query: 382 LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKK 440
AVIGPNAN +IGNY G PC+ TTPL G+ + + + AGC++ C A D A
Sbjct: 238 AAVIGPNANDGLALIGNYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVAATDQAAA 297
Query: 441 AAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFD 500
A+S+D L MG Q E+E DR LLLPG+QQ LIT VA AK PVIL++++GG D
Sbjct: 298 VASSSDYVFLFMGLSQKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILVLLTGGPVD 357
Query: 501 ISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
++FA+ +PKI +ILW GYPGQAGG AIA V FG +NP GRLP+TWYP+ +
Sbjct: 358 VTFAQTNPKIGAILWAGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEF 407
>gi|392570764|gb|EIW63936.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
SS1]
Length = 781
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 232/587 (39%), Positives = 317/587 (54%), Gaps = 63/587 (10%)
Query: 16 LFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQE 75
L FLT S+ ++ A P D V+ P L + C+ + R L+ T +E
Sbjct: 8 LHAFLTAFSVGAATVRAYGFP----DCVNGP-LKNNAVCDVTKDPITRATALISIWTDEE 62
Query: 76 KITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHF--SNVVPGATSFPQVILTAAS 133
V+++ V RLG+P Y WWSE LHGV+ PG F S ATSFPQ IL A+
Sbjct: 63 LTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQ-SPGVTFAPSGNFSYATSFPQPILMGAA 121
Query: 134 FNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
F+ L QAI +VSTE RA N G AGL +W+PNIN F+DPRWGRGQETPGEDP S+Y
Sbjct: 122 FDDPLIQAIATIVSTEGRAFNNAGRAGLDYWTPNINPFKDPRWGRGQETPGEDPFHLSQY 181
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIK 249
+ GLQ P KV A CKH+ AYD+DNW+G RY FNA+V + +YL
Sbjct: 182 VYNLILGLQGGLDPKP-YFKVVADCKHFAAYDMDNWEGVVRYGFNAVVSQQDLSEFYLPP 240
Query: 250 F--------------------------------------------KYIVSDCDSVDVLYN 265
F +++ SDCD+V ++
Sbjct: 241 FQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQDVLRDHWGFTDDRWVTSDCDAVQNIFT 300
Query: 266 SQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRL 325
+YT P +AAA ++LAG D++CG+F + A++ GLVN + + A +A+L+RL
Sbjct: 301 PHNYTTDPAQAAADALLAGTDIDCGTFSSTYLPEALQRGLVNSTDLRRAAIRQYASLVRL 360
Query: 326 GFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVI 385
G+FD P+ QPY QLG DV T Q LA AA +G+VLLKN G LPLS ++ LA+I
Sbjct: 361 GYFD-DPAAQPYRQLGWSDVNTLQAQQLAHTAAVEGMVLLKND-GLLPLS-KRVRKLALI 417
Query: 386 GPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNV--QCGTAQVDDAKKAAA 443
GP AN T+ + GNY G +P+QG + +NV + T+ A AA
Sbjct: 418 GPWANATRLLQGNYFGIAPYLVSPVQGAQQAGFEVEYVFGTNVTTRNDTSGFAAAVAAAK 477
Query: 444 SADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISF 503
ADA V G D+++E E DRL++ PG Q L+ E+ +V K P+I+ GG D +
Sbjct: 478 RADAVVFAGGLDETVEREEIDRLNVTWPGNQLDLVAELERVGK-PLIVAQFGGGQLDNTA 536
Query: 504 AKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
K + +I+W GYPGQ+GG A+ D+ G+ P+GRLP+T YP +Y
Sbjct: 537 LKRSKAVNAIIWGGYPGQSGGTALFDILTGKAAPAGRLPITQYPAAY 583
>gi|242216161|ref|XP_002473890.1| beta-xylosidase [Postia placenta Mad-698-R]
gi|220726990|gb|EED80923.1| beta-xylosidase [Postia placenta Mad-698-R]
Length = 741
Score = 353 bits (906), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 220/560 (39%), Positives = 302/560 (53%), Gaps = 61/560 (10%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
+N L + C+TS R L+ TL+EKI ++A V RLG+P Y+WW EALH
Sbjct: 24 ANGPLTTNTVCDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEALH 83
Query: 104 GVSYVGPGTHF--SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGL 161
GV+ PG F S ATSFPQ IL A+F+ +L + +VSTEARA N +G+
Sbjct: 84 GVA-ESPGVIFAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRSGI 142
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY 221
FW+PNIN F+DPRWGRGQETPGEDP Y + GLQ G P ++ A CKH+
Sbjct: 143 DFWTPNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQ--GGLDPEYKRIVATCKHF 200
Query: 222 TAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------------------- 251
AYDL+NW+G RY F+A+V + +Y F+
Sbjct: 201 AAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNGVPSCAN 260
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
YI SDCD++ +Y +YT T E A ++ AG DL+CG +
Sbjct: 261 SYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTATRAETVADALNAGTDLDCGEY 320
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQ 351
++ AA GL ES ++ A+ +A L++LG+FD P+ QPY Q+G +V T +
Sbjct: 321 YPENLGAAYDQGLFTESTLNRALIRQYAALVKLGYFD--PADIQPYRQIGWANVSTPEAE 378
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
+LA AA +GI LLKN G+LPLSP+ IK +A+IGP AN T M GNY G +PL
Sbjct: 379 ELAYTAAVEGITLLKND-GTLPLSPS-IKTIALIGPWANATTQMQGNYYGVAPYLISPLM 436
Query: 412 GLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
+ T+ Y AG T+ A AA +ADA + G D ++EAE+ DR L
Sbjct: 437 AAEELGFTVYYSAGPGVDDPTTSSFPAAFAAAEAADAIIYAGGIDITVEAEAMDRYTLDW 496
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG Q I +++ + K P+I++ GG D S +P + +++W GYPGQ+GG AI D+
Sbjct: 497 PGVQPDFIDQLSLLGK-PLIVLQFGGGQIDDSALLPNPGVNALVWGGYPGQSGGKAIMDI 555
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
G P+GRLP+T YP Y
Sbjct: 556 IVGNAAPAGRLPITQYPLDY 575
>gi|224068498|ref|XP_002302758.1| predicted protein [Populus trichocarpa]
gi|222844484|gb|EEE82031.1| predicted protein [Populus trichocarpa]
Length = 462
Score = 352 bits (904), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 171/269 (63%), Positives = 203/269 (75%), Gaps = 1/269 (0%)
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
A LDL+CG FLG+HTE AV+ GL+ E+ I++A+ N MRLG FDG PS +PYG LGP
Sbjct: 6 ASLDLDCGPFLGQHTEDAVRKGLLTEAEINNALLNTLTVQMRLGMFDGEPSSKPYGNLGP 65
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
DVCT A+Q+LAL+AARQGIVLLKN LPLS +++A+IGPN+NVT TMIGNY G
Sbjct: 66 TDVCTPAHQELALEAARQGIVLLKNHGPPLPLSTRHHQSVAIIGPNSNVTVTMIGNYAGV 125
Query: 403 PCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAE 461
C YTTPLQG+ TIYQ GC++V C Q A AA ADATVLVMG DQSIEAE
Sbjct: 126 ACGYTTPLQGIGRYAKTIYQQGCADVACVSDQQFVAAMDAARQADATVLVMGLDQSIEAE 185
Query: 462 SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQ 521
S DR +LLLPG+QQ LI++VA +KGP IL++MSGG D+SFA+NDPKI I+W GYPGQ
Sbjct: 186 SRDRTELLLPGRQQELISKVAAASKGPTILVLMSGGPIDVSFAENDPKIGGIVWAGYPGQ 245
Query: 522 AGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
AGGAAI+DV FG NP G+LPMTWYPQ Y
Sbjct: 246 AGGAAISDVLFGTTNPGGKLPMTWYPQDY 274
>gi|442803736|ref|YP_007371885.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
DSM 8532]
gi|442739586|gb|AGC67275.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
DSM 8532]
Length = 715
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 217/569 (38%), Positives = 302/569 (53%), Gaps = 98/569 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + S + R DLV R+T++EK++ ++ ++ ++ RLGIP Y WW+EALHGV+ G
Sbjct: 7 YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT-- 64
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
AT FPQ I AA+F+ L + V+STE RA Y+ GLTFW
Sbjct: 65 --------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFW 116
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRGQET GEDP L ++ +VKGLQ G P LK AAC KH+ +
Sbjct: 117 SPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQ---GNHPKYLKAAACAKHFAVH 173
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ + R+ FNA+V +Y YL FK
Sbjct: 174 ---SGPESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEPCCGSKTL 230
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
++VSDC ++ + H T T E+AA ++ G DLNCG+ G +
Sbjct: 231 LSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESAALAVRNGCDLNCGNMFG-NLL 289
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDA 357
A+K GL+ E ID A++ T M+LG FD P Q PY + V + +++LALD
Sbjct: 290 IALKEGLITEEEIDRAVTRLMITRMKLGMFD--PEDQVPYASISYDFVDCKEHRELALDV 347
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV- 416
A++ IVLLKN G LPL I+++AVIGPNA+ + +IGNYEGT +Y T L G+ +
Sbjct: 348 AKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLDGIREMA 406
Query: 417 ---VATIYQAGC-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE----- 461
V Y GC N+ ++ +A A AD ++ +G D +IE E
Sbjct: 407 GDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLDSTIEGEEMHES 466
Query: 462 ----SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
S D+ DL LPGQQQ L+ V K P++L++++G +++A D I +IL
Sbjct: 467 NIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWA--DEHIPAILNAW 523
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
YPG GG AIA V FG NPSG+LP+T+Y
Sbjct: 524 YPGALGGRAIASVLFGETNPSGKLPVTFY 552
>gi|426198365|gb|EKV48291.1| hypothetical protein AGABI2DRAFT_219902 [Agaricus bisporus var.
bisporus H97]
Length = 767
Score = 350 bits (898), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 217/564 (38%), Positives = 303/564 (53%), Gaps = 63/564 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V+ P L+S C+ + R L++ T +E + + + V RLG+P Y+WWSE
Sbjct: 26 DCVNGP-LSSTAVCDPTKAPAARAKTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSE 84
Query: 101 ALHGVSYVGPGTHF--SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV+ PG F S ATSFPQ I+ ++F+ L +A+ V+STEARA N
Sbjct: 85 ALHGVA-GSPGVSFAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHR 143
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR--LKVAA 216
AGL +++PNIN F+DPRWGRGQETPGEDP S+Y + GLQ GG R KVAA
Sbjct: 144 AGLDYFTPNINPFKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQ---GGIDPRPYFKVAA 200
Query: 217 CCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF---------------------- 250
CKHY AYDLD+W+G DR+HF+A V + YYL F
Sbjct: 201 DCKHYAAYDLDSWEGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGI 260
Query: 251 ----------------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
+++ SDCD++ ++ + ++T T EA A ++ AG D++
Sbjct: 261 PACANPYLLQDILRDFWGFDDDRWVTSDCDAIGNIFTTHNFTDTFAEAVADALKAGTDVD 320
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG+ H A+ L+ ++ A++ + +LMRLG+FD P QP QL DV
Sbjct: 321 CGTSYSTHLPDALNQSLITRDDLERALTRQYTSLMRLGYFD-PPESQPLRQLAWSDVNKP 379
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
Q LA AA +G+VLLKN G LP+S + K +A+IGP AN TK M GNY GT T
Sbjct: 380 DAQALAHTAAVEGLVLLKND-GFLPVSASG-KTIAIIGPYANATKDMQGNYFGTAPFIVT 437
Query: 409 PLQGL--AAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
P QG A + AG S A A A S+D + G + SIE+E+ DRL
Sbjct: 438 PFQGAVDAGFNEVVSAAGTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRL 497
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ G Q L+ ++A + K PV+++ GG D S ++ + +++W GYPGQ+GG A
Sbjct: 498 TIAWTGNQLSLVKQLASLGK-PVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTA 556
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
I DV G P+GRL +T YP+ +
Sbjct: 557 IFDVITGAVAPAGRLSVTQYPEDF 580
>gi|409079872|gb|EKM80233.1| hypothetical protein AGABI1DRAFT_57801 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 767
Score = 350 bits (898), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 217/564 (38%), Positives = 303/564 (53%), Gaps = 63/564 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V+ P L+S C+ + R L++ T +E + + + V RLG+P Y+WWSE
Sbjct: 26 DCVNGP-LSSTAVCDPTKAPAARATTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSE 84
Query: 101 ALHGVSYVGPGTHF--SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV+ PG F S ATSFPQ I+ ++F+ L +A+ V+STEARA N
Sbjct: 85 ALHGVA-GSPGVSFAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHR 143
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR--LKVAA 216
AGL +++PNIN F+DPRWGRGQETPGEDP S+Y + GLQ GG R KVAA
Sbjct: 144 AGLDYFTPNINPFKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQ---GGIDPRPYFKVAA 200
Query: 217 CCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF---------------------- 250
CKHY AYDLD+W+G DR+HF+A V + YYL F
Sbjct: 201 DCKHYAAYDLDSWEGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGI 260
Query: 251 ----------------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
+++ SDCD++ ++ + ++T T EA A ++ AG D++
Sbjct: 261 PACANPYLLQDILRDFWGFDDDRWVTSDCDAIGNIFTTHNFTDTFAEAVADALKAGTDVD 320
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG+ H A+ L+ ++ A++ + +LMRLG+FD P QP QL DV
Sbjct: 321 CGTSYSTHLPDALNQSLITRDDLERALTRQYTSLMRLGYFD-PPESQPLRQLAWSDVNKP 379
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
Q LA AA +G+VLLKN G LP+S + K +A+IGP AN TK M GNY GT T
Sbjct: 380 DAQALAHTAAVEGLVLLKND-GFLPVSASG-KTIAIIGPYANATKDMQGNYFGTAPFIVT 437
Query: 409 PLQGL--AAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
P QG A + AG S A A A S+D + G + SIE+E+ DRL
Sbjct: 438 PFQGAVDAGFNEVVSAAGTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRL 497
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ G Q L+ ++A + K PV+++ GG D S ++ + +++W GYPGQ+GG A
Sbjct: 498 TIAWTGNQLSLVKQLASLGK-PVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTA 556
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
I DV G P+GRL +T YP+ +
Sbjct: 557 IFDVITGAVAPAGRLSVTQYPEDF 580
>gi|296081550|emb|CBI20073.3| unnamed protein product [Vitis vinifera]
Length = 309
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 167/233 (71%), Positives = 186/233 (79%), Gaps = 31/233 (13%)
Query: 144 KVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQ 203
+VVSTEARAMYNVGLAGLTFWSPN+NIF+DPRWGRGQETPGEDPLL+SKYA+GYV+GLQQ
Sbjct: 77 QVVSTEARAMYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQQ 136
Query: 204 TDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV---------------------- 241
+D GSP+RLKVAACCKHYTAYDLDNWKG D +HFNA+V
Sbjct: 137 SDDGSPDRLKVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTFQPPFKSCVIDGNV 196
Query: 242 ---------IYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y Y +Y+VSDCDSVDV YNSQHYTKTPEEA AK+ILAGLDLNCGSF
Sbjct: 197 ASVMCSYNQFYLIYKYHLQYLVSDCDSVDVFYNSQHYTKTPEEAVAKAILAGLDLNCGSF 256
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
LG+HTEAAVK GLV+ESA+D A+SNNFATLMRLGFFDG+PSK YG+LGPKDV
Sbjct: 257 LGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDV 309
>gi|402225863|gb|EJU05924.1| hypothetical protein DACRYDRAFT_113532 [Dacryopinax sp. DJM-731
SS1]
Length = 778
Score = 349 bits (895), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 216/569 (37%), Positives = 304/569 (53%), Gaps = 66/569 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D ++ P LA+ C+++L R LV LT+ EK V+++ V RLG+P Y WWSE
Sbjct: 30 DCLAGP-LANTTVCDSALDPLTRARALVGMLTMAEKFNNTVNASPGVPRLGLPPYNWWSE 88
Query: 101 ALHGVSYVGPGTHFSNV---VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
LHGV+ PG F+ ATSFP+ IL A+F+ +L I ++STEARA N
Sbjct: 89 GLHGVA-SSPGVTFAPAGQNFSYATSFPEPILMGAAFDDNLIYDIATIISTEARAFNNFN 147
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
+GL FW+PNIN RDPRWGR ETPGEDP + Y V GLQ G P K+ A
Sbjct: 148 HSGLDFWTPNINPVRDPRWGRSLETPGEDPFHLASYVAKLVTGLQ-FGGDDPKYQKLVAT 206
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKHY YDL+NW G RY F+A++ + Y+L F+
Sbjct: 207 CKHYAGYDLENWGGYARYGFDAVISNQDLVEYFLPPFQTCARDVNVTSVMCSYNAVNGIP 266
Query: 252 ----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
Y+ SDCD+V +Y +YT TPE+A A S+ A
Sbjct: 267 SCANDYLLQSLLRTYWGWEPDSESLNAHYVTSDCDAVSNIYYPHNYTITPEQAVAVSLKA 326
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGP 342
G DL+CG+F + ++ + GL +++ ID A+ ++A L LG+FD P++ Q Y Q
Sbjct: 327 GTDLDCGTFYAEWLPSSYEQGLFHQTDIDRALIRSYAALFLLGYFD--PAEGQIYRQYNW 384
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
++ T Q LA AA +GI LLKN LPL P+ + N+A+IGP AN T M GNY+G
Sbjct: 385 ANINTDYAQQLAYTAAWEGITLLKNIDDMLPL-PSTMTNIALIGPWANATTQMQGNYQGI 443
Query: 403 PCKYTTPLQGLAAV-VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE 461
+PL L + Y G + TA A AA +AD T+ + G D ++EAE
Sbjct: 444 APFLHSPLYALQQRGINVTYVLGTNITSNSTAGFAAALAAAQTADLTLYIGGIDITVEAE 503
Query: 462 SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQ 521
+ DR+++ PG Q LI ++A V+ +I+ M GG D + +PK+ +LW GYPGQ
Sbjct: 504 AMDRVNITWPGNQLDLIAQLANVSTH-LIVYQMGGGQIDDTVLLENPKVHGLLWGGYPGQ 562
Query: 522 AGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG A+ D+ +G P+GRLP++ YP ++
Sbjct: 563 DGGTAMIDILYGSRAPAGRLPLSQYPANF 591
>gi|115436902|ref|XP_001217674.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|121734342|sp|Q0CB82.1|BXLB_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|114188489|gb|EAU30189.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 765
Score = 348 bits (894), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 216/595 (36%), Positives = 310/595 (52%), Gaps = 84/595 (14%)
Query: 27 HSSSSAQSSPVFACDVVSNPSLASLGF------------------CNTSLGIDLRVGDLV 68
+SS+S +++ + AC V SL LGF C+T+L R L+
Sbjct: 2 YSSNSRRAASILACIV----SLTQLGFAQSPFPDCENGPLSKNAVCDTTLDPVTRAQALL 57
Query: 69 KRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNV--VPGATSFPQ 126
+TL+EKI ++ V RLG+P Y WWSEALHGV+ PG HF++ ATSFP
Sbjct: 58 AAMTLEEKINNTQYNSPGVPRLGLPAYNWWSEALHGVA-GSPGVHFADSGNFSYATSFPS 116
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGED 186
I A+F+ L + I V+ TE RA N G AGL +W+PNIN +RDPRWGRGQETPGED
Sbjct: 117 PITLGAAFDDDLVKQIATVIGTEGRAFGNAGHAGLDYWTPNINPYRDPRWGRGQETPGED 176
Query: 187 PLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----I 242
P S+Y + GLQ DG P + K+ A CKH+ YD+++W+G +RY F+A++ +
Sbjct: 177 PFHTSRYVYHLIDGLQ--DGIGPEKPKIVATCKHFAGYDIEDWEGNERYAFDAVISDQDM 234
Query: 243 YTYYLIKFK---------------------------------------------YIVSDC 257
YY FK ++ SDC
Sbjct: 235 AEYYFPPFKTCTRDAKVDAVMCSYNSVNGIPTCADPWLLQTVLREHWEWEGVGHWVTSDC 294
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
++D +Y Y AAA ++ AG DL+CGS + +A+ GL+ +D A++
Sbjct: 295 GAIDNIYKDHKYVADGAHAAAVAVNAGTDLDCGSVYPQFLGSAISQGLLGNRTLDRALTR 354
Query: 318 NFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPT 377
+++L++LG+FD + QPY +G DV T + LA AA +G VLLKN G+LPL
Sbjct: 355 LYSSLVKLGYFD-PAADQPYRSIGWSDVATPDAEQLAHTAAVEGTVLLKND-GTLPLKKN 412
Query: 378 AIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV--VATIYQAGCSNVQCGTAQV 435
+A++GP AN T + GNYEGT KY + AA Y G T+
Sbjct: 413 G--TVAIVGPYANATTQLQGNYEGT-AKYIHTMLSAAAQQGYKVKYAPGTGINSNSTSGF 469
Query: 436 DDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
+ A AA +D + G D +EAE+ DR + PG Q LI +++ + K P++++
Sbjct: 470 EQALNAAKGSDLVIYFGGIDHEVEAEALDRTSIAWPGNQLDLIQQLSDLKK-PLVVVQFG 528
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG D S ++ + +LW GYP QAGGAA+ D+ G+ P+GRLP+T YP+ Y
Sbjct: 529 GGQVDDSSLLSNAGVNGLLWAGYPSQAGGAAVFDILTGKTAPAGRLPVTQYPEEY 583
>gi|302796585|ref|XP_002980054.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
gi|300152281|gb|EFJ18924.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
Length = 779
Score = 348 bits (894), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 172/347 (49%), Positives = 227/347 (65%), Gaps = 6/347 (1%)
Query: 208 SPNRLKVAACCKHY---TAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLY 264
S NR+ C Y +A +W V+ Y + YIVSDCDS+ +
Sbjct: 251 SYNRVNGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLY--LPCSYIVSDCDSLQTFF 308
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMR 324
++ +Y KT E+ A ++LAGL+L+CG FL HT++A+ G + E+ ++ A+ + MR
Sbjct: 309 DNTNYAKTAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMR 368
Query: 325 LGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAV 384
LG +DG+P QPYG LGP+ VCT NQ LALDAA++GIVLLKN LP S + I+ +A
Sbjct: 369 LGLYDGNPRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAA 428
Query: 385 IGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAA 443
IGP+A T+ MIGNY+G PCKYTTP GL+A +Y AGCS+V C + + A A+
Sbjct: 429 IGPHAKATRAMIGNYQGIPCKYTTPHDGLSAYARVVYSAGCSDVACYSNSLIGSAASTAS 488
Query: 444 SADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISF 503
ADA VL +G D + EAE DR LLLPG+QQ L+TEV K AKGPV+L+I SGG D+SF
Sbjct: 489 QADAVVLFVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPVVLVIFSGGSVDVSF 548
Query: 504 AKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
AK D K+ +LW GYPG+AGGAAIA V FG +NP GRLP+TWYP+S+
Sbjct: 549 AKYDKKVQGMLWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESF 595
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 130/204 (63%), Positives = 154/204 (75%), Gaps = 12/204 (5%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
+ACD N +L GFCNT L RV DL+ R+TLQEKI LV++A + RLG+P+YEW
Sbjct: 32 YACDQ-RNATLLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEW 90
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
W EALHGV+ V PG F PGATSFP ILTAASF+A VSTEARAM+N
Sbjct: 91 WQEALHGVA-VSPGVKFGGKFPGATSFPMPILTAASFDA---------VSTEARAMHNYQ 140
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
AGLT+WSPN+NI+RDPRWGRGQETPGEDPLL+SKYAT YV+GLQ T+ G ++LKV+AC
Sbjct: 141 RAGLTYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLGG-DKLKVSAC 199
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV 241
CKH TAYD+DNWKGT R+ FNA+V
Sbjct: 200 CKHMTAYDVDNWKGTTRFKFNAIV 223
>gi|392560759|gb|EIW53941.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
SS1]
Length = 783
Score = 348 bits (892), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 230/587 (39%), Positives = 315/587 (53%), Gaps = 63/587 (10%)
Query: 16 LFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQE 75
L FLT S+ +++ A P D V+ P L S C+ + R L+ T +E
Sbjct: 8 LHAFLTTFSVGAATAWAYGFP----DCVNGP-LKSNAVCDITKDPITRATALIGLWTDEE 62
Query: 76 KITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHF--SNVVPGATSFPQVILTAAS 133
+ V+++ V RLG+P Y WWSE LHGV+ PG F S ATSFPQ IL A+
Sbjct: 63 LTSNTVNASPGVPRLGLPAYNWWSEGLHGVAQ-SPGVTFAPSGNFSHATSFPQPILMGAA 121
Query: 134 FNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
F+ +L QAI +VSTE RA N G AGL +W+PNIN F+DPRWGRGQETPGEDP S+Y
Sbjct: 122 FDDTLIQAIATIVSTEGRAFNNAGRAGLDYWTPNINPFKDPRWGRGQETPGEDPFHLSQY 181
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIK 249
+ GLQ P KV A CKH+ AYDL+NW+G R F+A+V + +YL
Sbjct: 182 VYNLILGLQGGLDPKP-YFKVVADCKHFAAYDLENWEGIVRNGFDAIVSQQDLSEFYLPP 240
Query: 250 F--------------------------------------------KYIVSDCDSVDVLYN 265
F +++ SDCD+V+ +
Sbjct: 241 FQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQDVLRDHWGFTDDRWVTSDCDAVENILT 300
Query: 266 SQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRL 325
YT P +AAA ++LAG D++CG+F + A++ GLVN + + A +A+L+RL
Sbjct: 301 PHKYTTDPAQAAADALLAGTDIDCGTFSSTYLPEALQRGLVNSTDLRRAAIRQYASLVRL 360
Query: 326 GFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVI 385
G+FD P+ QPY QLG DV T Q LA AA +GIVLLKN G LP S ++ LA+I
Sbjct: 361 GYFD-DPAAQPYRQLGWSDVNTPQAQQLAHTAAVEGIVLLKND-GVLPFS-KHVRKLALI 417
Query: 386 GPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNV--QCGTAQVDDAKKAAA 443
GP AN T + G+Y G +PLQG + +NV Q + A A
Sbjct: 418 GPWANATSLLQGSYIGVAPYLVSPLQGAQEAGFEVEYVLGTNVTTQNDMSGFAAAVAAVR 477
Query: 444 SADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISF 503
ADA V G D+++E E DRL++ PG Q L+ E+ +V K P+I+ GG D +
Sbjct: 478 RADAVVFAGGLDETVECEGTDRLNVTWPGNQLDLVAELERVGK-PLIVAQFGGGQLDDTA 536
Query: 504 AKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
K+ + +I+W GYPGQ+GG A+ D+ G+ P+GRLP+T YP +Y
Sbjct: 537 LKHSKAVNAIIWGGYPGQSGGTALFDILTGKAAPAGRLPITQYPAAY 583
>gi|242813865|ref|XP_002486253.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
gi|218714592|gb|EED14015.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
Length = 893
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 217/556 (39%), Positives = 304/556 (54%), Gaps = 60/556 (10%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
L S C+TSL R LV +T +EK+ + + +RLG+P Y+WW+EALHGV+
Sbjct: 159 LCSNPICDTSLDPLTRAKGLVDAMTFEEKVQNTQNGSPGAARLGLPAYQWWNEALHGVA- 217
Query: 108 VGPGTHF--SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWS 165
PG F S ATSFPQ IL +A+F+ +L + +G VVS E RA N G AGL FW+
Sbjct: 218 GSPGVTFQPSGNFSYATSFPQPILMSAAFDDALIKEVGTVVSIEGRAFNNYGNAGLDFWT 277
Query: 166 PNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD 225
PNIN FRDPRWGRGQETPGEDP ++Y V GLQ +G +P +V A CKH+ YD
Sbjct: 278 PNINPFRDPRWGRGQETPGEDPYHIARYVYNLVDGLQ--NGIAPANPRVVATCKHFAGYD 335
Query: 226 LDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------------ 251
+++W+G RY FNA++ + YYL FK
Sbjct: 336 IEDWEGNSRYGFNAIISTQDLSEYYLPPFKSCARDAQVDAIMCSYNAVNGIPTCADSYLL 395
Query: 252 ---------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKH 296
++ SDCD+VD +Y+ YT + AAA ++ AG +L+CG+ + +
Sbjct: 396 DTILRDHWNWNQTGHWVTSDCDAVDNIYSDHRYTSSLAAAAADALNAGTNLDCGTTMSNN 455
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
AA L + ++ A+ +++L+RLG+FD S+ Y LG DV T A+Q LA
Sbjct: 456 LAAAAAQDLFKNATLNSALVYLYSSLVRLGWFDSEDSQ--YSSLGWSDVGTTASQQLANR 513
Query: 357 AARQGIVLLKNTAGS-LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
AA +GIVLLKN LPLS + +A+IGP AN T + GNY GTP T + G
Sbjct: 514 AAVEGIVLLKNDHKKVLPLSQHG-QTIALIGPYANATTQLQGNYYGTPAYIRTLVWGAEQ 572
Query: 416 VVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQ 474
+ T+ Y+AG T+ A AA +AD + G D SIEAE+ DR + G Q
Sbjct: 573 MGYTVQYEAGTGINSTDTSGFAAAVAAAKTADIVIYAGGIDNSIEAEAMDRNTIAWTGNQ 632
Query: 475 QLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGR 534
LI ++++V K P++++ GG D S + + ++LW GYP Q GG A+ D+ G+
Sbjct: 633 LQLIDQLSQVGK-PLVVLQFGGGQLDDSALLQNENVNALLWCGYPSQTGGQAVFDILTGQ 691
Query: 535 YNPSGRLPMTWYPQSY 550
P+GRLP+T YP +Y
Sbjct: 692 SAPAGRLPVTQYPANY 707
>gi|344302281|gb|EGW32586.1| hypothetical protein SPAPADRAFT_51129 [Spathaspora passalidarum
NRRL Y-27907]
Length = 788
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 220/584 (37%), Positives = 304/584 (52%), Gaps = 72/584 (12%)
Query: 20 LTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITF 79
+T+ ++ S Q+ P+ DV CN L + R +V T+ E I
Sbjct: 38 ITFQAVHFSFPDCQNGPLKDNDV-----------CNPYLPNNQRAKAVVDLFTVDELIAN 86
Query: 80 LVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHF--SNVVPGATSFPQVILTAASFNAS 137
+ +++ V RLG+P Y+WWSE LHG++ ++F S ATSFPQ IL +FN+
Sbjct: 87 MGNTSPGVERLGLPPYQWWSEGLHGIAR----SNFTASGEYSHATSFPQPILMGGAFNSD 142
Query: 138 LFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGY 197
L++ +G V+ TEARA NVG AGL ++SPNIN F+DPRWGRGQE E P+L YA Y
Sbjct: 143 LYKQVGNVIGTEARAFNNVGRAGLDYYSPNINPFKDPRWGRGQEVASESPVLVGNYALNY 202
Query: 198 VKGLQQTDGGSPN--RLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK 251
V+GLQ +PN L+VAA CKH+ YD+++WK R +NA++ + YY F+
Sbjct: 203 VQGLQGGIDSNPNDDTLQVAATCKHFAGYDMESWKQHSRLGYNAIISDQDLADYYFPTFQ 262
Query: 252 YIV-------------------------------------------SDCDSVDVLYNSQH 268
V SDCDS+ ++N
Sbjct: 263 SCVRDAKAAGAMCSYNAINGIPVCASEFFLGTVIREGFDFQNGVIHSDCDSLYSIWNPHL 322
Query: 269 YTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFF 328
Y + AAA I AG+D+NCG + A+ +NE I +++ ++ L+RLG+F
Sbjct: 323 YVQDLGAAAADGIKAGVDVNCGDTYQNNLGYALGNKTINEDQIRASVTRQYSNLIRLGYF 382
Query: 329 DGHPSKQPYGQLGPKDVCT-QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGP 387
D P Y DV T QANQ LA AA +GI LLKN G+LP + +KN+AVIGP
Sbjct: 383 DS-PQTNKYRTYNWSDVSTSQANQ-LAYQAAVEGITLLKND-GTLPFNKDKVKNVAVIGP 439
Query: 388 NANVTKTMIGNYEGTPCKYTTPLQGLA-AVVATIYQAGCSNVQCGTAQVDDAKKAAASAD 446
AN T M+G+Y GTP +PLQG + Y G T A AA AD
Sbjct: 440 WANATTDMLGDYAGTPPYLISPLQGAQDSGFKVQYAYGTQINTTLTTNYTAALNAAKGAD 499
Query: 447 ATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN 506
A V G D SIE E+ DR L PG Q L+++++ + K P++++ G D + KN
Sbjct: 500 AIVYFGGIDNSIENEALDRESLAWPGNQLDLVSKLSGLNK-PLVVVQFGAGQVDDTEIKN 558
Query: 507 DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+ + SI++ GYPGQ+GG AI DV G Y P+GRL T YP SY
Sbjct: 559 NNNVNSIVYAGYPGQSGGTAIWDVLNGIYAPAGRLSTTQYPASY 602
>gi|302811516|ref|XP_002987447.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
gi|300144853|gb|EFJ11534.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
Length = 779
Score = 345 bits (884), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 171/347 (49%), Positives = 226/347 (65%), Gaps = 6/347 (1%)
Query: 208 SPNRLKVAACCKHY---TAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLY 264
S NR+ C Y +A +W V+ Y + YIVSDCDS+ +
Sbjct: 251 SYNRVNGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLY--LPCSYIVSDCDSLQTFF 308
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMR 324
++ +Y KT E+ A ++LAGL+L+CG FL HT++A+ G + E+ ++ A+ + MR
Sbjct: 309 DNTNYAKTAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMR 368
Query: 325 LGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAV 384
LG +DG+P QPYG LGP+ VCT NQ LALDAA++GIVLLKN LP S + I+ +A
Sbjct: 369 LGLYDGNPRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAA 428
Query: 385 IGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAA 443
IGP+A T+ MIGNY+G PCKYTTP GL+A +Y AGCS+V C + + A A+
Sbjct: 429 IGPHAKATRAMIGNYQGIPCKYTTPHDGLSAYARVVYSAGCSDVACYSDSLIGSAVSTAS 488
Query: 444 SADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISF 503
ADA VL +G D + EAE DR LLLPG+QQ L+TEV K AKGP +L+I SGG D+SF
Sbjct: 489 QADAVVLFVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPAVLVIFSGGSVDVSF 548
Query: 504 AKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
AK + K+ ILW GYPG+AGGAAIA V FG +NP GRLP+TWYP+S+
Sbjct: 549 AKYNNKVQGILWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESF 595
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 130/204 (63%), Positives = 154/204 (75%), Gaps = 12/204 (5%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
+ACD N +L GFCNT L RV DL+ R+TLQEKI LV++A + RLG+P+YEW
Sbjct: 32 YACDQ-RNATLLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEW 90
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
W EALHGV+ V PG F PGATSFP ILTAASF+A VSTEARAM+N
Sbjct: 91 WQEALHGVA-VSPGVKFGGKFPGATSFPMPILTAASFDA---------VSTEARAMHNYQ 140
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
AGLT+WSPN+NI+RDPRWGRGQETPGEDPLL+SKYAT YV+GLQ T+ G ++LKV+AC
Sbjct: 141 RAGLTYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLGG-DKLKVSAC 199
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV 241
CKH TAYD+DNWKGT R+ FNA+V
Sbjct: 200 CKHMTAYDVDNWKGTTRFKFNAIV 223
>gi|398403795|ref|XP_003853364.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
gi|339473246|gb|EGP88340.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
Length = 785
Score = 345 bits (884), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 213/541 (39%), Positives = 280/541 (51%), Gaps = 62/541 (11%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNV--VPG 120
R L+ T++EKI +A V RLG+P Y WW EALHGV+ PG +FS+
Sbjct: 48 RATALIAAFTIEEKINNTGSTAPGVPRLGLPAYTWWQEALHGVAQ-SPGVNFSDSGDFRY 106
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQ 180
ATSFPQ IL A+F+ L + + V+STEARA N +GL +W+PNIN F+D RWGRGQ
Sbjct: 107 ATSFPQPILMGAAFDDDLIKDVATVISTEARAFNNDARSGLDYWTPNINPFKDSRWGRGQ 166
Query: 181 ETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAM 240
ETPGEDP S Y + GLQ G KV A CKH+ AYDL+ W G RY F+
Sbjct: 167 ETPGEDPYHLSSYVKSLIAGLQ----GDGKYKKVVATCKHFVAYDLETWNGNFRYQFDPH 222
Query: 241 V----IYTYYLIKFK--------------------------------------------- 251
V + YY+ F+
Sbjct: 223 VGSQELVEYYMPPFQACARDANVGAFMCSYNSLNGIPTCADPYLLQTILREHWNWTSEEQ 282
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAI 311
++ SDCDS+ +Y YT T EEA A S+ AG D+NCG++ + A+ GLV E I
Sbjct: 283 WVTSDCDSIQNVYLPHEYTSTREEAVAVSLKAGTDVNCGTYYQEFLPGALSLGLVTEKDI 342
Query: 312 DHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS 371
D A+ +++L+RLG+FDG + Y L KDV T Q LAL AA +GI LLKN G
Sbjct: 343 DMALIRQYSSLVRLGYFDG--TAVEYRSLSWKDVSTPYAQQLALKAAVEGITLLKND-GI 399
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCG 431
LPL+ T +AVIG AN T+ M+GNY+G P +PL A + +G Q
Sbjct: 400 LPLAITKDTKIAVIGDWANATEQMLGNYDGIPPYLHSPLWAAQQTGANVTYSGNPGGQGD 459
Query: 432 --TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPV 489
T A AD + G D +EAE DR+ + G Q +I ++A K PV
Sbjct: 460 PTTNNWLHIWTAVDEADVILFAGGIDNGVEAEGMDRVSIAWTGAQLDVIGQLASRGK-PV 518
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
I+ M G D + N+ I+++LW GYPGQ GG A+ D+ G+ P+GRLP T YP S
Sbjct: 519 IVAQMGTNGVDSTPLLNNQNISALLWGGYPGQDGGVALLDIIQGKSAPAGRLPTTQYPAS 578
Query: 550 Y 550
Y
Sbjct: 579 Y 579
>gi|343428088|emb|CBQ71612.1| related to Beta-xylosidase [Sporisorium reilianum SRZ2]
Length = 698
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 223/565 (39%), Positives = 301/565 (53%), Gaps = 67/565 (11%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
L++L C+TSL R LV + T E I V+ A V RLGIP+Y+WW+EALHGV+
Sbjct: 29 LSTLPVCDTSLDFYTRATSLVAQFTTAELINNTVNHAPGVPRLGIPQYQWWTEALHGVAR 88
Query: 108 VGPGTHFSNVVPG----ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF 163
PG +F+ G ATSFPQVI A+F+ +L++A+ ++ E RA N G AGL
Sbjct: 89 -SPGVNFNPDAAGEFGCATSFPQVINLGATFDDALYEAVAAHIANETRAFSNAGRAGLNM 147
Query: 164 WSP-NINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ---QTDGGSPNRLKVAACCK 219
+SP NIN FRDPRWGRGQET GEDPL S+YA V+GLQ D +P RL +AA CK
Sbjct: 148 YSPLNINAFRDPRWGRGQETVGEDPLHLSRYAVRVVRGLQGPAAQDEANP-RLTLAATCK 206
Query: 220 HYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------ 251
HY AYDL+ G +RY F+A+V + +L +F+
Sbjct: 207 HYLAYDLEASAGVERYQFDALVSNQDLADLHLPQFRACVRDGGATTLMTSYNAVNGVPPS 266
Query: 252 ---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG 290
Y+ SDCD+V +Y++ HY AAA S+ AG DL+CG
Sbjct: 267 ASKYYLETLARDTWGLDKHHNYVTSDCDAVANVYDAHHYAADYVHAAAASLNAGTDLDCG 326
Query: 291 SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQAN 350
+ AA+ L + + I A++ + +L+RLG+FD QP QLG KDV A
Sbjct: 327 ATYRDSLAAALAQNLTDVATIRRAVTRMYGSLVRLGYFDAA-EAQPLRQLGWKDVNAPAA 385
Query: 351 QDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPL 410
Q LA +AA I LLKN +LPL TA K +A+IGP N T + GNY G TP
Sbjct: 386 QKLAYEAAAASITLLKNRQSTLPLRETAGKTIALIGPYTNATFALRGNYAGPSPLVITPF 445
Query: 411 QGLAAVV--ATIYQAGCSNVQ--CGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
A I A +++ TA A A SAD V G D ++E ES DR
Sbjct: 446 DAARRTFSDAHIVSANGTSIAGPYDTATASAALATAKSADIIVYAGGIDPTVEGESLDRR 505
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGG-FDISFAKNDPKITSILWVGYPGQAGGA 525
D+ P Q LI E+A + K V++++ GGG D + K D + +++W GYPGQ+G
Sbjct: 506 DIAWPANQLRLIQELAALGK--VLVVVQFGGGQVDGALLKGDDGVGALVWAGYPGQSGAL 563
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
A+ D+ G+ P+GRLP+T YP +Y
Sbjct: 564 ALMDILAGKRAPAGRLPITQYPANY 588
>gi|23304843|emb|CAD48309.1| beta-xylosidase B [Clostridium stercorarium]
Length = 715
Score = 342 bits (876), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 214/569 (37%), Positives = 299/569 (52%), Gaps = 98/569 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + S + R DLV R+T++EK++ ++ ++ ++ RLGIP Y WW+EALHGV+ G
Sbjct: 7 YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT-- 64
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
AT FPQ I AA+F+ L + V+STE RA Y+ GLTFW
Sbjct: 65 --------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFW 116
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRGQET GEDP L ++ +VKGLQ G P LK CK+ +
Sbjct: 117 SPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQ---GNHPKYLKAGGMCKNILPF 173
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ + R+ FNA+V +Y YL FK
Sbjct: 174 TV--VPESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEPCCGSKTL 231
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
++VSDC ++ + H T T E+AA ++ G DLNCG+ G +
Sbjct: 232 LSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESAALAVRNGCDLNCGNMFG-NLL 290
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDA 357
A+K GL+ E ID A++ T M+LG FD P Q PY + C + +++LALD
Sbjct: 291 IALKEGLITEEEIDRAVTRLMITRMKLGMFD--PEDQVPYASISSFVDCKE-HRELALDV 347
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV- 416
A++ IVLLKN G LPL I+++AVIGPNA+ + +IGNYEGT +Y T L G+ +
Sbjct: 348 AKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLDGIREMA 406
Query: 417 ---VATIYQAGC-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE----- 461
V Y GC N+ ++ +A A AD ++ +G D +IE E
Sbjct: 407 GDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLDSTIEGEEMHES 466
Query: 462 ----SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
S D+ DL LPGQQQ L+ V K P++L++++G +++A D I +IL
Sbjct: 467 NIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWA--DEHIPAILNAW 523
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
YPG GG AIA V FG NPSG+LP+T+Y
Sbjct: 524 YPGALGGRAIASVLFGETNPSGKLPVTFY 552
>gi|344303941|gb|EGW34190.1| hypothetical protein SPAPADRAFT_65353 [Spathaspora passalidarum
NRRL Y-27907]
Length = 788
Score = 342 bits (876), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 212/562 (37%), Positives = 300/562 (53%), Gaps = 67/562 (11%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N L CN L + R +V T+ E I + +++ V RLG+P Y+WWSEALHG
Sbjct: 52 NGPLKHNAVCNPHLPTEQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEALHG 111
Query: 105 VSYVGPGTHF--SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT 162
++ ++F S ATSFPQ IL +FN L++ +G V+ TEARA NVG AGL
Sbjct: 112 IAR----SNFTASGEYSHATSFPQPILMGGAFNNDLYKQVGNVIGTEARAFNNVGRAGLD 167
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG-----SPNRLKVAAC 217
F+SPNIN FRD RWGRGQE E P+L YA YV+GLQ GG + + L+VAA
Sbjct: 168 FYSPNINPFRDARWGRGQEVASESPVLVGNYALNYVQGLQ---GGLDSNQNDDTLQVAAT 224
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFKYIV------------------- 254
CKH+ YD+++W R +NA++ + +YL F+ V
Sbjct: 225 CKHFVGYDMESWNQHSRLGYNAIISDQDLADFYLPTFQSCVRDAKAAGAMCSYNAVNGVP 284
Query: 255 ------------------------SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG 290
SDCD++ ++N Y + AAA +I AG+D+NCG
Sbjct: 285 ACASEFFLNTVLRDGFDFQNGVIHSDCDAIYNVWNPHLYAQDLGGAAADAIKAGVDVNCG 344
Query: 291 SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT-QA 349
+ A+ +NE+ I +++ ++ L+RLG+FD P Y + DV T QA
Sbjct: 345 DTYQNNLGYALGNKTINENQIRTSVTRQYSNLIRLGYFDS-PQTNKYRKYDWNDVSTPQA 403
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
NQ LA AA +GI LLKN G+LP + ++ +AVIGP AN T M+G+Y GTP +P
Sbjct: 404 NQ-LAYQAAVEGIALLKND-GTLPFNKQKVRKVAVIGPWANATTQMLGDYAGTPPYMISP 461
Query: 410 LQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDL 468
LQG + + Y G T+ A AA ADA V G D S+E E+ DR L
Sbjct: 462 LQGAQSEGFQVEYALGTQINTTDTSGYTAALNAAKGADAIVYFGGIDNSVENEALDRESL 521
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIA 528
PG Q L+++++ + K P++++ GG D + KN+ + +I++ GYPGQ+GG AI
Sbjct: 522 AWPGNQLDLVSKLSGLKK-PLVVLQFGGGQIDDTEIKNNKNVNAIVYAGYPGQSGGTAIW 580
Query: 529 DVCFGRYNPSGRLPMTWYPQSY 550
D+ G+Y P+GRL T YP SY
Sbjct: 581 DILSGKYAPAGRLTTTQYPASY 602
>gi|358397360|gb|EHK46735.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
206040]
Length = 865
Score = 341 bits (875), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 221/580 (38%), Positives = 313/580 (53%), Gaps = 68/580 (11%)
Query: 24 SLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDS 83
S+ + S++ +P F D +P L S C+T+L + R +VK +TL EK+ + S
Sbjct: 116 SVTNPSANPPLTPSFP-DCTRDP-LCSNAICDTTLSMAERAAAIVKPMTLDEKVANVGSS 173
Query: 84 AGSVSRLGIPKYEWWSEALHGVS------YVGP-GTHFSNVVPGATSFPQVILTAASFNA 136
A +RLG+P Y+W +EALHGV+ + P G +FS ATSFP IL +A+F+
Sbjct: 174 ASGSARLGLPAYQWQNEALHGVAGSTGVQFQSPLGANFS----AATSFPMPILLSAAFDD 229
Query: 137 SLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATG 196
+L Q + +STEARA N G AGL FW+PNIN FRDPRWGRG ETPGED Y
Sbjct: 230 ALVQNVATAISTEARAFANYGFAGLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLA 289
Query: 197 YVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK----- 251
+ GLQ G +P+ ++ A CKH+ AYD++N + + + + YYL F+
Sbjct: 290 LISGLQ--GGINPDFFRIIATCKHFAAYDIENGRTGNNLNPTQQDMADYYLPMFETCVRD 347
Query: 252 ----------------------------------------YIVSDCDSVDVLYNSQHYTK 271
Y+VSDCD+VD +++ HY
Sbjct: 348 AKVGSVMCAYNAVDGIPACASEYLLQDVLRDGFGFTEDFNYVVSDCDAVDNVFDPHHYAS 407
Query: 272 TPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGH 331
EAAA S+ AG DL+CGS A+V+A L +E+A++ ++ ++ L+++G+FD
Sbjct: 408 NLTEAAALSLNAGTDLDCGSSY-NVLNASVEAALTSEAALNQSLVRLYSALIKVGYFD-Q 465
Query: 332 PSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANV 391
PS+ Y L +V T NQ LA DAA G+ LLKN G+LPLS T + N+A+IGP N
Sbjct: 466 PSE--YKSLSWANVNTTQNQALAHDAATGGMTLLKND-GTLPLSRT-LSNVAIIGPWVNA 521
Query: 392 TKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVL 450
T M GNY GT PL + Y G + T+ A AA+S+D V
Sbjct: 522 TTQMQGNYAGTAPFLVNPLDVFQQKWGNVKYAQGTAINSQDTSGFSAALSAASSSDVIVY 581
Query: 451 VMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKI 510
+ G D ++E E DR ++ PG Q LI+++A + K P++++ GG D S ++P +
Sbjct: 582 LGGIDITVENEGFDRGSIVWPGNQLDLISQLANLGK-PLVIVQFGGGQIDDSSLLSNPNV 640
Query: 511 TSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
SILW GYPGQ GG A+ DV G P+GRLP+T YP SY
Sbjct: 641 RSILWAGYPGQDGGNAVFDVLTGANPPAGRLPITQYPASY 680
>gi|291528382|emb|CBK93968.1| Beta-glucosidase-related glycosidases [Eubacterium rectale M104/1]
Length = 714
Score = 341 bits (875), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 213/555 (38%), Positives = 297/555 (53%), Gaps = 98/555 (17%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV ++T+ EKI+ ++ + ++ RLGIP+Y WW+EALHGV+ G AT FPQ
Sbjct: 12 LVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVFPQ 61
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNV-------GL-AGLTFWSPNINIFRDPRWGR 178
I AA+F+A L + IG VVSTE R +N G+ GLTFW+PN+NIFRDPRWGR
Sbjct: 62 AIGLAAAFDADLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRWGR 121
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
G ET GEDP L K Y++GLQ G P+ LK AAC KH+ + + R+ F+
Sbjct: 122 GHETYGEDPYLTGKLGCAYIRGLQ---GDDPDHLKSAACAKHFAVH---SGPEAIRHEFD 175
Query: 239 AMV----IYTYYLIKFK------------------------------------------Y 252
A +Y YL FK +
Sbjct: 176 AKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILRDEFGFEGH 235
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS-FLGKHTEAAVKAGLVNESAI 311
+VSDC ++ + H T T EE+AA ++ G DLNCGS FL H + A GLV++ AI
Sbjct: 236 VVSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFL--HLKDAYDKGLVSDEAI 293
Query: 312 DHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS 371
A+ +RLG +PS PY + + V + + +L+++AAR+ +VLLKN
Sbjct: 294 TAAVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLVLLKNKDNF 351
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGC-- 425
LPL +K +AVIGPNAN +IGNY GT +Y TPL+GL + +Y GC
Sbjct: 352 LPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGDDTRVLYAEGCHL 411
Query: 426 --SNVQCGTAQVDDAKKA---AASADATVLVMGADQSIEAE---------SHDRLDLLLP 471
VQ + D K+A A +D V+ +G D +IE E S D+L L+LP
Sbjct: 412 YKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGDKLGLMLP 471
Query: 472 GQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVC 531
G Q+ L+ VA V K PVIL++ +G D+S+A+ + +I+ YPG GG A+A+
Sbjct: 472 GLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAEE--HVDAIIDSWYPGARGGKAVAEAI 528
Query: 532 FGRYNPSGRLPMTWY 546
FG Y+PSG+LP+T+Y
Sbjct: 529 FGEYSPSGKLPVTFY 543
>gi|388857998|emb|CCF48443.1| related to Beta-xylosidase [Ustilago hordei]
Length = 782
Score = 341 bits (875), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 211/565 (37%), Positives = 300/565 (53%), Gaps = 71/565 (12%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
C+ ++ R LV + T +E + ++ A V RLGIP Y+WW+EALHGV+ PG
Sbjct: 36 ICDPTIPFYTRATSLVNQFTTEELLNNTINYAPGVPRLGIPNYQWWTEALHGVA-KSPGV 94
Query: 113 HFSNVVP-----GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSP- 166
+F P AT FPQ I A+F+ L+Q I V+++E RA N G AGL +SP
Sbjct: 95 NFDLSDPHAEFTSATQFPQTINLGATFDDDLYQQIASVIASEVRAYNNAGKAGLNLYSPL 154
Query: 167 NINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ---QTDGGSPNRLKVAACCKHYTA 223
NIN FRDPRWGRGQET GEDPL S++A V GLQ + N+L VAA CKH+ A
Sbjct: 155 NINCFRDPRWGRGQETVGEDPLHMSRFAVSIVHGLQGPHAQNEAEGNKLTVAATCKHFLA 214
Query: 224 YDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------------- 251
YDL+ + +RY F+A+V + ++L +F+
Sbjct: 215 YDLEQYDRGERYQFDAIVSKQDLSDFHLPQFRACVRDGGATTLMTSYNAVNNVPPSASKY 274
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
Y+ SDCD+V +Y+ Y + EAAAKSI AG DL+CG+
Sbjct: 275 YLQTLARQAWGLDKTHNYVTSDCDAVANVYDGHRYAQNYVEAAAKSINAGTDLDCGATYS 334
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
++ AA+K L + + I A+ +A+L+RLG+FD P+ QP QL KDV + ++Q LA
Sbjct: 335 ENLGAALKQKLTDIATIRRAVIRMYASLVRLGYFDD-PASQPLRQLTWKDVNSPSSQRLA 393
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
+A I LLKN +LP+ K +A+IGP NV+ + GNY G P + + A
Sbjct: 394 YTSALSSITLLKNLDSTLPIKQKPTK-IAIIGPYTNVSTSFSGNYAG-PAAFNMTMVHAA 451
Query: 415 AVV---ATIYQAGCSNVQCGTAQVD--DAKKAAASADATVLVMGADQSIEAESHDRLDLL 469
+ V A I +++ D DA K + AD+ V G D SIE ESHDR D+
Sbjct: 452 SQVFPDAKIVWVNGTDISGPYIPSDAQDAVKLTSDADSVVFAGGIDASIERESHDRKDIA 511
Query: 470 LPGQQQLLITEVA----KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
P Q LI E++ K K ++++ GG D + K+D + +++W GYPGQ+
Sbjct: 512 WPPNQLRLIHELSQSRKKDKKSKLVVVQFGGGQLDGASLKSDDAVGALVWAGYPGQSASL 571
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
A+ D+ G+ P+GRLP+T YP SY
Sbjct: 572 AVWDILAGKAVPAGRLPVTQYPASY 596
>gi|443893988|dbj|GAC71176.1| hypothetical protein PANT_1d00031 [Pseudozyma antarctica T-34]
Length = 759
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 216/564 (38%), Positives = 301/564 (53%), Gaps = 73/564 (12%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
L++ C+TSL R LV T QE I +++A V RLGIP Y+WW+EALHGV+
Sbjct: 30 LSANAVCDTSLDYWTRATSLVAEFTTQELINNTINTAPGVPRLGIPPYQWWTEALHGVA- 88
Query: 108 VGPGTHFSNVVPG----ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF 163
PG +F++ V AT+FPQ+I A+F+ +L++ + ++ E RA N G AGL
Sbjct: 89 GSPGVNFADDVEAPYGSATNFPQIINLGATFDDALYEQVATHIANETRAFNNAGKAGLNM 148
Query: 164 WSP-NINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
+SP NIN FRDPRWGRGQET GEDPL S+YA V+GLQ G + + L++AA CKHY
Sbjct: 149 YSPLNINCFRDPRWGRGQETTGEDPLHMSRYAVKMVQGLQ---GPNQDELRLAATCKHYL 205
Query: 223 AYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------------------------- 251
AYDL+ W G +RY F+A V + +YL +F+
Sbjct: 206 AYDLEKWDGVERYQFDAQVSRQELAEFYLPQFRACVRDGKAVTLMTSYNAVNNVPPSASR 265
Query: 252 ------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL 293
Y+ SDCD+V +++ HY + +AAA SI AG DLNCG+
Sbjct: 266 YYLETLARKEWGLDKKHNYVTSDCDAVANVFDGHHYADSYVQAAADSINAGTDLNCGATY 325
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQD 352
+ A++ L + I A++ +A+ +RLG FD P + QP +LG + V T+A QD
Sbjct: 326 SDNLGQALEQNLTDVETIRTAVARMYASQVRLGLFD--PKQGQPLRELGWEHVNTKAAQD 383
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LA +A + LLKN G+LP+ +AVIGP +N T + GNY G P + +
Sbjct: 384 LAYSSAAASVTLLKNN-GTLPVD--GATKVAVIGPYSNATFALRGNYAG-PGPFAITMTE 439
Query: 413 LAAVV---ATIYQAGCSNVQCGTAQVDDAKKA---AASADATVLVMGADQSIEAESHDRL 466
A V ATI A + + GT DA+ A A AD + G D +IE+E DR
Sbjct: 440 AAQRVFSQATISSANGTTI-SGTYNHTDAEAAMQLAKEADLVIFAGGIDPTIESEELDRA 498
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ P Q LI + +AK + ++ GG D + K D I ++LW GYPGQ+G A
Sbjct: 499 TIAWPPNQLQLIHALGGMAK-KMAVVQFGGGQIDGASIKADGNIGALLWAGYPGQSGALA 557
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+ DV G P+GRLP+T YP Y
Sbjct: 558 VMDVIAGNTAPAGRLPITQYPAEY 581
>gi|367046937|ref|XP_003653848.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
gi|347001111|gb|AEO67512.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
Length = 923
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 235/598 (39%), Positives = 307/598 (51%), Gaps = 87/598 (14%)
Query: 8 NRAPKVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDL 67
N AP ++ L F C Q+SP+ S+P+ CNTSL I RV L
Sbjct: 139 NPAPTITPLNKFTPVC---------QTSPL-----CSSPA------CNTSLPIADRVRWL 178
Query: 68 VKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV------SYVGP-GTHFSNVVPG 120
V +LTLQEKIT LVD A +R+G+P YEWWSEALHGV ++ GP GT FS
Sbjct: 179 VGQLTLQEKITNLVDGASGSARVGLPPYEWWSEALHGVAASPGVTFAGPNGTAFSY---- 234
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQ 180
ATSFP I +A+F+ L I VV E RA N GL+G FW+PNIN FRDPRWGRG
Sbjct: 235 ATSFPMPITISAAFDDDLVSQIAAVVGREGRAFANHGLSGFDFWTPNINPFRDPRWGRGP 294
Query: 181 ETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAM 240
ETPGED +Y + GLQ +D P ++ A CKHY YD++ + Y
Sbjct: 295 ETPGEDAFRIQQYIRHLIPGLQGSD---PLDKQIIATCKHYAVYDVETGRYEYDYDPQPH 351
Query: 241 VIYTYYLIKFK---------------------------------------------YIVS 255
+ YYL FK Y+VS
Sbjct: 352 DLAEYYLAPFKTCVRDVGIGSVMCSYNAVDGIPACASEYLLQSVLRDHWGFTEPYQYVVS 411
Query: 256 DCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAI 315
DCD+V +Y+ ++T +P AAA ++ AG DL CGS + ++ + + E+A+D A+
Sbjct: 412 DCDAVRFIYSPHNFTDSPAAAAAVALNAGTDLECGSTY-LNLNQSLASNMTTEAALDRAL 470
Query: 316 SNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS 375
+ + L +GFFDG YG LG V T Q LA AA G VLLKN LPL
Sbjct: 471 TRLYTALHTIGFFDGSAR---YGGLGWDAVGTGDAQVLAYQAAVDGAVLLKNEKSLLPLD 527
Query: 376 PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA---TIYQAGCSNVQCGT 432
++ LAVIGP AN T M GNY G +PL + ++ G T
Sbjct: 528 SKRLRKLAVIGPWANATTQMQGNYFGQAAYLVSPLAAFQSAWGADNVLFANGTGIAGNST 587
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A A AA +ADA V + G D S+E+ES DR + PG Q LI ++A V K P++++
Sbjct: 588 AGFAAALAAAKAADAVVFLGGVDNSVESESLDRTAISWPGNQLDLIAQLAAVGK-PLVVV 646
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG D S +P++ ++LW GYPGQAGGAAIAD+ G+ P+GRLP+T Y SY
Sbjct: 647 QCGGGQLDDSALLANPRVGALLWAGYPGQAGGAAIADLLTGKQAPAGRLPVTQYAASY 704
>gi|340519849|gb|EGR50086.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
Length = 796
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 223/580 (38%), Positives = 312/580 (53%), Gaps = 68/580 (11%)
Query: 24 SLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDS 83
S+ + S + +P F D +P L S C+T+ I R +VK +TL EK+ + S
Sbjct: 47 SISNPSPNPPLTPSFP-DCTRDP-LCSNDVCDTTKSIAERAAAIVKPMTLNEKVANVGSS 104
Query: 84 AGSVSRLGIPKYEWWSEALHGVS------YVGP-GTHFSNVVPGATSFPQVILTAASFNA 136
A +RLG+P Y+W +EALHGV+ + P G +FS ATSFP IL +A+F+
Sbjct: 105 ASGSARLGLPAYQWQNEALHGVAGSTGVQFQSPLGANFS----AATSFPMPILLSAAFDD 160
Query: 137 SLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATG 196
+L +++ +STEARA N G AGL FW+PNIN FRDPRWGRG ETPGED Y
Sbjct: 161 ALVKSVATAISTEARAFANYGFAGLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLA 220
Query: 197 YVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK----- 251
V GLQ G P+ + + CKH+ AYD++N + + + YYL F+
Sbjct: 221 LVDGLQ--GGIDPDFYRTLSTCKHFAAYDIENGRTANNLSPTQQDMADYYLPMFETCVRD 278
Query: 252 ----------------------------------------YIVSDCDSVDVLYNSQHYTK 271
Y+VSDCD+V+ +++ HY
Sbjct: 279 AKVASIMCAYNAVDGVPACADSYLLQDVLRDTYGFTEDFNYVVSDCDAVENVFDPHHYAA 338
Query: 272 TPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGH 331
+AAA SI AG DL+CGS A+V+AGL E+ +D ++ ++ L+++G+FD
Sbjct: 339 NLTQAAAMSINAGTDLDCGSSY-NVLNASVQAGLTTEATLDKSLIRLYSALVKVGYFD-Q 396
Query: 332 PSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANV 391
P++ Y LG +V T +Q LA DAA +G+ LLKN G+LPLS T + N+AVIGP ANV
Sbjct: 397 PAE--YNSLGWGNVNTTQSQALAHDAATEGMTLLKND-GTLPLSRT-LSNVAVIGPWANV 452
Query: 392 TKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVL 450
T M GNY GT PL + Y G + T+ + A AA+S+D V
Sbjct: 453 TTQMQGNYAGTAPLLVNPLSVFQQKWRNVKYAQGTAINSQDTSGFNAALSAASSSDVIVY 512
Query: 451 VMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKI 510
+ G D S+E E DR + PG Q LI+++A + K P++++ GG D S ++ K+
Sbjct: 513 LGGIDISVENEGFDRSSITWPGNQLNLISQLANLGK-PLVIVQFGGGQIDDSALLSNSKV 571
Query: 511 TSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
SILW GYPGQ GG AI DV G P+GRLP+T YP +Y
Sbjct: 572 NSILWAGYPGQDGGNAIFDVLTGANPPAGRLPVTQYPANY 611
>gi|242786966|ref|XP_002480909.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218721056|gb|EED20475.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 757
Score = 340 bits (871), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 207/538 (38%), Positives = 282/538 (52%), Gaps = 60/538 (11%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNV---VP 119
RV L+ LTL+EKI LVD++ RLG+P YEWW+EA HGV PG F+
Sbjct: 25 RVKSLIDSLTLEEKILNLVDASAGSERLGLPSYEWWNEATHGVG-SAPGVQFTEKPVNFS 83
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
ATSFP ILTAASF+ +L + I V+ E RA N G +G FW+PNIN FRDPRWGRG
Sbjct: 84 YATSFPAPILTAASFDDALVREIASVIGREGRAFGNNGFSGFDFWAPNINPFRDPRWGRG 143
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
QETPGED + Y ++ GLQ G P +V A CKHY AYDL+ + + Y+
Sbjct: 144 QETPGEDSFVVQSYIRNFIPGLQ---GDDPEDKQVIATCKHYAAYDLETGRYGNDYNPTQ 200
Query: 240 MVIYTYYLIKFK---------------------------------------------YIV 254
+ Y+L FK Y+V
Sbjct: 201 QDLADYFLAPFKTCVRDTGVGSIMCAYNAVDGIPTCASEYLLDQVLRKHWNFTADYNYVV 260
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
SDC +V ++ ++T T E AA+ S+ AG+DL CGS K E+ + A A+D A
Sbjct: 261 SDCGAVTDIWQYHNFTDTEEAAASVSLNAGVDLECGSSYLKLNES-LAANQTTVQALDQA 319
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
++ ++ L +GFFDG Y LG DV T Q LA +AA +G+ LLKN LP+
Sbjct: 320 LTRLYSALFTVGFFDG----GKYTALGFADVSTPEAQSLAYEAAVEGMTLLKNDKRLLPI 375
Query: 375 -SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGT 432
S K++A+IGP AN T M G+Y G P +PL+ + Y G T
Sbjct: 376 RSSHKYKSVALIGPFANATTQMQGDYSGIPPFLISPLEAFKGHDWEVNYAMGTGINNQTT 435
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A AA +D + + G D SIEAE+ DR L PG Q L+T+++K+ K P+I++
Sbjct: 436 TGFASALAAAEKSDLVIYLGGIDNSIEAETLDRTSLTWPGNQLDLVTQLSKLHK-PLIVV 494
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG D S + + +++W GYP Q+GG+A+ DV G+ + +GRLP+T YP SY
Sbjct: 495 QFGGGQLDDSALLQNEGVQALVWAGYPSQSGGSALLDVLLGKRSIAGRLPVTQYPASY 552
>gi|291525508|emb|CBK91095.1| Beta-glucosidase-related glycosidases [Eubacterium rectale DSM
17629]
Length = 714
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 212/555 (38%), Positives = 296/555 (53%), Gaps = 98/555 (17%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV ++T+ EKI+ ++ + ++ RLGIP+Y WW+EALHGV+ G AT FPQ
Sbjct: 12 LVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVFPQ 61
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNV-------GL-AGLTFWSPNINIFRDPRWGR 178
I AA+F+ L + IG VVSTE R +N G+ GLTFW+PN+NIFRDPRWGR
Sbjct: 62 AIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRWGR 121
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
G ET GEDP L K Y++GLQ G P+ LK AAC KH+ + + R+ F+
Sbjct: 122 GHETYGEDPYLTGKLGCAYIRGLQ---GDDPDHLKSAACAKHFAVH---SGPEAIRHEFD 175
Query: 239 AMV----IYTYYLIKFK------------------------------------------Y 252
A +Y YL FK +
Sbjct: 176 AKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILRDEFGFEGH 235
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS-FLGKHTEAAVKAGLVNESAI 311
+VSDC ++ + H T T EE+AA ++ G DLNCGS FL H + A GLV++ AI
Sbjct: 236 VVSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFL--HLKDAYDKGLVSDEAI 293
Query: 312 DHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS 371
A+ +RLG +PS PY + + V + + +L+++AAR+ +VLLKN
Sbjct: 294 TAAVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLVLLKNKDNF 351
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGC-- 425
LPL +K +AVIGPNAN +IGNY GT +Y TPL+GL + +Y GC
Sbjct: 352 LPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRVLYAEGCHL 411
Query: 426 --SNVQCGTAQVDDAKKA---AASADATVLVMGADQSIEAE---------SHDRLDLLLP 471
VQ + D K+A A +D V+ +G D +IE E S D+L L+LP
Sbjct: 412 YKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGDKLGLMLP 471
Query: 472 GQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVC 531
G Q+ L+ VA V K PVIL++ +G D+S+A+ + +I+ YPG GG A+A+
Sbjct: 472 GLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAEE--HVDAIIDSWYPGARGGKAVAEAI 528
Query: 532 FGRYNPSGRLPMTWY 546
FG Y+PSG+LP+T+Y
Sbjct: 529 FGEYSPSGKLPVTFY 543
>gi|347832625|emb|CCD48322.1| glycoside hydrolase family 3 protein [Botryotinia fuckeliana]
Length = 772
Score = 337 bits (865), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 225/563 (39%), Positives = 304/563 (53%), Gaps = 61/563 (10%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V+ P LA+ C+TS R L+ TL EK+ +++ V R+G+P YEWW+E
Sbjct: 22 DCVNGP-LANNTVCDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNE 80
Query: 101 ALHGVSYVGPGTHFSNVVPG---ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
ALHG++ PGT F+ +TSFPQ IL A+F+ L + VSTEARA NV
Sbjct: 81 ALHGIAR-SPGTTFAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVN 139
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
GL FW+PNIN ++DPRWGRGQETPGEDP S Y + GLQ P + K A
Sbjct: 140 RFGLNFWTPNINPYKDPRWGRGQETPGEDPFHTSSYVNALITGLQGGLDDLPYK-KGVAT 198
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKH+ YDL++ G RY F+A++ + YYL F+
Sbjct: 199 CKHFAGYDLESSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVP 258
Query: 252 -----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
++ SDCD+V +++ +YT TPE++AA ++ AG DL+
Sbjct: 259 TCADDWLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSAADALNAGTDLD 318
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG+F + +A GL + S +D +++ +A+L+RLG+FD PS QPY QL +V T
Sbjct: 319 CGTFWPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFD-PPSVQPYRQLNWDNVSTP 377
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
A Q LAL AA GIVLLKN G LPLS + I N+A+IGP AN TK M GNY GT +
Sbjct: 378 AAQQLALQAAEDGIVLLKND-GILPLS-SNITNVALIGPLANATKQMQGNYYGTAPYLRS 435
Query: 409 PL-QGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
PL A Y G T A AA SAD + V G D SIEAE DR
Sbjct: 436 PLIAAQNAGFKVTYVQGADIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAEEIDRTS 495
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
+ P Q LI ++A ++ P+I+ M G D S ++ + ++LW GYPGQ GG AI
Sbjct: 496 ISWPSSQLSLINQLANLST-PLIISQM-GCMIDSSSLLSNTGVNALLWAGYPGQDGGTAI 553
Query: 528 ADVCFGRYNPSGRLPMTWYPQSY 550
++ G+ P+GRLP+T YP +Y
Sbjct: 554 FNILTGKTAPAGRLPITQYPSNY 576
>gi|238923424|ref|YP_002936940.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
gi|238875099|gb|ACR74806.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
Length = 714
Score = 337 bits (865), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 210/555 (37%), Positives = 296/555 (53%), Gaps = 98/555 (17%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV ++T+ EKI+ ++ + ++ RLGIP+Y WW+EALHGV+ G AT FPQ
Sbjct: 12 LVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVFPQ 61
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNV-------GL-AGLTFWSPNINIFRDPRWGR 178
I AA+F+ L + IG VVSTE R +N G+ GLTFW+PN+NIFRDPRWGR
Sbjct: 62 AIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRWGR 121
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
G ET GEDP L K Y++GLQ G P+ LK AAC KH+ + + R+ F+
Sbjct: 122 GHETYGEDPYLTGKLGCAYIRGLQ---GDDPDHLKSAACAKHFAVH---SGPEAIRHEFD 175
Query: 239 AMV----IYTYYLIKFK------------------------------------------Y 252
A +Y YL FK +
Sbjct: 176 AKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILRDEFGFEGH 235
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS-FLGKHTEAAVKAGLVNESAI 311
+VSDC ++ + H T T EE+AA ++ G DLNCGS FL H + A G+V++ AI
Sbjct: 236 VVSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFL--HLKDAYDKGMVSDEAI 293
Query: 312 DHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS 371
A+ +RLG +PS PY + + V + + +L+++AAR+ +VLLKN
Sbjct: 294 TAAVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLVLLKNKDNF 351
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGC-- 425
LPL +K +AVIGPNAN +IGNY GT +Y TPL+GL + +Y GC
Sbjct: 352 LPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRVLYAEGCHL 411
Query: 426 --SNVQCGTAQVDDAKKA---AASADATVLVMGADQSIEAE---------SHDRLDLLLP 471
VQ + D K+A A +D V+ +G D +IE E S D+L L+LP
Sbjct: 412 YKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGDKLGLMLP 471
Query: 472 GQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVC 531
G Q+ L+ VA V K PVIL++ +G D+S+A+ + +I+ YPG GG A+A+
Sbjct: 472 GLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAEE--HVDAIIDSWYPGARGGKAVAEAI 528
Query: 532 FGRYNPSGRLPMTWY 546
FG Y+P+G+LP+T+Y
Sbjct: 529 FGEYSPNGKLPVTFY 543
>gi|358382857|gb|EHK20527.1| hypothetical protein TRIVIDRAFT_192759 [Trichoderma virens Gv29-8]
Length = 860
Score = 337 bits (865), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 221/588 (37%), Positives = 312/588 (53%), Gaps = 84/588 (14%)
Query: 24 SLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDS 83
S+ + S + +P F D +P L S C+T+ I R +VK +TL EK+ + S
Sbjct: 111 SITNPSPNPPLTPSFP-DCTRDP-LCSNDVCDTTKSIAARAAAIVKPMTLNEKVANVGSS 168
Query: 84 AGSVSRLGIPKYEWWSEALHGVS------YVGP-GTHFSNVVPGATSFPQVILTAASFNA 136
A RLG+P Y+W +EALHGV+ + P G +FS ATSFP IL +A+F+
Sbjct: 169 ASGSGRLGLPAYQWQNEALHGVAGSTGVQFQSPLGANFS----AATSFPMPILLSAAFDD 224
Query: 137 SLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATG 196
+L Q++ +STEARA N G AGL FW+PNIN FRDPRWGRG ETPGED Y
Sbjct: 225 ALVQSVATAISTEARAFANYGFAGLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLS 284
Query: 197 YVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK----- 251
+ GLQ G P+ + + CKH+ AYD++N + + + YYL F+
Sbjct: 285 LINGLQ--GGIDPDFFRTISTCKHFAAYDIENGRTANNLSPTQQDMADYYLPMFETCVRD 342
Query: 252 ----------------------------------------YIVSDCDSVDVLYNSQHYTK 271
Y+VSDCD+V+ +Y+ HY
Sbjct: 343 AKVGSIMCAYNSVNGVPACADSYLLQSVLRDGYGFTEDFNYVVSDCDAVENVYDPHHYAA 402
Query: 272 TPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGH 331
+AAA S+ AG DL+CGS A+V+AG+ E+ +D ++ ++ L+++G+FD
Sbjct: 403 NLTQAAAMSLNAGTDLDCGSSY-NVLNASVQAGMTTEATLDKSLIRLYSALIKVGWFD-Q 460
Query: 332 PSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANV 391
P+K Y LG +V T + LA DAA G+ LLKN G+LPLSPT ++N+AVIGP N
Sbjct: 461 PAK--YSSLGWGNVNTTQTRALAHDAATGGMTLLKND-GTLPLSPT-LQNVAVIGPWVNA 516
Query: 392 TKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCG---------TAQVDDAKKAA 442
T + GNY GT PL T++Q NV+ T+ + A AA
Sbjct: 517 TTQLQGNYAGTAPVLVNPL--------TVFQQKWRNVKYAQGTAINSQDTSGFNAAISAA 568
Query: 443 ASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS 502
+S+D V + G D S+E E DR + PG Q LI+++A + K P++++ GG D S
Sbjct: 569 SSSDVIVYLGGIDISVENEGFDRTAITWPGNQLSLISQLANLGK-PLVIVQFGGGQIDDS 627
Query: 503 FAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
++ K+ SILW GYPGQ GG A+ DV G P+GRLP+T YP +Y
Sbjct: 628 SLLSNSKVNSILWAGYPGQEGGNALFDVLTGANPPAGRLPITQYPANY 675
>gi|30316196|sp|P83344.1|XYNB_PRUPE RecName: Full=Putative beta-D-xylosidase; AltName: Full=PpAz152
gi|19879972|gb|AAM00218.1|AF362990_1 beta-D-xylosidase, partial [Prunus persica]
Length = 461
Score = 337 bits (864), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 169/274 (61%), Positives = 207/274 (75%), Gaps = 1/274 (0%)
Query: 278 AKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY 337
A +I AGLDL+CG FL HTEAAV+ GLV++ I+ A++N MRLG FDG PS Y
Sbjct: 1 ADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQY 60
Query: 338 GQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIG 397
G LGP+DVCT A+Q LAL+AARQGIVLL+N SLPLS + +AVIGPN++VT TMIG
Sbjct: 61 GNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSTRRHRTVAVIGPNSDVTVTMIG 120
Query: 398 NYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAASADATVLVMGADQ 456
NY G C YTTPLQG+ TI+QAGC++V C Q+ A+ AA ADATVLVMG DQ
Sbjct: 121 NYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQ 180
Query: 457 SIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWV 516
SIEAE DR LLLPG QQ L++ VA+ ++GP IL++MSGG D++FAKNDP+I++I+WV
Sbjct: 181 SIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWV 240
Query: 517 GYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GYPGQAGG AIA+V FG NP G+LPMTWYPQ+Y
Sbjct: 241 GYPGQAGGTAIANVLFGTANPGGKLPMTWYPQNY 274
>gi|332982588|ref|YP_004464029.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
gi|332700266|gb|AEE97207.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
50-1 BON]
Length = 714
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 211/561 (37%), Positives = 291/561 (51%), Gaps = 91/561 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + SL + R DLV R+TL EKI+ ++ A ++ RL IP Y WW+E LHGV+ G
Sbjct: 13 YKDVSLSFEDRAKDLVSRMTLPEKISQMIYDAPAIPRLDIPAYNWWNECLHGVARAGI-- 70
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
AT FPQ I AA+FN L + + +S EARA ++ + GLTFW
Sbjct: 71 --------ATVFPQAIAMAATFNPELIHKVAEAISDEARAKHHEAVRNGDRGIYKGLTFW 122
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRG ET GEDP L S+ +VKGLQ G P LKV A KHY +
Sbjct: 123 SPNINIFRDPRWGRGHETYGEDPYLTSRMGVAFVKGLQ---GDDPKYLKVVATPKHYAVH 179
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ + R+ F+A V + YL F+
Sbjct: 180 ---SGPESQRHSFDARVSQKDLRETYLPAFEECVKEGKAVSIMGAYNRTNGEPCCASKTL 236
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
Y+VSDC ++D ++ TKT E+AA ++ G +LNCG ++
Sbjct: 237 LKDILRDEWGFDGYVVSDCGAIDDIHMHHKVTKTAAESAALAVNNGCELNCGKTY-EYLC 295
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC-TQANQDLALDA 357
AV+ GL++E ID A+ F MRLG FD P Y + P DV + +++LAL+
Sbjct: 296 QAVEQGLISEETIDQAVIKLFTARMRLGMFDP-PEMVRYAHI-PYDVNDSPEHRELALET 353
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
ARQ IVLLKN LPLS +K +AVIGPNA+ ++ NY GTP KY TPL+G+ V
Sbjct: 354 ARQSIVLLKNDENILPLS-KKLKTIAVIGPNADDLDVLLANYFGTPSKYVTPLEGIKNKV 412
Query: 418 A----TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHD--------- 464
+ +Y GC D+A A AD ++ +G IE E D
Sbjct: 413 SPDTKVLYAKGCEVTGNSVDGFDEAVNIAEMADIVIMCLGLSPRIEGEEGDVADSDGGGD 472
Query: 465 RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGG 524
RL + LPG Q+ L+ + K P++L++++G I++A + +I+ YPG+ GG
Sbjct: 473 RLHIDLPGMQEQLLETIYGTGK-PIVLVLLNGSAIAINWAHE--HVPAIIEAWYPGEEGG 529
Query: 525 AAIADVCFGRYNPSGRLPMTW 545
AIADV FG YNP+GRLP+T+
Sbjct: 530 TAIADVLFGDYNPAGRLPITF 550
>gi|378730020|gb|EHY56479.1| beta-glucosidase, variant [Exophiala dermatitidis NIH/UT8656]
gi|378730021|gb|EHY56480.1| beta-glucosidase [Exophiala dermatitidis NIH/UT8656]
Length = 783
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 220/565 (38%), Positives = 303/565 (53%), Gaps = 65/565 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V P L++ CNT+ + R LV LT +EK +++ V RLG+ Y+WW E
Sbjct: 23 DCVHGP-LSNNTVCNTNASVADRAKALVAALTNEEKFNLTGNTSPGVPRLGLYSYQWWQE 81
Query: 101 ALHGVSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV+ PG +FS ATSFPQ IL +A+F+ +L A+ VVSTEARA NV
Sbjct: 82 ALHGVA-SSPGVNFSTSGDFSHATSFPQPILMSAAFDDALINAVATVVSTEARAFNNVNR 140
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
+GL FW+PNIN ++DPRWGRGQETPGED Y + GLQ G +P KV A C
Sbjct: 141 SGLDFWTPNINPYKDPRWGRGQETPGEDTFHLKSYVAALIDGLQ--GGLNPPIKKVIATC 198
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ AYDL++W TDRY+F+A+V + YY+ F+
Sbjct: 199 KHFVAYDLEDWITTDRYNFDAIVSTQDLAEYYMQPFQTCARDARVGSIMCSYNAMNGVPT 258
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
Y+ SDCD++ +Y +Y T E+A A ++ AG DLNC
Sbjct: 259 CADPYILQTVLREHWNWTDDGQYVTSDCDAIQNIYAPHYYEPTREQAVADALTAGTDLNC 318
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G++ H AA GL N++ ID I+ ++ L++LG+FD PS PY L DV T A
Sbjct: 319 GTYYQTHLPAAFSEGLFNQTVIDQTITRLYSALIKLGYFD-PPSATPYRSLNWSDVSTPA 377
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKN--LAVIGPNANVTKTMIGNYEGTPCKYT 407
+ LAL AA +GIVLLKN G LPLS KN +A+IG AN T TM GNY G
Sbjct: 378 AEALALKAAEEGIVLLKND-GLLPLSFPTDKNTTVAIIGGWANATTTMQGNYFGIAPYLH 436
Query: 408 TPLQGLAAV--VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDR 465
+PL L + + +Y G T D+ AA AD ++ G S E+ES+DR
Sbjct: 437 SPLYALQQLPNINAVYGGGFG--VPTTDGWDELLGAAGEADLIIIADGLTTSDESESNDR 494
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
+ +I +++ + K P + + M G D + N+P I++++W GYPG AGG
Sbjct: 495 YTIGWQPAAIDIINQLSGMGK-PTVFLQM-GDQLDNTPLLNNPNISALIWGGYPGMAGGD 552
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
A+ ++ G+ P+GRLP+T YP Y
Sbjct: 553 ALINILTGKAAPAGRLPVTQYPADY 577
>gi|40363751|dbj|BAD06320.1| putative beta-xylosidase [Triticum aestivum]
Length = 573
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 174/387 (44%), Positives = 231/387 (59%), Gaps = 49/387 (12%)
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------- 251
L+ +ACCKH+TAYDL+NWKG R+ F+A V + Y FK
Sbjct: 5 LEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCSYN 64
Query: 252 --------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
YI SDCD+V ++++ Q Y K PE+A A + AG+
Sbjct: 65 RVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLKAGM 124
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
D+NCG ++ H +A + G + ID A+ N FA MRLG F+G+P YG +G V
Sbjct: 125 DVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFNGNPKYNRYGNIGADQV 184
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
C + +QDLAL AA+ GIVLLKN AG+LPLS + + ++AVIGPN N ++GNY G PC
Sbjct: 185 CKKEHQDLALQAAQDGIVLLKNDAGALPLSKSKVSSVAVIGPNGNNASLLLGNYFGPPCI 244
Query: 406 YTTPLQGLAAVV--ATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
TP Q L V AT Q GC+ C + + +A AA+SAD VL MG DQ+ E E
Sbjct: 245 SVTPFQALQGYVKDATFVQ-GCNAAVCNVSNIGEAVHAASSADYVVLFMGLDQNQEREEV 303
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DRL+L LPG Q+ L+ +VA AK PVIL+++ GG D++FAKN+PKI +I+W GYPGQAG
Sbjct: 304 DRLELGLPGMQESLVNKVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQAG 363
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
G AIA V FG +NP GRLP+TWYP+ +
Sbjct: 364 GIAIAQVLFGEHNPGGRLPVTWYPKEF 390
>gi|291167620|dbj|BAI82526.1| 1,4-beta-D-xylosidase [Aureobasidium pullulans var. melanogenum]
Length = 805
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 213/564 (37%), Positives = 296/564 (52%), Gaps = 64/564 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V+ P L++ C+ S R LV T+ EK+ +++ V RLG+P Y+WW E
Sbjct: 32 DCVNGP-LSNNTVCDKSADPVARAKALVAAFTVAEKLNLTGNNSPGVPRLGLPVYQWWQE 90
Query: 101 ALHGVSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV+ PG F+ ATSFPQ IL A+F+ +L Q++ +VVSTEARA N G
Sbjct: 91 ALHGVA-SSPGVTFNATGQFDSATSFPQPILMGAAFDDALIQSVAEVVSTEARAFNNYGR 149
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
AGL FW+PNIN +RDPRWGRGQETPGEDP S Y + GLQ G P K+ A C
Sbjct: 150 AGLDFWTPNINPYRDPRWGRGQETPGEDPYHLSSYVHSLIMGLQ--GGEDPEIRKITATC 207
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ YD+++W G RY + + + YYL F+
Sbjct: 208 KHFAGYDIESWNGNLRYQNDVQIPQRDLVEYYLPSFRSCARDSNVGAFMCTYSALNGVPT 267
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
++ SDCDS+ ++ +++ T + AAA ++ AG DL+C
Sbjct: 268 CADPWLLNDVLREHWGWTNEEQWVTSDCDSIQNIFLPHNFSDTRQGAAAAALNAGTDLDC 327
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G++ H A GL+N++ +D A+ + +L+R G+FDG P+ Y L DV T
Sbjct: 328 GTYYQHHLPLAYSQGLINQTTVDQALVRLYTSLVRTGYFDG-PNAM-YRNLTWSDVGTTH 385
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
Q LAL AA +G+VLLKN G LPLS + +A+IG AN T M GNY G P +P
Sbjct: 386 AQQLALQAAEEGMVLLKND-GLLPLSISNGTKIALIGSWANATTQMQGNYYGVPTYLHSP 444
Query: 410 LQGLAAVVATIYQAGCSNVQCGTAQVDD---AKKAAASADATVLVMGADQSIEAESHDRL 466
L A ++ A Q G D AA AD + + G D S+EAE DR
Sbjct: 445 LYAAQQTGAQVFYAQGPGGQ-GDPTTDHWLPVWTAAEKADIIIYIGGVDISVEAEGMDRE 503
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
D+ G Q +I E+A K P++L M G D + N+ I++++W GYPGQ GG A
Sbjct: 504 DINWTGAQLDIIGELAMYGK-PMVLAQM-GDQLDNTPIVNNANISALIWGGYPGQDGGVA 561
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+ ++ G+ P+GRLP+T YP Y
Sbjct: 562 LFNIITGKTAPAGRLPVTQYPAHY 585
>gi|429850127|gb|ELA25427.1| glycoside hydrolase family 3 protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 918
Score = 334 bits (857), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 214/562 (38%), Positives = 295/562 (52%), Gaps = 61/562 (10%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D +P L S C+ SL R LV LT+ EK+ LV+ A + RL +P YEWWSE
Sbjct: 158 DCTRDP-LCSNDVCDESLSDKQRAAALVAELTIWEKLDNLVNEAPGIPRLRVPPYEWWSE 216
Query: 101 ALHGVSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
LHGV+ PGT F++ ATSFPQ IL ++F+ L +A+G+VVS EARA N G
Sbjct: 217 GLHGVAR-SPGTKFTSKGNFSYATSFPQPILLGSAFDDELVRAVGEVVSREARAFSNAGR 275
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
+GL +SPNIN F+DPRWGRGQETPGED KY + + GL+ G P++ K+ A C
Sbjct: 276 SGLDLYSPNINAFKDPRWGRGQETPGEDTFHLQKYVSAMLSGLE---GDDPDK-KLIATC 331
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KHY A D +N+KG DR FNA++ + YYL FK
Sbjct: 332 KHYAANDFENYKGVDRSGFNAVISTQDLSEYYLPPFKTCAVEKNVGSFMCSYNGINGTPL 391
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
Y+ +DCD V ++ + HY AAA S+ AG DL C
Sbjct: 392 CANSYLIEDILRKHWGWNGDGQYVSTDCDCVALMVSYHHYAPDLGHAAAWSMQAGTDLEC 451
Query: 290 GSFLGKHT-EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
+F G ++A L++E +D A++ + +L+ +G FD K P LG +V T+
Sbjct: 452 NAFPGSEALQSAWNQSLISEKDVDKALTRMYTSLVSVGLFD-LDRKDPLRSLGWDEVNTK 510
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
QDLA AA +G VL+KN G LPLSP + K A+IGP + T M GNY G +
Sbjct: 511 EAQDLAYRAAVEGAVLMKND-GILPLSPDSSKKYALIGPWVSATTQMQGNYFGPAPYLIS 569
Query: 409 PLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDL 468
P + + S + A KAA +AD + + G D ++E E+ DR L
Sbjct: 570 PRKAAKDLGLDFTYFLGSRTNKSDSSFAQAIKAAQAADVVIFMGGVDNTLEQETLDRNTL 629
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIA 528
P Q L+ +++V K P++++ GG D + + + +ILW GYPGQ+GG AI
Sbjct: 630 AWPEPQLQLLRALSEVGK-PLVVLQFGGGQVDDTELLANDSVNAILWGGYPGQSGGKAIL 688
Query: 529 DVCFGRYNPSGRLPMTWYPQSY 550
D+ FGR P+GRL +T YP SY
Sbjct: 689 DIVFGRAAPAGRLSVTQYPASY 710
>gi|212531051|ref|XP_002145682.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
gi|210071046|gb|EEA25135.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
Length = 799
Score = 334 bits (856), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 213/563 (37%), Positives = 296/563 (52%), Gaps = 65/563 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D VS P L C+TS R L+ TL+E I +S V RLG+P YE WSE
Sbjct: 52 DCVSGP-LKDNIVCDTSANYVDRAEGLIALFTLEELINNTQNSGPGVPRLGLPPYEVWSE 110
Query: 101 ALHGVS---YVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
LHG+ +V G ++ ATSFP IL+ A+ N +L I +++T+ARA NVG
Sbjct: 111 GLHGLDRAHFVKSGDEWT----WATSFPMPILSMAALNRTLINQIASIIATQARAFNNVG 166
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDP-LLASKYATGYVKGLQQTDGGSPNRLKVAA 216
GL ++PNIN FR P WGRGQETPGED L S YA Y+ GLQ G P+ LK+AA
Sbjct: 167 RYGLDAYAPNINGFRSPLWGRGQETPGEDANFLTSSYAYEYITGLQ--GGIDPDNLKIAA 224
Query: 217 CCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF---------------------- 250
KH+ YDL+NW G R F+A + + YY +F
Sbjct: 225 TAKHFAGYDLENWGGNSRLGFDARITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNAI 284
Query: 251 ----------------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
Y+ SDCD+V ++N Y AAA+S+ AG D++
Sbjct: 285 PSCSSSFLLQTLLREQWDFPEYGYVSSDCDAVYNVFNPHGYASNQSSAAAESLRAGTDID 344
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG H + G V I+ +I ++ L++LG+FDG K Y QLG DV T
Sbjct: 345 CGQTYSWHLNQSFIEGSVTRGEIERSILRLYSNLVKLGYFDGD--KNEYRQLGWNDVVTT 402
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
+++ +AA +GIVLLKN G LPLS +K++A++GP AN TK + GNY GT T
Sbjct: 403 DAWNISYEAAVEGIVLLKND-GVLPLSKN-VKSVALVGPWANATKQLQGNYFGTAPYLIT 460
Query: 409 PLQGLAAVVATIYQAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
PLQG + + A +N+ T +A AA +D V + G D +IEAE DR++
Sbjct: 461 PLQGASDAGYKVNYALGTNISGNTTDGFANALSAAKKSDVIVYLGGIDNTIEAEGTDRMN 520
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
+ P Q LI ++++ K P++++ M GG D S K++ K+ +++W GYPGQ+GG AI
Sbjct: 521 VTWPRNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKSNSKVNALIWGGYPGQSGGKAI 579
Query: 528 ADVCFGRYNPSGRLPMTWYPQSY 550
D+ G+ P+GRL T YP Y
Sbjct: 580 FDILKGKRAPAGRLVSTQYPAEY 602
>gi|347531439|ref|YP_004838202.1| beta-glucosidase [Roseburia hominis A2-183]
gi|345501587|gb|AEN96270.1| beta-glucosidase [Roseburia hominis A2-183]
Length = 716
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 211/556 (37%), Positives = 291/556 (52%), Gaps = 98/556 (17%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV+++TL+EKI+ + + ++ RL IP Y WW+EALHGV+ G AT FPQ
Sbjct: 12 LVEQMTLEEKISQMRYESPAIERLHIPAYNWWNEALHGVARSGV----------ATMFPQ 61
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNV-------GL-AGLTFWSPNINIFRDPRWGR 178
I AA+F+ L + IG VVSTE RA + G+ GLTFW+PNINIFRDPRWGR
Sbjct: 62 AIALAATFDEELIEKIGDVVSTEGRAKFEAYSGRGDRGIYKGLTFWAPNINIFRDPRWGR 121
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
G ET GEDP L +K Y++G+Q G P+ LK AAC KH+ + + R+ F+
Sbjct: 122 GHETYGEDPCLTAKLGCAYIRGIQ---GKDPDHLKAAACAKHFAVH---SGPEALRHEFD 175
Query: 239 AMV----IYTYYLIKFK------------------------------------------Y 252
A V +Y YL FK +
Sbjct: 176 AKVSLHDLYDTYLYAFKRCVKDAGVEAVMGAYNRVNGEPACGSKTLLQDILREQFGFEGH 235
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAI 311
+VSDC ++ + H TKT EE+AA ++ G DLNCG +FL + A + GLV E I
Sbjct: 236 VVSDCWAILDFHEHHHVTKTVEESAAMAVNHGCDLNCGKAFL--YLSRACEQGLVEEKTI 293
Query: 312 DHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ-DLALDAARQGIVLLKNTAG 370
A+ +RLG + +PS PY + P DV L+L+A+++ +VLLKN
Sbjct: 294 TEAVERLMDVRIRLGMMEDYPS--PYANI-PYDVVECPEHIALSLEASKRSMVLLKNDNH 350
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGC- 425
LPL + +AVIGPNAN ++GNYEGT +Y TPL+G+ +Y GC
Sbjct: 351 FLPLKQEQVHTIAVIGPNANSRAALVGNYEGTSSRYITPLEGIQEYTGEKTRVLYAQGCH 410
Query: 426 ---SNVQCGTAQVDDAKK---AAASADATVLVMGADQSIEAE---------SHDRLDLLL 470
V+ D K+ AA AD V+ +G D IE E S D+L L L
Sbjct: 411 LYKDQVEFLGEPKDRFKEALIAAERADVIVMCLGLDAGIEGEEGDAGNEYASGDKLGLKL 470
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG QQ L+ VA V K P++L +++G D+S+A+ +I +IL YPG GG AIA+
Sbjct: 471 PGLQQELLEAVAAVGK-PIVLTVLAGSALDLSWAQEHAQIRAILDCWYPGARGGKAIAEA 529
Query: 531 CFGRYNPSGRLPMTWY 546
FG ++P G+LP+T+Y
Sbjct: 530 LFGEFSPCGKLPVTFY 545
>gi|60729621|pir||JC7966 xylan 1,4-beta-xylosidase (EC 3.2.1.37) - Talaromyces emersonii
gi|21326570|gb|AAL32053.2|AF439746_1 beta-xylosidase [Rasamsonia emersonii]
Length = 796
Score = 332 bits (850), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 208/549 (37%), Positives = 292/549 (53%), Gaps = 63/549 (11%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTH 113
CNTS R LV TL+E I ++A V RLG+P+Y+ W+EALHG+ +
Sbjct: 64 CNTSADPWARAEALVSLFTLEELINNTQNTAPGVPRLGLPQYQVWNEALHGLDRA----N 119
Query: 114 FSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIF 171
FS+ ATSFP IL+ ASFN +L I +++T+ARA N G GL ++PNIN F
Sbjct: 120 FSDSGEYSWATSFPMPILSMASFNRTLINQIASIIATQARAFNNAGRYGLDSYAPNINGF 179
Query: 172 RDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
R P WGRGQETPGED L+S YA Y+ GLQ G P +K+ A KH+ YDL+NW
Sbjct: 180 RSPLWGRGQETPGEDAFFLSSAYAYEYITGLQ--GGVDPEHVKIVATAKHFAGYDLENWG 237
Query: 231 GTDRYHFNAMV----IYTYYLIKF------------------------------------ 250
R NA++ + YY +F
Sbjct: 238 NVSRLGSNAIITQQDLSEYYTPQFLASARYAKTRSLMCSYNAVNGVPSCSNSFFLQTLLR 297
Query: 251 --------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVK 302
Y+ SDCD+V ++N Y AAA S+LAG D++CG + H +
Sbjct: 298 ESFNFVDDGYVSSDCDAVYNVFNPHGYALNQSGAAADSLLAGTDIDCGQTMPWHLNESFY 357
Query: 303 AGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGI 362
V+ I+ +++ +A L+RLG+FDG+ S Y L DV T +++ +AA +GI
Sbjct: 358 ERYVSRGDIEKSLTRLYANLVRLGYFDGNNSV--YRNLNWNDVVTTDAWNISYEAAVEGI 415
Query: 363 VLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQ 422
LLKN G+LPLS ++++A+IGP AN T M GNY GTP +PL+ A T+
Sbjct: 416 TLLKND-GTLPLS-KKVRSIALIGPWANATVQMQGNYYGTPPYLISPLEAAKASGFTVNY 473
Query: 423 AGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEV 481
A +N+ + Q +A AA +D + G D +IEAE DR DL PG Q LI ++
Sbjct: 474 AFGTNISTDSTQWFAEAISAAKKSDVIIYAGGIDNTIEAEGQDRTDLKWPGNQLDLIEQL 533
Query: 482 AKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRL 541
+KV K P++++ M GG D S K + + +++W GYPGQ+GGAA+ D+ G+ P+GRL
Sbjct: 534 SKVGK-PLVVLQMGGGQVDSSSLKANKNVNALVWGGYPGQSGGAALFDILTGKRAPAGRL 592
Query: 542 PMTWYPQSY 550
T YP Y
Sbjct: 593 VSTQYPAEY 601
>gi|348604625|dbj|BAK96214.1| beta-xylosidase [Acremonium cellulolyticus]
Length = 797
Score = 327 bits (839), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 207/547 (37%), Positives = 289/547 (52%), Gaps = 58/547 (10%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTH 113
C+TS R L+ TL+E I ++A V RLG+P Y+ WSEALHG+ T
Sbjct: 64 CDTSANYVDRAEGLIALFTLEELINNTQNTAPGVPRLGLPPYQVWSEALHGLDRANFATS 123
Query: 114 FSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRD 173
+ ATSFP IL+ A+ N +L I ++ T+ARA N G GL ++PNIN FR
Sbjct: 124 -GDEWTWATSFPMPILSMAALNRTLINQIAGIIGTQARAFNNAGRYGLDAYAPNINGFRS 182
Query: 174 PRWGRGQETPGEDP-LLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
P WGRGQETPGED L+S YA Y+ GLQ G P+ LKV A KH+ YDL+NW G
Sbjct: 183 PLWGRGQETPGEDANFLSSSYAYEYITGLQ--GGVDPDHLKVVATAKHFAGYDLENWGGN 240
Query: 233 DRYHFNAMV----IYTYYLIKF-------------------------------------- 250
R F+A + + YY +F
Sbjct: 241 SRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSFLLQTLLRDN 300
Query: 251 ------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAG 304
Y+ SDCD+V ++N Y AAA S+ AG D++CG + + G
Sbjct: 301 WDFPEYGYVSSDCDAVYNVFNPHGYASNQSAAAADSLRAGTDIDCGQTYPWNLNQSFIEG 360
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
V I+ +I ++ L++LG+FDG S+ Y QLG DV T +++ +AA +GIVL
Sbjct: 361 SVTRGEIERSIVRLYSNLVKLGYFDGDKSE--YRQLGWNDVVTTDAWNISYEAAVEGIVL 418
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAG 424
LKN G LPLS +K++A+IGP AN T+ + GNY GT TPLQG + + A
Sbjct: 419 LKND-GILPLS-KHVKSIALIGPWANATEQLQGNYYGTAPYLITPLQGASDAGYKVNYAL 476
Query: 425 CSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAK 483
+N+ T + DA AA +D V + G D +IEAE DR+++ PG Q LI ++++
Sbjct: 477 GTNILGNTTEGFADALSAAKKSDVIVYLGGIDNTIEAEGTDRMNVTWPGNQLDLIQQLSQ 536
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
K P++++ M GG D S K + K+ +++W GYPGQ+GG AI D+ G+ P+GRL
Sbjct: 537 TGK-PLVVLQMGGGQVDSSSIKANSKVNALVWGGYPGQSGGTAIFDILSGKRVPAGRLVT 595
Query: 544 TWYPQSY 550
T YP Y
Sbjct: 596 TQYPAEY 602
>gi|115387056|ref|XP_001210069.1| predicted protein [Aspergillus terreus NIH2624]
gi|114191067|gb|EAU32767.1| predicted protein [Aspergillus terreus NIH2624]
Length = 908
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 214/554 (38%), Positives = 292/554 (52%), Gaps = 62/554 (11%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
L S C+TSL I RV LVK LTL+EKI LVD+A +RLG+P YEWW+EA HGV
Sbjct: 157 LCSHRVCDTSLSIAERVNSLVKSLTLEEKILNLVDAAAGSTRLGLPFYEWWNEATHGVG- 215
Query: 108 VGPGTHFSNVVPG---ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFW 164
PG F++ ATSFP IL AASF+ +L + I +V+ E RA N G +G FW
Sbjct: 216 SAPGVQFTSKPANFSYATSFPAPILIAASFDNALIRKIAEVIGKEGRAFANNGFSGFDFW 275
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNIN FRDPRWGRGQETPGED +A Y ++ GLQ G P +V A CKHY Y
Sbjct: 276 APNINGFRDPRWGRGQETPGEDTFVAQNYIRNFIPGLQ---GDDPKNKQVIATCKHYAVY 332
Query: 225 DLDNWKGTDRYHFNAMVIYTYYLIKFK--------------------------------- 251
DL+ + + Y+ + Y+L FK
Sbjct: 333 DLETGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANEYLLDEV 392
Query: 252 ------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA 299
Y+VSDC++V ++ ++T T E AAA ++ AG+DL CGS K E
Sbjct: 393 LRKHWGFNADYHYVVSDCNAVTDIWQYHNFTDTEEAAAAVALNAGVDLECGSSYLKLNE- 451
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
++ A + A+D +++ ++ L +GFFDG Y L DV A Q LA +AA
Sbjct: 452 SLAANQTSVKAMDQSLARLYSALFTIGFFDG----GKYDHLDFSDVSIPAAQALAYEAAV 507
Query: 360 QGIVLLKNTAGSLPL-SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA--V 416
+G+ LLKN G LPL S K++AVIGP AN T M G Y G +PL +
Sbjct: 508 EGMTLLKND-GLLPLHSQHKYKSVAVIGPFANATTQMQGGYSGNAPYLISPLVAFESDHR 566
Query: 417 VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQL 476
Y G + T + + AA +D V + G D SIE+E+ DR L PG Q
Sbjct: 567 WKVNYAVGTAINDQNTTGFEASLAAAKKSDLIVYLGGIDNSIESETIDRTSLAWPGNQLD 626
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI ++ ++K P++++ GG D S + I +++W GYP Q+GG A+ D+ G+ +
Sbjct: 627 LIKSLSNLSK-PMVVVQFGGGQVDDSALLENKDIQALIWAGYPSQSGGTALLDILVGKRS 685
Query: 537 PSGRLPMTWYPQSY 550
P+GRLP+T YP SY
Sbjct: 686 PAGRLPVTQYPASY 699
>gi|223945397|gb|ACN26782.1| unknown [Zea mays]
Length = 516
Score = 324 bits (831), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 162/304 (53%), Positives = 209/304 (68%), Gaps = 3/304 (0%)
Query: 250 FKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNES 309
+ YI SDCD+V +++++Q Y KT E+A A + AG+D+NCGS++ H +A++ G + E
Sbjct: 30 YGYITSDCDAVAIIHDAQGYAKTAEDAVADVLKAGMDVNCGSYVQDHGASALQQGKITEQ 89
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT- 368
I+ A+ N FA MRLG F+G P + YG +GP VCTQ +QDLAL+AA+ GIVLLKN
Sbjct: 90 DINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQVCTQEHQDLALEAAQDGIVLLKNDG 149
Query: 369 -AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA-TIYQAGCS 426
AG+LPLS + +LAVIG NAN + GNY G PC TPLQ L V T + AGC+
Sbjct: 150 GAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCN 209
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAK 486
+ C + +A +AA+SAD+ VL MG DQ E E DRLDL LPGQQQ LI VA AK
Sbjct: 210 SAACNVTTIPEAVQAASSADSVVLFMGLDQDQEREEVDRLDLTLPGQQQTLIESVANAAK 269
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PVIL+++ GG D+SFAK +PKI +ILW GYPG+AGG AIA V FG +NP GRLP+TWY
Sbjct: 270 KPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWY 329
Query: 547 PQSY 550
PQ +
Sbjct: 330 PQDF 333
>gi|164429277|ref|XP_958209.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
gi|16945419|emb|CAB91343.2| related to xylan 1, 4-beta-xylosidase [Neurospora crassa]
gi|157073010|gb|EAA28973.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
Length = 774
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 210/591 (35%), Positives = 300/591 (50%), Gaps = 69/591 (11%)
Query: 19 FLTYCSLQHSSS--SAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEK 76
+ +YC L +S+ SA P N LASL C+ +L R LV +T +EK
Sbjct: 5 WASYCLLSCTSALVSAIDLPFQTYPDCVNGPLASLKVCDATLSPPQRAAALVAAMTTEEK 64
Query: 77 ITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSN---VVPGATSFPQVILTAAS 133
+ LV + R+G+P Y WWSEALHGV+Y PGT F + +TSFP +L AA+
Sbjct: 65 LQNLVSKSKGAPRIGLPAYNWWSEALHGVAYA-PGTQFRSGDGPFNSSTSFPMPLLMAAT 123
Query: 134 FNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
F+ L + +G+V+ TE RA N G +G +W+PN+N F+DPRWGRG ETPGED L +Y
Sbjct: 124 FDDELIEKVGEVIGTEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRY 183
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIK 249
A ++GLQ G P R +V A CKHY A D ++W G+ R+ F+A V + YYL
Sbjct: 184 AASMIRGLQ---GPLPER-RVVATCKHYAANDFEDWNGSTRHDFDAKVTLQDLAEYYLSP 239
Query: 250 FK---------------------------------------------YIVSDCDSVDVLY 264
F+ YI SDC++V ++
Sbjct: 240 FQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILREHWNWTAPGNYITSDCEAVLDIF 299
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMR 324
+ HY KT E A + AG D +C A GL+ +S +D A++ + L+R
Sbjct: 300 ANHHYAKTNAEGTALAFEAGTDSSCEYESSSDIPGAWTQGLLEQSTVDRALTRLYEGLVR 359
Query: 325 LGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLP--LSPTAIKNL 382
+G+FDG+ S+ Y LG KDV + +Q++AL A +GIVLLKN +LP L L
Sbjct: 360 VGYFDGNHSE--YASLGWKDVNSPKSQEVALQTAVEGIVLLKNDQ-TLPLGLKTDPKSKL 416
Query: 383 AVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ---VDDAK 439
A+IG AN KT+ G Y G P +P+ A+ + AG +Q T+ A
Sbjct: 417 AMIGFWANDPKTLSGGYSGKPAFEHSPVYAAEAMGFNVTTAGGPVLQNSTSNDTWTQAAL 476
Query: 440 KAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGF 499
+AA A+ + G D S E+ DR + P Q LI + K+ K P++++ M G
Sbjct: 477 EAAQDANYILYFGGLDTSAAGETKDRTTINWPEAQLQLIKTLTKLGK-PLVVVQM-GDQL 534
Query: 500 DISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
D + + SILW +PGQ GG A+ + G +P+GRLP+T YP +Y
Sbjct: 535 DNTPLLATKTVNSILWANWPGQDGGTAVMQILTGLKSPAGRLPVTQYPANY 585
>gi|310792973|gb|EFQ28434.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 728
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 195/531 (36%), Positives = 281/531 (52%), Gaps = 63/531 (11%)
Query: 71 LTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHF---SNVVPGATSFPQV 127
++++EK+ LVD++ V LG+P + WW+E LHGV + PG F S ATSFP
Sbjct: 1 MSVEEKVRNLVDASAGVKSLGLPPHGWWNEGLHGVGF-SPGVLFAQDSEPFGYATSFPLP 59
Query: 128 ILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDP 187
ILTAASF+ LF AIG+V+ E RA N G AG FW+PN+N FRDPRWGRGQETPGED
Sbjct: 60 ILTAASFDDDLFNAIGQVIGREGRAFSNYGYAGFNFWTPNMNAFRDPRWGRGQETPGEDV 119
Query: 188 LLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYL 247
L+ S Y YV GLQ +D P + A CKH+ AYD++ + + Y+ + YYL
Sbjct: 120 LVVSNYVQSYVTGLQGSD---PTDKVIIAACKHFAAYDIETARRANNYNPTQQDLQDYYL 176
Query: 248 IKFK---------------------------------------------YIVSDCDSVDV 262
F+ ++VSDC +V
Sbjct: 177 PAFRRCVRDSHVGTVMCSYNSVDGIPACSSEYLLKEVLRDTWGFTNDYQFVVSDCGAVTD 236
Query: 263 LYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDHAISNNFAT 321
++ ++T T ++AA+ S+ AG DL CG S+L H ++ V + +D A++ +
Sbjct: 237 VWLLHNFTNTEQDAASVSMAAGTDLECGSSYL--HLNGSLADKQVTQERVDEALTRLYKA 294
Query: 322 LMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKN 381
L +G+FDG + LG DV T Q +A +AAR G+ LLKN G LPL+ K+
Sbjct: 295 LFTVGYFDG----SSHSSLGWSDVSTIDAQQIACEAARAGMTLLKND-GVLPLADGKYKS 349
Query: 382 LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA--AVVATIYQAGCSNVQCGTAQVDDAK 439
+A+IGP AN T M GNY G +PL + + Y AG + DA
Sbjct: 350 VALIGPFANATTQMQGNYFGRAPFVRSPLWAFTQQSSLQVNYAAGTDINSTSDSGFADAL 409
Query: 440 KAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGF 499
AA ++D + G D +IEAE+ DR+ + PG Q LI++++ + K P+++ GG
Sbjct: 410 AAAKNSDIVIFCGGIDTTIEAETLDRVSITWPGNQLDLISQLSMLGK-PLVVAQFGGGQV 468
Query: 500 DISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
D + ++ + ++ W G PGQAGG A+ D+ G+ + +GRLP T YP SY
Sbjct: 469 DDTALVDNANVNALFWAGLPGQAGGLAMYDLVVGKASFAGRLPTTQYPASY 519
>gi|336471692|gb|EGO59853.1| hypothetical protein NEUTE1DRAFT_99999 [Neurospora tetrasperma FGSC
2508]
gi|350292807|gb|EGZ74002.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 770
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 210/591 (35%), Positives = 297/591 (50%), Gaps = 69/591 (11%)
Query: 19 FLTYCSLQHSSS--SAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEK 76
+ YC L SS SA P N LASL C+ +L R LV +T +EK
Sbjct: 5 LVPYCLLSCSSVLVSAIDLPFQTYPDCVNGPLASLKVCDVTLSPPQRAAALVAAMTTEEK 64
Query: 77 ITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHF---SNVVPGATSFPQVILTAAS 133
+ LV + R+G+P Y WWSEALHGV+Y PGT F +TSFP +L AA+
Sbjct: 65 LQNLVSKSKGAPRIGLPAYNWWSEALHGVAY-APGTQFWSGDGPFNASTSFPMPLLMAAT 123
Query: 134 FNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
F+ L + +G+V+ TE RA N G +G +W+PN+N F+DPRWGRG ETPGED L +Y
Sbjct: 124 FDDELIEKVGEVIGTEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRY 183
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIK 249
A ++GLQ G +V A CKHY A D ++W G+ R+ FNA V + YYL
Sbjct: 184 AASMIRGLQ----GPARERRVVATCKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSP 239
Query: 250 FK---------------------------------------------YIVSDCDSVDVLY 264
F+ YI SDC++V +
Sbjct: 240 FQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILREHWNWTAPGNYITSDCEAVLDIS 299
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMR 324
+ HY +T E A + AG+D +C A GL+ +S +D A+ + L+R
Sbjct: 300 ANHHYAETNAEGTALAFEAGIDSSCEYESSSDIPGAWTQGLLEQSTVDRALKRIYEGLVR 359
Query: 325 LGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS--PTAIKNL 382
+G+FDG+ S+ Y LG KDV + +Q++AL AA +GIVLLKN +LPL L
Sbjct: 360 VGYFDGNHSE--YASLGWKDVNSPKSQEVALQAAVEGIVLLKNDK-TLPLDLRTDPKSKL 416
Query: 383 AVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ---VDDAK 439
A+IG AN KT+ G Y G P +P+ A+ ++ AG +Q T+ A
Sbjct: 417 AMIGFWANDPKTLSGGYSGKPAFEHSPVYAAQAMGFSVTTAGGPVLQNSTSNDTWTQAAL 476
Query: 440 KAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGF 499
+AA A+ + G D S E+ DR + P Q LIT ++K+ K P++++ M G
Sbjct: 477 EAAKDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLITTLSKLGK-PLVVVQM-GDQL 534
Query: 500 DISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
D + + +ILW + GQ GG A+ + G NP+GRLP+T YP +Y
Sbjct: 535 DNTPLLAAKAVNAILWANWLGQDGGTAVMQILTGLKNPAGRLPVTQYPANY 585
>gi|67902828|ref|XP_681670.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
gi|74592887|sp|Q5ATH9.1|BXLB_EMENI RecName: Full=Exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|40747867|gb|EAA67023.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
gi|259484335|tpe|CBF80465.1| TPA: beta-1,4-xylosidase (Eurofung) [Aspergillus nidulans FGSC A4]
Length = 763
Score = 322 bits (825), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 215/562 (38%), Positives = 299/562 (53%), Gaps = 61/562 (10%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D + P L+ L C+TSL R LV LTL+EKI A SRLG+P Y WW+E
Sbjct: 27 DCTTGP-LSELPICDTSLSPLERAKSLVSALTLEEKINNTGHEAAGSSRLGLPAYNWWNE 85
Query: 101 ALHGVSYV-GPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA 159
ALHGV+ G S ATSFP I+ A+FN +L + + +++STEARA N A
Sbjct: 86 ALHGVAEKHGVSFEESGDFSYATSFPAPIVLGAAFNDALIRRVAEIISTEARAFSNSDHA 145
Query: 160 GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCK 219
G+ +W+PN+N F+DPRWGRGQETPGEDPL S+Y +V GLQ G P + KV A CK
Sbjct: 146 GIDYWTPNVNPFKDPRWGRGQETPGEDPLHCSRYVKEFVGGLQ---GDDPEKPKVVATCK 202
Query: 220 HYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------ 251
H AYDL+ W G R+ F+A V + YYL FK
Sbjct: 203 HLAAYDLEEWGGVSRFEFDAKVSAVDLLEYYLPPFKTCAVDASVGAFMCSYNALNGVPAC 262
Query: 252 ---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG 290
++ DC +V+ + HY ++ EAAA ++ AG+DL+CG
Sbjct: 263 ADRYLLQTVLREHWGWEGPGHWVTGDCGAVERIQTYHHYVESGPEAAAAALNAGVDLDCG 322
Query: 291 SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQA 349
++L + A + GL++ +D A++ + +L++LG+FD P++ QP LG DV T
Sbjct: 323 TWLPSYLGEAERQGLISNETLDAALTRLYTSLVQLGYFD--PAEGQPLRSLGWDDVATSE 380
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
++LA A QG VLLKN +LPL A LA+IGP N T + NY G T
Sbjct: 381 AEELAKTVAIQGTVLLKNIDWTLPLK--ANGTLALIGPFINFTTELQSNYAGPAKHIPTM 438
Query: 410 LQGLAAVVATIYQAGCSNVQ-CGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDL 468
++ + + A + V T DDA AA ADA + G D ++E ES DR +
Sbjct: 439 IEAAERLGYNVLTAPGTEVNSTSTDGFDDALAIAAEADALIFFGGIDNTVEEESLDRTRI 498
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIA 528
PG Q+ LI E+A++ + P+ ++ GG D S + +I+W GYP QAGGA +
Sbjct: 499 DWPGNQEELILELAELGR-PLTVVQFGGGQVDDSALLASAGVGAIVWAGYPSQAGGAGVF 557
Query: 529 DVCFGRYNPSGRLPMTWYPQSY 550
DV G+ P+GRLP+T YP+SY
Sbjct: 558 DVLTGKAAPAGRLPITQYPKSY 579
>gi|219887077|gb|ACL53913.1| unknown [Zea mays]
gi|224035251|gb|ACN36701.1| unknown [Zea mays]
gi|413919685|gb|AFW59617.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 405
Score = 322 bits (825), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 164/230 (71%), Positives = 190/230 (82%), Gaps = 3/230 (1%)
Query: 323 MRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNL 382
MRLGFFDG P + P+G LGP DVCT +NQ+LA +AARQGIVLLKNT G LPLS T+IK++
Sbjct: 1 MRLGFFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKNT-GKLPLSATSIKSM 59
Query: 383 AVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC--GTAQVDDAKK 440
AVIGPNAN + TMIGNYEGTPCKYTTPLQGL A VAT+YQ GC+NV C + Q+D A K
Sbjct: 60 AVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATK 119
Query: 441 AAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFD 500
AAASAD TVLV+GADQSIE ES DR LLLPGQQ L++ VA + GP IL++MSGG FD
Sbjct: 120 AAASADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFD 179
Query: 501 ISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
ISFAK+ KI +ILWVGYPG+AGGAAIADV FG +NPSGRLP+TWYP+S+
Sbjct: 180 ISFAKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESF 229
>gi|330836687|ref|YP_004411328.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
gi|329748590|gb|AEC01946.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
Length = 709
Score = 322 bits (824), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 200/556 (35%), Positives = 288/556 (51%), Gaps = 101/556 (18%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
+V R+TL EKI+ + A ++ RL IP+Y WW+EALHGV+ G AT FPQ
Sbjct: 17 IVSRMTLDEKISQIDYRASAIPRLDIPEYNWWNEALHGVARAGI----------ATVFPQ 66
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFWSPNINIFRDPRWGR 178
I AA F++ + + IG V+STE RA YN + GLTFWSPN+NIFRDPRWGR
Sbjct: 67 AIGLAAMFDSDMMERIGAVISTEGRAKYNEAVRHGDRDIYKGLTFWSPNVNIFRDPRWGR 126
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD--RYH 236
GQET GEDP L ++ A +++G+Q G LK AAC KH+ + G + R+
Sbjct: 127 GQETYGEDPYLTARLAVAFIRGIQ----GDGKYLKAAACAKHFAVH-----SGPEALRHE 177
Query: 237 FNAMV----IYTYYLIKFK----------------------------------------- 251
F+A V ++ YL FK
Sbjct: 178 FDARVSQKDLHETYLSAFKAAVKEAQVEIVMGAYNRVNGVPACASHELLSDILRSEWGFE 237
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSD ++++ ++ HY A ++ AG +L C + +H ++V GL++E
Sbjct: 238 GHVVSDYEALEDIFKHHHYVADEAHTMAVALKAGCNL-CAGKIARHLRSSVDEGLISEDE 296
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
I A+ F T + +G PY +G ++ T + LA++AA + VLLKN G
Sbjct: 297 ITEAVERLFTTRIMMGMM---ADDCPYDSIGYEENDTPEHHQLAVEAASRSFVLLKND-G 352
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGC- 425
LPL I ++AVIGPNAN K + GNY GT +Y T L+G+ +V Y GC
Sbjct: 353 LLPLEMEKISSIAVIGPNANSRKMLEGNYNGTASRYVTVLEGIQDLVGDSVRVWYSEGCH 412
Query: 426 ------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDRLDLLL 470
S++ ++ +A AA AD VL +G D ++E E S D+ +L L
Sbjct: 413 LYKNFHSSLSGRNDRLAEAVSAAQHADVVVLCLGLDATLEGEEGDVEVGFGSGDKPNLSL 472
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG+QQLL+ + V K PVIL++ SG + +ND + +IL + YPG GG A+ADV
Sbjct: 473 PGRQQLLLDTMLTVGK-PVILLLASGSALTLGGRENDENLKAILQIWYPGAMGGKAVADV 531
Query: 531 CFGRYNPSGRLPMTWY 546
FGR P+G+LP+T+Y
Sbjct: 532 LFGRRAPAGKLPVTFY 547
>gi|154313073|ref|XP_001555863.1| hypothetical protein BC1G_05538 [Botryotinia fuckeliana B05.10]
Length = 755
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 220/563 (39%), Positives = 294/563 (52%), Gaps = 78/563 (13%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V+ P LA+ C+TS R L+ TL EK+ +++ V R+G+P YEWW+E
Sbjct: 22 DCVNGP-LANNTVCDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNE 80
Query: 101 ALHGVSYVGPGTHFSNVVPG---ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
ALHG++ PGT F+ +TSFPQ IL A+F+ L + VSTEARA NV
Sbjct: 81 ALHGIAR-SPGTTFAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVN 139
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
GL FW+PNIN ++DPRWGRGQETPGEDP S Y + GLQ P + K A
Sbjct: 140 RFGLNFWTPNINPYKDPRWGRGQETPGEDPFHTSSYVNALITGLQGGLDDLPYK-KGVAT 198
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKH+ YDL+N G RY F+A++ + YYL F+
Sbjct: 199 CKHFAGYDLENSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVP 258
Query: 252 -----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
++ SDCD+V +++ +YT TPE++AA ++ AG DL+
Sbjct: 259 TCADDWLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSAADALNAGTDLD 318
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG+F + +A GL + S +D +++ +A+L+RLG+FD PS QPY QL +V T
Sbjct: 319 CGTFWPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFD-PPSVQPYRQLNWDNVSTP 377
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
A Q LAL AA GIVLLKN G LPLS + I N+A+IGP AN TK M GNY GT +
Sbjct: 378 AAQQLALQAAEDGIVLLKND-GILPLS-SNITNVALIGPLANATKQMQGNYYGTAPYLRS 435
Query: 409 PL-QGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
PL A Y G T A AA SAD + V G D SIEAE
Sbjct: 436 PLIAAQNAGFKVTYVQGADIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAE------ 489
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
E+ P+I+ M G D S ++ + ++LW GYPGQ GG AI
Sbjct: 490 ------------EILANLSTPLIISQM-GCMIDSSSLLSNTGVNALLWAGYPGQDGGTAI 536
Query: 528 ADVCFGRYNPSGRLPMTWYPQSY 550
++ G+ P+GRLP+T YP +Y
Sbjct: 537 FNILTGKTAPAGRLPITQYPSNY 559
>gi|392962219|ref|ZP_10327666.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
DSM 17108]
gi|392452977|gb|EIW29882.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
DSM 17108]
Length = 724
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 202/568 (35%), Positives = 293/568 (51%), Gaps = 95/568 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + +L + R DLV R+T++EK+T +V S+ ++SRLGIP Y WWSEALHGV+ G
Sbjct: 6 YQDETLSFEQRAKDLVSRMTIEEKVTQMVYSSPAISRLGIPAYNWWSEALHGVARAGV-- 63
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV-------GL-AGLTFW 164
AT FPQ I AA+F+ L + +++S EARA ++ G+ GLTFW
Sbjct: 64 --------ATVFPQAIGLAATFDEKLIYDVAEIISIEARAKFHEFQRKGDHGIYKGLTFW 115
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPN+NIFRDPRWGRGQET GEDP L + ++KGLQ G L+ AAC KH+ +
Sbjct: 116 SPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQ---GQDKKYLRAAACAKHFAVH 172
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ ++R+ F+A+V + YL FK
Sbjct: 173 ---SGPESERHRFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEPCCGSNIL 229
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
++VSDC ++ + + T + E+ A ++ G DLNCG+ +
Sbjct: 230 LKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPESVALALNNGCDLNCGNMY-LNLL 288
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAA 358
A + GLV E AI+ A++ T M+LG FD PY +G Q +++ AL+ +
Sbjct: 289 IAYQEGLVTEEAINTAVTRLMLTRMKLGLFDA-AENVPYTNIGFHQNDCQEHREFALEVS 347
Query: 359 RQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA 418
++ +VLLKN LPL I ++AVIGPNAN + + GNY GT Y T L+G+ V
Sbjct: 348 KKTLVLLKNENHLLPLDRNTISSIAVIGPNANSREALTGNYFGTASNYITVLEGIREAVG 407
Query: 419 ----TIYQAGC-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE------ 461
Y GC N+ + +A A AD V+ MG D SIE E
Sbjct: 408 KDTMVSYAQGCHLYRDKAENLGEERDRFAEAVSTAERADLVVMCMGLDASIEGEEGDVSN 467
Query: 462 ---SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGY 518
S D+L L LPG QQ L+ + K K P+IL++++G +++A K+ +I+ Y
Sbjct: 468 EYASGDKLGLNLPGLQQELLEVIYKTGK-PIILVLLAGSALAVTWAAE--KVPAIIQAWY 524
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PG GG A+A FG Y+P G+LP+T+Y
Sbjct: 525 PGAEGGKALASAIFGEYSPVGKLPITFY 552
>gi|297738404|emb|CBI27605.3| unnamed protein product [Vitis vinifera]
Length = 581
Score = 321 bits (822), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 161/358 (44%), Positives = 227/358 (63%), Gaps = 10/358 (2%)
Query: 198 VKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK---YIV 254
V+G + + LKV++CCKHY YD+D+W + ++ YIV
Sbjct: 5 VEGTENVTDLNSRPLKVSSCCKHYATYDIDSWLNVSEQDMKETFFSPFERDEWDLHGYIV 64
Query: 255 SDCDSVDVLYNSQHY-TKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDH 313
SDC ++V+ ++Q+Y ++ +A AK++ AGLDL CG + +V G V++ +D
Sbjct: 65 SDCYGLEVIVDNQNYLNESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYELDR 124
Query: 314 AISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLP 373
A+ N + LMR+G+FDG P+ Y LG KD+C + +LA +AARQGIVLLKN LP
Sbjct: 125 ALKNIYVLLMRVGYFDGIPA---YESLGLKDICAADHIELAREAARQGIVLLKNDYEVLP 181
Query: 374 LSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGT- 432
L P K L ++GP+AN T+ MIGNY G P KY +PL+ +A+ Y GC + C
Sbjct: 182 LKPG--KKLVLVGPHANATEVMIGNYAGLPYKYVSPLEAFSAIGNVTYATGCLDASCSND 239
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
+AK+AA A+ T++ +G D SIEAE DR+D LLPG Q LI +VA+V+ GPVIL+
Sbjct: 240 TYFSEAKEAAKFAEVTIIFVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILV 299
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
++SG DI+FAKN+P+I++ILWVG+PG+ GG AIADV FG+YNP GRLP+TWY Y
Sbjct: 300 VLSGSNIDITFAKNNPRISAILWVGFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADY 357
>gi|375150455|ref|YP_005012896.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361064501|gb|AEW03493.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 711
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 207/578 (35%), Positives = 297/578 (51%), Gaps = 85/578 (14%)
Query: 37 VFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYE 96
+ +C +V N + F N ++ RV DL+ +LTL EKI+ L + V RLGIP Y
Sbjct: 1 MLSCFIVINTQAQTSVFRNPQQPMEARVNDLLHQLTLPEKISLLGYRSKEVERLGIPAYN 60
Query: 97 WWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV 156
WW+EALHGV+ G AT FPQ I AA+FN L + V+STEARA YN+
Sbjct: 61 WWNEALHGVARAGV----------ATVFPQAIGMAATFNDDLLKEAATVISTEARAKYNL 110
Query: 157 GLA--------GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
LA GLTFWSPNINIFRDPRWGRGQET GEDP L + T +VKGLQ G
Sbjct: 111 SLAQGRHLQYMGLTFWSPNINIFRDPRWGRGQETYGEDPFLTAHMGTAFVKGLQ---GND 167
Query: 209 PNRLKVAACCKHYTAY--------------DLDNWKGTDRYHFNAM-------VIYTYYL 247
P LK +AC KH+ + D + + T Y F+A+ V+ Y
Sbjct: 168 PRYLKASACAKHFAVHSGPENGRHTFNAIVDEKDLRETYLYAFHALVDAGVESVMCAYNR 227
Query: 248 I--------------------KFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
+ KFK ++V+DC ++D ++ + E AA +I AG++
Sbjct: 228 VNDQPCCSGNFLLNSILRNEWKFKGHVVTDCGALDDIFMRHKVMPSGVEVAAAAIKAGVN 287
Query: 287 LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
L+C + L K E AV+ L+NE ID ++++ T ++LGF+D P+ P+ + G V
Sbjct: 288 LDCSNVLQKDVEKAVEQKLLNEKDIDSSLAHLLRTQIKLGFYD-DPTANPFYKYGADSVA 346
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
A+ LA A+Q +VLLKN+ LPL + V+G N+ ++GNY G +
Sbjct: 347 NTAHATLARAMAQQSMVLLKNSNQLLPLDKKKYPAIMVVGTNSASMDALLGNYHGVSNRA 406
Query: 407 TTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKK-----AAASADATVLVMG-------- 453
+ ++G+ V AG + +D AA +AD TV V+G
Sbjct: 407 VSFVEGITNAV----DAGTRVEYDQGSDYNDTTHFGGIWAAGNADITVAVIGLTPVYEGE 462
Query: 454 -ADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITS 512
D + A+ D+ D+ LP + + K K P+I +I +G DIS +P +
Sbjct: 463 EGDAFLAAKGGDKPDMSLPAAHIAFMKALRKANKKPIIAVITAGSAVDISAI--EPYADA 520
Query: 513 ILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
IL YPG+ GG A+AD+ FG+ +P+GRLP+T+Y QS+
Sbjct: 521 ILLAWYPGEQGGNALADILFGKVSPAGRLPVTFY-QSF 557
>gi|291537442|emb|CBL10554.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
M50/1]
Length = 710
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 200/560 (35%), Positives = 282/560 (50%), Gaps = 99/560 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R +LV ++TL+EK+ + A +V RL I Y WW+EALHGV+ G AT
Sbjct: 16 RAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT----------AT 65
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AA+F+ L + +G VSTEARA +N+ GLTFW+PN+NIFRDP
Sbjct: 66 VFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRDP 125
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD- 233
RWGRG ET GEDP L S+ Y++GLQ G N LK AAC KH+ + G +
Sbjct: 126 RWGRGHETFGEDPYLTSRLGVRYIEGLQ---GHDENYLKAAACAKHFAVH-----SGPEA 177
Query: 234 -RYHFNAMV----IYTYYLIKFK------------------------------------- 251
R+ F+A V + YL F+
Sbjct: 178 VRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLLIDILRKE 237
Query: 252 -----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLV 306
++ SDC ++ + H T T E+ A ++ G DLNCG+ G +A V+ GLV
Sbjct: 238 WGFSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMNNGCDLNCGTLFGFLVQA-VRQGLV 296
Query: 307 NESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLK 366
E +D A++N F M+LG FD + PY ++ ++ + L AR+ +VLLK
Sbjct: 297 KEERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPYLAADSREMKKLNEAVARRTVVLLK 355
Query: 367 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQ 422
N LPL IK + VIGPNA+ + ++GNYEGT +Y T L+G+ V +Y
Sbjct: 356 NKEHILPLDKNKIKTIGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGDDVRVLYS 415
Query: 423 AGC-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDRL 466
GC SN+ ++ + +D V V+G D IE E S D+
Sbjct: 416 EGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNEYGSGDKP 475
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
DL LPG Q+ ++ K PVIL+++SG +++A D + +I+ YPG GGAA
Sbjct: 476 DLNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVDAIVQGWYPGARGGAA 532
Query: 527 IADVCFGRYNPSGRLPMTWY 546
IAD+ FG NP G+LP+T+Y
Sbjct: 533 IADILFGEANPEGKLPVTFY 552
>gi|240146254|ref|ZP_04744855.1| beta-glucosidase [Roseburia intestinalis L1-82]
gi|257201613|gb|EEU99897.1| beta-glucosidase [Roseburia intestinalis L1-82]
gi|291539969|emb|CBL13080.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
XB6B4]
Length = 710
Score = 318 bits (816), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 200/560 (35%), Positives = 282/560 (50%), Gaps = 99/560 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R +LV ++TL+EK+ + A +V RL I Y WW+EALHGV+ G AT
Sbjct: 16 RAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT----------AT 65
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AA+F+ L + +G VSTEARA +N+ GLTFW+PN+NIFRDP
Sbjct: 66 VFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRDP 125
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD- 233
RWGRG ET GEDP L S+ Y++GLQ G N LK AAC KH+ + G +
Sbjct: 126 RWGRGHETFGEDPYLTSRLGVRYIEGLQ---GHDENYLKAAACAKHFAVH-----SGPEA 177
Query: 234 -RYHFNAMV----IYTYYLIKFK------------------------------------- 251
R+ F+A V + YL F+
Sbjct: 178 VRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLLIDILRKE 237
Query: 252 -----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLV 306
++ SDC ++ + H T T E+ A ++ G DLNCG+ G +A V+ GLV
Sbjct: 238 WGFSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMNNGCDLNCGTLFGFLVQA-VRQGLV 296
Query: 307 NESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLK 366
E +D A++N F M+LG FD + PY ++ ++ + L AR+ +VLLK
Sbjct: 297 KEERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPYLAADSREMKKLNEAVARRTVVLLK 355
Query: 367 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQ 422
N LPL IK + VIGPNA+ + ++GNYEGT +Y T L+G+ V +Y
Sbjct: 356 NKEHILPLDKNKIKTVGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGDDVRVLYS 415
Query: 423 AGC-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDRL 466
GC SN+ ++ + +D V V+G D IE E S D+
Sbjct: 416 EGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNEYGSGDKP 475
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
DL LPG Q+ ++ K PVIL+++SG +++A D + +I+ YPG GGAA
Sbjct: 476 DLNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVDAIVQGWYPGARGGAA 532
Query: 527 IADVCFGRYNPSGRLPMTWY 546
IAD+ FG NP G+LP+T+Y
Sbjct: 533 IADILFGEANPEGKLPVTFY 552
>gi|322512556|gb|ADX05682.1| putative carbohydrate-active enzyme [uncultured organism]
Length = 717
Score = 318 bits (815), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 203/562 (36%), Positives = 289/562 (51%), Gaps = 97/562 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV +TL+EK+ + +A ++ RLG+P Y +W+EALHGV+ G AT
Sbjct: 16 RAQALVCEMTLEEKVFQTLFNAPAIERLGVPAYNYWNEALHGVARAGV----------AT 65
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AASF+ L + +STEARA +N+ GLTFWSPN+NIFRDP
Sbjct: 66 VFPQAIGLAASFDEELLGQVADTISTEARAKFNMQQKFGDRDIYKGLTFWSPNVNIFRDP 125
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGRG ET GEDP L+ + +++G+Q G +KVAAC KH+ + + R
Sbjct: 126 RWGRGHETFGEDPFLSGRLGVSFIRGMQ---GDDERYMKVAACAKHFAVH---SGPEDQR 179
Query: 235 YHFNAMV----IYTYYLIKFK--------------------------------------- 251
+ FNA+V + YL F
Sbjct: 180 HSFNAVVSEQDLRETYLPAFHACVTEAGVEAVMGAYNRTNGEACCGSKKLLVDILRGEWG 239
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNE 308
++ SDC ++ + TK EE A ++ +G DLNCG+ L H AV+ GLV E
Sbjct: 240 FRGHVTSDCWALKDFHEFHMVTKNQEETVALAMNSGCDLNCGN-LYVHLLQAVRDGLVEE 298
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
S ID A++ F T M+LG FD + PY +G V T+AN+ L +A+R+ + LLKN
Sbjct: 299 SVIDRAVTRLFTTRMKLGLFD-RSEEVPYNGIGYDRVDTEANRKLNREASRRTVCLLKNA 357
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV----VATIYQAG 424
G LPL + ++ + V+GPNA+ K ++GNYEGT +Y T L G+ + V +Y G
Sbjct: 358 DGLLPLDISKLRTIGVVGPNADNRKALVGNYEGTASEYVTVLDGIRELAGDDVRVVYSEG 417
Query: 425 C----SNVQCGTAQVDD----AKKAAASADATVLVMGADQSIEAE---------SHDRLD 467
C VQ G Q +D A+ A +D + VMG D +E E S D+ +
Sbjct: 418 CHLFRDRVQ-GLGQPNDRIAEARAVAELSDVVIAVMGLDPGLEGEEGDQGNEFASGDKPN 476
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
L LPG Q ++ + + K PV+L+++ G I +A+ + +IL YPG GG A+
Sbjct: 477 LELPGLQGEVLKALVESGK-PVVLVLLGGSALAIPWAEE--HVPAILDAWYPGAQGGRAV 533
Query: 528 ADVCFGRYNPSGRLPMTWYPQS 549
ADV FGR P G+LP+T+Y S
Sbjct: 534 ADVLFGRACPEGKLPVTFYRTS 555
>gi|67523807|ref|XP_659963.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
gi|74597492|sp|Q5BAS1.1|XYND_EMENI RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|40745314|gb|EAA64470.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
gi|259487761|tpe|CBF86686.1| TPA: Beta-xylosidase (EC 3.2.1.37)
[Source:UniProtKB/TrEMBL;Acc:O42810] [Aspergillus
nidulans FGSC A4]
Length = 803
Score = 317 bits (813), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 202/562 (35%), Positives = 292/562 (51%), Gaps = 65/562 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D S P L+ C+ SL R LV T E + ++ VSRLG+P Y+ W E
Sbjct: 49 DCASGP-LSLTPVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGE 107
Query: 101 ALHGV---SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
ALHGV ++V G +FS ATSFP I A+ N +L IG +VST+ RA N G
Sbjct: 108 ALHGVGRANFVESG-NFS----WATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAG 162
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
L G+ +SPNIN FR P WGRGQETPGED L S Y Y+ LQ G P LK+ A
Sbjct: 163 LGGVDVYSPNINTFRHPVWGRGQETPGEDAFLTSVYGYEYITALQ--GGVDPETLKIIAT 220
Query: 218 CKHYTAYDLDNWKGTDR------------------------------------------- 234
KHY YD+++W R
Sbjct: 221 AKHYAGYDIESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVP 280
Query: 235 -----YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
+ ++ T+ + Y+ DC +V ++N Y A+A SILAG D++C
Sbjct: 281 SCANKFFLQTLLRDTFEFSEDGYVSGDCGAVYNVWNPHGYASNEAAASADSILAGTDIDC 340
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G+ H+E A + LV+ S I+ + ++ L++ G+FDG + PY + DV +
Sbjct: 341 GTSYQWHSEDAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDA--PYRDITWDDVLSTD 398
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
++A +AA +GIVLLKN +LPLS IK++AVIGP ANVT+ + GNY G +P
Sbjct: 399 AWNIAYEAAVEGIVLLKNDE-TLPLSKD-IKSVAVIGPWANVTEELQGNYFGPAPYLISP 456
Query: 410 LQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDL 468
L G ++ A +N+ T+ ++A AA ADA + G D +IEAE+ DR ++
Sbjct: 457 LTGFRDSGLDVHYALGTNLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENI 516
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIA 528
PG Q LI++++++ K P++++ M GG D S K++ + +++W GYPGQ+GG A+A
Sbjct: 517 TWPGNQLDLISKLSELGK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALA 575
Query: 529 DVCFGRYNPSGRLPMTWYPQSY 550
D+ G+ P+GRL T YP Y
Sbjct: 576 DIITGKRAPAGRLVTTQYPAEY 597
>gi|357489441|ref|XP_003615008.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516343|gb|AES97966.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 798
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 158/302 (52%), Positives = 207/302 (68%), Gaps = 5/302 (1%)
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAI 311
YI SDC++V ++Y++Q Y KTPE+A A + AG+D+ CG +L KH +AAV V S I
Sbjct: 316 YITSDCEAVRIIYDNQGYAKTPEDAVADVLQAGMDVECGDYLTKHAKAAVLQKKVPISQI 375
Query: 312 DHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS 371
D A+ N F +RLG FDG+P+K YG++GP VC++ N DLAL+AAR GIVLLKNTA
Sbjct: 376 DRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTASI 435
Query: 372 LPLSPTAIKNLAVIGPNAN-VTKTMIGNYEGTPCKYTTPLQGLAAVVA-TIYQAGC-SNV 428
LPL + L VIGPNAN +K ++GNY G PC+ L+G + T Y++GC
Sbjct: 436 LPLP--RVNTLGVIGPNANKSSKVVLGNYFGRPCRLVPILKGFYTYASQTHYRSGCLDGT 493
Query: 429 QCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGP 488
+C +A++D A + A +D +LVMG DQS E ES DR DL LPG+QQ LI VAK +K P
Sbjct: 494 KCASAEIDRAVEVAKISDYVILVMGLDQSQERESRDRDDLELPGKQQELINSVAKASKKP 553
Query: 489 VILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQ 548
VIL+++ GG DI+FAKN+ KI I+W GYPG+ GG A+A V FG YNP GRLPMTWYP+
Sbjct: 554 VILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPMTWYPK 613
Query: 549 SY 550
+
Sbjct: 614 DF 615
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 139/241 (57%), Positives = 172/241 (71%), Gaps = 8/241 (3%)
Query: 13 VSVLFLFLT-YCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRL 71
+S +FLFLT Y L H+ S P ++CD +NP SL FCN +L I R D+V RL
Sbjct: 11 ISFIFLFLTRYHRLVHADSPTHVPP-YSCDT-TNPLTKSLPFCNLNLTITQRAKDIVSRL 68
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
TL EKI+ LV++A S+ RLGIP Y+WW EALHGV+ G G + V GATSFPQVILTA
Sbjct: 69 TLDEKISQLVNTAPSIPRLGIPSYQWWDEALHGVANAGKGIRLNGSVAGATSFPQVILTA 128
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLA 190
ASF++ L+ I KV+ TEAR +YN G A G+TFW+PNINIFRDPRWGRGQET GEDPL+
Sbjct: 129 ASFDSKLWYQISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVN 188
Query: 191 SKYATGYVKGLQ--QTDGGS--PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYY 246
SKY YV+GLQ +GG +RLK +ACCKH+TAYDLDNWKG DR+ F+A V + +
Sbjct: 189 SKYGVSYVRGLQGDSFEGGKLIGDRLKASACCKHFTAYDLDNWKGLDRFDFDAKVSFLFS 248
Query: 247 L 247
+
Sbjct: 249 M 249
>gi|421077748|ref|ZP_15538711.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
JBW45]
gi|392524151|gb|EIW47314.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
JBW45]
Length = 750
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 201/568 (35%), Positives = 292/568 (51%), Gaps = 95/568 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + +L + R DLV R+TL+EK+T +V + ++ RLG+P Y WWSEALHGV+ G
Sbjct: 32 YQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVARAGV-- 89
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV-------GL-AGLTFW 164
AT FPQ I AA+F+ L + +V+S E RA ++ G+ GLTFW
Sbjct: 90 --------ATVFPQAIGLAATFDEKLIHDVAEVISIEGRAKFHEFQRKGDHGIYKGLTFW 141
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPN+NIFRDPRWGRGQET GEDP L + ++KGLQ G L+ AAC KH+ +
Sbjct: 142 SPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQ---GQDKKYLRAAACAKHFAVH 198
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ ++R+ F+A+V + YL FK
Sbjct: 199 ---SGPESERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEPCCGSNML 255
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
++VSDC ++ + + T + E+ A ++ G DLNCG+ +
Sbjct: 256 LKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPESVALALNNGCDLNCGNMY-LNLL 314
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAA 358
A + GLV E AI+ A++ T M+LG FD PY +G Q +++ AL+ +
Sbjct: 315 IAYQEGLVTEEAINTAVTRLMLTRMKLGLFDT-AENVPYTNIGFHQNDCQEHREFALEVS 373
Query: 359 RQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA 418
++ +VLLKN LPL I ++AVIGPNAN + + GNY GT Y T L+G+ V
Sbjct: 374 KKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYITVLEGIREAVG 433
Query: 419 --TI--YQAGC-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE------ 461
TI Y GC N+ + +A A AD V+ MG D SIE E
Sbjct: 434 KDTIVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIVVMCMGLDASIEGEEGDVSN 493
Query: 462 ---SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGY 518
S D+L L LPG QQ L+ + + K P+IL++++G +++A K+ +I+ Y
Sbjct: 494 EYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTWAAE--KVPAIIQAWY 550
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PG GG A+A FG Y+P G+LP+T+Y
Sbjct: 551 PGAEGGKALASAIFGEYSPVGKLPITFY 578
>gi|373955483|ref|ZP_09615443.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373892083|gb|EHQ27980.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 738
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 207/567 (36%), Positives = 286/567 (50%), Gaps = 94/567 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N +L +D RV DLV R+TL+EK++ +++SA ++ RLG+P Y WW+E LHGV+ T
Sbjct: 33 FNNPALSMDERVADLVGRMTLEEKVSQMLNSAPAIERLGVPAYNWWNECLHGVAR----T 88
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
F T +PQ I AA+++ + +G + E RA+YN + GLT+W
Sbjct: 89 PFK-----VTVYPQAIAMAATWDKTSMHVMGDYTAEEGRAVYNESIKNDKHDIYLGLTYW 143
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRGQET GEDP L + + +VKGLQ G P LK A C KHY +
Sbjct: 144 TPNINIFRDPRWGRGQETYGEDPFLTGEMGSAFVKGLQ---GDDPRYLKAAGCAKHYAVH 200
Query: 225 ----DLDNWKGTDRYHFNAMVIYTYYLIKFK----------------------------- 251
DL + TD ++ ++ YL F+
Sbjct: 201 SGPEDLRHKFNTDISDYD---LWDTYLPAFRKLVVDAKVTGVMCAYNAFKGQPCCGSDLL 257
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGKH 296
Y+ SDC +D Y + P E AAA ++L G D+ CG+ K
Sbjct: 258 MNSILHDKWKFTGYVTSDCGGIDDFYRENTHQTQPDAESAAADAVLHGTDVECGNVTYKS 317
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
AVK G ++E ID ++ F+ +LG FD + + Y Q+G + A+ AL
Sbjct: 318 LVKAVKDGKLSEKQIDQSLKRLFSVRFKLGMFDPADAVK-YNQIGKDALEAPAHGAQALK 376
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG---- 412
A Q IVLLKN LPLS +K +AV+GPNA+ +++GNY GTP + T LQG
Sbjct: 377 MAHQSIVLLKNEGNLLPLSKN-LKKIAVLGPNADNAVSVLGNYNGTPSRIVTALQGIKNK 435
Query: 413 LAAVVATIYQAGCSNVQCGTAQVDDAKKAAA--SADATVLVMGADQSIEAESH------- 463
L A IY V A+ + A AA ADA + + G +E E
Sbjct: 436 LPAGTEVIYDKAVDYVADSAARYNYAAMAAKVKDADAIIYIGGISPELEGEEMPVSKPGF 495
Query: 464 ---DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISF-AKNDPKITSILWVGYP 519
DR +LLPG Q L+ + K PV+ ++M+G + A+N P I + W G
Sbjct: 496 HGGDRSTILLPGVQTELLKALKATGK-PVVFVMMTGSAIATPWEAENLPAIVNA-WYG-- 551
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTWY 546
GQA G AIADV FG YNP+GRLP+T+Y
Sbjct: 552 GQAAGTAIADVLFGDYNPAGRLPVTFY 578
>gi|421060771|ref|ZP_15523202.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
B3]
gi|421065248|ref|ZP_15527033.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A12]
gi|421073214|ref|ZP_15534285.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A11]
gi|392444242|gb|EIW21677.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A11]
gi|392454445|gb|EIW31278.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
B3]
gi|392459366|gb|EIW35779.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A12]
Length = 724
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 200/570 (35%), Positives = 291/570 (51%), Gaps = 95/570 (16%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
+ + +L + R DLV R+TL+EK+T +V + ++ RLG+P Y WWSEALHGV+ G
Sbjct: 4 FAYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVARAGV 63
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV-------GL-AGLT 162
AT FPQ I AA+F+ L + +V+S E RA ++ G+ GLT
Sbjct: 64 ----------ATVFPQAIGLAATFDEKLIFNVAEVISIEGRAKFHEFQRKGDHGIYKGLT 113
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FWSPN+NIFRDPRWGRGQET GEDP L + ++KGLQ G L+ AAC KH+
Sbjct: 114 FWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQ---GQDKKYLRAAACAKHFA 170
Query: 223 AYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------------------------- 251
+ + ++R+ F+A+V + YL FK
Sbjct: 171 VH---SGPESERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEPCCGSN 227
Query: 252 ---------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKH 296
++VSDC ++ + + T + E+ A ++ G DLNCG+ +
Sbjct: 228 MLLKETLRREWGFTGHVVSDCWAIKDFHENHRVTSSAPESVAMALNNGCDLNCGNMY-LN 286
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
A + GLV E AI+ A++ T M+LG FD PY ++G Q +++ AL+
Sbjct: 287 LLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFDT-AENVPYTKIGFHQNDCQEHREFALE 345
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
+++ +VLLKN LPL I ++AVIGPNAN + + GNY GT Y T L+G+
Sbjct: 346 VSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYITVLEGIREA 405
Query: 417 VA----TIYQAGC-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---- 461
V Y GC N+ + +A A AD V+ MG D SIE E
Sbjct: 406 VGKDTMVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIVVMCMGLDASIEGEEGDV 465
Query: 462 -----SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWV 516
S D+L L LPG QQ L+ + + K P+IL++++G +++A KI +I+
Sbjct: 466 SNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTWAAE--KIPAIIQA 522
Query: 517 GYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
YPG GG A+A FG Y+P G+LP+T+Y
Sbjct: 523 WYPGAEGGKALASAIFGEYSPVGKLPITFY 552
>gi|376259588|ref|YP_005146308.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
gi|373943582|gb|AEY64503.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
Length = 712
Score = 314 bits (805), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 213/575 (37%), Positives = 285/575 (49%), Gaps = 105/575 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + SL R DLV R+TL+EK + L A V RLGIP+Y WW+EALHGV+ G
Sbjct: 6 YLDKSLSFKERAADLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
AT FPQ I AA F+ + I V++TE RA YN G+TFW
Sbjct: 64 --------ATVFPQAIGMAAIFDDEFLEKIADVIATEGRAKYNENAKKGDRDIYKGITFW 115
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPN+NIFRDPRWGRG ET GEDP L S+ +VKGLQ G LK AAC KH+ +
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----GDGKYLKTAACAKHFAVH 171
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ DR+HF+A+V +Y YL F+
Sbjct: 172 ---SGPEDDRHHFDAVVSQKDLYETYLPAFEALVKEAKVESVMGAYNRTNGEPCNGSKTL 228
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
++VSDC ++ + TKTP E+ A ++ +G DLNCG+
Sbjct: 229 LKDILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKSGCDLNCGNMY-LLIL 287
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDG--HPSKQPYGQLGPKDVCTQANQDLALD 356
A+K G + E ID A T MRLG FD K PY +L + + L+L+
Sbjct: 288 LALKEGRITEEDIDRAAIRLMTTRMRLGMFDDDCEFDKIPY-ELND----SVEHNKLSLE 342
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
AA++ +VLLKN G LPL IKN+AVIGPNA+ + + NY GTP + T L G+
Sbjct: 343 AAKKSMVLLKND-GLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSQNITILDGIRKR 401
Query: 417 VA----TIYQAGCS---NVQCGTAQVDDAKKAAAS----ADATVLVMGADQSIEAESH-- 463
V+ Y G N + AQ DD K A S +D VL +G D S+E E +
Sbjct: 402 VSEDTRVWYSVGSHLFMNREEDLAQPDDRLKEAVSVAERSDVVVLCLGLDASVEGEQNDQ 461
Query: 464 ---------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSIL 514
D+ DL LP Q+ L+ V K P I+ ++SG I A + K +I+
Sbjct: 462 GTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAAD--KAAAIV 518
Query: 515 WVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
YPG GG A A++ FG Y+P+GRLP+T+Y +
Sbjct: 519 QCWYPGSRGGLAFAEMIFGDYSPAGRLPVTFYKST 553
>gi|449303062|gb|EMC99070.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
10762]
Length = 786
Score = 314 bits (804), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 202/557 (36%), Positives = 292/557 (52%), Gaps = 62/557 (11%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N +L++ CN SL R LV+ TL+E ++A V RLG+P YE W+EALHG
Sbjct: 51 NSTLSTTPVCNRSLSAWDRAHALVQLFTLEELANNTGNTAPGVPRLGLPAYEVWNEALHG 110
Query: 105 VSYVGPGTHFSN--VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT 162
+S+ HF+ ATSFP IL+ AS N +L IG ++ST+ RA N G GL
Sbjct: 111 ISH----GHFATNGTWSWATSFPSPILSMASMNRTLINQIGDIISTQGRAFSNAGRYGLD 166
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAACCKHY 221
++PNIN FR P WGRGQETPGED L+S YA Y+ G+Q G +P K+ A KH+
Sbjct: 167 SYAPNINGFRSPVWGRGQETPGEDAFFLSSLYAYEYITGMQ--GGKAPAVPKLVAVPKHF 224
Query: 222 TAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------------------- 251
YD++NW R + + + YY +F+
Sbjct: 225 AGYDIENWNNNSRLGLDVNITQQDLAGYYTPQFRSAIQNAKALGLMCSYNAVNGVPSCSN 284
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
++ SDCD+V +YN Y A A S+ AG D++CG+
Sbjct: 285 SFFLQTLARDTWGFGNGFVSSDCDAVYNVYNPHGYAANTTGAVADSLRAGTDIDCGTSYP 344
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
+ A AGLV+ + I+ A++ ++ L+ G+FDG+ S Y LG DV T +++
Sbjct: 345 FYLVPAFNAGLVSRNDIELALTRYYSGLVMQGYFDGNSSL--YRNLGWNDVLTTDAWNIS 402
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
+AA +GI LLKN G+LPLS + +++A+IGP AN T + GNY +PLQ
Sbjct: 403 YEAAVEGITLLKND-GTLPLS-KSTRSVALIGPWANATLQLQGNYYAAAPYLISPLQAFR 460
Query: 415 AVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQ 473
A T+ + G + T+ +A A +D + G D SIEAE DR ++ PG
Sbjct: 461 ASGMTVNFVNGTTISSTNTSGFAEAITLAQQSDVIIYAGGIDNSIEAEGLDRQNITWPGN 520
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q LI ++++V K P++++ M GG D S KN+ K+ +++W GYPGQ+GG A+ D+ G
Sbjct: 521 QLDLIYQLSQVGK-PLVVLQMGGGQVDSSALKNNSKVNALVWGGYPGQSGGQALFDIIMG 579
Query: 534 RYNPSGRLPMTWYPQSY 550
P+GRL T YP SY
Sbjct: 580 NRAPAGRLVTTQYPASY 596
>gi|358393086|gb|EHK42487.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
206040]
Length = 794
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 199/547 (36%), Positives = 287/547 (52%), Gaps = 59/547 (10%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTH 113
C+++ G R L+ TL+E I +S V RLG+P Y+ W+EALHG+ T
Sbjct: 64 CDSTAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 123
Query: 114 FSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRD 173
G TSFP IL+ A+ N +L I ++ST+ARA N G GL ++PNIN FR
Sbjct: 124 GGEFEWG-TSFPMPILSMAALNRTLIHQIADIISTQARAFSNNGRYGLDVYAPNINGFRS 182
Query: 174 PRWGRGQETPGEDP-LLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
P WGRGQETPGED +L S Y Y+ G+Q G P LK+AA KH+ YDL+N+
Sbjct: 183 PLWGRGQETPGEDANVLTSAYTYEYITGMQ--GGVDPENLKIAATAKHFAGYDLENYNNQ 240
Query: 233 DRYHFNAMV----IYTYYLIKF-------------------------------------- 250
R F+A++ + YY +F
Sbjct: 241 SRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCSNSFFLQTLLRES 300
Query: 251 ------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAG 304
Y+ SDCD++ ++N +Y + AAA S+ AG D++CG H + AG
Sbjct: 301 WGFPEYGYVSSDCDAIYNVWNPHNYANSQSSAAADSLKAGTDIDCGQTYPWHLNESFVAG 360
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
V+ I+ +++ +A L+RLG+FD K Y LG KDV +++ +AA +GIVL
Sbjct: 361 TVSRGEIERSVTRLYANLVRLGYFD---KKNEYRSLGWKDVVKTDAWNISYEAAVEGIVL 417
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQA 423
LKN G+LPLS ++++A+IGP N T+ + GNY GT +PLQ + Y+
Sbjct: 418 LKND-GTLPLS-KKVRSIALIGPWVNATEQLQGNYFGTAPYLISPLQAAKKAGYEVNYEL 475
Query: 424 GCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAK 483
G TA A AA +DA + + G D +IE E DR D+ PG Q LI ++++
Sbjct: 476 GTGINNQTTAGFAKAIAAAKKSDAIIFIGGIDNTIEQEGADRTDIAWPGNQLDLIKQLSE 535
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
V K P++++ M GG D S K++ K+ S++W GYPGQ+GG A+ D+ G+ P+GRL
Sbjct: 536 VGK-PLVVLQMGGGQVDSSSIKSNKKVNSLVWGGYPGQSGGYALFDILSGKRAPAGRLVS 594
Query: 544 TWYPQSY 550
T YP Y
Sbjct: 595 TQYPAEY 601
>gi|2791278|emb|CAA93248.1| beta-xylosidase [Trichoderma reesei]
gi|340519464|gb|EGR49702.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
Length = 797
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 202/550 (36%), Positives = 286/550 (52%), Gaps = 65/550 (11%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV---SYVGP 110
C++S G R L+ TL+E I +S V RLG+P Y+ W+EALHG+ ++
Sbjct: 63 CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
G F ATSFP ILT A+ N +L I ++ST+ARA N G GL ++PN+N
Sbjct: 123 GGQFE----WATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNVNG 178
Query: 171 FRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNW 229
FR P WGRGQETPGED L+S Y Y+ G+Q G P LKVAA KH+ YDL+NW
Sbjct: 179 FRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQ--GGVDPEHLKVAATVKHFAGYDLENW 236
Query: 230 KGTDRYHFNAMV----IYTYYLIKF----------------------------------- 250
R F+A++ + YY +F
Sbjct: 237 NNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCAYNSVNGVPSCANSFFLQTLL 296
Query: 251 ---------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAV 301
Y+ SDCD+V ++N Y AAA S+ AG D++CG H +
Sbjct: 297 RESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHLNESF 356
Query: 302 KAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQG 361
AG V+ I+ +++ +A L+RLG+FD K Y LG KDV +++ +AA +G
Sbjct: 357 VAGEVSRGEIERSVTRLYANLVRLGYFD---KKNQYRSLGWKDVVKTDAWNISYEAAVEG 413
Query: 362 IVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI- 420
IVLLKN G+LPLS ++++A+IGP AN T M GNY G +PL+ +
Sbjct: 414 IVLLKND-GTLPLS-KKVRSIALIGPWANATTQMQGNYYGPAPYLISPLEAAKKAGYHVN 471
Query: 421 YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITE 480
++ G T A AA +DA + + G D +IE E DR D+ PG Q LI +
Sbjct: 472 FELGTEIAGNSTTGFAKAIAAAKKSDAIIYLGGIDNTIEQEGADRTDIAWPGNQLDLIKQ 531
Query: 481 VAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGR 540
+++V K P++++ M GG D S K++ K+ S++W GYPGQ+GG A+ D+ G+ P+GR
Sbjct: 532 LSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRAPAGR 590
Query: 541 LPMTWYPQSY 550
L T YP Y
Sbjct: 591 LVTTQYPAEY 600
>gi|2920706|emb|CAA73902.1| beta-xylosidase [Emericella nidulans]
Length = 802
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 196/549 (35%), Positives = 285/549 (51%), Gaps = 64/549 (11%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV---SYVGP 110
C+ SL R LV T E + ++ VSRLG+P Y+ W EALHGV ++V
Sbjct: 60 CDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGVGRANFVES 119
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
G +FS ATSFP I A+ N +L IG +VST+ RA N GL G+ +SPNIN
Sbjct: 120 G-NFS----WATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLGGVDVYSPNINT 174
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
FR P WGRGQETPGED L S Y Y+ LQ P K+ A KHY YD+++W
Sbjct: 175 FRHPVWGRGQETPGEDAFLTSVYGYEYITALQ--GAVDPETSKIIATAKHYAGYDIESWN 232
Query: 231 GTDR------------------------------------------------YHFNAMVI 242
R + ++
Sbjct: 233 NHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVPSCANKFFLQTLLR 292
Query: 243 YTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVK 302
T+ + Y+ DC +V ++N Y A+A SILAG D++CG+ H+E A +
Sbjct: 293 DTFEFSEDGYVSGDCGAVYNVWNPHGYASNEAAASADSILAGTDIDCGTSYQWHSEDAFE 352
Query: 303 AGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGI 362
LV+ S I+ + ++ L++ G+FDG + PY + DV + ++A +AA +GI
Sbjct: 353 DSLVSRSDIERGVIRLYSNLVQAGYFDGEDA--PYRDITWDDVLSTDAWNIAYEAAVEGI 410
Query: 363 VLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQ 422
VLLKN +LPLS IK++AVIGP ANVT+ + GNY G +PL G ++
Sbjct: 411 VLLKNDE-TLPLSKD-IKSVAVIGPWANVTEELQGNYFGPAPYLISPLTGFRDSGLDVHY 468
Query: 423 AGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEV 481
A +N+ T+ ++A AA ADA + G D +IEAE+ DR ++ PG Q LI+++
Sbjct: 469 ALGTNLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENITWPGNQLDLISKL 528
Query: 482 AKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRL 541
+++ K P++++ M GG D S K++ + +++W GYPGQ+GG A+AD+ G+ P+GRL
Sbjct: 529 SELGK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALADIITGKRAPAGRL 587
Query: 542 PMTWYPQSY 550
T YP Y
Sbjct: 588 VTTQYPAEY 596
>gi|310795958|gb|EFQ31419.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 824
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 218/584 (37%), Positives = 286/584 (48%), Gaps = 88/584 (15%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
S L S C+ +L R LV LT+ EK+ LV+ A V RL IP YEWWSE LH
Sbjct: 55 SRDPLCSNDVCDETLSPKERAAALVAELTIWEKLDNLVNEAPGVPRLAIPPYEWWSEGLH 114
Query: 104 GVSYVGPGTHF--SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGL 161
GV+ PGT F S ATSFPQ I+ ++F+ L +AIG+VVS EARA N G +GL
Sbjct: 115 GVA-SSPGTKFAKSGNFSYATSFPQPIVLGSAFDDDLVKAIGEVVSKEARAFSNRGRSGL 173
Query: 162 TFW---------------------SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKG 200
+ SPNIN F+DPRWGRGQETPGEDP Y + G
Sbjct: 174 DLYVSSISRHIEPEVRDDMLTEPESPNINAFKDPRWGRGQETPGEDPFHLQNYVAAMLTG 233
Query: 201 LQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----- 251
L+ GG P++ K+ A CKHY A D +N+KG DR F+A + + YYL FK
Sbjct: 234 LE---GGDPSK-KLIATCKHYAANDFENYKGVDRAGFDANITTQDLSEYYLPPFKTCAVD 289
Query: 252 ----------------------------------------YIVSDCDSVDVLYNSQHYTK 271
Y+ +DCD V ++ + HY
Sbjct: 290 KKVGSFMCSYNAINGEPLCANPYLLEDILRQHWGWNGDGQYVSTDCDCVALMVSHHHYAP 349
Query: 272 TPEEAAAKSILAGLDLNCGSFLGKHT-EAAVKAGLVNESAIDHAISNNFATLMRLGFFDG 330
AAA ++ AG DL C +F G + A L++E +D +++ + L+ +G FD
Sbjct: 350 DLGHAAAWAMKAGTDLECNAFPGSEALQLAWNQSLISEKEVDKSLTRMYTALVSVGQFDS 409
Query: 331 HPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPT-AIKNLAVIGPNA 389
QP L DV T+ Q LA A +G VLLKN G LPLS K A+IGP
Sbjct: 410 -ARGQPLRSLSWDDVNTKEAQKLAYQAVIEGAVLLKND-GILPLSAAWREKKYALIGPWI 467
Query: 390 NVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATV 449
N T M GNY G P Y L A + + T D K+A SA A
Sbjct: 468 NATTQMQGNYFG-PAPYLISLYQAAKEFGLDFTYSLGSRINSTD--DSFKQALDSAHAAA 524
Query: 450 LVM---GADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN 506
L++ G D ++EAE+ DR L P Q L+ V+ + K PVI++ GG D +
Sbjct: 525 LIVFAGGVDNTLEAETRDRKTLAWPESQLDLLRAVSALGK-PVIVLQFGGGQVDDTELLA 583
Query: 507 DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+ I ++LW GYPGQ+GG A+ D+ FGR P+GRL +T YP SY
Sbjct: 584 NHSINALLWGGYPGQSGGKAVIDLLFGRAAPAGRLSVTQYPASY 627
>gi|125534110|gb|EAY80658.1| hypothetical protein OsI_35835 [Oryza sativa Indica Group]
Length = 511
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 160/303 (52%), Positives = 209/303 (68%), Gaps = 3/303 (0%)
Query: 251 KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
KY+ SDCD+V + ++ HYT +PE+ A SI AG+D+NCG++ H AAV+ G + E
Sbjct: 15 KYVASDCDAVATIRDAHHYTLSPEDTVAVSIKAGMDVNCGNYTQVHAMAAVQKGNLTEKD 74
Query: 311 IDHAISNNFATLMRLGFFDGHP-SKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
ID A+ N FA MRLG FDG P S YG LG DVC+ A++ LAL+AA+ GIVLLKN A
Sbjct: 75 IDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDA 134
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA--TIYQAGCSN 427
G+LPL P+A+ +LAVIGPNA+ + GNY G PC+ TTPLQG+ + + AGC +
Sbjct: 135 GALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDS 194
Query: 428 VQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKG 487
C A ++A A+S+D VL MG Q E E DR LLLPG+QQ LIT VA A+
Sbjct: 195 PACAVAATNEAAALASSSDHVVLFMGLSQKQEQEGLDRTSLLLPGEQQGLITAVANAARR 254
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYP 547
PVIL++++GG D++FAK++PKI +IL GYPGQAGG AIA V FG +NPSGRLP+TWYP
Sbjct: 255 PVILVLLTGGPVDVTFAKDNPKIGAILLAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYP 314
Query: 548 QSY 550
+ +
Sbjct: 315 EEF 317
>gi|266619450|ref|ZP_06112385.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
gi|288869013|gb|EFD01312.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
Length = 714
Score = 311 bits (797), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 213/571 (37%), Positives = 294/571 (51%), Gaps = 105/571 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + S + RV DLV ++TL+EK++ L A +V RLGIP Y WW+EALHGV+ G
Sbjct: 5 YLDESRTDEERVRDLVSQMTLEEKVSQLRYDAPAVERLGIPSYNWWNEALHGVARAG--- 61
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
AT FPQ I AA F+ +L + IG V + E RA Y+ + G+TFW
Sbjct: 62 -------AATVFPQAIGLAAMFDEALLEKIGDVTALEGRAKYHEAVRNGDRGLYKGITFW 114
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRG ET GEDP L + T Y+KG+Q G+ RLK AAC KH+ A+
Sbjct: 115 SPNINIFRDPRWGRGHETYGEDPCLTGRMGTAYIKGMQ----GNGKRLKAAACVKHFAAH 170
Query: 225 DLDNWKGTDRYHFNAMV-----IYTYY--------------------------------- 246
KG R+ FN++V TY+
Sbjct: 171 SGPE-KG--RHSFNSVVSKKDLTETYFPAFERCVKEAGVEGVMGGYNRLNGEAACGSHHL 227
Query: 247 ---LIKFK-----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
+++ K Y VSDC ++ + T TP+E+AA ++ +G DLNCG+ H
Sbjct: 228 ITEILREKWGFDGYYVSDCGAIKDFHMHHGLTDTPQESAALALKSGCDLNCGAVY-LHVM 286
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPS--KQPYGQLGPKDVCTQANQDLALD 356
+A GLV+ ID A+++ T MRLG FD H + PY + C + + LAL
Sbjct: 287 SAYNQGLVSAEDIDRAVTHLMMTRMRLGMFDQHTEFDEIPY----EINDCAE-HHGLALK 341
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
AA + +VLLKN G LPL TA+K +AVIGPN + + + GNY GT + T L+G+ AV
Sbjct: 342 AAEESMVLLKND-GILPLDKTALKTVAVIGPNGDSEEILKGNYNGTATEKYTILEGIRAV 400
Query: 417 VATIYQAGCS--------NVQCGTAQVDDAKKAAAS----ADATVLVMGADQSIEAES-- 462
+ + CS NV+ A+ DD K A S +D L +G + ++E E
Sbjct: 401 LGKETRIFCSEGSHLYRDNVE-NLAEADDRLKEAVSMAVRSDVVFLCLGLNGTLEGEEGD 459
Query: 463 -------HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILW 515
D+ DL LP Q L+ V PVIL++ +G I++A ++IL
Sbjct: 460 ANNSYAGADKADLNLPESQMRLLKAVCGTGT-PVILLLAAGSAMAINYAAE--HCSAILH 516
Query: 516 VGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ YPGQ GG A A + G PSGRLP+T+Y
Sbjct: 517 IWYPGQMGGLAAARLLTGEAVPSGRLPVTFY 547
>gi|62701894|gb|AAX92967.1| beta-xylosidase, putative [Oryza sativa Japonica Group]
gi|77550041|gb|ABA92838.1| Glycosyl hydrolase family 3 C terminal domain containing protein
[Oryza sativa Japonica Group]
Length = 793
Score = 311 bits (797), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 158/302 (52%), Positives = 208/302 (68%), Gaps = 3/302 (0%)
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAI 311
Y+ SDCD+V + ++ HYT +PE+ A SI G+D+NCG++ H AAV+ G + E I
Sbjct: 300 YVASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLTEKDI 359
Query: 312 DHAISNNFATLMRLGFFDGHP-SKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
D A+ N FA MRLG FDG P S YG LG DVC+ A++ LAL+AA+ GIVLLKN AG
Sbjct: 360 DRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAG 419
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA--TIYQAGCSNV 428
+LPL P+A+ +LAVIGPNA+ + GNY G PC+ TTPLQG+ + + AGC +
Sbjct: 420 ALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDSP 479
Query: 429 QCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGP 488
C A ++A A+S+D VL MG Q E + DR LLLPG+QQ LIT VA A+ P
Sbjct: 480 ACAVAATNEAAALASSSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVANAARRP 539
Query: 489 VILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQ 548
VIL++++GG D++FAK++PKI +ILW GYPGQAGG AIA V FG +NPSGRLP+TWYP+
Sbjct: 540 VILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPE 599
Query: 549 SY 550
+
Sbjct: 600 EF 601
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 107/188 (56%), Positives = 133/188 (70%), Gaps = 7/188 (3%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
FC+ L + R DLV LTL EK++ L D A V+RLG+P YEWWSE LHG+S G G
Sbjct: 32 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 91
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIF 171
F+ V TSFPQVILTAA+F+A L++ +G+ V EARA+YN+G A GLT WSPN+NIF
Sbjct: 92 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 151
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
RDPRWGRGQETPGEDP+ AS+YA +V GLQ G + +ACCKH TAYDLD W
Sbjct: 152 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGG------EASACCKHATAYDLDYWNN 205
Query: 232 TDRYHFNA 239
RY++++
Sbjct: 206 VVRYNYDS 213
>gi|171678585|ref|XP_001904242.1| hypothetical protein [Podospora anserina S mat+]
gi|170937362|emb|CAP62020.1| unnamed protein product [Podospora anserina S mat+]
Length = 800
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 200/564 (35%), Positives = 284/564 (50%), Gaps = 70/564 (12%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
L++ CNT+L R LV LT +EK+ +V + R+G+P Y WWSEALHGV+Y
Sbjct: 34 LSTNQVCNTTLSPPERAAALVAALTPEEKLQNIVSKSLGAPRIGLPAYNWWSEALHGVAY 93
Query: 108 VGPGTHF---SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFW 164
PGT F +TSFP +L AA+F+ L + I +V+ E RA N G +GL +W
Sbjct: 94 A-PGTQFWQGDGPFNSSTSFPMPLLMAATFDDELLEKIAEVIGIEGRAFGNAGFSGLDYW 152
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR-LKVAACCKHYTA 223
+PN+N F+DPRWGRG ETPGED LL +YA +KGL+ G P + +V A CKHY A
Sbjct: 153 TPNVNPFKDPRWGRGSETPGEDVLLVKRYAAAMIKGLE---GPVPEKERRVVATCKHYAA 209
Query: 224 YDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------------- 251
D ++W G R++FNA + + YY + F+
Sbjct: 210 NDFEDWNGATRHNFNAKISLQDMAEYYFMPFQQCVRDSRVGSIMCAYNAVNGVPSCASPY 269
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
YI SDC++V + + Y T E A S AG+D +C
Sbjct: 270 LLQTILREHWNWTEHNNYITSDCEAVLDVSLNHKYAATNAEGTAISFEAGMDTSCEYEGS 329
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
A GL+ ES +D A+ + ++R G+FDG S Y LG DV + Q L+
Sbjct: 330 SDIPGAWSQGLLKESTVDRALLRLYEGIVRAGYFDGKQSL--YSSLGWADVNKPSAQKLS 387
Query: 355 LDAARQGIVLLKNTAGSLPLS----PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPL 410
L AA G VLLKN G+LPLS + K +A+IG ++ + G Y GT TP
Sbjct: 388 LQAAVDGTVLLKND-GTLPLSDLLDKSRPKKVAMIGFWSDAKDKLRGGYSGTAAYLHTPA 446
Query: 411 QGLAAVVATIYQAGC----SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL 466
+ + A S++ + D+A AA AD + G D S E+ DR
Sbjct: 447 YAASQLGIPFSTASGPILHSDLASNQSWTDNAMAAAKDADYILYFGGIDTSAAGETKDRY 506
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
DL PG Q LI + ++K P+I++ M G D + ++PKI +ILW +PGQ GG A
Sbjct: 507 DLDWPGAQLSLINLLTTLSK-PLIVLQM-GDQLDNTPLLSNPKINAILWANWPGQDGGTA 564
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+ ++ G +P+GRLP+T YP ++
Sbjct: 565 VMELVTGLKSPAGRLPVTQYPSNF 588
>gi|242771939|ref|XP_002477942.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
gi|218721561|gb|EED20979.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
Length = 797
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 200/550 (36%), Positives = 285/550 (51%), Gaps = 64/550 (11%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV---SYVGP 110
CNTS+ R L+ TL+E I +SA V RLG+P Y+ WSE LHG+ ++
Sbjct: 64 CNTSVNYVERAEGLISLFTLEELINNTQNSAPGVPRLGLPPYQVWSEGLHGLDRANWAKS 123
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
G + ATSFP IL+ A+ N +L I +++T+ARA NVG GL ++PNIN
Sbjct: 124 GEEWK----WATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGRYGLDAYAPNING 179
Query: 171 FRDPRWGRGQETPGEDP-LLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNW 229
FR P WGRGQETPGED L+S YA Y+ GLQ G P LK+ A KH+ YDL+NW
Sbjct: 180 FRSPLWGRGQETPGEDAGFLSSSYAYEYITGLQ--GGVDPEHLKIVATAKHFAGYDLENW 237
Query: 230 KGTDRYHFNAMV----IYTYYLIKF----------------------------------- 250
R F+A + + YY +F
Sbjct: 238 NNNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSFLLQTLL 297
Query: 251 ---------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAV 301
Y+ SDCD+ ++N Y AAA S+ AG D++CG + +
Sbjct: 298 RENWDFPDYGYVSSDCDAAYNVFNPHGYAINISAAAADSLRAGTDIDCGQTYPWYLNQSF 357
Query: 302 KAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQG 361
G V I+ ++ ++ L++LG+FDG+ S+ Y QLG DV +++ +AA +G
Sbjct: 358 IEGSVTRGEIERSLIRLYSNLVKLGYFDGNQSE--YRQLGWNDVVATDAWNISYEAAVEG 415
Query: 362 IVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIY 421
IVLLKN G LPLS +K++AVIGP AN T+ + GNY G TPLQ +
Sbjct: 416 IVLLKND-GVLPLS-EKLKSVAVIGPWANATQQLQGNYFGPAPYLITPLQAARDAGYKVN 473
Query: 422 QAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITE 480
A +N+ T A AA +D + + G D +IEAE DR+++ PG Q LI +
Sbjct: 474 YAFGTNILGNTTDGFAAALSAAKKSDVIIYLGGIDNTIEAEGTDRMNVTWPGNQLDLIQQ 533
Query: 481 VAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGR 540
+++ K P++++ M GG D S K++ + +++W GYPGQ+GG AI D+ G+ P+GR
Sbjct: 534 LSQTGK-PLVVLQMGGGQVDSSSLKSNNNVNALVWGGYPGQSGGKAIFDILSGKRAPAGR 592
Query: 541 LPMTWYPQSY 550
L T YP Y
Sbjct: 593 LVTTQYPAEY 602
>gi|220927661|ref|YP_002504570.1| glycoside hydrolase [Clostridium cellulolyticum H10]
gi|219997989|gb|ACL74590.1| glycoside hydrolase family 3 domain protein [Clostridium
cellulolyticum H10]
Length = 712
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 213/575 (37%), Positives = 283/575 (49%), Gaps = 105/575 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + SL R DLV R+TL+EK + L A V RLGIP+Y WW+EALHGV+ G
Sbjct: 6 YLDKSLSFKERAVDLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
AT FPQ I AA F+ + I V++TE RA YN G+TFW
Sbjct: 64 --------ATVFPQAIGLAAIFDDEFLEKIADVIATEGRAKYNESSKKGDRDIYKGITFW 115
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPN+NIFRDPRWGRG ET GEDP L S+ +VKGLQ G LK AAC KH+ +
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----GDGKYLKSAACAKHFAVH 171
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ DR+HFNA+ +Y YL F+
Sbjct: 172 ---SGPEDDRHHFNAVASQKDMYETYLPAFEALVKEAKVESVMGAYNRTNGEPCNGSKTL 228
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
++VSDC ++ + TKTP E+ A ++ G DLNCG+
Sbjct: 229 LKDILRDDWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKNGCDLNCGNMY-LLIL 287
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDG--HPSKQPYGQLGPKDVCTQANQDLALD 356
A+K G + E ID A T M+LG FD K PY + + N+ L+L+
Sbjct: 288 LALKEGKITEEDIDRAAIRLMTTRMKLGMFDDDCEFDKIPY----EVNDSIEHNK-LSLE 342
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
AAR+ +VLLKN G LPL IKN+AVIGPNA+ + + NY GTP T L G+ +
Sbjct: 343 AARKSMVLLKNN-GLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSHNITILDGVRSR 401
Query: 417 VA----TIYQAGCS---NVQCGTAQVDDAKKAAAS----ADATVLVMGADQSIEAESH-- 463
V+ Y G N + AQ DD K A S +D VL +G D S+E E +
Sbjct: 402 VSEDTRVWYSLGSHLFMNREEDLAQPDDRLKEAVSMAERSDVVVLCLGLDASVEGEQNDQ 461
Query: 464 ---------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSIL 514
D+ DL LP Q+ L+ V K P I+ ++SG I A + K +I+
Sbjct: 462 GTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAAD--KAAAIV 518
Query: 515 WVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
YPG GG A A++ FG Y+P+GRLP+T+Y +
Sbjct: 519 QCWYPGSKGGLAFAEMIFGDYSPAGRLPVTFYKST 553
>gi|346225847|ref|ZP_08846989.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
gi|346227016|ref|ZP_08848158.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
Length = 718
Score = 309 bits (792), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 204/559 (36%), Positives = 282/559 (50%), Gaps = 89/559 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N + +D R +VK+LT++EKI L+++A +V RL IP+Y+WW+E LHGV+ G
Sbjct: 17 FRNPDISLDERAECIVKQLTVEEKINQLMNAAPAVDRLEIPEYDWWNECLHGVARAGR-- 74
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+++ +L +G +STEARA YNV GLTFW
Sbjct: 75 --------ATVFPQAIGMAATWDTTLVYRVGDAISTEARAKYNVFSKHGYRGQYKGLTFW 126
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PN+NIFRDPRWGRGQET GEDP L S+ +VKGLQ G P LKVAA KHY +
Sbjct: 127 TPNVNIFRDPRWGRGQETYGEDPFLTSRIGVSFVKGLQ---GNHPKYLKVAALAKHYAVH 183
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
N R+ F+A V ++ YL F+
Sbjct: 184 ---NGPEALRHEFDAKVSMKDLWETYLPAFEALVKEAGVEGVMGAYNRTNGDPCCAHPYL 240
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
Y VSDC ++ Y TPEEAAA ++ AG +LNCG +
Sbjct: 241 MQEVLREKWGFDGYYVSDCGAIMDFYTGHKIVDTPEEAAAMALNAGCNLNCGDTYASLLK 300
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAA 358
+++ GL E ID ++ F T +RLG F PY + + ++ +Q LAL+AA
Sbjct: 301 -SLEKGLTTEEEIDRSVKQLFKTRLRLGLF-APEGAVPYDTISTDVIRSKEHQKLALEAA 358
Query: 359 RQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV- 417
R+ +VLLKN A +LP++ +K + V GP A + ++ NY G TT L+G+ V
Sbjct: 359 RKSVVLLKNEANTLPVA-RDVKKVYVTGPTATHVQALLANYYGVSEDMTTILEGIVGKVS 417
Query: 418 ---ATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES---------HDR 465
+ Y+ G + +D AAASAD TV +G Q IE E DR
Sbjct: 418 PQTSVQYRQGALLYEANRNTMDWFSGAAASADVTVACLGISQLIEGEEGEAIASEHRGDR 477
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
LP Q + + AK ++++I SG IS + ++L+V YPG+ GG
Sbjct: 478 ERTRLPQNQIDFLKRIRASAK-KLVVVITSGSA--ISLPEIYDMADALLYVWYPGEQGGK 534
Query: 526 AIADVCFGRYNPSGRLPMT 544
A+ADV FG PSGRLP+T
Sbjct: 535 AVADVLFGDAVPSGRLPVT 553
>gi|292495632|sp|Q0CMH8.2|XYND_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
Length = 793
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 198/560 (35%), Positives = 286/560 (51%), Gaps = 65/560 (11%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
N L+ C+ S R LV TL+E + ++ V RLG+PKY+ WSE+LH
Sbjct: 53 ENGPLSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLH 112
Query: 104 GV---SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG 160
GV ++ G + ATSFPQ ILT A+ N +L IG ++ST+ARA NVG G
Sbjct: 113 GVYRANWASEGDY-----SWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYG 167
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAACCK 219
L ++PNIN FR P WGRGQETPGED LAS YA Y+ G+Q G P LK+ A K
Sbjct: 168 LDTYAPNINSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQ--GGVDPETLKLVATAK 225
Query: 220 HYTAYDLDNWKGTDR--------------------------------------------- 234
HY YD++NW G R
Sbjct: 226 HYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVNGVPSC 285
Query: 235 ---YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
+ ++ T+ ++ Y+ DC +V +N Y A+A SI AG D++CG+
Sbjct: 286 SNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAANESSASADSIRAGTDIDCGT 345
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
H A G ++ I+ + + L+RLG+FDG+ S+ Y L DV T
Sbjct: 346 SYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSSQ--YRDLTWSDVQTTDAW 403
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
+++ +AA +G VLLKN G+LPL+ +I+++A+IGP AN T M GNY G T+PL
Sbjct: 404 NISHEAAVEGTVLLKND-GTLPLA-DSIRSVALIGPWANATTQMQGNYYGPAPYLTSPLA 461
Query: 412 GLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGA-DQSIEAESHDRLDLLL 470
L A ++ A +N+ T AAA ++ G D +IE E+ DR+++
Sbjct: 462 ALEASDLDVHYAFGTNISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDRMNITW 521
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG Q LI +++ + K P++++ M GG D S K++ ++++LW GYPGQ+GG A+ D+
Sbjct: 522 PGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGTALLDI 580
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
G P+GRL T YP Y
Sbjct: 581 IRGVRAPAGRLVTTQYPAGY 600
>gi|115397385|ref|XP_001214284.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
gi|114192475|gb|EAU34175.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
Length = 776
Score = 308 bits (790), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 198/560 (35%), Positives = 286/560 (51%), Gaps = 65/560 (11%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
N L+ C+ S R LV TL+E + ++ V RLG+PKY+ WSE+LH
Sbjct: 71 ENGPLSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLH 130
Query: 104 GV---SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG 160
GV ++ G + ATSFPQ ILT A+ N +L IG ++ST+ARA NVG G
Sbjct: 131 GVYRANWASEGDY-----SWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYG 185
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAACCK 219
L ++PNIN FR P WGRGQETPGED LAS YA Y+ G+Q G P LK+ A K
Sbjct: 186 LDTYAPNINSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQ--GGVDPETLKLVATAK 243
Query: 220 HYTAYDLDNWKGTDR--------------------------------------------- 234
HY YD++NW G R
Sbjct: 244 HYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVNGVPSC 303
Query: 235 ---YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
+ ++ T+ ++ Y+ DC +V +N Y A+A SI AG D++CG+
Sbjct: 304 SNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAANESSASADSIRAGTDIDCGT 363
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
H A G ++ I+ + + L+RLG+FDG+ S+ Y L DV T
Sbjct: 364 SYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSSQ--YRDLTWSDVQTTDAW 421
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
+++ +AA +G VLLKN G+LPL+ +I+++A+IGP AN T M GNY G T+PL
Sbjct: 422 NISHEAAVEGTVLLKND-GTLPLA-DSIRSVALIGPWANATTQMQGNYYGPAPYLTSPLA 479
Query: 412 GLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGA-DQSIEAESHDRLDLLL 470
L A ++ A +N+ T AAA ++ G D +IE E+ DR+++
Sbjct: 480 ALEASDLDVHYAFGTNISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDRMNITW 539
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG Q LI +++ + K P++++ M GG D S K++ ++++LW GYPGQ+GG A+ D+
Sbjct: 540 PGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGTALLDI 598
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
G P+GRL T YP Y
Sbjct: 599 IRGVRAPAGRLVTTQYPAGY 618
>gi|326202986|ref|ZP_08192853.1| glycoside hydrolase family 3 domain protein [Clostridium
papyrosolvens DSM 2782]
gi|325987063|gb|EGD47892.1| glycoside hydrolase family 3 domain protein [Clostridium
papyrosolvens DSM 2782]
Length = 712
Score = 308 bits (790), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 209/574 (36%), Positives = 283/574 (49%), Gaps = 103/574 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + SL R DLV ++TL+EK + L A V RLGIP+Y WW+EALHGV+ G
Sbjct: 6 YLDKSLSFKERAADLVSKMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
AT FPQ I AA F+ + I V++TE RA YN G+TFW
Sbjct: 64 --------ATVFPQAIGMAAMFDDEFLEKIADVIATEGRAKYNESAKKGDRDIYKGITFW 115
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPN+NIFRDPRWGRG ET GEDP L S+ +VKGLQ G LK AAC KHY +
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----GDGKYLKTAACAKHYAVH 171
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ DR+ F+A+V +Y YL F+
Sbjct: 172 ---SGPEDDRHFFDAIVSQKDLYETYLPAFEALVKEAKVESIMGAYNRTNGEPCNGSKTL 228
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
++VSDC ++ + TKTP E+ A ++ +G DLNCG+
Sbjct: 229 LKDILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKSGCDLNCGNMY-LLIL 287
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD-LALDA 357
A+K GL+ E ID A T M+LG FD P ++ A + ++L+A
Sbjct: 288 LALKEGLITEEDIDRAAIRLMTTRMKLGMFDDDCEFDNI----PYELNDSAEHNKISLEA 343
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
A++ +VLLKN G LPL IKN+AVIGPNA+ + + NY GTP + T ++G+ V
Sbjct: 344 AKKSMVLLKND-GLLPLDSKKIKNVAVIGPNADSSLALRANYSGTPSQNVTIIEGIRKRV 402
Query: 418 A----TIYQAGCS---NVQCGTAQVDDAKKAAASA----DATVLVMGADQSIEAESH--- 463
+ Y G N AQ DD K A SA D VL +G D S+E E +
Sbjct: 403 SENTRVWYAMGSHLFLNRDEDLAQPDDRLKEAVSAAERSDVVVLCLGLDASVEGEQNDQG 462
Query: 464 --------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILW 515
D+ DL LP Q+ L+ V K P I+ ++SG I A + K +I+
Sbjct: 463 TVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAAD--KAAAIVQ 519
Query: 516 VGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
YPG GG A A++ FG Y+P+GRLP+T+Y +
Sbjct: 520 CWYPGAIGGLAFAEMIFGDYSPAGRLPVTFYKST 553
>gi|292495634|sp|A1CND4.2|XYND_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
Length = 792
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 193/563 (34%), Positives = 290/563 (51%), Gaps = 68/563 (12%)
Query: 40 CDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWS 99
CD N L+ C+T R L+ TL+E + +++ V RLG+P Y+ W+
Sbjct: 52 CD---NGPLSKTIVCDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWN 108
Query: 100 EALHGV--SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
EALHG+ +Y FS +TSFP ILT ++ N +L + ++ST+ RA N G
Sbjct: 109 EALHGLDRAYFTDEGQFS----WSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAG 164
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAA 216
GL +SPNIN FR P WGRGQETPGED L+S YA Y+ G+Q G P LK+ A
Sbjct: 165 RYGLDVYSPNINSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQ--GGVDPKSLKLVA 222
Query: 217 CCKHYTAYDLDNWKGTDR------------------------------------------ 234
KHY YD++NW G R
Sbjct: 223 TAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNAVNGV 282
Query: 235 ------YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
+ ++ T+ ++ YI SDCDS ++N Y AAA SI AG D++
Sbjct: 283 PSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSSAAADSIRAGTDID 342
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
CG+ + + AV L++ + I+ + ++ LMRLG+FDG+ S Y L DV T
Sbjct: 343 CGTTYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDGNSSA--YRNLTWNDVVTT 400
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
+ +++ + +G VLLKN G+LPLS +I+++A++GP NV+ + GNY G +
Sbjct: 401 NSWNISYEV--EGTVLLKND-GTLPLS-ESIRSIALVGPWMNVSTQLQGNYFGPAPYLIS 456
Query: 409 PLQGLAAVVATIYQAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
PL + A +N+ + A AA +DA + G D S+EAE+ DR++
Sbjct: 457 PLDAFRDSHLDVNYAFGTNISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLDRMN 516
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
+ PG+Q LI +++++ K P+I++ M GG D S K++ + S++W GYPGQ+GG A+
Sbjct: 517 ITWPGKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQAL 575
Query: 528 ADVCFGRYNPSGRLPMTWYPQSY 550
D+ G+ P+GRL +T YP Y
Sbjct: 576 LDIITGKRAPAGRLVVTQYPAEY 598
>gi|320161274|ref|YP_004174498.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
gi|319995127|dbj|BAJ63898.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
Length = 712
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 197/561 (35%), Positives = 289/561 (51%), Gaps = 89/561 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N ++ RV DL+ R+TL+EKI+ + +S ++ RLGIP Y++WSEALHGV+ G
Sbjct: 8 YLNPDAPLEERVNDLISRMTLEEKISQMCNSCAAIPRLGIPAYDYWSEALHGVARNGK-- 65
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTF 163
AT FPQ I AA+++ L + + +++EARA ++ L GLT
Sbjct: 66 --------ATVFPQAIGMAATWDTELIERVADAIASEARAKFHETLRKFGKTDIYQGLTM 117
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
WSPNINIFRDPRWGRGQET GEDP L + +V+GLQ G P+ LK AAC KHYT
Sbjct: 118 WSPNINIFRDPRWGRGQETWGEDPYLTGEMGAAFVRGLQ---GKDPHYLKTAACAKHYTV 174
Query: 224 YDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------------- 251
+ + +R+ FNA+V ++ YL FK
Sbjct: 175 H---SGPEKERHTFNAIVTRRELFDTYLPAFKKLVTEAKVEAVMGAYNRTLGEPCCGSPY 231
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC-GSFLGKH 296
++VSDC +++ + TK E+AA I G D+ C ++ ++
Sbjct: 232 LLKEILRNQWGFKGHVVSDCGAINDFHLHHQVTKDGAESAALGIKNGCDMACICTYSYEN 291
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
A+ GL+ E IDHA+ N T +LG FD K PY + V +A++ LA +
Sbjct: 292 LTEALNRGLITEEDIDHALRNTLRTRFKLGLFDPQ-EKVPYAHISMSVVGCEAHRKLAYE 350
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
A + VLLKN LP+ P +K++ ++GPNA ++GNY G TT ++GL
Sbjct: 351 TAVKSAVLLKNHNHILPVKPD-VKSILIVGPNAGNVHVLLGNYYGLSDSMTTFMEGLVGR 409
Query: 417 ----VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE--------SHD 464
V + G + D + +AAS D + MG +E E + D
Sbjct: 410 LPEGVRMEFMPGSLLTDSKKIKNDWSVASAASFDLVIAFMGLSPLLEGEEGEAILSDNGD 469
Query: 465 RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGG 524
R D+ LP QQ I ++A A G I+++++GG I+ + + +ILWVGYPGQ GG
Sbjct: 470 REDIALPKAQQEYIRDLA--ATGAKIVLVLTGGS-AIALNGIEDLVEAILWVGYPGQEGG 526
Query: 525 AAIADVCFGRYNPSGRLPMTW 545
AIAD+ FG ++PSG+LP+T+
Sbjct: 527 RAIADLIFGDHSPSGKLPITF 547
>gi|255957137|ref|XP_002569321.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591032|emb|CAP97251.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 791
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 213/566 (37%), Positives = 294/566 (51%), Gaps = 66/566 (11%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSV----SRLGIP 93
F+ SN L+ C+T+ R L+ T +E LV+S G+V RLG+P
Sbjct: 45 FSFPDCSNGPLSKTMVCDTTAKPHDRAAALIAMFTFEE----LVNSTGNVMPAIPRLGLP 100
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
Y+ W+EALHG+ T F + ATSFP ILT A+ N +L IG +VST+ RA
Sbjct: 101 PYQVWNEALHGLDRANL-TEFGDY-SWATSFPSPILTMAALNRTLINQIGGIVSTQGRAF 158
Query: 154 YNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLK 213
N G GL +SPNIN FR P WGRGQETPGED L S Y Y+ GLQ G P LK
Sbjct: 159 NNGGRYGLDVYSPNINSFRHPVWGRGQETPGEDIQLCSVYGLEYITGLQ--GGLDPKELK 216
Query: 214 VAACCKHYTAYDLDNWK-----GTD-----------------------RYH-----FNA- 239
+AA KH+ YD++NW G D R H +NA
Sbjct: 217 LAATAKHFAGYDIENWGNHSRLGNDMSISAFDFASYYAPQFVTAVRDARVHSVMASYNAV 276
Query: 240 --------------MVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
++ T+ ++ Y+ SDCDSV ++N Y + AAAKSI AG
Sbjct: 277 NGVPASANSFLLQTLLRDTWNFVEDGYVSSDCDSVYNVFNPHGYASSASLAAAKSIQAGT 336
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
D++CG+ + + G ++ S I+ A + ++ L+ LG+FDG SK Y L DV
Sbjct: 337 DIDCGATYQLYLNQSFTQGEISRSEIERAATRFYSNLVSLGYFDGDNSK--YRDLDWSDV 394
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
+++ +AA +GIVLLKN G+LPLS ++A+IGP ANVT TM GNY G
Sbjct: 395 VATDAWNISYEAAVEGIVLLKND-GTLPLSKDT-HSVALIGPWANVTTTMQGNYYGAAPY 452
Query: 406 YTTPLQGLAAVVATIYQAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEAESHD 464
T PL L A + A +N+ T + A AA +D + G D S+EAE D
Sbjct: 453 LTGPLAALQASDLDVNYAFGTNISSETTSGFEAALSAARKSDVVIFAGGIDNSVEAEGVD 512
Query: 465 RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGG 524
R + PG Q LI +++++ K P++++ M GG D S K + + S++W GYPGQ+GG
Sbjct: 513 RETITWPGNQLQLIEQLSELGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGG 571
Query: 525 AAIADVCFGRYNPSGRLPMTWYPQSY 550
AI D+ G+ P+GRL +T YP Y
Sbjct: 572 PAILDILTGKRAPAGRLTVTQYPAEY 597
>gi|292495285|sp|B6EY09.1|XYND_ASPJA RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|211970990|dbj|BAG82824.1| 1,4-beta-D-xylosidase [Aspergillus japonicus]
Length = 804
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 199/555 (35%), Positives = 285/555 (51%), Gaps = 65/555 (11%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTH 113
C+++ R LV TL+E I +++ V RLG+P Y+ WSEALHG++ +
Sbjct: 60 CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLARA----N 115
Query: 114 FSN--VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIF 171
F++ ATSFP IL+AA+FN +L I ++ST+ RA N G GL +SPNIN F
Sbjct: 116 FTDNGAYSWATSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPNINTF 175
Query: 172 RDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
R P WGRGQETPGED L + YA Y+ G+Q G +P LK+AA KH+ YD++NW
Sbjct: 176 RHPVWGRGQETPGEDAYTLTAAYAYEYITGIQ--GGVNPEHLKLAATAKHFAGYDIENWD 233
Query: 231 GTDR------------------------------------------------YHFNAMVI 242
R + ++
Sbjct: 234 NHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCSNTFFLQTLLR 293
Query: 243 YTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVK 302
T+ + Y+ DC +V ++N Y AAA +ILAG D++CG+ H ++
Sbjct: 294 DTFSFVDHGYVSGDCGAVYGVFNPHGYAANEPSAAADAILAGTDIDCGTSYQYHFNESIT 353
Query: 303 AGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQDLALDAARQG 361
G V I+ +A L+ LG+FDG+ S PY LG DV +++ +AA +G
Sbjct: 354 TGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDAWNISYEAAVEG 413
Query: 362 IVLLKNTAGSLPL-SPTAIKN--LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA 418
IVLLKN G+LPL SP+ KN +A+IGP AN T + GNY G +P+ A
Sbjct: 414 IVLLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLISPVDAFTAAGY 472
Query: 419 TIYQAGCSNVQCG-TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLL 477
T++ A + + TA A AA +AD V + G D +IEAE+ DR + PG Q L
Sbjct: 473 TVHYAPGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDRSSIAWPGNQLEL 532
Query: 478 ITEVA--KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRY 535
I+++A K P+++ M GG D S K++ K+ ++LW GYPGQ+GG A+ D+ G
Sbjct: 533 ISQLAAQKSDDQPLVVYQMGGGQVDSSALKSNAKVNALLWGGYPGQSGGLALRDILTGAR 592
Query: 536 NPSGRLPMTWYPQSY 550
P+GRL T YP +Y
Sbjct: 593 APAGRLTTTQYPAAY 607
>gi|380293100|gb|AFD50200.1| beta-xylosidase [Hypocrea orientalis]
Length = 797
Score = 307 bits (787), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 204/550 (37%), Positives = 287/550 (52%), Gaps = 65/550 (11%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV---SYVGP 110
C++S G R L+ TL+E I +S V RLG+P Y+ W+EALHG+ ++
Sbjct: 63 CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
G F ATSFP ILT A+ N +L I ++ST+ARA N G GL ++PN+N
Sbjct: 123 GGQFE----WATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNVNG 178
Query: 171 FRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNW 229
FR P WGRGQETPGED L+S Y Y+ G+Q G P +LKVAA KH+ YDL+NW
Sbjct: 179 FRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQ--GGVDPEQLKVAATVKHFAGYDLENW 236
Query: 230 KGTDRYHFNAMV----IYTYYLIKF----------------------------------- 250
R F+A++ + YY +F
Sbjct: 237 NNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCSYNSVNGVPSCANSFFLQTLL 296
Query: 251 ---------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAV 301
Y+ SDCD+V ++N Y AAA S+ AG D++CG H +
Sbjct: 297 RESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHLNESF 356
Query: 302 KAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQG 361
AG V I+ +++ +A L+RLG+FD K Y LG KDV +++ +AA +G
Sbjct: 357 VAGEVTRGEIERSVTRLYANLVRLGYFD---KKNQYRSLGWKDVVKTDAWNISYEAAVEG 413
Query: 362 IVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI- 420
IVLLKN G+LPLS ++++A+IGP AN T M GNY G +PL+ +
Sbjct: 414 IVLLKND-GTLPLS-KKVRSIALIGPWANATTQMQGNYFGPAPYLISPLEAAKKAGYHVN 471
Query: 421 YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITE 480
++ G TA A AA +DA V + G D +IE E DR D+ PG Q LI +
Sbjct: 472 FELGTEIAGNSTAGFAKAIAAAKKSDAIVYLGGIDNTIEQEGADRTDIAWPGNQLDLIKQ 531
Query: 481 VAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGR 540
+++V K P++++ M GG D S K++ K+ S++W GYPGQ+GG A+ D+ G+ P+GR
Sbjct: 532 LSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRAPAGR 590
Query: 541 LPMTWYPQSY 550
L T YP Y
Sbjct: 591 LITTQYPAEY 600
>gi|410628680|ref|ZP_11339398.1| beta-glucosidase [Glaciecola mesophila KMM 241]
gi|410151684|dbj|GAC26167.1| beta-glucosidase [Glaciecola mesophila KMM 241]
Length = 732
Score = 307 bits (787), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 203/580 (35%), Positives = 289/580 (49%), Gaps = 92/580 (15%)
Query: 37 VFACDVVSNPSLASLG---FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
+F C +V + + + N L + R LV +T+ EKIT L S ++ RL +P
Sbjct: 11 MFTCHLVGAAQINHVKQQIWFNPELSFETRAQALVNAMTIDEKITQLSHSTPAIPRLEVP 70
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
+Y WW+EALHG++ G AT FPQ I A+F+ L Q + +S EARA
Sbjct: 71 QYNWWNEALHGIARNGK----------ATIFPQAIGLGATFDPELAQEVANAISDEARAK 120
Query: 154 YNVG--------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD 205
Y + AGLTFW+PN+NIFRDPRWGRGQET GEDPLL S+ T +VKGLQ
Sbjct: 121 YAIAQSIGNQGQYAGLTFWTPNVNIFRDPRWGRGQETYGEDPLLTSQMGTAFVKGLQ--- 177
Query: 206 GGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------- 251
G P LK A KH+ + + + R+ F+ +Y YL F+
Sbjct: 178 GDDPKYLKSAGVAKHFAVH---SGPESLRHQFDVEPSKKDLYETYLPAFEALVTQAKVAG 234
Query: 252 --------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAK 279
Y+VSDC ++ ++ T E+AA
Sbjct: 235 VMCAYNGVYGQPSCASEFLLGEMLKKKWQFNGYVVSDCGALHDFHSGHKVTHNRVESAAL 294
Query: 280 SILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYG 338
++ AG+DLNCG K +AA + GL+ +S ID + N RLG FD PS+ P+
Sbjct: 295 ALRAGVDLNCGFTYEKSLKAAFEEGLITQSLIDQRLKNLLMIRFRLGLFD--PSELNPHN 352
Query: 339 QLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGN 398
+G + + + + +LA A + IVLLKN LPLS IK V GP A + ++GN
Sbjct: 353 AIGQEVIHSLEHIELARKVAAKSIVLLKNEKQVLPLSKD-IKVPYVTGPFAASSDMLMGN 411
Query: 399 YEGTPCKYTTPLQGLAAVV----ATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGA 454
Y G T L+G+A V + Y+AG ++ A + A +ADA + V+G
Sbjct: 412 YYGISDSLVTVLEGIAGKVSLGSSLNYRAGALPFHSNINPLNWAPEVAKTADAVIAVVGI 471
Query: 455 DQSIEAES---------HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAK 505
+E E DR+ + LP Q + ++A+ KGP+IL++ +G DIS +
Sbjct: 472 SADMEGEEVDAIASADRGDRVAITLPQNQVDYVKQLAENKKGPLILVVAAGSPVDIS--E 529
Query: 506 NDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
DP +ILW+ YPG+ GG A+ADV FG NPSG LP+T+
Sbjct: 530 LDPLADAILWIWYPGEQGGNAVADVIFGDTNPSGHLPLTF 569
>gi|358385386|gb|EHK22983.1| glycoside hydrolase family 3 protein [Trichoderma virens Gv29-8]
Length = 795
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 200/550 (36%), Positives = 286/550 (52%), Gaps = 65/550 (11%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV---SYVGP 110
C++S G R L+ TL+E I +S V RLG+P Y+ W+EALHG+ ++
Sbjct: 63 CDSSAGYAERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
G F ATSFP IL+ A+ N +L I ++ST+ARA N G GL ++PNIN
Sbjct: 123 GGQFQ----WATSFPMPILSMAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNING 178
Query: 171 FRDPRWGRGQETPGEDP-LLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNW 229
FR P WGRGQETPGED +L S Y Y+ G+Q G P LK+AA KH+ YDL+NW
Sbjct: 179 FRSPLWGRGQETPGEDANVLTSAYTYEYITGMQ--GGVDPENLKIAATAKHFAGYDLENW 236
Query: 230 KGTDRYHFNAMV----IYTYYLIKF----------------------------------- 250
R F+A++ + YY +F
Sbjct: 237 NNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCANSFFLQTLL 296
Query: 251 ---------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAV 301
Y+ SDCD+V ++N Y AAA S+ AG D++CG H +
Sbjct: 297 RESWGFPEWGYVSSDCDAVYNVWNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHLNESF 356
Query: 302 KAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQG 361
AG V+ I+ +++ +A L+RLG+FD K Y LG KDV +++ +AA +G
Sbjct: 357 VAGEVSRGEIERSVTRLYANLVRLGYFD---KKNEYRSLGWKDVVKTDAWNISYEAAVEG 413
Query: 362 IVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI- 420
IVLLKN G+LPLS ++++A+IGP AN T M GNY G +PL+ +
Sbjct: 414 IVLLKND-GTLPLS-KKVRSIALIGPWANATTQMQGNYFGAAPYLISPLEAAKKAGYQVN 471
Query: 421 YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITE 480
++ G TA A AA +DA + G D ++E E DR D+ PG Q LI +
Sbjct: 472 FELGTETASTSTAGFAKAIAAAKKSDAIIFAGGIDNTVEQEGADRTDIAWPGNQLDLIKQ 531
Query: 481 VAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGR 540
++++ K P++++ M GG D S K++ K+ S++W GYPGQ+GG A+ D+ G+ P+GR
Sbjct: 532 LSELGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRAPAGR 590
Query: 541 LPMTWYPQSY 550
L T YP Y
Sbjct: 591 LVSTQYPADY 600
>gi|288870210|ref|ZP_06113312.2| beta-glucosidase [Clostridium hathewayi DSM 13479]
gi|288868024|gb|EFD00323.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
Length = 730
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 195/556 (35%), Positives = 279/556 (50%), Gaps = 98/556 (17%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LVK++TL+EK+ ++ A ++ RLGI Y WW+E LHGV+ G AT FPQ
Sbjct: 28 LVKQMTLEEKVFQTMNQAPAIERLGIKAYNWWNEGLHGVARAGV----------ATIFPQ 77
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWGR 178
I AA+F+ L + +G+ VSTEARA Y++ GLT W+PNINIFRDPRWGR
Sbjct: 78 AIGLAATFDEDLIETVGEAVSTEARAKYHMQQRYGDTDIYKGLTLWAPNINIFRDPRWGR 137
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD--RYH 236
G ET GEDP L S+ Y++GLQ G LK AAC KH+ + G + R+
Sbjct: 138 GHETYGEDPWLTSRLGIRYIRGLQ---GSHEKYLKTAACVKHFAVH-----SGPEELRHS 189
Query: 237 FNAMV----IYTYYLIKFK----------------------------------------- 251
F+A V + YL F+
Sbjct: 190 FDAEVSEKDLRETYLPAFEACVKDGDVEAVMGAYNRVNGVPCCGNEYLLETILRKEWGFH 249
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSDC ++ + T +P E+ + ++ G DLNCG+ + AVK G V E
Sbjct: 250 GHVVSDCWAIKDFHEGHGVTDSPVESVSMAMNHGCDLNCGNLF-TYLIQAVKEGKVKEER 308
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
+D A+ F T ++LG PY + +V + A + L AA + +VLLKNT G
Sbjct: 309 LDEAVIRLFTTRLKLGALGKMEEDDPYAGISYLEVDSPAMKKLNRSAAGKSVVLLKNTEG 368
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGC- 425
LP+ K + VIGPNA+ + ++GNYEGT +Y T L+G+ +Y GC
Sbjct: 369 LLPIDTKRYKTIGVIGPNADSRRALVGNYEGTASEYVTVLEGIREAAEPEARVLYSEGCH 428
Query: 426 ---SNVQCGTAQVD---DAKKAAASADATVLVMGADQSIEAES---------HDRLDLLL 470
SNV A+ D + K +D + MG D ++E E D+ DL+L
Sbjct: 429 LYKSNVSGLGARNDRLSEVKGICRESDIVIACMGLDSTLEGEQGDTGNIYAGGDKPDLML 488
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG QQ ++ E A + PV+L++++G +++A D + +IL YPG GG +ADV
Sbjct: 489 PGLQQKIL-ETAYDSGKPVVLVLLAGSAMAVTWA--DEHLPAILTAWYPGAEGGRGVADV 545
Query: 531 CFGRYNPSGRLPMTWY 546
FG NP GRLP+T+Y
Sbjct: 546 LFGTVNPEGRLPVTFY 561
>gi|425780840|gb|EKV18836.1| Beta-xylosidase XylA [Penicillium digitatum PHI26]
gi|425783077|gb|EKV20946.1| Beta-xylosidase XylA [Penicillium digitatum Pd1]
Length = 792
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 206/566 (36%), Positives = 293/566 (51%), Gaps = 66/566 (11%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG----SVSRLGIP 93
F+ SN L+ C+T+ R L TL+E LV+S G +V RLG+P
Sbjct: 46 FSFPDCSNGPLSKTIVCDTTAKPHDRAAALTSMFTLEE----LVNSTGNVIPAVPRLGLP 101
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
Y+ WSEALHG+ S ATSFP IL A+ N +L IG+++ST+ RA
Sbjct: 102 PYQVWSEALHGLDRANLTE--SGDYSWATSFPSPILIMAALNRTLINQIGEIISTQGRAF 159
Query: 154 YNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLK 213
N G GL ++PNIN FR P WGRGQETPGED L S Y Y+ G+Q G +P LK
Sbjct: 160 NNGGRYGLDVYAPNINSFRHPVWGRGQETPGEDVQLCSIYGVEYITGIQ--GGLNPRDLK 217
Query: 214 VAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF------------------- 250
+AA KH+ YDL+NW R N + + +YY +F
Sbjct: 218 LAATAKHFAGYDLENWGNHSRLGNNVAISSFDLASYYTPQFITAVRDARVHSVMSSYNAV 277
Query: 251 -------------------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
Y+ SDCD+V ++N Y + AAAKSI AG
Sbjct: 278 NGVPSSANSFLLQTLLRETWNFVEDGYVSSDCDAVFNVFNPHGYASSASLAAAKSIQAGT 337
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
D++CG+ + ++ ++ S I+ A++ ++TL+ LG+FDG SK Y L DV
Sbjct: 338 DIDCGATYQLYLNESLSHDEISRSEIERAVTRFYSTLVSLGYFDGDNSK--YRHLHWPDV 395
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
+++ +AA +GIVLLKN G+LPLS +++A+IGP ANVT T+ GNY G
Sbjct: 396 VATDAWNISYEAAVEGIVLLKND-GTLPLS-NNTRSVALIGPWANVTTTLQGNYYGAAPY 453
Query: 406 YTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHD 464
T PL L A + A +N+ T+ + A AA ++ + G D ++EAE D
Sbjct: 454 LTGPLAALQASNLDVNYAFGTNISSDSTSGFEAALSAAGKSEVIIFAGGIDNTVEAEGVD 513
Query: 465 RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGG 524
R + PG Q LI +++K+ K P++++ M GG D S K + + S++W GYPGQ+GG
Sbjct: 514 RESITWPGNQLQLIEQLSKLGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGG 572
Query: 525 AAIADVCFGRYNPSGRLPMTWYPQSY 550
AI D+ G+ P+GRL +T YP Y
Sbjct: 573 PAILDILTGKRAPAGRLTVTQYPAEY 598
>gi|367028614|ref|XP_003663591.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
gi|347010860|gb|AEO58346.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
Length = 760
Score = 306 bits (783), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 210/562 (37%), Positives = 282/562 (50%), Gaps = 66/562 (11%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
+N L S C+TS R LV + EK+ LV+++ VSRLG+ Y+WW+EALH
Sbjct: 29 ANGPLKSNTVCDTSASPGARAAALVSVMNNNEKLANLVNNSPGVSRLGLSAYQWWNEALH 88
Query: 104 GVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF 163
GV++ G + AT FPQ I T+A+F+ +L + IG ++STEARA N G A L F
Sbjct: 89 GVAH-NRGITWGGEFSAATQFPQAITTSATFDDALIEQIGTIISTEARAFANNGRAHLDF 147
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PN+N FRDPRWGRG ETPGED K+A +VKG+Q G P +V A CKHY A
Sbjct: 148 WTPNVNPFRDPRWGRGHETPGEDAFKNKKWAEAFVKGMQ---GPGPTH-RVIATCKHYAA 203
Query: 224 YDLDNWKGTDRYHFNAMV----IYTYYLIKF----------------------------- 250
YDL+N T R++F+A V + YYL F
Sbjct: 204 YDLENSGSTTRFNFDAKVSTQDLAEYYLPPFQQCARDSKVGSIMCSYNAVNEIPACANPY 263
Query: 251 ----------------KYIVSDCDSVDVLYNS---QHYTKTPEEAAAKSILAGLDLNCGS 291
+YIVSDCD+V L N+ Y + A S+ AG D C +
Sbjct: 264 LMDTILRKHWNWTDEHQYIVSDCDAVYYLGNANGGHRYKPSYAAAIGASLEAGCDNMCWA 323
Query: 292 FLGKHTE--AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G + +A +G +++ +D AI L+ G+FDG P Y L DV TQ
Sbjct: 324 TGGTAPDPASAFNSGQFSQTTLDTAILRQMQGLVLAGYFDG-PGGM-YRNLSVADVNTQT 381
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIK-NLAVIGPNANVTKTMIGNYEGTPCKYTT 408
QD AL AA GIVLLKN G LPLS +A+IG AN M+G Y G+P
Sbjct: 382 AQDTALKAAEGGIVLLKND-GILPLSVNGSNFQVAMIGFWANAADKMLGGYSGSPPFNHD 440
Query: 409 PLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDL 468
P+ ++ T+ Q A AA ++A V G D ++E ES DR +
Sbjct: 441 PVTAARSMGITVNYVNGPLTQP-NGDTSAALNAAQKSNAVVFFGGIDNTVEKESQDRTSI 499
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIA 528
P Q LI +A+ K PVI++ + G D + + P + +ILW GYPGQ GG A+
Sbjct: 500 EWPSGQLALIRRLAETGK-PVIVVRL-GTHVDDTPLLSIPNVRAILWAGYPGQDGGTAVV 557
Query: 529 DVCFGRYNPSGRLPMTWYPQSY 550
+ G +P+GRLP T YP SY
Sbjct: 558 KIITGLASPAGRLPATVYPSSY 579
>gi|255590044|ref|XP_002535159.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
gi|223523880|gb|EEF27223.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
Length = 449
Score = 305 bits (782), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 143/267 (53%), Positives = 189/267 (70%), Gaps = 1/267 (0%)
Query: 285 LDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
+D+NCG++L +T++AV+ V+ES ID A+ N F+ MRLG F+G+P+K PYG +
Sbjct: 1 MDVNCGNYLKNYTKSAVEKKKVSESEIDRALHNLFSIRMRLGLFNGNPTKLPYGDISADQ 60
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
VC+Q +Q +AL+AAR GIVLLKN+ LPLS + +LA+IGPNA+ + ++GNY G PC
Sbjct: 61 VCSQEHQAVALEAARDGIVLLKNSNQLLPLSKSKTTSLAIIGPNADNSTILVGNYAGPPC 120
Query: 405 KYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
K TP QGL + T Y GCS V C +A +D A K A AD VLVMG DQ+ E E H
Sbjct: 121 KTVTPFQGLQNYIKTTKYHPGCSTVACSSAAIDQAIKIAKEADQVVLVMGLDQTQEREEH 180
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR+DL+LPG+QQ LI VA+ AK PV+L+++ GG DISFAK D I ILW GYPG+AG
Sbjct: 181 DRVDLVLPGKQQELIISVARAAKKPVVLVLLCGGPVDISFAKYDRNIGGILWAGYPGEAG 240
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
G A+A++ FG +NP GRLP+TWYPQ +
Sbjct: 241 GIALAEIIFGNHNPGGRLPVTWYPQDF 267
>gi|372209074|ref|ZP_09496876.1| glycoside hydrolase family protein [Flavobacteriaceae bacterium
S85]
Length = 727
Score = 305 bits (782), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 208/576 (36%), Positives = 295/576 (51%), Gaps = 89/576 (15%)
Query: 42 VVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEA 101
+V+ L F +T I+ R LV ++TL+EKI L ++A ++SRL +P Y+WW+EA
Sbjct: 10 IVTFSQQVDLSFLDTDKSIEERAEILVSQMTLKEKIAQLKNTAPAISRLKVPDYDWWNEA 69
Query: 102 LHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---- 157
LHGV+ G AT FPQ I A+F+ L + +STEARA Y +
Sbjct: 70 LHGVARNGK----------ATIFPQGIGIGATFDPDLALRVASAISTEARAKYTISQQMG 119
Query: 158 ----LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLK 213
AGLTFW+PN+NIFRDPRWGRGQET GEDP L ++ +VKGLQ G PN LK
Sbjct: 120 NHSRYAGLTFWTPNVNIFRDPRWGRGQETFGEDPYLMTQMGVAFVKGLQ---GDDPNYLK 176
Query: 214 VAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------ 251
AAC KHY + + + R FNA+ +Y YL F+
Sbjct: 177 SAACAKHYAVH---SGPESLRLEFNAVPTQQDLYETYLPAFEALVKDANVEGVMPAHNAV 233
Query: 252 ------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL 287
Y+V+DC ++ + Y + AAA ++ AG +L
Sbjct: 234 FGAPMAANKFLLTDVLRDRWGFDGYVVTDCGAIKQIKVGHKYVDSEVAAAAVALKAGTNL 293
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
NCG+ K + A+ GLV E + F T RLG FD SK PY ++GP+ + +
Sbjct: 294 NCGATY-KELKKAIDQGLVTEELVHERTKQLFKTRFRLGMFDKDLSKNPYSKIGPELIHS 352
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
+ + +LA +AA++ IV+LKN LPL PT IK V GP AN + ++G+Y G
Sbjct: 353 KEHIELAREAAQKSIVMLKNKNNLLPL-PTDIKVPYVTGPFANSSDMLMGSYYGVSPGVV 411
Query: 408 TPLQGLAAVV----ATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMG--ADQSIE-- 459
T L G+ V + Y++G Q + A A +D T+ V+G AD+ E
Sbjct: 412 TILAGITDAVSLGTSLNYRSGALPFQKNINPKNWAPNVAGMSDVTICVVGLTADREGEGV 471
Query: 460 ---AESH--DRLDLLLPGQQQLLITEV-AKVAKGPVILIIMSGGGFDISFAKNDPKITSI 513
A +H DRLDL LP Q + ++ AK P++L+I SG +S + +I
Sbjct: 472 DAIASNHKGDRLDLKLPENQINYVKQLAAKKKDKPLVLVIASGS--PVSLEGIEEHCDAI 529
Query: 514 LWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
L + YPG+ GG A+ADV FG+ +P+G LPMT +P+S
Sbjct: 530 LQIWYPGEQGGNAVADVLFGKVSPTGHLPMT-FPKS 564
>gi|110737298|dbj|BAF00595.1| xylosidase [Arabidopsis thaliana]
Length = 303
Score = 305 bits (781), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 158/257 (61%), Positives = 187/257 (72%), Gaps = 10/257 (3%)
Query: 12 KVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRL 71
KV V+ +FL L HSS S + P+FACD +N +L FC ++ I +RV DL+ RL
Sbjct: 13 KVVVILVFLL--CLVHSSESLR--PLFACDP-ANGLTRTLRFCRANVPIHVRVQDLLGRL 67
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
TLQEKI LV++A +V RLGI YEWWSEALHG+S VGPG F PGATSFPQVI TA
Sbjct: 68 TLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFPQVITTA 127
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
ASFN SL++ IG+VVS EARAMYN G+AGLT+WSPN+NI RDPRWGRGQETPGEDP++A+
Sbjct: 128 ASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAA 187
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK 251
KYA YV+GLQ T G NRLKVAACCKHYTAYDLDNW G DR+HFNA V + L
Sbjct: 188 KYAASYVRGLQGTAAG--NRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVNLLHILYISN 245
Query: 252 YIVSD---CDSVDVLYN 265
+ S C+ + LYN
Sbjct: 246 IVYSKKIFCNLLSNLYN 262
>gi|336425135|ref|ZP_08605165.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013044|gb|EGN42933.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 705
Score = 305 bits (781), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 200/559 (35%), Positives = 282/559 (50%), Gaps = 100/559 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
+ +LV ++TL+EK + L A ++ RLG+P Y WW+EALHGV+ G AT
Sbjct: 10 KAHELVSQMTLEEKASQLRYDAPAIPRLGVPTYNWWNEALHGVARAGV----------AT 59
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
SFPQ I AA+F+ L + +G V+ E RA YN GLTFWSPN+NIFRDP
Sbjct: 60 SFPQAIAMAAAFDDELLKTVGDAVAAEGRAKYNEYSRHDDRDIYKGLTFWSPNVNIFRDP 119
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGRG ET GEDP L S+ YV+GLQ + + +K AAC KH+ + + + R
Sbjct: 120 RWGRGHETYGEDPYLTSRLGVAYVEGLQGSQ--DDDFMKTAACAKHFAVH---SGPESVR 174
Query: 235 YHFNAMV----IYTYYLIKFK--------------------------------------- 251
+ F+A +Y YL F+
Sbjct: 175 HEFDAQASKKDMYETYLPAFEACVKEAGVEAVMGAYNRTNGEPCCGSPTLIQNILREEWD 234
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVN 307
+ VSDC ++ + TKTPEE+AA ++ +G D+NCG ++L H A + GLV
Sbjct: 235 FQGHYVSDCWAIADFHMHHMVTKTPEESAALALKSGCDVNCGVTYL--HLLKAYQQGLVT 292
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E I A F T LG FD K Y + + V + + +LA A++ +VLLKN
Sbjct: 293 EEEITQAAERLFTTRFLLGCFD----KNEYDDIPYEVVECKEHLELAQKMAKESMVLLKN 348
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQA 423
G LPL+ +K + VIGPNA+ ++GNY GT +Y T L+G+ V Y
Sbjct: 349 D-GILPLNKDGLKTIGVIGPNADSRTPLVGNYHGTSSRYITLLEGIQDFVGEDVRVYYSE 407
Query: 424 GC-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDRLD 467
GC + ++ +A A +D VL +G D+++E E S D+ D
Sbjct: 408 GCHIYKDRVEGLGWKQDRISEALTVAEHSDVVVLCLGLDENLEGEEGDTGNSYASGDKKD 467
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
L LP Q+ L+ VA K PV+L +MSG D+ FA + +IL V YPG GG A
Sbjct: 468 LELPESQRELLEAVAGCGK-PVVLCMMSGSAIDMQFAAE--HVNAILQVWYPGARGGKAA 524
Query: 528 ADVCFGRYNPSGRLPMTWY 546
A++ FG +PSG+LP+T+Y
Sbjct: 525 AEILFGACSPSGKLPVTFY 543
>gi|292495281|sp|C0STH4.1|XYND_ASPAC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|225878711|dbj|BAH30675.1| beta-xylosidase [Aspergillus aculeatus]
Length = 805
Score = 305 bits (780), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 198/553 (35%), Positives = 280/553 (50%), Gaps = 60/553 (10%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTH 113
C+++ R LV TL+E I +++ V RLG+P Y+ WSEALHG+ T
Sbjct: 60 CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLGRAN-FTD 118
Query: 114 FSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRD 173
+ G SFP IL+AA+FN +L I ++ST+ RA N G GL +SPNIN FR
Sbjct: 119 NGALHAGRPSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPNINTFRH 178
Query: 174 PRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
P WGRGQETPGED L + YA Y+ G+Q G +P LK+AA KH+ YD++NW
Sbjct: 179 PVWGRGQETPGEDAYTLTAAYAYEYITGIQ--GGVNPEHLKLAATAKHFAGYDIENWDNH 236
Query: 233 DR------------------------------------------------YHFNAMVIYT 244
R + ++ T
Sbjct: 237 SRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCSNTFFLQTLLRDT 296
Query: 245 YYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAG 304
+ + Y+ DC +V ++N Y AAA +ILAG D++CG+ H ++ G
Sbjct: 297 FSFVDHGYVSGDCGAVYGVFNPHGYAANEPSAAADAILAGTDIDCGTSYQYHFNESITTG 356
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQDLALDAARQGIV 363
V I+ +A L+ LG+FDG+ S PY LG DV +++ +AA +GIV
Sbjct: 357 AVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDAWNISYEAAVEGIV 416
Query: 364 LLKNTAGSLPL-SPTAIKN--LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI 420
LLKN G+LPL SP+ KN +A+IGP AN T + GNY G +P+ A T+
Sbjct: 417 LLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLISPVDAFTAAGYTV 475
Query: 421 YQAGCSNVQCG-TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
+ A + + TA A AA +AD V + G D +IEAE+ DR + PG Q LI+
Sbjct: 476 HYAPGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDRSSIAWPGNQLELIS 535
Query: 480 EVA--KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
++A K P+++ M GG D S K + K+ ++LW GYPGQ+GG A+ D+ G P
Sbjct: 536 QLAAQKSDDQPLVVYQMGGGQVDSSSLKFNAKVNALLWGGYPGQSGGLALRDILTGARAP 595
Query: 538 SGRLPMTWYPQSY 550
+GRL T YP +Y
Sbjct: 596 AGRLTTTQYPAAY 608
>gi|366163035|ref|ZP_09462790.1| glycoside hydrolase family 3 [Acetivibrio cellulolyticus CD2]
Length = 705
Score = 305 bits (780), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 200/562 (35%), Positives = 281/562 (50%), Gaps = 108/562 (19%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
+ +LV ++TL+EK + L ++ ++ RLGIP Y WW+EALHGV+ G AT
Sbjct: 10 KAEELVAQMTLEEKASQLTYNSPAIERLGIPAYNWWNEALHGVARAGT----------AT 59
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AA F+ I ++ EARA YN GLT WSPNINIFRDP
Sbjct: 60 VFPQAIGLAAMFDDEFLMKIANAIAIEARAKYNESSKHGDRDIYKGLTIWSPNINIFRDP 119
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY----DLDNWK 230
RWGRG ET GEDP L+ K ++KGLQ G + + AAC KH+ AY DL
Sbjct: 120 RWGRGHETYGEDPFLSGKLGVAFIKGLQ----GDKDVMMTAACVKHFAAYSGPEDL---- 171
Query: 231 GTDRYHFNAMV----IYTYYLIKFK----------------------------------- 251
R+ FNA V ++ YL F+
Sbjct: 172 ---RHGFNAEVTKKDLWETYLPAFETCVKDAKVEAVMGGYNRTNGEPCCGSYTLLRDILR 228
Query: 252 -------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAG 304
++VSDC ++ + TKTPEE+ A +I AG DLNCG+ A++ G
Sbjct: 229 EKWGFEGHVVSDCWAIKDFHTDHMVTKTPEESVALAIDAGCDLNCGNMY-LMLLIALQEG 287
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
L+ E I A F T +LG F+G + + + V ++++A++AAR+ VL
Sbjct: 288 LITEEHITRAAVRIFTTRFKLGLFEG----SEFDNIPYEVVECSEHKEMAIEAARKSAVL 343
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TI 420
LKN G LP++ AIK + VIGPNAN + GNY GT +Y T L+G+ V +
Sbjct: 344 LKND-GILPINKGAIKTIGVIGPNANSRIALKGNYHGTSSRYITLLEGIQDEVGDEVRVL 402
Query: 421 YQAGCSNVQCGTA-------QVDDAKKAAASADATVLVMGADQSIEAE---------SHD 464
Y GC V+ T ++ +A A +D VL +G D++IE E S D
Sbjct: 403 YSNGCELVKDRTEVLAYANDRLAEAVTVAEHSDLVVLCLGLDETIEGEQSDEGNNGGSGD 462
Query: 465 RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGG 524
+ DL LP Q+ L+ ++ K P +L +M+G ++S+A IL YPG GG
Sbjct: 463 KKDLDLPEVQKSLLEKIVATGK-PTVLCLMAGSAINLSYAHE--HCNGILLTWYPGARGG 519
Query: 525 AAIADVCFGRYNPSGRLPMTWY 546
A+AD+ FG +PSG+LP+T+Y
Sbjct: 520 KAVADILFGNASPSGKLPVTFY 541
>gi|326789672|ref|YP_004307493.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
gi|326540436|gb|ADZ82295.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
Length = 704
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 207/561 (36%), Positives = 285/561 (50%), Gaps = 103/561 (18%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
+ G LV ++ L EK + L + ++ RLG+P Y WWSEALHGV+ G AT
Sbjct: 8 KAGQLVAQMDLLEKASMLRYDSPAIKRLGVPTYNWWSEALHGVARAGV----------AT 57
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AA F+ I +++TEARA YN G+T W+PNINIFRDP
Sbjct: 58 VFPQAIGMAAMFDEEYLYEIADIIATEARAKYNEFAKKEDRDIYKGMTLWAPNINIFRDP 117
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGRG ET GEDP L S+ ++ GLQ + + + K AAC KH+ + + +R
Sbjct: 118 RWGRGHETYGEDPYLTSRLGVAFIHGLQGDE--NHHYWKAAACAKHFAVH---SGPEEER 172
Query: 235 YHFNAMV----IYTYYLIKFK--------------------------------------- 251
+HF+A+V +Y YL F+
Sbjct: 173 HHFDAVVSKKDLYETYLPAFEAAVTKGKVAGMMGAYNRVNGEPACGSKVLLQDILKEEWG 232
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVN 307
Y+VSDC ++ + T T E+AA +I G LNCG ++L H A K GLV
Sbjct: 233 FDGYVVSDCWAIRDFHTEHMVTHTATESAALAINNGCQLNCGNTYL--HMLQAYKEGLVT 290
Query: 308 ESAIDHAISNNFATLMRLGFFDGH--PSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
E I + A M+LG FD + +K PY + C + ++D+ALD AR+ +VLL
Sbjct: 291 EETITKSAQKLMAIRMKLGLFDKNCEYNKIPY----EVNDC-KVHRDIALDVARRSMVLL 345
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV---ATIYQ 422
KN G LPL+ K + VIGP AN + GNY GT +YTT L+G+ V A +Y
Sbjct: 346 KNN-GILPLNLKQTKAIGVIGPTANSRTVLQGNYFGTASRYTTFLEGIQDYVGDAARVYY 404
Query: 423 A-GC-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDR 465
A GC S + ++ +A A +D +L +G D SIE E + D+
Sbjct: 405 AEGCHLFKNSISGLSWENDRLSEALIVAEQSDVVILCLGLDASIEGEQGDTGNAFAAGDK 464
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
DL L G+QQLL+ EV K+ K P ILI+ SG I A+ +IL YPGQ+GG
Sbjct: 465 SDLNLIGRQQLLLEEVLKIGK-PTILILSSGSAMAIHTAQE--YCEAILETWYPGQSGGK 521
Query: 526 AIADVCFGRYNPSGRLPMTWY 546
A+A + FG Y+PSG+LP+T+Y
Sbjct: 522 ALAQLLFGEYSPSGKLPITFY 542
>gi|76160898|gb|ABA40420.1| Xld [Aspergillus fumigatus]
Length = 792
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 191/560 (34%), Positives = 285/560 (50%), Gaps = 65/560 (11%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
N L+ C+TS R LV T +E + +++ V RLG+P Y+ WSEALH
Sbjct: 53 ENGPLSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALH 112
Query: 104 GV---SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG 160
G+ ++ G + ATSFP ILT ++ N +L I +++T+ RA NVG G
Sbjct: 113 GLDRANFTDEGEY-----SWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYG 167
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAACCK 219
L ++PNIN FR WGRGQETPGED LAS YA Y+ G+Q G P LK+ A K
Sbjct: 168 LDVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQ--GGVDPEHLKLVATAK 225
Query: 220 HYTAYDLDNWKGTDR--------------------------------------------- 234
HY YDL+NW G R
Sbjct: 226 HYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSC 285
Query: 235 ---YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
+ ++ T+ ++ Y+ SDCDS ++N + AAA SI AG D++CG+
Sbjct: 286 ANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAADSIRAGTDIDCGT 345
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
+ A V + I+ + ++ L+RLG+FDG+ S Y L DV T
Sbjct: 346 TYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSV--YRDLTWNDVVTTDAW 403
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
+++ +AA +GIVLLKN G+LPL+ +++++A+IGP NVT + GNY G +PL
Sbjct: 404 NISYEAAVEGIVLLKND-GTLPLA-KSVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLN 461
Query: 412 GLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
+ A +N+ T +A AA +D + G D ++EAE+ DR+++
Sbjct: 462 AFQNSDFDVNYAFGTNISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITW 521
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG Q LI +++++ K P+I++ M GG D S K++ + S++W GYPGQ+GG A+ D+
Sbjct: 522 PGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDI 580
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
G+ P+GRL +T YP Y
Sbjct: 581 ITGKRAPAGRLVVTQYPAEY 600
>gi|386347261|ref|YP_006045510.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
6578]
gi|339412228|gb|AEJ61793.1| glycoside hydrolase family 3 domain protein [Spirochaeta
thermophila DSM 6578]
Length = 693
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 196/552 (35%), Positives = 287/552 (51%), Gaps = 96/552 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ L+ +++++EK ++ A + RLGIP Y WW+EALHGV+ G AT
Sbjct: 6 RMTSLLSKMSIEEKAGLMLHRAKGIPRLGIPHYNWWNEALHGVANSGE----------AT 55
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYN-VGLA-------GLTFWSPNINIFRDP 174
FPQ I AA+F+ L + + + +STEARA +N +G GLTFWSPNINI+RDP
Sbjct: 56 VFPQAIGLAATFDPDLVRRVAEAISTEARAKFNAIGKERAAEYERGLTFWSPNINIYRDP 115
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD- 233
RWGRGQET GEDP L SK +VKGLQ G P ++VAAC KHY + G +
Sbjct: 116 RWGRGQETYGEDPFLTSKIGVSFVKGLQ---GDHPYYMRVAACAKHYAVH-----SGPEG 167
Query: 234 -RYHFNAMV----IYTYYLIKFK------------------------------------- 251
R+ F+A V ++ YL F+
Sbjct: 168 LRHVFDARVSEKDLWETYLPAFEALVKAGVEAVMGAYNRVNGEPACGSKRLLDEILRKRW 227
Query: 252 ----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVN 307
++VSDC ++ + TK P E+ A ++ AG DLNCG+ +H AVKAG+V+
Sbjct: 228 GFKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAGCDLNCGNTY-EHLLDAVKAGVVS 286
Query: 308 ESAIDHAISNNFATLMRLGFF-DGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLK 366
E +D +++ +TL RLG F D HP Y +L D+ +A++ LA +AA + +VLLK
Sbjct: 287 EELVDRSVARLLSTLDRLGLFTDDHP----YARLSLSDIDWEAHRALAREAAEKSVVLLK 342
Query: 367 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV----VATIYQ 422
N G LP ++ + V GPNA ++GNY G + T L+G+ + Y+
Sbjct: 343 NN-GILPFDRQKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITVTYK 401
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL---------DLLLPGQ 473
GC +D A A AD TV VMG D ++E E D + DL LP +
Sbjct: 402 IGCPLQGNKINPIDWASGVARYADVTVAVMGRDSTVEGEEGDAIFSDNYGDLSDLDLPRE 461
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q + + ++ K P++++++SG + + + +I++ YPG+ GG AIA V FG
Sbjct: 462 QIEYLRRIKEIGK-PLVVVLLSGA--PVCSPELEELADAIVYAWYPGEEGGNAIARVLFG 518
Query: 534 RYNPSGRLPMTW 545
+PSGRLP+T+
Sbjct: 519 EISPSGRLPITF 530
>gi|70996610|ref|XP_753060.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
gi|74672055|sp|Q4WRB0.1|XYND_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|66850695|gb|EAL91022.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
Length = 792
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 191/560 (34%), Positives = 285/560 (50%), Gaps = 65/560 (11%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
N L+ C+TS R LV T +E + +++ V RLG+P Y+ WSEALH
Sbjct: 53 ENGPLSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALH 112
Query: 104 GV---SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG 160
G+ ++ G + ATSFP ILT ++ N +L I +++T+ RA NVG G
Sbjct: 113 GLDRANFTDEGEY-----SWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYG 167
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAACCK 219
L ++PNIN FR WGRGQETPGED LAS YA Y+ G+Q G P LK+ A K
Sbjct: 168 LDVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQ--GGVDPEHLKLVATAK 225
Query: 220 HYTAYDLDNWKGTDR--------------------------------------------- 234
HY YDL+NW G R
Sbjct: 226 HYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSC 285
Query: 235 ---YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
+ ++ T+ ++ Y+ SDCDS ++N + AAA SI AG D++CG+
Sbjct: 286 ANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAADSIRAGTDIDCGT 345
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
+ A V + I+ + ++ L+RLG+FDG+ S Y L DV T
Sbjct: 346 TYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSV--YRDLTWNDVVTTDAW 403
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
+++ +AA +GIVLLKN G+LPL+ +++++A+IGP NVT + GNY G +PL
Sbjct: 404 NISYEAAVEGIVLLKND-GTLPLA-KSVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLN 461
Query: 412 GLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
+ A +N+ T +A AA +D + G D ++EAE+ DR+++
Sbjct: 462 AFQNSDFDVNYAFGTNISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITW 521
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG Q LI +++++ K P+I++ M GG D S K++ + S++W GYPGQ+GG A+ D+
Sbjct: 522 PGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDI 580
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
G+ P+GRL +T YP Y
Sbjct: 581 ITGKRAPAGRLVVTQYPAEY 600
>gi|292495282|sp|B0XP71.1|XYND_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|159131796|gb|EDP56909.1| beta-xylosidase XylA [Aspergillus fumigatus A1163]
Length = 792
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 191/560 (34%), Positives = 285/560 (50%), Gaps = 65/560 (11%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
N L+ C+TS R LV T +E + +++ V RLG+P Y+ WSEALH
Sbjct: 53 ENGPLSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALH 112
Query: 104 GV---SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG 160
G+ ++ G + ATSFP ILT ++ N +L I +++T+ RA NVG G
Sbjct: 113 GLDRANFTDEGEY-----SWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYG 167
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAACCK 219
L ++PNIN FR WGRGQETPGED LAS YA Y+ G+Q G P LK+ A K
Sbjct: 168 LDVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQ--GGVDPEHLKLVATAK 225
Query: 220 HYTAYDLDNWKGTDR--------------------------------------------- 234
HY YDL+NW G R
Sbjct: 226 HYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSC 285
Query: 235 ---YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
+ ++ T+ ++ Y+ SDCDS ++N + AAA SI AG D++CG+
Sbjct: 286 ANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAADSIRAGTDIDCGT 345
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
+ A V + I+ + ++ L+RLG+FDG+ S Y L DV T
Sbjct: 346 TYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSV--YRDLTWNDVVTTDAW 403
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
+++ +AA +GIVLLKN G+LPL+ +++++A+IGP NVT + GNY G +PL
Sbjct: 404 NISYEAAVEGIVLLKND-GTLPLA-KSVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLN 461
Query: 412 GLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
+ A +N+ T +A AA +D + G D ++EAE+ DR+++
Sbjct: 462 AFQNSDFDVNYAFGTNISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITW 521
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG Q LI +++++ K P+I++ M GG D S K++ + S++W GYPGQ+GG A+ D+
Sbjct: 522 PGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDI 580
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
G+ P+GRL +T YP Y
Sbjct: 581 ITGKRAPAGRLVVTQYPAEY 600
>gi|389632743|ref|XP_003714024.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|351646357|gb|EHA54217.1| beta-xylosidase [Magnaporthe oryzae 70-15]
Length = 847
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 200/579 (34%), Positives = 286/579 (49%), Gaps = 82/579 (14%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
N L++ C+ + R LV + L EK+ LV+ + R+G+P YEWWSEALH
Sbjct: 86 ENGPLSTNIVCDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALH 145
Query: 104 GVSYVGPGTHFSN----VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA 159
GV+ PG F+ ATSF I+ +A+F+ L +A+ +STEARA N GLA
Sbjct: 146 GVAK-SPGVTFNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLA 204
Query: 160 GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCK 219
GL +W+PNIN ++DPRWGRG ETPGED L SKY ++GL+ +D P K+ A CK
Sbjct: 205 GLDWWTPNINPYKDPRWGRGMETPGEDALRISKYVKALLRGLEGSD---PTTRKMVANCK 261
Query: 220 HYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF------------------------- 250
HY A DL+ W G RY+F+A V + YYL F
Sbjct: 262 HYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKGKD 321
Query: 251 -----------KYIVSD------------------CDSVDVLYNSQHYTKTPEEAAAKSI 281
KY+++D C++V ++N H++ T EEAA +
Sbjct: 322 LSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHWSDTREEAAGSAY 381
Query: 282 LAGLDLNC--GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQ 339
AG D C ++ + A GL++E +D A+ + L+R G+FDG PY
Sbjct: 382 TAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDG--PDAPYRN 439
Query: 340 LGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAI----KNLAVIGPNANVTKTM 395
+ DV T + LA +A +G+VL KN G LP+ + K +A+IG + + M
Sbjct: 440 ITWADVNTPEARKLAHRSAVEGMVLTKNN-GVLPIKLEELQKKGKTVALIGNWVDNGEQM 498
Query: 396 IGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDD----AKKAAASADATVLV 451
+G Y G TPL A+ + AG V T D A AA AD +
Sbjct: 499 LGTYSGIAPFRNTPLAAAKALNLKMVTAGGP-VNQSTGSRDSWTRPALNAAIQADVVLYF 557
Query: 452 MGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKIT 511
G D S+EAE DR L P Q L+++++ + K V++ + G D + ++ I+
Sbjct: 558 GGIDLSVEAEDRDRYSLAWPSAQAKLLSDISALGKPTVVVQL--GTMLDDTALLDNKNIS 615
Query: 512 SILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+I+W GYPGQ GG A D+ G+ PSGRLP+T YP Y
Sbjct: 616 AIIWAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKY 654
>gi|440472411|gb|ELQ41274.1| beta-xylosidase [Magnaporthe oryzae Y34]
gi|440484691|gb|ELQ64724.1| beta-xylosidase [Magnaporthe oryzae P131]
Length = 792
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 200/579 (34%), Positives = 286/579 (49%), Gaps = 82/579 (14%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
N L++ C+ + R LV + L EK+ LV+ + R+G+P YEWWSEALH
Sbjct: 31 ENGPLSTNIVCDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALH 90
Query: 104 GVSYVGPGTHFSN----VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA 159
GV+ PG F+ ATSF I+ +A+F+ L +A+ +STEARA N GLA
Sbjct: 91 GVAK-SPGVTFNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLA 149
Query: 160 GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCK 219
GL +W+PNIN ++DPRWGRG ETPGED L SKY ++GL+ +D P K+ A CK
Sbjct: 150 GLDWWTPNINPYKDPRWGRGMETPGEDALRISKYVKALLRGLEGSD---PTTRKMVANCK 206
Query: 220 HYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF------------------------- 250
HY A DL+ W G RY+F+A V + YYL F
Sbjct: 207 HYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKGKD 266
Query: 251 -----------KYIVSD------------------CDSVDVLYNSQHYTKTPEEAAAKSI 281
KY+++D C++V ++N H++ T EEAA +
Sbjct: 267 LSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHWSDTREEAAGSAY 326
Query: 282 LAGLDLNC--GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQ 339
AG D C ++ + A GL++E +D A+ + L+R G+FDG PY
Sbjct: 327 TAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDG--PDAPYRN 384
Query: 340 LGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAI----KNLAVIGPNANVTKTM 395
+ DV T + LA +A +G+VL KN G LP+ + K +A+IG + + M
Sbjct: 385 ITWADVNTPEARKLAHRSAVEGMVLTKNN-GVLPIKLEELQKKGKTVALIGNWVDNGEQM 443
Query: 396 IGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDD----AKKAAASADATVLV 451
+G Y G TPL A+ + AG V T D A AA AD +
Sbjct: 444 LGTYSGIAPFRNTPLAAAKALNLKMVTAG-GPVNQSTGSRDSWTRPALNAAIQADVVLYF 502
Query: 452 MGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKIT 511
G D S+EAE DR L P Q L+++++ + K V++ + G D + ++ I+
Sbjct: 503 GGIDLSVEAEDRDRYSLAWPSAQAKLLSDISALGKPTVVVQL--GTMLDDTALLDNKNIS 560
Query: 512 SILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+I+W GYPGQ GG A D+ G+ PSGRLP+T YP Y
Sbjct: 561 AIIWAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKY 599
>gi|238483831|ref|XP_002373154.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
gi|292495283|sp|B8MYV0.1|XYND_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|220701204|gb|EED57542.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
Length = 797
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 195/567 (34%), Positives = 283/567 (49%), Gaps = 71/567 (12%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D S P L+ C+TS R LV LT +E + ++ R+G+P Y+ W+E
Sbjct: 51 DCESGP-LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNE 109
Query: 101 ALHGVSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV++ FS+ +TSFPQ I T A+ N +L I ++ST+ RA N G
Sbjct: 110 ALHGVAHA----DFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR 165
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAAC 217
GL +SPNIN FR P WGRGQETPGED LAS YA Y+ G+Q G N LK+ A
Sbjct: 166 YGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQ--GGVDANPLKLIAT 223
Query: 218 CKHYTAYDLDNWKGTDR------------------------------------------- 234
KHY YD++NW R
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283
Query: 235 -----YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
+ ++ T+ ++ Y+ DC +V ++N Y AAA SI AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G +H + + V+ ++ ++ +A+L+R G+FDG S PY + DV +
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLS-PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
Q+L+ +AA Q IVLLKN G LPL+ ++ K +A+IGP AN T M+GNY G +
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTTSSSTKTIALIGPWANATTQMLGNYYGPAPYLIS 460
Query: 409 PLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAA-----SADATVLVMGADQSIEAESH 463
PLQ A + Y+ + T D ++ A AD + G D ++E E+
Sbjct: 461 PLQ---AFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQ 517
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR ++ P Q LIT++A + K P+I++ M GG D S KN+ + +++W GYPGQ+G
Sbjct: 518 DRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSG 576
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQSY 550
G A+AD+ G+ P+ RL T YP Y
Sbjct: 577 GQALADIITGKRAPAARLVTTQYPAEY 603
>gi|291518645|emb|CBK73866.1| Beta-glucosidase-related glycosidases [Butyrivibrio fibrisolvens
16/4]
Length = 713
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 194/558 (34%), Positives = 280/558 (50%), Gaps = 97/558 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R +LV ++T++EK + ++ A ++ RLGIPKY WW+EALHGV+ G AT
Sbjct: 8 RAKELVSQMTIEEKCSQMLHHAEAIDRLGIPKYCWWNEALHGVARAGD----------AT 57
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I A+F+ L + + V STE RA YN GLT+W+PN+NIFRDP
Sbjct: 58 VFPQAIGLGATFDEELVEKVADVTSTEGRAKYNEFTKHGDRDIYKGLTYWAPNVNIFRDP 117
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGRG ET GEDP L + YV+GLQ D +P K AAC KH+ + + +R
Sbjct: 118 RWGRGHETYGEDPYLTGQLGMAYVRGLQGDDLDNP---KSAACAKHFAVH---SGPEAER 171
Query: 235 YHFNAMV----IYTYYLIKFK--------------------------------------- 251
+HF+A V +Y YL FK
Sbjct: 172 HHFDAKVNDQDLYDTYLYAFKRLVKDAKVEAVMGAYNRVNGEPACGSKRLLKDILRGDWG 231
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNE 308
++VSDC ++ + + T E+AA ++ G DLNCG K A KA LV E
Sbjct: 232 FEGHVVSDCWAIRDFHENHKVTGCEVESAALAVNNGCDLNCGCVYEKLL-YAYKANLVTE 290
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
I ++ +RLG SK Y + + V + +++LA++AA++ +VLLKN
Sbjct: 291 ETITESVERLIELRLRLGTLPERRSK--YDDIPYEVVECKEHKELAIEAAKRSMVLLKND 348
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT--------- 419
G LPL IK + VIGPN+N ++GNYEG +Y T L+G+ V
Sbjct: 349 -GLLPLKKDEIKTIGVIGPNSNSRMALVGNYEGISSEYITVLEGIQQYVGDDVRVFHSDG 407
Query: 420 --IYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDRLDL 468
+++ + +A A +D VL MG D +IE E S D+ L
Sbjct: 408 TPLWKDRMHVLSEARDTFAEAMAVAEHSDVVVLAMGLDSTIEGEEGDAGNEFGSGDKKGL 467
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIA 528
LPG QQ L+ ++ + K PV+L++++G D+S+A + + +I+ YPG GG AIA
Sbjct: 468 KLPGLQQELLEKITAIGK-PVVLLVLAGSAMDLSWANEN--VNAIMHCWYPGARGGKAIA 524
Query: 529 DVCFGRYNPSGRLPMTWY 546
V FG +PSG+LP+T+Y
Sbjct: 525 QVLFGEDSPSGKLPLTFY 542
>gi|336435507|ref|ZP_08615222.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
1_4_56FAA]
gi|336000960|gb|EGN31106.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
1_4_56FAA]
Length = 717
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 201/558 (36%), Positives = 289/558 (51%), Gaps = 106/558 (18%)
Query: 66 DLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFP 125
+LV ++TL EK + L A ++ RL IP Y WW+E+LHGV+ G AT FP
Sbjct: 16 ELVDQMTLMEKASQLRYDAPAIPRLHIPAYNWWNESLHGVARGGT----------ATVFP 65
Query: 126 QVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDPRWG 177
Q I AASF+ + + IG+ ++ E RA YN + GLTFW+PN+NIFRDPRWG
Sbjct: 66 QAIGLAASFDREMLEEIGEAIALEGRAKYNAAVKLDDRDIYKGLTFWAPNVNIFRDPRWG 125
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD--RY 235
RG ET GEDP L+S+ Y++GLQ G +K AAC KH+ + G + R+
Sbjct: 126 RGHETYGEDPYLSSRLGVSYIRGLQ----GDGETMKAAACAKHFAVH-----SGPEALRH 176
Query: 236 HFNAMV----IYTYYLIKFK---------------------------------------- 251
F+A V + YL F+
Sbjct: 177 EFDAEVSEKDLRETYLPAFQACVQEGHVEAVMGAYNCVNGEPCCGSETLLKKILREEWGF 236
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNE 308
++VSDC ++ + + T TP ++AA ++ AG DLNCG ++L H A + GLV E
Sbjct: 237 DGHVVSDCWAIKDFHENHLVTGTPVQSAALAMEAGCDLNCGVTYL--HLVHACQEGLVTE 294
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
+ I A F T LG FDG Y + V + ++DL+ AAR+ IVLLKN
Sbjct: 295 AQITEAAIRLFTTRFLLGMFDG----SEYDSVPYTVVECKEHRDLSERAARESIVLLKNN 350
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAG 424
G LPL +K + +IGPNA+ K +IGNY GT +Y T L+G+ +V +Y G
Sbjct: 351 -GILPLDREKLKTIGIIGPNADSRKALIGNYHGTSSEYITVLEGVRRLVGDEVRILYSDG 409
Query: 425 C-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDRLDL 468
C N+ ++ +A+ A +D +L +G D+++E E S D++DL
Sbjct: 410 CHLYENKTENLAREQDRLSEARIVARESDVVILCLGLDETLEGEEGDTGNSYASGDKVDL 469
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIA 528
LP Q++L+ VA + K P +L +M+G D+SFA+ LW YPG GGAA A
Sbjct: 470 RLPKSQRMLMEAVA-MEKKPTVLCLMAGSDIDLSFAEKHFDAIVDLW--YPGAYGGAAAA 526
Query: 529 DVCFGRYNPSGRLPMTWY 546
D+ FG+ +PSG+LP+T+Y
Sbjct: 527 DILFGKCSPSGKLPITFY 544
>gi|150019484|ref|YP_001311738.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
8052]
gi|149905949|gb|ABR36782.1| glycoside hydrolase, family 3 domain protein [Clostridium
beijerinckii NCIMB 8052]
Length = 709
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 205/565 (36%), Positives = 285/565 (50%), Gaps = 107/565 (18%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
+ +LV ++TL+EK L + +V RL +P+Y WW+E LHGV+ G AT
Sbjct: 15 KAKELVGKMTLEEKAEQLTYKSSAVKRLNVPRYNWWNEGLHGVARAGT----------AT 64
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AA F+ L I KV+STE RA YN G+TFWSPN+NIFRDP
Sbjct: 65 VFPQAIGLAAMFDDELLNYIAKVISTEGRAKYNENSKKDDRDIYKGITFWSPNVNIFRDP 124
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD- 233
RWGRG ET GEDP L S+ +VKGLQ G LK AAC KH+ + G +
Sbjct: 125 RWGRGHETYGEDPYLTSRLGVAFVKGLQ----GEGKYLKAAACAKHFAVHS-----GPEG 175
Query: 234 -RYHFNAMV----IYTYYLIKFK------------------------------------- 251
R+ F+A+V +Y YL F+
Sbjct: 176 LRHEFDAVVSKKDLYETYLPAFEACVKEGDVEAVMGAYNRTNGEPCCGSKTLLRDILRGK 235
Query: 252 -----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLV 306
++VSDC ++ + T T E+AA ++ G DLNCG+ + A K GLV
Sbjct: 236 WNFKGHVVSDCWAIADFHLHHRVTSTATESAALAMKNGCDLNCGNVYLQLL-LAYKEGLV 294
Query: 307 NESAIDHAISNNFATLMRLGFFDGHP--SKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
E I A AT +RLG FD +K PY + C + N+ L+L AAR +VL
Sbjct: 295 TEEDITTAAERLMATRIRLGMFDEECEYNKIPY----ELNDCKEHNE-LSLKAARNSMVL 349
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TI 420
LKN G LPL+ +K++AVIGPNA+ + GNY GT +Y T L+G+ V
Sbjct: 350 LKNN-GILPLNKNNLKSIAVIGPNADSQIMLKGNYSGTASRYITVLEGIHEAVGEDVRVY 408
Query: 421 YQAGCSNVQCGTAQV----DDAKKA---AASADATVLVMGADQSIEAE---------SHD 464
Y GC + ++ D K+A A +D +L +G D +IE E + D
Sbjct: 409 YSEGCHLFRDRVEELAEPNDRLKEAISIAERSDVAILCLGLDSTIEGEQGDAGNSEGAGD 468
Query: 465 RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGG 524
+ L LPG+QQ L+ ++ + PVIL+I G G ++F + K ++IL YPG GG
Sbjct: 469 KASLNLPGRQQELLEKIIETGT-PVILVI--GAGSALTFNNAEDKCSAILDAWYPGSRGG 525
Query: 525 AAIADVCFGRYNPSGRLPMTWYPQS 549
A+AD+ FG+ +PSG+LP+T+Y +
Sbjct: 526 RAVADLIFGKCSPSGKLPITFYRNT 550
>gi|326791674|ref|YP_004309495.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
gi|326542438|gb|ADZ84297.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
Length = 696
Score = 301 bits (772), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 202/555 (36%), Positives = 280/555 (50%), Gaps = 103/555 (18%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV +TL+E+ + L + ++ RLG+P Y WW+EALHGV+ G ATSFPQ
Sbjct: 13 LVAEMTLEERASQLKYDSPAIKRLGVPAYNWWNEALHGVARAGV----------ATSFPQ 62
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWGR 178
I AA+F+ L + + +V++ E RA YN GLTFWSPN+NIFRDPRWGR
Sbjct: 63 AIGMAATFDDELLKRVAEVIAEEGRAKYNAYSQEGDRDIYKGLTFWSPNVNIFRDPRWGR 122
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
G ET GEDP L S+ +VKGLQ +G LK AAC KH+ + + DR+HF+
Sbjct: 123 GHETYGEDPYLTSRLGVAFVKGLQGEEG-----LKTAACAKHFAVH---SGPEADRHHFD 174
Query: 239 AMV----IYTYYLIKFK------------------------------------------Y 252
A V ++ YL F+ +
Sbjct: 175 ARVSQKDLWETYLPAFEALVKEAEVESVMGAYNRTNGEPCCGSPTLMKDILREKWGFQGH 234
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAI 311
VSDC ++ + T T +E+AA ++ +G DLNCG ++L H A + GLV E I
Sbjct: 235 YVSDCWAIKDFHEHHMVTSTAQESAALALKSGCDLNCGNTYL--HILMAYQNGLVTEEEI 292
Query: 312 DHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS 371
A F T LG FDG Y + + V ++ + +A +A + IVLLKN G
Sbjct: 293 TTAAERLFTTRYLLGLFDG----STYDAIPYEVVESKPHLSVADEATAKSIVLLKNN-GL 347
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAG--- 424
LPL+ +IK + VIGPNAN K +IGNY GT +Y T L+GL V +Y G
Sbjct: 348 LPLNKESIKTIGVIGPNANSRKALIGNYHGTSSQYITILEGLQKEVGDEVRILYSEGSHL 407
Query: 425 -CSNVQCGTAQVD---DAKKAAASADATVLVMGADQSIEAE---------SHDRLDLLLP 471
V+ Q D +AK A +D ++ +G D+++E E S D+ DL LP
Sbjct: 408 YADRVEPLAYQRDRLSEAKIVAKHSDVVIVCVGLDETLEGEEGDTGNAYASGDKRDLALP 467
Query: 472 GQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVC 531
QQ L+ +AK+ K PVIL + +G D+ +A D ++L YPG GG IA
Sbjct: 468 EPQQELVEAMAKMGK-PVILCLSAGSAIDLQYA--DAHYDAVLQAWYPGARGGQVIAKAL 524
Query: 532 FGRYNPSGRLPMTWY 546
G PSG+LP+T+Y
Sbjct: 525 LGEIVPSGKLPVTFY 539
>gi|373952439|ref|ZP_09612399.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373889039|gb|EHQ24936.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 721
Score = 301 bits (772), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 203/580 (35%), Positives = 284/580 (48%), Gaps = 91/580 (15%)
Query: 37 VFACDVVSNPSLASLGFC-------NTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSR 89
+FA V+++ L FC N + RV DL+ RLTL EK++ L + +V R
Sbjct: 9 IFA--VLTSLGLIKTAFCQQIPIYRNPDKKLSTRVQDLISRLTLAEKVSLLGYRSQAVPR 66
Query: 90 LGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTE 149
L IP Y WW+E LHGV+ G AT FPQ I AA+F+ +L + + VVSTE
Sbjct: 67 LNIPAYNWWNEGLHGVARAGE----------ATIFPQAIAMAATFDDNLVKQVANVVSTE 116
Query: 150 ARAMYNVGLA--------GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGL 201
ARA YN+ A GLTFWSPNINIFRDPRWGRGQET GEDP L SK YV GL
Sbjct: 117 ARAKYNLSTAMGRHLQYMGLTFWSPNINIFRDPRWGRGQETYGEDPFLTSKMGNAYVHGL 176
Query: 202 QQTDGGSPNRLKVAACCKHYTAY--------------DLDNWKGTDRYHFNAMV------ 241
Q TD P LK +A KH+ A+ D + + T Y F ++V
Sbjct: 177 QGTD---PLHLKTSATAKHFVAHSGPEGERDYFDALVDEKDLRDTYLYAFKSLVDGGVES 233
Query: 242 IYTYY------------------LIK---FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAK 279
I T Y +IK FK ++V+DC ++D +Y + E AA
Sbjct: 234 IMTAYNRVNGVPNSINKTLVNDIVIKEWGFKGHVVTDCGALDDVYKTHKVLPNRMEVAAA 293
Query: 280 SILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQ 339
+I AG+DL+C S A+ L+ E +D A++ +T +LGFFD PS P+
Sbjct: 294 AIKAGVDLDCSSIFQTDIINAINNKLLTEKQVDAALAAVLSTQFKLGFFDA-PSSSPFYS 352
Query: 340 LGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
G + ++ LA A++ +VLLKN LPL ++ V+GPNA ++ +Y
Sbjct: 353 FGADSIHNDSHVMLARQMAQKSMVLLKNDKQILPLKMQNYSSIMVVGPNAASLDALVASY 412
Query: 400 EGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAK----KAAASADATVLVMGAD 455
G K ++G+ A V + G D A +AD TV V+G
Sbjct: 413 HGVSSKAVNFVEGITAAVD---KGTRVEYDLGADYRDTTHFGGIWGAGNADVTVAVIGLT 469
Query: 456 QSIEAES---------HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN 506
+E E+ D+ DL LP + + K K P+I ++ SG D+ A
Sbjct: 470 PVLEGEAGDAFLSQTGGDKKDLSLPAGDIAFMKALRKSVKKPIIAVVTSGS--DVDIAAI 527
Query: 507 DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
P +++ YPG+ GG A+AD+ FG+ +PSG LP+T+Y
Sbjct: 528 APYADAVILAWYPGEQGGNALADILFGKISPSGHLPLTFY 567
>gi|121809149|sp|Q4AEG8.1|XYND_ASPAW RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|73486695|dbj|BAE19756.1| beta-xylosidase [Aspergillus awamori]
Length = 804
Score = 301 bits (771), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 202/564 (35%), Positives = 284/564 (50%), Gaps = 64/564 (11%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N L S C+ + R L+ TL E I ++ VSRLG+P Y+ WSEALHG
Sbjct: 60 NGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHG 119
Query: 105 VSYVGPGTHFSN--VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT 162
+ +FS+ ATSFPQ ILT A+ N +L I ++ST+ RA N G GL
Sbjct: 120 LDRA----NFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLD 175
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
++PNIN FR P WGRGQETPGED LA+ YA Y+ G+Q D S LK+AA KHY
Sbjct: 176 VYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPES--NLKLAATAKHYA 233
Query: 223 AYDLDNWKG------------------------------------------------TDR 234
YD++NW D
Sbjct: 234 GYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADS 293
Query: 235 YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
Y ++ T+ + Y+ SDCD+ +YN Y + AAA++ILAG D++CG+
Sbjct: 294 YFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQ 353
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK--QPYGQLGPKDVCTQANQD 352
H ++ AG ++ I+ + + TL++ G+FD + +K PY L DV +
Sbjct: 354 WHLNESITAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWN 413
Query: 353 LALDAARQGIVLLKNTAGSLPLS----PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
++ AA QGIVLLKN+ LPL+ P + +A+IGP AN T ++GNY G +
Sbjct: 414 ISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMIS 473
Query: 409 PLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
P + + G T+ A AA SAD + G D ++EAE+ DR
Sbjct: 474 PRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRES 533
Query: 468 LLLPGQQQLLITEVAKVA-KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ PG Q LI ++A A K P+I++ M GG D S KN+ K++++LW GYPGQ+GG A
Sbjct: 534 IAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTKVSALLWGGYPGQSGGFA 593
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+ D+ G+ NP+GRL T YP SY
Sbjct: 594 LRDIITGKKNPAGRLVTTQYPASY 617
>gi|374372635|ref|ZP_09630297.1| Beta-glucosidase [Niabella soli DSM 19437]
gi|373235166|gb|EHP54957.1| Beta-glucosidase [Niabella soli DSM 19437]
Length = 734
Score = 301 bits (770), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 206/587 (35%), Positives = 282/587 (48%), Gaps = 103/587 (17%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
F+ V + + L F N L + RV DLV RLTL+EK+ +++ A ++ RLGIP Y+W
Sbjct: 13 FSLAVQAQADKSQLPFWNYKLSFEARVNDLVSRLTLEEKVKQMLNHAPAIPRLGIPAYDW 72
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSE LHGV+ T T +PQ I AA+++ + + E RA++N
Sbjct: 73 WSEVLHGVARTPYHT---------TVYPQAIAMAATWDTVALYTMADQSAREGRAIHNKA 123
Query: 158 ---------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
GLT+W+PNINIFRDPRWGRGQET GEDP L + +V+GLQ G
Sbjct: 124 TEEGKNGDRYVGLTYWTPNINIFRDPRWGRGQETYGEDPFLTAMLGRAFVRGLQ---GED 180
Query: 209 PNRLKVAACCKHYTAYDLDNWKGTD--RYHFNAMV----IYTYYLIKFK----------- 251
P LK AAC KHY + G + R+ F+ V ++ YL FK
Sbjct: 181 PKYLKAAACAKHYAIHS-----GPEAVRHSFDVDVSDYDLWNTYLPAFKELVTHAKVAGV 235
Query: 252 -------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKS 280
Y+ SDC ++D +N E AA +
Sbjct: 236 MCAYNAFRKKPCCGSDLLMTDILRRQWGFTGYVTSDCGAIDDFFNYHKTHPNAEAAAIDA 295
Query: 281 ILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQL 340
+ G D+ CG+ AVK G + E ID ++ F MRLG FD S Y Q
Sbjct: 296 VTNGTDVECGNRAYLTLTDAVKTGRIAEKEIDRSVKRLFMIRMRLGMFD-PVSMVSYAQT 354
Query: 341 GPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE 400
P + + ++ AL A++ IVLLKN LPLS +IK +AV+GPNA+ + ++GNY
Sbjct: 355 SPAVLESAPHKAQALKMAQESIVLLKNENHLLPLS-KSIKKIAVVGPNADNSIAVLGNYN 413
Query: 401 GTPCKYTTPLQGLAAVVAT----IYQAGC--SNVQCGTAQVDDA--KKAAASADATVLVM 452
GTP K T L G+ A + T +Y+ +N + D A ADA + V
Sbjct: 414 GTPSKIVTALDGIKAKLGTNGSVVYEKAVNFTNAMLPEGKTDFAALTSRVKDADAIIFVG 473
Query: 453 GADQSIEAE----------SHDRLDLLLPGQQQLLITEVAKVAKG---PVILIIMSGGGF 499
G +E E S DR +LLP Q TE K K PV+ ++M+G
Sbjct: 474 GISPQLEGEEMKVNEPGFNSGDRTTILLPTVQ----TEAMKALKATGKPVVFVMMTGSAL 529
Query: 500 DISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I + + + I +I+ Y GQA G AIADV FG YNPSGRLP+T+Y
Sbjct: 530 AIPWEQEN--IPAIVNAWYGGQAAGTAIADVLFGDYNPSGRLPVTFY 574
>gi|313202830|ref|YP_004041487.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312442146|gb|ADQ78502.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 742
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 212/584 (36%), Positives = 295/584 (50%), Gaps = 95/584 (16%)
Query: 33 QSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGI 92
Q S V A + V P F +TS ID RV DLV RLTL EK ++ +A ++ RLGI
Sbjct: 26 QPSAVNAQNAVKYP------FQDTSKTIDERVKDLVSRLTLDEKAGQMLHNAPAIKRLGI 79
Query: 93 PKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA 152
Y WW+EALHGV+ G AT FP+ + AA+F+ L IG+ +S EA A
Sbjct: 80 LPYSWWNEALHGVARTG----------RATVFPENVGLAATFDEDLVYRIGQAISDEAWA 129
Query: 153 MYNVG--------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT 204
YN+ +G+TF++PN+NIFRDPRWGRGQET GEDP L S+ YVKG+Q
Sbjct: 130 KYNIAQRLENYGQYSGITFYAPNVNIFRDPRWGRGQETYGEDPFLTSRMGVAYVKGMQ-- 187
Query: 205 DGGSPNRLKVAACCKHYT----------AYD----LDNWKGTDRYHFNAMV--------- 241
G P LK AAC KHY +YD + ++ T F +V
Sbjct: 188 -GNDPKYLKTAACAKHYVVHSGPEALRHSYDAEPPMKDFMETYVPAFETLVKEGKVESVM 246
Query: 242 ---------------IYTYYLIKFK-----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSI 281
+ L++ K Y+ +DC ++ Y K EA A +I
Sbjct: 247 CAYNRTFGKPCCGSSFLLHDLLREKWGFTGYVTTDCWAIQNFYLHHGAAKDSLEACALAI 306
Query: 282 LAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG 341
+G++LNCG+ + AAV+ GLV E +D A+S T RLG FD P++ PY ++
Sbjct: 307 KSGVNLNCGNEF-NYLPAAVRKGLVTEKEVDEALSQLLRTRFRLGLFDS-PNENPYAKIK 364
Query: 342 PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG 401
+ + +Q N DLA +AA + +VLL+N +LPL +K+L V+GP A ++GNY G
Sbjct: 365 EEVIGSQQNIDLAYEAAAKSLVLLQNKNNTLPLKKD-MKSLYVVGPYAANQDILLGNYNG 423
Query: 402 TPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKA----------AASADATVLV 451
+ TT +Q A+V + N + G K + A + A +
Sbjct: 424 VNSRLTTIMQ---AIVGKVSAGTSVNYRIGVEPSAPNKNSMNYSIGEAADADAVVAVFGI 480
Query: 452 MGADQSIEAESH------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAK 505
G + E ES DRLDL LP Q + E+ K K P+IL++ GG I +
Sbjct: 481 SGVFEGEEGESTASTSRGDRLDLNLPQNQLDYLRELKKKCKKPIILVLT--GGSPICTPE 538
Query: 506 NDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+ +IL+V YPGQ GG A+ADV FG NPSGRL +T +P+S
Sbjct: 539 LADMVDAILFVWYPGQEGGHAVADVIFGDVNPSGRLCIT-FPKS 581
>gi|307719075|ref|YP_003874607.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
6192]
gi|306532800|gb|ADN02334.1| glycoside hydrolase family 3 [Spirochaeta thermophila DSM 6192]
Length = 693
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 200/552 (36%), Positives = 284/552 (51%), Gaps = 96/552 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ L+ R++++EK +V A V RLGIP Y WW+EALHGV+ G AT
Sbjct: 6 RMTSLLSRMSIEEKAGLMVHRAKGVPRLGIPNYNWWNEALHGVANSGE----------AT 55
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYN-VGLA-------GLTFWSPNINIFRDP 174
FPQ I AA+F+ L + + +S EARA +N VG GLTFWSPNINI+RDP
Sbjct: 56 VFPQAIGLAATFDPDLVRRVADAISREARAKFNAVGKERAAEYERGLTFWSPNINIYRDP 115
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD- 233
RWGRGQET GEDP L SK +VKGLQ G P L+VAAC KHY + G +
Sbjct: 116 RWGRGQETYGEDPFLTSKIGVAFVKGLQ---GDHPYYLRVAACAKHYAVH-----SGPEG 167
Query: 234 -RYHFNAMV----IYTYYLIKFK------------------------------------- 251
R+ F+A V ++ YL F+
Sbjct: 168 LRHVFDARVSEKDLWETYLPAFEALVKAGVEAVMGAYNRVNGEPACGSKRLLEEILRKKW 227
Query: 252 ----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVN 307
++VSDC ++ + TK P E+ A ++ AG DLNCG+ +H AVKAG V+
Sbjct: 228 GFKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAGCDLNCGNTY-EHLLDAVKAGAVS 286
Query: 308 ESAIDHAISNNFATLMRLGFF-DGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLK 366
E +D +++ +TL RLG F D HP Y +L D+ +A++ LA +AA + +VLLK
Sbjct: 287 EELVDRSVARLLSTLDRLGLFTDDHP----YVRLSLADIDWEAHRALAREAAEKSVVLLK 342
Query: 367 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV----VATIYQ 422
N G LPL ++ + V GPNA ++GNY G + T L+G+ + Y+
Sbjct: 343 NN-GILPLDRRKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITVTYK 401
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL---------DLLLPGQ 473
GC +D A A AD TV VMG D ++E E D + DL L +
Sbjct: 402 IGCPLQGNKINPIDWASGVARYADVTVAVMGRDSAVEGEEGDAIFSDNYGDLSDLNLSRE 461
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q + + ++ K P++++++SG + + + +I++ YPG+ GG AIA V FG
Sbjct: 462 QIDYLRRIKEIGK-PLVVVLLSGA--PVCSPELEELADAIVYAWYPGEEGGNAIARVLFG 518
Query: 534 RYNPSGRLPMTW 545
+PSGRLP+T+
Sbjct: 519 EVSPSGRLPITF 530
>gi|333379783|ref|ZP_08471502.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
22836]
gi|332884929|gb|EGK05184.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
22836]
Length = 737
Score = 300 bits (767), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 196/569 (34%), Positives = 277/569 (48%), Gaps = 96/569 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F NT+L ID RV DLV +LTL+EK+ ++++ ++ RL IP Y WW+E LHG+ T
Sbjct: 27 FQNTNLSIDERVNDLVSKLTLEEKVAQMLNNTPAIERLNIPAYNWWNECLHGIGR----T 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
+ T FPQ I AA++N L + + +S E RA+YN + GLT+W
Sbjct: 83 DYK-----VTVFPQAIGMAAAWNKELMKEVASAISDEGRAIYNDATSKGNREIYYGLTYW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRGQET GEDP L +V GLQ G LK AAC KHY +
Sbjct: 138 TPNINIFRDPRWGRGQETYGEDPFLTGVLGKSFVAGLQ---GDDTKYLKAAACAKHYAVH 194
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ R+ FN V ++ YL F+
Sbjct: 195 ---SGPENTRHTFNTFVTDYDLWDTYLPAFRNLVVEAKVAGVMCAYNAYNGEPCCGNNFL 251
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEE--AAAKSILAGLDLNCGSFLGKH 296
Y+ SDC ++D Y QH+ P+ AAA ++ G D++CG+ K
Sbjct: 252 MQEILREKWNFTGYVTSDCGAIDDFY--QHHKTHPDAKYAAADAVYNGTDIDCGNEAYKA 309
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLAL 355
AVK G++ E ID ++ F RLG FD P++ Y Q+ + +Q ++DLAL
Sbjct: 310 LVDAVKTGIITEKQIDISLKRLFTIRFRLGMFD--PAENVKYSQISTSVLESQKHKDLAL 367
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
R+ IVLLKN +LPLS +K +AV+GPNAN +++GNY G P + TP + +
Sbjct: 368 KITRESIVLLKNENNTLPLS-KKLKKVAVVGPNANNEVSVLGNYNGFPTEIVTPYEAVKQ 426
Query: 416 VV---ATIYQAGCSNVQCGT---AQVDDAKKAAASADATVLVMGADQSIEAESH------ 463
+ IY+ G V T +V K D + V G +E E
Sbjct: 427 KLKGAEVIYEKGIDFVTPSTNSKEEVSALVKRLKDVDVVIFVGGISPELEGEEMPVKIEG 486
Query: 464 ----DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYP 519
DR + LP Q + + K P + ++M+G + + I +I+ Y
Sbjct: 487 FTGGDRTSIKLPKIQTDFMKALV-AEKIPTVFVMMTGSAIATEWESQN--IPAIVNAWYG 543
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTWYPQ 548
GQ G AIADV FG YNPSG+LP+T+Y +
Sbjct: 544 GQDAGTAIADVLFGDYNPSGKLPVTFYAK 572
>gi|145230215|ref|XP_001389416.1| exo-1,4-beta-xylosidase xlnD [Aspergillus niger CBS 513.88]
gi|74626559|sp|O00089.2|XYND_ASPNG RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|292495287|sp|A2QA27.1|XYND_ASPNC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|2181180|emb|CAB06417.1| xylosidase [Aspergillus niger]
gi|134055533|emb|CAK37179.1| xylosidase xlnD-Aspergillus niger
gi|350638468|gb|EHA26824.1| hypothetical protein ASPNIDRAFT_205670 [Aspergillus niger ATCC
1015]
Length = 804
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 201/564 (35%), Positives = 283/564 (50%), Gaps = 64/564 (11%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N L S C+ + R L+ TL E I ++ VSRLG+P Y+ WSEALHG
Sbjct: 60 NGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHG 119
Query: 105 VSYVGPGTHFSN--VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT 162
+ +FS+ ATSFPQ ILT A+ N +L I ++ST+ RA N G GL
Sbjct: 120 LDRA----NFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLD 175
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
++PNIN FR P WGRGQETPGED LA+ YA Y+ G+Q D S LK+AA KHY
Sbjct: 176 VYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPES--NLKLAATAKHYA 233
Query: 223 AYDLDNWKG------------------------------------------------TDR 234
YD++NW D
Sbjct: 234 GYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADS 293
Query: 235 YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
Y ++ T+ + Y+ SDCD+ +YN Y + AAA++ILAG D++CG+
Sbjct: 294 YFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQ 353
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK--QPYGQLGPKDVCTQANQD 352
H ++ AG ++ I+ + + TL++ G+FD + +K PY L DV +
Sbjct: 354 WHLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWN 413
Query: 353 LALDAARQGIVLLKNTAGSLPLS----PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
++ AA QGIVLLKN+ LPL+ P + +A+IGP AN T ++GNY G +
Sbjct: 414 ISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMIS 473
Query: 409 PLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
P + + G T+ A AA SAD + G D ++EAE+ DR
Sbjct: 474 PRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRES 533
Query: 468 LLLPGQQQLLITEVAKVA-KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ PG Q LI ++A A K P+I++ M GG D S KN+ ++++LW GYPGQ+GG A
Sbjct: 534 IAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFA 593
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+ D+ G+ NP+GRL T YP SY
Sbjct: 594 LRDIITGKKNPAGRLVTTQYPASY 617
>gi|290889355|gb|ADD69953.1| xylosidase HistTag [synthetic construct]
Length = 810
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 201/564 (35%), Positives = 283/564 (50%), Gaps = 64/564 (11%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N L S C+ + R L+ TL E I ++ VSRLG+P Y+ WSEALHG
Sbjct: 60 NGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHG 119
Query: 105 VSYVGPGTHFSN--VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT 162
+ +FS+ ATSFPQ ILT A+ N +L I ++ST+ RA N G GL
Sbjct: 120 LDRA----NFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLD 175
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
++PNIN FR P WGRGQETPGED LA+ YA Y+ G+Q D S LK+AA KHY
Sbjct: 176 VYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPES--NLKLAATAKHYA 233
Query: 223 AYDLDNWKG------------------------------------------------TDR 234
YD++NW D
Sbjct: 234 GYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADS 293
Query: 235 YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
Y ++ T+ + Y+ SDCD+ +YN Y + AAA++ILAG D++CG+
Sbjct: 294 YFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQ 353
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK--QPYGQLGPKDVCTQANQD 352
H ++ AG ++ I+ + + TL++ G+FD + +K PY L DV +
Sbjct: 354 WHLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWN 413
Query: 353 LALDAARQGIVLLKNTAGSLPLS----PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
++ AA QGIVLLKN+ LPL+ P + +A+IGP AN T ++GNY G +
Sbjct: 414 ISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMIS 473
Query: 409 PLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
P + + G T+ A AA SAD + G D ++EAE+ DR
Sbjct: 474 PRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRES 533
Query: 468 LLLPGQQQLLITEVAKVA-KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ PG Q LI ++A A K P+I++ M GG D S KN+ ++++LW GYPGQ+GG A
Sbjct: 534 IAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFA 593
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+ D+ G+ NP+GRL T YP SY
Sbjct: 594 LRDIITGKKNPAGRLVTTQYPASY 617
>gi|359409694|ref|ZP_09202159.1| Beta-glucosidase [Clostridium sp. DL-VIII]
gi|357168578|gb|EHI96752.1| Beta-glucosidase [Clostridium sp. DL-VIII]
Length = 723
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 205/561 (36%), Positives = 285/561 (50%), Gaps = 105/561 (18%)
Query: 66 DLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFP 125
+LV ++TLQEK L ++ +V RL IP+Y WW+E LHGV+ G AT FP
Sbjct: 31 ELVAKMTLQEKAEQLTYNSPAVKRLNIPEYNWWNEGLHGVARAGT----------ATVFP 80
Query: 126 QVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWG 177
Q I AA F+ + +++TE RA YN GLT+WSPN+NIFRDPRWG
Sbjct: 81 QAIGLAAMFDEEFLGKVAGIIATEGRAKYNENSKKEDRDIYKGLTYWSPNVNIFRDPRWG 140
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHF 237
RG ET GEDP L S+ +VKGLQ G LK++AC KH+ + + + R+ F
Sbjct: 141 RGHETYGEDPYLTSRLGVAFVKGLQ----GDGKYLKLSACAKHFAVH---SGPESLRHEF 193
Query: 238 NAMV----IYTYYLIKFK------------------------------------------ 251
NA+V ++ YL F+
Sbjct: 194 NAVVSQKDLHETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKALLKDILRGKWGFKG 253
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAI 311
++VSDC ++ + T T E+ A +I G DLNCG+ + A K GLV E I
Sbjct: 254 HVVSDCWALADFHMHHKVTSTATESVALAIENGCDLNCGNMY-LNLLLAYKEGLVTEEQI 312
Query: 312 DHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV--CTQANQDLALDAARQGIVLLKNTA 369
A T +LG FD Y Q+ P +V C + NQ ++L+A+R+ +VLLKN
Sbjct: 313 TTAAERLMTTRFKLGMFD---EDCEYNQI-PYEVNDCKEHNQ-VSLEASRKSMVLLKNN- 366
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV----VATIYQAGC 425
G LPL + +K +AVIGPNAN + GNY GT KYTT L G+ V V Y GC
Sbjct: 367 GILPLDKSKLKAVAVIGPNANSEIMLKGNYSGTASKYTTILDGIHDVLDDDVRVYYSEGC 426
Query: 426 ----SNVQCGTAQVDD----AKKAAASADATVLVMGADQSIEAE---------SHDRLDL 468
V+ A+ DD A A AD +L +G D +IE E + D+LDL
Sbjct: 427 HLYKEKVE-DLARRDDRLAEAVSVAERADVVILCLGLDSTIEGEQGDAGNGYGAGDKLDL 485
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIA 528
LPG QQ L+ +V + K PV++++ +G G ++ A + + +IL YPG GG A A
Sbjct: 486 NLPGIQQELLEKVLETGK-PVVVVLGTGSGLTLNGA--EERCAAILNAWYPGSHGGTAAA 542
Query: 529 DVCFGRYNPSGRLPMTWYPQS 549
D+ FG+ +PSG+LP+T+Y +
Sbjct: 543 DILFGKCSPSGKLPVTFYKDT 563
>gi|160881137|ref|YP_001560105.1| glycoside hydrolase family 3 [Clostridium phytofermentans ISDg]
gi|160429803|gb|ABX43366.1| glycoside hydrolase family 3 domain protein [Clostridium
phytofermentans ISDg]
Length = 717
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 196/560 (35%), Positives = 279/560 (49%), Gaps = 99/560 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R +LVK++TL+EK+ + SA S+ RL I Y +W+EALHGV+ G AT
Sbjct: 13 RATELVKKMTLEEKVFQTLHSAPSIPRLDIKAYNYWNEALHGVARAGV----------AT 62
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AA+F+ L + I +STE R +N GLTFWSPN+NIFRDP
Sbjct: 63 VFPQAIGLAATFDEDLIEEIADTISTEGRGKFNAQQKYGDHDIYKGLTFWSPNVNIFRDP 122
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD- 233
RWGRG ET GEDP L+ +V G+Q G LK AAC KH+ + G +
Sbjct: 123 RWGRGHETFGEDPFLSGTLGGRFVDGIQ---GHDETYLKAAACAKHFAVH-----SGPED 174
Query: 234 -RYHFNAMV----IYTYYLIKFK------------------------------------- 251
R+ FNA V + YL FK
Sbjct: 175 IRHSFNAEVSEQDLRETYLPAFKKLVKEHKVEAVMGAYNRTNGEPCCGSKTLLEDILRGE 234
Query: 252 -----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLV 306
++ SDC ++ + T E+ A ++ G DLNCG+ L + AV+ GLV
Sbjct: 235 WEFVGHVTSDCWAIKDFHEHHMVTSNAVESVALAMNRGCDLNCGN-LYVNLLQAVRDGLV 293
Query: 307 NESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLK 366
E ID A+ F T M+LG FD S P+ + V T+++++L + A+++ +VLLK
Sbjct: 294 EEETIDTALIRLFTTRMKLGLFDKEESI-PFNTITYDQVDTKSSKELNIKASKKCVVLLK 352
Query: 367 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV----VATIYQ 422
N LPL+P I ++ VIGPNAN ++GNYEGT +Y T L+G+ V V +
Sbjct: 353 NEDNILPLNPKKITSVGVIGPNANNRNALVGNYEGTASEYITVLEGIKQVVPEDVRVYFS 412
Query: 423 AGC-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDRL 466
GC SN+ ++ + + +D + +G D +E E S D+
Sbjct: 413 EGCHLFKNKLSNLSQENDRIAEVRAVCEHSDVVIACLGLDPGLEGEEGDQGNQFASGDKK 472
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
L LPG Q+ ++ + + K PVILI++SG + +A D I +IL YPG GG A
Sbjct: 473 TLALPGIQEDVLKTIYECGK-PVILILLSGSALAVPWA--DEHIPAILQGWYPGAQGGRA 529
Query: 527 IADVCFGRYNPSGRLPMTWY 546
IA++ FG NP G+LP+T+Y
Sbjct: 530 IAELIFGDGNPEGKLPVTFY 549
>gi|358061481|ref|ZP_09148135.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
WAL-18680]
gi|356700240|gb|EHI61746.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
WAL-18680]
Length = 695
Score = 298 bits (764), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 196/552 (35%), Positives = 283/552 (51%), Gaps = 96/552 (17%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV+++TL+E+ + + A +V RLGIP Y WW E LHGV+ G AT FPQ
Sbjct: 13 LVEQMTLEERASQMRYDAPAVPRLGIPAYNWWGEGLHGVARAGT----------ATMFPQ 62
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDPRWGR 178
I AA F+ L + I VVSTE RA YN GLTFWSPN+NIFRDPRWGR
Sbjct: 63 AIAMAAMFDVELTEEIANVVSTEGRAKYNQFCEEGDRDIYKGLTFWSPNVNIFRDPRWGR 122
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT------AYDLDNWKGT 232
G ET GEDP L S+ T +V+GLQ G LK+AAC KH+ A + W T
Sbjct: 123 GHETYGEDPYLTSRLGTAFVRGLQ----GDGEHLKIAACAKHFAVHSGPEALRHEFWADT 178
Query: 233 DR-----------------------------YHFNAMVIYTYYLIKF--------KYIVS 255
+ YH T + + + VS
Sbjct: 179 SKKDLWETYLPAFEACVKEAHVESVMGAYNSYHGEPCCANTLLMEEILRGQWGFEGHFVS 238
Query: 256 DCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDHA 314
DC ++ + + T T E+AA ++ G DLNCG ++L A + GL++++ + A
Sbjct: 239 DCWAIRDFHMNYMVTDTAMESAALAVKKGCDLNCGNTYL--QVLKACEEGLLDDACVTEA 296
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+ F T LG + + Y + + V + +++LA++AAR+ +VLLKN G LPL
Sbjct: 297 VVRLFTTRYLLGMGE----ETEYDDIPYEVVECKEHRELAVEAARRSMVLLKND-GLLPL 351
Query: 375 SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGC----- 425
+ +AVIGPNA+ +IGNY GT YTT L+G+ V +Y GC
Sbjct: 352 HAEKLNTIAVIGPNADNRTALIGNYHGTSSCYTTILEGIQDAVGEDVRVLYAEGCHLFKD 411
Query: 426 --SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDRLDLLLPGQQ 474
++ ++ +A+ A +D VL +G D+++E E S D+ DLLLP Q
Sbjct: 412 RVEHLAVAGDRLSEARIVAKHSDVVVLCVGLDETLEGEEGDTGNSHASGDKKDLLLPESQ 471
Query: 475 QLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGR 534
+ L+ E+ + K PV++ MSG D+S A+ K +++ V YPG GG A+AD+ FG+
Sbjct: 472 RRLMEEILNLGK-PVVVCNMSGSAIDLSLAQE--KAGAVIQVWYPGAEGGRALADLLFGK 528
Query: 535 YNPSGRLPMTWY 546
+PSG+LP+T+Y
Sbjct: 529 ASPSGKLPVTFY 540
>gi|449527525|ref|XP_004170761.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
Length = 241
Score = 298 bits (764), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 145/203 (71%), Positives = 159/203 (78%), Gaps = 3/203 (1%)
Query: 40 CDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWS 99
CD N ++GFC SLGI+ RV DL+ RLTL EKI LV++A +V RLGI YEWWS
Sbjct: 34 CDK-RNVGTRNMGFCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWS 92
Query: 100 EALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA 159
EALHGVS VGPGT F PGATSFPQVI TAASFN SL+ IG+VVS EARAMYN G A
Sbjct: 93 EALHGVSNVGPGTKFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTA 152
Query: 160 GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCK 219
GLT+WSPN+NIFRDPRWGRGQETPGEDP+LA+KYA YV+GLQ DG RLKVAACCK
Sbjct: 153 GLTYWSPNVNIFRDPRWGRGQETPGEDPILAAKYAANYVQGLQGNDG--KKRLKVAACCK 210
Query: 220 HYTAYDLDNWKGTDRYHFNAMVI 242
HYTAYDLDNW G DRYHFNA V
Sbjct: 211 HYTAYDLDNWNGVDRYHFNAKVF 233
>gi|354508473|gb|AER26905.1| beta-xylosidase 3 [synthetic construct]
Length = 778
Score = 298 bits (763), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 202/564 (35%), Positives = 282/564 (50%), Gaps = 64/564 (11%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N L S C+ S R L+ TL E I ++ VSRLG+P Y+ WSEALHG
Sbjct: 34 NGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHG 93
Query: 105 VSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT 162
+ +FS+ ATSFPQ ILT A+ N +L I ++ST+ RA N G GL
Sbjct: 94 LDRA----NFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLD 149
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
++PNIN FR P WGRGQETPGED LA+ YA Y+ G+Q D S LK+AA KHY
Sbjct: 150 VYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDS--NLKLAATAKHYA 207
Query: 223 AYDLDNWKG------------------------------------------------TDR 234
YD++NW D
Sbjct: 208 GYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPACADS 267
Query: 235 YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
Y ++ T+ + Y+ SDCD+ +YN Y + AAA++ILAG D++CG+
Sbjct: 268 YFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQ 327
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK--QPYGQLGPKDVCTQANQD 352
H ++ AG ++ I+ + + TL++ G+FD + +K PY L DV +
Sbjct: 328 WHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWN 387
Query: 353 LALDAARQGIVLLKNTAGSLPLS----PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
++ AA QGIVLLKN+ LPL+ P + +A+IGP AN T ++GNY G +
Sbjct: 388 ISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMIS 447
Query: 409 PLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
P + + G T+ A AA SAD + G D ++EAE+ DR
Sbjct: 448 PRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRES 507
Query: 468 LLLPGQQQLLITEVAKVA-KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ PG Q LI ++A A P+I++ M GG D S KN+ +T++LW GYPGQ+GG A
Sbjct: 508 IAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFA 567
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+ D+ G+ NP+GRL T YP SY
Sbjct: 568 LRDIITGKKNPAGRLVTTQYPASY 591
>gi|4235093|gb|AAD13106.1| beta-xylosidase [Aspergillus niger]
Length = 804
Score = 298 bits (763), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 202/564 (35%), Positives = 282/564 (50%), Gaps = 64/564 (11%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N L S C+ S R L+ TL E I ++ VSRLG+P Y+ WSEALHG
Sbjct: 60 NGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHG 119
Query: 105 VSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT 162
+ +FS+ ATSFPQ ILT A+ N +L I ++ST+ RA N G GL
Sbjct: 120 LDRA----NFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLD 175
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
++PNIN FR P WGRGQETPGED LA+ YA Y+ G+Q D S LK+AA KHY
Sbjct: 176 VYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDS--NLKLAATAKHYA 233
Query: 223 AYDLDNWKG------------------------------------------------TDR 234
YD++NW D
Sbjct: 234 GYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPACADS 293
Query: 235 YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
Y ++ T+ + Y+ SDCD+ +YN Y + AAA++ILAG D++CG+
Sbjct: 294 YFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQ 353
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK--QPYGQLGPKDVCTQANQD 352
H ++ AG ++ I+ + + TL++ G+FD + +K PY L DV +
Sbjct: 354 WHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWN 413
Query: 353 LALDAARQGIVLLKNTAGSLPLS----PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
++ AA QGIVLLKN+ LPL+ P + +A+IGP AN T ++GNY G +
Sbjct: 414 ISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMIS 473
Query: 409 PLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
P + + G T+ A AA SAD + G D ++EAE+ DR
Sbjct: 474 PRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRES 533
Query: 468 LLLPGQQQLLITEVAKVA-KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ PG Q LI ++A A P+I++ M GG D S KN+ +T++LW GYPGQ+GG A
Sbjct: 534 IAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFA 593
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+ D+ G+ NP+GRL T YP SY
Sbjct: 594 LRDIITGKKNPAGRLVTTQYPASY 617
>gi|310797011|gb|EFQ32472.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 767
Score = 298 bits (763), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 197/562 (35%), Positives = 279/562 (49%), Gaps = 65/562 (11%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N +L+ C+ +L R LVK LT++EK+ LV A R+G+P Y WWSEALHG
Sbjct: 34 NGTLSVNKVCDRTLSPPERAAALVKALTVEEKLQNLVSKAQGAPRIGLPAYNWWSEALHG 93
Query: 105 VSYVGPGTHFSN---VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGL 161
V+Y PGT+F +TS+P +L AA+F+ L + IG + EARA N G AGL
Sbjct: 94 VAYA-PGTYFPEGDVEFNSSTSYPMPLLMAAAFDDELIEQIGAAIGIEARAWGNAGWAGL 152
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSP-NRLKVAACCKH 220
+W+PN+N F+DPRWGRG ETPGED L +YA +GL DG P + +V + CKH
Sbjct: 153 DYWTPNVNPFKDPRWGRGSETPGEDVLRVKRYAEYITRGL---DGPVPGEQRRVISTCKH 209
Query: 221 YTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------- 251
Y D ++W GT R+ F+A + + YYL+ F+
Sbjct: 210 YAGNDFEDWNGTSRHDFDAKITAQDLAEYYLMPFQQCARDSKVGSIMCAYNAVNGVPSCA 269
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
Y+ SDC++V + + Y T A AG+D +C
Sbjct: 270 NEYLLQNILREHWNWTEHNNYVTSDCEAVLDVSANHKYAPTNAAGTAICFEAGMDTSCEY 329
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
A GL+ E +D A+ + L+R G+FDGH + Y +LG KDV + Q
Sbjct: 330 TGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGHEAI--YAKLGWKDVNSAEAQ 387
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LAL AA +GIVLLKN G+LPL +A+IG A+ + G Y G TP
Sbjct: 388 SLALQAAVEGIVLLKNN-GTLPLDLKPSHKVAMIGFWADAPDKLQGGYSGRAAHLHTPAY 446
Query: 412 GLAAVVATIYQAGCSNVQCGTAQ---VDDAKKAAASADATVLVMGADQSIEAESHDRLDL 468
+ I A +Q A A +AA AD + G D S E+ DR DL
Sbjct: 447 AARQLGLDITLASGPVLQRNNASDNWTAAALEAAEGADYILYFGGLDTSAAGETLDRTDL 506
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIA 528
P Q +LI +++ + K P+++ ++ D + D +++SILW +PGQ GG AI
Sbjct: 507 EWPEAQLMLIKKLSALGK-PLVVNLLGDQLDDTPLLQLD-EVSSILWANWPGQDGGVAIM 564
Query: 529 DVCFGRYNPSGRLPMTWYPQSY 550
+ G +P+GRLP+T YP +Y
Sbjct: 565 KLITGEKSPAGRLPVTQYPSNY 586
>gi|372208556|ref|ZP_09496358.1| beta-glucosidase [Flavobacteriaceae bacterium S85]
Length = 729
Score = 298 bits (763), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 201/566 (35%), Positives = 278/566 (49%), Gaps = 92/566 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +TSL + R+ LVK +TL+EKI L + V RL IP+Y WW+EALHGV+ G
Sbjct: 26 WLDTSLTFEERIHHLVKAMTLKEKIAQLDSGSPEVKRLDIPEYNWWNEALHGVARNGK-- 83
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
+T FPQ I AA+F+ L + + +S EARA +N+ AGLTFW
Sbjct: 84 --------STVFPQAIGLAATFDPVLAKQVASAISDEARAKFNISQSIGNRGQYAGLTFW 135
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PN+NIFRDPRWGRGQET GEDP L S+ +VKGLQ G P LK AAC KH+ +
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTSQMGVAFVKGLQ---GNHPKYLKSAACAKHFAVH 192
Query: 225 DLDNWKGTD--RYHFNA----MVIYTYYLIKFK--------------------------- 251
G + R+HFNA +Y YL F+
Sbjct: 193 S-----GPEELRHHFNANPSKKDLYETYLPAFEALVKQANVEGVMSAYNAVYGVPAGSSE 247
Query: 252 ---------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKH 296
YIVSDC ++ ++ KT EAAA ++ AG++LNCG
Sbjct: 248 FLLKETLRKSWGFDGYIVSDCGALGDIFKGHKQVKTMPEAAAVALKAGVNLNCGYVYNGA 307
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
E AV+ GLV+E ID + T +LGFFD + PY + + + + LA
Sbjct: 308 LEKAVQQGLVSEELIDTRLKQLLKTRFKLGFFDPKEA-NPYNAIPTSVIHSDDHIALARK 366
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
A++ IVLLKN +LPL IK V GP A+ + ++ NY G + L+G+A
Sbjct: 367 TAQKSIVLLKNKNHTLPLDKN-IKVPYVTGPFASSSDVLLANYYGMTTNLVSVLEGIADK 425
Query: 417 V----ATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES---------H 463
V + Y+ G + A A +ADA + V+G E E
Sbjct: 426 VSLGTSLNYRMGALPFNKNLNPKNWAPNVAKTADAVIAVVGLSADFEGEEVDAIASPNKG 485
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
D+ DL LP Q + E+A KGP+IL++ SG + + ++W YPG+ G
Sbjct: 486 DKKDLKLPQNQIDYVKEMAAKKKGPLILVVASGSAVALGELYDLADAIVLMW--YPGEQG 543
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQS 549
G A+ADV FG +PSG LP+T +P+S
Sbjct: 544 GNAVADVLFGDVSPSGHLPVT-FPKS 568
>gi|169767016|ref|XP_001817979.1| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
gi|121805502|sp|Q2UR38.1|XYND_ASPOR RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|83765834|dbj|BAE55977.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 798
Score = 298 bits (763), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 195/568 (34%), Positives = 282/568 (49%), Gaps = 72/568 (12%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D S P L+ C+TS R LV LT +E + ++ R+G+P Y+ W+E
Sbjct: 51 DCESGP-LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNE 109
Query: 101 ALHGVSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV++ FS+ +TSFPQ I T A+ N +L I ++ST+ RA N G
Sbjct: 110 ALHGVAHA----DFSDAGGFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR 165
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAAC 217
GL +SPNIN FR P WGRGQETPGED LAS YA Y+ G+Q G N LK+ A
Sbjct: 166 YGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQ--GGVDANPLKLIAT 223
Query: 218 CKHYTAYDLDNWKGTDR------------------------------------------- 234
KHY YD++NW R
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283
Query: 235 -----YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
+ ++ T+ ++ Y+ DC +V ++N Y AAA SI AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G +H + + V+ ++ ++ +A+L+R G+FDG S PY + DV +
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKN--LAVIGPNANVTKTMIGNYEGTPCKYT 407
Q+L+ +AA Q IVLLKN G LPL+ T+ +A+IGP AN T M+GNY G
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLI 460
Query: 408 TPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAA-----SADATVLVMGADQSIEAES 462
+PLQ A + Y+ + T D ++ A AD + G D ++E E+
Sbjct: 461 SPLQ---AFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEA 517
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR ++ P Q LIT++A + K P+I++ M GG D S KN+ + +++W GYPGQ+
Sbjct: 518 QDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQS 576
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG A+AD+ G+ P+ RL T YP Y
Sbjct: 577 GGQALADIITGKRAPAARLVTTQYPAEY 604
>gi|194400335|gb|ACF61038.1| beta-xylosidase [Aspergillus awamori]
Length = 804
Score = 298 bits (762), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 202/564 (35%), Positives = 282/564 (50%), Gaps = 64/564 (11%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N L S C+ S R L+ TL E I ++ VSRLG+P Y+ WSEALHG
Sbjct: 60 NGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHG 119
Query: 105 VSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT 162
+ +FS+ ATSFPQ ILT A+ N +L I ++ST+ RA N G GL
Sbjct: 120 LDRA----NFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLD 175
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
++PNIN FR P WGRGQETPGED LA+ YA Y+ G+Q D S LK+AA KHY
Sbjct: 176 VYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDS--NLKLAATAKHYA 233
Query: 223 AYDLDNWKG------------------------------------------------TDR 234
YD++NW D
Sbjct: 234 GYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADS 293
Query: 235 YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
Y ++ T+ + Y+ SDCD+ +YN Y + AAA++ILAG D++CG+
Sbjct: 294 YFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQ 353
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK--QPYGQLGPKDVCTQANQD 352
H ++ AG ++ I+ + + TL++ G+FD + +K PY L DV +
Sbjct: 354 WHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWN 413
Query: 353 LALDAARQGIVLLKNTAGSLPLS----PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
++ AA QGIVLLKN+ LPL+ P + +A+IGP AN T ++GNY G +
Sbjct: 414 ISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMIS 473
Query: 409 PLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
P + + G T+ A AA SAD + G D ++EAE+ DR
Sbjct: 474 PRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRES 533
Query: 468 LLLPGQQQLLITEVAKVA-KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ PG Q LI ++A A P+I++ M GG D S KN+ +T++LW GYPGQ+GG A
Sbjct: 534 IAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFA 593
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+ D+ G+ NP+GRL T YP SY
Sbjct: 594 LRDIITGKKNPAGRLVTTQYPASY 617
>gi|391872736|gb|EIT81831.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 798
Score = 297 bits (761), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 195/568 (34%), Positives = 282/568 (49%), Gaps = 72/568 (12%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D S P L+ C+TS R LV LT +E + ++ R+G+P Y+ W+E
Sbjct: 51 DCESGP-LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNE 109
Query: 101 ALHGVSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV++ FS+ +TSFPQ I T A+ N +L I ++ST+ RA N G
Sbjct: 110 ALHGVAHA----DFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR 165
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAAC 217
GL +SPNIN FR P WGRGQETPGED LAS YA Y+ G+Q G N LK+ A
Sbjct: 166 YGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQ--GGVDANPLKLIAT 223
Query: 218 CKHYTAYDLDNWKGTDR------------------------------------------- 234
KHY YD++NW R
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283
Query: 235 -----YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
+ ++ T+ ++ Y+ DC +V ++N Y AAA SI AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G +H + + V+ ++ ++ +A+L+R G+FDG S PY + DV +
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKN--LAVIGPNANVTKTMIGNYEGTPCKYT 407
Q+L+ +AA Q IVLLKN G LPL+ T+ +A+IGP AN T M+GNY G
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLI 460
Query: 408 TPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAA-----SADATVLVMGADQSIEAES 462
+PLQ A + Y+ + T D ++ A AD + G D ++E E+
Sbjct: 461 SPLQ---AFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEA 517
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR ++ P Q LIT++A + K P+I++ M GG D S KN+ + +++W GYPGQ+
Sbjct: 518 QDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQS 576
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG A+AD+ G+ P+ RL T YP Y
Sbjct: 577 GGQALADIITGKRAPAARLVTTQYPAEY 604
>gi|436410475|gb|AGB57183.1| beta-xylosidase [Aspergillus sp. BCC125]
Length = 804
Score = 297 bits (761), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 201/564 (35%), Positives = 283/564 (50%), Gaps = 64/564 (11%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N L S C+ S R L+ TL E I ++ VSRLG+P Y+ WSEALHG
Sbjct: 60 NGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHG 119
Query: 105 VSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT 162
+ +FS++ ATSFPQ ILT A+ N +L I ++ST+ RA N G GL
Sbjct: 120 LDRA----NFSDLGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLD 175
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
++PNIN FR P WGRGQETPGED LA+ YA Y+ G+Q D S LK+AA KHY
Sbjct: 176 VYAPNINTFRHPVWGRGQETPGEDVSLAAIYAYEYITGIQGPDPDS--NLKLAATAKHYA 233
Query: 223 AYDLDNWKG------------------------------------------------TDR 234
YD++NW D
Sbjct: 234 GYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADS 293
Query: 235 YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
Y ++ T+ + Y+ SDCD+ +YN Y + AAA++ILAG D++CG+
Sbjct: 294 YFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQ 353
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK--QPYGQLGPKDVCTQANQD 352
H ++ AG ++ I+ + + TL++ G+FD + +K PY L DV +
Sbjct: 354 WHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWN 413
Query: 353 LALDAARQGIVLLKNTAGSLPLS----PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
++ AA QGIVLLKN+ LPL+ P + +A+IGP AN T ++GNY G +
Sbjct: 414 ISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMIS 473
Query: 409 PLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
P + + G T+ A AA SAD + G D ++EAE+ DR
Sbjct: 474 PRAAFEEAGYNVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRES 533
Query: 468 LLLPGQQQLLITEVAKVA-KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ PG Q LI ++A A P+I++ M GG D S KN+ ++++LW GYPGQ+GG A
Sbjct: 534 IAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFA 593
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+ D+ G+ NP+GRL T YP SY
Sbjct: 594 LRDIITGKKNPAGRLVTTQYPASY 617
>gi|224068504|ref|XP_002302759.1| predicted protein [Populus trichocarpa]
gi|222844485|gb|EEE82032.1| predicted protein [Populus trichocarpa]
Length = 273
Score = 297 bits (760), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 146/246 (59%), Positives = 173/246 (70%), Gaps = 8/246 (3%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FACD S + FC L I RV DL+ R+TLQEK+ LV+ A +V RLGI YEW
Sbjct: 27 FACDPEDGTS-RNFPFCQVKLPIQSRVSDLIGRMTLQEKVGLLVNDAAAVPRLGIKGYEW 85
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGVS VGPGT F PGATSFPQVI TAASFNA+L++AIG+VVS EARAM+N G
Sbjct: 86 WSEALHGVSNVGPGTQFGGAFPGATSFPQVITTAASFNATLWEAIGRVVSDEARAMFNGG 145
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
+AGLT+WSPN+NIFRDPRWGRGQETPGEDP++A KYA YV+GLQ DG +RLKVAAC
Sbjct: 146 VAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNDG---DRLKVAAC 202
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTP 273
CKH+TAYDLDNW G DR+HFNA V + + + F+ V + V+ + P
Sbjct: 203 CKHFTAYDLDNWNGVDRFHFNAQVSKQDMEDTFDVPFRMCVKEGKVASVMCSYNQVNGIP 262
Query: 274 EEAAAK 279
A K
Sbjct: 263 TCADPK 268
>gi|253579611|ref|ZP_04856880.1| glycoside hydrolase, family 3 domain-containing protein
[Ruminococcus sp. 5_1_39B_FAA]
gi|251849112|gb|EES77073.1| glycoside hydrolase, family 3 domain-containing protein
[Ruminococcus sp. 5_1_39BFAA]
Length = 706
Score = 297 bits (760), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 198/556 (35%), Positives = 282/556 (50%), Gaps = 104/556 (18%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV ++TL EK + L A V RLG+P Y +W+EALHGV+ G AT FPQ
Sbjct: 18 LVSQMTLLEKASQLKYDAAPVKRLGVPAYNYWNEALHGVARAGV----------ATMFPQ 67
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDPRWGR 178
I AA F+ + +G +++TE RA YN A GLTFWSPN+NIFRDPRWGR
Sbjct: 68 AIAMAAVFDDEEMKKVGDIIATEGRAKYNAYSAKEDRDIYKGLTFWSPNVNIFRDPRWGR 127
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
G ET GEDP L S+ +V+G+Q G P +K AAC KHY + + + R+ F+
Sbjct: 128 GHETYGEDPYLTSRLGVKFVEGIQ---GDGP-VMKAAACAKHYAVH---SGPESLRHEFD 180
Query: 239 AMV----IYTYYLIKFKYIV---------------------------------------- 254
A ++ YL F+ +V
Sbjct: 181 AQASMKDMWETYLPAFEALVTEADVEAVMGAYNRTNGEPCCAHKYLMEDVLRGKWKFEGH 240
Query: 255 --SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAI 311
SDC ++ + T TP ++AA ++ AG DLNCG ++L H A + GLV E I
Sbjct: 241 YTSDCWAIRDFHEHHMVTSTPRQSAAMALNAGCDLNCGNTYL--HMMGAYQDGLVTEEKI 298
Query: 312 DHAISNNFATLMRLGFFDGHP-SKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
+ T LG FDG K PY + K+ + D AL AR+ VLLKN G
Sbjct: 299 TESAVRLLTTRYLLGLFDGSEYDKIPYSVVECKE-----HIDEALKMARKSCVLLKND-G 352
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV----VATIYQAGC- 425
LP+ T + + VIGPNA+ +IGNY GT +Y T L+G+ V +Y GC
Sbjct: 353 VLPIDKTKVNTIGVIGPNADSRAALIGNYHGTSSEYITVLEGIREEAGDDVRILYSQGCD 412
Query: 426 ------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDRLDLLL 470
N+ ++ +A A ++D +L +G ++++E E S D++DL L
Sbjct: 413 LYKDKVENLAWDQDRISEAVITAENSDVVILCVGLNETLEGEEGDTGNSDASGDKVDLHL 472
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
P Q+ LI +V V K P I+++M+G D+++A+++ IL YPG GG AIAD+
Sbjct: 473 PKVQEELIEKVTAVGK-PTIVVLMAGSAIDLNYAQDN--CNGILLAWYPGARGGRAIADL 529
Query: 531 CFGRYNPSGRLPMTWY 546
FG+ +PSG+LP+T+Y
Sbjct: 530 LFGKESPSGKLPITFY 545
>gi|302669556|ref|YP_003829516.1| beta-xylosidase [Butyrivibrio proteoclasticus B316]
gi|302394029|gb|ADL32934.1| beta-xylosidase Xyl3A [Butyrivibrio proteoclasticus B316]
Length = 709
Score = 297 bits (760), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 194/560 (34%), Positives = 285/560 (50%), Gaps = 102/560 (18%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R +LV ++T++EK + L A ++ RLGIP Y WW+EALHGV+ G AT
Sbjct: 9 RAKELVAKMTVEEKASQLRYDAPAIDRLGIPAYNWWNEALHGVARAGT----------AT 58
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AA+F+ L +G+V++ EARA YN GLTFW+PN+NIFRDP
Sbjct: 59 MFPQAIGLAAAFDEELMSEVGEVIAEEARAKYNEQSKREDRDIYKGLTFWAPNVNIFRDP 118
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGRG ET GEDP L S+ A +VK +Q G +K AAC KH+ + + +R
Sbjct: 119 RWGRGHETYGEDPFLTSRLAVPFVKAMQ----GDGEYMKAAACAKHFAVH---SGPEGER 171
Query: 235 YHFNAMV----IYTYYLIKFK--------------------------------------- 251
+ F+A + YL F+
Sbjct: 172 HFFDAKASKKDLEETYLPAFEALVKEAEVEAVMGAYNRTNGEPCCANKPLMVDTLRGKWG 231
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNE 308
+ VSDC ++ + + T +PEE+A ++ G DLNCG + V+AGL++E
Sbjct: 232 FQGHFVSDCWAIKDFHENHKVTSSPEESAKLALEMGCDLNCGCTY-QSIMNGVRAGLIDE 290
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
I + F T LG FD K + ++ + V + + +A AAR+ +VLLKN
Sbjct: 291 KLITESCERLFTTRFLLGMFD----KTEFDEIPYEKVECKEHLAVAKRAARESVVLLKND 346
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAG 424
G LPL+ +IK + V+GPNAN ++IGNY GT +Y T L+G+ V +Y G
Sbjct: 347 -GLLPLNKDSIKTIGVVGPNANSRLSLIGNYHGTSSRYITVLEGIQDKVGDDVRVLYSEG 405
Query: 425 CSNVQCGTAQVDD---------AKKAAASADATVLVMGADQSIEAE---------SHDRL 466
C Q + + D A+ A +D V+V+G D+++E E S D++
Sbjct: 406 CDIFQNNISNLADPNLPDRLSEAQAVADHSDVVVVVVGLDENLEGEEGDAGNQFASGDKI 465
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+L LP Q+ L+ V K P I+I M+G D+S A+++ ++L YPG GGA
Sbjct: 466 NLNLPLSQRQLLNAVLDCGK-PTIVIDMAGSAIDLSKAQDEA--NAVLQAFYPGARGGAD 522
Query: 527 IADVCFGRYNPSGRLPMTWY 546
+AD+ FG +PSG+LP+T+Y
Sbjct: 523 VADILFGDVSPSGKLPVTFY 542
>gi|367053033|ref|XP_003656895.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
gi|347004160|gb|AEO70559.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
Length = 758
Score = 296 bits (759), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 211/572 (36%), Positives = 290/572 (50%), Gaps = 77/572 (13%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVD------SAGSVSRLGIPKYEWW 98
N LA+ C+T+ R LV+ + + EK+ LV+ S+ RLG+P YEWW
Sbjct: 2 NGPLANNTVCDTTASPPKRAAALVEAMNITEKLANLVEYVMARSSSKGAPRLGLPPYEWW 61
Query: 99 SEALHGVSYVGPGTHFS---NVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
SEALHGV+ PG F+ ATSF I +A+F+ L Q + V+STEARA N
Sbjct: 62 SEALHGVA-ASPGVSFNWSGGPFSYATSFANPITLSAAFDDELVQKVADVISTEARAFAN 120
Query: 156 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVA 215
G AGL FW+PNIN +RDPRWGRG ETPGEDP+ Y ++GL+ G + KV
Sbjct: 121 AGSAGLDFWTPNINPWRDPRWGRGSETPGEDPVRIKGYVRSLLRGLE----GEESIKKVI 176
Query: 216 ACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------------- 251
A CKHY AYDL+ W RY F+A+V + YYL F+
Sbjct: 177 ATCKHYAAYDLERWHNITRYEFDAIVSLQDLSEYYLPPFQQCARDSKVGSIMCSYNSLNG 236
Query: 252 -------------------------YIVSDCDSV-DVLYNSQHYTKTPEEAAAKSILAGL 285
YI SDC+++ D L + ++T+T EAAA + AG
Sbjct: 237 TPACANTYLMDDILRKHWRWTEDNNYITSDCNAIKDFLPDEHNFTQTAAEAAAAAYTAGT 296
Query: 286 DLNC---GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
D C GS A L++E ID A+ + L+R G+FD S PY +G
Sbjct: 297 DTVCEVAGSPPYTDVVGAYDQKLLSEEVIDRALRRLYEGLVRAGYFDPA-SASPYRDIGW 355
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
DV T Q LAL +A G+VLLKN G+LP+ K +A+IG A+ T++M+G Y G
Sbjct: 356 SDVNTAEAQALALQSASDGLVLLKND-GTLPIKLEG-KTVALIGHWASGTRSMLGGYSGI 413
Query: 403 PCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD----DAKKAAASADATVLVMGADQSI 458
P Y +P+ A + Y+ V +A D DA AA +D + G DQS+
Sbjct: 414 PPYYHSPVYA-AGQLNLTYKYASGPVAPASAARDTWTADALSAANKSDVILYFGGLDQSV 472
Query: 459 EAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGY 518
+E DR + P Q LI +A + K +++I G D + +P +++ILW GY
Sbjct: 473 ASEDKDRDSIAWPPAQLTLIQTLAGLGK--PLVVIQLGDQVDDTPLLTNPNVSAILWAGY 530
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PGQ+GG A+ + G P+GRLP+T YP SY
Sbjct: 531 PGQSGGTAVLNAITGVSPPAGRLPVTQYPSSY 562
>gi|3135209|dbj|BAA28267.1| beta-xylosidase A [Aspergillus oryzae]
Length = 798
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 195/568 (34%), Positives = 281/568 (49%), Gaps = 72/568 (12%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D S P L+ C+TS R LV LT +E + ++ R+G+P Y+ W+E
Sbjct: 51 DCESGP-LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNE 109
Query: 101 ALHGVSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV++ FS+ +TSFPQ I T A+ N +L I ++ST+ RA N G
Sbjct: 110 ALHGVAHA----DFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR 165
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAAC 217
GL +SPNIN FR P WGRGQETPGED LAS YA Y+ G+Q G N LK+ A
Sbjct: 166 YGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQ--GGVDANPLKLIAT 223
Query: 218 CKHYTAYDLDNWKGTDR------------------------------------------- 234
KHY YD++NW R
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283
Query: 235 -----YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
+ ++ T+ ++ Y+ DC +V ++N Y AAA SI AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G +H + + V+ ++ + +A+L+R G+FDG S PY + DV +
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKN--LAVIGPNANVTKTMIGNYEGTPCKYT 407
Q+L+ +AA Q IVLLKN G LPL+ T+ +A+IGP AN T M+GNY G
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLI 460
Query: 408 TPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAA-----SADATVLVMGADQSIEAES 462
+PLQ A + Y+ + T D ++ A AD + G D ++E E+
Sbjct: 461 SPLQ---AFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEA 517
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR ++ P Q LIT++A + K P+I++ M GG D S KN+ + +++W GYPGQ+
Sbjct: 518 QDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQS 576
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG A+AD+ G+ P+ RL T YP Y
Sbjct: 577 GGQALADIITGKRAPAARLVTTQYPAEY 604
>gi|2723496|dbj|BAA24107.1| beta-1,4-xylosidase [Aspergillus oryzae]
Length = 798
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 195/568 (34%), Positives = 281/568 (49%), Gaps = 72/568 (12%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D S P L+ C+TS R LV LT +E + ++ R+G+P Y+ W+E
Sbjct: 51 DCESGP-LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNE 109
Query: 101 ALHGVSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV++ FS+ +TSFPQ I T A+ N +L I ++ST+ RA N G
Sbjct: 110 ALHGVAHA----DFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR 165
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAAC 217
GL +SPNIN FR P WGRGQETPGED LAS YA Y+ G+Q G N LK+ A
Sbjct: 166 YGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQ--GGVDANPLKLIAT 223
Query: 218 CKHYTAYDLDNWKGTDR------------------------------------------- 234
KHY YD++NW R
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283
Query: 235 -----YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
+ ++ T+ ++ Y+ DC +V ++N Y AAA SI AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
G +H + + V+ ++ + +A+L+R G+FDG S PY + DV +
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKN--LAVIGPNANVTKTMIGNYEGTPCKYT 407
Q+L+ +AA Q IVLLKN G LPL+ T+ +A+IGP AN T M+GNY G
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLI 460
Query: 408 TPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAA-----SADATVLVMGADQSIEAES 462
+PLQ A + Y+ + T D ++ A AD + G D ++E E+
Sbjct: 461 SPLQ---AFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEA 517
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR ++ P Q LIT++A + K P+I++ M GG D S KN+ + +++W GYPGQ+
Sbjct: 518 QDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQS 576
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG A+AD+ G+ P+ RL T YP Y
Sbjct: 577 GGQALADIITGKRAPAARLVTTQYPAEY 604
>gi|410723195|ref|ZP_11362440.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
Maddingley MBC34-26]
gi|410603399|gb|EKQ57833.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
Maddingley MBC34-26]
Length = 709
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 200/562 (35%), Positives = 286/562 (50%), Gaps = 107/562 (19%)
Query: 66 DLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFP 125
+LV ++TLQE+ L + ++ L +P+Y WW+E LHGV+ G AT FP
Sbjct: 18 ELVSKMTLQERAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATVFP 67
Query: 126 QVILTAASFNASLFQAIGKVVSTEARAMYNV-------GL-AGLTFWSPNINIFRDPRWG 177
Q I AA F+ I ++STE RA YN G+ GLT+WSPN+NIFRDPRWG
Sbjct: 68 QAIGLAAIFDEEFLGEIADIISTEGRAKYNEYSKKDDRGIYKGLTYWSPNVNIFRDPRWG 127
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD--RY 235
RG ET GEDP L S+ ++KGLQ G LK+AAC KH+ + G + R+
Sbjct: 128 RGHETYGEDPYLTSRLGVAFIKGLQ----GEGKYLKLAACAKHFAVHS-----GPEGLRH 178
Query: 236 HFNAMV----IYTYYLIKFK---------------------------------------- 251
FNA+V +Y YL F+
Sbjct: 179 EFNAVVEKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILRGKWGF 238
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNES 309
++VSDC ++ + T T E+ A +I G DLNCG+ + A K GLV E
Sbjct: 239 KGHVVSDCWALADFHLHHMITSTATESVALAIENGCDLNCGNMY-LNLLLAYKEGLVTEE 297
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV--CTQANQDLALDAARQGIVLLKN 367
I A T +LG FD Y ++ P +V C + N+ +AL A+R+ +VLLKN
Sbjct: 298 QITTAAERLMTTRFKLGMFD---EDCEYNRI-PYEVNDCKEHNE-IALIASRKSMVLLKN 352
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQA 423
G+LPL +++K++AVIGPNAN + GNY GT KYTT L+G+ V Y
Sbjct: 353 D-GTLPLDKSSLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHNAVGDNIRVYYSE 411
Query: 424 GC-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDRLD 467
GC ++ ++ +A A +D +L +G D +IE E + D+
Sbjct: 412 GCHLFKDKVEDLAGPDDRLSEAISVAERSDVVILCLGLDSTIEGEQGDAGNSYGAGDKES 471
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
L LPG+QQ L+ +V +V K PVI+++ G G ++F + K +IL YPG GG A+
Sbjct: 472 LNLPGRQQNLLEKVLEVGK-PVIVVL--GAGSALTFNGAEEKCAAILNAWYPGSHGGTAV 528
Query: 528 ADVCFGRYNPSGRLPMTWYPQS 549
AD+ FG+ +PSG+LP+T+Y +
Sbjct: 529 ADILFGKCSPSGKLPVTFYKDT 550
>gi|373852136|ref|ZP_09594936.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
gi|372474365|gb|EHP34375.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
Length = 740
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 198/571 (34%), Positives = 293/571 (51%), Gaps = 101/571 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L +D RV DLV RLTL EK++ + +A ++ RLGIP Y +W+E LHGV+ G
Sbjct: 23 FRDPDLALDHRVRDLVSRLTLAEKVSQMEHAAAAIPRLGIPAYNYWNECLHGVARNG--- 79
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA-----------GL 161
AT FPQ+I AA+++ L + +S EARA ++ LA GL
Sbjct: 80 -------RATVFPQIIGLAATWDTDLVYRVATAISDEARAKHHAALARQGFAQTQQYQGL 132
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPN-RLKVAACCKH 220
TFW+PNIN+FRDPRWGRGQET GEDP L ++ A +V+GLQ G +P+ LK+AAC KH
Sbjct: 133 TFWTPNINLFRDPRWGRGQETWGEDPHLTARLAAAFVRGLQ---GDTPDTHLKLAACAKH 189
Query: 221 YTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------- 251
Y + + +R+ FNA V ++ YL F+
Sbjct: 190 YAVH---SGPENERHTFNARVTPHDLWDSYLPAFEHLVRHARVESVMGAYNRTLDEPCCA 246
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS--- 291
++VSDC ++ ++ + T P E+AA ++ G DL CG+
Sbjct: 247 SQFLLLDILRERWGFEGHVVSDCWALRDIHETHRITTDPVESAALALTKGCDLACGTTFE 306
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFD-GHPSKQPYGQ-LGPKDVCT-Q 348
LG+ AV+ GL+ E+ ID A+S + +LG FD ++ P+ P+ + T
Sbjct: 307 LLGE----AVQRGLITEADIDRALSRHLRARFKLGMFDPADDNRNPWSNPPAPEAIVTCA 362
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
A+ LA +AA VLL+N LPL P ++++ + GP A ++GNY G P + T
Sbjct: 363 AHTALACEAAVASCVLLQNHNHILPLRPD-VRSIYITGPLAATQDALLGNYYGLPPRAIT 421
Query: 409 PLQGLAAV----VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES-- 462
L GLAA + Y+ G ++ A+ AS D T+ +G +E E
Sbjct: 422 LLDGLAAALPEGIRADYRPGALLSTPKQNALEWAEFDCASCDVTIACLGLTALLEGEEGE 481
Query: 463 -------HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILW 515
DR D+ LP Q+L + + + +G +++I+ GG +S K+ +ILW
Sbjct: 482 AIASSLHGDRDDISLPPPQRLFLESL--IQRGARVIVILFGGSA-LSLGPLADKVEAILW 538
Query: 516 VGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
GYPGQ GG A+AD+ GR +PSGRLP+T+Y
Sbjct: 539 AGYPGQEGGRALADILLGRASPSGRLPITFY 569
>gi|255690205|ref|ZP_05413880.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
gi|260624224|gb|EEX47095.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 1425
Score = 296 bits (757), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 203/571 (35%), Positives = 281/571 (49%), Gaps = 99/571 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L I+ RV DLV RLTL+EK+ ++++A ++ RLGIP Y WW+E LHGV T
Sbjct: 714 FRNPQLSIEQRVDDLVSRLTLEEKVRQMLNNAPAIKRLGIPAYNWWNECLHGVGR----T 769
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
+ T FPQ I AAS+N L + + ++ E RA+YN LT+W
Sbjct: 770 KYH-----VTVFPQAIGMAASWNDVLMKEVASSIADEGRAIYNDAQKRGDYSQYHALTYW 824
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRGQET GEDP L SK +V GLQ G P LK +AC KHY +
Sbjct: 825 TPNINIFRDPRWGRGQETYGEDPYLTSKIGKAFVLGLQ---GDDPRYLKASACAKHYAVH 881
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+R+ FN+ V ++ YL F+
Sbjct: 882 SGPE---KNRHSFNSDVSTYDLWDTYLPAFRTLVVDANVSGVMCAYNAFKGQPCCGNDLL 938
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
Y+ SDC ++D ++N AAA ++ G DL+CG
Sbjct: 939 MQSILRDKWNFKGYVTSDCGAIDDIFNHHKAHPDAATAAADAVFHGTDLDCGQSAYLALV 998
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDA 357
AVK G++ E +D ++ F RLG FD P++Q Y + + + +QDLA
Sbjct: 999 KAVKNGIITEKQLDVSVKRLFTIRFRLGLFD--PAEQVDYAHIPISVLECKKHQDLAKQL 1056
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG----L 413
AR+ +VLLKN LPL +K + V+GPNA+ ++GNY G P + TPLQ L
Sbjct: 1057 ARESMVLLKNDR-LLPLQKNKLKKVVVMGPNADCKDALLGNYNGHPSRMLTPLQAIRERL 1115
Query: 414 AAVVATIYQAGCSNVQCGTAQVDDAKK---AAASADATVLVMGADQSIEAESH------- 463
V +Y +G + T D+ K+ A ADA + + G +E E
Sbjct: 1116 KGVAEVVYVSGIDYIN--TVSEDELKRYVNQAKGADAVIFIGGISPRLEGEEMSVNKDGF 1173
Query: 464 ---DRLDLLLPG-QQQLLITEVAKVAKGPVILIIMSGGGFDISF-AKNDPKITSILWVGY 518
DR + LP Q QL+ VA + P + ++M+G I + AK+ P +IL Y
Sbjct: 1174 DGGDRTSIALPTVQTQLMKALVA--GRIPTVFVMMTGSALAIPWEAKHVP---AILNAWY 1228
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
GQ GG AIADV FG YNPSG+LP+T+Y +
Sbjct: 1229 GGQYGGEAIADVLFGDYNPSGKLPVTFYAKD 1259
>gi|261368518|ref|ZP_05981401.1| beta-glucosidase [Subdoligranulum variabile DSM 15176]
gi|282569400|gb|EFB74935.1| glycosyl hydrolase family 3 C-terminal domain protein
[Subdoligranulum variabile DSM 15176]
Length = 717
Score = 295 bits (756), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 188/559 (33%), Positives = 275/559 (49%), Gaps = 97/559 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV ++TL+EKI+ ++ A ++ RLGIP Y WW+E +HGV G AT
Sbjct: 14 RARALVAQMTLKEKISQMLSWAPAIPRLGIPAYNWWNEGIHGVGRAGT----------AT 63
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDP 174
FPQ I AASF+ L +G+ V EAR YN+ + GLT W+PN+NIFRDP
Sbjct: 64 VFPQAIGLAASFDEDLLGQVGEAVGVEARGKYNMYRSYQDRDIYKGLTIWAPNVNIFRDP 123
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGRG ET GEDP L S+ +V+G+Q G P+ L+ AAC KH+ + + R
Sbjct: 124 RWGRGHETYGEDPYLTSRLGVRFVEGMQ---GDDPDYLRAAACAKHFAVH---SGPEDQR 177
Query: 235 YHFNAMV----IYTYYLIKFK--------------------------------------- 251
++F+A V ++ YL F+
Sbjct: 178 HYFDAKVSQQDLWETYLPAFRALVKEAGVEAVMGAYNRTNGEPCCGSKTLLVDILRGKWN 237
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNE 308
++ SDC ++ + T P ++ A ++ G DLNCG + E AV G V E
Sbjct: 238 FQGHVTSDCWAIKDFHEGHMVTSGPVDSVALAVNNGCDLNCGDLYA-YLEEAVAEGKVKE 296
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
ID ++ F T M+LG FD K PY ++G V ++ Q L L+ A + +VLLKN
Sbjct: 297 ETIDRSLVRLFTTRMKLGMFDAE-EKVPYNKIGYDAVDSREMQALNLEVAEKILVLLKNE 355
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG----LAAVVATIYQAG 424
+LPL + + +AV+GPNA+ K ++GNYEGT +Y T L G L V Y G
Sbjct: 356 NHTLPLDKSKLHRVAVVGPNADNRKALVGNYEGTASRYVTVLDGIQEYLGEDVQVRYSEG 415
Query: 425 C----SNVQCGTAQ----VDDAKKAAASADATVLVMGADQSIEAE---------SHDRLD 467
C +Q G A+ + + + A D + +G D +E E S D+
Sbjct: 416 CHLYADKIQ-GLAKSNELISEVRGVCAECDVVICCLGLDAGLEGEEGDQGNQFASGDKQS 474
Query: 468 LLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAI 527
L LPG Q+ ++ + K PV+++++SG + A+ ++L YPG GG A+
Sbjct: 475 LSLPGNQESVLKACIESGK-PVVVVVLSGSALALGTAQEGA--AAVLQAWYPGAQGGRAV 531
Query: 528 ADVCFGRYNPSGRLPMTWY 546
A FG NP G+LP+T+Y
Sbjct: 532 ARALFGECNPQGKLPVTFY 550
>gi|365135698|ref|ZP_09343911.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363612160|gb|EHL63713.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 643
Score = 295 bits (756), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 202/554 (36%), Positives = 282/554 (50%), Gaps = 95/554 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV ++TL+EK++ + A ++ RLGIP Y WW+E LHGV G AT
Sbjct: 7 RARALVAQMTLEEKVSQMRYDAPAIERLGIPAYNWWNECLHGVGRSGT----------AT 56
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYN-------VGL-AGLTFWSPNINIFRDP 174
FPQ I AASF+ SL + + + +S EARA YN G+ GLTFWSPNIN+FRDP
Sbjct: 57 VFPQPIGMAASFDESLLEHVAQAISDEARAKYNQYKTFGETGIYQGLTFWSPNINLFRDP 116
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGRG ET GEDPLL + T +++GLQ+ G K+ A KH+ A+ + R
Sbjct: 117 RWGRGHETYGEDPLLTGRMGTAFIRGLQE--GEDSQYRKLDATVKHFAAH---SGPEAGR 171
Query: 235 YHFNAMV----IYTYYLIKFKY-------------------------------------- 252
+ FNA V + YL F+Y
Sbjct: 172 HSFNAEVSAEDMADSYLWAFRYCIEHAKPAAVMGAYNRINGEPACASSTYLKGVLYEEWK 231
Query: 253 ----IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNE 308
+VSDC ++ + + H TK +E+AA ++ G LNCG +AAV+ GL++E
Sbjct: 232 FDGYVVSDCGAIQDINENHHVTKNEKESAALAVNNGCQLNCGKAY-HWVKAAVEDGLISE 290
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
+ A+ F RLG FD S Y + + + +++L A++ IVLLKN
Sbjct: 291 DTVTCAVERLFEARFRLGMFD---SDCVYDSIPMNVIECRKHRELNRKMAQESIVLLKNN 347
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA--AVVATIYQAGCS 426
G LPL+P K +AVIGPNA+ ++GNY GTP +TT L+G+ A Y G
Sbjct: 348 -GILPLNPE--KTIAVIGPNADDKTVLLGNYNGTPSHWTTLLRGIQDQARGEVYYARGSV 404
Query: 427 NVQ-----CGTAQVDDAKKAAASADATVLVMGADQSIE---------AESHDRLDLLLPG 472
V+ + +A A +AD VL +G +E A+S DR D+ LP
Sbjct: 405 LVEKEALPWAEKPLHEAIYTAKAADVVVLCLGLSPLLEGEEGDAYNGADSGDRKDISLPD 464
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCF 532
QQ L+ + K PV+L+ +SGG D+ A D + +IL YPG GG A+AD+ F
Sbjct: 465 IQQQLLCAILDTEK-PVVLVNVSGGCVDLRQA--DERCAAILQCFYPGAEGGNALADILF 521
Query: 533 GRYNPSGRLPMTWY 546
GR +PSGRLP+T+Y
Sbjct: 522 GRVSPSGRLPVTFY 535
>gi|280977785|gb|ACZ98610.1| glucosidase [Cellulosilyticum ruminicola]
Length = 711
Score = 295 bits (756), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 195/556 (35%), Positives = 280/556 (50%), Gaps = 99/556 (17%)
Query: 66 DLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFP 125
+LV+++ L EK + L A ++ RLGIP Y WW+EALHGV+ G AT FP
Sbjct: 10 ELVRQMDLLEKASQLRYDAPAIKRLGIPTYNWWNEALHGVARAGV----------ATVFP 59
Query: 126 QVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWG 177
Q I AA F+ I +++ E RA YN G+TFW+PNINIFRDPRWG
Sbjct: 60 QAIGLAAMFDEEKLGEIADIIAIEGRAKYNQFSQKEDRDIYKGMTFWAPNINIFRDPRWG 119
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHF 237
RG ET GEDP L ++ ++KGLQ + + + LK AAC KH+ + + DR+HF
Sbjct: 120 RGHETYGEDPYLTARLGVAFIKGLQGDE--NEDYLKAAACAKHFAVH---SGPEEDRHHF 174
Query: 238 NAMV----IYTYYLIKFK------------------------------------------ 251
+A+V +Y YL F+
Sbjct: 175 DAIVSKKDLYETYLPAFEAAVKEANVIGVMGAYNRVNGEPACGSKTLLVDILKKDWGFDG 234
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESA 310
YIVSDC ++ + T T E+AA +I G +LNCG ++L H A + GLV E
Sbjct: 235 YIVSDCWAIRDFHTEHMVTHTAAESAALAINNGCELNCGNTYL--HMLEAHQEGLVKEEI 292
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
I A M+LG FD + Y ++ + ++++AL+A+R+ +V+LKN G
Sbjct: 293 ITEAAEKLMRIRMQLGLFDKNCK---YNEIPYAVNDCKVHREVALEASRRSMVMLKND-G 348
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGC- 425
LPL+ +K++ +IGP AN + GNY GT +YTT ++G+ V Y GC
Sbjct: 349 ILPLNKDKLKSIGIIGPTANNRTVLEGNYNGTASRYTTFVEGIQDYVGDDVRVYYSEGCH 408
Query: 426 ------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHD---------RLDLLL 470
SN+ + +A A +D VL +G D +IE E D +L L L
Sbjct: 409 LFANGMSNLAWENDREAEALIVAEQSDVVVLCLGLDSTIEGEQGDTGNAFAGGDKLSLNL 468
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
G+QQ L+ +V V K PVIL++ +G I++A D +I YPG GG A+A +
Sbjct: 469 IGRQQQLLEKVVAVGK-PVILVLSTGSAMAINYA--DEHCNAIFQTWYPGAQGGKALAQL 525
Query: 531 CFGRYNPSGRLPMTWY 546
FG Y+PSG+LP+T+Y
Sbjct: 526 LFGEYSPSGKLPVTFY 541
>gi|451821678|ref|YP_007457879.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451787657|gb|AGF58625.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 710
Score = 295 bits (755), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 203/564 (35%), Positives = 282/564 (50%), Gaps = 110/564 (19%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
+ +LV ++TLQE+ L A ++ L I +Y WW+E LHGV+ G AT
Sbjct: 15 KAKELVSKMTLQERAEQLTYKAPAIKHLNISRYNWWNEGLHGVARAGT----------AT 64
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AA F+ L + I +++TE RA YN GLTFWSPN+NIFRDP
Sbjct: 65 VFPQAIGLAAIFDDELLEKIAGIIATEGRAKYNENSKKEDKDIYKGLTFWSPNVNIFRDP 124
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD- 233
RWGRG ET GEDP L S+ +VKGLQ G LK+AAC KH+ + G +
Sbjct: 125 RWGRGHETYGEDPYLTSRLGVAFVKGLQ----GDEKYLKIAACAKHFAVH-----SGPEG 175
Query: 234 -RYHFNAMV----IYTYYLIKFK------------------------------------- 251
R+ FNA+V +Y YL F+
Sbjct: 176 LRHEFNAVVSKKDLYETYLPAFEACVKEADVEAVMGAYNRTNDEPCCGSSLLLKDILRGK 235
Query: 252 -----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLV 306
++VSDC ++ + T T E+AA +I G DLNCG+ + A K GLV
Sbjct: 236 WQFKGHVVSDCWAIADFHLYHGVTSTATESAALAIKNGCDLNCGNVYLQML-LAYKEGLV 294
Query: 307 NESAIDHAISNNFATLMRLGFFDGHP--SKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
E I A AT +RLG FD +K PY K+ + +++L A+R+ IV+
Sbjct: 295 TEEDITRAAERLMATRIRLGMFDEECEFNKIPYTMNDCKE-----HHEVSLMASRKSIVM 349
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT----- 419
L+N G LPL + +K++ +IGPNA+ + GNY GT KY T L+G+ V +
Sbjct: 350 LRNN-GLLPLDKSKLKSIGIIGPNADSELMLKGNYFGTASKYITVLEGIHEAVDSENIRI 408
Query: 420 IYQAGC----SNVQCGTAQVDD----AKKAAASADATVLVMGADQSIEAE---------S 462
Y GC VQ A+ DD A A +D +L +G D SIE E +
Sbjct: 409 FYSEGCHLYKDRVQ-DLAEPDDRMAEAVTVAEHSDVVILCLGLDSSIEGEQGDAGNSDGA 467
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
D+L+L LPG+QQ L+ +V K PVI+++ G G ++ + +IL YPG
Sbjct: 468 GDKLNLNLPGKQQELLEKVIATGK-PVIVVL--GAGSALTLQGQEENCAAILNAWYPGSF 524
Query: 523 GGAAIADVCFGRYNPSGRLPMTWY 546
GG AIAD+ FG+ +PSG+LP+T+Y
Sbjct: 525 GGRAIADLIFGKCSPSGKLPVTFY 548
>gi|358365439|dbj|GAA82061.1| beta-xylosidase [Aspergillus kawachii IFO 4308]
Length = 788
Score = 295 bits (754), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 196/542 (36%), Positives = 275/542 (50%), Gaps = 64/542 (11%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNV--VPGATSF 124
L+ TL E I ++ VSRLG+P Y+ WSEALHG+ +FS+ ATSF
Sbjct: 66 LISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLDRA----NFSDSGSYNWATSF 121
Query: 125 PQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPG 184
PQ ILT A+ N +L I ++ST+ RA N G GL ++PNIN FR P WGRGQETPG
Sbjct: 122 PQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPNINTFRHPVWGRGQETPG 181
Query: 185 EDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG------------- 231
ED LA+ YA Y+ G+Q D S LK+AA KHY YD++NW
Sbjct: 182 EDVSLAAVYAYEYITGIQGPDPDS--NLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQ 239
Query: 232 -----------------------------------TDRYHFNAMVIYTYYLIKFKYIVSD 256
D Y ++ T+ + Y+ SD
Sbjct: 240 DLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSD 299
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAIS 316
CD+ +YN Y + AAA++ILAG D++CG+ H ++ AG ++ I+ +
Sbjct: 300 CDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVI 359
Query: 317 NNFATLMRLGFFDGHPSK--QPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+ TL++ G+FD + +K PY L DV +++ AA QGIVLLKN+ LPL
Sbjct: 360 RLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPL 419
Query: 375 S----PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQ 429
+ P + +A+IGP AN T ++GNY G +P + + G
Sbjct: 420 TEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGTGISS 479
Query: 430 CGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVA-KGP 488
T+ A AA SAD + G D ++EAE+ DR + PG Q LI ++A A P
Sbjct: 480 TSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKP 539
Query: 489 VILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQ 548
+I++ M GG D S KN+ +T++LW GYPGQ+GG A+ D+ G+ NP+GRL T YP
Sbjct: 540 LIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPA 599
Query: 549 SY 550
SY
Sbjct: 600 SY 601
>gi|150019782|ref|YP_001312036.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
8052]
gi|149906247|gb|ABR37080.1| glycoside hydrolase, family 3 domain protein [Clostridium
beijerinckii NCIMB 8052]
Length = 709
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 196/560 (35%), Positives = 285/560 (50%), Gaps = 103/560 (18%)
Query: 66 DLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFP 125
+LV ++TLQEK L + ++ L +P+Y WW+E LHGV+ G AT FP
Sbjct: 18 ELVSKMTLQEKAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATVFP 67
Query: 126 QVILTAASFNASLFQAIGKVVSTEARAMYNV-------GL-AGLTFWSPNINIFRDPRWG 177
Q I AA F+ + +++TE RA YN G+ GLT+WSPNINIFRDPRWG
Sbjct: 68 QAIGLAAIFDDEFLGKVANIIATEGRAKYNEYSKKDDRGIYKGLTYWSPNINIFRDPRWG 127
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD--RY 235
RG ET GEDP L S+ ++KGLQ G LK+AAC KH+ + G + R+
Sbjct: 128 RGHETYGEDPYLTSRLGVAFIKGLQ----GEGKYLKLAACAKHFAVHS-----GPEGLRH 178
Query: 236 HFNAMV----IYTYYLIKFK---------------------------------------- 251
FNA+V +Y YL F+
Sbjct: 179 EFNAVVNKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILRGKWGF 238
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNES 309
++VSDC ++ + T T E+ A +I G DLNCG+ + A K GLV E
Sbjct: 239 KGHVVSDCWALADFHLHHMVTSTATESVALAIENGCDLNCGNMY-LNLLLAYKEGLVTEE 297
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
I A T +LG FD + Y ++ + ++ + ++AL A+R+ +VLLKN
Sbjct: 298 QITTAAERLMTTRFKLGMFD---EECEYNKIPYEVNDSREHNEVALIASRKSMVLLKNN- 353
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT----IYQAGC 425
G+LPL + +K++AVIGPNAN + GNY GT KYTT L+G+ V Y GC
Sbjct: 354 GTLPLDKSNLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHDAVGNDVRVYYSEGC 413
Query: 426 -------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDRLDLL 469
++ ++ +A A +D VL +G D +IE E + D+ +L
Sbjct: 414 HLFKDKVEDLARPDDRLSEAISVAERSDVVVLCLGLDSTIEGEQGDAGNSYGAGDKENLN 473
Query: 470 LPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIAD 529
LPG+QQ L+ +V +V K PVI+++ +G ++ A + K +IL YPG GG A+AD
Sbjct: 474 LPGRQQNLLEKVLEVGK-PVIVVLGAGSALTLNGA--EEKCAAILNAWYPGSHGGTAVAD 530
Query: 530 VCFGRYNPSGRLPMTWYPQS 549
+ FG+ +PSG+LP+T+Y +
Sbjct: 531 ILFGKCSPSGKLPVTFYKDT 550
>gi|398406144|ref|XP_003854538.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
gi|339474421|gb|EGP89514.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
Length = 884
Score = 293 bits (749), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 195/553 (35%), Positives = 276/553 (49%), Gaps = 71/553 (12%)
Query: 54 CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV------SY 107
C+TSL D R+ L+ ++T++EK T LVD A + R+G+P YEWW+EALHGV S+
Sbjct: 146 CDTSLSQDDRIAALISQMTVEEKATNLVDGALGLPRIGLPPYEWWNEALHGVAGSRGVSF 205
Query: 108 VGP-GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSP 166
P G+ FS ATSFP IL A+F+ L + ++ EARA N +G FW+P
Sbjct: 206 DSPNGSDFSY----ATSFPLPILMGAAFDDPLIYDVASIIGKEARAFANYAHSGYDFWTP 261
Query: 167 NINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSP--NRLKVAACCKHYTAY 224
N+N F DPRWGRG E P ED A +Y V GLQ GG + ++ A CKH+ Y
Sbjct: 262 NMNTFLDPRWGRGLEVPTEDSFHAQRYVASLVPGLQ---GGKEKTDHKQIIATCKHFAVY 318
Query: 225 DLDNWKGTDRYHFNAMVIYTYYLIKFK--------------------------------- 251
D++ + Y + YYL FK
Sbjct: 319 DVETNRHAQNYEPTPQDLGEYYLPAFKTCVRDVNVGSIMCSYNAVYGVPACASEYFLQDV 378
Query: 252 ------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTE 298
Y+ SDC++V ++ ++T T AAA ++ AG D NCG S+L +T
Sbjct: 379 LRDQWNFNEPYHYVTSDCEAVKDIWTPHNFTDTEPAAAAVALNAGTDTNCGTSYLQLNT- 437
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAA 358
+V E+ +D +++ + L +G+FDG P Y L DV T Q A AA
Sbjct: 438 -SVANNWTTEAQMDISLTRLYNALFTVGYFDGQPE---YDGLSFADVSTPFAQATAYRAA 493
Query: 359 RQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA 418
+GI LLKN G LPL + ++A+IGP AN T M G Y+G +PL A
Sbjct: 494 SEGITLLKND-GLLPLK-KSYNSVALIGPWANATTQMQGIYQGIAPYLVSPLAAAQAQWG 551
Query: 419 TI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLL 477
I + G + T A AA AD + G D SIE ES DR + PG Q L
Sbjct: 552 HISFTNGTAINSTNTTGFASALSAARDADVIIYAGGIDSSIEKESRDRTSISWPGNQLDL 611
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ +++++ K P++++ GG D S + + S++W GYPGQ GG+A+ DV G+ +P
Sbjct: 612 VQQLSELGK-PLVVVQFGGGQVDDSALLRNKNVNSLVWAGYPGQDGGSALIDVLVGKQSP 670
Query: 538 SGRLPMTWYPQSY 550
+GRL +T YP Y
Sbjct: 671 AGRLTITQYPADY 683
>gi|410617070|ref|ZP_11328046.1| beta-glucosidase [Glaciecola polaris LMG 21857]
gi|410163339|dbj|GAC32184.1| beta-glucosidase [Glaciecola polaris LMG 21857]
Length = 731
Score = 293 bits (749), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 192/551 (34%), Positives = 277/551 (50%), Gaps = 89/551 (16%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV +T+ EKI L + +++RL +P+Y WW+EALHG++ G AT
Sbjct: 39 RANLLVNAMTVDEKIAQLSHATPAIARLNVPQYNWWNEALHGIARNG----------KAT 88
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AA+F+ L + +S EARA Y + AGLTFW+PN+NIFRDP
Sbjct: 89 IFPQAIGLAATFDPDLAHQVASAISDEARAKYAIAQSIGNQGQYAGLTFWTPNVNIFRDP 148
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGRGQET GEDP L ++ T +VKGLQ G P LK A KH+ + + + R
Sbjct: 149 RWGRGQETYGEDPFLTAQMGTAFVKGLQ---GDDPKYLKSAGVAKHFAVH---SGPESLR 202
Query: 235 YHFNAMV----IYTYYLIKFK--------------------------------------- 251
+HF+ +Y YL F+
Sbjct: 203 HHFDVEPSQKDLYETYLPAFEALVTQAKVAGVMCAYNAVNGEPACASAQLLDGILKKQWG 262
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNE 308
YIVSDC +++ TK+ E+AA ++ +G++LNCGS +AA++ LV
Sbjct: 263 FHGYIVSDCGALNDFQAGHKVTKSGPESAALALQSGVNLNCGSTYEHFLKAALEQNLVPL 322
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
ID ++ +LGFFD P+ PY ++ P + + + +L+ D AR+ IVLLKN
Sbjct: 323 ELIDQRLTQLLMIRFQLGFFD--PAGLNPYNEVTPDVIHSPEHINLSRDVARKSIVLLKN 380
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT----IYQA 423
LPLS IK V GP A + +IGNY G + L+G+A V+ Y++
Sbjct: 381 DNHVLPLSKD-IKVPYVTGPFAASSDMLIGNYYGISDSLVSVLEGIAGKVSLGSSLNYRS 439
Query: 424 GCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES---------HDRLDLLLPGQQ 474
G ++ A + A +ADA + V+G +E E DR+ + LP Q
Sbjct: 440 GSLPFHNNINPLNWAPQVAKTADAVIAVVGVSADMEGEEVDAIASADRGDRVAITLPQNQ 499
Query: 475 QLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGR 534
+ ++A KGP+IL++ +G DIS +P +ILW+ YPG+ GG A+ADV FG
Sbjct: 500 VDYVKQLAAHKKGPLILVVAAGSPVDIS--DLEPLADAILWIWYPGEQGGNAVADVLFGD 557
Query: 535 YNPSGRLPMTW 545
NPSG LP+T+
Sbjct: 558 TNPSGHLPLTF 568
>gi|225878709|dbj|BAH30674.1| beta-xylosidase [Aspergillus aculeatus]
Length = 785
Score = 292 bits (747), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 199/585 (34%), Positives = 294/585 (50%), Gaps = 65/585 (11%)
Query: 19 FLTYCSLQHSSSSAQSSPV--FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEK 76
++ Y + H S S+ + F+ SN L C+ + R L +TL+E
Sbjct: 25 YVDYNIVAHPDLSPLSTDLVDFSFPDCSNGPLRGSLVCDRTASAHDRAAALTSMMTLEEL 84
Query: 77 ITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNA 136
+ + ++ RLG+P Y+ W+EALHG+ Y+ T S +TSFP ILT A+ N
Sbjct: 85 MNSTGNRIPAIPRLGLPPYQIWNEALHGL-YLANFTE-SGPFSWSTSFPSPILTMATLNR 142
Query: 137 SLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDP-LLASKYAT 195
+L I ++++T+ RA N G GL +SPNIN FR P WGRGQETPGED L S YA
Sbjct: 143 TLIHQIAQIIATQGRAFNNAGRYGLNAFSPNINAFRHPVWGRGQETPGEDANCLCSAYAY 202
Query: 196 GYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRY-------------------- 235
Y+ GLQ G+ K+ A KHY YD++NW+ R+
Sbjct: 203 EYITGLQ----GNATNPKIIATAKHYAGYDIENWRQRSRFGNDLNITQQDLAEYFTPQFV 258
Query: 236 -------------HFNA---------------MVIYTYYLIKFKYIVSDCDSVDVLYNSQ 267
+NA +V ++ I+ Y+ SDCD+V ++N
Sbjct: 259 VAVRDAQVRSVMPSYNAVNGVPSSANTFLLQTLVRDSWGFIQDGYMASDCDAVYNVFNPH 318
Query: 268 HYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLG 326
Y A+A S+ AG D++CG S+L E+ + G ++ S I+ A++ ++ L+ G
Sbjct: 319 GYAANLSSASAMSLRAGTDIDCGISYLTTLNESLTQ-GQISRSEIERAVTRFYSNLVSAG 377
Query: 327 FFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG 386
+FDG PY L DV ++A +AA G+VLLKN G LPLS +++ +A+IG
Sbjct: 378 YFDG--PDAPYRDLSWSDVVRTNRWNVAYEAAVAGVVLLKND-GVLPLS-KSVQRVALIG 433
Query: 387 PNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ-VDDAKKAAASA 445
P AN T+ M GNY G T+PL + A + A +N+ A AA +
Sbjct: 434 PWANATEQMQGNYHGVAPYLTSPLAAVQASGLEVNYAFGTNITSNVTNCFAAALAAAEKS 493
Query: 446 DATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAK 505
D + G D ++EAE DR ++ PG Q LI + ++ K P++++ M GG D S K
Sbjct: 494 DIIIFAGGIDNTLEAEELDRANITWPGNQLELIHRLGELGK-PLVVLQMGGGQVDSSALK 552
Query: 506 NDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
K+ ++LW GYPGQAGG A+ D+ G+ P+GRL T YP Y
Sbjct: 553 ASEKVGALLWGGYPGQAGGQALWDILTGQRAPAGRLTTTQYPAEY 597
>gi|333995841|ref|YP_004528454.1| beta-glucosidase [Treponema azotonutricium ZAS-9]
gi|333737309|gb|AEF83258.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
(Beta-D-glucoside glucohydrolase) [Treponema
azotonutricium ZAS-9]
Length = 706
Score = 291 bits (746), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 200/567 (35%), Positives = 287/567 (50%), Gaps = 107/567 (18%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ +++ ++TL+EK++ L A +V GIPKY WW+E LHGV+ G AT
Sbjct: 6 RIKEMISKMTLEEKVSQLSYDAPAVESAGIPKYNWWNECLHGVARAGL----------AT 55
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDP 174
FPQ I AA+F+ + +++ +S E RA YN + GLTFW+PN+NIFRDP
Sbjct: 56 VFPQAIALAATFDEAFIRSVADAISDEGRAKYNEAVKRGNRSQYYGLTFWTPNVNIFRDP 115
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD- 233
RWGRGQET GEDP L + ++KGLQ G LKVAAC KHY + G +
Sbjct: 116 RWGRGQETYGEDPYLTGRIGLAFMKGLQ---GDDTEHLKVAACAKHYAVH-----SGPEK 167
Query: 234 -RYHFNAMV----IYTYYLIKFKYIV---------------------------------- 254
R+ F+A+V ++ YL FK +V
Sbjct: 168 LRHTFDAVVSKKDLFETYLPAFKLLVENGVEAVMGAYNRTLGEPCGGSTYLLKEILRGRW 227
Query: 255 -------SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVN 307
SDC ++ + + TK+PEE+AA ++ AG DLNCG + + K GLV
Sbjct: 228 GFKGHVTSDCWAIRDFHENHKVTKSPEESAAMALNAGCDLNCGCTY-PYLTVSHKKGLVT 286
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
+ ID A++ T +LG FD P + PY LG V + +++LAL+AA++ IVLLKN
Sbjct: 287 DETIDTALTRLLRTRFKLGLFD-PPEQDPYRNLGNDIVGCEKHRNLALEAAQKSIVLLKN 345
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI------Y 421
+ LPL +A K L ++GP A T++ NY G + T L+GLA + T Y
Sbjct: 346 DSNILPLDDSARKIL-LMGPGAANILTLLANYYGMSSRLVTILEGLAEKIKTKTAISFEY 404
Query: 422 QAGC--------SNVQCGTAQVDDAKK--AAASADATVLVMGADQSIEAE---------S 462
+ G SNV G+ VD D + V G D S+E E +
Sbjct: 405 RQGSLMYEPNHLSNVPFGSTGVDAEAPIYGLDEIDLVIAVYGLDGSMEGEEGDSIASDAN 464
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR + LP Q + + K K V+LI+ GG I+F ++ ++L+ YPG+
Sbjct: 465 GDRDTIELPSWQLNFLRRIRKAGK-KVVLILT--GGSPIAFPED--LADAVLFAWYPGEQ 519
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQS 549
GG A+AD+ FG +PSG+LP+T +PQS
Sbjct: 520 GGNAVADILFGDVSPSGKLPIT-FPQS 545
>gi|330947691|ref|XP_003306937.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
gi|311315273|gb|EFQ84970.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
Length = 756
Score = 291 bits (746), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 196/562 (34%), Positives = 275/562 (48%), Gaps = 70/562 (12%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N L S C+ + R LV + QEK+ LV + V+RLG+P Y WW EALHG
Sbjct: 26 NGPLKSNAICDVTASPAKRAAALVAAMQTQEKLENLVSKSKGVARLGLPAYNWWGEALHG 85
Query: 105 VSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFW 164
V+ PG +F+ ATSFP +L +A+F+ L I V+ EARA N G+A + FW
Sbjct: 86 VA-GAPGINFTGSYRTATSFPMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGIAPVDFW 144
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P+IN FRDPRWGRG ETPGED L Y + GL+ G + K+ A CKHY Y
Sbjct: 145 TPDINPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLE----GDKAQRKIIATCKHYVGY 200
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
D++NW GTDR+HF+A + + Y++ F+
Sbjct: 201 DVENWNGTDRHHFDAKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYV 260
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGK 295
YI SDC++V + Y T +EA A + G+DL+C
Sbjct: 261 LEDILRKHWNWTDSNNYITSDCEAVKDISLRHKYVATLQEATAIAFNNGMDLSCEYSGTS 320
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
A GL+N S ID A++ + L+ G+FDG + Y LG +D+ T Q L L
Sbjct: 321 DIPGAFSQGLLNVSVIDRALTRQYEGLVHAGYFDG--AAATYAHLGVQDINTPEAQKLVL 378
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ---- 411
A +G+ LLKN +LPLS + +A++G AN T + G Y G TP+
Sbjct: 379 QVAAEGLTLLKND-DTLPLSLKSGSKVAMVGFWANTTSKLSGIYSGPAPYLHTPVYAGNK 437
Query: 412 -GLAAVVAT--IYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDL 468
GL VAT I Q + T ++ AKK +D + G D S AE DR D+
Sbjct: 438 LGLDMAVATGPILQTSGAADNWTTTALNAAKK----SDFILYFGGLDPSAAAEGSDRTDI 493
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIA 528
P Q LIT++A + K P+++I + G D + + S++W +PGQ GG A+
Sbjct: 494 SWPSAQIDLITKLAALGK-PLVVIAL-GDMVDHTPILKMKGVNSLIWANWPGQDGGTAVM 551
Query: 529 DVCFGRYNPSGRLPMTWYPQSY 550
V G + +GRLP+T YP Y
Sbjct: 552 QVITGEHAIAGRLPITQYPAEY 573
>gi|295134875|ref|YP_003585551.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294982890|gb|ADF53355.1| beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 735
Score = 291 bits (746), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 196/568 (34%), Positives = 287/568 (50%), Gaps = 97/568 (17%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYV 108
+ F +T L +D R+ DL+ RLTL+EK +++++ ++ RLGIP Y+WW+EALHG+
Sbjct: 30 SEFDFYDTDLSMDERIDDLISRLTLEEKAQQMLNASPAIERLGIPAYDWWNEALHGLGRS 89
Query: 109 GPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AG 160
G AT FPQ I A+F+ L + +S EARA +N + G
Sbjct: 90 GV----------ATVFPQAIGMGATFDDDLILKVSTAISDEARANFNNAVKHGYHRKYGG 139
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
LTFW+PN+NIFRDPRWGRGQET GEDP L SK +VKGLQ G + LK AA KH
Sbjct: 140 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTSKLGEAFVKGLQ---GDNDKYLKTAAAAKH 196
Query: 221 YTAYDLDNWKGTD--RYHFNAMV----IYTYYLIKFK----------------------- 251
Y + G + R+ FNA V ++ YL FK
Sbjct: 197 YAVH-----SGPEKLRHEFNADVSEKDLWETYLPAFKTLVDANVETIMCAYNSTNGEPCC 251
Query: 252 ------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG--- 290
++VSDC ++ + ++PE AAA ++ G++LNCG
Sbjct: 252 ANNRLINDILRDKWGFNGHVVSDCWALQDFVSGHDIVESPEAAAALAVEVGIELNCGDTY 311
Query: 291 SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQAN 350
+FL K AV+ GLV+E +D + T +LG FD S PY ++G + + + +
Sbjct: 312 NFLAK----AVEDGLVSEELVDKRLHKLLETRFKLGLFDPEES-NPYNKIGVEVMNSDEH 366
Query: 351 QDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPL 410
+ LA + AR+ IVLLKN G LPL K + GPNA + ++GNY G T L
Sbjct: 367 RALARETARKSIVLLKND-GVLPLKNNLSKYF-ITGPNATNIEVLLGNYHGVNPDMVTVL 424
Query: 411 QGLAAVVA----TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES---- 462
+G+A + Y+ G D A A ++DAT +VMG +E E
Sbjct: 425 EGIAKAIKPESQLQYRMGTRLNLPNENPQDWASPNAGNSDATFVVMGISGLLEGEEGESI 484
Query: 463 -----HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
DR+D LP Q + +V++ A+ ++ I++GG ++ + ++L V
Sbjct: 485 ASPTFGDRMDYNLPQNQIDYLQKVSEAAEDRPVVAIVTGGS-PMNLTEVHKLADAVLLVW 543
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTW 545
YPG+ GG A+AD+ FG+ +PSGRLP+T+
Sbjct: 544 YPGEEGGNAVADIIFGKNSPSGRLPITF 571
>gi|255284060|ref|ZP_05348615.1| beta-glucosidase [Bryantella formatexigens DSM 14469]
gi|255265405|gb|EET58610.1| glycosyl hydrolase family 3 C-terminal domain protein
[Marvinbryantia formatexigens DSM 14469]
Length = 700
Score = 291 bits (746), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 200/558 (35%), Positives = 276/558 (49%), Gaps = 101/558 (18%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV ++T++EK + L A ++ RLGIP Y WW+EALHGV+ G AT
Sbjct: 9 RAEALVAQMTVEEKASQLKYDAPAIKRLGIPAYNWWNEALHGVARAGQ----------AT 58
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDP 174
FPQ I A+F+ +L I V++TE RA YN A GLTFWSPN+NIFRDP
Sbjct: 59 VFPQAIGLGATFDEALLGEIADVIATEGRAKYNAYAAKEDRDIYKGLTFWSPNVNIFRDP 118
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGRG ET GEDP L S+ +VKGLQ G +K AAC KH+ + + R
Sbjct: 119 RWGRGHETYGEDPCLTSRLGVAFVKGLQ----GDGETMKAAACAKHFAVH---SGPEAVR 171
Query: 235 YHFNAMV----IYTYYLIKFK--------------------------------------- 251
+ FNA + YL F+
Sbjct: 172 HEFNAEASAKDMEETYLPAFEALVKEADVEAVMGAYNRTNGEACCASPVLQKILREDWGF 231
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNE 308
+ VSDC ++ + T T +E+AA +I +G DLNCG ++L H A + GLV+E
Sbjct: 232 EGHFVSDCWAIRDFHEHHMLTATAKESAAMAINSGCDLNCGNTYL--HILHAYRDGLVSE 289
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
I A F T LG FDG Y + V ++ + LA AA + VLLKN
Sbjct: 290 ETITEAAVRLFTTRFLLGLFDG----SEYDDIPYTVVESKEHLALAEKAALESAVLLKNN 345
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL----AAVVATIYQAG 424
G LPL ++ + VIGPNA+ + GNY GT +Y T QGL V + G
Sbjct: 346 -GILPLKKERLRTVGVIGPNADSRAALAGNYHGTASRYETIQQGLQDYLGEDVRVLTSVG 404
Query: 425 CSNVQCGTAQVD-------DAKKAAASADATVLVMGADQSIEAE---------SHDRLDL 468
C+ + T ++ +A+ A ++D +L +G D+++E E S D+ L
Sbjct: 405 CALSEDRTEKLALAGDRLAEAQIVAENSDVVILCLGLDETLEGEEGDTGNSYASGDKETL 464
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIA 528
LLP Q+ L+ VA K PV+L +MSG D+S+A +IL + YPG GG+A A
Sbjct: 465 LLPEAQRDLMEAVAATGK-PVVLCMMSGSDLDMSYAAE--HFDAILQLWYPGSQGGSAAA 521
Query: 529 DVCFGRYNPSGRLPMTWY 546
+ FG +PSG+LP+T+Y
Sbjct: 522 KLLFGEVSPSGKLPVTFY 539
>gi|51507369|emb|CAH18932.1| beta-xylosidase [Pyrus communis]
Length = 238
Score = 291 bits (744), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 142/238 (59%), Positives = 176/238 (73%), Gaps = 1/238 (0%)
Query: 301 VKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQ 360
++ G VNE I++A++N MRLG FDG PS Q YG LG DVC ++ +LAL+AARQ
Sbjct: 1 MRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSSNELALEAARQ 60
Query: 361 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI 420
GIVLL+N SLPLS + +AVIGPN++VT+TMIGNY G C YTTPLQG+A TI
Sbjct: 61 GIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGVACGYTTPLQGIARYTRTI 120
Query: 421 YQAGCSNVQC-GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
+QAGC++V C G + A+ AA ADATVLV+G DQSIEAE DR +LLLPG QQ L++
Sbjct: 121 HQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTNLLLPGHQQELVS 180
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
VA+ ++GP IL+IMSGG D+ FAKNDP+I +I+WVGYPGQAGG AIADV FG NP
Sbjct: 181 RVARASRGPTILVIMSGGPIDVMFAKNDPRIGAIIWVGYPGQAGGTAIADVLFGTTNP 238
>gi|367032987|ref|XP_003665776.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
gi|347013048|gb|AEO60531.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
Length = 835
Score = 291 bits (744), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 194/582 (33%), Positives = 289/582 (49%), Gaps = 81/582 (13%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
+ P L+ + C+ +L R LV LT +EK+ LV A R+G+P Y WWSEALH
Sbjct: 19 TKPPLSDIKVCDRTLPEAERAAALVAALTDEEKLQNLVSKAPGAPRIGLPAYNWWSEALH 78
Query: 104 GVSYVGPGTHFSNVVPG----ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA 159
GV++ PGT F + PG +TSFP +L AA+F+ L +A+G V+ TEARA N G +
Sbjct: 79 GVAHA-PGTQFRDG-PGDFNSSTSFPMPLLMAAAFDDELIEAVGDVIGTEARAFGNAGWS 136
Query: 160 GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL------- 212
GL +W+PN+N FRDPRWGRG ETPGED + +YA ++GL+ S +
Sbjct: 137 GLDYWTPNVNPFRDPRWGRGSETPGEDVVRLKRYAASMIRGLEGRSSSSSSCSFGSGGEP 196
Query: 213 -KVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------- 251
+V + CKHY D ++W GT R+ F+A++ + YYL F+
Sbjct: 197 PRVISTCKHYAGNDFEDWNGTTRHDFDAVISAQDLAEYYLAPFQQCARDSRVGSVMCAYN 256
Query: 252 -----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSIL 282
Y+ SDC++V + HY T E
Sbjct: 257 AVNGVPSCANSYLMNTILRGHWNWTEHDNYVTSDCEAVLDVSAHHHYADTNAEGTGLCFE 316
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
AG+D +C A G + A+D A++ + +L+R+G+FDG + P+ LG
Sbjct: 317 AGMDTSCEYEGSSDIPGASAGGFLTWPAVDRALTRLYRSLVRVGYFDG--PESPHASLGW 374
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLS-PTAI--------KNLAVIGPNANVTK 393
DV Q+LAL AA +GIVLLKN +LPL P + + +A+IG A+
Sbjct: 375 ADVNRPEAQELALRAAVEGIVLLKNDNDTLPLPLPDDVVVTADGGRRRVAMIGFWADAPD 434
Query: 394 TMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDA-----KKAAASADAT 448
+ G Y G P +P + + AG ++ + + +D +AAA AD
Sbjct: 435 KLFGGYSGAPPFARSPASAARQLGWNVTVAGGPVLEGDSDEEEDTWTAPAVEAAADADYI 494
Query: 449 VLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDP 508
V G D S E+ DR+ + P Q LI+E+A++ K PV+++ M D + D
Sbjct: 495 VYFGGLDTSAAGETKDRMTIGWPAAQLALISELARLGK-PVVVVQMGDQLDDTPLFELD- 552
Query: 509 KITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+ ++LW +PGQ GG A+ + G +P+GRLP+T YP +Y
Sbjct: 553 GVGAVLWANWPGQDGGTAVVRLLSGAESPAGRLPVTQYPANY 594
>gi|402074909|gb|EJT70380.1| hypothetical protein GGTG_11406 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 793
Score = 290 bits (743), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 201/575 (34%), Positives = 285/575 (49%), Gaps = 82/575 (14%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
LAS C+ SL R LV L + EK+ LV +A +R+G+PKY WWSEALHGV+Y
Sbjct: 38 LASNKVCDRSLSPSERAAALVAALNVTEKMANLVSNANGSARIGLPKYNWWSEALHGVAY 97
Query: 108 VGPGTHFSNVVPG----ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF 163
PGT F PG +TSFP +L AASF+ SL + IG V+ TE+RA N +GL +
Sbjct: 98 A-PGTQFRRG-PGDFNSSTSFPMPLLLAASFDDSLIEKIGDVIGTESRAFGNGRWSGLDY 155
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR-LKVAACCKHYT 222
W+PN+N F+DPRWGRG ETPGED L +YA +KGL+ G P + +V + CKHY
Sbjct: 156 WTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIKGLE---GPHPEKERRVVSTCKHYA 212
Query: 223 AYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------------------------- 251
A D ++W GT R+ F+A + + YYL+ F+
Sbjct: 213 ANDFEDWNGTSRHDFDARISAQDLAEYYLMPFQQCARDSRVGSIMCAYNAVNGVPSCANS 272
Query: 252 ------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL 293
Y+ SDC++V + Y +T E A AG D +C
Sbjct: 273 YLLDTVLRKHWGWTGHNNYVTSDCEAVLDVSAGHKYARTNAEGTAMCFEAGTDTSCEYTP 332
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
A GL+ E +D A+ + L+R+G+FDG+ S + + DV A QDL
Sbjct: 333 SSDIRGAYAQGLLREETMDRALLRLYEGLVRVGYFDGNSSA--FSDISWADVNAPAAQDL 390
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIK-------------NLAVIGPNANVTKTMIGNYE 400
+L +A +GIV+LKN G+LPL A LA+IG A+ + + G Y
Sbjct: 391 SLQSAVEGIVMLKND-GTLPLPLGAKCSSKSKKRSSSGGPKLAMIGFWADAPEKLRGGYS 449
Query: 401 GTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDD-----AKKAAASADATVLVMGAD 455
GT TP + + AG +Q A D A AA AD V G D
Sbjct: 450 GTAAYLRTPAYAARQMGLDVVTAGGPVLQGAAAAAADNWTAPALAAAEGADYIVYFGGLD 509
Query: 456 QSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILW 515
++ E+ DR D+ PG Q L+ +A + K P++++ M G D + + + ++LW
Sbjct: 510 ETAAGENKDRWDVEWPGAQLALVKRLAALGK-PLVVVQM-GDQLDGTPLLANAGVGAVLW 567
Query: 516 VGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+PGQ GG A+ + G +P+GRLP+T YP +Y
Sbjct: 568 ASWPGQDGGPAVMRLLSGAASPAGRLPVTQYPANY 602
>gi|402493386|ref|ZP_10840139.1| beta-glucosidase [Aquimarina agarilytica ZC1]
Length = 734
Score = 290 bits (742), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 209/616 (33%), Positives = 312/616 (50%), Gaps = 109/616 (17%)
Query: 5 QNRNRAPKVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRV 64
Q++ KV ++ +FLT ++ +++ F + +T+ + R
Sbjct: 6 QHKLLPTKVVLMLIFLTLLPKHLAAQKNENTKNFE-------------WFDTNKSFEKRA 52
Query: 65 GDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSF 124
LV LTL+EKI+ +VD + + RL IP+Y WW+E LHGV+ G AT F
Sbjct: 53 KALVASLTLEEKISLMVDQSAPIDRLNIPEYNWWNECLHGVARNGR----------ATVF 102
Query: 125 PQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDPRW 176
PQ I AA+F+ L + +STEARA +N +A GLTFW+PNINIFRDPRW
Sbjct: 103 PQAIGLAATFDQDLIFKVADAISTEARAKFNASIAIGNRGKYAGLTFWTPNINIFRDPRW 162
Query: 177 GRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD--R 234
GRGQET GEDP L S+ +VKGLQ G P LK AAC KHY + G + R
Sbjct: 163 GRGQETYGEDPYLTSQIGVNFVKGLQ---GNHPKYLKSAACAKHYAVHS-----GPEELR 214
Query: 235 YHFNAM-----VIYTY--------------------------------YLIK-------- 249
+ F+A+ + TY YL++
Sbjct: 215 HEFDAIASKKDMAETYLPAFEALVKEAKVEGVMGAYNRVNGEGACASPYLLEKLLKDTWG 274
Query: 250 FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNE 308
FK YIVSDC ++ L+ T+T EE+AA ++ GL++NCG+ + A+K GL +E
Sbjct: 275 FKGYIVSDCWALSDLHKFHKVTQTAEESAAAALNVGLNVNCGNVY-PALDGAIKQGLTSE 333
Query: 309 SAIDHAISNNFATLMRLGFFDGHPS-KQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
+D+ + + T +LGFFD PS PY ++ V ++A++ +AL+AA++ IVLLKN
Sbjct: 334 KQLDNVLQHQLLTRFKLGFFD--PSNNNPYNKITTDVVDSEAHRAIALEAAQKSIVLLKN 391
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT----IYQA 423
L +K++ V GPNA ++GNY G K T L G+ + V+ Y+
Sbjct: 392 NNNLL-PLKKDLKSVYVAGPNAAREDVLLGNYYGVTSKTQTILDGIVSKVSAGTSINYKQ 450
Query: 424 GCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES---------HDRLDLLLPGQQ 474
G Q +D + + AD ++VMG + E E DR+D+ LP Q
Sbjct: 451 GLLPFQKNVNPIDWSTGEISRADVGIIVMGLSGNYEGEEGEAIASESKGDRVDIRLPQNQ 510
Query: 475 QLLITEV-AKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
I ++ AK P++L++ GG I+ + + +I++ YPG+ GG A+AD+ FG
Sbjct: 511 IDYIKKIKAKNTGNPLVLVLT--GGSPIAMPEVYDLVDAIVFAWYPGEEGGQAVADILFG 568
Query: 534 RYNPSGRLPMTWYPQS 549
PSG+LP+T +P+S
Sbjct: 569 DVVPSGKLPIT-FPKS 583
>gi|329745495|gb|AEB98984.1| xylosidase precursor [synthetic construct]
Length = 804
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 202/576 (35%), Positives = 289/576 (50%), Gaps = 65/576 (11%)
Query: 33 QSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGI 92
++ P+ + D + P L S C+ S R L+ TL E I ++ VSRLG+
Sbjct: 49 ETIPLSSPDCQNGP-LRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGL 107
Query: 93 PKYEWWSEALHGVSYVGPGTHFSNV--VPGATSFPQVILTAASFNASLFQAIGKVVSTEA 150
P Y+ WSEALHG+ +FS+ ATSFPQ ILT A+ N +L I ++ST+
Sbjct: 108 PVYQVWSEALHGLDRA----NFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQG 163
Query: 151 RAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPN 210
RA N G GL ++PNIN FR P GRGQETPGED LA+ YA Y+ G+Q D S
Sbjct: 164 RAFNNAGRYGLDVYAPNINTFRHPVRGRGQETPGEDVSLAAVYAYEYITGIQGPDPDS-- 221
Query: 211 RLKVAACCKHYTAYDLDNWKG--------------------------------------- 231
LK+AA KHY YD++NW
Sbjct: 222 NLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAY 281
Query: 232 ---------TDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSIL 282
D Y ++ T+ + Y+ SDCD+ +YN Y + AAA++IL
Sbjct: 282 NAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAIL 341
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK--QPYGQL 340
AG D++CG+ H ++ AG ++ I+ + + TL++ G+FD + +K PY L
Sbjct: 342 AGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDL 401
Query: 341 GPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS----PTAIKNLAVIGPNANVTKTMI 396
DV +++ AA QGIVLLKN+ LPL+ P + +A+IGP AN T ++
Sbjct: 402 TWSDVLETDAWNISYQAATQGIVLLKNSNKVLPLTEKAYPPSNTTVALIGPWANATTQLL 461
Query: 397 GNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQ-CGTAQVDDAKKAAASADATVLVMGAD 455
GNY G +P + A + + T+ A AA SAD + G D
Sbjct: 462 GNYYGNAPYMISPRVAFEEAGYNVNFAERTGISSTNTSGFAAALSAAQSADVIIYAGGID 521
Query: 456 QSIEAESHDRLDLLLPGQQQLLITEVAKVA-KGPVILIIMSGGGFDISFAKNDPKITSIL 514
++EAE+ DR + PG Q LI ++A A P+I++ M GG D S KN+ ++++L
Sbjct: 522 NTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALL 581
Query: 515 WVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
W GYPGQ+GG A+ D+ G+ NP+GRL T YP SY
Sbjct: 582 WGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASY 617
>gi|194700280|gb|ACF84224.1| unknown [Zea mays]
Length = 452
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 146/269 (54%), Positives = 184/269 (68%), Gaps = 3/269 (1%)
Query: 285 LDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
+D+NCGS++ H +A++ G + E I+ A+ N FA MRLG F+G P + YG +GP
Sbjct: 1 MDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQ 60
Query: 345 VCTQANQDLALDAARQGIVLLKNT--AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
VCTQ +QDLAL+AA+ GIVLLKN AG+LPLS + +LAVIG NAN + GNY G
Sbjct: 61 VCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGP 120
Query: 403 PCKYTTPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE 461
PC TPLQ L V T + AGC++ C + +A +AA+SAD+ VL MG DQ E E
Sbjct: 121 PCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIPEAVQAASSADSVVLFMGLDQDQERE 180
Query: 462 SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQ 521
DRLDL LPGQQQ LI VA AK PVIL+++ GG D+SFAK +PKI +ILW GYPG+
Sbjct: 181 EVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPGE 240
Query: 522 AGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
AGG AIA V FG +NP GRLP+TWYPQ +
Sbjct: 241 AGGIAIAQVLFGEHNPGGRLPVTWYPQDF 269
>gi|333381510|ref|ZP_08473192.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
BAA-286]
gi|332830480|gb|EGK03108.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
BAA-286]
Length = 738
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 189/567 (33%), Positives = 273/567 (48%), Gaps = 92/567 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F +T L D RV DLV RLTL+EK+ ++++ ++ RL IP Y WW+E LHG+ T
Sbjct: 26 FRDTKLSTDKRVSDLVSRLTLEEKVLQMLNNTPAIERLNIPAYNWWNECLHGIGR----T 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
+ T FPQ I AA+++A L + + +S E RA+YN A GLT+W
Sbjct: 82 EYK-----VTVFPQAIGMAAAWDARLLKDVANAISDEGRAIYNDASAKGNYSIYHGLTYW 136
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PN+NIFRDPRWGRGQET GEDP L +V GLQ G LK AAC KHY +
Sbjct: 137 TPNVNIFRDPRWGRGQETYGEDPYLTGALGKSFVAGLQ---GDDSQYLKAAACAKHYAVH 193
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ R+ FN V ++ YL F+
Sbjct: 194 ---SGPENTRHTFNTFVTTFDLWDTYLPAFRDLVVDAKVAGVMCAYNAFSGEPCCGNNLL 250
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
Y+ SDC ++D Y + AAA ++ +G D++CG+ K
Sbjct: 251 MQEILRDKWGFTGYVTSDCGAIDDFYRHHKTHPDAKYAAADAVYSGTDIDCGNEAYKALV 310
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDA 357
AVK GL+ E ID ++ F RLG FD P++ + ++ + +Q ++DLAL
Sbjct: 311 DAVKTGLITEEQIDISLKRLFEIRFRLGMFD--PAEDVKFSKIPLSVLESQPHKDLALKI 368
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
R+ IVLLKN LPLS +K +AVIGPNA+ +++GNY G P + TP + + +
Sbjct: 369 TRESIVLLKNENNFLPLS-KKLKKVAVIGPNADNEVSVLGNYNGFPTQIITPYKAIKNKL 427
Query: 418 ---ATIYQAGCSNV---QCGTAQVDDAKKAAASADATVLVMGADQSIEAESH-------- 463
IY+ G V + ++ K D + G +E E
Sbjct: 428 KNTEVIYEKGIDFVKPSENSKEEIAALAKRLKGMDVVIFAGGISPELEGEEMPVKIEGFT 487
Query: 464 --DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQ 521
DR + LP Q L+ + K + P + ++M+G + + + +IL Y GQ
Sbjct: 488 GGDRTSIKLPKIQTELM-QALKAERIPTVFVMMTGSAIAAEWESQN--VPAILNAWYGGQ 544
Query: 522 AGGAAIADVCFGRYNPSGRLPMTWYPQ 548
G AIADV FG YNPSG+LP+T+Y +
Sbjct: 545 DAGTAIADVLFGDYNPSGKLPVTFYTK 571
>gi|171695518|ref|XP_001912683.1| hypothetical protein [Podospora anserina S mat+]
gi|170948001|emb|CAP60165.1| unnamed protein product [Podospora anserina S mat+]
Length = 805
Score = 289 bits (739), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 211/608 (34%), Positives = 292/608 (48%), Gaps = 109/608 (17%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGS-------------- 86
D +S P LA C+T+ R LV+ L + EK+ LV+ S
Sbjct: 24 DCISGP-LAKTLACDTTASPPARAAALVQALNITEKLVNLVEYVKSREAPLGISIQLITP 82
Query: 87 ------VSRLGIPKYEWWSEALHGVSYVGPGTHFSNV---VPGATSFPQVILTAASFNAS 137
R+G+P Y WW+EALHGV+ PG F+ ATSF I AA+F+
Sbjct: 83 HSMSLGAERIGLPAYAWWNEALHGVA-ASPGVSFNQAGQEFSHATSFANTITLAAAFDND 141
Query: 138 LFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG------------------ 179
L + +STEARA N LAGL +W+PNIN ++DPRWGRG
Sbjct: 142 LVYEVADTISTEARAFSNAELAGLDYWTPNINPYKDPRWGRGHEVCYLSLLFRAVQLLRT 201
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
Q+TPGEDP+ + GYV+ L + G KV A CKH+ AYDL+ W+G RY FNA
Sbjct: 202 QKTPGEDPV----HIKGYVQALLEGLEGRDKIRKVIATCKHFAAYDLERWQGALRYRFNA 257
Query: 240 MV----IYTYYLIKFK-------------------------------------------- 251
+V + YYL F+
Sbjct: 258 VVTSQDLSEYYLQPFQQCARDSKVGSFMCSYNALNGTPACASTYLMDDILRKHWNWTEHN 317
Query: 252 -YIVSDCDSV-DVLYNSQHYTKTPEEAAAKSILAGLDLNC---GSFLGKHTEAAVKAGLV 306
YI SDC+++ D L N ++++TP +AAA + AG D C G A L+
Sbjct: 318 NYITSDCNAIQDFLPNFHNFSQTPAQAAADAYNAGTDTVCEVPGYPPLTDVIGAYNQSLL 377
Query: 307 NESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLK 366
+E ID A+ + L+R G+ D S PY ++ V T Q LAL +A GIVLLK
Sbjct: 378 SEEIIDRALRRLYEGLIRAGYLDSA-SPHPYTKISWSQVNTPKAQALALQSATDGIVLLK 436
Query: 367 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCS 426
N G LPL T K +A+IG AN T+ M+G Y G P Y P+ + T + A
Sbjct: 437 NN-GLLPLDLTN-KTIALIGHWANATRQMLGGYSGIPPYYANPIYAATQLNVTFHHAPGP 494
Query: 427 NVQCGTAQVDD----AKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVA 482
Q + D A AA+ +D + + G D SI AE DR + P Q L+T +A
Sbjct: 495 VNQSSPSTNDTWTSPALSAASKSDIILYLGGTDLSIAAEDRDRDSIAWPSAQLSLLTSLA 554
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
++ K P I+ + G D + ++P I+SILWVGYPGQ+GG A+ ++ G +P+ RLP
Sbjct: 555 QMGK-PTIVARL-GDQVDDTPLLSNPNISSILWVGYPGQSGGTALLNIITGVSSPAARLP 612
Query: 543 MTWYPQSY 550
+T YP++Y
Sbjct: 613 VTVYPETY 620
>gi|373954937|ref|ZP_09614897.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373891537|gb|EHQ27434.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 723
Score = 288 bits (738), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 194/558 (34%), Positives = 270/558 (48%), Gaps = 93/558 (16%)
Query: 61 DLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG 120
D+RV DL+ +LTL+EK+ ++D + SV RL +PKY WW+EALHGV+ G
Sbjct: 32 DVRVRDLISKLTLEEKVHQMMDVSPSVPRLNLPKYNWWNEALHGVARSGV---------- 81
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFWSPNINIFR 172
AT FPQ I A+F+ L + +S EARAMYN + GLTFW+PNINIFR
Sbjct: 82 ATIFPQAIALGATFDQDLAKRESTAISDEARAMYNAAMVNGYNEKYGGLTFWTPNINIFR 141
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGRGQET GEDP L S+ +++GLQ G P LKVAAC KH+ + G
Sbjct: 142 DPRWGRGQETYGEDPFLTSQIGVAFIQGLQ---GDDPEHLKVAACAKHFAVHS-----GP 193
Query: 233 D--RYHFNAMV----IYTYYLIKFK----------------------------------- 251
+ R+ FNA+ + YL FK
Sbjct: 194 ERLRHSFNAIASPKDLRETYLPAFKALVNARVEAVMCAYNRTNSEVCCGSNLLLDQILRD 253
Query: 252 ------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGL 305
++VSDC ++ Y EA A ++ G+DLNCG E AVK GL
Sbjct: 254 EWHFTGHVVSDCGAIVDFYMGHKVVPGQPEAVALAVKHGVDLNCGDEYPALIE-AVKRGL 312
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
+ E ID A++ T +LG FD PY + + + ++ LA + A + IVLL
Sbjct: 313 ITEKEIDKALATLLKTRFKLGLFDPK-QNSPYNNIPVSVINSTDHRALAKEVALKSIVLL 371
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV----ATIY 421
KN LPL + + GPNA ++GNY G +T L+G+A + Y
Sbjct: 372 KNEK-CLPLK-NNLSKYYITGPNAASVDALMGNYYGVNPHMSTILEGIAGAIQPGSQMQY 429
Query: 422 QAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES---------HDRLDLLLPG 472
+ G + +D A ++D T +VMG +E E DRLD LP
Sbjct: 430 KPGILLDRDNNNPIDWTTGDAKASDVTFVVMGITGLLEGEEGEAIASPNYGDRLDYNLPK 489
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCF 532
Q + ++ K K V+ II G ++S + W YPG+ GG A+AD+ F
Sbjct: 490 NQIDFLRKIRKGNKNKVVAIITGGSPMNLSEVHELADAVLLAW--YPGEEGGNAVADILF 547
Query: 533 GRYNPSGRLPMTWYPQSY 550
G+ +PSGRLP+T +P+S+
Sbjct: 548 GKVSPSGRLPVT-FPKSF 564
>gi|301090543|ref|XP_002895482.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
gi|262098232|gb|EEY56284.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
Length = 809
Score = 288 bits (738), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 206/594 (34%), Positives = 289/594 (48%), Gaps = 91/594 (15%)
Query: 28 SSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSA--- 84
S+ + +S AC S+P FCN SL RV DL++RL L EK+T L A
Sbjct: 15 STPTVDASIPRAC---SSPEHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPK 71
Query: 85 GSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGK 144
G++S +G+P+Y W + +HGV T +N ATSFP + A F+ + +
Sbjct: 72 GNMSSIGLPEYNWGANCVHGVQ----STCGTNC---ATSFPNPVNLGAIFDPRAVFDMAQ 124
Query: 145 VVSTEARAMYNVGLA---------GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAT 195
VV E RA++ G GL WSPNINI RDPRWGR ETP EDPL+ SKY
Sbjct: 125 VVGWELRALWLEGARENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGV 184
Query: 196 GYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV-IYTY---YLIKFK 251
Y KGLQ+ G L+ KHY AY +++ G DR FNA+V Y + YL F+
Sbjct: 185 AYTKGLQE--GKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFE 242
Query: 252 ------------------------------------------YIVSDCDSVDVLYNSQHY 269
YI SD ++ +Y+ +HY
Sbjct: 243 ASVVHGKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHY 302
Query: 270 TKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFD 329
TKT EA +IL+G D+N GS + V +G + E A+D A+ LG FD
Sbjct: 303 TKTLCEAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFD 362
Query: 330 GHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNA 389
QPY + P +V T ++ L+LD +R+ IVLL+N LPL+ K LAVIGP+A
Sbjct: 363 -PIDDQPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKG--KKLAVIGPHA 419
Query: 390 NVTKTMIGNYEGTPCK--------YTTPLQGLA---AVVATIYQAGCSNVQCGTAQVDDA 438
+ ++GNY G C TPL+ + T+Y G TA D+A
Sbjct: 420 AAKRALLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGINDTSTAGFDEA 479
Query: 439 KKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG- 497
+ AA A+ VL +G D SIE E+ DR ++ +P Q L+ V + K P ++++ +GG
Sbjct: 480 EAAARKAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGV 538
Query: 498 -GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
G + D + + YPG G A++D+ FG PSG+LP+T YP +Y
Sbjct: 539 VGAEELILHTDGVVEAF----YPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNY 588
>gi|449299051|gb|EMC95065.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
10762]
Length = 849
Score = 288 bits (737), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 200/572 (34%), Positives = 284/572 (49%), Gaps = 67/572 (11%)
Query: 41 DVVSNPSLASLGF-CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWS 99
D V+ P+L + C+T+ R ++ + + EK+ L+D + +RLG+P YEWWS
Sbjct: 29 DCVNGPALLTSNLVCDTNATPYQRASAIINAMNITEKLANLLDVSYGSARLGLPPYEWWS 88
Query: 100 EALHGVSYVGPGTHF--SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
EALHGV+ PG +F S ATSFP I +++F+ Q I V+STEARA N
Sbjct: 89 EALHGVA-GSPGVNFTSSGNYSYATSFPMPITFSSAFDDPSVQNIASVISTEARAYSNAA 147
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR-----L 212
GL +++PNIN F+DPRWGRG ETPGEDPL Y + GL+ TD G N
Sbjct: 148 RGGLDYFTPNINPFKDPRWGRGSETPGEDPLRIQGYVKNLLIGLEGTDDGYFNTSHSGYK 207
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------- 251
K+ A CKH+ YDL++W G RY ++A + + YYL F+
Sbjct: 208 KMIATCKHFAGYDLEDWDGYIRYGYDAEITTQDLAEYYLPPFQTCARDQNVASIMCSYNS 267
Query: 252 ----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
YI SDC+++ +Y + +Y+ AA S+
Sbjct: 268 VNSVPACANSYLQETILREHWGWTIDNNYITSDCNAISDIYYNHNYSVNNAAAAGLSLSN 327
Query: 284 GLDLNC---GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQL 340
G+D C + + + G V E+ I A+ + L+ G+FD S PY +
Sbjct: 328 GMDTACIVANTGVMTDVNGSYYGGYVTEATITTALIRQYEALVIAGYFD-PASSNPYRSI 386
Query: 341 GPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE 400
G V T A Q LA AA +G LLKNT G LP T+ +A+IG AN T M G Y
Sbjct: 387 GWSSVNTPAAQTLARQAATEGTTLLKNT-GLLPYKFTSQTKVAMIGMWANGTSQMQGGYS 445
Query: 401 GTPCKYTTPLQGLAAV-VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGA-DQSI 458
G +PL + + ++ Y G N T+ AAA +L G D S+
Sbjct: 446 GPAPYLHSPLYAASQLGLSYNYANGPINQTTLTSNYSQNATAAAQNADVILFFGGIDWSV 505
Query: 459 EAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGY 518
EAE+ DR + PG QQ LI ++A + K P+I++ M G D + ++ I++++WVGY
Sbjct: 506 EAEAMDRYQIAWPGAQQALIAQLAALGK-PMIVLQM-GSMLDATPILSNNNISALVWVGY 563
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PGQ GG A D+ G P+GRLP+T YP Y
Sbjct: 564 PGQDGGVAAFDILTGAVAPAGRLPVTMYPADY 595
>gi|410098444|ref|ZP_11293422.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
CL02T12C30]
gi|409222318|gb|EKN15263.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
CL02T12C30]
Length = 738
Score = 288 bits (737), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 194/570 (34%), Positives = 277/570 (48%), Gaps = 98/570 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L +D+RV D++ RLTL+EK+ + +A +V RLGIP Y WW+EALHGV+
Sbjct: 28 FRNPDLPLDVRVQDIISRLTLEEKVQLMKHAAPAVPRLGIPAYNWWNEALHGVA------ 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA---------GLTF 163
T FPQ I AA+F+ Q +G + S+E RA++N L GLT+
Sbjct: 82 ---RTKEKVTVFPQAIGMAATFDTEALQKMGDMTSSEGRALFNEDLKAGKTGEIYRGLTY 138
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PNINIFRDPRWGRGQET GEDP L +K + V GL+ G +P LK AC KHY
Sbjct: 139 WTPNINIFRDPRWGRGQETYGEDPYLTAKMGSAIVHGLE---GNNPEYLKSVACAKHYAV 195
Query: 224 YDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------------- 251
+ + +R+ ++A V ++ YL F+
Sbjct: 196 H---SGPEHNRHSYDARVSMYDLWDTYLPAFRELVTKAKVHGVMCAYNRFEGTPCCGHNE 252
Query: 252 --------------YIVSDCDSVDVL--YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGK 295
Y+ SDC +V Y+ H T EA A ++L G DL CG+ K
Sbjct: 253 LLQDILRNQWKFDGYVTSDCWAVSDFAKYHKTHSNDT--EAVADAVLNGTDLECGNLYQK 310
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
+ V+ GL++E I+ +++ F +LG +D + PY +G + + A++ A
Sbjct: 311 -LQQGVEKGLISEKDINVSLARLFEIQFKLGMYDP-ADRVPYASIGREVIECDAHKKHAY 368
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL-- 413
+ A++ +VLLKN LPL+ + IK +A+IGPN + T++ NY GTP + TP + L
Sbjct: 369 EMAQKSMVLLKNNKNILPLNASKIKRIALIGPNMDNGSTLLANYFGTPSEIITPYKSLQK 428
Query: 414 ----AAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIE---------- 459
+ + T+ G G A AD + V G E
Sbjct: 429 RFGNSIQIDTLTGVGIVQKLEGAPSFAQVAAQAKKADIIIFVGGISADYEGEAGDAGAAG 488
Query: 460 ---AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWV 516
S DR + LP Q L+ E+ K + P+IL+ MSG +SF +IL
Sbjct: 489 YGGFASGDRTTMKLPPVQTELMKELKKTGR-PLILVNMSGSV--MSFDWESRNADAILQA 545
Query: 517 GYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
Y GQA G AI DV FG YNP+GR+P+T Y
Sbjct: 546 WYGGQAAGDAITDVLFGDYNPAGRMPLTTY 575
>gi|182415033|ref|YP_001820099.1| Beta-glucosidase [Opitutus terrae PB90-1]
gi|177842247|gb|ACB76499.1| Beta-glucosidase [Opitutus terrae PB90-1]
Length = 905
Score = 288 bits (736), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 204/584 (34%), Positives = 296/584 (50%), Gaps = 106/584 (18%)
Query: 42 VVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEA 101
V+ P++ + ++S + +R DL++R++L EK++ L ++A + RLG+P Y++W+EA
Sbjct: 198 TVAEPAI----WRDSSKPLRVRADDLIRRMSLAEKVSQLKNAAPGIPRLGLPAYDYWNEA 253
Query: 102 LHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN------ 155
HG++ G AT FPQ I AA++N +L G V+ E RA +N
Sbjct: 254 AHGIANNGI----------ATVFPQAIGAAAAWNPALLHQEGTVIGIEGRAKFNDYANRH 303
Query: 156 ----VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR 211
GLT+W+PNIN+FRDPRWGRGQET GEDP L ++ +VKG+Q G P
Sbjct: 304 NGDSKWWTGLTYWAPNINLFRDPRWGRGQETYGEDPFLTAEIGIEFVKGVQ---GDDPRY 360
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------- 251
+ AC KHY + + R+ FNA + ++ YL F+
Sbjct: 361 MLAMACAKHYAVH---SGPERTRHSFNAEIPERDLFDTYLPHFERVVREGKVAGVMSAYN 417
Query: 252 --------------------------YIVSDCDSVDVLYNSQ--HYTKTPEEAAAKSILA 283
Y+ SDCD++ +Y + HY KT EEAAA ++ A
Sbjct: 418 AVNGVPASANSFLLTELLRKRWGFEGYVPSDCDAIRDIYGEKQHHYVKTAEEAAALAVKA 477
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGP 342
G +L CG A V+ GLV E +D A+ + T RLG FD P++Q P+
Sbjct: 478 GCNLCCGGDYNALVRA-VQQGLVTEKDLDGALYHTLWTRFRLGLFD--PAEQVPFSGYTL 534
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
KD A+ +AL+ ARQ IVLLKN G+LPL T +K +AVIGPNA + GNY G+
Sbjct: 535 KDNDLPAHSQVALELARQAIVLLKND-GTLPLDRTKLKQIAVIGPNAASKSMLEGNYHGS 593
Query: 403 PCKYTTPLQGLAAVVAT---IYQAGCSNV--QCGTA---------------QVDDAKKAA 442
+ + L + +V + I A S V + GTA +A K A
Sbjct: 594 ASRSISILDDIRNLVGSEIKITHAMGSPVTTKPGTAPWSGQDNTTDRPVAELKAEALKLA 653
Query: 443 ASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS 502
A ADA + V G + E ES DR + LP +Q+ LI + K PV+++ SG ++
Sbjct: 654 AEADAIIYVGGITPAQEGESFDRESIELPSEQEDLIRALHATGK-PVVMVNCSGSAMALT 712
Query: 503 FAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ D + +I+ YPGQ GG A+A+V FG NPSG LP+T+Y
Sbjct: 713 W--QDENLPAIVQAWYPGQEGGRAVAEVLFGETNPSGHLPITFY 754
>gi|380696433|ref|ZP_09861292.1| glycoside hydrolase [Bacteroides faecis MAJ27]
Length = 739
Score = 288 bits (736), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 197/568 (34%), Positives = 281/568 (49%), Gaps = 99/568 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L ++ RV DLV RLTL+EK+ +++S V RLGIP Y WW+E LHG+ T
Sbjct: 27 FRDPQLPVEQRVEDLVSRLTLEEKVKQMLNSTPPVERLGIPAYNWWNECLHGIGR----T 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN--------VGLAGLTFW 164
+ T FPQ I AA++N +L + + ++ E RA+YN LT+W
Sbjct: 83 KYH-----VTVFPQAIGMAAAWNDALIKEVASSIADEGRAIYNDTQRKEDYSQYHALTYW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRGQET GEDP L ++ +V+GLQ G +P LK +AC KHY +
Sbjct: 138 TPNINIFRDPRWGRGQETYGEDPYLTARIGEAFVQGLQ---GDNPRYLKASACAKHYAVH 194
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ +R+ FN+ V ++ YL F+
Sbjct: 195 ---SGPEKNRHSFNSDVSTYDLWDTYLPAFRTLVVDAKVSGVMCAYNAFQGQPCCGNDLL 251
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
Y+ SDC ++D ++N AAA ++ G DL+CG
Sbjct: 252 MQSILRDKWNFTGYVTSDCGAIDDIFNHHKTHPDAATAAADAVFHGTDLDCGHSAYLALV 311
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDA 357
AVK G++ E +D ++ F RLG FD P + Y ++ + + +QDLA
Sbjct: 312 KAVKDGIITEKQLDVSVKRLFTIRFRLGLFD--PVELVDYARIPISILECRKHQDLAKQL 369
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG----L 413
AR+ +VLLKN LPL +K + V+GPNA+ ++++GNY G P + TPLQ L
Sbjct: 370 ARESMVLLKNDQ-LLPLQKNKLKKVVVMGPNADSRESLLGNYNGNPSRMLTPLQAIRERL 428
Query: 414 AAVVATIYQAGCSNVQCGTAQVDDAKK---AAASADATVLVMGADQSIEAESH------- 463
Y G +V T DD K+ A ADA + + G +E E
Sbjct: 429 GGWTEVEYIEGVDHVN--TISADDLKQYVNRAKGADAVIFIGGISPRLEGEEMPVSKDGF 486
Query: 464 ---DRLDLLLPG-QQQLLITEVAKVAKGPVILIIMSGGGFDISF-AKNDPKITSILWVGY 518
DR + LP Q Q++ VA+ P + ++M+G I + A+N P +IL Y
Sbjct: 487 DGGDRTTIALPAVQTQMMKAWVAEHI--PTVFVMMTGSALAIPWEAQNVP---AILNAWY 541
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWY 546
GQ GG AIADV FG YNPSG+LP+T+Y
Sbjct: 542 GGQYGGEAIADVLFGDYNPSGKLPVTFY 569
>gi|345519864|ref|ZP_08799275.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
gi|254836262|gb|EET16571.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
Length = 736
Score = 287 bits (735), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 201/577 (34%), Positives = 286/577 (49%), Gaps = 102/577 (17%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
+ +L F N L +++RV DLV RLTL+EK+ + + ++ RLGIP Y+WW+EALHGV+
Sbjct: 21 VENLPFRNADLPLEVRVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVA- 79
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN---------VGL 158
+ T FPQ I AA+F+ Q +G + STE RA++N
Sbjct: 80 --------RTLEKVTVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRY 131
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
GLT+W+PNINIFRDPRWGRGQET GEDP L +K V+GL+ G P+ LK AC
Sbjct: 132 RGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGAAIVRGLE---GEDPHYLKSVACA 188
Query: 219 KHYTAYDLDNWKGTDRYHFNA----MVIYTYYLIKFKYIVS------------------- 255
KHY + + +R+ F+A ++ Y+ F+ +V+
Sbjct: 189 KHYAVHSGPEY---NRHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPC 245
Query: 256 ---DCDSVDVLYNSQHYT-----------------KT-PEEAAAKS--ILAGLDLNCGSF 292
D VD+L N H+ KT PE A S +LAG DL CG+
Sbjct: 246 CGNDPLLVDILRNQWHFDGYVTSDCWALKDFAEFHKTHPEHTIAMSDALLAGTDLECGNL 305
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQ 351
E VK GL +E I+ ++S F L ++G FD P+++ PY +G + + +A++
Sbjct: 306 YHLLAE-GVKKGLHSERDINVSLSRLFTILFKIGMFD--PAERVPYSSIGREVLECEAHK 362
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
A A++ IVLL+N LPL + IK++A+IGPNA+ +T + NY GTP + TP
Sbjct: 363 QHAERMAKESIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYM 422
Query: 412 GLAAVVATIYQAGCSNVQCGTAQVDDAKKA---------AASADATVLVMGADQSIE--- 459
L + + N G VD K A AA +D V V G E
Sbjct: 423 SLKRRLGDKIKI---NYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEA 479
Query: 460 ----------AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPK 509
S DR + LP Q L+ ++ K + P+I++ MSG +SF
Sbjct: 480 GDAGAAGYGGFASGDRTTMQLPLVQIELLKKLKKTGR-PLIIVNMSGSV--MSFEWESQN 536
Query: 510 ITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
++L Y GQA G AI DV FG NP+GR+P+T Y
Sbjct: 537 ADALLQAWYGGQAAGDAIVDVLFGHCNPAGRMPLTTY 573
>gi|268610157|ref|ZP_06143884.1| glycoside hydrolase family 3 protein [Ruminococcus flavefaciens
FD-1]
Length = 690
Score = 287 bits (735), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 191/568 (33%), Positives = 274/568 (48%), Gaps = 100/568 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + SL R DL RLTL+E+ + L A +V RL IP Y WWSE LHGV+ G
Sbjct: 4 YKDKSLSAQERAEDLTNRLTLEEQASQLKYDAPAVDRLDIPAYNWWSEGLHGVARAGT-- 61
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AA F+ +G ++ EARA YN A GL W
Sbjct: 62 --------ATMFPQAIGLAAMFDEEAMNKVGSIIGDEARAKYNEYSAHGDHDIYKGLCLW 113
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPN+NIFRDPRWGRGQET GEDP L ++ + KGLQ G LK AAC KH +
Sbjct: 114 SPNVNIFRDPRWGRGQETYGEDPYLTTRLGVAFAKGLQ----GEGEVLKTAACAKHLAVH 169
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ R+ F+A+ + YL F+
Sbjct: 170 ---SGPEAIRHEFDAVASPKDMEETYLPAFEALVKEAKVEGVMGAYNRVNGEPACASKFL 226
Query: 252 -----------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEA 299
Y VSDC ++ + + TKT E+AA ++ G DLNCG ++L H
Sbjct: 227 MGKLDEWGFDGYFVSDCWAIRDFHTNHMVTKTAPESAAMALKLGCDLNCGNTYL--HLLH 284
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
A GL+N+ I A ++ T +RLG FD + Y +L V + N+ A +
Sbjct: 285 AYNEGLINDEDIKKACTHLMRTRVRLGMFD---DETEYDKLDYSIVANEENKAYARKCSE 341
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV-- 417
+ +V+LKN G LPL P+ IK + VIGPNA+ + GNY G +Y T L+G+
Sbjct: 342 RSMVMLKNN-GILPLDPSKIKTIGVIGPNADSRPALEGNYNGRADRYITFLEGIQDAFGG 400
Query: 418 -------ATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE--------- 461
+ +Y+ C + ++ +A+ +D VL +G D +IE E
Sbjct: 401 RVLYSEGSHLYKDRCMGLAVADDRLSEAEIVTEHSDVVVLCVGLDATIEGEEGDTGNEFS 460
Query: 462 SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQ 521
S D+ DL LP Q+ L+ V + K PVI++ +G ++ + +++ YPGQ
Sbjct: 461 SGDKNDLRLPEAQRKLVETVMRKGK-PVIIVTAAGSAINV-----EADCDALIHAWYPGQ 514
Query: 522 AGGAAIADVCFGRYNPSGRLPMTWYPQS 549
GG A+AD+ FG+ +PSG+LP+T+Y +
Sbjct: 515 FGGTALADILFGKISPSGKLPVTFYTDT 542
>gi|402308386|ref|ZP_10827395.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400375830|gb|EJP28725.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 721
Score = 287 bits (735), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 191/547 (34%), Positives = 275/547 (50%), Gaps = 89/547 (16%)
Query: 66 DLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFP 125
+++ R+T+ EKI+ L++ + ++ LGI Y+WWSE LHGV G AT FP
Sbjct: 36 EIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR----------ATVFP 85
Query: 126 QVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFWSPNINIFRDPRWG 177
Q I A+F+ +L + IG V+TE RA +NV AGLTFWSPN+NIFRDPRWG
Sbjct: 86 QPIALGATFDEALVREIGDAVATEGRAKFNVAQKLKNYSRNAGLTFWSPNVNIFRDPRWG 145
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD-LDNWKGTDRYH 236
RG ET GEDPLL+ T YV+GLQ D LK AC KHY + + + H
Sbjct: 146 RGMETYGEDPLLSGMLGTAYVRGLQGDDA---FYLKTGACAKHYAVHSGPEGTRHEADIH 202
Query: 237 FNAMVIYTYYLIKFK------------------------------------------YIV 254
+ ++ YL +FK +IV
Sbjct: 203 PSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDILRKSWGFNGHIV 262
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
SDCD+++ Y Y KTPEEA A +I AGL++ CG K + A+ GL+ E+ +D A
Sbjct: 263 SDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECGHTF-KAMQGALDQGLLAEADLDRA 321
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+ T ++LG + S PY ++C+ A+ LAL AA + +VLLKN G LPL
Sbjct: 322 LFPLVMTRLKLGILE-PDSACPYNSYDESEICSPAHTALALRAADEAMVLLKNN-GILPL 379
Query: 375 SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ 434
I+ L V GP A+ ++GNY G +Y+T LQG +V+ + N + Q
Sbjct: 380 DKN-IRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQG---IVSRVSSGTSVNFRPAFMQ 435
Query: 435 VDD-------AKKAAASADATVLVMGADQSIEAES---------HDRLDLLLPGQQQLLI 478
+ + A A +A+ ++VMG + ++E E DR+ + LP Q +
Sbjct: 436 ITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGIGLPASQLNYL 495
Query: 479 TEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPS 538
V K KG I+++++GG I K +++ YPGQ GG A+ D+ FG N S
Sbjct: 496 RRV-KARKGGRIVVVLTGGS-PIDLRKISKLADAVVMAWYPGQEGGEALGDLLFGDKNFS 553
Query: 539 GRLPMTW 545
GRLP+T+
Sbjct: 554 GRLPITF 560
>gi|301118693|ref|XP_002907074.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
gi|262105586|gb|EEY63638.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
Length = 809
Score = 287 bits (735), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 205/594 (34%), Positives = 288/594 (48%), Gaps = 91/594 (15%)
Query: 28 SSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSA--- 84
S+ + +S AC S+P FCN SL RV DL++RL L EK+T L A
Sbjct: 15 STPTVDASIPRAC---SSPEHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPK 71
Query: 85 GSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGK 144
G++S +G+P+Y W + +HGV T +N ATSFP + A F+ + +
Sbjct: 72 GNMSSIGLPEYNWGANCVHGVQ----STCGTNC---ATSFPNPVNLGAIFDPRAVFDMAQ 124
Query: 145 VVSTEARAMYNVGLA---------GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAT 195
VV E RA++ G GL WSPNINI RDPRWGR ETP EDPL+ SKY
Sbjct: 125 VVGWELRALWLEGARENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGV 184
Query: 196 GYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV-IYTY---YLIKFK 251
Y KGLQ+ G L+ KHY AY +++ G DR FNA+V Y + YL F+
Sbjct: 185 AYTKGLQE--GKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFE 242
Query: 252 ------------------------------------------YIVSDCDSVDVLYNSQHY 269
YI SD ++ +Y+ +HY
Sbjct: 243 ASVVHGKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHY 302
Query: 270 TKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFD 329
TKT EA +IL+G D+N GS + V +G + E A+D A+ LG FD
Sbjct: 303 TKTLCEAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFD 362
Query: 330 GHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNA 389
QPY + P +V T ++ L+LD +R+ IVLL+N LPL+ K LAVIGP+A
Sbjct: 363 -PIDDQPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKG--KKLAVIGPHA 419
Query: 390 NVTKTMIGNYEGTPCK--------YTTPLQGLA---AVVATIYQAGCSNVQCGTAQVDDA 438
+ ++GNY G C TPL+ + T+Y G T D+A
Sbjct: 420 AAKRALLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGINDTSTGGFDEA 479
Query: 439 KKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG- 497
+ AA A+ VL +G D SIE E+ DR ++ +P Q L+ V + K P ++++ +GG
Sbjct: 480 EAAARKAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGV 538
Query: 498 -GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
G + D + + YPG G A++D+ FG PSG+LP+T YP +Y
Sbjct: 539 VGAEELILHTDGVVEAF----YPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNY 588
>gi|288924872|ref|ZP_06418809.1| beta-glucosidase [Prevotella buccae D17]
gi|288338659|gb|EFC77008.1| beta-glucosidase [Prevotella buccae D17]
Length = 721
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 190/547 (34%), Positives = 275/547 (50%), Gaps = 89/547 (16%)
Query: 66 DLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFP 125
+++ R+T+ EKI+ L++ + ++ LGI Y+WWSE LHGV G AT FP
Sbjct: 36 EIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR----------ATVFP 85
Query: 126 QVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFWSPNINIFRDPRWG 177
Q I A+F+ +L + IG V+TE RA +NV AGLTFWSPN+NIFRDPRWG
Sbjct: 86 QPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIFRDPRWG 145
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD-LDNWKGTDRYH 236
RG ET GEDPLL+ T YV+GLQ D LK AC KHY + + + H
Sbjct: 146 RGMETYGEDPLLSGMLGTAYVRGLQGDDA---FYLKTGACAKHYAVHSGPEGTRHEADIH 202
Query: 237 FNAMVIYTYYLIKFK------------------------------------------YIV 254
+ ++ YL +FK +IV
Sbjct: 203 PSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDILRKSWGFNGHIV 262
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
SDCD+++ Y Y KTPEEA A +I AGL++ CG K + A+ GL+ E+ +D A
Sbjct: 263 SDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECGHTF-KAMQGALDQGLLAEADLDRA 321
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+ T ++LG + S PY ++C+ A+ LAL AA + +VLLKN G LPL
Sbjct: 322 LFPLVMTRLKLGILE-PDSACPYNSYDESEICSPAHTALALRAADEAMVLLKNN-GILPL 379
Query: 375 SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ 434
I+ L V GP A+ ++GNY G +Y+T LQG +V+ + N + Q
Sbjct: 380 DKN-IRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQG---IVSRVSSGTSVNFRPAFMQ 435
Query: 435 VDD-------AKKAAASADATVLVMGADQSIEAES---------HDRLDLLLPGQQQLLI 478
+ + A A +A+ ++VMG + ++E E DR+ + LP Q +
Sbjct: 436 ITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGIGLPASQMNYL 495
Query: 479 TEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPS 538
V K KG I+++++GG I + +++ YPGQ GG A+ D+ FG N S
Sbjct: 496 RRV-KARKGGRIVVVLTGGS-PIDLREISKLADAVVMAWYPGQEGGEALGDLLFGDKNFS 553
Query: 539 GRLPMTW 545
GRLP+T+
Sbjct: 554 GRLPITF 560
>gi|189201569|ref|XP_001937121.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187984220|gb|EDU49708.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 756
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 196/566 (34%), Positives = 276/566 (48%), Gaps = 71/566 (12%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V+ P L S C+ + R LV + QEK+ LV + V+RLG+P Y WW E
Sbjct: 23 DCVNGP-LKSNAICDVTASPAKRAAALVAAMQTQEKLDNLVSKSKGVARLGLPAYNWWGE 81
Query: 101 ALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG 160
ALHGV+ PG +F+ ATSFP +L +A+F+ L I V+ EARA N G+A
Sbjct: 82 ALHGVAGA-PGINFTGPYRTATSFPMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGIAP 140
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
+ FW+P+IN FRDPRWGRG ETPGED L Y + GL+ G + K+ A CKH
Sbjct: 141 VDFWTPDINPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLE----GDKAQRKIIATCKH 196
Query: 221 YTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------- 251
Y YD+++W GTDR+ F+A + + Y++ F+
Sbjct: 197 YVGYDMEDWNGTDRHSFDAKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVPTCA 256
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
YI SDC++V + Y T +EA A + G+DL+C
Sbjct: 257 DTYVLEDILRKHWNWTDSNNYITSDCEAVKDISLRHKYVATLQEATAIAFNNGMDLSCEY 316
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
A GL+N S ID A++ + L+ G+FDG + Y LG +D+ T Q
Sbjct: 317 SGSSDIPGAFSQGLLNVSVIDRALTRQYEGLVHAGYFDG--AAATYANLGVQDINTPEAQ 374
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
L L A +G+ LLKN +LPLS + +A++G AN + + G Y G P+
Sbjct: 375 KLVLQVAAEGLTLLKND-DTLPLSLKSGSKVAMVGFWANDSSKLSGIYSGPAPYLHNPVY 433
Query: 412 -----GLAAVVAT--IYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHD 464
GL VAT I Q + T +D AKK +D + G D S AE D
Sbjct: 434 AGNKLGLDMAVATGPILQKSGAADNWTTKALDAAKK----SDTILYFGGLDPSAAAEGSD 489
Query: 465 RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGG 524
R D+ P Q LIT++A + K P+++I + G D N + S++W +PGQ GG
Sbjct: 490 RTDISWPSAQIDLITKLAALGK-PLVVIAL-GDMVDHMPILNMKGVNSLIWANWPGQDGG 547
Query: 525 AAIADVCFGRYNPSGRLPMTWYPQSY 550
A+ V G + +GRLP+T YP Y
Sbjct: 548 TAVMQVITGEHAIAGRLPITQYPAKY 573
>gi|323447708|gb|EGB03620.1| hypothetical protein AURANDRAFT_72703 [Aureococcus anophagefferens]
Length = 744
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 185/564 (32%), Positives = 272/564 (48%), Gaps = 113/564 (20%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N + +L FC+ +L IDLR D V R+T+ EKI L G ++ LG+P Y WWSEA G
Sbjct: 31 NATFEALPFCDATLAIDLRAADAVSRMTIPEKIDALDTKTGPIASLGLPAYNWWSEASSG 90
Query: 105 VSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFW 164
V P T F+ +P + TA SFN +L++A G + EARA+ N G A T+W
Sbjct: 91 VMGSRPTTKFA--------YP--VTTAMSFNRTLWRATGAAIGREARALMNAGAAYSTYW 140
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +N+ R+PRWGR E PGEDP L +YAT +V G Q P L+ +ACCKHY A
Sbjct: 141 APVVNLAREPRWGRNIEVPGEDPYLTGEYATEFVGGFQAAP-EDPYHLQASACCKHYVAN 199
Query: 225 DLDNWKGT-----DRYHFNAMV----IYTYYLIKFK------------------------ 251
+L+N + DR H ++ V + Y++ F+
Sbjct: 200 ELENTRQPDGEQWDRQHVDSNVTQRDLVDSYMVPFQACVEKGKVSSLMCSYNAVNGVPSC 259
Query: 252 ------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL 293
YI SDCD+ +Y++ HY TPEEA A + AG D++C SF+
Sbjct: 260 ANDWLLRTVARDAWHFDGYITSDCDADSNVYDAHHYAATPEEAVADVLKAGTDVDCQSFV 319
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFD-GHPSKQPYGQLGPKD----VCTQ 348
G+H +A+ GL+ E+ +D + N F +RLG FD + +P G L D VC+
Sbjct: 320 GQHARSALDKGLITEADMDARLVNLFKVRLRLGHFDLSFDAAKPRGPLDEIDADAVVCSD 379
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
A+ D +++ Q LLKN G+LPL P+ AV+GPNA ++K G Y T
Sbjct: 380 AHLDASMEGLAQSATLLKND-GALPLKPSGTA--AVVGPNALLSKADAGYYGPTD----- 431
Query: 409 PLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDL 468
+ADA VL +G D + AE D +
Sbjct: 432 -----------------------------------AADAVVLAVGTDLTWAAEGKDATSI 456
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS--FAKNDPKITSILWVGYPGQAGGAA 526
+ Q LI VA + PV++++ S D++ A++D K+ +++ VG P
Sbjct: 457 VFTAAQLELIDAVATASATPVVVVVFSATPLDLTPLLARSDGKVGAVVHVGQP-SVTVKG 515
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+ D+ +GR + +GR T YP +Y
Sbjct: 516 LGDLLYGRRSFAGRAVQTVYPAAY 539
>gi|291548352|emb|CBL21460.1| Beta-glucosidase-related glycosidases [Ruminococcus sp. SR1/5]
Length = 697
Score = 285 bits (730), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 194/556 (34%), Positives = 268/556 (48%), Gaps = 105/556 (18%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV R+TL+EK + L A ++ RLGIP Y WW+E LHGV+ G AT FPQ
Sbjct: 13 LVARMTLEEKASQLRYDAPAIKRLGIPAYNWWNEGLHGVARAGQ----------ATVFPQ 62
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWGR 178
I AA+F+ + +V+TE RA YN GLTFWSPN+NIFRDPRWGR
Sbjct: 63 AIGMAAAFDRKSVAEMAGIVATEGRAKYNAYSVNGDRDIYKGLTFWSPNVNIFRDPRWGR 122
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD--RYH 236
G ET GEDP L + +VK LQ G+ + +K AAC KH+ + G + R+
Sbjct: 123 GHETYGEDPYLTKELGVSFVKALQ----GNGDTMKAAACAKHFAVH-----SGPEALRHE 173
Query: 237 FNAMV----IYTYYLIKFK----------------------------------------- 251
F+A + YL F+
Sbjct: 174 FDAEASAKDMEETYLPAFEGLVKEAKVEAVMGAYNRTNGEPCCGSPTLQKKLRGEWKFQG 233
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESA 310
+ VSDC ++ + T T E+AA +I G DLNCG ++L H A + GLV E
Sbjct: 234 HFVSDCWAIRDFHEHHMVTDTAVESAALAINNGCDLNCGNTYL--HIMKAYEKGLVTEET 291
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
I A F T LG FDG Y L +V + + D A AA + VLLKN G
Sbjct: 292 ITRAAVRLFTTRYLLGLFDG----SEYDNLSYMEVESPRHLDAAEKAAEKSFVLLKNN-G 346
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGCS 426
LPL +K + +IGPNA+ + +IGNY GT +Y T +G+ V + GC
Sbjct: 347 ILPLDKEKLKTIGIIGPNADSRQALIGNYHGTASRYITIQEGIQDYVGDDVRILTSRGCD 406
Query: 427 NVQCGTA-------QVDDAKKAAASADATVLVMGADQSIEAE---------SHDRLDLLL 470
+ T ++ +AK A ++D +L MG D+++E E S D+ D+ L
Sbjct: 407 LFRDRTEHLAFTRDRIAEAKVVAENSDVVILCMGLDETLEGEEGDTGNSYVSGDKEDIEL 466
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG Q+ L+ +A K PV+ +++G D+ +A +LW YPG GG A A V
Sbjct: 467 PGVQRELMEAIADTGK-PVVFCLLAGSDLDLKYAAEKFDAVMMLW--YPGCQGGKAAAKV 523
Query: 531 CFGRYNPSGRLPMTWY 546
FG +PSG+LP+T+Y
Sbjct: 524 LFGEISPSGKLPVTFY 539
>gi|333494646|gb|AEF56854.1| putative glycosyl hydrolase [synthetic construct]
Length = 743
Score = 285 bits (728), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 202/566 (35%), Positives = 285/566 (50%), Gaps = 101/566 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + +L + R DLV R+TL+EKI + A S+ RLG+P Y WW+EALHGV+ G
Sbjct: 30 YRDENLSFEERARDLVSRMTLEEKIAQMQHEAPSIERLGVPAYNWWNEALHGVARAGV-- 87
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
+T FPQ I AA+F+A L + V+STE RA Y+ GLTFW
Sbjct: 88 --------STMFPQAIGMAATFDAELIEKTADVISTEGRARYHEFQRKGDRDIYKGLTFW 139
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SP INI RDPRWGRGQET GEDP L S+ A +++G+Q G LK AAC KH+ +
Sbjct: 140 SPTINIDRDPRWGRGQETYGEDPYLTSRLAVSFIRGIQ----GRGRYLKAAACAKHFAVH 195
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ ++R+ FNA V ++ YL F+
Sbjct: 196 ---SGPESERHQFNAEVSQKDLWETYLPAFEASVKEAKVAGVMGAYNRVNGEPCCGSGTL 252
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
Y+ SDC ++ + TKT EE++A ++ +G DLNCG +
Sbjct: 253 LGDVLRGEWEFGGYVTSDCWAIKDINEGHGVTKTIEESSALAVKSGCDLNCGCAYASLVK 312
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG-PKDVCTQANQDLALDA 357
A +AGL+ E ID A+ T MRLG FD P K PY + K+ C + ++ AL+
Sbjct: 313 -AYRAGLIGEKEIDTAVHRLMLTRMRLGMFDA-PEKVPYSSIPYEKNDCAE-HRAFALEV 369
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
A + +VLL+N +G LPL + I+++AVIGPNA+ + GNY GT +Y T L G+ V
Sbjct: 370 AEKSLVLLRNRSGFLPLDRSRIRSVAVIGPNADSRVALEGNYNGTASEYVTVLDGIREAV 429
Query: 418 ---ATIYQAGCS----NVQCGTAQVD----DAKKAAASADATVLVMGADQSIEAE----- 461
A +Y A S N G +Q + +A AA AD V+ +G ++ IE E
Sbjct: 430 GDRARVYYAEGSHLFRNSMGGLSQKNDRLAEAAAAAERADVAVVCLGLNRDIEGEEGDPS 489
Query: 462 ----SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
+ D+ DL LPG Q+ L+ E K PV+L+++SG +++A + W
Sbjct: 490 NEYPAGDKRDLRLPGLQEELL-ETVKATGTPVVLVLLSGSALAVNWADENADAVVQAW-- 546
Query: 518 YPG-QAGGAAIADVCFGRYNPSGRLP 542
YPG QA G A FG P+G P
Sbjct: 547 YPGAQAEGRRGA--LFGIIRPAGGFP 570
>gi|325192664|emb|CCA27085.1| unnamed protein product [Albugo laibachii Nc14]
Length = 2278
Score = 284 bits (727), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 199/573 (34%), Positives = 280/573 (48%), Gaps = 100/573 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSA---GSVSRLGIPKYEWWSEALHGVSYVG 109
FCN+SL +DLRV DL++RL L EK+ L A GS+ RLG+P+Y W + +HGV
Sbjct: 36 FCNSSLSLDLRVEDLLQRLQLDEKVRMLTARASTHGSIPRLGVPEYNWGANCVHGVQSTC 95
Query: 110 PGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA---------G 160
GTH ATSFP + A F+ + + +V+ E RA+ G G
Sbjct: 96 -GTH------CATSFPNPVNLGAIFDPNEIYKMAQVIGKELRALRLEGARENYARGPHIG 148
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
L WSPNINI RDPRWGR ETP EDP + +KY Y KGLQ+ G L+ KH
Sbjct: 149 LDCWSPNININRDPRWGRAMETPSEDPYVNAKYGVAYTKGLQE--GQDSRFLQAVVTLKH 206
Query: 221 YTAYDLDNWKGTDRYHFNAMV-IYTY---YLIKFK------------------------- 251
Y AY +N+ GTDR F+A+V Y + Y F+
Sbjct: 207 YLAYSYENYGGTDRTQFDAIVSAYDFADTYFPAFEASVVDGKAKGIMCSYNSLNGIPTCA 266
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL-------N 288
YI SD ++ +++ YTKT EA ++ +G+D+ N
Sbjct: 267 NKWLNQLLRDDLEFDGYITSDTGAIQGIFDGHKYTKTLCEATKIAMESGVDICSGNAYWN 326
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
C L T + ++ID AI +LG FD QP+ GP+DV T
Sbjct: 327 CLKQLANSTNFS--------ASIDEAIRRTLKLRFQLGLFDA-IGDQPH--FGPEDVRTA 375
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK--- 405
+ L+LD AR+ IVLL+N +LPL +AVIGP++ + ++GNY G C
Sbjct: 376 KSLQLSLDLARKSIVLLQNHGNTLPLRLGL--RIAVIGPHSMTRRGIMGNYYGQLCHGDY 433
Query: 406 -----YTTPLQGLAAVVA---TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQS 457
+PL+ + +V T + GC TA+ DDA +A +AD VL +G D S
Sbjct: 434 DEVRCIQSPLEAIQSVNGRNNTHHVNGCGINDTSTAEFDDALQAVRTADVAVLFLGIDIS 493
Query: 458 IEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
IE ES DR ++ +P QL + + +VA P ++++ +GG I K S+L
Sbjct: 494 IERESKDRDNIDVP-HIQLELLKAIRVAGKPTVVVLFNGGILGIE--KLILYADSVLEAF 550
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
YPG G AIA++ FG NPSG+LP+T Y ++
Sbjct: 551 YPGFFGAQAIAEILFGSINPSGKLPVTMYRSNF 583
>gi|365120422|ref|ZP_09338009.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
6_1_58FAA_CT1]
gi|363647477|gb|EHL86692.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
6_1_58FAA_CT1]
Length = 735
Score = 284 bits (726), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 205/603 (33%), Positives = 290/603 (48%), Gaps = 113/603 (18%)
Query: 15 VLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQ 74
VLF+ +C+ H+ + + P F N L + RV DLV RLTL+
Sbjct: 9 VLFVLWGFCA--HTGKAQNTFP----------------FQNPDLSFEKRVDDLVSRLTLE 50
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASF 134
EKI+ +++ A ++ RLGIP Y+WW+E LHGV T + T FPQ I AA++
Sbjct: 51 EKISQMLNKAPAIERLGIPAYDWWNECLHGVGR----TPYK-----VTVFPQAIGMAATW 101
Query: 135 NASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDPRWGRGQETPGED 186
+ +LFQ + ++ E RA+Y+ ++ GLT+W+PNINIFRDPRWGRGQET GED
Sbjct: 102 DDALFQQVASSIADEGRAIYHDAISKGVHEIYHGLTYWTPNINIFRDPRWGRGQETYGED 161
Query: 187 PLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----I 242
P L +V GLQ G P LK +AC KHY + + R+ FN V +
Sbjct: 162 PYLTGTLGKAFVNGLQ---GDDPKYLKASACAKHYAVH---SGPEISRHFFNTEVSMYDL 215
Query: 243 YTYYLIKFK------------------------------------------YIVSDCDSV 260
+ YL F+ Y+ SDC ++
Sbjct: 216 WDTYLPAFRDLVVDAKVSSVMCAYNALAGQPCCGNDLLMQDILRKQWKFTGYVTSDCGAI 275
Query: 261 DVLYNSQHYT-KTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNF 319
D +H T A+A ++L G DL CG + AVK GL+ E+ ID ++ F
Sbjct: 276 DDFL--KHKTHADAAHASADAVLHGTDLECGQNIYVKLVDAVKQGLITEAQIDESVKRLF 333
Query: 320 ATLMRLGFFDGHPSKQPYGQLGPKDVCT-QANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
T RLG FD P+ + P V ++ LAL +R+ +VLLKN LPL
Sbjct: 334 MTRFRLGLFD--PADRVKYADTPLSVLECDEHKALALKMSRESVVLLKND-NVLPLRKN- 389
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGCSNVQCGTAQ 434
+K +AVIGPNA+ + ++GNY G P K TPL+ + + V IY V+ +
Sbjct: 390 LKKIAVIGPNADDSTVVLGNYNGFPSKVITPLEAIRSKVGKRTQVIYDRAIDCVKPSDEK 449
Query: 435 VDDAK-KAAASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAK 483
+A + D + V G +E E DR + LP Q L+ ++ K
Sbjct: 450 TLNALIERLKGVDQVIFVGGISPRLEGEELPISVDGFRGGDRTTIALPEVQTELMKKM-K 508
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
A PVI ++M+G I + + I +IL Y GQ G AIADV FG YNPSG+LP+
Sbjct: 509 EAGLPVIFVMMTGSALGIEWESQN--IPAILNAWYGGQFAGQAIADVLFGDYNPSGKLPV 566
Query: 544 TWY 546
T+Y
Sbjct: 567 TFY 569
>gi|332638085|ref|ZP_08416948.1| glycoside hydrolase family 3 protein [Weissella cibaria KACC 11862]
Length = 713
Score = 284 bits (726), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 193/560 (34%), Positives = 281/560 (50%), Gaps = 102/560 (18%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
+V ++T+ EKI + A ++ RL IP+Y +W+EALHGV+ G AT FPQ
Sbjct: 17 IVDQMTIDEKIGQIKYEAPAIERLNIPEYNYWNEALHGVARAGV----------ATVFPQ 66
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWGR 178
I AA+F+ L I V+ TE RA YN GLTFWSPN+NIFRDPRWGR
Sbjct: 67 AIGLAATFDDQLINDIADVIGTEGRAKYNEFTKHEDRDIYKGLTFWSPNVNIFRDPRWGR 126
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD--RYH 236
G ET GEDP L SK+ ++KGLQ G LK+AA KH+ + G + R+
Sbjct: 127 GHETYGEDPFLTSKFGVAFIKGLQ----GQAKYLKLAATAKHFAVH-----SGPEGLRHG 177
Query: 237 FNAMV----IYTYYLIKFK----------------------------------------- 251
F+A+V +Y YL FK
Sbjct: 178 FDAVVSDKDLYETYLPAFKAAVEEADVESIMTAYNAVDGVPASVSEMLLRDILHDKWSFE 237
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSD + + ++ + YTK E +I AGL+L G + + A+ GLV E
Sbjct: 238 GHVVSDYMAPEDVHENHKYTKDAAETMGLAIKAGLNLVAGH-IEQSLHEALNRGLVTEEE 296
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
I +A+ + +AT +RLG F + Y + + T+A+ +L+ AA + VLLKN G
Sbjct: 297 ITNAVISLYATRVRLGMF---ATDNEYDAIPYEANDTKAHNNLSEIAAEKSFVLLKND-G 352
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG----LAAVVATIYQAGCS 426
LPL ++ +AV+GPNA+ ++GNY GTP + T L+G L V Y G
Sbjct: 353 VLPLRKETMEAIAVVGPNAHSEIALLGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSG 412
Query: 427 NVQCGTAQ--------VDDAKKAAASADATVLVMGADQSIEAE---------SHDRLDLL 469
Q A+ +A AA +D V V+G D +IE E + D+ +L
Sbjct: 413 VFQDHAAEPLAKADERESEAIIAAEHSDVIVAVLGLDSTIEGEEGDAGNSQGAGDKPNLS 472
Query: 470 LPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIAD 529
LPG+Q+ L+ + V K PV++++ SG + +N P + +I+ + YPG GG A+AD
Sbjct: 473 LPGRQRQLLERLLAVGK-PVVVLLASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVAD 531
Query: 530 VCFGRYNPSGRLPMTWYPQS 549
V FG +PSG+LP+T+Y +
Sbjct: 532 VLFGTVSPSGKLPVTFYKNT 551
>gi|336261464|ref|XP_003345521.1| hypothetical protein SMAC_07509 [Sordaria macrospora k-hell]
gi|380088197|emb|CCC13872.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 762
Score = 284 bits (726), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 203/596 (34%), Positives = 289/596 (48%), Gaps = 96/596 (16%)
Query: 22 YCSLQHSSSSAQSSPVFACDV-------VSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQ 74
YC S S+A V+A D+ N LASL C+ +L R LV +T +
Sbjct: 13 YCLFSLSCSAAL---VYAIDLPFQTYPDCVNGPLASLKVCDATLSPPQRAAALVAAMTTE 69
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSN---VVPGATSFPQVILTA 131
EK+ LV + R+G+P Y WWSEALHGV+Y PGT F + +TSFP +L A
Sbjct: 70 EKLQNLVSKSKGAPRIGLPAYNWWSEALHGVAY-APGTQFRSGNGTFNSSTSFPMPLLMA 128
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
A+F+ L + +G+V+ E RA N G +G +W+PN+N F+DPRWGRG ETPGED L
Sbjct: 129 ATFDDELIERVGEVIGIEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIK 188
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYL 247
+YA ++GL+ R + A CKHY A D ++W G+ R+ FNA V + YYL
Sbjct: 189 RYAASMIRGLEGPVRERERR--IVATCKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYL 246
Query: 248 IKFK---------------------------------------------YIVSDCDSVDV 262
F+ YI SDC++V
Sbjct: 247 SPFQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILRDHWNWTAPGNYITSDCEAVLD 306
Query: 263 LYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATL 322
+ + HY KT E A + AG+D +C A GL+ +S +D A+ + L
Sbjct: 307 ISANHHYAKTNAEGTALAFEAGIDSSCEYEGSSDILGAWTQGLLKQSTVDRALRRLYEGL 366
Query: 323 MRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKN- 381
+++G+FDG+ S+ Y LG V +Q++AL AA +GIVLLKN +LPL KN
Sbjct: 367 VQVGYFDGNRSE--YASLGWNHVNRPKSQEVALQAAVEGIVLLKNDK-TLPLG--VKKNG 421
Query: 382 ----LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ--- 434
LA+IG AN KT+ G Y GTP +P+ A+ + AG +Q T++
Sbjct: 422 PKLKLAMIGFWANDPKTLSGGYSGTPAFEHSPVYATQAMGFKVTTAGGPVLQNSTSKDTW 481
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIM 494
A AA A+ + G D S E+ DR + P Q LIT+++K+ K P++++ M
Sbjct: 482 TQAALAAAKDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLITDLSKLGK-PLVVVQM 540
Query: 495 SGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
G D + I SILW +P P+GRLP+T Y +Y
Sbjct: 541 -GDQLDNTPLLASKAINSILWANWP----------------VPAGRLPVTQYHANY 579
>gi|116197206|ref|XP_001224415.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
gi|88181114|gb|EAQ88582.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
Length = 735
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 187/519 (36%), Positives = 263/519 (50%), Gaps = 66/519 (12%)
Query: 87 VSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVV 146
VSRLG+ Y+WW+EALHGV++ G + AT FPQ I ++A+F+ L + IG ++
Sbjct: 48 VSRLGLSAYQWWNEALHGVAH-NRGITWGGQFSAATQFPQAITSSAAFDDHLIERIGVII 106
Query: 147 STEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
STEARA N G A L FW+PN+N FRDPRWGRG ETPGED K+A +V+G+Q
Sbjct: 107 STEARAFANNGRAHLDFWTPNVNPFRDPRWGRGHETPGEDAFRNKKWAEAFVQGMQ---- 162
Query: 207 GSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKF------------ 250
G+ + +V A CKHY AYDL+N T R++F+A V + YYL F
Sbjct: 163 GTESTHRVIATCKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPFQQCARDSKVGSI 222
Query: 251 ---------------------------------KYIVSDCDSVDVLYNS---QHYTKTPE 274
+Y+VSDCD+V L N+ Y +
Sbjct: 223 MCSYNAVNGVPACASPYLMDTILRKHWNWTDQNQYVVSDCDAVYYLGNANGGHRYKSSYA 282
Query: 275 EAAAKSILAGLDLNCGSFLGKHTE--AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP 332
A S+ AG D C + G + +A + ++ +D A+ L++ G+FDG
Sbjct: 283 AAIGASLEAGCDNMCWATGGTTPDPASAFNSRQFTQATLDKAMLRQMQGLVKAGYFDGPN 342
Query: 333 SKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS-PTAIKNLAVIGPNANV 391
S Y L DV TQ +D AL AA +GIVLLKN LPL+ + +A+IG AN
Sbjct: 343 SL--YRNLTAADVNTQVARDTALKAAEEGIVLLKND-NILPLTLGGSNTQVAMIGFWANA 399
Query: 392 TKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLV 451
M+G Y G+P P+ ++ T+ Q A A AA + +
Sbjct: 400 ADKMLGGYSGSPPFSHDPVTAARSMGITVNYVNGPLTQT-NADTSAAVNAAQKSSVVIFF 458
Query: 452 MGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKIT 511
G D ++E ES DR + P Q +I +A+ K PVI++ M G D + + P +
Sbjct: 459 GGIDNTVEKESQDRTSIAWPSGQLTMIQRLAQTGK-PVIVVRM-GTHVDDTPLLSIPNVK 516
Query: 512 SILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+ILW GYPGQ GG A+ ++ G +P+GRLP+T YP SY
Sbjct: 517 AILWAGYPGQDGGTAVMNLITGLASPAGRLPVTVYPSSY 555
>gi|371776901|ref|ZP_09483223.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 720
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 195/574 (33%), Positives = 278/574 (48%), Gaps = 90/574 (15%)
Query: 40 CDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWS 99
C V S F + +L I+ R L+ LTL+EKI+ L + V RL IP Y WW+
Sbjct: 16 CTAVVELQGQSTNFRDEALDIETRAKALLSELTLKEKISLLGYNNPPVERLQIPAYNWWN 75
Query: 100 EALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA 159
EALHGV+ G AT FPQ I AA+F+ +L I +STEAR+ YN+ +
Sbjct: 76 EALHGVARAGE----------ATVFPQAIALAATFDTTLVYRIADAISTEARSKYNINRS 125
Query: 160 --------GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPN- 210
G+TFW+PNINIFRDPRWGRGQET GEDP L + +VKGLQ G P
Sbjct: 126 KGFQNQYLGITFWTPNINIFRDPRWGRGQETYGEDPFLTASMGKAFVKGLQ---GSEPER 182
Query: 211 RLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------------- 251
RLK AA KH+ + + DR+HFNA+V + YL FK
Sbjct: 183 RLKTAAGAKHFAVH---SGPEADRHHFNAVVDEKDLRETYLPAFKALVENGVTTIMCAYN 239
Query: 252 --------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+V+DC ++D ++ T E AA ++ AG+
Sbjct: 240 RVNGEPCCTGKTLLQDILRDEWGFKGQVVTDCWALDDIWLRHKTIPTRVEVAAAAVKAGV 299
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
+L+C + L + + A++ L+ +D A+ T ++LGF+D PS PY G V
Sbjct: 300 NLDCANILQEDVQDAIEKRLLTLEQVDSALLPTLQTQLKLGFYDD-PSHSPYRHYGIDSV 358
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
+ LA +AA + +VLLKN G LPL I ++ V+G NA + GNY G
Sbjct: 359 NNSYHISLAKEAAEKSMVLLKND-GILPLKKDTISSIMVVGENAASISALTGNYHGLSGN 417
Query: 406 YTTPLQGLAAV----VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE 461
T ++GL ++ Y GCS AA D T+ V+G +E E
Sbjct: 418 MVTFVEGLVKAGGPGMSVQYDYGCSFADTSHF---GGIWAAGFTDVTIAVIGLSPLLEGE 474
Query: 462 S---------HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITS 512
D+ DL +P ++ + ++ + PVI ++ G DIS +P +
Sbjct: 475 HGDAFLSNWGGDKKDLRMPRSHEIYLKKLRESHNHPVIAVVTGGSALDISAI--EPYADA 532
Query: 513 ILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I++ YPG+ GG A+AD+ FG +PSGRLP+T+Y
Sbjct: 533 IIYAWYPGEQGGTALADLIFGEVSPSGRLPITFY 566
>gi|358380569|gb|EHK18247.1| glycoside hydrolase family 3 protein, partial [Trichoderma virens
Gv29-8]
Length = 722
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 192/541 (35%), Positives = 274/541 (50%), Gaps = 74/541 (13%)
Query: 71 LTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPG-------THFSNVVPGATS 123
LTL EK LV++A V RLG+P YEW +EALHG++ V PG T + +T
Sbjct: 12 LTLDEKAANLVNNAPGVKRLGLPPYEWRNEALHGLAGVSPGQGINSTFTQGNVAFNSSTQ 71
Query: 124 FPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETP 183
FP I+ A+F+ L I VSTEARA N AGL +W+PNIN +RDPRWGRGQETP
Sbjct: 72 FPSPIVLGAAFDDHLVHDIATAVSTEARAFSNHLKAGLDYWAPNINPYRDPRWGRGQETP 131
Query: 184 GEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV-- 241
GEDP ++YA YV GL+ G P + KV + CKH+ YD+++ G R +NA++
Sbjct: 132 GEDPYHVAQYAYNYVVGLK--GGVGPAKSKVVSTCKHFAGYDIEDSDGVVRGSYNAIIST 189
Query: 242 --IYTYYLIKFK---------------------------------------------YIV 254
+ YYL F+ ++
Sbjct: 190 QDLAEYYLPSFRSCFRDAKTGAVMCSYNAVNGHPSCANSYMLDTVLRDHWGWGSSAHWVT 249
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
DC +VD ++N H ++ + A +I G DL+CG+ + +AV+ E+ +D A
Sbjct: 250 GDCGAVDGVFNQHHVGQSAAQGVAFAINNGTDLDCGTAYASNIASAVQNNYTTEAQLDQA 309
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+S +++L+ LG+FD P Q Y LG DV T + Q LA A +GI + LP+
Sbjct: 310 LSRLYSSLIVLGYFD-PPEGQEYRTLGVSDVNTPSTQKLAYTALVEGINI-------LPI 361
Query: 375 SPTAIKNLAVIGPNA-NVTKTMIGNYEGTPCKYTTPL---QGLAAVVATIYQAGCSNVQC 430
P + + +GP A N + +M GNY G T P+ A Y G V
Sbjct: 362 RPMG-QTVLFVGPWANNASVSMFGNYNGVAPYKTIPVPTANSSAYNWNVTYSQGLQYVLS 420
Query: 431 G-TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPV 489
T+Q A AA AD V + G D+ +EAE+HDR + PG Q LI ++A V PV
Sbjct: 421 NDTSQFAAAVSAAQEADVVVYIGGIDEQVEAEAHDRTSIDWPGAQLNLIKQLAAVK--PV 478
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+++ + GG D S + + +LW+GYPGQ G+ + D+ G P+GRLP+T YP +
Sbjct: 479 VVVQVGGGQVDDSSLLQNKNVKGLLWMGYPGQEFGSGLIDILSGASAPAGRLPVTQYPAN 538
Query: 550 Y 550
Y
Sbjct: 539 Y 539
>gi|315607899|ref|ZP_07882892.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315250368|gb|EFU30364.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 721
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 189/547 (34%), Positives = 274/547 (50%), Gaps = 89/547 (16%)
Query: 66 DLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFP 125
+++ R+T+ EKI+ L++ + ++ LGI Y+WWSE LHGV G AT FP
Sbjct: 36 EIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR----------ATVFP 85
Query: 126 QVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFWSPNINIFRDPRWG 177
Q I A+F+ +L + IG V+TE RA +NV AGLTFWSPN+NIFRD RWG
Sbjct: 86 QPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIFRDLRWG 145
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD-LDNWKGTDRYH 236
RG ET GEDPLL+ T YV+GLQ D LK AC KHY + + + H
Sbjct: 146 RGMETYGEDPLLSGMLGTAYVRGLQGDDA---FYLKTGACAKHYAVHSGPEGTRHEADIH 202
Query: 237 FNAMVIYTYYLIKFK------------------------------------------YIV 254
+ ++ YL +FK +IV
Sbjct: 203 PSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDILRKSWGFNGHIV 262
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
SDCD+++ Y Y KTPEEA A +I AGL++ CG K + A+ GL+ E+ +D A
Sbjct: 263 SDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECGHTF-KAMQGALDQGLLAEADLDRA 321
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+ T ++LG + S PY ++C+ A+ LAL AA + +VLLKN G LPL
Sbjct: 322 LFPLVMTRLKLGILE-PDSACPYNSYDESEICSPAHTALALRAADEAMVLLKNN-GILPL 379
Query: 375 SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ 434
I+ L V GP A+ ++GNY G +Y+T LQG +V+ + N + Q
Sbjct: 380 DKN-IRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQG---IVSRVSSGTSVNFRPAFMQ 435
Query: 435 VDD-------AKKAAASADATVLVMGADQSIEAES---------HDRLDLLLPGQQQLLI 478
+ + A A +A+ ++VMG + ++E E DR+ + LP Q +
Sbjct: 436 ITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGIGLPASQLNYL 495
Query: 479 TEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPS 538
V K KG I+++++GG I + +++ YPGQ GG A+ D+ FG N S
Sbjct: 496 RRV-KARKGGRIVVVLTGGS-PIDLREISKLADAVVMAWYPGQEGGEALGDLLFGDKNFS 553
Query: 539 GRLPMTW 545
GRLP+T+
Sbjct: 554 GRLPITF 560
>gi|109897152|ref|YP_660407.1| beta-glucosidase [Pseudoalteromonas atlantica T6c]
gi|109699433|gb|ABG39353.1| Beta-glucosidase [Pseudoalteromonas atlantica T6c]
Length = 733
Score = 281 bits (720), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 201/578 (34%), Positives = 285/578 (49%), Gaps = 95/578 (16%)
Query: 43 VSNPSLAS-LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEA 101
V++P+LA+ + +T L + R+ L+ +TL+EK + LV+ ++ RLG+P+Y++W+EA
Sbjct: 15 VASPTLANDHPWFDTQLPTNERIESLIDAMTLKEKASQLVNGNVAIERLGLPEYDFWNEA 74
Query: 102 LHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---- 157
LHGV+ G AT FPQ I AA+F+ L V+S EARA +NV
Sbjct: 75 LHGVARNGR----------ATVFPQAIGMAATFDQDLLLQAATVISDEARAKFNVSSEIG 124
Query: 158 ----LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLK 213
+GLTFW+PNINIFRDPRWGRGQET GEDP L ++ V GLQ G P LK
Sbjct: 125 NRSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQ---GDHPKYLK 181
Query: 214 VAACCKHYTAYDLDNWKGTD--RYHFNAMV----IYTYYLIKFK---------------- 251
AA KH+ + G + R+ F+A+ +Y Y F+
Sbjct: 182 TAAAAKHFAVH-----SGPEALRHEFDAIASEKDMYETYFPAFEALVTEADVETVMAAYN 236
Query: 252 --------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+IVSDC + + T E+AA +I G
Sbjct: 237 RVNGHPAGGSDFLLNTVLRDKWGFSGHIVSDCWGLADFHEYHKVTANAVESAALAINTGT 296
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
DLNCGS + AV+AGLV+E ID + AT +LGFFD PY + V
Sbjct: 297 DLNCGSVYTALPD-AVEAGLVDEKTIDTRLHKVLATKFKLGFFDPK-DDNPYNSISADVV 354
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
+ A+ D+A + A + IVLL+N LPL I+N+ V GP A+ ++ ++GNY G K
Sbjct: 355 NSDAHADVAYEMAVKSIVLLQNENQVLPLDKN-IRNVYVTGPFASSSEVLLGNYYGLSGK 413
Query: 406 YTTPLQGLAAVVA---TI-YQAGCSNVQCGTAQVD-DAKKAAASADATVLVMGADQSIEA 460
T L G+ A V+ TI Y+ G Q +D +A D + VMG + E
Sbjct: 414 TTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEG 473
Query: 461 ES---------HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKIT 511
E DRL L LP Q + ++ K PVI+++ +G +++
Sbjct: 474 EEGEAIASPHKGDRLSLDLPEHQIEFLRKLRKDNDKPVIVVLTAGTPVNVTEIAQLADAI 533
Query: 512 SILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
W YPGQ GG A+AD+ FG +PSGRLP+T +P+S
Sbjct: 534 VFAW--YPGQEGGKAVADILFGERSPSGRLPIT-FPKS 568
>gi|410648100|ref|ZP_11358515.1| beta-glucosidase [Glaciecola agarilytica NO2]
gi|410132388|dbj|GAC06914.1| beta-glucosidase [Glaciecola agarilytica NO2]
Length = 733
Score = 281 bits (720), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 200/578 (34%), Positives = 287/578 (49%), Gaps = 95/578 (16%)
Query: 43 VSNPSLAS-LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEA 101
V+ P+LA+ + +T L R+ L+ +TL+EK + LV+ ++ RLG+P+Y++W+EA
Sbjct: 15 VATPALANDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEA 74
Query: 102 LHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---- 157
LHGV+ G AT FPQ I AA+F+ L V+S EARA +NV
Sbjct: 75 LHGVARNG----------RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIG 124
Query: 158 ----LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLK 213
+GLTFW+PNINIFRDPRWGRGQET GEDP L ++ V GLQ G P LK
Sbjct: 125 NRSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQ---GDHPKYLK 181
Query: 214 VAACCKHYTAYDLDNWKGTD--RYHFNAMV----IYTYYLIKFK---------------- 251
AA KH+ + G + R+ F+A+ +Y Y F+
Sbjct: 182 TAAAAKHFAVH-----SGPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETVMAAYN 236
Query: 252 --------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
++VSDC + + T E+AA +I G
Sbjct: 237 RVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESAALAINTGT 296
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
DLNCG+ +A V+AGLV+E ID +S AT +LGFFD PY + V
Sbjct: 297 DLNCGAVYNALPDA-VEAGLVDEKTIDKRLSKVLATKFKLGFFDPK-DDNPYNNISADVV 354
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
++A+ +A + A + IVLL+N LPL I+NL V GP A+ ++ ++GNY G K
Sbjct: 355 NSEAHAQVAYEMAVKSIVLLQNKNNILPLDRN-IRNLYVTGPFASSSEVLLGNYYGLSGK 413
Query: 406 YTTPLQGLAAVVA---TI-YQAGCSNVQCGTAQVD-DAKKAAASADATVLVMGADQSIEA 460
T L G+ A V+ TI Y+ G Q +D +A D + VMG + E
Sbjct: 414 TTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEG 473
Query: 461 ESH---------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKIT 511
E DRL L LP Q + ++ K PVI+++ +G ++ +
Sbjct: 474 EEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTAGT--PVNLTEIAELAD 531
Query: 512 SILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+I++ YPGQ GG A+AD+ FG +PSGRLP+T +P+S
Sbjct: 532 AIVFAWYPGQEGGKAVADILFGERSPSGRLPIT-FPKS 568
>gi|410639677|ref|ZP_11350222.1| beta-glucosidase [Glaciecola chathamensis S18K6]
gi|410140558|dbj|GAC08409.1| beta-glucosidase [Glaciecola chathamensis S18K6]
Length = 733
Score = 281 bits (719), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 200/578 (34%), Positives = 287/578 (49%), Gaps = 95/578 (16%)
Query: 43 VSNPSLAS-LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEA 101
V+ P+LA+ + +T L R+ L+ +TL+EK + LV+ ++ RLG+P+Y++W+EA
Sbjct: 15 VATPALANDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEA 74
Query: 102 LHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---- 157
LHGV+ G AT FPQ I AA+F+ L V+S EARA +NV
Sbjct: 75 LHGVARNG----------RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIG 124
Query: 158 ----LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLK 213
+GLTFW+PNINIFRDPRWGRGQET GEDP L ++ V GLQ G P LK
Sbjct: 125 NRSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQ---GDHPKYLK 181
Query: 214 VAACCKHYTAYDLDNWKGTD--RYHFNAMV----IYTYYLIKFK---------------- 251
AA KH+ + G + R+ F+A+ +Y Y F+
Sbjct: 182 TAAAAKHFAVH-----SGPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETVMAAYN 236
Query: 252 --------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
++VSDC + + T E+AA +I G
Sbjct: 237 RVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESAALAINTGT 296
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
DLNCG+ +A V+AGLV+E ID +S AT +LGFFD PY + V
Sbjct: 297 DLNCGAVYNALPDA-VEAGLVDEKTIDKRLSKVLATKFKLGFFDPK-DDNPYNNISADVV 354
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
++A+ +A + A + IVLL+N LPL I+NL V GP A+ ++ ++GNY G K
Sbjct: 355 NSEAHAQVAYEMAVKSIVLLQNKNNILPLDRN-IRNLYVTGPFASSSEVLLGNYYGLSGK 413
Query: 406 YTTPLQGLAAVVA---TI-YQAGCSNVQCGTAQVD-DAKKAAASADATVLVMGADQSIEA 460
T L G+ A V+ TI Y+ G Q +D +A D + VMG + E
Sbjct: 414 TTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEG 473
Query: 461 ESH---------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKIT 511
E DRL L LP Q + ++ K PVI+++ +G ++ +
Sbjct: 474 EEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTAGT--PVNLTEIAELAD 531
Query: 512 SILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+I++ YPGQ GG A+AD+ FG +PSGRLP+T +P+S
Sbjct: 532 AIVFAWYPGQEGGKAVADILFGERSPSGRLPIT-FPKS 568
>gi|332307852|ref|YP_004435703.1| glycoside hydrolase family protein [Glaciecola sp. 4H-3-7+YE-5]
gi|332175181|gb|AEE24435.1| glycoside hydrolase family 3 domain protein [Glaciecola sp.
4H-3-7+YE-5]
Length = 733
Score = 281 bits (719), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 200/578 (34%), Positives = 287/578 (49%), Gaps = 95/578 (16%)
Query: 43 VSNPSLAS-LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEA 101
V+ P+LA+ + +T L R+ L+ +TL+EK + LV+ ++ RLG+P+Y++W+EA
Sbjct: 15 VATPALANDQPWFDTQLPTQERIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEA 74
Query: 102 LHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---- 157
LHGV+ G AT FPQ I AA+F+ L V+S EARA +NV
Sbjct: 75 LHGVARNG----------RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIG 124
Query: 158 ----LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLK 213
+GLTFW+PNINIFRDPRWGRGQET GEDP L ++ V GLQ G P LK
Sbjct: 125 NRSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQ---GDHPKYLK 181
Query: 214 VAACCKHYTAYDLDNWKGTD--RYHFNAMV----IYTYYLIKFK---------------- 251
AA KH+ + G + R+ F+A+ +Y Y F+
Sbjct: 182 TAAAAKHFAVH-----SGPEALRHEFDAIASPKDMYETYFPAFEALITEANVETVMAAYN 236
Query: 252 --------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
++VSDC + + T E+AA +I G
Sbjct: 237 RVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESAALAINTGT 296
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
DLNCG+ +A V+AGLV+E ID +S AT +LGFFD PY + V
Sbjct: 297 DLNCGAVYNALPDA-VEAGLVDEKTIDKRLSKVLATKFKLGFFDPK-DDNPYNNISADVV 354
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
++A+ +A + A + IVLL+N LPL I+NL V GP A+ ++ ++GNY G K
Sbjct: 355 NSEAHAQVAYEMAVKSIVLLQNKNNILPLDRN-IRNLYVTGPFASSSEVLLGNYYGLSGK 413
Query: 406 YTTPLQGLAAVVA---TI-YQAGCSNVQCGTAQVD-DAKKAAASADATVLVMGADQSIEA 460
T L G+ A V+ TI Y+ G Q +D +A D + VMG + E
Sbjct: 414 TTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEG 473
Query: 461 ESH---------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKIT 511
E DRL L LP Q + ++ K PVI+++ +G ++ +
Sbjct: 474 EEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTAGT--PVNLTEIAELAD 531
Query: 512 SILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+I++ YPGQ GG A+AD+ FG +PSGRLP+T +P+S
Sbjct: 532 AIVFAWYPGQEGGKAVADILFGERSPSGRLPIT-FPKS 568
>gi|319641744|ref|ZP_07996426.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
gi|317386631|gb|EFV67528.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
Length = 702
Score = 281 bits (719), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 197/563 (34%), Positives = 278/563 (49%), Gaps = 102/563 (18%)
Query: 62 LRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGA 121
+RV DLV RLTL+EK+ + + ++ RLGIP Y+WW+EALHGV+ +
Sbjct: 1 MRVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVA---------RTLEKV 51
Query: 122 TSFPQVILTAASFNASLFQAIGKVVSTEARAMYN---------VGLAGLTFWSPNINIFR 172
T FPQ I AA+F+ Q +G + STE RA++N GLT+W+PNINIFR
Sbjct: 52 TVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYRGLTYWTPNINIFR 111
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGRGQET GEDP L +K V+GL+ G P+ LK AC KHY + +
Sbjct: 112 DPRWGRGQETYGEDPYLTAKMGAAIVRGLE---GEDPHYLKSVACAKHYAVHSGPEY--- 165
Query: 233 DRYHFNA----MVIYTYYLIKFKYIVS----------------------DCDSVDVLYNS 266
+R+ F+A ++ Y+ F+ +V+ D VD+L N
Sbjct: 166 NRHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCCGNDPLLVDILRNQ 225
Query: 267 QHYT-----------------KT-PEEAAAKS--ILAGLDLNCGSFLGKHTEAAVKAGLV 306
H+ KT PE A S +LAG DL CG+ E VK GL
Sbjct: 226 WHFDGYVTSDCWALKDFAEFHKTHPEHTIAMSDALLAGTDLECGNLYHLLAE-GVKKGLH 284
Query: 307 NESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLL 365
+E I+ ++S F L ++G FD P+++ PY +G + + +A++ A A++ IVLL
Sbjct: 285 SERDINVSLSRLFTILFKIGMFD--PAERVPYSSIGREVLECEAHKQHAERMAKESIVLL 342
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGC 425
+N LPL + IK++A+IGPNA+ +T + NY GTP + TP L + +
Sbjct: 343 ENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYMSLKRRLGDKIKI-- 400
Query: 426 SNVQCGTAQVDDAKKA---------AASADATVLVMGADQSIE-------------AESH 463
N G VD K A AA +D V V G E S
Sbjct: 401 -NYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEAGDAGAAGYGGFASG 459
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR + LP Q L+ ++ K + P+I++ MSG +SF ++L Y GQA
Sbjct: 460 DRTTMQLPLVQIELLKKLKKTGR-PLIIVNMSGSV--MSFEWESQNADALLQAWYGGQAA 516
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AI DV FG NP+GR+P+T Y
Sbjct: 517 GDAIVDVLFGHCNPAGRMPLTTY 539
>gi|291240563|ref|XP_002740191.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 747
Score = 280 bits (715), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 203/572 (35%), Positives = 272/572 (47%), Gaps = 87/572 (15%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGS--------VSRLGIPKYEWWS 99
L F NTSL RV DLV RLTL+E I + G+ + RLGI Y W +
Sbjct: 22 LGDFPFRNTSLPWSERVDDLVGRLTLEE-IVLQMSRGGTGSNGPAPPIDRLGIGPYSWNT 80
Query: 100 EALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--- 156
E LHG GP ATSFPQ AA+F+A L + I + E RA YN
Sbjct: 81 ECLHGDVAAGP----------ATSFPQAFGLAATFDAVLIEQIANATAYEVRAKYNNYAK 130
Query: 157 -----GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR 211
GL+ +SP INI R P WGR QET GEDP L+ A YV GLQ G P
Sbjct: 131 HKEYGDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASYVNGLQ---GNHPRY 187
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------- 251
+ A CKH+ AY + R F+A V + +L F
Sbjct: 188 VTANAGCKHFDAYAGPEDIPSSRSTFDAKVSDRDLRMTFLPAFHECIQAGTHSLMCSYNS 247
Query: 252 -------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
Y++SD +V+ +Y++ HYTK + A + +GL+
Sbjct: 248 INGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAHHYTKDMLDTAIACVNSGLN 307
Query: 287 LNCGSFLGKH----TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
L S L + T AVK G V + +S F T MRLG FD P PY +L
Sbjct: 308 LELSSNLEDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTRMRLGEFDP-PEMNPYSKLDL 366
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
+ +Q +Q+L+L AA + VLLKN LPL I LAV+GP A+ + G+Y T
Sbjct: 367 SIIQSQEHQELSLKAAAKSFVLLKNENRFLPLK-EKIDKLAVVGPLADNVDALYGDYSAT 425
Query: 403 PCKYT-TPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA 460
P YT TP GLA + T Y +GC N +C K A + AD V+ +G IE+
Sbjct: 426 PNNYTVTPRNGLARLAGNTSYASGCDNPKCRKYDSGQVKSAVSGADMVVVCVGTGTDIES 485
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E +DR +L LPG+Q L+ + K PVIL++ + G D+S+A +P + +I+ +P
Sbjct: 486 EGNDRHELALPGKQLSLLQDAVKFGTKPVILLLFNAGPLDVSWAVENPAVQTIVACFFPA 545
Query: 521 QAGGAAIADVCFG---RYNPSGRLPMTWYPQS 549
QA G A+ + NP+GRLPMTW P+S
Sbjct: 546 QATGDALYRMFMNTSPESNPAGRLPMTW-PRS 576
>gi|339499234|ref|YP_004697269.1| beta-glucosidase [Spirochaeta caldaria DSM 7334]
gi|338833583|gb|AEJ18761.1| Beta-glucosidase [Spirochaeta caldaria DSM 7334]
Length = 699
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 194/557 (34%), Positives = 275/557 (49%), Gaps = 99/557 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
++ L+ ++L+EKI ++ A + RLGIP Y WW+EALHGV+ G AT
Sbjct: 11 QIETLISNMSLEEKIGLMIHRAKGIPRLGIPDYNWWNEALHGVANNGE----------AT 60
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYN-VGLA-------GLTFWSPNINIFRDP 174
FPQ I A+F+ L + + +S EARA +N VG GLTFW+PNINIFRDP
Sbjct: 61 VFPQAIALGATFDEDLVHRVAEAISIEARAKFNAVGKEKAEQYHRGLTFWAPNINIFRDP 120
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD- 233
RWGRGQET GEDP+L S+ T YV+GLQ +D P L+ AAC KH+ + G +
Sbjct: 121 RWGRGQETYGEDPVLTSRLGTAYVRGLQGSD---PYYLRAAACAKHFAVH-----SGPEG 172
Query: 234 -RYHFNAMV----IYTYYLIKFK------------------------------------- 251
R+ FNA V + YL FK
Sbjct: 173 LRHTFNAEVSQKDLEETYLPAFKALVKSGVESVMGAYNRVNGEPACGSTYLLKQKLREEW 232
Query: 252 ----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVN 307
++VSDC ++ + + T E+ A ++ +G DLNCG EA +K G V
Sbjct: 233 QFQGHVVSDCWAICDFHKNHKVTNDILESIALALRSGCDLNCGDAYNYLAEAVLK-GYVT 291
Query: 308 ESAIDHAISNNFATLMRLGFF--DGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
E I+ A+ TL +LG DG PY + + + + LAL+AA + IVLL
Sbjct: 292 EDDINRAVVRLLITLDKLGLIHDDG-----PYQGITIHQIDWKKHDSLALEAAEKSIVLL 346
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA----VVATIY 421
KN G LPL I + V GPNA + ++GNY G + T L+ + + Y
Sbjct: 347 KNN-GVLPLKKDKISYIYVTGPNATNSDALLGNYAGVSSRLLTVLEAIVEEAGPEITVTY 405
Query: 422 QAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRL---------DLLLPG 472
+ GC + D A AD T+ VMG D S+E E D + DL L
Sbjct: 406 KKGCPLAERRVNPNDWASGVTKYADVTIAVMGRDTSVEGEEGDAILSSTYGDFEDLNLND 465
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCF 532
+Q + ++ + K P+I+++M GG I + +IL YPGQAGG A++++ F
Sbjct: 466 EQLSYLHKLKESGK-PLIVVLM--GGAPICSPELHEIADAILVAWYPGQAGGTAVSNIVF 522
Query: 533 GRYNPSGRLPMTWYPQS 549
G+ NPSG+LP+T +P+S
Sbjct: 523 GKTNPSGKLPVT-FPKS 538
>gi|317474362|ref|ZP_07933636.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909043|gb|EFV30723.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 723
Score = 279 bits (713), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 201/574 (35%), Positives = 266/574 (46%), Gaps = 103/574 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N SL R DLV RLTL+EKIT + +++ +V RLGI YEWW+EALHGV+ G
Sbjct: 25 YQNKSLSPTERAADLVSRLTLEEKITLMQNNSSAVKRLGIKPYEWWNEALHGVARNGL-- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
AT +PQ I ASFN +L + +S EAR Y GLTFW
Sbjct: 83 --------ATVYPQAIGMGASFNDTLLYQVFTSISDEARVKYRQAREAGNYKRYTGLTFW 134
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRGQET GEDP L S+ V GLQ N K AC KHY +
Sbjct: 135 TPNINIFRDPRWGRGQETYGEDPYLTSRMGLSVVNGLQGPQNTKYN--KTHACAKHYAVH 192
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL F+
Sbjct: 193 SGPEW---NRHSFNAENINPRDLWETYLPAFQDLVIQGNVKEVMCAYNRFEGDPCCGSDR 249
Query: 252 --------------YIVSDCDSVDVLY-NSQHYT-KTPEEAAAKSILAGLDLNCG-SFLG 294
+VSDC ++D Y +H T K +A+A ++L+G DL CG S+ G
Sbjct: 250 LLINILRNEWNYKGLVVSDCGAIDNFYFKGRHETHKNKADASAAAVLSGTDLECGRSYTG 309
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
+AVK GL+NESAID ++ LG D P+ QL + A+Q LA
Sbjct: 310 --LISAVKEGLINESAIDQSLCRLMKARFELGEMD---DTTPWDQLPDSLLSCHAHQQLA 364
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
L AR+ + LL+N LPL +A+IGPNAN + NY G P T L+GL
Sbjct: 365 LQMARESMTLLQNHKNILPLDKEM--TVALIGPNANDSVMQWANYNGFPVHTITLLEGLT 422
Query: 415 AVV--ATIYQAGCSNVQCGT--------AQVDDAKKAAASADATVLVMGADQSIEAESH- 463
+ + N++ + AA AD + G S+E E
Sbjct: 423 QYLPQERLIYIPQKNIEVQKYPWVNYYPNDIQAVINQAAKADVIIYAGGISASLEGEEMD 482
Query: 464 ---------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSIL 514
DR + LP Q+ L+ + K P++ + SG + +IL
Sbjct: 483 VDAEGFRGGDRTTIELPNVQRKLVKALKATGK-PIVFVNFSGCA--MGLQPESQICDAIL 539
Query: 515 WVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQ 548
YPGQAGG AIA+V FG YNP+GRLP+T+Y +
Sbjct: 540 QAWYPGQAGGTAIAEVLFGDYNPAGRLPITFYKK 573
>gi|115436096|ref|NP_001042806.1| Os01g0296700 [Oryza sativa Japonica Group]
gi|113532337|dbj|BAF04720.1| Os01g0296700, partial [Oryza sativa Japonica Group]
Length = 522
Score = 278 bits (712), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 147/309 (47%), Positives = 198/309 (64%), Gaps = 18/309 (5%)
Query: 252 YIVSDCDSVDVLYNSQHYTK-TPEEAAAKSILAGLDLNCG-------SFLGKHTEAAVKA 303
YIVSDCDSV V+ + T EA A ++ AGLDL+CG F + AV+
Sbjct: 26 YIVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQ 85
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIV 363
G + ESA+D+A++N + TLMRLGFFDG P + LG DVCT+ +++LA DAARQG+V
Sbjct: 86 GKLKESAVDNALTNLYLTLMRLGFFDGIPELE---SLGAADVCTEEHKELAADAARQGMV 142
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGP--NANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIY 421
LLKN A LPLSP + ++A+ G + N T M+G+Y G PC+ TP G+ VV++
Sbjct: 143 LLKNDAALLPLSPEKVNSVALFGQLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTS 202
Query: 422 QAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEV 481
C C D A AA + DAT++V G + S+E ES+DR DLLLP Q I V
Sbjct: 203 VHACDKGSC-----DTAAAAAKTVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAV 257
Query: 482 AKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRL 541
A+ + P++L+IMS GG D+SFA+++PKI +++W GYPG+ GG AIADV FG+YNP GRL
Sbjct: 258 AEASPSPIVLVIMSAGGVDVSFAQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRL 317
Query: 542 PMTWYPQSY 550
P+TWY Y
Sbjct: 318 PLTWYKNEY 326
>gi|409385818|ref|ZP_11238358.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
gi|399206850|emb|CCK19273.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
Length = 695
Score = 278 bits (710), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 189/555 (34%), Positives = 278/555 (50%), Gaps = 100/555 (18%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
+V ++TL EKI+ + A ++ RL IP Y +W+E LHGV+ G AT FPQ
Sbjct: 15 IVSQMTLAEKISQIDFDASAIERLNIPHYNYWNEGLHGVARAGV----------ATVFPQ 64
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWGR 178
I AA+F+ L + I +V+S E RA YN GLTFWSPNIN+FRDPRWGR
Sbjct: 65 AIGLAATFDTELVKHIAEVISIEGRAKYNAYTKHGDRDIYKGLTFWSPNINLFRDPRWGR 124
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
GQET GEDP L ++ ++KGLQ G L++AAC KH+ + + DR++F+
Sbjct: 125 GQETYGEDPFLTAQIGVAFIKGLQ----GEGKYLRLAACTKHFAVH---SGPEADRHYFD 177
Query: 239 AMV----IYTYYLIKFK------------------------------------------Y 252
A+V + +YL +FK +
Sbjct: 178 AVVNPKDLNEFYLPQFKAAIEEADVESFMGAYNAINGQPACVNEELIAKTLLGKWGFEGH 237
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAID 312
+VSD +++ ++ + HYT+T E A ++ G +L C + AV GLV E+ I
Sbjct: 238 VVSDYAALEDVHENHHYTQTAAETMALAMKIGTNL-CAGKISDALFEAVGKGLVTETEIT 296
Query: 313 HAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD-LALDAARQGIVLLKNTAGS 371
++ + T +RLG F Y + P +V A + L+L AA + +VLLKN
Sbjct: 297 ASVVKLYTTHVRLGMF---AEDNDYDTI-PYEVNASAEHEMLSLKAAEKSMVLLKND-NF 351
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGC-- 425
LPLS + IK++AVIGP A + GNY GT Y T + G+ ++ Y GC
Sbjct: 352 LPLSQSEIKSVAVIGPTARNIGALEGNYAGTANHYETFVSGIQQALSNQARVTYALGCHL 411
Query: 426 ------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDRLDLLL 470
S++ + +A AA AD VL +G D +IE E S D+ L L
Sbjct: 412 YADHAESSLSRANERESEAIIAAEHADIAVLCVGLDPTIEGEQGDAGNVYGSGDKPSLSL 471
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PGQQ+ LI +V + K VIL++ SG + + + +I+ YPG GG A+A++
Sbjct: 472 PGQQKRLIEKVLETGK-TVILVLTSGSALSLEGLEKHTGVKAIIQAWYPGAHGGTALANI 530
Query: 531 CFGRYNPSGRLPMTW 545
G+ +PSG+LP+T+
Sbjct: 531 LLGKVSPSGKLPVTF 545
>gi|291544853|emb|CBL17962.1| Beta-glucosidase-related glycosidases [Ruminococcus champanellensis
18P13]
Length = 697
Score = 277 bits (709), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 193/565 (34%), Positives = 279/565 (49%), Gaps = 100/565 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N SL D R DL RLT++E+ + L A + RLGIP Y WW+E LHGV+ G
Sbjct: 9 YLNPSLTPDERAEDLADRLTVEEQASQLRYDALPIPRLGIPAYNWWNEGLHGVARAGT-- 66
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
AT FPQ I AA+F+ +L IG++ +TEARA + GLT W
Sbjct: 67 --------ATMFPQAIGMAATFDTALLHQIGEITATEARAKHMAAREHGDFDIYKGLTLW 118
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNIN+FRDPRWGRG ET GEDP L ++ +VKG+Q G LK AAC KH+ +
Sbjct: 119 APNINLFRDPRWGRGHETYGEDPFLTARLGVAFVKGMQ----GEGKVLKAAACAKHFAVH 174
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ R+ F+A V + YL F
Sbjct: 175 ---SGPEALRHSFDAQVSPKDLEESYLPAFHALVAEAKVEGVMGAYNRVNGEPSCASPML 231
Query: 252 -----------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEA 299
Y VSDC ++ + TK E+AA ++ G DLNCG ++L + A
Sbjct: 232 MDKLHQWGFAGYFVSDCWAIQDFHKHHGVTKNVTESAALALRTGCDLNCGNTYL--YVLA 289
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
A++ GL++ + I A T +RLG FD P + + + A++ ++L A
Sbjct: 290 ALEEGLIDAADIRRACIRVLRTRIRLGLFDPEPH---FAACTYDTIASPAHKAVSLSCAE 346
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV-- 417
+ +VLLKN G LPL + + +AVIGPNA+ + GNY GT +Y T L+G+
Sbjct: 347 KSMVLLKND-GILPLDLSKLHAIAVIGPNADSRAALEGNYCGTADRYVTFLEGIQDAFPG 405
Query: 418 ATIYQAGC-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE--------- 461
Y GC SN+ + +A AA ++D +L +G D ++E E
Sbjct: 406 RVHYAQGCHLYKDRTSNLAMADDRYAEALAAAEASDVVILCLGLDATLEGEEGDTGNEFS 465
Query: 462 SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQ 521
S D+ DL LP Q L+ ++ V K PVIL++ +G + + N ++L YPGQ
Sbjct: 466 SGDKADLRLPPPQCKLLEKLHAVGK-PVILVLAAGSALNPEISCN-----AVLQAWYPGQ 519
Query: 522 AGGAAIADVCFGRYNPSGRLPMTWY 546
GG A+A + FG+ +PSG+LP+T+Y
Sbjct: 520 CGGQALAHILFGKVSPSGKLPVTFY 544
>gi|359473580|ref|XP_003631325.1| PREDICTED: protein BRASSINOSTEROID INSENSITIVE 1-like [Vitis
vinifera]
Length = 785
Score = 277 bits (709), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 139/290 (47%), Positives = 195/290 (67%), Gaps = 7/290 (2%)
Query: 250 FKYIVSDCDSVDVLYNSQHY-TKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNE 308
+YIVSDC ++V+ ++Q+Y ++ +A AK++ AGLDL CG + +V G V++
Sbjct: 8 IRYIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQ 67
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
+D A+ N + LMR+G+FDG P+ Y LG KD+C + +LA +AARQGIVLLKN
Sbjct: 68 YELDRALKNIYVLLMRVGYFDGIPA---YESLGLKDICAADHIELAREAARQGIVLLKND 124
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNV 428
LPL P K L ++GP+AN T+ MIGNY G P KY +PL+ +A+ Y GC +
Sbjct: 125 YEVLPLKPG--KKLVLVGPHANATEVMIGNYAGLPYKYVSPLEAFSAIGNVTYATGCLDA 182
Query: 429 QCGT-AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKG 487
C +AK+AA A+ T++ +G D SIEAE DR+D LLPG Q LI +VA+V+ G
Sbjct: 183 SCSNDTYFSEAKEAAKFAEVTIIFVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSG 242
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
PVIL+++SG DI+FAKN+P+I++ILWVG+PG+ GG AIADV FG+YNP
Sbjct: 243 PVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQGGHAIADVVFGKYNP 292
>gi|451851086|gb|EMD64387.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
Length = 763
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 186/584 (31%), Positives = 280/584 (47%), Gaps = 63/584 (10%)
Query: 19 FLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKIT 78
FL+ + +S ++S D + P L++ C+ + R LV + QEK+
Sbjct: 3 FLSTAVVLGASLISKSHAAIGPDCTNGP-LSTNAICDVNAPPHERAAALVAAMEPQEKLD 61
Query: 79 FLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASL 138
LV + VSRLG+P Y WW EALHGV+ PG F ATSFP IL +A+F+ L
Sbjct: 62 NLVSKSKGVSRLGLPAYNWWGEALHGVA-GAPGIKFVEPYKNATSFPMPILMSAAFDDDL 120
Query: 139 FQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYV 198
I ++ EARA N G+A + +W+P+IN RD RWGR E+PGED + GY
Sbjct: 121 IFKIANIIGNEARAFGNGGVAPVDYWTPDINPVRDIRWGRASESPGED----IRRIKGYT 176
Query: 199 KGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--- 251
K L G + K+ A CKHY YD++ W G DR++F+A + + YY+ F+
Sbjct: 177 KALLAGLEGDQAQRKIIATCKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCA 236
Query: 252 ------------------------------------------YIVSDCDSVDVLYNSQHY 269
YI SDC++V + + Y
Sbjct: 237 RDSKVGSFMCSYNAVNGIPTCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKY 296
Query: 270 TKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFD 329
+T + A + G+DL+C A GL+N S ID A++ + L+ G+FD
Sbjct: 297 VETLAQGTALAFAKGMDLSCEYTGSSDIPGAWAQGLLNISVIDKALTRQYEGLVHAGYFD 356
Query: 330 GHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNA 389
G +K Y L KD+ T + L+L +G+V+LKN +LPL T +A+IG A
Sbjct: 357 G--AKATYANLSYKDINTPEARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWA 413
Query: 390 NVTKTMIGNYEGTPCKYTTPL---QGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASAD 446
N + + G Y G P +P+ + + +A + N +A AA +D
Sbjct: 414 NDSSKLQGIYSGPPPYRHSPVFAGEQMGLDMAIAWGPMIQNSSVPDNWTTNALDAAEKSD 473
Query: 447 ATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN 506
+ G D ++ E +DR + P Q L+T++AK+ K P+++I + G D S +
Sbjct: 474 YILYFGGQDWTVAQEGYDRTTISFPQVQIDLLTKLAKLGK-PLVVITL-GDMTDHSPLLS 531
Query: 507 DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+ SI+W +PGQ GG AI +V G + P+GRLP+T YP Y
Sbjct: 532 MEGVNSIIWANWPGQDGGPAILNVVSGAHAPAGRLPITEYPADY 575
>gi|374316077|ref|YP_005062505.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
str. Grapes]
gi|359351721|gb|AEV29495.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
str. Grapes]
Length = 701
Score = 275 bits (703), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 190/557 (34%), Positives = 270/557 (48%), Gaps = 104/557 (18%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV ++L+E + L+ A ++ RLG+P+Y WW+EALHG + G AT FPQ
Sbjct: 13 LVAHMSLKEMFSQLLHEAPAIPRLGLPRYNWWNEALHGAARSGT----------ATVFPQ 62
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDPRWGR 178
I AA F+ + I V+STE RA YN A GLT WSPN+NIFRDPRWGR
Sbjct: 63 AIGLAAMFDDVFLKEIATVISTEQRAKYNTFSALGDRGIYKGLTLWSPNVNIFRDPRWGR 122
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD--RYH 236
GQET GEDP LAS+ +++GLQ G LK AAC KH+ + G + R+
Sbjct: 123 GQETYGEDPYLASQLGVSFIQGLQ----GDGPYLKTAACVKHFAVH-----SGPEPLRHD 173
Query: 237 FNAMV----IYTYYLIKFK----------------------------------------- 251
FNA+V +Y YL F+
Sbjct: 174 FNAIVSRKDLYETYLPAFEACVKEGEVNAVMGAYSAVNGEPCCGSPFLITDILRNDWGFE 233
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNES 309
+SDC ++ + + TK ++ A ++ AG DLNCG +L E A + GL++
Sbjct: 234 GMYISDCWAIRDFHLNHAVTKNQVDSVALALNAGCDLNCGCEYLS--LEKAYQQGLIDRK 291
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
I A T LG F Y +G + T+ ++ +A A+ +VLLKN
Sbjct: 292 TITQACIRVMTTRFALGLFS---EDCTYSNIGYEQNDTEEHRKVAFKASCNSLVLLKND- 347
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG----LAAVVATIYQAGC 425
G LPL ++ +A+IGPNA+ + + GNY GT YTT L+G L V Y G
Sbjct: 348 GMLPLDSRSLHAIAIIGPNADSREALWGNYHGTSSTYTTVLEGFRKTLGESVKVKYSQGS 407
Query: 426 S-------NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHD---------RLDLL 469
+ + ++ +A A +D +L +G D+++E E HD + DL
Sbjct: 408 AIQKEKLERLAEPNDRIAEAIAVATVSDTIILCLGYDETVEGEMHDDGNGGWAGDKQDLR 467
Query: 470 LPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIAD 529
LP Q+ L+ VA K P++L+++SGG D + P + ++L YPGQ GG AIA
Sbjct: 468 LPPCQRALLKAVASTGK-PIVLVLLSGGAIDPEIERF-PNVKALLQGWYPGQEGGLAIAH 525
Query: 530 VCFGRYNPSGRLPMTWY 546
G NPSG LP+T+Y
Sbjct: 526 TILGLNNPSGHLPVTFY 542
>gi|348684866|gb|EGZ24681.1| hypothetical protein PHYSODRAFT_325770 [Phytophthora sojae]
Length = 805
Score = 275 bits (702), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 208/607 (34%), Positives = 290/607 (47%), Gaps = 90/607 (14%)
Query: 15 VLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQ 74
VL L L S + +A + P AC +N L FCNTSL RV DL+ RL LQ
Sbjct: 3 VLPLLLAIASASLVTPTAATIPR-ACVGKANQELP---FCNTSLSTADRVEDLLSRLPLQ 58
Query: 75 EKITFLVDSA---GSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
EK T L A G++S +G+P+Y W + +HGV GT+ TSFP +
Sbjct: 59 EKATLLTARASPRGNMSSIGLPEYNWGANCVHGVQSTC-GTNC------PTSFPNPVNLG 111
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLA---------GLTFWSPNINIFRDPRWGRGQET 182
A F+ + + +V+ E RA++ G GL WSPNINI RDPRWGR ET
Sbjct: 112 AIFDPQVVFDMAQVIGWELRALWLEGATENYKGGPHLGLDCWSPNININRDPRWGRNTET 171
Query: 183 PGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVI 242
P EDPL+ SKY Y +GLQ+ P L+ KHY AY +N+ G +R F+A+V
Sbjct: 172 PSEDPLVNSKYGVAYTRGLQEGKRQDPRFLQAVVTLKHYAAYSYENYGGVNRMEFDAIVS 231
Query: 243 -YTY---YLIKFK------------------------------------------YIVSD 256
Y + Y F+ Y+ SD
Sbjct: 232 PYDFADTYFPAFRSSVVDGNAKGVMCSYNSVNGIPMCANKELVETLLRGTLGFDGYVTSD 291
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAIS 316
+V+ + + HY + EAA +ILAG D+N G + V + E A+D A+
Sbjct: 292 SGAVEAISDMHHYADSQCEAARLAILAGTDINSGKSYEACLKTLVDDNQLEEKALDDALR 351
Query: 317 NNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+ LG FD QPY + P +V T A + L+L+A R+ +V+L+N A LPL
Sbjct: 352 HTLKLRFELGLFD-PIDDQPYWNVTPSEVNTAAAKALSLNATRKSLVMLQNNASVLPLQ- 409
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCK--------YTTPLQGLAAVVA---TIYQAGC 425
+K LAV+GP+A + ++GNY G C TPL + A T + GC
Sbjct: 410 KGVK-LAVLGPHAKSKRGLLGNYLGQMCHGDYDEVGCVQTPLDAIRAANGASNTTFAEGC 468
Query: 426 SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVA 485
TA + A AA ADA VL +G D+SIE E DR ++ LP Q L+ V V
Sbjct: 469 GISGNSTAGFEKAVAAAKEADAVVLFLGIDKSIEGEVGDRNNIDLPNIQMQLLQRVHAVG 528
Query: 486 KGPVILIIMSGG--GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
+ P ++++++GG G + + D + + YPG G A+ADV FG NPSG+LP+
Sbjct: 529 R-PTVVVLINGGVIGAEEIIERTDALVEAF----YPGFFGARAMADVLFGDTNPSGKLPV 583
Query: 544 TWYPQSY 550
T Y Y
Sbjct: 584 TMYRSDY 590
>gi|348684872|gb|EGZ24687.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 805
Score = 275 bits (702), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 197/569 (34%), Positives = 274/569 (48%), Gaps = 88/569 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSA---GSVSRLGIPKYEWWSEALHGVSYVG 109
FC+ SL RV DL++RL L EK+T L A G++S +G+P+Y W + +HGV
Sbjct: 36 FCDASLSTSERVEDLLRRLPLDEKVTLLTARASPKGNMSSIGLPEYNWGANCVHGVQ--- 92
Query: 110 PGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA---------G 160
T +N ATSFP + A F+ + +V+ E RA++ G G
Sbjct: 93 -STCGTNC---ATSFPNPVNLGAIFDPQAVFDMAQVIGWELRALWLEGARENYAAGPHLG 148
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
L WSPNINI RDPRWGR ETP EDPL+ SKY Y +GLQ+ G L+ KH
Sbjct: 149 LDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTRGLQE--GKDKRFLQAVVTLKH 206
Query: 221 YTAYDLDNWKGTDRYHFNAMV-IYTY---YLIKFK------------------------- 251
Y AY +++ G DR FNA V Y + YL F
Sbjct: 207 YAAYSYEHYDGIDRMAFNAQVSRYDFADTYLPAFHASVVEGKAKGVMCSYNSVNGMPMCA 266
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
YI SD +++ +Y +HYTK+ EA +I++G D+N GS
Sbjct: 267 NEQLNTKLLREALGFDGYITSDSGAIEGIYRQRHYTKSLCEAGRLAIMSGTDVNSGSVYK 326
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
K V +G + E A+D A+ LG FD QPY + P +V ++ L+
Sbjct: 327 KCLADLVTSGQLPEKAVDDAMRRTLKLRFELGLFD-PIDDQPYWHVAPSEVGKTESKQLS 385
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK--------Y 406
L+ R+ IVLL+N LPL K LAVIGP+A + ++GNY G C
Sbjct: 386 LELTRKSIVLLQNHGNVLPLRKG--KKLAVIGPHAKAKRALLGNYLGQMCHGDYLEVGCV 443
Query: 407 TTPLQGLAAVVA---TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
TPL+ + A T+Y G TA D A+ AA ADA VL +G D SIE E+
Sbjct: 444 QTPLEAITAANGASNTVYAKGSGINDTSTADFDAAEAAARGADAVVLFLGIDTSIEREAW 503
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG--GFDISFAKNDPKITSILWVGYPGQ 521
DR ++ +P Q L+ V + K P ++++ +GG G + D + YPG
Sbjct: 504 DRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGVVGAEELILHTD----GVAEAFYPGF 558
Query: 522 AGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
G A++D+ FG PSG+LP+T YP +Y
Sbjct: 559 FGAQAVSDILFGDAIPSGKLPVTMYPSNY 587
>gi|291530120|emb|CBK95705.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum 70/3]
Length = 689
Score = 274 bits (701), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 189/553 (34%), Positives = 280/553 (50%), Gaps = 99/553 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R L L+ +E+ L A ++ + G+P Y WW+E LHGV+ G AT
Sbjct: 14 RAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT----------AT 63
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AA+F+ + +G+VVSTEARAMYN GLT W+PNINIFRDP
Sbjct: 64 VFPQAIALAAAFDKDMMCRVGEVVSTEARAMYNSAAKHGDTDIYKGLTLWAPNINIFRDP 123
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGRG ET GEDP L S+ +VKG+Q G L+ AAC KH+ + + + R
Sbjct: 124 RWGRGHETYGEDPYLTSRLGVNFVKGIQ----GEEKYLRAAACAKHFAVH---SGPESLR 176
Query: 235 YHFNAMV----IYTYYLIKFK--------------------------------------- 251
+ F+A V + YL FK
Sbjct: 177 HEFDARVSEKDLEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMGKLREWGFD 236
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNES 309
Y VSDC ++ + + T T ++AA ++ AG D+NCG ++L H AA++ GL+ +
Sbjct: 237 GYFVSDCGAIRDFHTNHKITDTAPQSAAMALKAGCDVNCGNTYL--HILAALEEGLITKQ 294
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
I A + T +RLG D + + L + N+ L+L+AA + +VLL N
Sbjct: 295 DIRTACIHALRTRIRLGQLDDN----EFDDLPFDIIACDGNKALSLEAAEKSMVLLHND- 349
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA-AVVATIYQA-GCSN 427
G LPL + I ++AVIGPNA+ ++GNYEGTP + T L+G+ A +Y A GC
Sbjct: 350 GILPLDKSRISSIAVIGPNADSRAALLGNYEGTPDRSVTFLEGIQDAFDGRVYYAEGCQL 409
Query: 428 VQCGTA-------QVDDAKKAAASADATVLVMGADQSIEAE-------SHDRLDLLLPGQ 473
+ T + +A A +AD TV+ +G D ++E E S D+ DL LP
Sbjct: 410 FRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDSTLEGEEGDTENKSGDKPDLRLPEV 469
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q++L+ ++ K P+I+++ +G + N +++ YPGQ GG A+A++ FG
Sbjct: 470 QRVLLQKLKDTGK-PLIIVLAAGSSVNTECEGN-----ALINAWYPGQYGGKALAEILFG 523
Query: 534 RYNPSGRLPMTWY 546
+PSG+LP+T+Y
Sbjct: 524 EVSPSGKLPVTFY 536
>gi|348684865|gb|EGZ24680.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 769
Score = 274 bits (701), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 198/572 (34%), Positives = 276/572 (48%), Gaps = 86/572 (15%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSA---GSVSRLGIPKYEWWSEALHGVS 106
L FCNTSL RV DL+ RL LQEK T L A G++S +G+P+Y W + +HGV
Sbjct: 33 ELPFCNTSLSTADRVEDLLSRLPLQEKATLLTARASPRGNMSSIGLPEYNWGANCVHGVQ 92
Query: 107 YVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA------- 159
GT+ TSFP + A F+ + + +V+ E RA++ G
Sbjct: 93 STC-GTNC------PTSFPNPVNLGAIFDPQVVFDMAQVIGWELRALWLEGATENYKGGP 145
Query: 160 --GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
GL WSPNINI RDPRWGR ETP EDPL+ SKY Y +GLQ+ P L+
Sbjct: 146 HLGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQEGKRQDPRFLQAVVT 205
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMVI-YTY---YLIKFK---------------------- 251
KHY AY +N+ G +R F+A+V Y + Y F+
Sbjct: 206 LKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGNAKGVMCSYNSVNGIP 265
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
Y+ SD +V+ + + HY + EAA +ILAG D+N G
Sbjct: 266 MCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHYADSQCEAARLAILAGTDINSGK 325
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
+ V + E A+D A+ + LG FD QPY + P +V T A +
Sbjct: 326 SYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFD-PIDDQPYWNVTPSEVNTAAAK 384
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK------ 405
L+L+A R+ +V+L+N A LPL +K LAV+GP+A + ++GNY G C
Sbjct: 385 ALSLNATRKSLVMLQNNASVLPLQ-KGVK-LAVLGPHAKSKRGLLGNYLGQMCHGDYDEV 442
Query: 406 --YTTPLQGLAAVVA---TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA 460
TPL + A T + GC TA + A AA ADA VL +G D+SIE
Sbjct: 443 GCVQTPLDAIRAANGASNTTFAEGCGISGNSTAGFEKAVAAAKEADAVVLFLGIDKSIEG 502
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG--GFDISFAKNDPKITSILWVGY 518
E DR ++ LP Q L+ V V + P ++++++GG G + + D + + Y
Sbjct: 503 EVGDRNNIDLPNIQMQLLQRVHAVGR-PTVVVLINGGVIGAEEIIERTDALVEAF----Y 557
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
PG G A+ADV FG NPSG+LP+T Y Y
Sbjct: 558 PGFFGARAMADVLFGDTNPSGKLPVTMYRSDY 589
>gi|169611757|ref|XP_001799296.1| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
gi|160702362|gb|EAT83185.2| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
Length = 755
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 187/561 (33%), Positives = 265/561 (47%), Gaps = 67/561 (11%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
+N L C+ + R LV+ + EK+ L+ V+RLG+PKY WW EALH
Sbjct: 24 ANGPLKDNKICDVTAAPAERAAALVEAMQTNEKLDNLMRG---VTRLGLPKYNWWGEALH 80
Query: 104 GVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF 163
GV+ PG +F+ ATSFP +L +A+F+ L I ++ EARA N G+A + F
Sbjct: 81 GVA-GAPGINFTGAYKTATSFPMPLLMSAAFDDDLIFKIANIIGNEARAFGNGGVAPVDF 139
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+P+IN FRDPRWGRG ETPGED + Y + GL+ G P R K+ A CKHY
Sbjct: 140 WTPDINPFRDPRWGRGSETPGEDIVRIKGYTKHLLAGLE---GDKPQR-KIIATCKHYVG 195
Query: 224 YDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------------- 251
YD++ W G DR+ FNA + + YY+ F+
Sbjct: 196 YDMEAWGGIDRHSFNAKINMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGVPTCADTY 255
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
YI SDC++V + Y KT E + AG+D +C
Sbjct: 256 VLQTILRDHWNWTESNNYITSDCEAVKDISLKHKYAKTNAEGTGLAFTAGMDNSCEYTGS 315
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
A ++ ID A+ + L+R G+FDG + Y LG KD+ T Q L+
Sbjct: 316 SDIPGAFNQSYLSIPTIDRALKRQYEGLVRAGYFDGAAAT--YANLGVKDINTPEAQQLS 373
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ--- 411
L A +G+VLLKN +LPLS T +A++G AN T + G Y G +P+
Sbjct: 374 LQVASEGLVLLKND-DTLPLSLTNGSKVAMLGFWANDTSKLSGIYSGPAPYLRSPVWAGQ 432
Query: 412 --GLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLL 469
GL +A+ SN +A AA +D + G D S AE DR +
Sbjct: 433 KLGLDMAIASGPILQQSNSSTRDNWTTNALAAAEKSDYILYFGGLDPSAAAEGFDRNSIA 492
Query: 470 LPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIAD 529
P Q LI ++A + K V+L++ G D S + S++W +PGQ GG+A+
Sbjct: 493 WPTAQVDLIKKLAAIGKPLVVLVL--GDLMDNSPLLELDGVNSVIWANWPGQDGGSAVMQ 550
Query: 530 VCFGRYNPSGRLPMTWYPQSY 550
V G +GRLP+T YP +Y
Sbjct: 551 VVTGAVAVAGRLPITQYPANY 571
>gi|238578959|ref|XP_002388893.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
gi|215450599|gb|EEB89823.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
Length = 658
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 183/508 (36%), Positives = 254/508 (50%), Gaps = 87/508 (17%)
Query: 95 YEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
Y WWSEAL+ FS+ ATSFP I A+F+ L AI V+STEARA
Sbjct: 1 YNWWSEALN----------FSS----ATSFPAPITMGATFDDGLIHAIATVISTEARAFN 46
Query: 155 NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKV 214
NV GL F++PNIN F+DPRWGRGQETPGEDP S+Y V GLQ G P LK+
Sbjct: 47 NVNRGGLDFFTPNINPFKDPRWGRGQETPGEDPFHISQYVYQLVTGLQ--GGVGPTNLKI 104
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------- 251
AA CKH+ AYDL+N G R+ F+A V + +Y F+
Sbjct: 105 AADCKHWAAYDLEN-LGVSRFEFDAKVTMQDLAEFYSPSFQSCIRDAKVASIMCSYNAVN 163
Query: 252 -------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
+I DC +V ++ HYT P A ++ AG D
Sbjct: 164 GIPSCANRYLLQTLARDFWGLGEEQWITGDCGAVGNIFARHHYTDDPANGTAVALNAGTD 223
Query: 287 LNCGSFLGKHTE---AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
++C S +++ A+ LV+E + A++ + +L+RL +
Sbjct: 224 IDCDSGAAAYSQNLGQALNRSLVSEDQLRTAVTRQYNSLVRLSW---------------D 268
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
DV T+ Q LA AA +GIVLLKN G LPL+ +++K +AV+GP AN T M NY G
Sbjct: 269 DVNTEPAQQLAYQAAVEGIVLLKND-GILPLA-SSVKKVAVVGPMANATTQMQSNYNGIA 326
Query: 404 CKYTTPLQGLA-AVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES 462
+P Q A + G T+ A AA AD V G D +IE E
Sbjct: 327 PFLVSPQQAFRNAGFNVTFANGTGLNSSDTSGFSAAIAAADDADVVFYVGGIDTTIERED 386
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR ++ G Q L+ ++A + K P+I++ M GG D S +++ + +++W GYPGQ+
Sbjct: 387 RDRPEISWTGNQLALVQQLASLGK-PLIVLQMGGGQVDSSSLRDNTSVNALIWGGYPGQS 445
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG A+ D+ G+ P+GRLP+T YP SY
Sbjct: 446 GGTALVDLITGKQAPAGRLPITQYPASY 473
>gi|451996250|gb|EMD88717.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
C5]
Length = 763
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 185/584 (31%), Positives = 278/584 (47%), Gaps = 63/584 (10%)
Query: 19 FLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKIT 78
FL+ + +S ++S D + P L++ C+ + R LV + QEK+
Sbjct: 3 FLSTAVVLGASLVSKSYAAIGPDCTNGP-LSTNAICDVNAPPHERAAALVAAMEPQEKLD 61
Query: 79 FLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASL 138
LV + VSRLG+P Y WW EALHGV+ PG F ATSFP IL +A+F+ L
Sbjct: 62 NLVSKSKGVSRLGLPAYNWWGEALHGVA-GAPGIKFVEPYKNATSFPMPILMSAAFDDDL 120
Query: 139 FQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYV 198
I ++ EARA N G+A + +W+P+IN RD RWGR E+PGED + GY
Sbjct: 121 IFKIANIIGNEARAFGNGGVAPMDYWTPDINPVRDIRWGRASESPGED----IRRIKGYT 176
Query: 199 KGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--- 251
K L G + K+ A CKHY YD++ W G DR++F+A + + YY+ F+
Sbjct: 177 KALLAGLEGDQAQRKIIATCKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCA 236
Query: 252 ------------------------------------------YIVSDCDSVDVLYNSQHY 269
YI SDC++V + + Y
Sbjct: 237 RDSKVGSFMCSYNAVNGVPTCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKY 296
Query: 270 TKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFD 329
+T + A + G+DL+C A GL+N S ID A++ + L+ G+FD
Sbjct: 297 VETLAQGTALAFAKGMDLSCEYSGSSDIPGAWSQGLLNLSVIDKALTRQYEGLVHAGYFD 356
Query: 330 GHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNA 389
G +K Y L D+ T + L+L +G+V+LKN +LPL T +A+IG A
Sbjct: 357 G--AKATYANLSYNDINTPEARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWA 413
Query: 390 NVTKTMIGNYEGTPCKYTTPL---QGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASAD 446
N + + G Y G P +P+ + + +A + N +A AA +D
Sbjct: 414 NDSSKLQGIYSGPPPYRHSPVFAGEQMGLDMAIAWGPMIQNSSVPDNWTTNALDAAEKSD 473
Query: 447 ATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN 506
+ G D ++ E +DR + P Q L+ ++AK+ K P+++I + G D S +
Sbjct: 474 YILYFGGQDWTVAQEGYDRTTISFPQVQIDLLAKLAKLGK-PLVVITL-GDMTDHSPLLS 531
Query: 507 DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
I SI+W +PGQ GG AI +V G + P+GRLP+T YP Y
Sbjct: 532 MEGINSIIWANWPGQDGGPAILNVISGVHAPAGRLPITEYPADY 575
>gi|325970053|ref|YP_004246244.1| beta-glucosidase [Sphaerochaeta globus str. Buddy]
gi|324025291|gb|ADY12050.1| Beta-glucosidase [Sphaerochaeta globus str. Buddy]
Length = 698
Score = 271 bits (694), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 189/560 (33%), Positives = 276/560 (49%), Gaps = 102/560 (18%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R +LV+R+ L + ++ L A ++ LGIP Y WW+E LHG + G AT
Sbjct: 6 RAQELVERMNLPQMMSQLRHDAPAIESLGIPAYNWWNEGLHGSARSGT----------AT 55
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I A+ F+ A+ VVSTE RA YN+ GLT WSPN+NIFRDP
Sbjct: 56 VFPQAIGLASLFDPDFLYAVASVVSTEQRAKYNLFTHENDRDIYKGLTVWSPNVNIFRDP 115
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD- 233
RWGRGQET GEDP L ++ A +++GLQ G LK A+C KH+ A+ G +
Sbjct: 116 RWGRGQETFGEDPYLTARLAVAFIRGLQ----GEGPVLKTASCVKHFAAH-----SGPEP 166
Query: 234 -RYHFNAMV----IYTYYLIKFK------------------------------------- 251
R+ FNA+V + YL F
Sbjct: 167 LRHGFNAVVGKKDLEETYLPAFASAVKEAKADAVMGAYSALNDEPCCASSFLMEETLRLR 226
Query: 252 -----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLV 306
+SDC ++ + + TK EE+AA ++ G DL CG + E A + GL+
Sbjct: 227 WGFEGMYISDCWAIRDFHLNHKVTKNEEESAALALKRGCDLACGCEY-QSLEKAFQKGLI 285
Query: 307 NESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLK 366
I A T +LG FD Y LG + + + + LA +A+ + +VLLK
Sbjct: 286 TREQIKKAAIRVMTTRFKLGQFD---QGTAYDTLGLESLDSDEHAALAFEASCRSLVLLK 342
Query: 367 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT----IYQ 422
N A LPL A+ LAVIGPNA+ + + GNY GT +Y T L+GL V + +Y
Sbjct: 343 NDA-LLPLKKEAVSCLAVIGPNADSRQALWGNYHGTSSRYVTILEGLRDYVGSSTRILYS 401
Query: 423 AGCSNVQCGTAQV--DDAKKAAA-----SADATVLVMGADQSIEAESH---------DRL 466
G + + ++ DD + + A ++D VL +G ++++E E H D+
Sbjct: 402 EGSNLTKNKVERLAKDDDRLSEAVFMAKASDVVVLCLGLNETVEGEMHDDGNGGWAGDKD 461
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
DL LP Q+ L+ VA+ K P+I++++SGG D + + +++ YPGQ GG A
Sbjct: 462 DLRLPLCQRKLLKAVAETGK-PIIVVLLSGGSLDPEIEQY-ANVKALIQAWYPGQEGGKA 519
Query: 527 IADVCFGRYNPSGRLPMTWY 546
IA + +G PSG+LP+T+Y
Sbjct: 520 IAHLLYGALCPSGKLPVTFY 539
>gi|46127231|ref|XP_388169.1| hypothetical protein FG07993.1 [Gibberella zeae PH-1]
Length = 712
Score = 271 bits (694), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 188/527 (35%), Positives = 266/527 (50%), Gaps = 34/527 (6%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
P D P L S G C+T+ R LV LT +EK+ LV +A R+G+P+Y
Sbjct: 25 PYQQPDCTKEP-LRSNGICDTTASPAERAAALVSALTPREKVNNLVSNATGAPRIGLPRY 83
Query: 96 EWWSEALHGVSYVGPGTHFSNVVP--GATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
WW+EALHGV+ PG +++ P ATSFP +L ++F+ L IG+V+ TEARA
Sbjct: 84 NWWNEALHGVA-GAPGNDYNDKPPYDSATSFPMPLLMGSTFDDDLIHDIGEVIGTEARAW 142
Query: 154 YNVGLAG-LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG---SP 209
N G G + +W+PN+N F+DPRWGRG ETPGED L S+YA G S
Sbjct: 143 NNGGWGGGVDYWTPNVNPFKDPRWGRGSETPGEDALHVSRYARAMECTRDAKVGSIMCSY 202
Query: 210 NRLK-VAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQH 268
N + + AC Y L R H+N + +I SDC ++ ++ +
Sbjct: 203 NAVNGIPACANSYLQETL------LRKHWN-------WTHTNNWITSDCGAMQDIWQHHN 249
Query: 269 YTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFF 328
YTKT EAA + G D +C K + + GL+ E +D A+ F L+ GFF
Sbjct: 250 YTKTGAEAAKAAFENGQDSSCEYTTTKDISDSYEQGLLTEKVMDRALKRLFEGLVHTGFF 309
Query: 329 DGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPN 388
DG K + L DV T+ QDLAL +A +G VLLKN +LPL+ +++A+IG
Sbjct: 310 DG--DKSEWSSLDFDDVNTRHAQDLALQSAVRGAVLLKND-NTLPLNIKKKESVALIGFW 366
Query: 389 ANVTKTMIGNYEGTPCKYTTP-----LQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAA 443
A+ + G Y G TP + GL VA + N +A +AA
Sbjct: 367 ADDKTKLQGGYSGPAPHVRTPAYAAKMLGLNTNVA--WGPTLQNSSVPDNWTTNALEAAK 424
Query: 444 SADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISF 503
+D V + G D + E DR DL P Q L+ +++ + K P++++ + D
Sbjct: 425 KSDYIVYLGGLDATAAGEERDRTDLDWPSTQLTLLKKLSNLGK-PLVVVQLGDQVDDTPL 483
Query: 504 AKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
KN + SILWV YPGQ GG A+ ++ GR P+GRLP+T YP Y
Sbjct: 484 LKNK-GVNSILWVNYPGQEGGTAVMELITGRKGPAGRLPLTQYPSKY 529
>gi|423258868|ref|ZP_17239791.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
CL07T00C01]
gi|423264161|ref|ZP_17243164.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
CL07T12C05]
gi|387776448|gb|EIK38548.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
CL07T00C01]
gi|392706427|gb|EIY99550.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
CL07T12C05]
Length = 722
Score = 271 bits (692), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 172/530 (32%), Positives = 267/530 (50%), Gaps = 66/530 (12%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
+ +RV L++++TL EK+ LV + S+ RL +P Y +W+E LHGV+ G
Sbjct: 58 VAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE--------- 108
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
T FPQ I A++++ L + + +STEAR Y GLT+WSP IN+ RDPRWGR
Sbjct: 109 -VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
+ET GEDP L S+ +VKGLQ G P LK A KH+ A + +N + +
Sbjct: 168 EETYGEDPHLTSRLGVAFVKGLQ---GDHPTYLKTVATIKHFVANNEENNRFSSSSQIPT 224
Query: 240 MVIYTYYLIKFK------------------------------------------YIVSDC 257
+Y YY ++ ++VSDC
Sbjct: 225 KQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDC 284
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
++ V+ + EEAAA + +G DL CG+ + AV+ GL++E+AID A++
Sbjct: 285 GAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAIDRALTR 344
Query: 318 NFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+LG FD P + PY K + + +LA +AA + +VLLKN A LPL+
Sbjct: 345 VLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-LLPLNK 401
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD 436
IK++AV+GP A+ +G Y G P + L+G+ ++ + N +A D
Sbjct: 402 EKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGTSA--D 457
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+ AD ++ +G+D+ + E+HD + LP +Q+ L+ E+ +V ++L+ +G
Sbjct: 458 SIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEIYQV-NPRIVLVFHTG 516
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+A D I +I+ YPGQ G A+A++ FG NPSG+LPMT Y
Sbjct: 517 NPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
>gi|336408348|ref|ZP_08588841.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
gi|423248801|ref|ZP_17229817.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
CL03T00C08]
gi|423253750|ref|ZP_17234681.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
CL03T12C07]
gi|335937826|gb|EGM99722.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
gi|392655379|gb|EIY49022.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
CL03T12C07]
gi|392657742|gb|EIY51373.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
CL03T00C08]
Length = 722
Score = 271 bits (692), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 172/530 (32%), Positives = 267/530 (50%), Gaps = 66/530 (12%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
+ +RV L++++TL EK+ LV + S+ RL +P Y +W+E LHGV+ G
Sbjct: 58 VAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE--------- 108
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
T FPQ I A++++ L + + +STEAR Y GLT+WSP IN+ RDPRWGR
Sbjct: 109 -VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
+ET GEDP L S+ +VKGLQ G P LK A KH+ A + +N + +
Sbjct: 168 EETYGEDPHLTSRLGVAFVKGLQ---GDHPTYLKTVATIKHFVANNEENNRFSSSSQIPT 224
Query: 240 MVIYTYYLIKFK------------------------------------------YIVSDC 257
+Y YY ++ ++VSDC
Sbjct: 225 KQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDC 284
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
++ V+ + EEAAA + +G DL CG+ + AV+ GL++E+AID A++
Sbjct: 285 GAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAIDRALTR 344
Query: 318 NFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+LG FD P + PY K + + +LA +AA + +VLLKN A LPL+
Sbjct: 345 VLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-LLPLNK 401
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD 436
IK++AV+GP A+ +G Y G P + L+G+ ++ + N +A D
Sbjct: 402 EKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGTSA--D 457
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+ AD ++ +G+D+ + E+HD + LP +Q+ L+ E+ +V ++L+ +G
Sbjct: 458 SIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEIYQV-NPRIVLVFHTG 516
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+A D I +I+ YPGQ G A+A++ FG NPSG+LPMT Y
Sbjct: 517 NPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
>gi|116194378|ref|XP_001223001.1| hypothetical protein CHGG_03787 [Chaetomium globosum CBS 148.51]
gi|88179700|gb|EAQ87168.1| hypothetical protein CHGG_03787 [Chaetomium globosum CBS 148.51]
Length = 572
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 188/530 (35%), Positives = 263/530 (49%), Gaps = 63/530 (11%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D S P LA C+ SL R LV LT +EK+ LV + SRLG+P Y WWSE
Sbjct: 27 DCTSGP-LAENKVCDRSLPPAERAAALVAALTNEEKLQNLVSKSKGASRLGLPAYNWWSE 85
Query: 101 ALHGVSYVGPGTHFSNVVPG----ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV 156
ALHGV+Y PGT F + PG +TSFP +L AA+F+ L + IG V+ E RA N
Sbjct: 86 ALHGVAYA-PGTQFRDG-PGDFNSSTSFPMPLLMAAAFDDQLIEDIGTVIGIEGRAFGNA 143
Query: 157 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAA 216
G +G+ +W+PN+N FRDPRWGRG ETPGED L +YA V+GL+ G P R ++ A
Sbjct: 144 GWSGMDYWTPNVNPFRDPRWGRGSETPGEDILRVKRYAASMVRGLE---GPGPER-RIVA 199
Query: 217 CCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------YIVSDCDSVDV 262
CKHY A D ++WKG R++FNA V + YYL F+ Y+ SD
Sbjct: 200 TCKHYAANDFEDWKGATRHNFNAEVSAQDLAEYYLSPFQHTGNWTEHDNYVTSDWRGPCW 259
Query: 263 LYNSQHYTKTPE-EAAAKSILAGLDLNCGSFLGKHTE-AAVKAGLVNESAIDHAISNNFA 320
+++H + E A S T A GL++ +D A+ +
Sbjct: 260 DVSAKHPLRQDHVEGTALCFEDPAWTPAASTRAPPTSTGASSQGLLSWDTVDRALRRLYQ 319
Query: 321 TLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIK 380
++R+G+FDG + P+ +LG DV Q LAL A +GIVLLKN G+LP+ P
Sbjct: 320 GIIRVGYFDG--DQSPHAKLGWSDVNQPEAQRLALQVAAEGIVLLKND-GTLPI-PAKKP 375
Query: 381 NLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKK 440
N+A+IG A+ + G + A
Sbjct: 376 NVAMIGFWADAGDKLSGG------------------------------SAKDTWTEGALA 405
Query: 441 AAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFD 500
AA AD + G D S E+ DR+ + P Q L+ ++ + K P++++ M D
Sbjct: 406 AAQDADYILYFGGLDTSAAGETKDRMTIDWPAAQLALLEKLGALKK-PLVVVQMGDQIDD 464
Query: 501 ISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+ D + SILW +PGQ GG A+ + G +P+GRLP+T YP +Y
Sbjct: 465 TPLLEMD-AVNSILWANWPGQDGGTAVLQLLSGAKSPAGRLPLTQYPANY 513
>gi|390630430|ref|ZP_10258413.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
gi|390484359|emb|CCF30761.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
Length = 674
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 185/534 (34%), Positives = 266/534 (49%), Gaps = 102/534 (19%)
Query: 90 LGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTE 149
+ IP+Y +W+EALHGV+ G AT FPQ I AA+F+ L I V+ TE
Sbjct: 1 MNIPEYNYWNEALHGVARAGV----------ATVFPQAIGLAATFDDHLINEIADVIGTE 50
Query: 150 ARAMYNVG--------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGL 201
RA YN GLTFWSPN+NIFRDPRWGRG ET GEDP L SK+ ++KGL
Sbjct: 51 GRAKYNEFTKHDDRDIYKGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTSKFGVAFIKGL 110
Query: 202 QQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD--RYHFNAMV----IYTYYLIKFK---- 251
Q G LK+AA KH+ + G + R+ F+A+V +Y YL FK
Sbjct: 111 Q----GQAKYLKLAATAKHFAVH-----SGPEGLRHGFDAVVSDKDLYETYLPAFKAAVE 161
Query: 252 --------------------------------------YIVSDCDSVDVLYNSQHYTKTP 273
++VSD + + ++ + YTK
Sbjct: 162 EADVESIMTAYNAVDGVPASVSEMLLKDILHDKWSFEGHVVSDYMAPEDVHENHKYTKDA 221
Query: 274 EEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPS 333
E +I AGL+L G + + A+ GLV E I +A+ + +AT +RLG F +
Sbjct: 222 AETMGLAIKAGLNLVAGH-IEQSLHEALDRGLVTEEEITNAVISLYATRVRLGMF---AT 277
Query: 334 KQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTK 393
Y + + T+A+ +L+ AA + VLLKN G LPL ++ +AV+GPNA+
Sbjct: 278 DNEYDAIPYEANDTKAHNNLSEIAAEKSFVLLKND-GVLPLRKETMEAIAVVGPNAHSEI 336
Query: 394 TMIGNYEGTPCKYTTPLQG----LAAVVATIYQAGCSNVQCGTAQ--------VDDAKKA 441
++GNY GTP + T L+G L V Y G Q A+ +A A
Sbjct: 337 ALLGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSGLFQDHAAEPLAKADERESEAVIA 396
Query: 442 AASADATVLVMGADQSIEAE---------SHDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A +D V V+G D +IE E + D+ +L LPG+Q+ L+ + V K PV+++
Sbjct: 397 AEHSDVVVAVLGLDSTIEGEEGDAGNSQGAGDKPNLSLPGRQRQLLERLLAVGK-PVVVL 455
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ SG + +N P + +I+ + YPG GG A+ADV FG +PSG+LP+T+Y
Sbjct: 456 LASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVADVLFGAVSPSGKLPVTFY 509
>gi|291556907|emb|CBL34024.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum V10Sc8a]
Length = 691
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 187/555 (33%), Positives = 277/555 (49%), Gaps = 101/555 (18%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R L L+ +E+ L A ++ + G+P Y WW+E LHGV+ G AT
Sbjct: 14 RAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT----------AT 63
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AA+F+ + +G+V+STEARAMYN GLT W+PNINIFRDP
Sbjct: 64 VFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLWAPNINIFRDP 123
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGRG ET GEDP L S+ +VKG+Q G L+ AAC KH+ + + + R
Sbjct: 124 RWGRGHETYGEDPYLTSRLGVSFVKGIQ----GEEEYLRAAACAKHFAVH---SGPESLR 176
Query: 235 YHFNAMV----IYTYYLIKFK--------------------------------------- 251
+ F+A V + YL FK
Sbjct: 177 HEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMGKLREWGFD 236
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNES 309
Y VSDC ++ + + T T ++AA ++ AG D+NCG ++L H AA++ GL+ +
Sbjct: 237 GYFVSDCWAIRDFHTTHKITDTAPQSAAMALKAGCDVNCGNTYL--HILAALEEGLITKQ 294
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
I A + T +RLG D + L + N+ L+L+AA + +VLL N
Sbjct: 295 DIRTACIHALRTRIRLGQLD----DNEFDDLPFDIIACDGNKALSLEAAEKSMVLLHND- 349
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA-AVVATIYQA-GCSN 427
G LPL + I ++AVIGPNA+ ++GNY GTP + T L+G+ A +Y A GC
Sbjct: 350 GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQDAFDGRVYYAEGCQL 409
Query: 428 VQCGTA-------QVDDAKKAAASADATVLVMGADQSIEAE---------SHDRLDLLLP 471
+ T + +A A +AD TV+ +G D ++E E S D+ DL LP
Sbjct: 410 FRDRTQGLALPGDRYAEAVAACEAADVTVICVGLDATLEGEEGDTGNEFASGDKPDLRLP 469
Query: 472 GQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVC 531
Q++L+ + K P+I+++ +G + N +++ YPGQ GG A+A++
Sbjct: 470 EVQRVLLQNLKDTGK-PLIIVLAAGSSVNTECEGN-----ALINAWYPGQYGGKALAEIL 523
Query: 532 FGRYNPSGRLPMTWY 546
FG +PSG+LP+T+Y
Sbjct: 524 FGEVSPSGKLPVTFY 538
>gi|167751044|ref|ZP_02423171.1| hypothetical protein EUBSIR_02029 [Eubacterium siraeum DSM 15702]
gi|167655962|gb|EDS00092.1| glycosyl hydrolase family 3 C-terminal domain protein [Eubacterium
siraeum DSM 15702]
Length = 691
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 187/555 (33%), Positives = 278/555 (50%), Gaps = 101/555 (18%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R L L+ +E+ L A ++ + G+P Y WW+E LHGV+ G AT
Sbjct: 14 RAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT----------AT 63
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AA+F+ + +G+V+STEARAMYN GLT W+PNINIFRDP
Sbjct: 64 VFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLWAPNINIFRDP 123
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGRG ET GEDP L S+ +VKG+Q G L+ AAC KH+ + + + R
Sbjct: 124 RWGRGHETYGEDPYLTSRLGVNFVKGIQ----GEEEYLRAAACAKHFAVH---SGPESLR 176
Query: 235 YHFNAMV----IYTYYLIKFK--------------------------------------- 251
+ F+A V + YL FK
Sbjct: 177 HEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMGKLREWGFD 236
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNES 309
Y VSDC ++ + + T T ++AA ++ AG D+NCG ++L H AA++ GL+ +
Sbjct: 237 GYFVSDCWAIRDFHTTHKITDTAPQSAAMALKAGCDVNCGNTYL--HILAALEEGLITKQ 294
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
I A + T +RLG D + L + N+ L+L+AA + +VLL N
Sbjct: 295 NIRTACIHALRTRIRLGQLD----DNEFDDLPFDIIACDGNKALSLEAAEKSMVLLHND- 349
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA-AVVATIYQA-GCSN 427
G LPL + I ++AVIGPNA+ ++GNY GTP + T L+G+ A +Y A GC
Sbjct: 350 GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQDAFDGRVYYAEGCQL 409
Query: 428 VQCGTA-------QVDDAKKAAASADATVLVMGADQSIEAE---------SHDRLDLLLP 471
+ T + +A A +AD TV+ +G D ++E E S D+ DL LP
Sbjct: 410 FRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDATLEGEEGDTGNEFASGDKPDLRLP 469
Query: 472 GQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVC 531
Q++L+ ++ K P+I+++ +G + N +++ YPGQ GG A+A++
Sbjct: 470 EVQRVLLQKLKDTGK-PLIIVLAAGSSVNTECEGN-----ALINAWYPGQYGGKALAEIL 523
Query: 532 FGRYNPSGRLPMTWY 546
FG +PSG+LP+T+Y
Sbjct: 524 FGEVSPSGKLPVTFY 538
>gi|313145345|ref|ZP_07807538.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
gi|313134112|gb|EFR51472.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
Length = 722
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 176/531 (33%), Positives = 264/531 (49%), Gaps = 68/531 (12%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
I +RV L++++TL EK + LV + S+ RL +P Y +W+E LHGV+ G
Sbjct: 58 IAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE--------- 108
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
T FPQ I A++++ L + + +STEAR Y GLT+WSP IN+ RDPRWGR
Sbjct: 109 -VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
+ET GEDP L S+ +VKGLQ G P LK A KH+ A + +N + +
Sbjct: 168 EETYGEDPYLTSRLGVAFVKGLQ---GDHPAYLKTVATIKHFVANNEENNRFSSSSQIPT 224
Query: 240 MVIYTYYLIKFK------------------------------------------YIVSDC 257
+Y YY ++ ++VSDC
Sbjct: 225 KQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEWGFDGFVVSDC 284
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
++ V+ + EEAAA + +G DL CG+ + AVK GL++E+ ID A++
Sbjct: 285 GAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLISEATIDQALTR 344
Query: 318 NFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+LG FD P + PY K + + +LA +AA + +VLLKN LPLS
Sbjct: 345 VLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKN-ENLLPLSK 401
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD 436
K++AV+GP A+ +G Y G P T L+G+ ++ + G N G
Sbjct: 402 EKTKSVAVVGPFAD--HNYLGGYSGQPPYSVTLLKGVKDLMG---KRGKVNYLNGIGASR 456
Query: 437 DAKKAAASA-DATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
D+ AA D ++ +G+D+ + E+HD + LP +Q+ L+ + +V ++L+ S
Sbjct: 457 DSIVAAVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQV-NPRIVLVFHS 515
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G +A D I +I+ YPGQ G A+AD+ FG NPSG+LPMT Y
Sbjct: 516 GNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLLFGNENPSGKLPMTIY 564
>gi|423279990|ref|ZP_17258903.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
610]
gi|404584326|gb|EKA88991.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
610]
Length = 722
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 176/531 (33%), Positives = 264/531 (49%), Gaps = 68/531 (12%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
I +RV L++++TL EK + LV + S+ RL +P Y +W+E LHGV+ G
Sbjct: 58 IAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE--------- 108
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
T FPQ I A++++ L + + +STEAR Y GLT+WSP IN+ RDPRWGR
Sbjct: 109 -VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
+ET GEDP L S+ +VKGLQ G P LK A KH+ A + +N + +
Sbjct: 168 EETYGEDPYLTSRLGVAFVKGLQ---GDHPAYLKTVATIKHFVANNEENNRFSSSSQIPT 224
Query: 240 MVIYTYYLIKFK------------------------------------------YIVSDC 257
+Y YY ++ ++VSDC
Sbjct: 225 KQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEWGFDGFVVSDC 284
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
++ V+ + EEAAA + +G DL CG+ + AVK GL++E+ ID A++
Sbjct: 285 GAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLISEATIDQALTR 344
Query: 318 NFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+LG FD P + PY K + + +LA +AA + +VLLKN LPLS
Sbjct: 345 VLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNE-NLLPLSK 401
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD 436
K++AV+GP A+ +G Y G P T L+G+ ++ + G N G
Sbjct: 402 EKTKSVAVVGPFAD--HNYLGGYSGQPPYSVTLLKGVKDLMG---KRGKVNYLNGIGASR 456
Query: 437 DAKKAAASA-DATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
D+ AA D ++ +G+D+ + E+HD + LP +Q+ L+ + +V ++L+ S
Sbjct: 457 DSIVAAVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQV-NPRIVLVFHS 515
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G +A D I +I+ YPGQ G A+AD+ FG NPSG+LPMT Y
Sbjct: 516 GNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLLFGNENPSGKLPMTIY 564
>gi|53712125|ref|YP_098117.1| beta-xylosidase [Bacteroides fragilis YCH46]
gi|52214990|dbj|BAD47583.1| beta-xylosidase [Bacteroides fragilis YCH46]
Length = 722
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 172/530 (32%), Positives = 266/530 (50%), Gaps = 66/530 (12%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
+ +RV L++++TL EK+ LV + S+ RL +P Y +W+E LHGV+ G
Sbjct: 58 VAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE--------- 108
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
T FPQ I A++++ L + + +STEAR Y GLT+WSP IN+ RDPRWGR
Sbjct: 109 -VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
+ET GEDP L S+ +VKGLQ G P LK A KH+ A + +N + +
Sbjct: 168 EETYGEDPHLTSRLGVAFVKGLQ---GDHPTYLKTVATIKHFVANNEENNRFSSSSQIPT 224
Query: 240 MVIYTYYLIKFK------------------------------------------YIVSDC 257
+Y YY ++ ++VSDC
Sbjct: 225 KQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDC 284
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
++ V+ + EEAAA + +G DL CG+ + AV+ GL++E+AID A++
Sbjct: 285 GAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAIDRALTR 344
Query: 318 NFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+LG FD P + PY K + + +LA +AA + +VLLKN A LPL+
Sbjct: 345 VLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-LLPLNK 401
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD 436
IK++AV+GP A+ +G Y G P + L+G+ ++ + N +A D
Sbjct: 402 EKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGTSA--D 457
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+ AD ++ +G+D+ + E+HD + LP Q+ L+ E+ +V ++L+ +G
Sbjct: 458 SIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEGQEKLLKEIYQV-NPRIVLVFHTG 516
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+A D I +I+ YPGQ G A+A++ FG NPSG+LPMT Y
Sbjct: 517 NPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
>gi|423281966|ref|ZP_17260851.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
615]
gi|404582453|gb|EKA87147.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
615]
Length = 722
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 174/531 (32%), Positives = 267/531 (50%), Gaps = 68/531 (12%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
+ +RV L++++TL EK+ LV + S+ RL +P Y +W+E LHGV+ G
Sbjct: 58 VAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE--------- 108
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
T FPQ I A++++ L + + +STEAR Y GLT+WSP IN+ RDPRWGR
Sbjct: 109 -VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
+ET GEDP L S+ +VKGLQ G P LK A KH+ A + +N + +
Sbjct: 168 EETYGEDPHLTSRLGVAFVKGLQ---GDHPTYLKTVATIKHFVANNEENNRFSSSSQIPT 224
Query: 240 MVIYTYYLIKFK------------------------------------------YIVSDC 257
+Y YY ++ ++VSDC
Sbjct: 225 KQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDC 284
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
++ V+ + EEAAA + +G DL CG+ + AV+ GL++E+AID A++
Sbjct: 285 GAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAIDRALTR 344
Query: 318 NFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+LG FD P + PY K + + +LA +AA + +VLLKN A LPL+
Sbjct: 345 VLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-LLPLNK 401
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD 436
IK++AV+GP A+ +G Y G P + L+G+ ++ + N +A D
Sbjct: 402 EKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGTSA--D 457
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVI-LIIMS 495
+ AD ++ +G+D+ + E+HD + LP +Q+ L+ E+ +V P I L+ +
Sbjct: 458 SIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEIYQV--NPRIALVFHT 515
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G +A D I +I+ YPGQ G A+A++ FG NPSG+LPMT Y
Sbjct: 516 GNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
>gi|265765457|ref|ZP_06093732.1| beta-xylosidase [Bacteroides sp. 2_1_16]
gi|263254841|gb|EEZ26275.1| beta-xylosidase [Bacteroides sp. 2_1_16]
Length = 722
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 171/530 (32%), Positives = 267/530 (50%), Gaps = 66/530 (12%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
+ +RV L++++TL EK+ LV + S+ RL +P Y +W+E LHGV+ G
Sbjct: 58 VAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE--------- 108
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
T FPQ I A++++ L + + +STEAR Y GLT+WSP IN+ RDPRWGR
Sbjct: 109 -VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
+ET GEDP L S+ +VKGLQ G P LK A KH+ A + +N + +
Sbjct: 168 EETYGEDPHLTSRLGVAFVKGLQ---GDHPTYLKTVATIKHFVANNEENNRFSSSSQIPT 224
Query: 240 MVIYTYYLIKFK------------------------------------------YIVSDC 257
+Y YY ++ ++VSDC
Sbjct: 225 KQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDC 284
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
++ V+ + EEAAA + +G DL CG+ + AV+ GL++E+AID A++
Sbjct: 285 GAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAIDRALTR 344
Query: 318 NFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+LG FD P + PY K + + +LA +AA + +VLLKN A LPL+
Sbjct: 345 VLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-LLPLNK 401
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD 436
IK++AV+GP A+ +G Y G P + L+G+ ++ + N +A D
Sbjct: 402 EKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGTSA--D 457
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+ AD ++ +G+D+ + E+HD + LP +Q+ L+ ++ +V ++L+ +G
Sbjct: 458 SIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKIYQV-NPRIVLVFHTG 516
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+A D I +I+ YPGQ G A+A++ FG NPSG+LPMT Y
Sbjct: 517 NPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
>gi|375357164|ref|YP_005109936.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
gi|301161845|emb|CBW21389.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
Length = 722
Score = 269 bits (687), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 171/530 (32%), Positives = 267/530 (50%), Gaps = 66/530 (12%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
+ +RV L++++TL EK+ LV + S+ RL +P Y +W+E LHGV+ G
Sbjct: 58 VAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE--------- 108
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
T FPQ I A++++ L + + +STEAR Y GLT+WSP IN+ RDPRWGR
Sbjct: 109 -VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
+ET GEDP L S+ +VKGLQ G P LK A KH+ A + +N + +
Sbjct: 168 EETYGEDPHLTSRLGVAFVKGLQ---GDHPTYLKTVATIKHFVANNEENNRFSSSSQIPT 224
Query: 240 MVIYTYYLIKFK------------------------------------------YIVSDC 257
+Y YY ++ ++VSDC
Sbjct: 225 KQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDC 284
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
++ V+ + EEAAA + +G DL CG+ + AV+ GL++E+AID A++
Sbjct: 285 GAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAIDRALTR 344
Query: 318 NFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+LG FD P + PY K + + +LA +AA + +VLLKN A LPL+
Sbjct: 345 VLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-LLPLNK 401
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD 436
IK++AV+GP A+ +G Y G P + L+G+ ++ + N +A D
Sbjct: 402 EKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGTSA--D 457
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+ AD ++ +G+D+ + E+HD + LP +Q+ L+ ++ +V ++L+ +G
Sbjct: 458 SIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKIYQV-NPRIVLVFHTG 516
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+A D I +I+ YPGQ G A+A++ FG NPSG+LPMT Y
Sbjct: 517 NPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
>gi|383117083|ref|ZP_09937830.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
gi|251947612|gb|EES87894.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
Length = 722
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 171/530 (32%), Positives = 266/530 (50%), Gaps = 66/530 (12%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
+ +RV L++++TL EK+ LV + S+ RL +P Y +W+E LHGV+ G
Sbjct: 58 VAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE--------- 108
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
T FPQ I A++++ L + + +STEAR Y GLT+WSP IN+ RDPRWGR
Sbjct: 109 -VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
+ET GEDP L S+ +VKGLQ G P LK A KH+ A + +N + +
Sbjct: 168 EETYGEDPHLTSRLGVAFVKGLQ---GDHPTYLKTVATIKHFVANNEENNRFSSSSQIPT 224
Query: 240 MVIYTYYLIKFK------------------------------------------YIVSDC 257
+Y YY ++ ++VSDC
Sbjct: 225 KQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDC 284
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
++ V+ + EEAAA + +G DL CG+ + AV+ GL++E AID A++
Sbjct: 285 GAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEVAIDRALTR 344
Query: 318 NFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+LG FD P + PY K + + +LA +AA + +VLLKN A LPL+
Sbjct: 345 VLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-LLPLNK 401
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD 436
IK++AV+GP A+ +G Y G P + L+G+ ++ + N +A D
Sbjct: 402 EKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGTSA--D 457
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+ AD ++ +G+D+ + E+HD + LP +Q+ L+ ++ +V ++L+ +G
Sbjct: 458 SIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKIYQV-NPRIVLVFHTG 516
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+A D I +I+ YPGQ G A+A++ FG NPSG+LPMT Y
Sbjct: 517 NPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
>gi|424661938|ref|ZP_18098975.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
616]
gi|404578249|gb|EKA82984.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
616]
Length = 722
Score = 268 bits (686), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 175/531 (32%), Positives = 264/531 (49%), Gaps = 68/531 (12%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
I +RV L++++TL EK + LV + S+ RL +P Y +W+E LHGV+ G
Sbjct: 58 IAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE--------- 108
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
T FPQ I A++++ L + + +STEAR Y GLT+WSP IN+ RDPRWGR
Sbjct: 109 -VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
+ET GEDP L S+ +VKGLQ G P LK A KH+ A + +N + +
Sbjct: 168 EETYGEDPYLTSRLGVAFVKGLQ---GDHPAYLKTVATIKHFVANNEENNRFSSSSQIPT 224
Query: 240 MVIYTYYLIKFK------------------------------------------YIVSDC 257
+Y YY ++ ++VSDC
Sbjct: 225 KQLYEYYFPAYEACVKEADVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEWGFDGFVVSDC 284
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
++ V+ + EEAAA + +G DL CG+ + AVK GL++E+ ID A++
Sbjct: 285 GAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLISEATIDQALTR 344
Query: 318 NFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+LG FD P + PY K + + +LA +AA + +VLLKN LPLS
Sbjct: 345 VLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNE-NLLPLSK 401
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD 436
K++AV+GP A+ +G Y G P T L+G+ ++ + G N G
Sbjct: 402 EKTKSVAVVGPFAD--HNYLGGYSGQPPYSITLLKGVKDLMG---KRGKVNYLNGIGASR 456
Query: 437 DAKKAAAS-ADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
D+ AA D ++ +G+D+ + E+HD + LP +Q+ L+ + +V ++L+ S
Sbjct: 457 DSIVAAVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQV-NPRIVLVFHS 515
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G +A D I +I+ YPGQ G A+A++ FG NPSG+LPMT Y
Sbjct: 516 GNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
>gi|373460527|ref|ZP_09552278.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
gi|371955145|gb|EHO72949.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
Length = 699
Score = 268 bits (685), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 186/552 (33%), Positives = 267/552 (48%), Gaps = 95/552 (17%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
L+ +TL EKI+ +++ + RLGI Y+WW+E LHGV G AT FPQ
Sbjct: 16 LINMMTLDEKISQMMNETPGIPRLGIKPYDWWNEGLHGVGRDGR----------ATVFPQ 65
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWGR 178
I A+FN +L + IG ++TE RA YNV GLTFWSPNINIFRDPRWGR
Sbjct: 66 PIGMGATFNPALIRQIGDAIATEGRAKYNVAQRNNNYARYTGLTFWSPNINIFRDPRWGR 125
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
G ET GEDP L YV+G+Q G P LKVAAC KHY + G +
Sbjct: 126 GMETYGEDPFLTGTLGIAYVQGMQ---GNDPFYLKVAACGKHYAVHS-----GPEATRHE 177
Query: 239 AMV------IYTYYLIKFK----------------------------------------- 251
A V ++ YL FK
Sbjct: 178 ANVSPTKRDLFETYLPAFKMLVQQGHVEAIMGAYNRVYGEACSGSKYLLTDVLRKQWGFR 237
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
+IVSDCD+V ++ KT EA A +I AGL++ CG + + AV L+ E
Sbjct: 238 GHIVSDCDAVADIHAGHKIVKTEAEACAIAIKAGLNIECGHTF-EAMKQAVAQKLLTEQE 296
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
ID A+ T ++LG + + ++ PY ++ ++C+ + LA AA + +VLLKN G
Sbjct: 297 IDRALLPLMMTRLKLGILE-YDAECPYNEVKETEICSPEHIALARKAATESMVLLKNN-G 354
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV----ATIYQAGCS 426
LPL + L + GP A+ + ++GNY G +Y T LQG+A V A ++
Sbjct: 355 ILPLDKN-LHTLFIAGPGASDSFWLMGNYFGISNRYCTYLQGIADKVSSGTAVNFRPAFG 413
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES---------HDRLDLLLPGQQQLL 477
++ A A +A+ T++VMG + ++E E DR+ + LP Q
Sbjct: 414 ESTPTKNTINWALDEAIAAEKTIVVMGNNGNLEGEEGESIASETRGDRVSMRLPASQMKF 473
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ ++ K K +++++ G D+ + W YPGQ GG A+AD+ FG N
Sbjct: 474 LRDL-KARKNGIVVVLTGGSPIDVREISRLADAVVMAW--YPGQEGGYALADLLFGDENF 530
Query: 538 SGRLPMTWYPQS 549
SGRLP+T +P+S
Sbjct: 531 SGRLPVT-FPES 541
>gi|60680313|ref|YP_210457.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
gi|60491747|emb|CAH06504.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
Length = 722
Score = 268 bits (685), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 170/530 (32%), Positives = 266/530 (50%), Gaps = 66/530 (12%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
+ +RV L++++TL EK+ LV + S+ RL +P Y +W+E LHGV+ G
Sbjct: 58 VAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE--------- 108
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
T FPQ I A++++ L + + +STEAR Y GLT+WSP IN+ RDPRWGR
Sbjct: 109 -VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
+ET GEDP L S+ +VKGLQ G P LK A KH+ A + +N + +
Sbjct: 168 EETYGEDPHLTSRLGVAFVKGLQ---GDHPTYLKTVATIKHFVANNEENNRFSSSSQIPT 224
Query: 240 MVIYTYYLIKFK------------------------------------------YIVSDC 257
+Y YY ++ ++VSDC
Sbjct: 225 KQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDC 284
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
++ V+ + EEAAA + +G DL CG+ + AV+ GL++E+AID A++
Sbjct: 285 GAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAIDRALTR 344
Query: 318 NFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+LG FD P + PY K + + +LA +AA + +VLLKN A LPL+
Sbjct: 345 VLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-LLPLNK 401
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD 436
IK++AV+GP A+ +G Y G P + L+G+ ++ + N +A D
Sbjct: 402 EKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGTSA--D 457
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+ AD ++ +G+D+ + E+HD + LP +Q+ + ++ +V ++L+ +G
Sbjct: 458 SIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKFLKKIYQV-NPRIVLVFHTG 516
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+A D I +I+ YPGQ G A+A++ FG NPSG+LPMT Y
Sbjct: 517 NPLTSEWA--DTHILAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
>gi|423269271|ref|ZP_17248243.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
CL05T00C42]
gi|423273165|ref|ZP_17252112.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
CL05T12C13]
gi|392701693|gb|EIY94850.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
CL05T00C42]
gi|392708197|gb|EIZ01305.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
CL05T12C13]
Length = 722
Score = 268 bits (684), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 171/530 (32%), Positives = 266/530 (50%), Gaps = 66/530 (12%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
+ +RV L++++TL EK+ LV + S+ RL +P Y +W+E LHGV+ G
Sbjct: 58 VAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE--------- 108
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
T FPQ I A++++ L + + +STEAR Y GLT+WSP IN+ RDPRWGR
Sbjct: 109 -VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
+ET GE+P L S+ +VKGLQ G P LK A KH+ A + +N + +
Sbjct: 168 EETYGEEPHLTSRLGVAFVKGLQ---GDHPTYLKTVATIKHFVANNEENNRFSSSSQIPT 224
Query: 240 MVIYTYYLIKFK------------------------------------------YIVSDC 257
+Y YY ++ ++VSDC
Sbjct: 225 KQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDC 284
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
++ V+ + EEAAA + +G DL CG+ + AV+ GL++E+AID A++
Sbjct: 285 GAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEAAIDRALTR 344
Query: 318 NFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+LG FD P + PY K + + +LA +AA + +VLLKN A LPL+
Sbjct: 345 VLTARFKLGEFD--PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDA-LLPLNK 401
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVD 436
IK++AV+GP A+ +G Y G P + L+G+ ++ + N +A D
Sbjct: 402 EKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGTSA--D 457
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+ AD ++ +G+D+ + E+HD + LP Q+ L+ E+ +V ++L+ +G
Sbjct: 458 SIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEGQEKLLKEIYQV-NPRIVLVFHTG 516
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+A D I +I+ YPGQ G A+A++ FG NPSG+LPMT Y
Sbjct: 517 NPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENPSGKLPMTIY 564
>gi|282877070|ref|ZP_06285912.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
gi|281300752|gb|EFA93079.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
Length = 721
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 192/588 (32%), Positives = 279/588 (47%), Gaps = 103/588 (17%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
AC S + + F + L + R DL KRLTL+EK + +++ V RLGI +++W
Sbjct: 11 LACAAGSIQAQVTYPFQDARLSFEQRADDLCKRLTLEEKAGLMQNNSKPVPRLGIKQFQW 70
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
W EALHG + G AT FPQ I AASF+ L + + STEARA YNV
Sbjct: 71 WGEALHGSARTGL----------ATVFPQTIGMAASFDDELLLQVFNIASTEARAKYNVA 120
Query: 158 LA--------GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSP 209
++ W+PN+NIFRDPRWGRGQET GEDP L S+ V+GLQ G P
Sbjct: 121 AKKGYFDTSWSVSLWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVEGLQ--GGKGP 178
Query: 210 NR-LKVAACCKHYTAYDLDNWKGTDRYHFNAMVI-----YTYYLIKFKY----------- 252
++ K AC KH+ + W +R+ + + + YL FK+
Sbjct: 179 HKYYKAFACAKHFAVHSGPEW---NRHSISIDDVSPRDFHETYLPAFKHLVQVGGVKEVM 235
Query: 253 -------------------------------IVSDCDSVDVLYNSQHYTKTPE--EAAAK 279
+VSDC ++D ++ + P+ A+A+
Sbjct: 236 CAYNSIDGEPCCSDQRLLEQLLRDEWGFKGIVVSDCGAIDDIWRKGFHEVEPDAAHASAR 295
Query: 280 SILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQ 339
++ G D++CG G E AV+ G V E ID ++ M+LG FD S +
Sbjct: 296 AVKGGTDMSCGQTYGSLPE-AVRLGKVTEERIDKSLKRLIVGRMQLGEFDPD-SITRWNA 353
Query: 340 LGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
+ KDV T A++++AL AR+ + LL N +LPLS +K + V+GPNAN + M GNY
Sbjct: 354 ISMKDVSTPASREVALKMARETMTLLHNPMHALPLS-KQLKQVVVMGPNANDSVMMWGNY 412
Query: 400 EGTPCKYTTPLQGLAAVVA---TIYQAGCSNV------------QCGTAQVDDAKKAAAS 444
GTP T L G+ + + GC V Q +V D K
Sbjct: 413 NGTPHHTVTILDGIRRKIGAQRVKFIEGCGLVEPHRRGNQALTTQQLVEEVGDNKTVIFV 472
Query: 445 ADATVLVMGADQSIEAESH---DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDI 501
+ + G +EA+ DR+ + LP Q+ +I + K VI++ SG +
Sbjct: 473 GGISPQLEGEQLEVEAKGFKGGDRVTIELPQVQREMIAALHAAGK-QVIMVNCSGSAIGL 531
Query: 502 SFAKNDPKIT---SILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
P++T +IL YPG+ GG A+ADV FG YNP+G+LP+T+Y
Sbjct: 532 V-----PEVTHTDAILQAWYPGERGGEAVADVLFGDYNPAGKLPVTFY 574
>gi|443695317|gb|ELT96258.1| hypothetical protein CAPTEDRAFT_179825 [Capitella teleta]
Length = 750
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 194/603 (32%), Positives = 292/603 (48%), Gaps = 88/603 (14%)
Query: 9 RAPKVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLV 68
RA +V+V+ L L + +S S + FA S+ +L S F N SL I+ R+ DL+
Sbjct: 2 RALRVTVVCLTL----ISTASFSQDNVLRFAP---SSHALDSFPFRNVSLPIETRLNDLI 54
Query: 69 KRLTLQEKITFLVDSAGS----VSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSF 124
RLT+++ I V G + RLGI E+ +E L GV AT F
Sbjct: 55 SRLTIEDAINQTVARYGKFTPGIERLGIKPIEYITECLRGVRR-----------ENATGF 103
Query: 125 PQVILTAASFNASLFQAIGKVVSTEARAMYN-------VGLAGLTFWSPNINIFRDPRWG 177
PQ + AASF+ L Q + VS E RA YN G G+T +SP INI R P WG
Sbjct: 104 PQALGLAASFSRDLMQRVATAVSVEVRAFYNHDIQRETYGAHGITCFSPVINILRHPLWG 163
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHF 237
R QET GEDP L+ + A+ YV GLQ G P L+V+A CKH+ A+ + ++ F
Sbjct: 164 RNQETYGEDPYLSGELASQYVSGLQ---GDDPRYLRVSAGCKHFDAHGGPDTIPVRKFGF 220
Query: 238 NAMV----IYTYYLIKFK-----------------------------------------Y 252
+A + + +L FK +
Sbjct: 221 DAKIEERDLQMTFLPAFKKCIAAKPYNVMCSFNSINGVPSCANKRLLTDVLRAQWGYEGF 280
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN-CGSFLGKHTE--AAVKAGLVNES 309
+VSD +V+ ++ HY + E AA ++I +G ++ G F + + A+ L+ +
Sbjct: 281 VVSDDAAVEYIFTEHHYNSSFETAAVEAIKSGCNMELVGKFDPSYWQLTKALNEHLITKD 340
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
+ + F T LG FD P+ P+ Q+ V + +Q LAL+AA + VLLKN
Sbjct: 341 ELMENVRPVFLTRFLLGEFDP-PALNPFNQITKDVVLSAEHQRLALEAAVKSFVLLKNDR 399
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVATI-YQAGCS 426
LPL ++K +AV+GP +N T +IG+Y + P TPL G+ + + + +GCS
Sbjct: 400 NFLPLLKNSLKTVAVVGPMSNYTDGLIGDYSTDTDPSLILTPLHGIKKLAPNVQFASGCS 459
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAK 486
N C + D A A + +G +EAE++DR D++LPG Q L+ + A
Sbjct: 460 NSTCTDYRATDVAAAVDGAQVVFVALGTGFIVEAENNDRSDIVLPGAQLQLLKDAVYHAN 519
Query: 487 G-PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCF---GRYNPSGRLP 542
G PV+L++ +GG D++FA+ I SI+ +P G AI + G +P+GRLP
Sbjct: 520 GRPVVLLLFNGGPLDVTFAQLTSGIVSIVECFFPAMMTGEAIYRMLINNEGISSPAGRLP 579
Query: 543 MTW 545
+TW
Sbjct: 580 LTW 582
>gi|297740661|emb|CBI30843.3| unnamed protein product [Vitis vinifera]
Length = 401
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 127/248 (51%), Positives = 171/248 (68%), Gaps = 4/248 (1%)
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIV 363
G E +D ++ N + L ++GFFDG PS Y L KD+CT+ + +LA DAARQGIV
Sbjct: 3 GKAREEDVDTSLRNLYIVLTQVGFFDGIPS---YESLDKKDLCTKEHIELAADAARQGIV 59
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQA 423
LLKN +LPL P +KNLA+IGP+AN T M+GNY G PC+Y++PL G +A Y+
Sbjct: 60 LLKNINETLPLDPAKLKNLALIGPHANATIEMLGNYAGVPCQYSSPLDGFSAYGKVTYEM 119
Query: 424 GCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVA 482
GC+NV C + A +A+ +ADAT+L++G D+++E E DR DLLLPG Q LI +V
Sbjct: 120 GCNNVTCDNKTFIMPAVEASKNADATILLVGLDKTVEGEGLDRNDLLLPGYQTELILQVI 179
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
+KGP+IL+IMSG DISF+K D ++ +ILW GYPG+ GG AIADV +G+YNP GRLP
Sbjct: 180 VASKGPIILVIMSGSAVDISFSKTDDRVKAILWAGYPGEEGGRAIADVVYGKYNPGGRLP 239
Query: 543 MTWYPQSY 550
+TW+ Y
Sbjct: 240 LTWHQNDY 247
>gi|255572559|ref|XP_002527213.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
gi|223533389|gb|EEF35139.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
Length = 454
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 130/268 (48%), Positives = 178/268 (66%), Gaps = 2/268 (0%)
Query: 285 LDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
+D+NCGS+ ++ ++AV G + E ID A+ N F+ +RLG FDG + +LGP+D
Sbjct: 1 MDINCGSYAIRNAQSAVDKGKLREEDIDRALLNLFSVQLRLGLFDGDRINGHFSKLGPED 60
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
VCT+ ++ LAL+AARQGIVLLKN LPL+ A+ +LA+IGP AN ++ G+Y G C
Sbjct: 61 VCTEEHKKLALEAARQGIVLLKNEKKFLPLNKKAVSSLAIIGPLANNGGSLGGDYTGYSC 120
Query: 405 KYTTPLQGLAAVVA-TIYQAGCSNVQCGTA-QVDDAKKAAASADATVLVMGADQSIEAES 462
+ G+ A + T Y GCSNV C + Q +A A +AD ++V G D S E E
Sbjct: 121 NPQSLFDGVQAYIKRTSYAVGCSNVSCDSDDQFPEAIHIAKTADFVIVVAGIDLSQETED 180
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR+ LLLPG+Q L++ VA +K PVIL++ GG D+SFAK D +I SILW+GYPG+A
Sbjct: 181 RDRISLLLPGKQMALVSYVAAASKKPVILVLTGGGPVDVSFAKRDSRIASILWIGYPGEA 240
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQSY 550
G A+AD+ FG YNP GRLPMTWYP+S+
Sbjct: 241 GAKALADIIFGEYNPGGRLPMTWYPESF 268
>gi|350295750|gb|EGZ76727.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 839
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 204/592 (34%), Positives = 283/592 (47%), Gaps = 88/592 (14%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V+ P LA C+ + R LV +LT+ EK+ LVD A SR+G+PKY WWSE
Sbjct: 25 DCVNGP-LAKSKACDVTGTAPERAASLVDQLTIDEKLVNLVDQALGASRIGLPKYAWWSE 83
Query: 101 ALHGVSYVGPGTHFSNV---VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
LHGV+ PG F+ ATSF I ASF+ L +G +STEARA N G
Sbjct: 84 GLHGVA-GSPGVTFNTTGYPFSYATSFANAINLGASFDDDLVYEVGTAISTEARAFANFG 142
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKG----------------- 200
GL +W+PN+N ++DPRWGRG ETPGEDPL Y + G
Sbjct: 143 FGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAMLAGLEGNETVRKVIATCKHY 202
Query: 201 ----LQQTDGGSPNRLKVAACCKHYTAYDL--------DNWKGTDRYHFNAMVIY----- 243
L++ G + + + + Y L D+ G+ +NA+ I
Sbjct: 203 AAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRDMAGG 262
Query: 244 ------------------TYYLIKFK----------YIVSDCDSV-DVLYNSQHYTKTPE 274
TY + + YI SDC+++ D L ++ ++++TP
Sbjct: 263 SKPDEIINLTTAQPACANTYLMTILRDHWNWTEHNNYITSDCNAILDFLPDNHNFSQTPA 322
Query: 275 EAAAKSILAGLDLNC---GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFD-- 329
EAAA + AG D C GS L A L+ E+ ID A+ + L+R G+ D
Sbjct: 323 EAAAAAYKAGTDTVCEVSGSPL-TDVVGAYNQSLLPEAVIDTALRRLYEGLIRAGYLDHG 381
Query: 330 ---------GHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIK 380
G S Y L DV T + Q+LAL +A +GIVLLKN+ LPL + K
Sbjct: 382 RSAVAGGDGGSFSSPAYDALNWNDVNTPSTQELALRSATEGIVLLKNSGSLLPLDFSG-K 440
Query: 381 NLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV--DDA 438
+A+IG AN T TM G Y G P Y PL + ++ A V A
Sbjct: 441 KVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSLSYANGPVVNASDPDTWTAPA 500
Query: 439 KKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
AA AD + G D ++ +E DR + P Q L++E+A + K PV++I + G
Sbjct: 501 LAAAEGADVVLYFGGTDTTVASEDLDRESIAWPEAQMKLLSELAGLGK-PVVVIQL-GDQ 558
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
D S N+ ++SILWVGYPGQ+GG A+ DV G+ P+GRLP+T YP+ Y
Sbjct: 559 VDDSSLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAPAGRLPVTQYPEGY 610
>gi|429738050|ref|ZP_19271875.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
F0055]
gi|429161155|gb|EKY03583.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
F0055]
Length = 722
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 183/553 (33%), Positives = 265/553 (47%), Gaps = 96/553 (17%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
++ +LTL EKI+ L A + RLGI Y W +EALHGV G AT FPQ
Sbjct: 38 IISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------ATVFPQ 87
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWGR 178
I A+F+ + IG ++TE RA + V AGLTFW+PN+NIFRDPRWGR
Sbjct: 88 PINLGATFDPKIVHQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRDPRWGR 147
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
G ET GEDP L T +VKG+Q G P LK AAC KH+ + G +R
Sbjct: 148 GMETYGEDPFLTGTLGTAFVKGMQ---GDDPFYLKAAACGKHFAVHS-----GPERTRHT 199
Query: 239 AMV------IYTYYLIKFK----------------------------------------- 251
A V +Y YL FK
Sbjct: 200 ANVEPTKRDLYETYLPAFKMLVQKGKVESIMGAYQRLYGESCSGSKYLLTDILRKDWGFK 259
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSDC +V +Y K+ EA A +I AGL+L CG+ + + + A++ L+ E
Sbjct: 260 GHVVSDCGAVTDMYEGHKLVKSEAEAVAFAIKAGLNLECGNSM-RTMKDAIQQKLITEKD 318
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
+D A+ T ++LG P PY + + ++AN+ +A AA + +VLLKN
Sbjct: 319 LDKALLPLMMTRLKLGIL--QPDAACPYNEFPESVIGSEANRKIAEQAAEESMVLLKNN- 375
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV----ATIYQAGC 425
G LP++ I+ L V GP A ++GNY G +Y+T L+G+ V + Y+ G
Sbjct: 376 GVLPIAKD-IRTLFVTGPGATDAYYLMGNYFGLSNRYSTYLEGIVGKVSNGTSVNYKQGF 434
Query: 426 SNVQCGTAQVDDAKKAAASADATVLVMGADQSIE---------AESHDRLDLLLPGQQQL 476
V V+ + + A+ ++L+MG + E AE DR++L LP Q
Sbjct: 435 MQVFKNLNDVNWSVSESRGAEVSILIMGNSGNTEGEEGDAIASAERGDRVNLRLPDSQME 494
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
+ EV+K +++++ G D+ + W YPGQ GG A+A++ FG N
Sbjct: 495 YLREVSKDRTNKLVVVLTGGSPIDVKEITELADAVVMAW--YPGQEGGVALANLLFGDAN 552
Query: 537 PSGRLPMTWYPQS 549
SGRLP+T +P+S
Sbjct: 553 FSGRLPVT-FPES 564
>gi|255545654|ref|XP_002513887.1| hypothetical protein RCOM_1034150 [Ricinus communis]
gi|223546973|gb|EEF48470.1| hypothetical protein RCOM_1034150 [Ricinus communis]
Length = 246
Score = 261 bits (667), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 132/239 (55%), Positives = 166/239 (69%), Gaps = 10/239 (4%)
Query: 12 KVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRL 71
K+S+L LF L + + P F+CD SNPS S FC TSL I R DLV RL
Sbjct: 5 KISLLTLFHLTSFLIFIFADSAPQPPFSCDY-SNPSTKSYLFCQTSLPISQRAKDLVSRL 63
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYV-----GPGTHFSNVVPGATSFPQ 126
TL EKI+ LVD+A S+ RLG+P Y WWSEALHGV+++ G HF+ + ATSFPQ
Sbjct: 64 TLDEKISQLVDTAPSIPRLGVPAYRWWSEALHGVAFLPDVSERQGIHFNGTIRSATSFPQ 123
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGE 185
VI+TAASF+ L+ IG+V EARA+YN G A G+TFW+PNINIFRDPRWGRGQETPGE
Sbjct: 124 VIITAASFDTKLWYRIGQVTGKEARAIYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGE 183
Query: 186 DPLLASKYATGYVKGLQ--QTDGGS-PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV 241
DP++ KYA +V+G+Q +GG +L+ +ACCKH+TAYDL+ W GT+R+ FNA V
Sbjct: 184 DPMVTGKYAVSFVRGVQGDSFEGGVLGEKLQASACCKHFTAYDLEKWNGTNRFIFNAQV 242
>gi|323344407|ref|ZP_08084632.1| beta-glucosidase [Prevotella oralis ATCC 33269]
gi|323094534|gb|EFZ37110.1| beta-glucosidase [Prevotella oralis ATCC 33269]
Length = 722
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 182/553 (32%), Positives = 265/553 (47%), Gaps = 96/553 (17%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
++ +LTL EKI+ L A + RLGI Y W +EALHGV G AT FPQ
Sbjct: 38 IISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------ATVFPQ 87
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWGR 178
I A+F+ + Q IG ++TE RA + V AGLTFW+PN+NIFRDPRWGR
Sbjct: 88 PISLGATFDPEIVQQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRDPRWGR 147
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
G ET GEDP L T +VKG+Q G P LK AAC KH+ + G +R
Sbjct: 148 GMETYGEDPFLTGVLGTAFVKGMQ---GNDPFYLKAAACGKHFAVHS-----GPERTRHT 199
Query: 239 AMV------IYTYYLIKFK----------------------------------------- 251
A V +Y YL FK
Sbjct: 200 ANVEPTKHDLYETYLPAFKMLVQQGKVESIMGAYQRLYGESCSGSKYLLTDILRKDWGFK 259
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSDC +V +Y K+ EA A +I AGL+L CG+ + + + A+K L+ E
Sbjct: 260 GHVVSDCGAVTDMYEGHKLVKSEAEAVAFAIKAGLNLECGNSM-RTMKDALKQKLITEKD 318
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
+D A+ T ++LG P PY + + + N+++A AA + +VLLKN
Sbjct: 319 LDKALLPLMMTRLKLGIL--QPDVACPYNEFPESVIGSIDNRNIAQRAAEESMVLLKND- 375
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV----ATIYQAGC 425
G LP++ I+ L V GP A ++GNY G +Y+T L+G+ V + Y+ G
Sbjct: 376 GVLPIAKD-IRTLFVTGPGATDAYYLMGNYFGLSDRYSTYLEGIVGKVSNGTSVNYKQGF 434
Query: 426 SNVQCGTAQVDDAKKAAASADATVLVMGADQSIE---------AESHDRLDLLLPGQQQL 476
V V+ + + A+ ++++MG + E +E DR+DL LP Q
Sbjct: 435 MQVFKNLNDVNWSVSESRGAEVSIIIMGNSGNTEGEEGDAIASSERGDRVDLRLPEPQMQ 494
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
+ EV+K +++++ G D+ + W YPGQ GG A+A++ FG N
Sbjct: 495 YLREVSKDRTNKLVVVLTGGSPIDVKEITELADAVVMAW--YPGQEGGVALANLLFGDAN 552
Query: 537 PSGRLPMTWYPQS 549
SGRLP+T +P++
Sbjct: 553 FSGRLPVT-FPET 564
>gi|167519969|ref|XP_001744324.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777410|gb|EDQ91027.1| predicted protein [Monosiga brevicollis MX1]
Length = 721
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 192/573 (33%), Positives = 280/573 (48%), Gaps = 80/573 (13%)
Query: 44 SNPSLA-SLGFCNTSLGIDLRVGDLVKRLTLQE------KITFLVDS-AGSVSRLGIPKY 95
SNP+ L FC+ SL R DL +RLTL E +F + A V RLG+ Y
Sbjct: 34 SNPNCQLDLPFCDLSLDFRDRAWDLAQRLTLDELAQQLNTYSFTPQAYAPGVPRLGLRNY 93
Query: 96 EWWSEALHGVSYVGPGTHFSNVVP-GATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
+ +E LHG+ +NVV AT +PQV AA+ NASL + ++ TE RA+
Sbjct: 94 SYHAEGLHGI-------RDANVVNYPATLYPQVTAMAATANASLIHEMSTIMGTELRAVN 146
Query: 155 N--------VGLAG-LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD 205
N G G L+ + P +NI RD RWGR QE+ EDP L YA +V GL+Q +
Sbjct: 147 NRAQELGEIFGRGGALSIYGPTMNIIRDGRWGRSQESVSEDPWLNGLYAVNFVLGLEQRN 206
Query: 206 GGSPNRLKVAACCKHYTAYDLDNWKGT-DRYHFNAMV----IYTYYLIKFK--------- 251
S L+ A CKH AY + + T R+ FNA++ I+ YL F+
Sbjct: 207 --SSKYLQAATSCKHLFAYSFEGYNNTLTRHSFNAVIDELDIHDTYLPAFRACVELGHVQ 264
Query: 252 ---------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAA 278
IVSDCD+V +YN+ +YT+TPE+A
Sbjct: 265 QIMCSYNSVNGIPACARGDVQNDRVRKAWGFEGLIVSDCDAVADIYNTHNYTRTPEDAVT 324
Query: 279 KSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYG 338
++ G DL+CG F +H +AV+ L +A+ +++ LG FD S PY
Sbjct: 325 VALQGGCDLDCGDFYSQHLASAVQQNLTTLAALQQSMTRVLEMRFLLGEFDPDTSV-PYR 383
Query: 339 QLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMI-G 397
QLG + + T +D +L A+R+ +VLL+N LP++ +A +A+IGP N+T M+ G
Sbjct: 384 QLGREAIDTPFARDSSLRASRESVVLLENRIKLLPVTLSADIKVALIGPYVNLTTIMMGG 443
Query: 398 NYEGTPCKYTTPLQGLAAVVATIYQA--GCSNVQCGTAQVDDAKKAAASADATVLVMGAD 455
+ TP TT QG A+ T + GC+ +D A + A AD VL +G
Sbjct: 444 KLDYTPSFITTYFQGFQAIGITHLTSSPGCNITAPLPGALDKAVQIATQADLVVLTLGLS 503
Query: 456 QSIEAESHDRLDLLLPGQQQLLITEV-AKVAKGPVILIIMSGGGFDISFAK-NDPKITSI 513
IE E DR L LP QQ L + A + +++++++GG + K + +I
Sbjct: 504 SDIEHEGGDRETLGLPTPQQDLYDAISAAIPSSKLVVVLVNGGPVSVDRIKYGIARTPTI 563
Query: 514 LWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ Y GQ+ G A+A+ FG+ NPSG LP T +
Sbjct: 564 IEAFYGGQSAGTALAETIFGQNNPSGTLPYTVF 596
>gi|325919363|ref|ZP_08181395.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
gi|325550152|gb|EGD20974.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
Length = 876
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 164/433 (37%), Positives = 224/433 (51%), Gaps = 76/433 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T D R DLV R+TL+EK + ++A ++ RL +P+Y+WW+EALHGV+ G
Sbjct: 20 YLDTQRPFDARAADLVARMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG--- 76
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ LA GLTFW
Sbjct: 77 -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEYKRYQGLTFW 129
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ G P R K+ A KH+ +
Sbjct: 130 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG--PYR-KLDATAKHFAVH 186
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ DR+HF+ ++ YL F+
Sbjct: 187 ---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGKVAAVMGAYNRVNGESASASTRL 243
Query: 252 ------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA 299
YIVSDC ++ ++ + TPE AAA + G DL+CG A
Sbjct: 244 EGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYAA-LPA 302
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
AV+AGL++E+ ID A+ T MRLG FD P+K P+ Q+ + + LA AR
Sbjct: 303 AVRAGLIDEATIDTALKRLMTTRMRLGMFD-PPAKVPWAQIPASANQSPQHDALARRTAR 361
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL---AAV 416
+ +VLLKN G LPL PT +K +AVIGP A+ +++GNY GTP T LQG+ A
Sbjct: 362 ESLVLLKND-GVLPLKPT-LKRIAVIGPTADDPMSLLGNYYGTPAAPVTILQGIRDAAPQ 419
Query: 417 VATIYQAGCSNVQ 429
IY G V+
Sbjct: 420 AQVIYARGSDLVE 432
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 65/125 (52%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESHD----------RLDLLLPGQQQLLITEVAKV 484
+ +A AA A+ V V G +E E D R D LP Q+ L+ + +
Sbjct: 600 LQEAVDAARDAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQA 658
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
PV+ ++ +G I +A+ + +IL YPGQ GG+A+ DV FG+ +P GRLP+T
Sbjct: 659 TGTPVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPVT 716
Query: 545 WYPQS 549
+Y ++
Sbjct: 717 FYKEA 721
>gi|121700633|ref|XP_001268581.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
gi|119396724|gb|EAW07155.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
Length = 743
Score = 258 bits (660), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 174/564 (30%), Positives = 258/564 (45%), Gaps = 119/564 (21%)
Query: 40 CDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWS 99
CD N L+ C+T R L+ TL+E + +++ V RLG+P Y+ W+
Sbjct: 52 CD---NGPLSKTIVCDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWN 108
Query: 100 EALHGV--SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
EALHG+ +Y FS +TSFP ILT ++ N +L + ++ST+ RA N G
Sbjct: 109 EALHGLDRAYFTDEGQFS----WSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAG 164
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPL-LASKYATGYVKGLQQTDGGSPNRLKVAA 216
GL +SPNIN FR P WGRGQETPGED L+S YA Y+ G+Q G P LK+ A
Sbjct: 165 RYGLDVYSPNINSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQ--GGVDPKSLKLVA 222
Query: 217 CCKHYTAYDLDNWKGTDR------------------------------------------ 234
KHY YD++NW G R
Sbjct: 223 TAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNAVNGV 282
Query: 235 ------YHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
+ ++ T+ ++ YI SDCDS ++N Y AAA SI AG D++
Sbjct: 283 PSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSSAAADSIRAGTDID 342
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP-KDVCT 347
CG+ + + AV L++ + I+ + ++ LMRLG+FD +GP +V T
Sbjct: 343 CGTTYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFD----------VGPWMNVST 392
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
Q + GNY G
Sbjct: 393 Q----------------------------------------------LQGNYFGPAPYLI 406
Query: 408 TPLQGLAAVVATIYQAGCSNVQCGTAQ-VDDAKKAAASADATVLVMGADQSIEAESHDRL 466
+PL + A +N+ + A AA +DA + G D S+EAE+ DR+
Sbjct: 407 SPLDAFRDSHLDVNYAFGTNISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLDRM 466
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
++ PG+Q LI +++++ K P+I++ M GG D S K++ + S++W GYPGQ+GG A
Sbjct: 467 NITWPGKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQA 525
Query: 527 IADVCFGRYNPSGRLPMTWYPQSY 550
+ D+ G+ P+GRL +T YP Y
Sbjct: 526 LLDIITGKRAPAGRLVVTQYPAEY 549
>gi|188993706|ref|YP_001905716.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
gi|167735466|emb|CAP53681.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
Length = 896
Score = 258 bits (658), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 171/462 (37%), Positives = 239/462 (51%), Gaps = 81/462 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + + + R DLV R+TL+EK + ++A ++ RL +P+Y+WW+EALHGV+ G
Sbjct: 40 YLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG--- 96
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG--------LTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ LAG LTFW
Sbjct: 97 -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKRYQGLTFW 149
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ G P R K+ A KHY +
Sbjct: 150 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG--PYR-KLDATAKHYAVH 206
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ DR+HF+ +Y YL F+
Sbjct: 207 ---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESASASTRL 263
Query: 252 ------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA 299
YIVSDC ++ ++ + TPE AAA + G DL+CG A
Sbjct: 264 EGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYAA-LPA 322
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
AV+AGL++E+ ID +++ A +RLG FD P+K P+ Q+ + + LA AR
Sbjct: 323 AVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQIPASANQSPQHDALARRTAR 381
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL--AAVV 417
+ +VLLKN G LPL PT +K +AV+GP A+ +++GNY GTP T LQG+ AA
Sbjct: 382 ESLVLLKND-GLLPLKPT-LKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQGIRDAAPQ 439
Query: 418 ATIYQAGCSNVQCGTAQVDDAKKAAASADATVL--VMGADQS 457
A + A S++ G + AAA DA L GA Q+
Sbjct: 440 AEVVYARGSDLVEGR----EDPNAAAPIDARYLRPAAGASQN 477
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESHD----------RLDLLLPGQQQLLITEVAKV 484
+ +A AA +AD V V G +E E D R D LP Q+ L+ + +
Sbjct: 620 LQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQA 678
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
PV+ ++ +G I +A+ + +IL YPGQ GG A+ DV FG+ +P GRLP+T
Sbjct: 679 TGTPVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPIT 736
Query: 545 WYPQS 549
+Y +
Sbjct: 737 FYKED 741
>gi|325916103|ref|ZP_08178390.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325537647|gb|EGD09356.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 896
Score = 257 bits (657), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 173/459 (37%), Positives = 236/459 (51%), Gaps = 81/459 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T L + R DLV R+TL+EK + ++A ++ RL +P Y+WW+EALHGV+ G
Sbjct: 40 YLDTQLPFETRAADLVSRMTLEEKAAQMQNAAPAIPRLRVPAYDWWNEALHGVARAG--- 96
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ LA GLTFW
Sbjct: 97 -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARDEHKRYQGLTFW 149
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ G P R K+ A KHY +
Sbjct: 150 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG--PYR-KLDATAKHYAVH 206
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ DR+HF+ ++ YL F+
Sbjct: 207 ---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGHVAAVMGAYNRVNGESASASTRL 263
Query: 252 ------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA 299
YIVSDC ++ ++ + TPE AAA + G DL+CG +
Sbjct: 264 EGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYAALPK- 322
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD-LALDAA 358
AV+AGL++E+ ID ++ T MRLG FD P+K + Q+ P V D LA A
Sbjct: 323 AVRAGLIDEATIDTSLKRLMTTRMRLGMFD-PPAKVAWAQI-PASVNQSPQHDALARRTA 380
Query: 359 RQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL--AAV 416
R+ +VLLKN G LPL PT +K +AV+GP A+ +++GNY GTP T LQG+ AA
Sbjct: 381 RESLVLLKND-GLLPLKPT-LKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQGIRDAAP 438
Query: 417 VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGAD 455
A + A S++ G + AAA DA L AD
Sbjct: 439 QAEVVYARGSDLVEGR----EDPNAAAPIDARYLRPAAD 473
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 66/125 (52%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESHD----------RLDLLLPGQQQLLITEVAKV 484
+ +A AA +A+ V V G +E E D R D LP Q+ L+ + +
Sbjct: 620 LQEAVDAARNAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQA 678
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
PV+ ++ +G + +A+ + +IL YPGQ GG+A+ DV FG+ +P GRLP+T
Sbjct: 679 TGTPVVAVLTTGSALAVDWAQQ--HVPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPIT 736
Query: 545 WYPQS 549
+Y ++
Sbjct: 737 FYKEA 741
>gi|384430040|ref|YP_005639401.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
756C]
gi|341939144|gb|AEL09283.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
756C]
Length = 896
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 171/462 (37%), Positives = 238/462 (51%), Gaps = 81/462 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + + + R DLV R+TL+EK + ++A ++ RL +P+Y+WW+EALHGV+ G
Sbjct: 40 YLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG--- 96
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ LA GLTFW
Sbjct: 97 -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEHKRYQGLTFW 149
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ G P R K+ A KHY +
Sbjct: 150 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG--PYR-KLDATAKHYAVH 206
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ DR+HF+ +Y YL F+
Sbjct: 207 ---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESASASTRL 263
Query: 252 ------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA 299
YIVSDC ++ ++ + TPE AAA + G DL+CG A
Sbjct: 264 EGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYAA-LPA 322
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
AV+AGL++E+ ID +++ A +RLG FD P+K P+ Q + + LA AR
Sbjct: 323 AVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQSPQHDALARRTAR 381
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL--AAVV 417
+ +VLLKN G LPL PT +K +AV+GP A+ +++GNY GTP T LQG+ AA
Sbjct: 382 ESLVLLKND-GLLPLKPT-LKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQGIRDAAPQ 439
Query: 418 ATIYQAGCSNVQCGTAQVDDAKKAAASADATVL--VMGADQS 457
A + A S++ G + AAA DA L GA Q+
Sbjct: 440 AEVVYARGSDLVEGR----EDPNAAAPIDARYLRPAAGASQN 477
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESHD----------RLDLLLPGQQQLLITEVAKV 484
+ +A AA +AD V V G +E E D R D LP Q+ L+ + +
Sbjct: 620 LQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQA 678
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
PV+ ++ +G I +A+ + +IL YPGQ GG A+ DV FG+ +P GRLP+T
Sbjct: 679 TGTPVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPIT 736
Query: 545 WYPQS 549
+Y +
Sbjct: 737 FYKED 741
>gi|397690575|ref|YP_006527829.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
gi|395812067|gb|AFN74816.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
Length = 860
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 160/432 (37%), Positives = 224/432 (51%), Gaps = 78/432 (18%)
Query: 52 GFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPG 111
G+ N +L + R DL++RL+L EKI+ +V + ++ RLGIP+Y WW+EALHGV+ G
Sbjct: 22 GYLNVNLPFEERAEDLLQRLSLDEKISLMVHQSPAIERLGIPEYNWWNEALHGVARNGR- 80
Query: 112 THFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTF 163
AT FP I AA+++ L I V+S EARA YN L G++
Sbjct: 81 ---------ATVFPMPIGLAATWDRDLIYRIADVISNEARAKYNSALKKNQRGIYQGISL 131
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PNINIFRDPRWGRG ET GEDP L + A ++KGLQ G LK A KH
Sbjct: 132 WAPNINIFRDPRWGRGMETYGEDPYLTGELAVSFIKGLQ---GQDKKYLKTIATPKHLAV 188
Query: 224 YDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------------- 251
+ + +R+HFNA+V + YL FK
Sbjct: 189 H---SGPEPERHHFNALVSNYDLNETYLPHFKKSIMKGKAYSVMCAYNRLRGKACCGHDT 245
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKH 296
+VSDC +V ++NS +PE+AAA ++ +G DL CG +FL
Sbjct: 246 LLTDILRNKWGFEGIVVSDCWAVYDIFNSHKIVDSPEKAAALAVSSGTDLECGNTFLS-- 303
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
+ A + GL+ E ID A+ +LG FD P Y Q+ + N+++AL+
Sbjct: 304 LKNAYRDGLITEKEIDSALRRVLLARFKLGMFD-PPEIVSYSQIDESYLDNSYNREIALE 362
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
AAR+ IVLLKN LPL ++I +AVIGPNA+ ++++GNY G P +Y TPLQ + V
Sbjct: 363 AARKSIVLLKNDNKLLPLD-SSINKIAVIGPNADNLESLLGNYHGFPSEYITPLQAIRRV 421
Query: 417 V---ATIYQAGC 425
+ Y+ GC
Sbjct: 422 LKNGEVFYEKGC 433
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 68/121 (56%), Gaps = 13/121 (10%)
Query: 436 DDAKKAAASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAKVA 485
D A K A +DA ++ MG +E E+ DRL L LP Q LI ++
Sbjct: 589 DRAYKTALKSDAVIMFMGLCPRMEGEALKIKLDGFKGGDRLKLSLPANQLKLIKKIHSTG 648
Query: 486 KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
K PVIL++++GG IS I +IL YPGQAGG AI DV +G+YNPSG+LP+T
Sbjct: 649 K-PVILVLLNGG--PISTVWESENIPAILEAWYPGQAGGRAITDVIWGKYNPSGKLPVTI 705
Query: 546 Y 546
Y
Sbjct: 706 Y 706
>gi|21233528|ref|NP_639445.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66770493|ref|YP_245255.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
gi|21115383|gb|AAM43327.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66575825|gb|AAY51235.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
Length = 896
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 171/462 (37%), Positives = 238/462 (51%), Gaps = 81/462 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + + + R DLV R+TL+EK + ++A ++ RL +P+Y+WW+EALHGV+ G
Sbjct: 40 YLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG--- 96
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG--------LTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ LAG LTFW
Sbjct: 97 -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKRYQGLTFW 149
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ G P R K+ A KHY +
Sbjct: 150 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG--PYR-KLDATAKHYAVH 206
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ DR+HF+ +Y YL F+
Sbjct: 207 ---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESASASTRL 263
Query: 252 ------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA 299
YIVSDC ++ ++ + TPE AAA + G DL+CG A
Sbjct: 264 EGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYAA-LPA 322
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
AV+AGL++E+ ID +++ A +RLG FD P+K P+ Q + + LA AR
Sbjct: 323 AVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQSPQHDALARRTAR 381
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL--AAVV 417
+ +VLLKN G LPL PT +K +AV+GP A+ +++GNY GTP T LQG+ AA
Sbjct: 382 ESLVLLKND-GLLPLKPT-LKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQGIRDAAPQ 439
Query: 418 ATIYQAGCSNVQCGTAQVDDAKKAAASADATVL--VMGADQS 457
A + A S++ G + AAA DA L GA Q+
Sbjct: 440 AEVVYARGSDLVEGR----EDPNAAAPIDARYLRPAAGASQN 477
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESHD----------RLDLLLPGQQQLLITEVAKV 484
+ +A AA +AD V V G +E E D R D LP Q+ L+ + +
Sbjct: 620 LQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQA 678
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
PV+ ++ +G I +A+ + +IL YPGQ GG A+ DV FG+ +P GRLP+T
Sbjct: 679 TGTPVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPIT 736
Query: 545 WYPQS 549
+Y +
Sbjct: 737 FYKED 741
>gi|167537541|ref|XP_001750439.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771117|gb|EDQ84789.1| predicted protein [Monosiga brevicollis MX1]
Length = 834
Score = 255 bits (652), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 174/553 (31%), Positives = 273/553 (49%), Gaps = 66/553 (11%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFL-VDSAGSVSRLGIPKYEWWSEALHGVSY 107
+S FC+T L +D R+ DLV R++ + T L + + +G+P Y W + A+HG+
Sbjct: 105 SSYPFCDTKLSVDDRLKDLVSRVSTADAATQLRARESAQIDNIGLPAYYWGTNAIHGMQ- 163
Query: 108 VGPGTHFSNVVPGA--TSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFW 164
+ + + G TSFP +A+FN SL + +G+++ E RA YN GL W
Sbjct: 164 -----NTACLADGQCPTSFPAPNGLSATFNYSLVKDMGRIIGRELRAYYNTKFHNGLDTW 218
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SP IN RDPRWGR E+PGE P + +Y Y +GLQ +G + + KH+ AY
Sbjct: 219 SPTINPSRDPRWGRNVESPGESPFVCGQYGAAYTEGLQ--NGDDKDYTQAVVTLKHWVAY 276
Query: 225 DLDNWKGTDRYHFNAMV-----IYTY-----YLIK------------------------- 249
++++ RY +NA+V + TY Y++K
Sbjct: 277 SVEDYDNVTRYEYNAIVSEYDLMDTYFPGWEYVVKNAKPLGVMCSYNSLNGVPTCGNPAL 336
Query: 250 ---------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA 299
F+ YI SD DS+ ++ HY A +L G D++ G + EA
Sbjct: 337 TAYLREDWGFEGYITSDSDSIHCIWADHHYESNAVLATRDGLLGGCDIDSGDTYADNLEA 396
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
AV LVN SA+D A++N++ LG FD + + Y ++ +V ++Q+ +L AAR
Sbjct: 397 AVNQSLVNRSAVDAALTNSYRMRFNLGLFDPNVTN-AYDRISADEVGMSSSQETSLLAAR 455
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT--PLQGLAAVV 417
+ + LLKN +LP + K +AVIG ++N + ++GNY G C +Q L V
Sbjct: 456 KSMTLLKNDGQTLPFATG--KKVAVIGKSSNSAEDILGNYVGPICPSGAFDCVQTLYQGV 513
Query: 418 ATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLL 477
A Q G + + A ++ A + A AD VL + ++ E DR + L QQ L
Sbjct: 514 AAANQGGATTLSDDVADINTAIQLAMDADQVVLTI-SNYGQAGEGKDRTYIGLDTDQQEL 572
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ V KV K P +++++GG + + K++ + +IL PG GG A+A+ FG NP
Sbjct: 573 VAAVLKVGK-PTAIVMLNGGLISLDWIKDEAQ--AILVAFAPGVHGGQAVAETIFGANNP 629
Query: 538 SGRLPMTWYPQSY 550
G+LP+T Y Y
Sbjct: 630 GGKLPVTMYASDY 642
>gi|361127339|gb|EHK99311.1| putative exo-1,4-beta-xylosidase bxlB [Glarea lozoyensis 74030]
Length = 569
Score = 254 bits (650), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 182/524 (34%), Positives = 263/524 (50%), Gaps = 41/524 (7%)
Query: 41 DVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
D V+ P L S C+T+ R LVK + EK+ ++ + VSRLG+P Y WWSE
Sbjct: 53 DCVNGP-LKSNKVCDTTAPPADRAAALVKAMQSSEKLQNIISKSAGVSRLGLPPYNWWSE 111
Query: 101 ALHGVSYVGPGTHFSNVVPG--ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
ALHGV+ PG FS+ P ATS P IL AA+F+ L + +G ++ TEARA N
Sbjct: 112 ALHGVA-GAPGIQFSSSSPWNYATSLPMPILMAAAFDDDLIEKVGTLIGTEARAFGNGNH 170
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
+G+ FW+PNIN F+DPRWGRG ETPGED L Y ++GL+ G+ + ++ A C
Sbjct: 171 SGIDFWTPNINPFKDPRWGRGSETPGEDTLRLKGYVAALLRGLE----GNKAQRRIIATC 226
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPE 274
KHY A DL++W G R+ F+A + + YYL F+ D + + P
Sbjct: 227 KHYAANDLESWNGVTRHDFDAKISMQDLAEYYLQPFQQCARDSKVGSFMCSYNSVNGVP- 285
Query: 275 EAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDH--AISNNFATLMRL------- 325
A K +L + + ++ ++ V + +++H A +N T +
Sbjct: 286 ACANKYLLQTILRDHWNWTSENQYVTSDCEAVQDISLNHHYASTNAAGTALAFNAGTDSS 345
Query: 326 ---GFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNL 382
G+FDG SK Y LG DV T Q LAL A GIV+LKN G+LPL + +
Sbjct: 346 CEAGYFDG--SKALYSSLGWSDVNTPQAQQLALQATVDGIVMLKND-GTLPLKLDSKSKV 402
Query: 383 AVIGPNANVTKTMIGNYEGTPCKYTTPLQ-----GLAAVVAT--IYQAGCSNVQCGTAQV 435
A+IG A+ + + G Y G TP+ G VAT + Q+ + T
Sbjct: 403 AMIGFWASDSSKLQGGYSGKAPYLRTPVYAAQQLGFTPNVATGPVQQSASATDNWTT--- 459
Query: 436 DDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
+A AA+ +D + G D S AE DR L P Q LI +++ + K P+I II
Sbjct: 460 -NALAAASKSDYILYFGGLDTSAAAEGVDRTSLEWPSAQLALIKKLSALGK-PLI-IIQE 516
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
G D + + ++SILW +PGQ GG A+ + G +P+G
Sbjct: 517 GDQMDNTPLLTNKGVSSILWASWPGQDGGPAVMQIISGAKSPAG 560
>gi|332377068|gb|AEE64772.1| Xyl3A [Ruminococcus albus 8]
Length = 691
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 185/566 (32%), Positives = 272/566 (48%), Gaps = 96/566 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + SL + R L +T +E+ + L A ++ RLGIP Y WW+E +HG++ G
Sbjct: 4 YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AA F+ L + ++ S EARA YN GLT W
Sbjct: 62 --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGR ET GEDP L ++ V+GLQ G +K AAC KH+ +
Sbjct: 114 APNINIFRDPRWGRSHETFGEDPYLTAQNGKAVVRGLQ----GDGKVMKAAACAKHFAVH 169
Query: 225 --------------DLDNWKGTDRYHFNAMVIYT---------------------YYLIK 249
D + + T F A+V Y + K
Sbjct: 170 SGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACASDYLMEK 229
Query: 250 FK------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKA 303
K Y VSDC ++ + T E+AA ++ AG D+NCG ++ AA+
Sbjct: 230 LKEWEFDGYFVSDCWAIRDFHEHHMVTANAVESAAMALKAGCDVNCGCTY-QNLLAALDK 288
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQ--PYGQLGPKDVCTQANQDLALDAARQG 361
GL+ + I A + T +RLG FD H PY ++ C + ++ ++L+ A +
Sbjct: 289 GLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSKVA----CAE-HKAVSLECAEKS 343
Query: 362 IVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA--T 419
+VLLKN G LPL K +AVIGPNA+ + GNY G +YTT L G+
Sbjct: 344 LVLLKNN-GILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQDRFEGRV 402
Query: 420 IYQAGCSNVQ---CGTAQVDD----AKKAAASADATVLVMGADQSIEAE---------SH 463
I+ GC + G AQ D A AA +AD ++ +G D +IE E S
Sbjct: 403 IFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDATIEGEEGDTGNEFSSG 462
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
D+ L LP Q++L+ ++ V K PV+ ++ +G + + + +++ YPG G
Sbjct: 463 DKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGSAIN-----TESQPDALIHAFYPGAEG 516
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQS 549
G A+A+V FG +PSG+LP+T+Y +
Sbjct: 517 GKALAEVLFGDVSPSGKLPVTFYEDT 542
>gi|325679939|ref|ZP_08159508.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
albus 8]
gi|324108377|gb|EGC02624.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
albus 8]
Length = 691
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 185/566 (32%), Positives = 272/566 (48%), Gaps = 96/566 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + SL + R L +T +E+ + L A ++ RLGIP Y WW+E +HG++ G
Sbjct: 4 YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AA F+ L + ++ S EARA YN GLT W
Sbjct: 62 --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRG ET GEDP L ++ V+GLQ G +K AAC KH+ +
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTAQNGKAVVRGLQ----GDGKVMKAAACAKHFAVH 169
Query: 225 --------------DLDNWKGTDRYHFNAMVIYT---------------------YYLIK 249
D + + T F A+V Y + K
Sbjct: 170 SGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACASDYLMEK 229
Query: 250 FK------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKA 303
K Y VSDC ++ + T E+AA ++ AG D+NCG ++ AA+
Sbjct: 230 LKEWEFDGYFVSDCWAIRDFHEHHMVTANAVESAAMALKAGCDVNCGCTY-QNLLAALDK 288
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQ--PYGQLGPKDVCTQANQDLALDAARQG 361
GL+ + I A + T +RLG FD H PY ++ C + ++ ++L+ A +
Sbjct: 289 GLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSKVA----CAE-HKAVSLECAEKS 343
Query: 362 IVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA--T 419
+VLLKN G LPL K +AVIGPNA+ + GNY G +YTT L G+
Sbjct: 344 LVLLKNN-GILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQDRFEGRV 402
Query: 420 IYQAGCSNVQ---CGTAQVDD----AKKAAASADATVLVMGADQSIEAE---------SH 463
I+ GC + G AQ D A AA +AD ++ +G D +IE E S
Sbjct: 403 IFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDATIEGEEGDTGNEFSSG 462
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
D+ L LP Q++L+ ++ V K PV+ ++ +G + + + +++ YPG G
Sbjct: 463 DKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGSAIN-----TESQPDALIHAFYPGAEG 516
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQS 549
A+A+V FG +PSG+LP+T+Y +
Sbjct: 517 SKALAEVLFGDVSPSGKLPVTFYEDT 542
>gi|427385932|ref|ZP_18882239.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
12058]
gi|425726971|gb|EKU89834.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
12058]
Length = 732
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 182/568 (32%), Positives = 265/568 (46%), Gaps = 98/568 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP-KYEWWSEALHGVSYVGPG 111
F N + ++ RV DL+ RLTL++K L +V G + + W++ LHGV + P
Sbjct: 33 FLNQEMSMEARVADLMSRLTLEQKAQLLNHRGKTVVVDGFSIRADQWNQCLHGVKWTEP- 91
Query: 112 THFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV---------GLAGLT 162
T+FP I A+++ L + V+S EARA+YN GL
Sbjct: 92 ---------TTNFPTSIALGATWDTELIHRVATVISDEARAIYNGWKQDPEFRGEHKGLI 142
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
+ SP INI R+P WGR E GEDP + YVKGLQ G + LK+A+ KHY
Sbjct: 143 YRSPVINISRNPYWGRINEIFGEDPYHTGRMGVAYVKGLQ---GDDSHYLKLASTLKHYA 199
Query: 223 AYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------------------------- 251
+++ DR +A V +Y Y+L FK
Sbjct: 200 VNNVE----VDRMKLSAQVPERMLYEYWLPHFKDCIVEGKAQSVMASYNAINGVPNNINK 255
Query: 252 ---------------YIVSDCDSVDVLYNSQHYTKTP-EEAAAKSILAGLDLNCGSFLGK 295
++VSD V + H + EEA +SI+AG D + + K
Sbjct: 256 LLLTDILKNQWGHEGFVVSDLGGVKTMVEGHHQRQISCEEAVGRSIMAGCDFSDAEY-EK 314
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
+ A++ G + E ++ A+ RLG FD S PY ++ P + + +++L+L
Sbjct: 315 YIPDALRKGYLTEERLNDALRRVLLVRFRLGEFDDFKS-VPYSRISPDVIGCKEHRNLSL 373
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
+AAR+ IVLLKN LP+ + IK +AVIGP A++ GNY G P TPLQG+
Sbjct: 374 EAARKSIVLLKNEKKLLPIDRSIIKRVAVIGPYADLFNQ--GNYGGVPKDPVTPLQGIKN 431
Query: 416 VVA----TIYQAGCS----NVQCG---------TAQVDDAKKAAASADATVLVMGADQSI 458
V +Y G V+ G A++ A + A ++D L +G I
Sbjct: 432 AVGNNVEVLYCKGAQITPVKVRKGQPIPPRFDKEAEMKKAVEMARNSDVVFLFVGTTADI 491
Query: 459 EAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGY 518
E E DR L+LPG Q L+ V +V K V++++MS G + K + I ++L +
Sbjct: 492 EVEGRDRKTLVLPGNQNELVKAVYEVNK-KVVVVLMSAGPVAVPEVKKN--IPAVLQAWW 548
Query: 519 PGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PG GG AIADV FG YNP G+LP T Y
Sbjct: 549 PGDEGGNAIADVLFGDYNPGGKLPYTMY 576
>gi|90021134|ref|YP_526961.1| Beta-glucosidase [Saccharophagus degradans 2-40]
gi|89950734|gb|ABD80749.1| b-xylosidase-like protein [Saccharophagus degradans 2-40]
Length = 893
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 171/483 (35%), Positives = 238/483 (49%), Gaps = 86/483 (17%)
Query: 13 VSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLT 72
+ V+ LT CS + ++ Q A S A+ F + SL +D RV DLV RLT
Sbjct: 12 LGVILQGLTACSKSAAPNANQPQDTAA-------STATYPFRDASLSVDARVDDLVSRLT 64
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAA 132
EKI + + ++ RLGIP Y WW+E+LHGV+ G AT +PQ I A+
Sbjct: 65 TTEKIAQMFNDTPAIERLGIPAYNWWNESLHGVARAGK----------ATVYPQAIGLAS 114
Query: 133 SFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDPRWGRGQETPG 184
+F+ L + +S E RA Y+ L+ GLTFWSPNINIFRDPRWGRGQET G
Sbjct: 115 TFDEDLMLRVATSISDEGRAKYHDFLSKDVRTIYGGLTFWSPNINIFRDPRWGRGQETYG 174
Query: 185 EDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD-LDNWKGTDRYHFNAMVIY 243
EDP L + A +VKG+Q + S + LK A KHY + + + +D YH ++
Sbjct: 175 EDPFLTGRMAINFVKGIQGENDNS-DYLKAVATIKHYAVHSGPEKTRHSDDYHPTRKDLF 233
Query: 244 TYYLIKFK------------------------------------------YIVSDCDSVD 261
YL F+ Y+VSDC ++
Sbjct: 234 ETYLPAFRMAIAETNVQSLMCAYNRVDGAPACGNNELMQEILRGDMGFNGYVVSDCGAIA 293
Query: 262 VLYNSQ--HYTKTPEEAAAKSILAGLDLNCGSFLGK---HTEAAVKAGLVNESAIDHAIS 316
Y S+ H +P EAAA ++ +G DLNCG G + A++ GL+ E ID A+
Sbjct: 294 DFYESRSHHVVDSPAEAAAWAVKSGTDLNCGDSHGNTYTNLHYALQQGLITEDYIDIAVK 353
Query: 317 NNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
F ++LG FD + PY ++G V + + L +AA + IVLLKN G LPL
Sbjct: 354 RLFKARIKLGMFD-EQDRVPYSEIGMDVVGSPKHLALTQEAAEKSIVLLKNN-GVLPLK- 410
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNV--QCGTAQ 434
A +AVIGPNA ++GNY G P K PL+G+ + + G +NV G+AQ
Sbjct: 411 -AGVKVAVIGPNAVDEDVLVGNYHGVPVKPVLPLEGI------VNRVGEANVFYAPGSAQ 463
Query: 435 VDD 437
+ D
Sbjct: 464 IAD 466
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 64/121 (52%), Gaps = 13/121 (10%)
Query: 436 DDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVA 485
++A AA AD + + G D +E E DR + LP Q L+ ++
Sbjct: 618 EEALAAARKADVIIFMGGIDAHLEGEEMPLELDGFTHGDRTHINLPKVQTNLLKQLKATG 677
Query: 486 KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
K PV+++ SG +++ K+ +IL YPG+A G A+A++ +G +PSGRLP+T+
Sbjct: 678 K-PVVMVNFSGSAMALNWESE--KLDAILQAFYPGEATGTALANILWGDVSPSGRLPVTF 734
Query: 546 Y 546
Y
Sbjct: 735 Y 735
>gi|319788503|ref|YP_004147978.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
gi|317467015|gb|ADV28747.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
suwonensis 11-1]
Length = 916
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 161/440 (36%), Positives = 225/440 (51%), Gaps = 80/440 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +TSL + R LV R+TL+EK + + + ++ RLG+P Y+WW+EALHGV+ G
Sbjct: 50 WLDTSLSFEERAAALVSRMTLEEKAAQMQNDSPAIERLGLPAYDWWNEALHGVARAG--- 106
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AASF+ L + +S EARA ++ L GLTFW
Sbjct: 107 -------GATVFPQAIGMAASFDVPLMDQVSAAISDEARAKHHDFLRKGEHGRYQGLTFW 159
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS-----PNRLKVAACCK 219
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ D + P K+ A K
Sbjct: 160 SPNINIFRDPRWGRGQETYGEDPFLTTRMGVSFVRGLQGMDPQTGQPLDPKYRKLDATAK 219
Query: 220 HYTAYDLDNWKGTDRYHFNA----MVIYTYYLIKFK------------------------ 251
H+ + + DR+ F+ +Y YL F+
Sbjct: 220 HFAVH---SGPEADRHTFDVHPSKQDLYDTYLPAFESLVKEADVYAVMGAYNRVYGESAS 276
Query: 252 ------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL 293
Y++SDC ++ ++ + +TPEEAAA ++ G +LNCGS
Sbjct: 277 GSKFLLLDTLRRDWGFDGYVMSDCWAIVDIWKNHKIVETPEEAAALAVKNGTELNCGSTY 336
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD- 352
H AVK GL++E+ +D A++ F M LG FD P + + Q+ P V A D
Sbjct: 337 ADHLPVAVKKGLISEAELDDALTRLFVARMELGMFD-PPEQVRWAQV-PYSVNQSAEHDA 394
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LA A++ +VLLKN G LPLS I+ LAV+GP A+ T ++GNY GTP T L+G
Sbjct: 395 LARKMAQESLVLLKND-GVLPLS-KDIRRLAVVGPTADDTMALLGNYYGTPADPVTILRG 452
Query: 413 L---AAVVATIYQAGCSNVQ 429
+ A V +Y G V+
Sbjct: 453 IREAAPGVDVVYARGVDLVE 472
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 64/119 (53%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A +AA SADA V V G +E E DR D+ LP QQ L+ V K
Sbjct: 643 ALEAANSADAVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDIRLPATQQKLLEAVHATGK- 701
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PV++++ +G I +A+ + + IL YPGQ GG A+ + FG YNP GRLP+T+Y
Sbjct: 702 PVVMVLTTGSALGIDWARRN--VPGILVAWYPGQRGGTAVGEALFGDYNPGGRLPVTFY 758
>gi|116621778|ref|YP_823934.1| glycoside hydrolase family 3 protein [Candidatus Solibacter
usitatus Ellin6076]
gi|116224940|gb|ABJ83649.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 850
Score = 251 bits (640), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 155/419 (36%), Positives = 220/419 (52%), Gaps = 74/419 (17%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYV 108
+ L F + L + R DLV R+TL EK+ + +SA ++ RLGIP Y+WW+EALHGV+
Sbjct: 22 SQLPFMDPDLSAERRAADLVARMTLDEKVLQMQNSAPAIPRLGIPAYDWWNEALHGVARA 81
Query: 109 GPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------G 160
G AT FPQ I AA+++A+L I + +STEARA YN + G
Sbjct: 82 GL----------ATVFPQAIGLAATWDATLMHRIAETISTEARAKYNEAIRNDDHSRYRG 131
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
LTFWSPNINIFRDPRWGRGQET GEDP L S+ A ++KG+Q D P+ KV A KH
Sbjct: 132 LTFWSPNINIFRDPRWGRGQETYGEDPFLTSRMAVAFIKGMQGED---PHYYKVIATAKH 188
Query: 221 YTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------- 251
Y + + + R+ F+ + YL F+
Sbjct: 189 YAVH---SGPESSRHQFDVKPSPRDLADTYLPAFRASIVEARADSLMCAYNRVDGIPACA 245
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
++VSDC +V ++ HY A+A ++ AG DL CG+
Sbjct: 246 STDLLEKRLRGEWGFQGFVVSDCGAVSDIFRGHHYQPDAASASAVAVKAGTDLTCGNEYR 305
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
+ AVK GL+ E I+ ++ F +LG FD P + P+ + +V + ++ +A
Sbjct: 306 ALVD-AVKTGLITEPEINRSLERLFVARFKLGMFD-PPERVPFSNIPYSEVDSAGHRKIA 363
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
L+AAR+ IVLLKN G+LPL ++IK +AVIGP A+ + ++GNY G TPL G+
Sbjct: 364 LEAARKSIVLLKND-GTLPLK-SSIKKIAVIGPAADDAEALLGNYNGFSSLQVTPLAGI 420
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 64/119 (53%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A +A ++AD T+ +G + S+E E DR +L LP Q+ LI E A
Sbjct: 597 AIEAVSNADVTLAFVGLNPSLEGEEMPVSVPGFQGGDRTNLELPEPQEKLI-EAAIATGK 655
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PV++++ SG ++FA +++L Y G+ G AIAD G NPSGRLP+T+Y
Sbjct: 656 PVVVVLASGSAVAMNFAAQ--HASALLETWYNGEETGTAIADTLAGINNPSGRLPVTFY 712
>gi|5690010|emb|CAB51937.1| Family 3 Glycoside Hydrolase [Ruminococcus flavefaciens]
Length = 690
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 193/556 (34%), Positives = 264/556 (47%), Gaps = 96/556 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R D+ RL+ +EK A + RLG Y WWSE LHGV+ G AT
Sbjct: 16 RAEDITDRLSTEEKAEQQKYDAPAEERLGKDAYNWWSEGLHGVARAGT----------AT 65
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDP 174
FPQ I AA F+ G+ S EARA YN A GLT WSPN+NIFRDP
Sbjct: 66 MFPQTIGMAAMFDDEAVHRAGETTSREARAKYNEYSAHDDRDIYKGLTLWSPNVNIFRDP 125
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY--------TAYDL 226
RWGRGQET GEDP L S Y KGLQ G L+ AAC KH+ T ++
Sbjct: 126 RWGRGQETYGEDPYLTSCLGVAYAKGLQ----GDGKVLRTAACAKHFAVHSGPEATRHEF 181
Query: 227 D---NWKG-TDRY--HFNAMV---------------------IYTYYLIKFK------YI 253
D N K T+ Y F A+V + + K + +
Sbjct: 182 DAKANMKDMTETYIAAFEALVKDAKVESVMGAYNRVNGEPACASDFVMNKLEEWGFDGHF 241
Query: 254 VSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAID 312
VSDC ++ + + TKT E+AA ++ G DLNCG ++L H AA GL+NE +
Sbjct: 242 VSDCWAIRDFHTNHGVTKTAPESAALALKKGCDLNCGNTYL--HLLAAFNEGLINEEDLR 299
Query: 313 HAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
+ T +RLG FD Y L V +++ +L + + +VLLKN G L
Sbjct: 300 RSCIKLMRTRVRLGMFD---KSTEYDGLDYDIVACDEHKEFSLRCSERSMVLLKNN-GIL 355
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL--AAVVATIYQAGCS--NV 428
PL + K + VIGPNA+ + GNY G +Y T L G+ A +Y G
Sbjct: 356 PLDGSKYKTIGVIGPNADSVPALEGNYNGKADEYITFLSGIREAHDGRVLYTEGSHLYKD 415
Query: 429 QC-GTAQVDDAKKAAASADATVLVMGA----DQSIEAE---------SHDRLDLLLPGQQ 474
+C G A DD A T+ G+ D +IE E S D+ DL LP Q
Sbjct: 416 RCMGLALPDDRLSEAEIITRTLRCSGSLCWLDATIEGEEGDTGNEFSSGDKNDLRLPESQ 475
Query: 475 QLLITEVAKVAKG-PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
+ L+ V +AKG PVI++ +G ++ + +++ YPGQ GG A+A++ FG
Sbjct: 476 RKLVKTV--MAKGKPVIIVTAAGSAINV-----EADCDALIQAWYPGQLGGRALANILFG 528
Query: 534 RYNPSGRLPMTWYPQS 549
+ +PSG+LP+T+Y +
Sbjct: 529 KVSPSGKLPVTFYEDA 544
>gi|291240561|ref|XP_002740190.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 763
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 201/584 (34%), Positives = 271/584 (46%), Gaps = 91/584 (15%)
Query: 42 VVSNPSL-ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDS--AGSVSRLGIPKYEWW 98
+VS P + A+ F NTSL + RV DLV RLTL E + + + A + RLGI Y W
Sbjct: 16 LVSFPLISAAYPFQNTSLSWEERVDDLVSRLTLDEMVLQMARTSPAPPIDRLGIKPYVWN 75
Query: 99 SEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL 158
SE LHGV V P AT+FPQ I AASF+ L + K + E RA +N +
Sbjct: 76 SECLHGV--VPPDGL-------ATAFPQSIGLAASFSPDLLSDVAKAIGLEVRAKHNDYV 126
Query: 159 --------AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPN 210
GL+ +SP INI R P WGR QET GEDP L + + YV+GLQ G P
Sbjct: 127 QRGVYQEHTGLSCFSPVINIARHPLWGRNQETYGEDPFLIGELGSAYVRGLQ---GDHPR 183
Query: 211 RLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKF-------------------- 250
+ A CKH+ + R+ F+A V + + F
Sbjct: 184 YVLANAGCKHFDVHGGPEDIPVSRFSFDAKVFERDWQMTFLPAFHECVKAGVYSVMCSYN 243
Query: 251 -------------------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
Y+VSD +V+ + S HYT + + A ++ AG
Sbjct: 244 RINEVPACANTRLLTDILRKEWGFDGYVVSDEGAVEFIMTSHHYTDSIVDTVASAVNAGC 303
Query: 286 DLNCGSFLGK----HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG 341
+L+ +G AV AG + E + + F T MRLG FD P PY L
Sbjct: 304 NLDLAFPVGDGMYIKIGDAVTAGKIKEKTVVERVKPLFYTRMRLGEFDP-PELNPYANLN 362
Query: 342 PKDVCTQANQDLALDAARQ-----GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMI 396
V ++ +++LA+ AA Q VLLK LPL T + LAVIGP A+ +
Sbjct: 363 LSVVQSEEHRELAVKAALQSFVLLNFVLLKREGRVLPLD-TLVNKLAVIGPFADNPSYLF 421
Query: 397 GNYEGTPCK--YTTPLQGLA-AVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMG 453
G+Y P K TP +GL+ A T GC C T + K A AD V+ +G
Sbjct: 422 GDYSPNPDKEFVVTPCKGLSNAARDTRCTPGCLTAPCTTYFSEMVKAAVTGADLIVVCLG 481
Query: 454 ADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKG-PVILIIMSGGGFDISFAKNDPKITS 512
IEAE DR DL LPG+Q L+ +V K A G P+IL++ + G DI +A +P I
Sbjct: 482 TGVKIEAEFVDRSDLSLPGKQFQLLQDVVKYANGKPIILLLFNAGPLDIVWAVENPAIQV 541
Query: 513 ILWVGYPGQAGGAAIADVCF-------GRYNPSGRLPMTWYPQS 549
I+ +P QA G A+ + G NP GRLP+TW P+S
Sbjct: 542 IVACFFPSQATGDALYRMFMNTHGVDTGNGNPGGRLPITW-PRS 584
>gi|317057539|ref|YP_004106006.1| glycoside hydrolase family protein [Ruminococcus albus 7]
gi|315449808|gb|ADU23372.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7]
Length = 691
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 183/564 (32%), Positives = 270/564 (47%), Gaps = 92/564 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + +L R L +T +E+ + L A +V RLGIP Y WW+E +HG++ G
Sbjct: 4 YLDETLSAQERAEALTDEMTTEEQASQLRYDAPAVERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
AT FPQ I AA F+ L + +V S EARA YN GLT W
Sbjct: 62 --------ATMFPQAIGLAAMFDDELTKKTAEVTSEEARAKYNAYSGEEDRDIYKGLTLW 113
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRG ET GEDP L +K V+GLQ G +K AAC KH+ +
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTTKNGMAVVRGLQ----GDGKVIKAAACAKHFAVH 169
Query: 225 D--------LD---NWKGTDRYH---FNAMV---------------------IYTYYLIK 249
D N K + + F A+V + + K
Sbjct: 170 SGPEAIRHSFDAKANAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACASNFLMDK 229
Query: 250 FK------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKA 303
K Y VSDC ++ + + T E+ A ++ AG D+NCG ++ A++
Sbjct: 230 LKEWEFDGYFVSDCWAIRDFHENHMVTANAIESTAMALKAGCDVNCGCTY-QNLLVALEK 288
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIV 363
G V + I A + T +RLG FD K Y + V + ++ ++L+ A + +V
Sbjct: 289 GAVTKEDIRTACVHLMRTRIRLGMFD---KKTEYDDIPYDKVACKEHKAISLECAEKSLV 345
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV--ATIY 421
+L+N G LP+ + K +AVIGPNA+ + GNY G +YTT L G+ I+
Sbjct: 346 MLENN-GILPVDTSKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQDRFDGRVIF 404
Query: 422 QAGC-------SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAE---------SHDR 465
GC SN+ + +A AA AD T+L +G D +IE E S D+
Sbjct: 405 AEGCHLYKDRVSNLAQAGDRYAEAVAAAKFADMTILCLGLDATIEGEEGDTGNEFSSGDK 464
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
L LP Q+ L+ ++ V K PV+ ++ +G + + K +++ YPG GG
Sbjct: 465 NGLTLPPPQRELVKKIMAVGK-PVVTVVCAGSAIN-----TESKPDALIHAFYPGAEGGK 518
Query: 526 AIADVCFGRYNPSGRLPMTWYPQS 549
A+A+V FG +PSG+LP+T+Y +
Sbjct: 519 ALAEVLFGDVSPSGKLPVTFYEDT 542
>gi|374313710|ref|YP_005060140.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358755720|gb|AEU39110.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 883
Score = 249 bits (636), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 152/420 (36%), Positives = 217/420 (51%), Gaps = 75/420 (17%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +T+L + R DLV RLTL EK LV SA + RLG+P Y++WSE LHG++ G
Sbjct: 35 LPYQDTTLPAEQRAADLVGRLTLDEKAAQLVTSAPGIPRLGVPAYDFWSEGLHGIARSG- 93
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLT 162
AT FPQ + AA+F+ L IG+V+STEARA YN +A GLT
Sbjct: 94 ---------YATLFPQAVGMAATFDEPLLHQIGEVISTEARAKYNDAVAHDLRSIFYGLT 144
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
WSPNINIFRDPRWGRGQET GEDP L ++ T +V+GLQ G PN + KH+
Sbjct: 145 IWSPNINIFRDPRWGRGQETYGEDPFLTARLGTAFVEGLQ---GDDPNYYRAIGTPKHFA 201
Query: 223 AYDLDNWKGTDRYHFNA----MVIYTYYLIKFK--------------------------- 251
+ + ++R+ FNA ++ YL F+
Sbjct: 202 VH---SGPESERHRFNADPSPHDLWDTYLPAFRATIVEGKAGSIMCAYNAIEGKPACASD 258
Query: 252 ---------------YIVSDCDSVDVLY--NSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
++ SDC ++D + + HY+K E+A+ I AG D NCG
Sbjct: 259 LLLDEVLRKDWAFKGFVTSDCGAIDNFFEKDGHHYSKDAEQASVDGIRAGTDTNCGGTY- 317
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
++ +AV+ G++ ES +D + F +LG FD PS+ Y + + + ++ +LA
Sbjct: 318 RNLASAVRKGMIQESELDVPLRRLFLARFKLGLFDP-PSQVKYASMPITENMSSSHTELA 376
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
L AAR+ +VLLKN +LPL +K +AVIGPNA+ ++ GNY P + G+A
Sbjct: 377 LQAAREAVVLLKNEHHTLPLD-ARVKTIAVIGPNASSLISLEGNYNAIPKNPVMQVDGIA 435
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 67/119 (56%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESHD----------RLDLLLPGQQQLLITEVAKVAKG 487
A +A ADA V +G +E E D R DL+LP QQ L+ E AK +
Sbjct: 606 AMEAVKQADAVVAFVGLSPELEGEEMDVHIPGFSGGDRTDLVLPAAQQQLL-EAAKASGK 664
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
P+++++++G +++A+ +IL YPGQAG AIA+ G+ NPSGRLP+T+Y
Sbjct: 665 PLVVVLLNGSALAVNWAQE--HADAILEAWYPGQAGAQAIAETLSGKNNPSGRLPVTFY 721
>gi|348688508|gb|EGZ28322.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 701
Score = 248 bits (634), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 185/565 (32%), Positives = 259/565 (45%), Gaps = 117/565 (20%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSA---GSVSRLGIPKYEWWSEALHGVSY 107
L FCNTSL + RV DL+ RL L EK L A G++S +G+P+Y W + +HGV
Sbjct: 32 LQFCNTSLPVSARVEDLLARLPLDEKAILLTARASPRGNMSSIGLPEYNWGANCVHGVRS 91
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
GT+ TSFP + N S+ +
Sbjct: 92 TC-GTNCP------TSFPNPV------NLSIHRR-------------------------- 112
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLD 227
RDPRWGR ETP EDPL+ SKY Y KGLQ+ P L+ KHY AY +
Sbjct: 113 ----RDPRWGRNTETPSEDPLVNSKYGVAYTKGLQEGKHEDPRYLQAVVTLKHYVAYSYE 168
Query: 228 NWKGTDRYHFNAMVI-YTY---YLIKFK-------------------------------- 251
N+ G +R FNA+V Y + Y F+
Sbjct: 169 NYGGGNRKTFNAIVSPYDFADTYFPAFRSSIVDGNAKGVMCSYNSVNGVPACANNELENK 228
Query: 252 ----------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT--EA 299
YI SD +++ + + HY T EAA +ILAG D+N G G +
Sbjct: 229 LLRGMLGFDGYITSDSGAIEAISDWLHYVPTRCEAARLAILAGTDVNSGRGFGYMACLKE 288
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
V++ ++ +D + + LG FD QPY ++ P DV T A + L+LD AR
Sbjct: 289 LVESNQLDVKVVDDVLRHTLKLRFELGLFD-PIEDQPYWKVTPNDVNTDAAKKLSLDLAR 347
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK--------YTTPLQ 411
+ IVLL+N LPL +K LAV+GP+A + ++GNY G C TP +
Sbjct: 348 KSIVLLQNNQPVLPLR-RGVK-LAVVGPHAQAKRALLGNYLGQMCHGDYNEVGCIKTPFE 405
Query: 412 GLAAV---VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDL 468
++A +T Y GC+ TA +A KA A+A VL +G D+S+EAE DR ++
Sbjct: 406 AVSASNGDSSTTYALGCNVTGNSTAGFVEAVKAVQGAEAVVLFLGIDKSVEAEVRDRNNI 465
Query: 469 LLPGQQQLLITEVAKVAKGPVILIIMSGGGF---DISFAKNDPKITSILWVGYPGQAGGA 525
LP Q L+ V V K P ++++M+GG DI + D + + YPG G
Sbjct: 466 DLPAIQVQLLQRVRAVGK-PTVVVLMNGGVLTAEDI-IGQTDALVEAF----YPGFFGAQ 519
Query: 526 AIADVCFGRYNPSGRLPMTWYPQSY 550
A+ D+ FG NP G+LP+T Y Y
Sbjct: 520 AMTDILFGDANPGGKLPVTMYRSDY 544
>gi|389636381|ref|XP_003715843.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|351648176|gb|EHA56036.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|440480767|gb|ELQ61414.1| beta-xylosidase [Magnaporthe oryzae P131]
Length = 517
Score = 248 bits (634), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 172/512 (33%), Positives = 244/512 (47%), Gaps = 68/512 (13%)
Query: 26 QHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG 85
+ S QS P D V+ P L++ C+ +L R LV+ L+++EK+ LV +
Sbjct: 18 EDPESRRQSYP----DCVNGP-LSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQ 72
Query: 86 SVSRLGIPKYEWWSEALHGVSYVGPGTHF--SNV-VPGATSFPQVILTAASFNASLFQAI 142
R+G+P Y WWSEALHGV+Y PGT+F NV +TS+P +L AA F+ +L + I
Sbjct: 73 GAPRIGLPAYNWWSEALHGVAYA-PGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKI 131
Query: 143 GKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ 202
G + EARA N G AG +W+PN+N F+DPRWGRG ETPGED L +YA +GL
Sbjct: 132 GTAIGIEARAWGNSGWAGFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGL- 190
Query: 203 QTDGGSPN-RLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------ 251
DG PN + ++ + CKHY D ++W GT R+ FNA + + YYL F+
Sbjct: 191 --DGPVPNEQRRIISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDS 248
Query: 252 ---------------------------------------YIVSDCDSVDVLYNSQHYTKT 272
Y+ SDC++V + + HY T
Sbjct: 249 KVGSIMCAYNAVNGVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHYAPT 308
Query: 273 PEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP 332
A AG+D +C A GL+ E +D A+ + L+R G+FDG
Sbjct: 309 NAAGTAICFEAGMDTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEE 368
Query: 333 SKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVT 392
+ Y L + V + Q LAL AA +G+VLLKN G+LPL +A+IG A+
Sbjct: 369 AM--YADLDWQHVNSAEAQSLALQAAVEGMVLLKNN-GTLPLDLDPSHKIAMIGFWADAP 425
Query: 393 KTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ---VDDAKKAAASADATV 449
+ + G Y G +P + I A +Q A +A +AA+ AD +
Sbjct: 426 EKLQGGYSGRAHHLYSPAFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASGADYIL 485
Query: 450 LVMGADQSIEAESHDRLDLLLPGQQQLLITEV 481
G D S E+ DR DL P Q L+ V
Sbjct: 486 YFGGLDTSAAGETLDRTDLDWPEAQLTLVKVV 517
>gi|440476402|gb|ELQ45004.1| beta-xylosidase, partial [Magnaporthe oryzae Y34]
Length = 515
Score = 248 bits (632), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 171/509 (33%), Positives = 243/509 (47%), Gaps = 68/509 (13%)
Query: 26 QHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG 85
+ S QS P D V+ P L++ C+ +L R LV+ L+++EK+ LV +
Sbjct: 18 EDPESRRQSYP----DCVNGP-LSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQ 72
Query: 86 SVSRLGIPKYEWWSEALHGVSYVGPGTHF--SNV-VPGATSFPQVILTAASFNASLFQAI 142
R+G+P Y WWSEALHGV+Y PGT+F NV +TS+P +L AA F+ +L + I
Sbjct: 73 GAPRIGLPAYNWWSEALHGVAYA-PGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKI 131
Query: 143 GKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ 202
G + EARA N G AG +W+PN+N F+DPRWGRG ETPGED L +YA +GL
Sbjct: 132 GTAIGIEARAWGNSGWAGFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGL- 190
Query: 203 QTDGGSPN-RLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------ 251
DG PN + ++ + CKHY D ++W GT R+ FNA + + YYL F+
Sbjct: 191 --DGPVPNEQRRIISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDS 248
Query: 252 ---------------------------------------YIVSDCDSVDVLYNSQHYTKT 272
Y+ SDC++V + + HY T
Sbjct: 249 KVGSIMCAYNAVNGVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHYAPT 308
Query: 273 PEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP 332
A AG+D +C A GL+ E +D A+ + L+R G+FDG
Sbjct: 309 NAAGTAICFEAGMDTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEE 368
Query: 333 SKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVT 392
+ Y L + V + Q LAL AA +G+VLLKN G+LPL +A+IG A+
Sbjct: 369 AM--YADLDWQHVNSAEAQSLALQAAVEGMVLLKNN-GTLPLDLDPSHKIAMIGFWADAP 425
Query: 393 KTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ---VDDAKKAAASADATV 449
+ + G Y G +P + I A +Q A +A +AA+ AD +
Sbjct: 426 EKLQGGYSGRAHHLYSPAFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASGADYIL 485
Query: 450 LVMGADQSIEAESHDRLDLLLPGQQQLLI 478
G D S E+ DR DL P Q L+
Sbjct: 486 YFGGLDTSAAGETLDRTDLDWPEAQLTLV 514
>gi|405968899|gb|EKC33925.1| Putative beta-D-xylosidase 5 [Crassostrea gigas]
Length = 748
Score = 248 bits (632), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 188/577 (32%), Positives = 271/577 (46%), Gaps = 105/577 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDS--------AGSVSRLGIPKYEWWSEALHG 104
F N SL RV DLV RLTL + + L A ++ LGI Y+W +E L G
Sbjct: 26 FQNVSLSWSERVDDLVGRLTLDQIVQQLARGGAGLNGGPAPAIENLGIGPYQWNTECLRG 85
Query: 105 VSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN--------V 156
G ATSFPQ I AA+F+ L + K +TE RA +N
Sbjct: 86 DVEAG----------NATSFPQAIGLAAAFSKDLIFNVSKAAATEVRAKHNDFVKRGIFT 135
Query: 157 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAA 216
GL+ +SP +NI R P WGR QET GEDP L+ YA+ +V+GLQ G ++ A
Sbjct: 136 DHTGLSCFSPVVNIMRHPLWGRNQETYGEDPYLSGTYASYFVQGLQ---GDHDRYIQANA 192
Query: 217 CCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--------------------- 251
CKH+ A+ R F+A V + +L F+
Sbjct: 193 GCKHFDAHGGPEDIPESRMGFDAKVSMRDLRLTFLPAFQKCVQAGAYSLMCSYNSINGVP 252
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
Y+VSD +++ + HY E+AAA S+ AG +L
Sbjct: 253 ACSNKLLMMDILRGEWNFTGYVVSDEGAIENQISFHHYYNNSEDAAAGSVNAGCNLELS- 311
Query: 292 FLGKHTEA-------AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
G TE AVK+G + ES + + + F T MRLG FD P PY +
Sbjct: 312 --GNLTEPVFMKIGDAVKSGKLEESVVRNRVKPLFYTRMRLGEFD-PPEMNPYSSVNLSV 368
Query: 345 VCTQANQDLALDAARQGIVLLKNTA--------GSLPLSPTAIKNLAVIGPNANVTKTMI 396
+ ++ +++L+L AA + +VLLK + G P + +AVIGP AN T +
Sbjct: 369 IQSEEHRNLSLTAAAKSLVLLKRPSKFSKRHLIGGFP-----SERMAVIGPMANNTDQIF 423
Query: 397 GNYEGT--PCKYTTPLQGLAAVVATI-YQAGCSN-VQCGTAQVDDAKKAAASADATVLVM 452
G+Y T P TPL+GL + ++ Y AGC + +C DD K A AD V+ +
Sbjct: 424 GDYSPTTDPRFVKTPLKGLTELNFSMNYAAGCVDGTRCLNYSQDDVKTALVGADLVVVCL 483
Query: 453 GADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITS 512
G + +E+E+ DR D++LPG+Q L+ +V + V L++ S G +I++A+ ++
Sbjct: 484 GTGKDLESENVDRKDMMLPGKQLQLLQDVVSMTNKAVYLLVFSAGPVNITWAQESERVLI 543
Query: 513 ILWVGYPGQAGGAAIADVCF---GRYNPSGRLPMTWY 546
IL YP Q+ G AI GR+NP+GRLP TWY
Sbjct: 544 ILQCFYPAQSAGDAITQALIMRDGRFNPAGRLPYTWY 580
>gi|147826476|emb|CAN72807.1| hypothetical protein VITISV_033721 [Vitis vinifera]
Length = 236
Score = 248 bits (632), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 111/200 (55%), Positives = 143/200 (71%), Gaps = 6/200 (3%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
+ S FC+ SL + R DLV R+TLQEK+ V +A V RLG+P+Y WWSEALHG+S
Sbjct: 31 MKSFAFCDKSLSYEERAKDLVSRMTLQEKVMQSVHTASGVRRLGLPEYSWWSEALHGISN 90
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
+GPG F +PGATSFP VIL+ A+FN +L++ +G+VVSTE RAMYN+G AGLTFWSPN
Sbjct: 91 LGPGVFFDETIPGATSFPTVILSTAAFNQTLWKTLGRVVSTEGRAMYNLGHAGLTFWSPN 150
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG------SPNRLKVAACCKHY 221
IN+ RD RWGR QET GEDP + ++A YV+GLQ +G + LKV++CCKHY
Sbjct: 151 INVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTDLNSRPLKVSSCCKHY 210
Query: 222 TAYDLDNWKGTDRYHFNAMV 241
AYD+D+W DR+ F+A V
Sbjct: 211 AAYDIDSWLNVDRHTFDARV 230
>gi|294667502|ref|ZP_06732718.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292602731|gb|EFF46166.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 901
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 157/440 (35%), Positives = 219/440 (49%), Gaps = 79/440 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T + R DLV R+TL+EK + ++A ++ RL +P Y+WW+EALHGV+ G
Sbjct: 34 YLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG--- 90
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 91 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTFW 143
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR------LKVAACC 218
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ G +P K+ A
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGGDAPKNAQGERYRKLDATA 203
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ + + DR+HF+A +Y YL F+
Sbjct: 204 KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESA 260
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y+VSDC ++ ++ T E+AAA ++ G +L CG
Sbjct: 261 SASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEE 320
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
AAV+ GL++E+ ID A+ MRLG FD P + P+ Q+ + A+
Sbjct: 321 YST-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWSQIPASVNQSPAHDA 378
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LA AR+ +VLLKN G LPLS +K +AVIGP A+ T ++GNY GTP T LQG
Sbjct: 379 LARRTARESLVLLKND-GLLPLSRARLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQG 437
Query: 413 LAAV---VATIYQAGCSNVQ 429
+ A +Y G V+
Sbjct: 438 IRAAAPNAQVLYARGADLVE 457
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A SA+ V V G +E E DR DL LP Q+ L+ +
Sbjct: 625 LQEALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALHAT 684
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G I +A+ + +IL YPGQ GG A+AD FG NP GRLP+T
Sbjct: 685 GK-PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 741
Query: 545 WYPQS 549
+Y +S
Sbjct: 742 FYKES 746
>gi|157676888|emb|CAP07659.1| beta-xylosidase [uncultured rumen bacterium]
Length = 761
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 200/615 (32%), Positives = 268/615 (43%), Gaps = 145/615 (23%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVG 109
+ + + S +LR L+ +L+L+EK + ++ +V RLGI Y WWSEALHGV+ G
Sbjct: 28 EISYTDKSQPAELRAKALLPKLSLEEKAGLVQYNSPAVERLGIKAYNWWSEALHGVARNG 87
Query: 110 PGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGL 161
AT FPQ I AASF+ + + VS EAR + AGL
Sbjct: 88 ----------SATVFPQPIGMAASFDVEKIETVFTAVSDEARVKNRIAAEDGRVYQYAGL 137
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR--LKVAACCK 219
+FW+PNINIFRDPRWGRG ET GEDP L + V+GLQ G P+ LK AC K
Sbjct: 138 SFWTPNINIFRDPRWGRGMETYGEDPYLMGQLGMAVVRGLQ----GDPDADVLKTHACAK 193
Query: 220 HYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------ 251
HY + + ++R+ F+A V + YL FK
Sbjct: 194 HYAVH---SGLESNRHRFDAQVSERDLRETYLPAFKDLVTKAGVKEVMTAYNRFRGYPCA 250
Query: 252 ------------------YIVSDCDSV-DVLYNSQH-YTKTPEEAAAKSILAGLDLNCGS 291
+VSDC ++ D +H + T EEAAA ++ GLD+ CGS
Sbjct: 251 ASEYLVQKILREEWGYKGLVVSDCWAIPDFFEPGRHGFVATGEEAAALAVANGLDVECGS 310
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
K AA+ GL+ E +D + RLG DG + P+ L P V ++
Sbjct: 311 TFSK-IPAAIDQGLLKEEDLDRNLLRVLTERFRLGEMDG---ESPWDDLDPAIVEGPEHR 366
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
L+LD AR+ +VLL+N G LPL A + +A+IGPNA+ + GNY P T LQ
Sbjct: 367 ALSLDIARETMVLLRNN-GVLPLK--AGEKIALIGPNADDAQMQWGNYNPVPKSTITLLQ 423
Query: 412 GLAAVV-ATIYQAGC--------------------SNVQCGTA------QVDDAK----- 439
+ A V +Y C S Q A V+D K
Sbjct: 424 AMQARVPGLVYDRACGILDAEYAPQGSAYANLIGASEAQLEAAARRYAVSVNDIKNYIRR 483
Query: 440 ------------------KAAASADATVLVMGADQSIEAESH----------DRLDLLLP 471
K D V G +E E DR D+ LP
Sbjct: 484 DEEQRRSFMPALDEAAVLKKLEGVDVVVFAGGISPRLEGEEMRVQVPGFSGGDRTDIELP 543
Query: 472 GQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVC 531
G Q+ L+ + K V+L+ SG I +IL YPGQ GG AIADV
Sbjct: 544 GVQRRLLKALHDAGK-KVVLVNFSGCA--IGLVPETESCDAILQAWYPGQEGGTAIADVL 600
Query: 532 FGRYNPSGRLPMTWY 546
FG NPSG+LP+T+Y
Sbjct: 601 FGDVNPSGKLPVTFY 615
>gi|329850151|ref|ZP_08264997.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
gi|328842062|gb|EGF91632.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
Length = 877
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 164/454 (36%), Positives = 229/454 (50%), Gaps = 79/454 (17%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
+A++ + +T+L R DLV R+TL+EK L +A ++ RLG+PKY WW+E LHGV+
Sbjct: 18 VAAMAYRDTALDPKARAADLVSRMTLEEKAAQLGHTAPAIPRLGVPKYNWWNEGLHGVAR 77
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------- 158
G AT FPQ I AA+++ + +G VVSTE RA Y +
Sbjct: 78 AGV----------ATVFPQAIGMAATWDEPMMTTVGDVVSTEFRAKYVERVHPDGGTDWY 127
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
GLT WSPNINIFRDPRWGRGQET GEDP L S+ GY+ GLQ G P K A
Sbjct: 128 RGLTVWSPNINIFRDPRWGRGQETYGEDPYLTSRIGIGYIHGLQ---GNDPKFFKTVATS 184
Query: 219 KHY-----------------TAYDLDNW-------KGTDRYHFNAMVIY----------T 244
KH+ + +DL++ T+ ++ M +Y +
Sbjct: 185 KHFAVHSGPESNRHKEDVYPSKFDLEDTYLPAFRATVTEGKAYSVMCVYNAVYGVPGCAS 244
Query: 245 YYLIKFK---------YIVSDCDSVDVLYNSQ--HYTKTPEEAAAKSILAGLDLNCGSFL 293
+L++ K ++VSDC + ++ HYTKT EE A + AG+DL CG +
Sbjct: 245 DFLMEEKLRQNWGFPGFVVSDCGAAANIFREDALHYTKTAEEGVAVGLKAGMDLICGDYR 304
Query: 294 GKHTEA------AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
K + AVKAG + + +D A+ F +RLG FD P+ P+ + D T
Sbjct: 305 NKMSTEVQPIINAVKAGQLPIAVVDQALVRLFEGRIRLGMFD-PPASLPFAHITADDSDT 363
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
A+ +ALD A++ +VLLKN G LPL K +AVIGPNA+ ++GNY G P K
Sbjct: 364 PAHHAVALDMAKKSMVLLKND-GLLPLKAEP-KTIAVIGPNADSLDALVGNYYGKPSKPV 421
Query: 408 TPLQGLAAVVAT---IYQAGCSNVQCGTAQVDDA 438
T L G+ A T +Y G + V D+
Sbjct: 422 TVLDGIRARFPTAKIVYAEGTGLIGPAEPPVPDS 455
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 58/99 (58%), Gaps = 6/99 (6%)
Query: 451 VMGADQSIEAESH---DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKND 507
V G + +EAE DR + LP QQ L+ +V K P +L++MSG +++A D
Sbjct: 611 VEGEEMKVEAEGFAGGDRTSIDLPKPQQQLLEKVIGTGK-PTVLVLMSGSALGVNWA--D 667
Query: 508 PKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ +I+ YPG GG A+A + G Y+P+GRLP+T+Y
Sbjct: 668 KHVPAIIEAWYPGGEGGHAVAQLIAGDYSPAGRLPVTFY 706
>gi|433677589|ref|ZP_20509555.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430817300|emb|CCP39963.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 913
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 156/440 (35%), Positives = 220/440 (50%), Gaps = 79/440 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T + R DLV R+TL+EK + ++A ++ RLG+P Y+WW+EALHGV+ G
Sbjct: 37 YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR------LKVAACC 218
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ D P K+ A
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEDVDVPKNAQGEAYRKLDATA 206
Query: 219 KHYTAYDLDNWKGTDRYHFNA----MVIYTYYLIKFK----------------------- 251
KH+ + + DR+HF+A +Y YL F+
Sbjct: 207 KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGESA 263
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y+VSDC ++ ++ + T EEAAA ++ G +L CG+
Sbjct: 264 SASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATREEAAALAVKHGTELECGAE 323
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
AV+ GL++E+ +D+A+ + MRLG FD P K + Q+ + +
Sbjct: 324 YST-LPTAVRKGLISEADVDNALQKLMYSRMRLGMFDP-PEKLAWAQIPLSANQSPEHDA 381
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LA AR+ +VLLKN G LPLS IK +AV+GP A+ T ++GNY GTP T LQG
Sbjct: 382 LARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTVLQG 440
Query: 413 L---AAVVATIYQAGCSNVQ 429
+ A +Y G V+
Sbjct: 441 IREAAPDAEVLYARGADLVE 460
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 48/125 (38%), Positives = 66/125 (52%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ DA AA AD V V G +E E DR DL LP Q+ L+ +
Sbjct: 628 LQDALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGT 687
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G I +A+ + +IL YPGQ GG+A+ADV FG NP GRLP+T
Sbjct: 688 GK-PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVT 744
Query: 545 WYPQS 549
+Y +S
Sbjct: 745 FYKES 749
>gi|78049893|ref|YP_366068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
gi|78038323|emb|CAJ26068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 902
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 161/446 (36%), Positives = 221/446 (49%), Gaps = 91/446 (20%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T + R DLV R+TL+EK + ++A ++ RLG+P Y+WW+EALHGV+ G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 92 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ-------QTDGGSPNRLKVAAC 217
SPNINIFRDPRWGRGQET GEDP L ++ +V+GL+ + G P R K+ A
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLRGEGADAPKNAQGEPYR-KLDAT 203
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
KH+ + + DR+HF+A +Y YL F+
Sbjct: 204 AKHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 260
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
Y+VSDC ++ ++ T E+AAA ++ G +L CG
Sbjct: 261 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGE 320
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
AAV+ GL++E+ ID A++ MRLG FD P GQL + NQ
Sbjct: 321 EYST-LPAAVRQGLIDEAQIDTALTTLMTARMRLGMFD------PPGQLPWSTIPASVNQ 373
Query: 352 DLALDA-----ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
A DA AR+ +VLLKN G LPLS +K +AVIGP A+ T ++GNY GTP
Sbjct: 374 SPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAP 432
Query: 407 TTPLQGLAAV---VATIYQAGCSNVQ 429
T LQG+ A +Y G V+
Sbjct: 433 VTVLQGIRAAAPNAQVLYARGADLVE 458
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 65/125 (52%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A+SAD V V G +E E DR DL LP Q+ L+ +
Sbjct: 626 LQEALDVASSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQAT 685
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G I +A+ + +IL YPGQ GG A+AD FG NP GRLP+T
Sbjct: 686 GK-PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742
Query: 545 WYPQS 549
+Y +S
Sbjct: 743 FYKES 747
>gi|371777036|ref|ZP_09483358.1| glycoside hydrolase [Anaerophaga sp. HS1]
Length = 890
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 163/435 (37%), Positives = 221/435 (50%), Gaps = 77/435 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + +L + R DLV ++TL+EK++ + +A ++ RLGIP+Y WW+E LHGV G
Sbjct: 40 YLDPTLPFEERAADLVSKMTLEEKVSQMQHAAPAIERLGIPEYNWWNECLHGVGRAGI-- 97
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN-------VGL-AGLTFW 164
AT FPQ I AA ++ I VS EARA ++ G+ GLTFW
Sbjct: 98 --------ATVFPQAIGMAAMWDDEEMYRIATAVSDEARAKHHDFARRGKRGIYQGLTFW 149
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRG ET GEDP L + A Y+KGLQ G LK+ A KH+ +
Sbjct: 150 TPNINIFRDPRWGRGMETYGEDPFLTGELAVDYIKGLQ---GDDDRYLKLVATSKHFLVH 206
Query: 225 DLDNWKGTDRYHFNA-------MVIYTYYLIK---------------------------- 249
+ DR+HF+A ++ YT + K
Sbjct: 207 ---SGPEPDRHHFDARTSARDSLMTYTPHFKKTIQEAGVYSVMCAYNRYNGLPCCGSKPV 263
Query: 250 ---------FK-YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGKHT 297
FK YIVSDC +V Y H+ P EEAAA ++ AG DLNCG+
Sbjct: 264 ENLLRNEWGFKGYIVSDCWAVADFYKKGHHEVVPTVEEAAAMAVKAGTDLNCGNSYPALV 323
Query: 298 EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDA 357
+ AVK GLV+E ID + +RLG FD P PY + V ++ +++LAL A
Sbjct: 324 D-AVKQGLVSEEEIDVLVKRLMEARLRLGMFDP-PEMVPYTNIPYSVVDSKEHRELALIA 381
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
AR+ +VLLKN +LPL +KN+AVIGPNAN ++ NY G P TPL G+ +
Sbjct: 382 ARKSMVLLKNDNNTLPLDKN-VKNVAVIGPNANNLDVLLANYNGYPSNPVTPLDGIRQKL 440
Query: 418 ATI---YQAGCSNVQ 429
Y GC + +
Sbjct: 441 PNANVQYALGCRHAE 455
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/120 (38%), Positives = 72/120 (60%), Gaps = 13/120 (10%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAK 486
+A + AA++D ++ MG ++E E DR+D+ LP Q L+ + + K
Sbjct: 609 EAIQIAAASDVVLMFMGLSPNLEGEEMPVNVPGFSGGDRVDIKLPQIQTDLVKAIMSLGK 668
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PV+L++++G I++ + + +IL YPGQAGG AIADV FG YNP+GRLP+T+Y
Sbjct: 669 -PVVLVLLNGSALAINWEAEN--VPAILEAWYPGQAGGTAIADVLFGDYNPAGRLPVTFY 725
>gi|363742357|ref|XP_003642627.1| PREDICTED: probable beta-D-xylosidase 5-like [Gallus gallus]
Length = 748
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 181/563 (32%), Positives = 271/563 (48%), Gaps = 83/563 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDS-------AGSVSRLGIPKYEWWSEALHGV 105
F + +L R+ DL+ RLT E + + A + RLGI Y W +E L G
Sbjct: 29 FRDPTLPWHRRLEDLLGRLTPAEMVLQMARGGALGNGPAPPIPRLGIAPYNWNTECLRGD 88
Query: 106 SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA------ 159
+ PG AT+FPQ + AA+F+ L + +TE RA +N +A
Sbjct: 89 AE-APGW--------ATAFPQALGLAAAFSPELVYRVANATATEVRAKHNSFVAAGRYDD 139
Query: 160 --GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
GL+ +SP +NI R P WGR QET GEDP L ++ AT +V+GLQ G P +K +A
Sbjct: 140 HTGLSCFSPVLNIMRHPLWGRNQETYGEDPYLTAELATSFVQGLQ---GQHPRYIKASAG 196
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMVI----YTYYLIKFK---------------------- 251
CKH++ + R F+A V+ +T +L +F+
Sbjct: 197 CKHFSVHGGPENIPVSRLSFDAKVLERDWHTTFLPQFQACVRAGSYSFMCSYNRINGVPA 256
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y+VSD +V+++ YT T E A S+ AGL+L
Sbjct: 257 CANKKLLTDILRGEWGFEGYVVSDEGAVELILLGHRYTHTFLETAIASVNAGLNLELSYG 316
Query: 293 LGK----HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
+ H A+ G + + + F T +RLG FD P+ PY L V +
Sbjct: 317 MRNNVFMHIPKALAMGNITLEMLRDRVRPLFYTRLRLGEFDP-PAMNPYNALELSVVQSS 375
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-CKYT 407
+++L+L+AA + VLLKN +LPL K LAV+GP A+ + + G+Y P +Y
Sbjct: 376 EHRNLSLEAAIKSFVLLKNQRDTLPLRELHGKRLAVVGPFADNPRVLFGDYAPVPEPQYI 435
Query: 408 -TPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDR 465
TP +GL + A + + AGC +C D+ + A AD ++ +G +E E+ DR
Sbjct: 436 YTPRRGLQTLPANVSFAAGCREPRCWVYSRDEVENAVRGADVVLVCLGTGIDVEMEARDR 495
Query: 466 LDLLLPGQQQLLITEVAKVAKG-PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGG 524
DL LPG Q L+ + + A G PVIL++ + G D+S+A+ + +IL +P QA G
Sbjct: 496 KDLSLPGHQLQLLQDAVRAAAGHPVILLLFNAGPLDVSWAQLHDGVGAILACFFPAQATG 555
Query: 525 AAIADVCFGRY--NPSGRLPMTW 545
AIA V G+ +P+GRLP TW
Sbjct: 556 LAIASVLLGKQGASPAGRLPATW 578
>gi|291240559|ref|XP_002740189.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 745
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 192/572 (33%), Positives = 264/572 (46%), Gaps = 86/572 (15%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDS-------AGSVSRLGIPKYEWWSE 100
L+ F NTSL + RV DLV RL L+E + + A + RL I Y W +E
Sbjct: 22 LSDFPFRNTSLPWNKRVEDLVGRLKLEEIVLQMSRGGRYSNGPAPPIDRLNIGPYSWNTE 81
Query: 101 ALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV---- 156
L G GP ATSFPQ AA+F+A L + I + E RA YN
Sbjct: 82 CLRGDLSAGP----------ATSFPQAFGLAATFDAVLIKQIANATAYEVRAKYNNYTKH 131
Query: 157 ----GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
GL+ +SP INI R P WGR QET GEDP L+ A +V GLQ G P +
Sbjct: 132 KEYGDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASFVTGLQ---GNHPRYV 188
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------- 251
A CKH+ AY + R F+A V + +L F
Sbjct: 189 TANAGCKHFDAYAGPENIPSSRSTFDAKVSDRDLRMTFLPAFHECIQAGTYSLMCSYNSI 248
Query: 252 ------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL 287
Y++SD +V+ +Y++ HYTK + A + +GL+L
Sbjct: 249 NGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAHHYTKDMLDTAIACVNSGLNL 308
Query: 288 NCGSFLGKH----TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
S L + T AVK G V + +S F T MRLG FD P PY +L
Sbjct: 309 ELSSNLTDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTRMRLGEFDP-PEMNPYSKLDLS 367
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
+ +Q +Q+L+L AA + VLLKN LPL I LAV+GP + + G+
Sbjct: 368 IIQSQEHQELSLKAAAKSFVLLKNENRFLPLK-EKIDKLAVVGPFGDNPIEIYGSKSPDV 426
Query: 404 CKYT-TPLQGLAAV--VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA 460
T TP GL+ + +AT + +GC + C K+A D V+ +G +E
Sbjct: 427 SNLTVTPRYGLSKIARLATTFASGCLSPACTEYDPKSTKQAIDRVDMVVVCLGTGNEVEN 486
Query: 461 ESHDRLDLLLPGQQ-QLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYP 519
E+HDR +L LPGQQ +LL V A PVIL++ + G DI++A ++P I I+ +P
Sbjct: 487 EAHDRSELTLPGQQLRLLQDAVTFAADKPVILLLFNAGPLDITWAVSNPAIPVIVECFFP 546
Query: 520 GQAGGAAIADVCFGR--YNPSGRLPMTWYPQS 549
Q G A+ + NP GRLP+TW P+S
Sbjct: 547 AQTTGTALYHLFVNSPGSNPGGRLPITW-PKS 577
>gi|440731995|ref|ZP_20911965.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
gi|440370332|gb|ELQ07251.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
Length = 913
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 155/440 (35%), Positives = 220/440 (50%), Gaps = 79/440 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T + R DLV R+TL+EK + ++A ++ RLG+P Y+WW+EALHGV+ G
Sbjct: 37 YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 94 -------GATVFPQAIGMAATFDVPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR------LKVAACC 218
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ +P K+ A
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEAYRKLDATA 206
Query: 219 KHYTAYDLDNWKGTDRYHFNA----MVIYTYYLIKFK----------------------- 251
KH+ + + DR+HF+A +Y YL F+
Sbjct: 207 KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGESA 263
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y+VSDC ++ ++ + T EEAAA ++ G +L CG+
Sbjct: 264 SASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATREEAAALAVKHGTELECGAE 323
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
+AV+ GL++E+ +D A+ + MRLG FD P K + Q+ + +
Sbjct: 324 YST-LPSAVRKGLISEADVDKALQKLMYSRMRLGMFDP-PEKLAWAQIPLSANQSPEHDA 381
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LA AR+ +VLLKN G LPLS IK +AV+GP A+ T ++GNY GTP T LQG
Sbjct: 382 LARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTVLQG 440
Query: 413 L---AAVVATIYQAGCSNVQ 429
+ A +Y G V+
Sbjct: 441 IREAAPDAEVLYARGADLVE 460
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 48/125 (38%), Positives = 66/125 (52%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ DA AA AD V V G +E E DR DL LP Q+ L+ +
Sbjct: 628 LQDALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGT 687
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G I +A+ + +IL YPGQ GG+A+ADV FG NP GRLP+T
Sbjct: 688 GK-PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVT 744
Query: 545 WYPQS 549
+Y +S
Sbjct: 745 FYKES 749
>gi|346726970|ref|YP_004853639.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346651717|gb|AEO44341.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 902
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 162/446 (36%), Positives = 219/446 (49%), Gaps = 91/446 (20%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T + R DLV R+TL+EK + ++A ++ RLG+P Y+WW+EALHGV+ G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 92 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ-------QTDGGSPNRLKVAAC 217
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ + G P R K+ A
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYR-KLDAT 203
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
KH+ + + DR+HF+A +Y YL F+
Sbjct: 204 AKHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 260
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
Y+VSDC ++ ++ T E+AAA ++ G +L CG
Sbjct: 261 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGE 320
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
AAV GL++E+ ID A+ MRLG FD P GQL + NQ
Sbjct: 321 EYST-LPAAVHQGLIDEAQIDTALKTLMTARMRLGMFD------PPGQLPWSTIPASVNQ 373
Query: 352 DLALDA-----ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
A DA AR+ +VLLKN G LPLS +K +AVIGP A+ T ++GNY GTP
Sbjct: 374 SPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAP 432
Query: 407 TTPLQGLAAV---VATIYQAGCSNVQ 429
T LQG+ A +Y G V+
Sbjct: 433 VTVLQGIRAAAPNAQVLYARGADLVE 458
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A SAD V V G +E E DR DL LP Q+ L+ +
Sbjct: 626 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQAT 685
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G I +A+ + +IL YPGQ GG A+AD FG NP GRLP+T
Sbjct: 686 GK-PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742
Query: 545 WYPQS 549
+Y +S
Sbjct: 743 FYKES 747
>gi|413925161|gb|AFW65093.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 323
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 169/293 (57%), Gaps = 49/293 (16%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
P F+C S + L FC+ +L R DLV RLT EKI L D A V RLG+P Y
Sbjct: 29 PPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLGVPGY 88
Query: 96 EWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
+WW+EALHG++ G G HF V ATSFPQV+LTAA+F+ L+ IG+ + EARA++N
Sbjct: 89 KWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREARALFN 148
Query: 156 VGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKV 214
VG A GLT WSPN+NIFRDPRWGRGQETPGEDP +AS+YA +V+G+Q S + L+
Sbjct: 149 VGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGN--SSSSLLQT 206
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNAMVI-----------YTYYLIKFK------------ 251
+ACCKH TAYDL++W G RY F A V + +++ K
Sbjct: 207 SACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYTAIN 266
Query: 252 -----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSI 281
Y+ SDCD+V ++ ++Q Y TPE+A A S+
Sbjct: 267 GVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSL 319
>gi|296081549|emb|CBI20072.3| unnamed protein product [Vitis vinifera]
Length = 333
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 125/156 (80%), Positives = 138/156 (88%)
Query: 395 MIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGA 454
MIGNYEGTP KYTTPLQGL A+VAT Y GCSNV CGTAQ+D+AKK AA+ADATVL++G
Sbjct: 1 MIGNYEGTPGKYTTPLQGLTALVATTYLPGCSNVACGTAQIDEAKKIAAAADATVLIVGI 60
Query: 455 DQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSIL 514
DQSIEAE DR+++ LPGQQ LLITEVAK +KG VIL++MSGGGFDISFAKND KITSIL
Sbjct: 61 DQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITSIL 120
Query: 515 WVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
WVGYPG+AGGAAIADV FG YNPSGRLP TWYPQSY
Sbjct: 121 WVGYPGEAGGAAIADVIFGFYNPSGRLPTTWYPQSY 156
>gi|58584046|ref|YP_203062.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84625823|ref|YP_453195.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|58428640|gb|AAW77677.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84369763|dbj|BAE70921.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 904
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 160/445 (35%), Positives = 216/445 (48%), Gaps = 89/445 (20%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ T + R DLV R+TL+EK + ++A ++ RLG+P Y+WW+EALHGV+ G
Sbjct: 37 YLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTFW 146
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR------LKVAACC 218
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ +P K+ A
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDATA 206
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ + + DR+HF+A +Y YL F+
Sbjct: 207 KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESA 263
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y+VSDC ++ ++ T E+AAA ++ G +L CG
Sbjct: 264 SASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELECGEE 323
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
AAV GL++E+ ID A+ MRLG FD P GQL + NQ
Sbjct: 324 YST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD------PPGQLPWSKIPASVNQS 376
Query: 353 LALDA-----ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
A DA AR+ +VLLKN G LPLS +K +AVIGP A+ T ++GNY GTP
Sbjct: 377 PAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPV 435
Query: 408 TPLQGLAAV---VATIYQAGCSNVQ 429
T LQG+ A +Y G V+
Sbjct: 436 TVLQGIRAAAPNAQVLYARGADLVE 460
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A SAD V V G +E E DR DL LP Q+ L+ +
Sbjct: 628 LQEALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQAT 687
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G + +A+ + +IL YPGQ GG A+AD FG NP GRLP+T
Sbjct: 688 GK-PVVAVLTAGSALAVDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 744
Query: 545 WYPQS 549
+Y +S
Sbjct: 745 FYKES 749
>gi|386819249|ref|ZP_10106465.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
gi|386424355|gb|EIJ38185.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
Length = 878
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 167/454 (36%), Positives = 227/454 (50%), Gaps = 85/454 (18%)
Query: 14 SVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTL 73
++LFLFL S S ++ P F NT L D RV DL+ RLT+
Sbjct: 6 TILFLFL---SFNFCFSQSEKYP----------------FQNTELPEDERVNDLINRLTV 46
Query: 74 QEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAAS 133
EKI L+ + ++ RLGIP Y WW+E+LHGV+ G AT FPQ I AAS
Sbjct: 47 DEKIAQLLYQSPAIERLGIPAYNWWNESLHGVARAG----------YATVFPQSITIAAS 96
Query: 134 FNASLFQAIGKVVSTEARAMYNVGL--------AGLTFWSPNINIFRDPRWGRGQETPGE 185
++ L + V+S EARA ++ L GLTFWSPNINIFRDPRWGRG ET GE
Sbjct: 97 WDDELVAEVANVISDEARAKHHEYLRRGQHDIYQGLTFWSPNINIFRDPRWGRGHETYGE 156
Query: 186 DPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY---------------DLDNWK 230
DP L T YVKGLQ G + LKV A KH+ + D W+
Sbjct: 157 DPYLTGVLGTEYVKGLQ---GNNAKYLKVVATAKHFAVHSGPEPLRHEFDVAPSQRDLWE 213
Query: 231 G---------TDRYHFNAMVIYT-------------YYLIKFK-----YIVSDCDSVDVL 263
D ++ M Y Y +++ K Y+VSDC ++ +
Sbjct: 214 TYLPAFRTLVKDGNVYSIMTAYNRIYGEAASASNSLYSILRDKWGFNGYVVSDCGAIADM 273
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLM 323
+ + H K EA+A ++ G DLNCG+ K T+ A++ GL+ E+ +D A+
Sbjct: 274 WKTHHVAKDAAEASAMAVKEGCDLNCGNSYEKLTD-ALQDGLITEADLDVALHRLMRARF 332
Query: 324 RLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLA 383
+LG FD K PY ++ ++ LAL AA++ IVLLKN LPLS +KN+A
Sbjct: 333 KLGMFDSD-EKVPYAKIPFSVNNNPKHKVLALKAAQKSIVLLKNENAILPLSKN-LKNIA 390
Query: 384 VIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
VIGPNA+ +++ GNY G P T L+G+ V
Sbjct: 391 VIGPNADNIQSLWGNYNGMPKNPVTVLEGIKNKV 424
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 73/123 (59%), Gaps = 13/123 (10%)
Query: 434 QVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAK 483
Q++ A AA +D VL +G ++ +E E DR L LP +Q L+ EV
Sbjct: 588 QLEKAVLAANKSDVVVLALGLNERLEGEEMKVEVEGFADGDRTSLNLPKKQVELMKEVVA 647
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
K PV+L++++G I++A + I +I+ GYPGQ GG AIA+V FG YNP+GRLP+
Sbjct: 648 TGK-PVVLVLLNGSALSINWASEN--IPAIISAGYPGQEGGNAIANVLFGDYNPAGRLPV 704
Query: 544 TWY 546
T+Y
Sbjct: 705 TYY 707
>gi|188574621|ref|YP_001911550.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|188519073|gb|ACD57018.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 904
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 160/445 (35%), Positives = 216/445 (48%), Gaps = 89/445 (20%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ T + R DLV R+TL+EK + ++A ++ RLG+P Y+WW+EALHGV+ G
Sbjct: 37 YLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTFW 146
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR------LKVAACC 218
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ +P K+ A
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDATA 206
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ + + DR+HF+A +Y YL F+
Sbjct: 207 KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESA 263
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y+VSDC ++ ++ T E+AAA ++ G +L CG
Sbjct: 264 SASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELECGEE 323
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
AAV GL++E+ ID A+ MRLG FD P GQL + NQ
Sbjct: 324 YST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD------PPGQLPWSKIPASVNQS 376
Query: 353 LALDA-----ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
A DA AR+ +VLLKN G LPLS +K +AVIGP A+ T ++GNY GTP
Sbjct: 377 PAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPV 435
Query: 408 TPLQGLAAV---VATIYQAGCSNVQ 429
T LQG+ A +Y G V+
Sbjct: 436 TVLQGIRAAAPNAQVLYARGADLVE 460
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A SAD V V G +E E DR DL LP Q+ L+ +
Sbjct: 628 LQEALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQAT 687
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G I +A+ + +IL YPGQ GG A+AD FG NP GRLP+T
Sbjct: 688 GK-PVVAVLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 744
Query: 545 WYPQS 549
+Y +S
Sbjct: 745 FYKES 749
>gi|289668505|ref|ZP_06489580.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 902
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 160/445 (35%), Positives = 217/445 (48%), Gaps = 89/445 (20%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T + R DLV R+TL+EK + ++A ++ RLG+ Y+WW+EALHGV+ G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG--- 91
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 92 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTFW 144
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR------LKVAACC 218
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ G +P K+ A
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRKLDATA 204
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ + + DR+HF+A +Y YL F+
Sbjct: 205 KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESA 261
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y+VSDC ++ ++ T E+AAA ++ G +L CG
Sbjct: 262 SASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEE 321
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
AAV GL+ E+ ID ++ MRLG FD P GQL + NQ
Sbjct: 322 YST-LPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD------PPGQLPWSKIPASVNQS 374
Query: 353 LALDA-----ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
A DA AR+ +VLLKN G LPLS T +K +AVIGP A+ T ++GNY GTP
Sbjct: 375 PAHDALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPV 433
Query: 408 TPLQGLAAV---VATIYQAGCSNVQ 429
T LQG+ A +Y G V+
Sbjct: 434 TVLQGIRAAAPNAQVLYARGADLVE 458
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A SA+ V V G +E E DR DL LP Q+ L+ +
Sbjct: 626 LQEALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQAT 685
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G I +A+ + +IL YPGQ GG A+AD FG NP GRLP+T
Sbjct: 686 GK-PVVAVLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742
Query: 545 WYPQS 549
+Y +S
Sbjct: 743 FYKES 747
>gi|346725879|ref|YP_004852548.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346650626|gb|AEO43250.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 889
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 163/428 (38%), Positives = 222/428 (51%), Gaps = 72/428 (16%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
A +++ P++A+ NT D R LV +++ +EK+ ++ A ++ RLGIP YEW
Sbjct: 18 LALPLIAAPAVAA---KNTDATPDQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEW 74
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSE LHG++ G AT FPQ I AAS+N L Q +G VVSTEARA +N
Sbjct: 75 WSEGLHGIARNG----------YATVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQA 124
Query: 158 ---------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
AGLT WSPNINIFRDPRWGRG ET GEDP L + A G+++GLQ D
Sbjct: 125 GGPGKDHKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNH 184
Query: 209 PNRLKVAACCKHYT----------AYDLD----NWKGTDRYHFNAMVIYTY--------- 245
P + A KH +D+D + + T F A ++
Sbjct: 185 P---RTIATPKHIAVHSGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYN 241
Query: 246 -----------YLIK--------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+L+ FK ++VSDCD+VD + ++ ++A ++ AG
Sbjct: 242 SLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGH 301
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
DLNCG + A+ G V+E+ +D ++ FAT RLG + P K PY +LG KDV
Sbjct: 302 DLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFATRYRLGELEA-PRKDPYARLGAKDV 359
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
A++ LAL AA + IVLLKN A +LPL A LAVIGPNA+ + NY+GT
Sbjct: 360 DNAAHRALALQAAAESIVLLKNDANTLPLK--AGTRLAVIGPNADALAALEANYQGTSSA 417
Query: 406 YTTPLQGL 413
TPL GL
Sbjct: 418 PVTPLLGL 425
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 647 DRNDIALPAPQQALL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA + G NP GRLP+T+Y
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY 726
>gi|289666226|ref|ZP_06487807.1| beta-glucosidase precursor [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 902
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 160/445 (35%), Positives = 217/445 (48%), Gaps = 89/445 (20%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T + R DLV R+TL+EK + ++A ++ RLG+ Y+WW+EALHGV+ G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG--- 91
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 92 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTFW 144
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR------LKVAACC 218
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ G +P K+ A
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRKLDATA 204
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ + + DR+HF+A +Y YL F+
Sbjct: 205 KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESA 261
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y+VSDC ++ ++ T E+AAA ++ G +L CG
Sbjct: 262 SASKFLLQDLLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEE 321
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
AAV GL+ E+ ID ++ MRLG FD P GQL + NQ
Sbjct: 322 YST-LPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD------PPGQLPWSKIPASVNQS 374
Query: 353 LALDA-----ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
A DA AR+ +VLLKN G LPLS T +K +AVIGP A+ T ++GNY GTP
Sbjct: 375 PAHDALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPV 433
Query: 408 TPLQGLAAV---VATIYQAGCSNVQ 429
T LQG+ A +Y G V+
Sbjct: 434 TVLQGIRAAAPNAQVLYARGADLVE 458
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A SA+ V V G +E E DR DL LP Q+ L+ +
Sbjct: 626 LQEALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQAT 685
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G I +A+ + +IL YPGQ GG A+AD FG NP GRLP+T
Sbjct: 686 GK-PVVAVLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742
Query: 545 WYPQS 549
+Y +S
Sbjct: 743 FYKES 747
>gi|390956994|ref|YP_006420751.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
gi|390411912|gb|AFL87416.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
Length = 742
Score = 245 bits (626), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 190/604 (31%), Positives = 279/604 (46%), Gaps = 99/604 (16%)
Query: 9 RAPKVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLV 68
R+ +S + L+ S +S+ ++P +V + + S I+ R+ DL+
Sbjct: 4 RSVALSTAAVLLSVASCVSASAQGSNAPASGGEV----------YRDMSRPIEDRITDLI 53
Query: 69 KRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVI 128
KR TLQEK L + V RLG+P + W++ LHGV P T FP
Sbjct: 54 KRFTLQEKAMQLNHTNRGVPRLGLPMWGGWNQTLHGVWSKQP----------TTLFPIPT 103
Query: 129 LTAASFNASLFQAIGKVVSTEARAMYNVGLAG------LTFWSPNINIFRDPRWGRGQET 182
A+++ L + +S EARA+YN G L + SP INI RDPRWGR QE
Sbjct: 104 AMGATWDPELVHTVADAMSDEARALYNAHAEGPRTPHGLVYRSPVINISRDPRWGRIQEV 163
Query: 183 PGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV- 241
EDPLL + YV+GLQ G LK+AA KH+ +++ + R H NA V
Sbjct: 164 FSEDPLLTGRMGVAYVRGLQ---GDDLQHLKLAATVKHFAVNNVE----SGRQHLNADVD 216
Query: 242 ---IYTYYLIKFKYIVSDCDSVDVL--YNS------------------------------ 266
++ ++L ++ + + + V+ YN+
Sbjct: 217 ERNLFEFWLPHWRAAIMEAHAQSVMSSYNAINGMPDAVNHWLLTDVLRKKWGFDGFVTDD 276
Query: 267 -------------------QHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVN 307
QH+++ P AAA +I AG D + F + AV+ GL+
Sbjct: 277 LGAVALLSGTRATNTSEPGQHFSEDPVVAAAAAIRAGNDSDDVEF-ETNLPLAVQRGLLT 335
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E +D A+ N RLG +D P Y ++G V +QA++DL+ A + + LL N
Sbjct: 336 EKDVDGALRNVLRVGFRLGAYDP-PQASKYSRIGMDVVRSQAHRDLSQRVAEESMTLLLN 394
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV----VATIYQA 423
LPL +K++AVIGP A GNY GTP T+ +GL A+ V Y+
Sbjct: 395 RRQFLPLQRDQVKSVAVIGP-AGGEAYETGNYYGTPAVKTSVTEGLRALLGSGVKVEYEK 453
Query: 424 GCSNVQCG-TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVA 482
G V +++ A A +D VL +G + +EAE DR DL LPG QQ L+ E
Sbjct: 454 GAGYVDLADDKEIERAANLARKSDVVVLCLGTNLQVEAEGRDRRDLNLPGAQQRLL-EAV 512
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
A V L++M+ G +++A + + +IL YPG+ GGAAIA FG NP G LP
Sbjct: 513 YAANPKVALVLMNAGPLGVTWAHD--HVPAILSAWYPGELGGAAIARTLFGLNNPGGHLP 570
Query: 543 MTWY 546
T Y
Sbjct: 571 YTVY 574
>gi|390991557|ref|ZP_10261819.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372553724|emb|CCF68794.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 901
Score = 244 bits (624), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 159/445 (35%), Positives = 217/445 (48%), Gaps = 89/445 (20%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T + R DLV R+TL+EK + ++A ++ RL +P Y+WW+EALHGV+ G
Sbjct: 34 YLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG--- 90
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 91 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 143
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR------LKVAACC 218
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ +P K+ A
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRKLDATA 203
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ + + DR+HF+A +Y YL F+
Sbjct: 204 KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESA 260
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y+VSDC ++ ++ T E+AAA ++ G +L CG
Sbjct: 261 SASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEE 320
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
AAV+ GL++E+ ID A+ MRLG FD P GQL + NQ
Sbjct: 321 YAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD------PPGQLPWSTIPASVNQS 373
Query: 353 LALDA-----ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
A DA AR+ +VLLKN G LPLS +K +AVIGP A+ T ++GNY GTP
Sbjct: 374 PAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPV 432
Query: 408 TPLQGLAAV---VATIYQAGCSNVQ 429
T LQG+ A +Y G V+
Sbjct: 433 TVLQGIRAAAPKAQVLYARGADLVE 457
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A SAD V V G +E E DR DL LP Q+ L+ E +
Sbjct: 625 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLL-EALQA 683
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
PV+ ++ +G I +A+ + +IL YPGQ GG A+AD FG NP GRLP+T
Sbjct: 684 TGRPVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 741
Query: 545 WYPQS 549
+Y +S
Sbjct: 742 FYKES 746
>gi|189467437|ref|ZP_03016222.1| hypothetical protein BACINT_03826 [Bacteroides intestinalis DSM
17393]
gi|189435701|gb|EDV04686.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 863
Score = 244 bits (624), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 157/436 (36%), Positives = 215/436 (49%), Gaps = 74/436 (16%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYV 108
A+ F N L I RV DLV RLTL+EKI+ ++++A ++ RLGIP Y WW+E LHGV+
Sbjct: 22 ATCKFLNPELPIVERVNDLVGRLTLEEKISQMLNNAPAIDRLGIPAYNWWNECLHGVA-- 79
Query: 109 GPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAG 160
+ P TSFPQ I AA+++ + S E RA+Y+ G
Sbjct: 80 ------RSPYP-VTSFPQAIAMAATWDTESVHQMAVYASDEGRAIYHDATRKGTPGIFRG 132
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
LT+WSPNINIFRDPRWGRGQET GEDP L + +VKGLQ G P LK +AC KH
Sbjct: 133 LTYWSPNINIFRDPRWGRGQETYGEDPFLTASIGVSFVKGLQ---GDDPVYLKSSACAKH 189
Query: 221 YTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------- 251
Y + W +R+ ++A V ++ YL FK
Sbjct: 190 YAVHSGPEW---NRHTYDAKVNNHDLWDTYLPAFKELVVEGKVTGVMCAYNSFFGQPCCG 246
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
Y+ SDC +V+ YN+ + A+A ++L G D CG+
Sbjct: 247 NDLLMMDILRNHWKFGGYVTSDCGAVEDFYNTHKTHQDAAAASADAVLHGTDCECGNGAY 306
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
+ AV GL+ E ID ++ F RLG FD + PY + + A++ A
Sbjct: 307 RALADAVLRGLITEKQIDESLKKLFEIRFRLGMFDPD-DRVPYSNIPLSVLECDAHKAHA 365
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
L ARQ IVLLKN LPL+ IK +AV+GPNA+ ++ NY G P TT L+G+
Sbjct: 366 LKIARQSIVLLKNQDQLLPLNKNKIKKIAVVGPNADDKSVLLANYYGYPSHITTALEGIQ 425
Query: 415 AVVA----TIYQAGCS 426
V +Y+ G +
Sbjct: 426 KKVGNQVEVVYEKGVN 441
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 67/125 (53%), Gaps = 15/125 (12%)
Query: 434 QVDDAKKAAASADATVLVM---------GADQSIEAESH---DRLDLLLPGQQQLLITEV 481
+ D + AAA DA V++ G + +E E DR + +P QQ L+ E+
Sbjct: 586 KADYKQTAAAVKDADVIIFVGGLSAKVEGEEMGVEIEGFKRGDRTSISIPSVQQNLLKEL 645
Query: 482 AKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRL 541
K PV+ ++M+G + + + +IL Y GQAGG AIADV FG YNPSGRL
Sbjct: 646 YATGK-PVVFVMMTGSALGLEW--ESAHLPAILNAWYGGQAGGQAIADVLFGDYNPSGRL 702
Query: 542 PMTWY 546
P+T+Y
Sbjct: 703 PLTFY 707
>gi|381170979|ref|ZP_09880130.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
gi|380688543|emb|CCG36617.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
Length = 901
Score = 244 bits (624), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 159/445 (35%), Positives = 217/445 (48%), Gaps = 89/445 (20%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T + R DLV R+TL+EK + ++A ++ RL +P Y+WW+EALHGV+ G
Sbjct: 34 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG--- 90
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 91 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 143
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR------LKVAACC 218
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ +P K+ A
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRKLDATA 203
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ + + DR+HF+A +Y YL F+
Sbjct: 204 KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGESA 260
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y+VSDC ++ ++ T E+AAA ++ G +L CG
Sbjct: 261 SASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEE 320
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
AAV+ GL++E+ ID A+ MRLG FD P GQL + NQ
Sbjct: 321 YAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD------PPGQLPWSTIPASVNQS 373
Query: 353 LALDA-----ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
A DA AR+ +VLLKN G LPLS +K +AVIGP A+ T ++GNY GTP
Sbjct: 374 PAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPV 432
Query: 408 TPLQGLAAV---VATIYQAGCSNVQ 429
T LQG+ A +Y G V+
Sbjct: 433 TVLQGIRAAAPNAQVLYARGADLVE 457
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A SAD V V G +E E DR DL LP Q+ L+ E +
Sbjct: 625 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLL-EALQA 683
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
PV+ ++ +G I +A+ + +IL YPGQ GG A+AD FG NP GRLP+T
Sbjct: 684 TGRPVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 741
Query: 545 WYPQS 549
+Y +S
Sbjct: 742 FYKES 746
>gi|418518550|ref|ZP_13084692.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|418522850|ref|ZP_13088880.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410700720|gb|EKQ59264.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410703176|gb|EKQ61671.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 901
Score = 244 bits (623), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 159/445 (35%), Positives = 217/445 (48%), Gaps = 89/445 (20%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T + R DLV R+TL+EK + ++A ++ RL +P Y+WW+EALHGV+ G
Sbjct: 34 YLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG--- 90
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 91 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 143
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR------LKVAACC 218
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ +P K+ A
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGERYRKLDATA 203
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH+ + + DR+HF+A +Y YL F+
Sbjct: 204 KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESA 260
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y+VSDC ++ ++ T E+AAA ++ G +L CG
Sbjct: 261 SASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEE 320
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
AAV+ GL++E+ ID A+ MRLG FD P GQL + NQ
Sbjct: 321 YAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD------PPGQLPWSTIPASVNQS 373
Query: 353 LALDA-----ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
A DA AR+ +VLLKN G LPLS +K +AVIGP A+ T ++GNY GTP
Sbjct: 374 PAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPV 432
Query: 408 TPLQGLAAV---VATIYQAGCSNVQ 429
T LQG+ A +Y G V+
Sbjct: 433 TVLQGIRAAAPNAQVLYARGADLVE 457
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A SAD V V G +E E DR DL LP Q+ L+ +
Sbjct: 625 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQAT 684
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G I +A+ + +IL YPGQ GG A+AD FG NP GRLP+T
Sbjct: 685 GK-PVVAVLTAGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 741
Query: 545 WYPQS 549
+Y +S
Sbjct: 742 FYKES 746
>gi|146301622|ref|YP_001196213.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
UW101]
gi|146156040|gb|ABQ06894.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 875
Score = 244 bits (623), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 155/423 (36%), Positives = 215/423 (50%), Gaps = 76/423 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N SL + RV DLV RLTL+EK++ +++S+ ++RLGIP Y+WW+E LHGV+ T
Sbjct: 29 FQNPSLSFEQRVDDLVSRLTLEEKVSQMLNSSPEIARLGIPAYDWWNETLHGVA----RT 84
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA---------GLTF 163
F V +PQ I AA+F+ + + + E RA+YN + GLT+
Sbjct: 85 PFKTTV-----YPQAIGMAATFDKNSLFTMADYSALEGRAIYNKAVELKRTNERYLGLTY 139
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PNINIFRDPRWGRGQET GEDP L + +VKGLQ G P LK AAC KHY
Sbjct: 140 WTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQ---GDDPKYLKAAACAKHYAV 196
Query: 224 YDLDNWKGTD--RYHFNAMV----IYTYYLIKFK-------------------------- 251
+ G + R+ F+ V ++ YL F+
Sbjct: 197 HS-----GPESLRHTFDVDVTPYELWDTYLPAFRKLITESNVAGVMCAYNAFRTQPCCAS 251
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGK 295
Y+ SDC ++D + + E AAA ++ G D++CG+ K
Sbjct: 252 DILMNDILRKEWKFDGYVTSDCWAIDDFFKNHKTHPDAESAAADAVFHGTDIDCGTDAYK 311
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
AVK G ++E ID ++ F RLG FD S Y Q + ++ +Q AL
Sbjct: 312 ALVQAVKNGKISEKQIDISVKRLFMIRFRLGMFDP-VSMVKYAQTPSSVLESKEHQLHAL 370
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
ARQ IVLLKN LPL+ +K + V+GPNA+ +++GNY GTP K TT LQG+
Sbjct: 371 KMARQSIVLLKNEKNILPLNKN-LKKIVVLGPNADNAISILGNYNGTPSKLTTVLQGIKE 429
Query: 416 VVA 418
V+
Sbjct: 430 KVS 432
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 59/114 (51%), Gaps = 15/114 (13%)
Query: 444 SADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILII 493
+ADA + G +E E DR +L P Q L+ + K PV+ +
Sbjct: 605 NADAFIFAGGISPQLEGEEMPVDFPGFKGGDRTSILFPEVQTKLLKALQSSGK-PVVFAM 663
Query: 494 MSGGGFDISF-AKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
M+G I + A+N P I +I W G GQ+ G A ADV FG YNP+GRLP+T+Y
Sbjct: 664 MTGSAIAIPWEAENIPAILNI-WYG--GQSAGTAAADVIFGDYNPAGRLPVTFY 714
>gi|413925165|gb|AFW65097.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 412
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 136/302 (45%), Positives = 170/302 (56%), Gaps = 55/302 (18%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
P F+C PSL L FCNT L R DLV R+T EK + L D A V RLG+P Y
Sbjct: 84 PPFSCG--GGPSLG-LPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSY 140
Query: 96 EWWSEALHGVSYVGPGTHFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
+WW+EALHGV+ G G H V ATSFPQV+LTAASFN +L+ IG+ EARA Y
Sbjct: 141 KWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFY 200
Query: 155 NVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS---PN 210
N+G A GLT WSPN+NIFRDPRWGRGQETPGEDP +AS+YA +V+GLQ + + P
Sbjct: 201 NIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQGSSSNTKSVPP 260
Query: 211 RLKVAACCKHYTAYDLDNWKGTDRYHFNA---------------------------MVIY 243
L +ACCKH TAYDL++WKG RY F A M Y
Sbjct: 261 VLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAY 320
Query: 244 T--------------------YYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
T + + +Y+ +DCD+V ++ NSQ Y T E+ A ++ A
Sbjct: 321 TSVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIMRNSQFYRPTAEDTVATTLKA 380
Query: 284 GL 285
G+
Sbjct: 381 GM 382
>gi|418519424|ref|ZP_13085476.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410704868|gb|EKQ63347.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
Length = 886
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 162/420 (38%), Positives = 216/420 (51%), Gaps = 69/420 (16%)
Query: 46 PSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV 105
P LA ++ D R LV +++ +EK+ ++ A ++ RLGIP YEWWSE LHG+
Sbjct: 20 PCLALAAPAKSAGSPDQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGI 79
Query: 106 SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG-------- 157
+ G AT FPQ I AAS+N SL Q +G VVSTEARA +N
Sbjct: 80 ARNG----------YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQ 129
Query: 158 -LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAA 216
AGLT WSPNINIFRDPRWGRG ET GEDP L + A G+++GLQ D P + A
Sbjct: 130 RYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHP---RTIA 186
Query: 217 CCKHYT----------AYDLD--------------------NWKGTDRYHFNAM----VI 242
KH +D+D G+ +NA+ V
Sbjct: 187 TPKHIAVHSGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVC 246
Query: 243 YTYYLIK--------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL 293
+L+ FK ++VSDCD+VD + ++ ++A ++ AG DLNCG
Sbjct: 247 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 306
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ A+ G V+E+ +D ++ FA RLG + P K PY +LG KDV A++ L
Sbjct: 307 -RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AA + IVLLKN A +LPL A LAVIGPNA+ + NY+GT TPL GL
Sbjct: 365 ALQAAAESIVLLKNDANTLPLR--AGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 644 DRNDIALPAPQQTLL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 700
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA + G NP GRLP+T+Y
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFY 723
>gi|116181370|ref|XP_001220534.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
gi|88185610|gb|EAQ93078.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
Length = 549
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 185/532 (34%), Positives = 258/532 (48%), Gaps = 82/532 (15%)
Query: 32 AQSSPVFACDVV-------SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSA 84
A +S +FA V+ +N LA C+ R LVK L ++EK+ LVD +
Sbjct: 10 ALASALFANGVIGFQYPDCANGPLADNTVCDPKATPPERAAALVKALNIEEKLQNLVDMS 69
Query: 85 GSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG----ATSFPQVILTAASFNASLFQ 140
RLG+P Y WWSEALHGV+ PG F+ G ATSF I +A+F+ L
Sbjct: 70 KGAERLGLPAYAWWSEALHGVA-ASPGVRFNRTAGGRFSSATSFANSITLSAAFDDELVY 128
Query: 141 AIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKG 200
+ +STEARA N GLAGL +W+PNIN ++DPRWGRG ETPGEDP+ Y + G
Sbjct: 129 KVADTISTEARAFANAGLAGLDYWTPNINPYKDPRWGRGHETPGEDPVRIKGYVKALLAG 188
Query: 201 LQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----- 251
L+ G P+ KV A CKHY AYDL+ W+GT R+ F+A+V + YYL F+
Sbjct: 189 LE---GDDPSIRKVVATCKHYAAYDLERWQGTTRHRFDAVVSLQDLSEYYLPPFQQCARD 245
Query: 252 ----------------------------------------YIVSDCDSV-DVL----YNS 266
YI SDC+++ D L +++
Sbjct: 246 SKVGSFMCSYNALNGTPACASTYLMDDILRKHWGWTEHNNYITSDCNAIQDFLPGPKWHN 305
Query: 267 QHYTKTPEEAAAKSILAGLDLNC---GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLM 323
T+T EAAA + AG D C G A L++E ID A+ + L+
Sbjct: 306 FSSTQTEAEAAAVAYQAGTDTVCEVPGWPPYTDVIGAYNQTLLSEEVIDTALKRLYEGLV 365
Query: 324 RLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLA 383
R+G+FD S PY +G +DV T Q+LAL + G+VLLKN G+LPL+ K +A
Sbjct: 366 RVGYFD-PASGSPYRSIGWEDVNTPEAQELALQSGTDGLVLLKND-GTLPLN-LEDKTVA 422
Query: 384 VIGPNANVTK--TMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDD--- 437
+IG AN T ++G Y G P +P+ + T Y +G A +DD
Sbjct: 423 LIGFWANSTNGGRILGGYSGFPPYIHSPVDAAEKLNLTYHYASGPLAENITQAAIDDWVA 482
Query: 438 -AKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGP 488
A + A ++ + G D SI AE DR + P Q +I ++ + + P
Sbjct: 483 KALEPAKKSNVILYFGGTDTSIAAEDLDRDSIAWPEIQLAVIEALSALRQAP 534
>gi|58581402|ref|YP_200418.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|58425996|gb|AAW75033.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
Length = 889
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 166/443 (37%), Positives = 221/443 (49%), Gaps = 77/443 (17%)
Query: 23 CSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVD 82
C L + S + P+ A VS NT + R DLV ++ +EK+ ++
Sbjct: 8 CRLAAALSVGLALPLIAAPAVSAK--------NTDATPEQRAADLVAHMSREEKVAQAMN 59
Query: 83 SAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAI 142
A ++ RLGIP YEWWSE LHG++ G AT FPQ I AAS+N L Q +
Sbjct: 60 DAPAIPRLGIPAYEWWSEGLHGIARNG----------YATVFPQAIGLAASWNTHLMQQV 109
Query: 143 GKVVSTEARAMYNVG---------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
G VVSTEARA +N AGLT WSPNINIFRDPRWGRG ET GEDP L +
Sbjct: 110 GTVVSTEARAKFNQAGRPGNDHKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQM 169
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYT----------AYDLD----NWKGTDRYHFNA 239
A G++ GLQ D P + A KH +D+D + + T F A
Sbjct: 170 AVGFIHGLQGEDLDHP---RTIATPKHLAVHSGPEPGRHGFDVDVSPRDVEATYTPAFRA 226
Query: 240 MVIYTY--------------------YLIK--------FK-YIVSDCDSVDVLYNSQHYT 270
++ +LI FK ++VSDCD+VD + ++
Sbjct: 227 AIVEGQAGAVMCAYNSLHGTPACAADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFR 286
Query: 271 KTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDG 330
++A ++ AG DLNCG + A+ G V+E+ +D ++ FA RLG +
Sbjct: 287 PDNAGSSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA 345
Query: 331 HPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNAN 390
P K PY +LG KDV ++ LAL AA + IVLLKN A +LPL+ A LAVIGPNA+
Sbjct: 346 -PRKDPYARLGAKDVDNAQHRALALQAAAESIVLLKNNANTLPLN--AGTRLAVIGPNAD 402
Query: 391 VTKTMIGNYEGTPCKYTTPLQGL 413
+ NY+GT TPL GL
Sbjct: 403 ALAALEANYQGTSSAPVTPLLGL 425
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 647 DRNDIALPAPQQALL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA + G NP GRLP+T+Y
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY 726
>gi|84623339|ref|YP_450711.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188577358|ref|YP_001914287.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|84367279|dbj|BAE68437.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188521810|gb|ACD59755.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 889
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 166/443 (37%), Positives = 221/443 (49%), Gaps = 77/443 (17%)
Query: 23 CSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVD 82
C L + S + P+ A VS NT + R DLV ++ +EK+ ++
Sbjct: 8 CRLAAALSVGLALPLIAAPAVSAK--------NTDATPEQRAADLVAHMSREEKVAQAMN 59
Query: 83 SAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAI 142
A ++ RLGIP YEWWSE LHG++ G AT FPQ I AAS+N L Q +
Sbjct: 60 DAPAIPRLGIPAYEWWSEGLHGIARNG----------YATVFPQAIGLAASWNTHLMQQV 109
Query: 143 GKVVSTEARAMYNVG---------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
G VVSTEARA +N AGLT WSPNINIFRDPRWGRG ET GEDP L +
Sbjct: 110 GTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQM 169
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYT----------AYDLD----NWKGTDRYHFNA 239
A G++ GLQ D P + A KH +D+D + + T F A
Sbjct: 170 AVGFIHGLQGDDLDHP---RTIATPKHLAVHSGPEPGRHGFDVDVSPRDVEATYTPAFRA 226
Query: 240 MVIYTY--------------------YLIK--------FK-YIVSDCDSVDVLYNSQHYT 270
++ +LI FK ++VSDCD+VD + ++
Sbjct: 227 AIVEGQAGAVMCAYNSLHGTPACAADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFR 286
Query: 271 KTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDG 330
++A ++ AG DLNCG + A+ G V+E+ +D ++ FA RLG +
Sbjct: 287 PDNAGSSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA 345
Query: 331 HPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNAN 390
P K PY +LG KDV ++ LAL AA + IVLLKN A +LPL+ A LAVIGPNA+
Sbjct: 346 -PRKDPYARLGAKDVDNAQHRALALQAAAESIVLLKNNANTLPLN--AGTRLAVIGPNAD 402
Query: 391 VTKTMIGNYEGTPCKYTTPLQGL 413
+ NY+GT TPL GL
Sbjct: 403 ALAALEANYQGTSSAPVTPLLGL 425
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 647 DRNDIALPAPQQALL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA + G NP GRLP+T+Y
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY 726
>gi|405955586|gb|EKC22647.1| Putative beta-D-xylosidase 2 [Crassostrea gigas]
Length = 745
Score = 243 bits (621), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 177/566 (31%), Positives = 268/566 (47%), Gaps = 88/566 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFL--------VDSAGSVSRLGIPKYEWWSEALHG 104
F NTSL D RV DLV RLT++E + + A +V RLG+ + W +E L G
Sbjct: 28 FRNTSLPWDARVKDLVDRLTIEEIVVQMSRGGSGPRASPAPAVPRLGVGPFSWNTECLRG 87
Query: 105 VSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV-------- 156
Y G ATSFPQ + AA+F+ + + S E RA +N
Sbjct: 88 DVYAG----------NATSFPQALGLAATFSTEVICDVASATSIEVRAKFNDYQRRKIYG 137
Query: 157 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAA 216
G++ +SP INI R P WGR QET GEDP L+ + A +VK LQ G P ++ A
Sbjct: 138 DHKGISCFSPVINIMRHPLWGRNQETYGEDPFLSGELAAIFVKCLQ---GDDPTYIRANA 194
Query: 217 CCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKF-------------------------- 250
CKH+ + R+ F+A V + + F
Sbjct: 195 GCKHFDVHGGPENIPVSRFSFDAKVSERDWRLTFLPAFKRCVQAGSYSLMCSFNRINGVP 254
Query: 251 -------------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
Y+VSD ++++ + HYT + AA + AG +L +
Sbjct: 255 ACGNKRLLTDILRTEWGFTGYVVSDQEAIENIMTYHHYTNNSVDTAALCVKAGCNLELST 314
Query: 292 FLGKHTE----AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
K T A+KAG +++ + ++S F T MRLG FD P PY + + +
Sbjct: 315 NEVKPTYFYIIDALKAGKLDKEDLVKSVSPLFYTRMRLGEFDP-PDHNPYNFIDLSVIQS 373
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCK 405
+ ++ ++L+AA + VLLKN G LP++ ++V+GP A+ IG+Y + P
Sbjct: 374 EEHRAISLNAAMKSFVLLKNKGGFLPIT-KLFDTISVLGPMADNKYQQIGSYAPDVMPSY 432
Query: 406 YTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHD 464
TTPLQGL+ + + Y AGC++ C + ++A S+D + +G IE E HD
Sbjct: 433 TTTPLQGLSKLSKRVQYAAGCNDNACSKYNRTEIQRAVNSSDIFFVCLGTGPMIENEDHD 492
Query: 465 RLDLLLPGQQ-QLLITEVAKVAKG-PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
R + LPGQQ QLL + AKG P++L++ +GG +I++A ++ +I+ +P Q
Sbjct: 493 RASMELPGQQAQLLKDAIMFSAKGVPIVLLLFNGGPVNITWADRSDRVVAIMECFFPAQE 552
Query: 523 GGAAIADVCF---GRYNPSGRLPMTW 545
G A+ V NP+GRLP TW
Sbjct: 553 TGEAVLRVVTNTGNSSNPAGRLPYTW 578
>gi|373951852|ref|ZP_09611812.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373888452|gb|EHQ24349.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 871
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 153/419 (36%), Positives = 211/419 (50%), Gaps = 78/419 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L RV DLVKR+TL+EK++ +++S+ ++ RL IP Y+WW+E LHGV+ T
Sbjct: 28 YQNYHLDFTTRVNDLVKRMTLEEKVSQMLNSSPAIPRLKIPAYDWWNEVLHGVAR----T 83
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA---------GLTF 163
F T +PQ I AA+F+ + + E RA++N L GLT+
Sbjct: 84 PFK-----VTVYPQAIAMAATFDRQSLNQMADYAALEGRAVHNKALQMRKPGEKYLGLTY 138
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PNINIFRDPRWGRGQET GEDP L + +V GLQ G P LK AAC KHY
Sbjct: 139 WTPNINIFRDPRWGRGQETYGEDPFLTGAMGSAFVSGLQ---GNDPKYLKAAACAKHYAV 195
Query: 224 YDLDNWKGTD--RYHFNAMV----IYTYYLIKFK-------------------------- 251
+ G + R+ FNA + ++ YL FK
Sbjct: 196 H-----SGPEPLRHVFNADISTYDLWDTYLPAFKKLVVDDKVAGVMCAYNAFKTQPCCGS 250
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGK 295
Y+ SDC +D + + T E+A+ ++L G D+ CG+ K
Sbjct: 251 DLLMVDILRNQWKFSGYVTSDCGGIDDFFKNHKTHATAEDASTDAVLHGTDIECGTDAYK 310
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQDLA 354
AAVK G ++E+ ID ++ F RLG FD PS Y Q + + +Q A
Sbjct: 311 SLVAAVKEGKISETQIDISVKRLFMIRFRLGMFD--PSDVVKYAQTPVSVLESPEHQAHA 368
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
L ARQ +VLLKN +LPLS T I+ + V+GPNA+ ++GNY GTP TT QG+
Sbjct: 369 LKMARQSVVLLKNANHTLPLSKT-IRKIVVLGPNADNPIAILGNYNGTPSNLTTVYQGI 426
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 61/118 (51%), Gaps = 15/118 (12%)
Query: 440 KAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPV 489
K A ADA V V G +E E DR + LP Q L+ + K PV
Sbjct: 598 KRVADADAIVYVGGISPQLEGEEMQVNYPGFNGGDRTSIQLPAAQTNLMKTLQATGK-PV 656
Query: 490 ILIIMSGGGFDISF-AKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ ++M+G + A+N P I + W G GQA G A+ADV FG YNP+GRLP+T+Y
Sbjct: 657 VFVMMTGSALATPWEAENIPAIVNA-WYG--GQAAGTAVADVLFGDYNPAGRLPVTFY 711
>gi|325925754|ref|ZP_08187127.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
gi|325543811|gb|EGD15221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
Length = 874
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 161/428 (37%), Positives = 221/428 (51%), Gaps = 72/428 (16%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
A +++ P++A+ NT + R LV +++ +EK+ ++ A ++ RLGIP YEW
Sbjct: 3 LALPLIAAPAVAA---KNTDATPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEW 59
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSE LHG++ G AT FPQ I AAS+N L Q +G VVSTEARA +N
Sbjct: 60 WSEGLHGIARNG----------YATVFPQAIGLAASWNTRLMQQVGTVVSTEARAKFNQA 109
Query: 158 ---------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
AGLT WSPNINIFRDPRWGRG ET GEDP L + A G+++GLQ D
Sbjct: 110 GGPGKDHKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNH 169
Query: 209 PNRLKVAACCKHYT----------AYDLD----NWKGTDRYHFNAMVIYTY--------- 245
P + A KH +D+D + + T F A ++
Sbjct: 170 P---RTIATPKHIAVHSGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYN 226
Query: 246 -----------YLIK--------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+L+ FK ++VSDCD+VD + ++ ++A ++ AG
Sbjct: 227 SLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGH 286
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
DLNCG + A+ G V+E+ +D ++ FA RLG + P K PY +LG KDV
Sbjct: 287 DLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDV 344
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
A++ LAL AA + IVLLKN A +LPL A LAVIGPNA+ + NY+GT
Sbjct: 345 DNAAHRALALQAAAESIVLLKNDANTLPLK--AGTRLAVIGPNADALAALEANYQGTSSA 402
Query: 406 YTTPLQGL 413
TPL GL
Sbjct: 403 PVTPLLGL 410
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 632 DRNDIALPAPQQALL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 688
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA + G NP GRLP+T+Y
Sbjct: 689 GTAIARMLAGDDNPGGRLPVTFY 711
>gi|384421334|ref|YP_005630694.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353464247|gb|AEQ98526.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 904
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 162/461 (35%), Positives = 223/461 (48%), Gaps = 91/461 (19%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
AC + + + +T+ + R DLV R+TL+EK + ++A ++ RL +P Y+W
Sbjct: 22 LACGTGATHAATPPPYLDTARSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDW 81
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
W+EALHGV+ G GAT FPQ I AA+F+ L + +S EARA ++
Sbjct: 82 WNEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQF 131
Query: 158 L--------AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ------- 202
L GLTFWSPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ
Sbjct: 132 LRQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAP 191
Query: 203 QTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------- 251
+ G P R K+ A KH+ + + +R+HF+A +Y YL F+
Sbjct: 192 KNAQGEPYR-KLDATAKHFAVH---SGPEAERHHFDARPSQRDLYETYLPAFEALVKDGK 247
Query: 252 -----------------------------------YIVSDCDSVDVLYNSQHYTKTPEEA 276
Y+VSDC ++ ++ T E+A
Sbjct: 248 VDAVMGAYNRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQA 307
Query: 277 AAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQP 336
AA ++ G +L CG AAV GL++E+ ID A+ MRLG FD P
Sbjct: 308 AALAVTHGTELECGEEYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD------P 360
Query: 337 YGQLGPKDVCTQANQDLALDA-----ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANV 391
GQL + NQ A DA AR+ +VLLKN G LPLS +K +AVIGP A+
Sbjct: 361 PGQLPWSKIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADD 419
Query: 392 TKTMIGNYEGTPCKYTTPLQGLAAV---VATIYQAGCSNVQ 429
T ++GNY GTP T LQG+ A +Y G V+
Sbjct: 420 TMALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVE 460
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A SAD V V G +E E DR DL LP Q+ L+ +
Sbjct: 628 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQAT 687
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G I +A+ + +IL YPGQ GG A+AD FG NP GRLP+T
Sbjct: 688 GK-PVVAVLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 744
Query: 545 WYPQS 549
+Y +S
Sbjct: 745 FYKES 749
>gi|408369545|ref|ZP_11167326.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
gi|407745291|gb|EKF56857.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
Length = 881
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 160/434 (36%), Positives = 223/434 (51%), Gaps = 84/434 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L RV DL++RLT++EKI L+ ++ ++ RLGIP+Y WW+E+LHGV+ G
Sbjct: 29 FQNPELDDSARVADLLERLTVEEKIDQLLYTSPAIERLGIPEYNWWNESLHGVARAG--- 85
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
AT FPQ I AA++++ L + + +S EARA ++ + GLTFW
Sbjct: 86 -------YATVFPQSITIAAAWDSDLLKEVADAISDEARAKHHEYIRRGQRGIYQGLTFW 138
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRG ET GEDP L + YVKGLQ G PN LK+ A KH+ +
Sbjct: 139 SPNINIFRDPRWGRGHETYGEDPYLTGQLGIAYVKGLQ---GNDPNYLKLVATAKHFAVH 195
Query: 225 DLDNWKGTD--RYHFNA----MVIYTYYLIKFK--------------------------- 251
G + R+ F+ ++ YL F+
Sbjct: 196 -----SGPEPLRHEFDVSPSKRDLWETYLPAFRYLVKQGDVKSVMTAYNRVYGEAASASD 250
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT 297
Y+VSDC ++ ++ K EA+A +++ G DLNCG K
Sbjct: 251 TLFTILRDYWDFDGYVVSDCFAISDIWKYHKIAKDAAEASAMAVIEGCDLNCGDSYEKLN 310
Query: 298 EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCT-QANQDLAL 355
+ A + G+V E ID A+S ++LG FD P + PY Q+ P +V T + + LAL
Sbjct: 311 Q-AYQQGMVTEKDIDIALSRLMEARIKLGMFD--PEQLVPYAQI-PFNVNTSEKHNQLAL 366
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG--- 412
AA++ IVLLKN LPLS +K++AVIGPNA+ +++ GNY G P T LQG
Sbjct: 367 KAAKESIVLLKNQGDLLPLS-KDLKSVAVIGPNADNIQSLWGNYNGNPKDPITVLQGIQN 425
Query: 413 -LAAVVATIYQAGC 425
L +YQ G
Sbjct: 426 ALGPQTTVVYQEGS 439
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 49/123 (39%), Positives = 75/123 (60%), Gaps = 13/123 (10%)
Query: 434 QVDDAKKAAASADATVLVMGADQSIEAESHD----------RLDLLLPGQQQLLITEVAK 483
Q++DA AA A+ V+V+G ++ +E E D R L LP Q+ L+ EVAK
Sbjct: 591 QLEDAVAAAEKAEVVVMVLGLNERLEGEEMDVVVEGFAGGDRTALDLPASQRTLLKEVAK 650
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
K P++L++++G I++A + I +I+ GY GQ GG A+A+V FG YNP+ RLP+
Sbjct: 651 TGK-PIVLVLLNGSALSINWAAEN--IPAIMTAGYAGQQGGNAVAEVLFGDYNPAARLPV 707
Query: 544 TWY 546
T+Y
Sbjct: 708 TYY 710
>gi|325922365|ref|ZP_08184139.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
gi|325547147|gb|EGD18227.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
Length = 889
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 159/403 (39%), Positives = 211/403 (52%), Gaps = 69/403 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV +++ +EK+ ++ A ++ RLGIP YEWWSE LHG++ G AT
Sbjct: 40 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 89
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRD 173
FPQ I AAS+N L Q +G VVSTEARA +N AGLT WSPNINIFRD
Sbjct: 90 VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 149
Query: 174 PRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------A 223
PRWGRG ET GEDP L + A G+++GLQ D P + A KH +
Sbjct: 150 PRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLDHP---RTIATPKHIAVHSGPEPGRHS 206
Query: 224 YDLD----NWKGTDRYHFNAMVI--------------------YTYYLIK--------FK 251
+D+D + + T F A +I +L+ FK
Sbjct: 207 FDVDVSPRDVEATYTPAFRAALIDGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDWGFK 266
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSDCD+VD + ++ ++A S+ AG DLNCG + + A++ G V+E+
Sbjct: 267 GFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCG-YAYRALGTAIERGEVDEAL 325
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
+D ++ FA RLG + P K PY LG KD+ AN+ LAL AA Q IVLLKN A
Sbjct: 326 LDQSLVRLFAARYRLGELEA-PHKDPYATLGAKDIDNTANRALALKAAAQSIVLLKNDAN 384
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+LPL A LAVIGPNA+ + NY+GT TPL GL
Sbjct: 385 TLPLKAGA--RLAVIGPNADALAALEANYQGTSSTPVTPLLGL 425
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 647 DRNDIALPAPQQALL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA + G NP GRLP+T+Y
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY 726
>gi|21244948|ref|NP_644530.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
gi|21110666|gb|AAM39066.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
Length = 901
Score = 243 bits (619), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 159/445 (35%), Positives = 215/445 (48%), Gaps = 89/445 (20%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T + R DLV R+TL+EK + ++A ++ RL +P Y+WW+EALHGV+ G
Sbjct: 34 YLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG--- 90
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 91 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 143
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR------LKVAACC 218
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ +P K+ A
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRKLDATA 203
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KH + + DR+HF+A +Y YL F+
Sbjct: 204 KHLAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGESA 260
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y+VSDC ++ ++ T E+AAA ++ G +L CG
Sbjct: 261 SASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEE 320
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
AAV+ GL++E+ ID A+ MRLG FD P GQL + NQ
Sbjct: 321 YAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD------PPGQLPWSTIPASVNQS 373
Query: 353 LALDA-----ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
A DA AR+ +VLLKN G LPLS K +AVIGP A+ T ++GNY GTP
Sbjct: 374 PAHDALARRTARESLVLLKND-GLLPLSRAKFKRIAVIGPTADDTMALLGNYYGTPAAPV 432
Query: 408 TPLQGLAAV---VATIYQAGCSNVQ 429
T LQG+ A +Y G V+
Sbjct: 433 TVLQGIRAAAPNAQVLYARGADLVE 457
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A SAD V V G +E E DR DL LP Q+ L+ E +
Sbjct: 625 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLL-EALQA 683
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
PV+ ++ +G I +A+ + +IL YPGQ GG A+AD FG NP GRLP+T
Sbjct: 684 TGRPVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 741
Query: 545 WYPQS 549
+Y +S
Sbjct: 742 FYKES 746
>gi|78048767|ref|YP_364942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
gi|78037197|emb|CAJ24942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 889
Score = 243 bits (619), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 161/428 (37%), Positives = 221/428 (51%), Gaps = 72/428 (16%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
A +++ P++A+ NT + R LV +++ +EK+ ++ A ++ RLGIP YEW
Sbjct: 18 LALPLIAAPAVAA---KNTDATPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEW 74
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSE LHG++ G AT FPQ I AAS+N L Q +G VVSTEARA +N
Sbjct: 75 WSEGLHGIARNG----------YATVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQA 124
Query: 158 ---------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
AGLT WSPNINIFRDPRWGRG ET GEDP L + A G+++GLQ D
Sbjct: 125 GGPGKDHKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNH 184
Query: 209 PNRLKVAACCKHYT----------AYDLD----NWKGTDRYHFNAMVIYTY--------- 245
P + A KH +D+D + + T F A ++
Sbjct: 185 P---RTIATPKHIAVHSGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYN 241
Query: 246 -----------YLIK--------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+L+ FK ++VSDCD+VD + ++ ++A ++ AG
Sbjct: 242 SLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGH 301
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
DLNCG + A+ G V+E+ +D ++ FA RLG + P K PY +LG KDV
Sbjct: 302 DLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDV 359
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
A++ LAL AA + IVLLKN A +LPL A LAVIGPNA+ + NY+GT
Sbjct: 360 DNAAHRALALQAAAESIVLLKNDANTLPLK--AGTRLAVIGPNADALAALEANYQGTSSA 417
Query: 406 YTTPLQGL 413
TPL GL
Sbjct: 418 PVTPLLGL 425
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 647 DRNDIALPAPQQALL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA + G NP GRLP+T+Y
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY 726
>gi|424796589|ref|ZP_18222299.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422794891|gb|EKU23686.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 913
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 155/440 (35%), Positives = 218/440 (49%), Gaps = 79/440 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T + R DLV R+TL+EK + ++A ++ RLG+P Y+WW+EALHGV+ G
Sbjct: 37 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
GAT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR------LKVAACC 218
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ +P K+ A
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGDAYRKLDATA 206
Query: 219 KHYTAYDLDNWKGTDRYHFNA----MVIYTYYLIKFK----------------------- 251
KH+ + + DR+HF+A +Y YL F+
Sbjct: 207 KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGESA 263
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y+VSDC ++ ++ + T E+AAA ++ G +L CG
Sbjct: 264 SASKFLLRDVLRDTWGFDGYVVSDCWAIVDIWKNHKIVATREQAAALAVNNGTELECGEE 323
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
AAV+ GL++E+ +D A+ + MRLG FD P + Q+ + +
Sbjct: 324 YST-LPAAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPDTLRWAQIPLSANQSPEHDA 381
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LA AR+ +VLLKN G LPLS IK +AVIGP A+ T ++GNY GTP T LQG
Sbjct: 382 LARRTARESLVLLKND-GVLPLSRGKIKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQG 440
Query: 413 L---AAVVATIYQAGCSNVQ 429
+ A +Y G V+
Sbjct: 441 IREAAPDAEVLYARGADLVE 460
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 48/125 (38%), Positives = 66/125 (52%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ DA AA AD V V G +E E DR DL LP Q+ L+ +
Sbjct: 628 LQDALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQGT 687
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G I +A+ + +IL YPGQ GG+A+ADV FG NP GRLP+T
Sbjct: 688 GK-PVVAVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVT 744
Query: 545 WYPQS 549
+Y +S
Sbjct: 745 FYKES 749
>gi|86143269|ref|ZP_01061671.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
gi|85830174|gb|EAQ48634.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
Length = 873
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 165/451 (36%), Positives = 226/451 (50%), Gaps = 79/451 (17%)
Query: 37 VFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYE 96
VF C + + F N L ++ R+ DLV R+TL+EKI+ L+ A ++ RL IPKY
Sbjct: 11 VFCC-CIQIEAQKQFPFQNEQLDLETRLNDLVSRMTLEEKISQLMSDAPAIERLNIPKYN 69
Query: 97 WWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV 156
WW+E+LHGV+ G AT FPQ I AAS++A L + + +S EARA ++
Sbjct: 70 WWNESLHGVARAG----------YATVFPQSISIAASWDAQLVREVATAISDEARAKHHE 119
Query: 157 GL--------AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
L GLT WSPNINIFRDPRWGRG ET GEDP L YVKGLQ G
Sbjct: 120 YLRRDQHDIYQGLTMWSPNINIFRDPRWGRGHETYGEDPFLTGTLGAQYVKGLQ---GDD 176
Query: 209 PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------- 251
P LKV A KH+ + + R++F+A ++ YL F+
Sbjct: 177 PEYLKVVATAKHFAVH---SGPEESRHYFDANTSERDLWETYLPAFRMLVKDAQVQSVMT 233
Query: 252 ----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
Y+VSDC +++ ++ T A+A ++
Sbjct: 234 AYNRFRGEAASSNKLLFDILRNKWGFDGYVVSDCGAINDIWEDHKITADAASASALALET 293
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
G DLNCG+ K + A+ GL+ E I+ AI F ++LG FD + Y + P
Sbjct: 294 GTDLNCGATY-KSLKEAIANGLITEEKINIAIERLFRARLKLGMFDTEENLS-YATI-PF 350
Query: 344 DVCTQANQD-LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
V T A+ LA AA++ IVLLKN A LPLS +K +AVIGPNA+ +++ GNY GT
Sbjct: 351 SVNTNASHTALARKAAQESIVLLKNEAHMLPLSKD-LKQIAVIGPNAHNVQSLWGNYNGT 409
Query: 403 PCKYTTPLQGL----AAVVATIYQAGCSNVQ 429
P T +QG+ A V +Y+ G S Q
Sbjct: 410 PKNPVTVVQGIRNKVAPQVKVVYEEGASLAQ 440
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 47/123 (38%), Positives = 74/123 (60%), Gaps = 13/123 (10%)
Query: 434 QVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAK 483
+++ A A +D T+LV+G ++ +E E DR L LP +Q+ L+ +
Sbjct: 588 KLERAVNLAEDSDVTILVLGLNERLEGEEMRIDVEGFSKGDRTALDLPLEQRELMRALVA 647
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
K P++L++++G I++A+ + +IL GYPGQ GG AIADV FG YNP+GRLP+
Sbjct: 648 TGK-PIVLVLLNGSALAINYAQE--HVPAILSAGYPGQEGGNAIADVLFGDYNPAGRLPV 704
Query: 544 TWY 546
T+Y
Sbjct: 705 TYY 707
>gi|365118446|ref|ZP_09337032.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649697|gb|EHL88801.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
6_1_58FAA_CT1]
Length = 1283
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 181/553 (32%), Positives = 271/553 (49%), Gaps = 91/553 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGS--VSRLGIPKYEWWSEALHGVSYVGP 110
+ N ++ I+ R+ DL+ RLTL+EK+ L DS GS ++RL IP +E LHG SY
Sbjct: 72 YLNPNIPIEERIDDLLPRLTLEEKVIQLSDSWGSKGIARLKIPAM-LKTEGLHGQSYA-- 128
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
G+T FP I ++F+ L Q +GK + EA+A N+ ++ WSP +++
Sbjct: 129 --------TGSTIFPHGINMGSTFDTELIQEVGKATAIEAKAA-NLRVS----WSPVLDV 175
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY------ 224
RD RWGR +ET GEDP L + ++KG Q + AC KH+ +
Sbjct: 176 ARDARWGRVEETYGEDPYLVGRIGVAWIKGFQGE--------HMFACPKHFAGHGQPVGG 227
Query: 225 -DLDNWKGTDRYH-----------------FNAMVIYTYY--------------LIKFK- 251
D ++ +DR F M Y + +++ +
Sbjct: 228 RDSHDYGLSDRVMRNIHLAPFRDVIKEANAFGVMAAYGLWNGVPDNGSKELLQKILREEW 287
Query: 252 ----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVN 307
++VSDC + + Q T EEAAA ++ AG+D+ CGS K +AVK G++
Sbjct: 288 GFEGFVVSDCSGPENIQRKQSVVGTMEEAAAMAVRAGVDIECGSAYKKALASAVKKGIIK 347
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
ES +D + F MRLG FD PS + + T ++ LA A + VLLKN
Sbjct: 348 ESELDANLRRVFRAKMRLGLFD-RPSIENMVWNKLPEYDTPEHRALARKVAVKSTVLLKN 406
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG--TPCKYTTPLQGLAAVVA----TIY 421
LPL IK +AVIGPNA+ +T G+Y P + + L+G+ V+ +Y
Sbjct: 407 ENNLLPLDKN-IKTIAVIGPNADQGQT--GDYSAKYAPGQIISVLEGVKNHVSPSTKVLY 463
Query: 422 QAGCSNVQCGTAQVDDAKKAAASADATVLVMGAD---------QSIEAESHDRLDLLLPG 472
GC+ + T +A A ADA +LV+G + +S E+ D L +PG
Sbjct: 464 AQGCTQLDMDTTGFAEAVNIAKQADAVILVVGDNSNRHENGNKKSTTGENVDGATLEIPG 523
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCF 532
Q+ LI V K PV+L++++G F +++ D I SIL YPG+ GG A AD+ F
Sbjct: 524 VQRQLIKAVEATGK-PVVLVLVNGKPFTLTW--EDENIESILETWYPGEEGGNATADIIF 580
Query: 533 GRYNPSGRLPMTW 545
G NPSGRLP+++
Sbjct: 581 GDENPSGRLPISF 593
>gi|21243803|ref|NP_643385.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
gi|21109396|gb|AAM37921.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
Length = 886
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 158/403 (39%), Positives = 210/403 (52%), Gaps = 69/403 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV +++ +EK+ ++ A ++ RLGIP YEWWSE LHG++ G AT
Sbjct: 37 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 86
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRD 173
FPQ I AAS+N SL Q +G VVSTEARA +N AGLT WSPNINIFRD
Sbjct: 87 VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146
Query: 174 PRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------A 223
PRWGRG ET GEDP L + A G+++GLQ D P + A KH
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHP---RTIATPKHIAVHSGPEPGRHG 203
Query: 224 YDLD--------------------NWKGTDRYHFNAM----VIYTYYLIK--------FK 251
+D+D G+ +NA+ V +L+ FK
Sbjct: 204 FDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDWGFK 263
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSDCD+VD + ++ ++A ++ AG DLNCG + A+ G V+E+
Sbjct: 264 GFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY-RELGTAIARGEVDEAL 322
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
+D ++ FA RLG + P K PY +LG KDV A++ LAL AA + IVLLKN A
Sbjct: 323 LDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLKNDAN 381
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+LPL A LAVIGPNA+ + NY+GT TPL GL
Sbjct: 382 TLPLR--AGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 644 DRNDIALPAPQQALL-ERAKASGKPLVVVLMSGSAVALNWAKMHADAIVAAW--YPGQSG 700
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA + G NP GRLP+T+Y
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFY 723
>gi|384420163|ref|YP_005629523.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353463076|gb|AEQ97355.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 889
Score = 241 bits (615), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 165/443 (37%), Positives = 220/443 (49%), Gaps = 77/443 (17%)
Query: 23 CSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVD 82
C L + S + P+ A VS + T + R DLV ++ +EK+ ++
Sbjct: 8 CRLAAALSVGLALPLIAAPAVSAKT--------TDATPEQRAADLVAHMSREEKVAQAMN 59
Query: 83 SAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAI 142
A ++ RLGIP YEWWSE LHG++ G AT FPQ I AAS+N L Q +
Sbjct: 60 DAPAIPRLGIPAYEWWSEGLHGIARNG----------YATVFPQAIGLAASWNTHLMQQV 109
Query: 143 GKVVSTEARAMYNVG---------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
G VVSTEARA +N AGLT WSPNINIFRDPRWGRG ET GEDP L +
Sbjct: 110 GTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQM 169
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYT----------AYDLD----NWKGTDRYHFNA 239
A G++ GLQ D P + A KH +D+D + + T F A
Sbjct: 170 AVGFIHGLQGDDLDHP---RTIATPKHLAVHSGPEPGRHGFDVDVSPRDVEATYTPAFRA 226
Query: 240 MVIYTY--------------------YLIK--------FK-YIVSDCDSVDVLYNSQHYT 270
++ +LI FK ++VSDCD+VD + ++
Sbjct: 227 AIVEGQAGAVMCAYNSLHGTPACAADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFR 286
Query: 271 KTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDG 330
++A ++ AG DLNCG + A+ G V+E+ +D ++ FA RLG +
Sbjct: 287 PDNAGSSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA 345
Query: 331 HPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNAN 390
P K PY +LG KDV ++ LAL AA + IVLLKN A +LPL A LAVIGPNA+
Sbjct: 346 -PRKDPYARLGAKDVDNAQHRALALQAAAESIVLLKNNANTLPLK--AGTRLAVIGPNAD 402
Query: 391 VTKTMIGNYEGTPCKYTTPLQGL 413
+ NY+GT TPL GL
Sbjct: 403 ALAALEANYQGTSSAPVTPLLGL 425
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 647 DRNDIALPAPQQALL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA + G NP GRLP+T+Y
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY 726
>gi|381169747|ref|ZP_09878910.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
gi|380689765|emb|CCG35397.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
Length = 874
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 158/403 (39%), Positives = 210/403 (52%), Gaps = 69/403 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV +++ +EK+ ++ A ++ RLGIP YEWWSE LHG++ G AT
Sbjct: 25 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 74
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRD 173
FPQ I AAS+N SL Q +G VVSTEARA +N AGLT WSPNINIFRD
Sbjct: 75 VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 134
Query: 174 PRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------A 223
PRWGRG ET GEDP L + A G+++GLQ D P + A KH
Sbjct: 135 PRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHP---RTIATPKHIAVHSGPEPGRHG 191
Query: 224 YDLD--------------------NWKGTDRYHFNAM----VIYTYYLIK--------FK 251
+D+D G+ +NA+ V +L+ FK
Sbjct: 192 FDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDWGFK 251
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSDCD+VD + ++ ++A ++ AG DLNCG + A+ G V+E+
Sbjct: 252 GFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY-RELGTAIARGEVDEAL 310
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
+D ++ FA RLG + P K PY +LG KDV A++ LAL AA + IVLLKN A
Sbjct: 311 LDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLKNDAN 369
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+LPL A LAVIGPNA+ + NY+GT TPL GL
Sbjct: 370 TLPLR--AGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 410
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 632 DRNDIALPAPQQALL-ERAKASGKPLVVVLMSGSAVALNWAKMHADAIVAAW--YPGQSG 688
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA + G NP GRLP+T+Y
Sbjct: 689 GTAIARMLAGDDNPGGRLPVTFY 711
>gi|289670678|ref|ZP_06491753.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 886
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 159/420 (37%), Positives = 217/420 (51%), Gaps = 69/420 (16%)
Query: 46 PSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV 105
P LA + + R LV +++ +EK+ ++ A ++ RLGIP YEWWSE LHG+
Sbjct: 20 PCLALAAPTERAATPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGI 79
Query: 106 SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG-------- 157
+ G AT FPQ I AAS+N +L Q +G VVSTEARA +N
Sbjct: 80 ARNG----------HATVFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHK 129
Query: 158 -LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAA 216
AGLT WSPNINIFRDPRWGRG ET GEDP L + A G+++GLQ D P + A
Sbjct: 130 RYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHP---RTIA 186
Query: 217 CCKHYT----------AYDLD----NWKGTDRYHFNAMVIYTY----------------- 245
KH +D+D + + T F A ++
Sbjct: 187 TPKHLAVHSGPEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPAC 246
Query: 246 ---YLIK--------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL 293
+L+ FK ++VSDCD+VD + ++ ++A ++ AG DLNCG
Sbjct: 247 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 306
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ A++ G V+E+ +D ++ FA RLG + P K PY +LG KDV A++ L
Sbjct: 307 -RELGTAIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AA + IVLLKN A +LPL+ A LAVIGPNA+ + NY+GT TPL GL
Sbjct: 365 ALQAAAESIVLLKNDANTLPLN--AGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 61/114 (53%), Gaps = 13/114 (11%)
Query: 443 ASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A +DA V +G +E E DR D+ LP QQ L+ E AK + P++++
Sbjct: 613 AQSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALL-ERAKASGKPLVVV 671
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+MSG +++AK W YPGQ+GG AIA + G NP GRLP+T+Y
Sbjct: 672 LMSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFY 723
>gi|390992294|ref|ZP_10262532.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372552957|emb|CCF69507.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 886
Score = 241 bits (615), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 158/403 (39%), Positives = 210/403 (52%), Gaps = 69/403 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV +++ +EK+ ++ A ++ RLGIP YEWWSE LHG++ G AT
Sbjct: 37 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------AT 86
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRD 173
FPQ I AAS+N SL Q +G VVSTEARA +N AGLT WSPNINIFRD
Sbjct: 87 VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146
Query: 174 PRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------A 223
PRWGRG ET GEDP L + A G+++GLQ D P + A KH
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHP---RTIATPKHIAVHSGPEPGRHG 203
Query: 224 YDLD--------------------NWKGTDRYHFNAM----VIYTYYLIK--------FK 251
+D+D G+ +NA+ V +L+ FK
Sbjct: 204 FDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDWGFK 263
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSDCD+VD + ++ ++A ++ AG DLNCG + A+ G V+E+
Sbjct: 264 GFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY-RELGTAIARGEVDEAL 322
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
+D ++ FA RLG + P K PY +LG KDV A++ LAL AA + IVLLKN A
Sbjct: 323 LDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLKNDAN 381
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+LPL A LAVIGPNA+ + NY+GT TPL GL
Sbjct: 382 TLPLR--AGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 644 DRNDIALPAAQQTLL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 700
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA + G NP GRLP+T+Y
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFY 723
>gi|94970273|ref|YP_592321.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
gi|94552323|gb|ABF42247.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
Length = 881
Score = 241 bits (614), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 149/426 (34%), Positives = 215/426 (50%), Gaps = 85/426 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N SL + R DLV R+T++EK++ L + + +V RL +P Y+WWSEALHGV+
Sbjct: 30 YLNPSLAPEKRAADLVHRMTVEEKVSQLTNDSRAVPRLNVPDYDWWSEALHGVAQ----- 84
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
PG T +PQ + AA+F+ Q + + + E R + G+ GL FW
Sbjct: 85 ------PGVTEYPQPVALAATFDNDKVQRMARFIGIEGRIKHEEGMKDGHSDIFQGLDFW 138
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRGQET GEDP L ++ YVKGLQ G P + KHY +
Sbjct: 139 APNINIFRDPRWGRGQETYGEDPFLTARMGVAYVKGLQ---GDDPKYYLAISTPKHYAVH 195
Query: 225 DLDNWKGTDRYHFNAMVIYTY-----YLIKFK---------------------------- 251
+ T R HF + + + YL F+
Sbjct: 196 ---SGPETTR-HFADVKVSKHDELDTYLPAFRATVTEAKAGSVMCAYNSINGQPACVNEF 251
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT 297
Y+VSDC+++ +Y +TKT EA+A ++ G+D C F GK
Sbjct: 252 LLQDQLRGKWNFQGYVVSDCEAIINIYRDHKFTKTQAEASALAVQRGMDNECVDF-GKQK 310
Query: 298 E--------AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
+ A K G++ ES ID A+ F M+LG FD P PY ++ PK++ +
Sbjct: 311 DDHDYRPYFDAYKQGILKESEIDTALVRLFTARMKLGMFD-PPEMVPYSKIDPKELESAE 369
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
+++LA A + +VLLKN G+LPL + +K +AVIGP A T+ ++GNY GTP +
Sbjct: 370 HRELARTLANESMVLLKND-GTLPLKKSGLK-IAVIGPLAEQTRYLLGNYNGTPSHTVSV 427
Query: 410 LQGLAA 415
L+GL A
Sbjct: 428 LEGLRA 433
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 73/146 (50%), Gaps = 18/146 (12%)
Query: 416 VVATIYQAGCSNVQCGTAQVDD-----AKKAAASADATVLVMGADQSIEAESH------- 463
V +I QAG Q ++ D A AA +AD + V+G +E E
Sbjct: 576 VRYSIRQAGPMQAQLIWSKFDPTPNPAAVTAAKNADVVIAVLGITSDLEGEEMPVSEEGF 635
Query: 464 ---DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
DR L LP +Q L+ ++ K PV+L++ +G +++A+ +IL YPG
Sbjct: 636 NGGDRTSLDLPKPEQQLLESISAAGK-PVVLVLSNGSALSVNWAQQ--HANAILEGWYPG 692
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
+ GG AIA G+ NP+GRLP+T+Y
Sbjct: 693 EEGGTAIAQTLSGKNNPAGRLPVTFY 718
>gi|389737578|ref|ZP_10190998.1| beta-glucosidase [Rhodanobacter sp. 115]
gi|388434298|gb|EIL91245.1| beta-glucosidase [Rhodanobacter sp. 115]
Length = 898
Score = 241 bits (614), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 161/435 (37%), Positives = 216/435 (49%), Gaps = 79/435 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T+ R DLV R+TL EK+ + +SA ++ RLG+P Y+WW+EALHGV+ G
Sbjct: 43 YLDTAHSFQERAADLVSRMTLAEKVAQMQNSAPAIPRLGVPAYDWWNEALHGVARAGE-- 100
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN----VGL----AGLTFW 164
AT FPQ I AA+F+ +L +S EARA YN G+ GLTFW
Sbjct: 101 --------ATVFPQAIGLAATFDPALLHHEATAISDEARAKYNDFQRRGMRGRYEGLTFW 152
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPN NIFRDPRWGRGQET GEDP L S+ +V+GL+ G P K+ A KH+ +
Sbjct: 153 SPNTNIFRDPRWGRGQETYGEDPYLTSRMGVAFVRGLE---GDDPTYQKLDATAKHFAVH 209
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ ++R+ F+ ++ YL F+
Sbjct: 210 ---SGPESERHRFDVHPSERDLHETYLPAFQALVQQGGVDAVMGAYNRVDGVPATASHRL 266
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
Y+VSDCD+V +Y T E+AAA ++ G DLNCG+ +
Sbjct: 267 LQDILRRDWGFKGYVVSDCDAVADIYQFHKVVPTAEQAAALAVNNGDDLNCGTTYATLVK 326
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD-LALDA 357
AV GLVNE ID A++ RLG FD P + P+ L P V D LAL
Sbjct: 327 -AVHDGLVNEHTIDTAVTRLMLARFRLGMFDP-PGRVPWSTL-PMSVVQSPQHDALALRT 383
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
A++ +VLLKN G LPLS ++ +AVIGP A+ ++GNY GTP T LQG+ V
Sbjct: 384 AQESMVLLKND-GLLPLSHN-VRRIAVIGPTADNVTALLGNYHGTPKAPVTILQGIREAV 441
Query: 418 ---ATIYQAGCSNVQ 429
Y G VQ
Sbjct: 442 PNAQVTYVQGTELVQ 456
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 61/119 (51%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A AA AD + G +E E DR L LP Q+ L+ + +V
Sbjct: 626 ALDAARHADVVIFAGGLSSDLEGEEMPVDYPGFAGGDRTTLALPATQRKLL-QALQVTGK 684
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PV+L++ +G I +AK + +IL YPGQ GG A+AD FG +P+GRLP+T+Y
Sbjct: 685 PVVLVLTTGSALAIDWAKQ--HLPAILLAWYPGQDGGHAVADALFGNVDPAGRLPVTFY 741
>gi|289664871|ref|ZP_06486452.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 886
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 156/403 (38%), Positives = 211/403 (52%), Gaps = 69/403 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV ++ +EK+ ++ A ++ RLGIP YEWWSE LHG++ G AT
Sbjct: 37 RAAALVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------HAT 86
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRD 173
FPQ I AAS+N +L Q +G VVSTEARA +N AGLT WSPNINIFRD
Sbjct: 87 VFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 146
Query: 174 PRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------A 223
PRWGRG ET GEDP L + A G+++GLQ D P + A KH
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHP---RTIATPKHLAVHSGPEPGRHG 203
Query: 224 YDLD----NWKGTDRYHFNAMVIYTY--------------------YLIK--------FK 251
+D+D + + T F A ++ +L+ FK
Sbjct: 204 FDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDWGFK 263
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSDCD+VD + ++ ++A ++ AG DLNCG + A++ G V+E+
Sbjct: 264 GFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY-RELGTAIERGDVDEAL 322
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
+D ++ FA RLG + P K PY +LG KDV A++ LAL AA + IVLLKN A
Sbjct: 323 LDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLKNDAN 381
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+LPL+ A LAVIGPNA+ + NY+GT TPL GL
Sbjct: 382 TLPLN--AGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 61/114 (53%), Gaps = 13/114 (11%)
Query: 443 ASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A +DA V +G +E E DR D+ LP QQ L+ E AK + P++++
Sbjct: 613 AQSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALL-ERAKASGKPLVVV 671
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+MSG +++AK W YPGQ+GG AIA + G NP GRLP+T+Y
Sbjct: 672 LMSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFY 723
>gi|397691073|ref|YP_006528327.1| beta-glucosidase [Melioribacter roseus P3M]
gi|395812565|gb|AFN75314.1| beta-glucosidase [Melioribacter roseus P3M]
Length = 923
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 150/411 (36%), Positives = 210/411 (51%), Gaps = 67/411 (16%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ DL+ +T +EKI L + A S+ RLG+ Y +W+E+LHGV + GAT
Sbjct: 116 RLNDLISLMTTEEKIKQLTNQADSIPRLGLRAYNYWNESLHGV-----------LAEGAT 164
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQET 182
SFPQ I A+++ L + VS EARA+ + GLT+WSP INI RDPRWGR +E+
Sbjct: 165 SFPQAIALGATWDPRLVNRVATAVSDEARALNRLYGKGLTYWSPTINIARDPRWGRNEES 224
Query: 183 PGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVI 242
EDP L S+ ++KG+Q G P LK A KH+ A + + + T + +
Sbjct: 225 YSEDPYLLSRMGVAFIKGMQ---GDHPYYLKTVATPKHFIANNEEERRHTGSSDVDMRNL 281
Query: 243 YTYYLIKFK------------------------------------------YIVSDCDSV 260
Y YYL FK Y+VSDC ++
Sbjct: 282 YEYYLPAFKSAIVEARAYSIMGAYNELNHVPSNANMFLMTDLLRRQWGFEGYVVSDCGAI 341
Query: 261 -DVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNF 319
D+LY + + KT EA A+SILAG DLNCG + + A+ GL+ E ID A+
Sbjct: 342 HDMLYGHKFF-KTGAEAVARSILAGCDLNCGQAYREFIKDALDEGLLREKDIDSALFRVL 400
Query: 320 ATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAI 379
+ RLG FD P PY +G + ++ N+ LALDAAR+ IVLLKN LP+ + I
Sbjct: 401 SARFRLGEFD-PPELVPYSSIGKDKLDSKENRRLALDAARKSIVLLKNN-DILPIDKSKI 458
Query: 380 KNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-----YQAGC 425
K++AVIGPNA + +G Y G P +PL+G+ ++ Y GC
Sbjct: 459 KSIAVIGPNAR--EAQLGIYSGFPNVLISPLEGIKNKADSLDIRVGYVKGC 507
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/117 (38%), Positives = 66/117 (56%), Gaps = 3/117 (2%)
Query: 430 CGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPV 489
G + AKK AA D +LV+G I E DR ++ LP Q+ L+ + A+V P
Sbjct: 653 LGQKDFEKAKKIAAENDLVILVLGITPGISQEELDRKEIELPSVQRELVKQTAEV--NPN 710
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I+I++ GG ++ A + +I+ Y G+ GG A+ADV FG YNP G+LP T+Y
Sbjct: 711 IVIVLVNGG-PVALAGAEKYAKAIVENWYNGEFGGQALADVLFGDYNPGGKLPQTFY 766
>gi|395803818|ref|ZP_10483061.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
gi|395434089|gb|EJG00040.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
Length = 875
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 153/423 (36%), Positives = 216/423 (51%), Gaps = 76/423 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F NT L + RV +LV +LTL+EK+ ++++A ++ RLGIP Y+WW+E LHGV+ T
Sbjct: 29 FQNTDLTFEERVENLVSQLTLEEKVAQMLNAAPAIPRLGIPAYDWWNETLHGVAR----T 84
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA---------GLTF 163
F V FPQ I AA+F+ + + + E RA+YN + GLT+
Sbjct: 85 PFKTTV-----FPQAIAMAATFDKNSLFKMADYSALEGRAIYNKAVELNRTKERYLGLTY 139
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PNINIFRDPRWGRGQET GEDP L + +VKGLQ G P LK AAC KHY
Sbjct: 140 WTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQ---GDDPKYLKAAACAKHYAV 196
Query: 224 YDLDNWKGTD--RYHFNAMV----IYTYYLIKFK-------------------------- 251
+ G + R+ F+ V ++ YL FK
Sbjct: 197 HS-----GPESLRHTFDVDVTPYELWDTYLPAFKKLVTNSKVAGVMCAYNAFRTQPCCAS 251
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGK 295
Y+ SDC ++D + + A+A ++L G D++CG+ K
Sbjct: 252 DILMNDILRNQWKFTGYVTSDCWAIDDFFKNHKTHPDAASASADAVLHGTDIDCGTDAYK 311
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
AVK G + E ID ++ F RLG FD S Y Q + ++ +++ AL
Sbjct: 312 SLVQAVKNGQITEKQIDVSVKRLFMIRFRLGMFD-PVSMVKYAQTPSSVLESEEHKEHAL 370
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
ARQ IVLLKN +LPLS +K + V+GPNA+ + +++GNY GTP K TT LQG+
Sbjct: 371 KMARQSIVLLKNEKNTLPLS-KKLKKIVVLGPNADNSISILGNYNGTPSKLTTVLQGIKE 429
Query: 416 VVA 418
++
Sbjct: 430 KIS 432
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 61/114 (53%), Gaps = 15/114 (13%)
Query: 444 SADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILII 493
+ADA + G +E E DR +LLP Q L+ + K PV+ +I
Sbjct: 605 NADAFIFAGGISPQLEGEEMPVDAPGFNGGDRTSILLPEVQTRLLKALQSSGK-PVVFLI 663
Query: 494 MSGGGFDISF-AKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
M+G + + A+N P I +I W G GQ+ G A ADV FG YNP+GRLP+T+Y
Sbjct: 664 MTGSAIAVPWEAENIPAILNI-WYG--GQSAGTASADVIFGDYNPAGRLPVTFY 714
>gi|380509734|ref|ZP_09853141.1| beta-glucosidase-related glycosidase [Xanthomonas sacchari NCPPB
4393]
Length = 883
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 152/415 (36%), Positives = 214/415 (51%), Gaps = 74/415 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +TS + R LV ++TL+EK + ++A ++ RLG+P Y+WW+EALHGV+ G
Sbjct: 24 WQDTSASFEARAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEALHGVARAGQ-- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
AT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 82 --------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLREGAHGRYQGLTFW 133
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ G P K+ A KH+ +
Sbjct: 134 SPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVQGLQ---GDDPVYRKLDATAKHFAVH 190
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ DR+HF+A +Y YL F+
Sbjct: 191 ---SGPEADRHHFDARPSKRDLYDTYLPAFEALVKEGKVDAVMGAYNRVYGESASASQFL 247
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
Y+VSDC ++ ++ H + E AAA ++ G +L CG
Sbjct: 248 LRDVLRRDWGFTGYVVSDCWAIVDIWKHHHLAPSREAAAALAVKNGTELECGQEYAT-LP 306
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAA 358
AAV+ GL+ E+ ID A++ F MRLG FD P + + ++ A+ LAL AA
Sbjct: 307 AAVRQGLIGEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQVPAHDALALQAA 365
Query: 359 RQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
++ +VLLKN G LPLS T +K +AV+GP A+ T ++GNY GTP T LQG+
Sbjct: 366 QESLVLLKND-GVLPLSRT-LKRIAVVGPTADDTMALLGNYFGTPAAPVTILQGI 418
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 62/121 (51%), Gaps = 13/121 (10%)
Query: 436 DDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVA 485
D+A AA +AD V V G +E E DR DL LP Q+ L+ +
Sbjct: 606 DEALDAARNADVVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDLRLPAPQRALLEALHATG 665
Query: 486 KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
K PV++++ G + +A+ + +IL YPGQ GG A+ FG NP+GRLP+T+
Sbjct: 666 K-PVVMVLTGGSALAVDWAQA--HLPAILMSWYPGQRGGTAVGQALFGEVNPAGRLPVTF 722
Query: 546 Y 546
Y
Sbjct: 723 Y 723
>gi|390340546|ref|XP_001186857.2| PREDICTED: probable beta-D-xylosidase 2-like [Strongylocentrotus
purpuratus]
Length = 623
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 182/569 (31%), Positives = 275/569 (48%), Gaps = 89/569 (15%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGS--------VSRLGIPKYEWWSE 100
+ L F N SL D R+ DL+ RL + + +T+ + G+ + RL I KY W +E
Sbjct: 28 SQLPFWNQSLPWDQRLDDLLSRLKVDD-MTYQLARGGADPNGPAPAIGRLQIGKYVWNTE 86
Query: 101 ALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA- 159
L G + G AT+FPQ + +A+F+ L + E RA YN L
Sbjct: 87 CLRGDAQAG----------NATAFPQALGLSAAFSRDLLFEVANATGYEVRAKYNYYLQK 136
Query: 160 -------GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
GL +SP INI R P WGR QET GEDP L + A +V GLQ G P L
Sbjct: 137 GDFNNHQGLNCFSPVINIMRHPYWGRNQETYGEDPYLTGELAKSFVWGLQ---GNHPRYL 193
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------- 251
A CKH+ AY + R+ F+A V + + FK
Sbjct: 194 LTNAGCKHFAAYSGPENYPSSRFSFDAKVSDKDLQVTFFPAFKECIKAGTYSVMCSYNSV 253
Query: 252 ------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL 287
Y+VSD ++++ + +YT + + A KS+ AG +L
Sbjct: 254 NGIPACANSYLLNDVLRTEWGFKGYVVSDQRALELEELAHNYTTSYLDTAIKSLKAGCNL 313
Query: 288 NCGSF---LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
+ G+ + + AV+ G++ + +I+ F T +RLG FD P PY +L
Sbjct: 314 DLGTTKPAVYDYLAEAVELGMLTAQDLRDSIAPLFYTRLRLGEFD-PPDHNPYVKLNVDQ 372
Query: 345 VC-TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
V + +Q++AL AA + VL+KN +LP+ T I LAV+GP AN +K + G+Y P
Sbjct: 373 VVESPEHQEIALKAALKSFVLVKNDGSTLPIEGT-IHTLAVVGPFANNSKLLFGDYAPNP 431
Query: 404 CK--YTTPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA 460
TT L+GL+ + T + +GC + +C T A AD V+ +G +E+
Sbjct: 432 DPRFVTTVLEGLSPMATKTRHASGCPSPKCVTYDQQGVLNAVTGADVVVVCLGTGIELES 491
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKG-PVILIIMSGGGFDISFAKNDPKITSILWVGYP 519
E +DR D+LLPG+Q+ L+ + A+ A G PVIL++ + G +I++A + P + +I+ +P
Sbjct: 492 EGNDRRDMLLPGKQEQLLQDAARYAAGKPVILLLFNAGPLNITWALSSPSVQAIVECFFP 551
Query: 520 GQAGGAAIADVCFGR---YNPSGRLPMTW 545
QA G A+ + F NP GRLP TW
Sbjct: 552 AQATGVALR-MMFQNAPGANPGGRLPSTW 579
>gi|294627323|ref|ZP_06705909.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292598405|gb|EFF42556.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 886
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 157/403 (38%), Positives = 211/403 (52%), Gaps = 69/403 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV +++ +EK+ ++ A ++ RLGIP YEWWSE LHG++ G AT
Sbjct: 37 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 86
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRD 173
FPQ I AAS+N SL Q +G VVSTEARA +N AGLT WSPNINIFRD
Sbjct: 87 VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146
Query: 174 PRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------A 223
PRWGRG ET GEDP L + A G+++GLQ D P + A KH
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLDHP---RTIATPKHIAVHSGPEPGRHG 203
Query: 224 YDLD----NWKGTDRYHFNAMVIYTY--------------------YLIK--------FK 251
+D+D + + T F A ++ +L+ FK
Sbjct: 204 FDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDWGFK 263
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSDCD+VD + ++ ++A ++ AG DLNCG + A++ G V+E+
Sbjct: 264 GFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY-RDLGTAIERGDVDEAL 322
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
+D ++ FA RLG + P K PY +LG KDV A++ LAL AA + IVLLKN A
Sbjct: 323 LDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLKNDAN 381
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+LPL A LAVIGPNA+ + NY+GT TPL GL
Sbjct: 382 TLPLK--AGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 644 DRNDIALPAPQQALL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 700
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA + G NP GRLP+T+Y
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFY 723
>gi|294665226|ref|ZP_06730524.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292605014|gb|EFF48367.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 886
Score = 239 bits (610), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 157/403 (38%), Positives = 211/403 (52%), Gaps = 69/403 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV +++ +EK+ ++ A ++ RLGIP YEWWSE LHG++ G AT
Sbjct: 37 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------AT 86
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRD 173
FPQ I AAS+N SL Q +G VVSTEARA +N AGLT WSPNINIFRD
Sbjct: 87 VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146
Query: 174 PRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------A 223
PRWGRG ET GEDP L + A G+++GLQ D P + A KH
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLDHP---RTIATPKHIAVHSGPEPGRHG 203
Query: 224 YDLD----NWKGTDRYHFNAMVIYTY--------------------YLIK--------FK 251
+D+D + + T F A ++ +L+ FK
Sbjct: 204 FDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDWGFK 263
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSDCD+VD + ++ ++A ++ AG DLNCG + A++ G V+E+
Sbjct: 264 GFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY-RDLGTAIERGDVDEAL 322
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
+D ++ FA RLG + P K PY +LG KDV A++ LAL AA + IVLLKN A
Sbjct: 323 LDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIVLLKNDAN 381
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+LPL A LAVIGPNA+ + NY+GT TPL GL
Sbjct: 382 TLPLK--AGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 644 DRNDIALPAPQQALL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 700
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G A+A + G NP GRLP+T+Y
Sbjct: 701 GTAMARMLAGDDNPGGRLPVTFY 723
>gi|320105647|ref|YP_004181237.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319924168|gb|ADV81243.1| glycoside hydrolase family 3 domain protein [Terriglobus saanensis
SP1PR4]
Length = 885
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 159/437 (36%), Positives = 225/437 (51%), Gaps = 81/437 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + +L R DLV R+TL+EK ++++A ++ RLG+P Y++WSE LHGV+ G
Sbjct: 30 YLDPTLSPPARARDLVHRMTLEEKTAQMINTAPAIDRLGVPAYDFWSEGLHGVARSG--- 86
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AA+++ L IG VVSTEARA YN + GLT W
Sbjct: 87 -------YATLFPQAIGMAATWDEPLMHEIGTVVSTEARAKYNDAVQHGVHSIYFGLTIW 139
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRGQET GEDP L ++ T +V+G+Q G PN + A KH+ +
Sbjct: 140 SPNINIFRDPRWGRGQETYGEDPFLTARMGTAFVRGIQ---GDDPNYFRTIATPKHFAVH 196
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ + R+ FN V ++ YL F+
Sbjct: 197 ---SGPESTRHTFNVDVSQHDLWDTYLPAFRSTIIEGKADSIMCAYNRIDGQPACASDLL 253
Query: 252 -------------YIVSDCDSVDVLYNS--QHYTKTPEEAAAKSILAGLDLNCG-SFLGK 295
++ SDC ++D Y H++K E+A+A + AG D CG ++LG
Sbjct: 254 LKQILRGDWGFRGFVTSDCGAIDDFYTKIGHHFSKEKEDASAAGVKAGTDTACGKTYLG- 312
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
+AVK+GL+ E +D ++ F +RLG FD P++ PY +L +V + A++ LAL
Sbjct: 313 -LTSAVKSGLITEHEMDISLERLFEARIRLGLFD-DPARMPYARLTMAEVNSPAHRALAL 370
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
AAR+ IVLLKN LPL +KN+AVIGPNA + GNY P+ G+AA
Sbjct: 371 RAARESIVLLKNANNLLPLH--GVKNIAVIGPNAASLDALEGNYNAIARDPAMPVDGIAA 428
Query: 416 V---VATIYQAGCSNVQ 429
+Y G V+
Sbjct: 429 AFPGAKVVYAQGAPYVE 445
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 57/111 (51%), Gaps = 15/111 (13%)
Query: 446 DATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
D V +G +E E DR D+ LP Q QL + K P+I+++M+
Sbjct: 620 DVVVAFVGLSPELEGEEMPIKVKGFAGGDRTDIELP-QTQLELLRAVKATGKPLIVVLMN 678
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G + A D + ++L YPG+AG AIA+ G+ NPSGRLP+T+Y
Sbjct: 679 GS----AIALKDSETDALLEAWYPGEAGAQAIAETLAGKNNPSGRLPLTFY 725
>gi|325914134|ref|ZP_08176487.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325539637|gb|EGD11280.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 874
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 160/420 (38%), Positives = 217/420 (51%), Gaps = 69/420 (16%)
Query: 46 PSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV 105
P +A + S + R LV +++ EK+ ++ A ++ RL IP YEWWSE LHG+
Sbjct: 8 PCIAFAAPADRSGTPEQRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGI 67
Query: 106 SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG-------- 157
+ G AT FPQ I AAS+N +L Q +G VVSTEARA +N
Sbjct: 68 ARNG----------YATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHK 117
Query: 158 -LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAA 216
AGLT WSPNINIFRDPRWGRG ET GEDP L + A G+++GLQ D P + A
Sbjct: 118 RYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLNHP---RTIA 174
Query: 217 CCKHYT----------AYDLD----NWKGT----------DRYHFNAMVIYTY------- 245
KH +D+D + + T D ++ M Y
Sbjct: 175 TPKHIAVHSGPEPGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPAC 234
Query: 246 ---YLIK--------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL 293
+L+ FK ++VSDCD+VD + ++ ++A ++ AG DLNCG
Sbjct: 235 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 294
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ A++ G V+E+ +D ++ FA RLG + P K PY +LG KDV A++ L
Sbjct: 295 -RELGTAIERGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRAL 352
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AA + IVLLKNTA +LPL A LAVIGPNA+ + NY+GT TPL GL
Sbjct: 353 ALQAAAESIVLLKNTATTLPLK--AGTRLAVIGPNADALAALEANYQGTSATPITPLLGL 410
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 61/114 (53%), Gaps = 13/114 (11%)
Query: 443 ASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A +DA V +G +E E DR D+ LP QQ L+ E AK + P++++
Sbjct: 601 AQSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALL-ERAKASGKPLVVV 659
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+MSG +++AK + W YPGQ+GG AIA G NP GRLP+T+Y
Sbjct: 660 LMSGSAVALNWAKANADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFY 711
>gi|163889365|gb|ABY48135.1| beta-D-xylosidase [Medicago truncatula]
Length = 776
Score = 238 bits (608), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 118/237 (49%), Positives = 155/237 (65%), Gaps = 14/237 (5%)
Query: 14 SVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTL 73
S + +FL L H ++P + C P + FCN SL I R L+ LTL
Sbjct: 7 STIIIFLFSLLLIHLPKFF-TTPDYPC----KPPHSHYPFCNISLPISTRTTSLISLLTL 61
Query: 74 QEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAAS 133
+KI L ++A S+S LGIP Y+WWSEALHG++ GPG +F+ V AT+FPQVI++AA+
Sbjct: 62 SDKINQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAA 121
Query: 134 FNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
FN SL+ IG V E RAM+NVG AGL+FW+PN+N+FRDPRWGRGQETPGEDP++ S Y
Sbjct: 122 FNRSLWFLIGYAVGVEGRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAY 181
Query: 194 ATGYVKGLQQTDG---------GSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV 241
A +V+G+Q DG + L V+ACCKH+TAYDL+ W RY+FNA+V
Sbjct: 182 AVEFVRGIQGVDGIKKVLNDHDSDDDGLMVSACCKHFTAYDLEKWGEFSRYNFNAVV 238
Score = 237 bits (605), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 126/271 (46%), Positives = 171/271 (63%), Gaps = 22/271 (8%)
Query: 285 LDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
+D+NCG+F+ +HTE+A++ GLV E +D A+ N F+ MRLG F+G P K +G+LGP+D
Sbjct: 334 MDINCGTFMLRHTESAIEQGLVKEEDLDRALFNLFSVQMRLGLFNGDPEKGKFGKLGPQD 393
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
VCT ++ LAL+AARQGIVLLKN LPL +LA+IGP A T + G Y G PC
Sbjct: 394 VCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKDRVSLAIIGPMA-TTSELGGGYSGIPC 452
Query: 405 KYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDD----AKKAAASADATVLVMGADQSIE 459
+ GL V TI Y GCS+V+C + DD A A AD V+V G D ++E
Sbjct: 453 SPRSLYDGLKEYVKTISYAFGCSDVKCDS---DDGFAVAIDIAKQADFVVIVAGLDTTLE 509
Query: 460 AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYP 519
E DR+ LLLPG+Q L++ VA +K PVIL++ GG D+SFA+++ ITSILW+GYP
Sbjct: 510 TEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLTGGGPLDVSFAESNQLITSILWIGYP 569
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
++ +GRLPMTWYP+S+
Sbjct: 570 VD-------------FDAAGRLPMTWYPESF 587
>gi|418518029|ref|ZP_13084183.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|410705279|gb|EKQ63755.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 886
Score = 238 bits (608), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 156/403 (38%), Positives = 209/403 (51%), Gaps = 69/403 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV +++ +EK+ ++ A ++ RLGIP YEWWSE LHG++ G AT
Sbjct: 37 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 86
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRD 173
FPQ I AAS+N SL Q +G VVSTEARA +N AGLT WSPNINIFRD
Sbjct: 87 VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146
Query: 174 PRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------A 223
PRWGRG ET GEDP L + A G+++GLQ D P + A KH
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHP---RTIATPKHIAVHSGPEPGRHG 203
Query: 224 YDLD--------------------NWKGTDRYHFNAM----VIYTYYLIK--------FK 251
+D+D G+ +NA+ V +L+ FK
Sbjct: 204 FDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDWGFK 263
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSDCD++D + ++ ++ ++ AG DLNCG + A+ G V+E+
Sbjct: 264 GFVVSDCDAIDDMTQFHYFRPDNAGSSVAALKAGHDLNCGHAY-RELGTAIARGEVDEAL 322
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
+D ++ FA RLG + P K PY +LG KDV A++ LAL AA + IVLLKN A
Sbjct: 323 LDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNVAHRALALQAAAESIVLLKNDAN 381
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+LPL A LAVIGPNA+ + NY+GT TPL GL
Sbjct: 382 TLPLR--AGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 422
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 644 DRNDIALPAPQQALL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIMAAW--YPGQSG 700
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA + G NP GRLP+T+Y
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFY 723
>gi|322437617|ref|YP_004219707.1| glycoside hydrolase family protein [Granulicella tundricola
MP5ACTX9]
gi|321165510|gb|ADW71213.1| glycoside hydrolase family 3 domain protein [Granulicella
tundricola MP5ACTX9]
Length = 892
Score = 238 bits (608), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 157/421 (37%), Positives = 218/421 (51%), Gaps = 77/421 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + + +L RV DLV R+TL+EK++ ++SA ++SRL +P+Y++WSE LHG++ G
Sbjct: 32 LPYMDPALTTQQRVDDLVSRMTLEEKVSQTINSAPAISRLNVPEYDYWSEGLHGIARSG- 90
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLT 162
AT FPQ I AA+++A L Q IG V+S EARA +N + GLT
Sbjct: 91 ---------YATMFPQAIGMAATWDAPLLQQIGDVISIEARAKFNEAIRHNIHSIYYGLT 141
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY- 221
WSPNINIFRDPRWGRGQET GEDP L + +VKG+Q G PN + A KH+
Sbjct: 142 IWSPNINIFRDPRWGRGQETYGEDPFLTGRLGVAFVKGIQ---GPDPNYFRAIATPKHFA 198
Query: 222 ----------------TAYDLDN-----WKGTDRYHFNAMVIYTYYLIK----------- 249
T +DL + ++ T ++ Y ++
Sbjct: 199 VHSGPESTRHSANIEPTPHDLHDTYLPAFRATITEAHADSIMCAYNAVEGSPACASKLLL 258
Query: 250 ---------FK-YIVSDCDSVDVLYN----SQHYTKTPEEAAAKSILAGLDLNCGS---F 292
FK ++ SDC ++D Y S H + E AAA I AG D NCG
Sbjct: 259 QDTLRRDWGFKGFVTSDCGAIDDFYATDYPSHHTSPDKEAAAAAGIKAGTDSNCGQTYLT 318
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
LG +AVK GLV E+ ID A+ + F +LG FD +K + + +V + A+Q
Sbjct: 319 LG----SAVKKGLVTEAEIDTALKHLFTARFQLGLFDP-AAKVAFNAIPFSEVNSPAHQA 373
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LAL AA + IVLLKN A +LP P+ ++ +AVIGP+A + GNY P PL G
Sbjct: 374 LALKAAEESIVLLKNDAHTLPFKPS-VRTIAVIGPSAATLNNLEGNYNAIPLHPVLPLDG 432
Query: 413 L 413
+
Sbjct: 433 I 433
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 54/83 (65%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ ++ VA K P+++++++G +++A + +IL YPGQAG
Sbjct: 652 DRTDIKLPAAQQQMLEAVAATGK-PLVVVLLNGSALAVNWAND--HAAAILEAWYPGQAG 708
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA+ G+ NP+GRLP+T+Y
Sbjct: 709 GTAIAETLAGKNNPAGRLPVTFY 731
>gi|427386425|ref|ZP_18882622.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
12058]
gi|425726465|gb|EKU89330.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
12058]
Length = 864
Score = 238 bits (608), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 153/432 (35%), Positives = 215/432 (49%), Gaps = 74/432 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ++ RV DLV LTL+EKI+ ++++A ++ RLGIP Y WW+E LHGV+
Sbjct: 26 FQNPDLPVEERVNDLVGHLTLEEKISQMMNNAPAIERLGIPAYNWWNECLHGVA------ 79
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
+ P TSFPQ I AA+++ + + S E RA+Y+ GLT+W
Sbjct: 80 --RSPYP-VTSFPQAIAMAATWDTKSVYQMAEYASDEGRAIYHDAARKGTPGIFRGLTYW 136
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRGQET GEDP L + +VKGLQ G P LK +AC KHY +
Sbjct: 137 SPNINIFRDPRWGRGQETYGEDPYLTAAIGVAFVKGLQ---GDDPVYLKSSACAKHYAVH 193
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
W +R+ +NA V ++ YL F+
Sbjct: 194 SGPEW---NRHTYNAEVSNHDLWDTYLPAFRELVVDAKVTGVMCAYNSFFEQPCCGNDLL 250
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
Y+ SDC +++ YN+ + + EA+A ++L G D CG+ +
Sbjct: 251 MMDILRNQWKFDGYVTSDCGAIEDFYNTHNTHEDAAEASADAVLHGTDCECGNGAYRALA 310
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAA 358
A+ GL+ E +D ++ F RLG FD + PY + + A++ AL A
Sbjct: 311 DAIVRGLITEEQVDVSLKKLFEIRFRLGMFDPD-DRVPYSDIPISVLECDAHKAHALKMA 369
Query: 359 RQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA 418
RQ IVLLKN LPL IK +AV+GPNA+ ++ NY G P TT L+G+ V
Sbjct: 370 RQSIVLLKNEKQLLPLDMNKIKKIAVVGPNADDKSVLLANYYGYPSCVTTVLEGIKGKVG 429
Query: 419 ----TIYQAGCS 426
IY+ G +
Sbjct: 430 DQVEVIYEKGVN 441
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/129 (39%), Positives = 68/129 (52%), Gaps = 16/129 (12%)
Query: 431 GTAQVDDAKKAAAS---ADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLL 477
GT + D ++ A+S AD V V G +E E DR + +P QQ L
Sbjct: 582 GTLRKADYRQTASSVKDADVVVFVGGLSAKVEGEEMKVEIDGFKRGDRTSISIPVVQQNL 641
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ E+ K PVI I+M+G + + + +IL Y GQAGG AIADV FG YNP
Sbjct: 642 LKELYATGK-PVIFILMTGSAVGLEWESE--HLPAILNAWYGGQAGGQAIADVLFGDYNP 698
Query: 538 SGRLPMTWY 546
SGRLP+T+Y
Sbjct: 699 SGRLPLTFY 707
>gi|389794400|ref|ZP_10197553.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
gi|388432423|gb|EIL89432.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
Length = 902
Score = 238 bits (607), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 165/473 (34%), Positives = 226/473 (47%), Gaps = 94/473 (19%)
Query: 15 VLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQ 74
VL +F ++ ++ SA S PV+ + S R DLV +TL+
Sbjct: 24 VLPVFPSHAEGADAAPSAASEPVYR---------------DLSRSFHDRAADLVAHMTLE 68
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASF 134
EK + ++A ++ RLG+ Y+WW+E LHGV+ G AT FPQ I AA+F
Sbjct: 69 EKAAQMQNTAPAIPRLGVAAYDWWNEGLHGVARAGQ----------ATVFPQAIGLAATF 118
Query: 135 NASLFQAIGKVVSTEARAMYNV--------GLAGLTFWSPNINIFRDPRWGRGQETPGED 186
+ L + +S EARA YN GLT+WSPNINIFRDPRWGRGQET GED
Sbjct: 119 DVPLMHEVATAISDEARAKYNEFQRKGSHGRYEGLTYWSPNINIFRDPRWGRGQETYGED 178
Query: 187 PLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA----MVI 242
P L + +V GLQ G +P K+ A KH+ + + DR+HF+ +
Sbjct: 179 PYLTERMGVAFVTGLQ---GDNPTYRKLDATAKHFAVH---SGPEADRHHFDVHPSERDL 232
Query: 243 YTYYLIKFK------------------------------------------YIVSDCDSV 260
Y YL F+ Y+VSDC +V
Sbjct: 233 YETYLPAFQTLVQEADVDAVMSAYNRVNGEPATGSPRLLGQILRKDWGFKGYVVSDCGAV 292
Query: 261 DVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFA 320
+ +Y T E A+A ++ G+DL+CG+ + AV GL+ ES ID A++
Sbjct: 293 EDIYKHHKVVDTVEAASALAVKNGVDLDCGTEYAALVK-AVHDGLIKESEIDAALTRLMQ 351
Query: 321 TLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD-LALDAARQGIVLLKNTAGSLPLSPTAI 379
MRLG FD SK P+ + P V D LA AAR+ +VLLKN G LPLS I
Sbjct: 352 ARMRLGMFDP-ASKVPWSDV-PYSVNQSPQHDALARRAARESMVLLKND-GVLPLSKD-I 407
Query: 380 KNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL---AAVVATIYQAGCSNVQ 429
K++AVIGP A+ ++GNY GTP T L+G+ A +Y G V+
Sbjct: 408 KHIAVIGPTADDVMALVGNYHGTPADPVTILRGIREAAPQAKVVYARGVDLVE 460
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 53/86 (61%), Gaps = 3/86 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR DL LP Q+ L+ + K PV+L++ SG + +A + ++L YPGQ G
Sbjct: 667 DRTDLRLPATQRKLLEALQATGK-PVVLVLTSGSALAVDWANQ--HLPAVLLAWYPGQRG 723
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQS 549
G A+ADV FG+ +P+GRLP+T+Y S
Sbjct: 724 GNAVADVLFGKADPAGRLPVTFYKAS 749
>gi|431798021|ref|YP_007224925.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
gi|430788786|gb|AGA78915.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
Length = 906
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 153/424 (36%), Positives = 218/424 (51%), Gaps = 76/424 (17%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
F + + RV LV +++L+EK++ +++++ ++ RL +P+Y WW+E LHGV+ G
Sbjct: 50 FSFLDMEKNFEERVDILVDQMSLEEKVSQMMNASPAIPRLKVPEYNWWNECLHGVARAGY 109
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLT 162
AT FPQ I AASF+ +L + IG V+S EARA ++ + GL
Sbjct: 110 ----------ATVFPQSISVAASFDKNLMKDIGSVISDEARAKHHEFIRNGKRGIYTGLD 159
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FWSPNINIFRDPRWGRG ET GEDP L + A+ +++GLQ +DG LK A KH+
Sbjct: 160 FWSPNINIFRDPRWGRGHETYGEDPYLTGELASQFIEGLQDSDG---KYLKTIATSKHFA 216
Query: 223 AYDLDNWKGTD--RYHFNAMV----IYTYYLIKFK------------------------- 251
+ G + R+ F+ V +Y YL F+
Sbjct: 217 VH-----SGPEPLRHTFDVDVSDRDLYETYLPAFRKTVKEAKVYSIMGAYNRFRGESCSG 271
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLG 294
Y+VSDC ++ ++ T EAAA + G DLNCG++
Sbjct: 272 HDFLLNQLLREQWGFEGYVVSDCGAIQDIHTGHKIASTAAEAAAIGVSGGCDLNCGNYYT 331
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
TE AV GL++E ID A+ F RLG FD + Y Q+ VC++A+ LA
Sbjct: 332 HLTE-AVAEGLISEEEIDIAVKRLFLARFRLGMFDPEEAVS-YAQIPFGIVCSEAHNTLA 389
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
AA++ +VLLKN LPLS IK +AVIGPNA+ ++++GNY G P K T L G+
Sbjct: 390 RQAAQKSMVLLKNQKNLLPLSVDKIKRIAVIGPNADNVESLLGNYHGIPKKPVTFLDGIK 449
Query: 415 AVVA 418
V
Sbjct: 450 HKVG 453
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 52/125 (41%), Positives = 77/125 (61%), Gaps = 13/125 (10%)
Query: 432 TAQVDDAKKAAASADATVLVMGADQSIEAESHD----------RLDLLLPGQQQLLITEV 481
+++D+A A SAD V+V+G Q +E ES D R + LP QQ+ L+ V
Sbjct: 614 VSKIDEAVAMAKSADLAVVVLGLSQRLEGESMDVVTPGFDRGDRTAITLPAQQEALLKAV 673
Query: 482 AKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRL 541
+ K PVIL++ +G I++AK + + +I+ GYPG+ GG A+ADV FG YNP+GRL
Sbjct: 674 KETGK-PVILVLNAGSAMAINWAKEN--VDAIISAGYPGEEGGNALADVVFGDYNPAGRL 730
Query: 542 PMTWY 546
P+T+Y
Sbjct: 731 PITYY 735
>gi|285016879|ref|YP_003374590.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
gi|283472097|emb|CBA14604.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
PC73]
Length = 914
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 155/431 (35%), Positives = 213/431 (49%), Gaps = 81/431 (18%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R DLV R+TL+EK+ + ++A ++ RLG+P Y+WW+E LHGV+ G GAT
Sbjct: 44 RADDLVARMTLEEKVAQMQNAAPAIPRLGVPAYDWWNEGLHGVARAG----------GAT 93
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFWSPNINIFRDP 174
FPQ I AA+F+ L + +S EARA ++ L GLTFWSPNINIFRDP
Sbjct: 94 VFPQAIGLAATFDLPLMHEVSTAISDEARAKHHEALRRGEHGRYQGLTFWSPNINIFRDP 153
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR------LKVAACCKHYTAYDLDN 228
RWGRGQET GEDP L ++ +V+G+Q +P K+ A KH+ + +
Sbjct: 154 RWGRGQETYGEDPFLTARMGVTFVQGMQGEGADAPKNAQGETYRKLDATAKHFAVH---S 210
Query: 229 WKGTDRYHFNAMV----IYTYYLIKFK--------------------------------- 251
++R+HF+A +Y YL F+
Sbjct: 211 GPESERHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRLFGESASASKFLLRDV 270
Query: 252 ---------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVK 302
Y+VSDC ++ ++ + T E+AAA ++ G L CG AAV+
Sbjct: 271 LRERWGFHGYVVSDCWAIVDIWKNHKIVATREQAAALAVKNGTQLECGQEYAT-LPAAVQ 329
Query: 303 AGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD-LALDAARQG 361
GL+ E+ ID A+ MRLG FD P + + QL P V D LA AR+
Sbjct: 330 QGLIGETDIDAALRTLMTARMRLGMFDP-PGQLRWAQL-PISVNQSPEHDALARRTARES 387
Query: 362 IVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA--- 418
+VLLKN G LPLS K +AVIGP A+ T ++GNY GTP T LQG+ A
Sbjct: 388 LVLLKND-GLLPLSRAKHKRIAVIGPTADDTMALLGNYYGTPATPVTILQGIRAAAPDAD 446
Query: 419 TIYQAGCSNVQ 429
+Y G V+
Sbjct: 447 VLYARGADLVE 457
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 65/125 (52%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A AD V V G +E E DR DL LP Q+ L+ ++
Sbjct: 625 LQEALDTARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLQALSAT 684
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G I +A+ + +IL YPGQ GG+A+ADV FG NP GRLP+T
Sbjct: 685 GK-PVVAVLTTGSALAIDWAQE--HVPAILLAWYPGQRGGSAVADVLFGDTNPGGRLPVT 741
Query: 545 WYPQS 549
+Y S
Sbjct: 742 FYKAS 746
>gi|255545662|ref|XP_002513891.1| hypothetical protein RCOM_1034290 [Ricinus communis]
gi|223546977|gb|EEF48474.1| hypothetical protein RCOM_1034290 [Ricinus communis]
Length = 359
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 142/363 (39%), Positives = 196/363 (53%), Gaps = 70/363 (19%)
Query: 36 PVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKY 95
P FACD SNPS FC T L RV DLV RLTL EKI+ LV++ ++ LGIP Y
Sbjct: 52 PPFACDS-SNPSTEEYLFCQTFLSTKERVKDLVSRLTLDEKISRLVNTEAAIPLLGIPAY 110
Query: 96 EWWSEALHGVSYV-----GPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEA 150
+WWSEAL GV+++ G HF+ + ATSFPQVI TAASF+ L+ IGK
Sbjct: 111 QWWSEALQGVAFLPHVAKTQGIHFNGTITKATSFPQVIHTAASFDTRLWYRIGK------ 164
Query: 151 RAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--QTDGGS 208
YA +V+GLQ +GG
Sbjct: 165 ------------------------------------------YAVSFVRGLQGDSFEGGK 182
Query: 209 P-NRLKVAACCKH--YTAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYN 265
+L+ +ACCKH + A+DLDNWK +R+ FNA V Y I+ ++ S ++ +
Sbjct: 183 LGEQLQASACCKHKHFIAHDLDNWKAVNRFIFNAKV----YNIRLRF-----KSSMMIKD 233
Query: 266 SQHYTKTPEEAAAKSI--LAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLM 323
+Y K + K + G+D+ CG++L +TE AV + E+ ID A+SN F+ +M
Sbjct: 234 MPNYQKMRVQMCLKQSWQIVGMDVECGNYLKTYTELAVSKKKLQETEIDTALSNLFSVIM 293
Query: 324 RLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLA 383
RLG F+G+P+K PY ++ VC+Q +Q LAL+AAR G +LLKN+ LPL + I +LA
Sbjct: 294 RLGLFNGNPTKLPYSKISANQVCSQEHQALALEAARDGTILLKNSDKFLPLWKSKITSLA 353
Query: 384 VIG 386
VIG
Sbjct: 354 VIG 356
>gi|372209036|ref|ZP_09496838.1| glycoside hydrolase [Flavobacteriaceae bacterium S85]
Length = 859
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 177/554 (31%), Positives = 257/554 (46%), Gaps = 94/554 (16%)
Query: 55 NTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHF 114
N + I+ RV DL+ +TL+EKI++ + RLGIP +EW+ EALHG+
Sbjct: 27 NPNASIEDRVNDLLANMTLEEKISYCGSRIPEIKRLGIPYFEWYGEALHGI--------- 77
Query: 115 SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDP 174
+ T FPQ I A++N L + +S EARA+ N G + +SP +N+ RDP
Sbjct: 78 --ISWNCTQFPQNIAMGATWNPDLMFDVATAISNEARALKNAGKKEVMMFSPTVNMARDP 135
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGR E EDP L S+ A YV+G+Q G P +K KHY A +++ +
Sbjct: 136 RWGRNGECYAEDPHLMSEMARMYVRGMQ---GNDPKYVKTVTTVKHYVANNVETKREWIH 192
Query: 235 YHFNAMVIYTYYLIKFK------------------------------------------Y 252
+ +Y YY +K Y
Sbjct: 193 SNIGKKDLYEYYFPAYKTCIVDEEATGIMTALNGLNGIPCSAHDWLVNGVLRNEWGFKGY 252
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA--------AVKAG 304
+++D +V L Y + +AAA +I AG+D C F K +A A++ G
Sbjct: 253 VIADWAAVQGLEKRMKYASSQAQAAAMAIKAGVDQEC--FRNKVRQAPMVQALPDALQQG 310
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGH---PSKQPYGQLGPKDVCTQANQDLALDAARQG 361
L+ E +D + L+RL F G PS PY + + A++ LAL AA Q
Sbjct: 311 LITEKELDVTVKR----LLRLRFMTGDFDDPSLNPYSAIPTSVLECDAHKQLALKAAEQS 366
Query: 362 IVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV--AT 419
IVLLKN A LPL +K++A+IGP A+ + +G Y G P +PL G+ A
Sbjct: 367 IVLLKNDA-VLPLKKD-LKSIAMIGPFAD--RCWMGIYSGHPKSKVSPLDGIKAYTNAKV 422
Query: 420 IYQAGCSNVQCGTAQVDDAKKAAAS------ADATVLVMGADQSIEAESHDRLDLLLPGQ 473
+ GC TA+ DD +K A + ++ +LV+G D++ E+ DR + LPG
Sbjct: 423 SFAQGCE----VTAKEDDEQKIAEAVALAKKSEQVILVVGNDETTSTENTDRKSIKLPGN 478
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYP-GQAGGAAIADVCF 532
Q LI V V K +++++ SG KN P I +P GQ G A+A V F
Sbjct: 479 QHQLIKAVQAVNKNVILVLVPSGPTAVTWEQKNIPGIV----CAWPNGQEQGTALAKVLF 534
Query: 533 GRYNPSGRLPMTWY 546
G NP G+L TWY
Sbjct: 535 GDVNPGGKLNATWY 548
>gi|218262493|ref|ZP_03476939.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
DSM 18315]
gi|218223341|gb|EEC95991.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
DSM 18315]
Length = 868
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 157/439 (35%), Positives = 219/439 (49%), Gaps = 84/439 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ID R+ DL+KRLT +EK+ ++++ ++ RLGIP+Y+WW+EALHGV+ G
Sbjct: 28 FRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARAGK-- 85
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+F+ +VS EARA Y+ GLTFW
Sbjct: 86 --------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKGLTFW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRG ET GEDP L + VKGLQ G P K AC KHY +
Sbjct: 138 TPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQ---GDDPKYFKTHACAKHYAVH 194
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFKYIVSD------------------CDS--- 259
W +R+ F+ V ++ YL F+ +V + C S
Sbjct: 195 SGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNRYQGKPCCSSDKL 251
Query: 260 -VDVLYNSQHY------------------TKTP--------EEAAAKSILAGLDLNCGSF 292
+D+L NS Y +TP E A+A ++L G DL CG+
Sbjct: 252 LIDILRNSWGYENIILSDCGAINDFWERDERTPRHETHPDAESASADAVLNGTDLECGNS 311
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQ 351
+ A+K G ++E+ +D ++ LG FD P +Q PY Q+ V + +
Sbjct: 312 Y-RALVKALKDGKISENDLDVSLRRLLKGRFELGMFD--PDEQVPYAQIPYNVVESPEHV 368
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
AL+ A + +VLLKN +LPLS T I+ +AV+GPNA + + NY G P T L+
Sbjct: 369 AQALEMAHKSMVLLKNKNNTLPLSKT-IRKIAVVGPNAADSTMLWANYNGFPTHTVTILE 427
Query: 412 GLAAVV---ATIYQAGCSN 427
G+ V IY+ GC++
Sbjct: 428 GIRNKVPDTEVIYELGCNH 446
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 69/133 (51%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQ-VDDAKKAAASADATVLVM--GADQSIEAESH----------DRLDLLLPGQ 473
N Q GT + VD A AA DA V+V G +E E DR ++ LP
Sbjct: 584 NFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKV 643
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
QQ ++ + K PV+ ++ +G +++ + I +IL Y GQ G A+AD+ FG
Sbjct: 644 QQEMVKALKATGK-PVVYVLCTGSALALNW--EEANIDAILNAWYGGQEAGTAVADILFG 700
Query: 534 RYNPSGRLPMTWY 546
YNPSGRLP+T+Y
Sbjct: 701 DYNPSGRLPVTFY 713
>gi|285018984|ref|YP_003376695.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
gi|283474202|emb|CBA16703.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
PC73]
Length = 904
Score = 235 bits (600), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 156/409 (38%), Positives = 209/409 (51%), Gaps = 69/409 (16%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV ++T EKI ++ A ++ RLGIP YEWWSE LHG++ G AT
Sbjct: 55 RATALVAKMTRAEKIAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGE----------AT 104
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRD 173
FPQ I AAS+N L A+G V STEARA +N+ GLT WSPNINIFRD
Sbjct: 105 VFPQAIGLAASWNTDLLHAVGTVTSTEARAKFNLAGGPGKNHARYGGLTIWSPNINIFRD 164
Query: 174 PRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------A 223
PRWGRG ET GEDP L + A G++ GLQ G P + A KH
Sbjct: 165 PRWGRGMETYGEDPYLTGQLAVGFIHGLQ---GDDPTHPRTIATPKHLAVHSGPESGRHG 221
Query: 224 YDLD----NWKGTDRYHFNAMVIYTY--------------------YLIK--------FK 251
+D+D +++ T F A ++ + +LI FK
Sbjct: 222 FDVDVSPHDFEATYSPAFRAAIVEGHAGSVMCAYNALHGIPACAADWLIDGRVRGNWGFK 281
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSDCD++D + +Y +AA ++ AG DLNCG + + A+ G E+
Sbjct: 282 GFVVSDCDAIDDMTQFHYYRADNAGSAAAALKAGHDLNCG-YAYRDLGTALDRGEAEEAM 340
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
+D ++ FA RLG SK PY +LG KD+ + ++ LAL AA+Q +VLL+N
Sbjct: 341 LDRSLVRLFAARYRLGELQPR-SKDPYARLGAKDIDSPTHRALALQAAQQSLVLLQNRND 399
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT 419
+LPL P LAVIGPNA+ + NY+GT TPLQGL A T
Sbjct: 400 TLPLRPGL--RLAVIGPNADALAALEANYQGTSVAPVTPLQGLRARFGT 446
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 39/83 (46%), Positives = 52/83 (62%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR DL LP QQ L+ E AK + P+I+++MSG +++AK +IL YPGQ+G
Sbjct: 662 DRNDLSLPAAQQALL-ERAKASGKPLIVVLMSGSAVALNWAKQ--HADAILAAWYPGQSG 718
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA G NP GRLP+T+Y
Sbjct: 719 GTAIAQALAGDINPGGRLPVTFY 741
>gi|21232323|ref|NP_638240.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|21114093|gb|AAM42164.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 888
Score = 235 bits (599), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 154/405 (38%), Positives = 210/405 (51%), Gaps = 69/405 (17%)
Query: 61 DLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG 120
+ R LV +++ +EK+ +++A ++ RLGIP YEWWSE LHG++ G
Sbjct: 37 EQRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------Y 86
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIF 171
AT FPQ I AAS+N L Q +G VVSTEARA +N AGLT WSPNINIF
Sbjct: 87 ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIF 146
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT--------- 222
RDPRWGRG ET GEDP L + A G+++GLQ D P + A KH
Sbjct: 147 RDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHP---RTIATPKHIAVHSGPEPGR 203
Query: 223 -AYDLD----NWKGTDRYHFNAMVIYTY--------------------YLIK-------- 249
+D+D + + T F A ++ +L+
Sbjct: 204 HGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDWG 263
Query: 250 FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNE 308
FK ++VSDCD+VD + ++ ++A S+ AG DLNCG+ + A++ G V+E
Sbjct: 264 FKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAY-RALGTAIERGEVDE 322
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
+ +D ++ FA RLG P K Y +LG KD+ N+ LAL AA + IVLLKN
Sbjct: 323 ALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIVLLKNA 381
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+LPL A LAVIGPNA+ + NY+GT + TPL GL
Sbjct: 382 NATLPLK--AGTRLAVIGPNADALAALEANYQGTSSQPVTPLLGL 424
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 50/83 (60%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 646 DRNDIALPAAQQALL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 702
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA G NP GRLP+T+Y
Sbjct: 703 GTAIARALAGDDNPGGRLPVTFY 725
>gi|188990656|ref|YP_001902666.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
gi|167732416|emb|CAP50610.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
Length = 888
Score = 235 bits (599), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 154/403 (38%), Positives = 209/403 (51%), Gaps = 69/403 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R LV +++ +EK+ +++A ++ RLGIP YEWWSE LHG++ G AT
Sbjct: 39 RAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 88
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRD 173
FPQ I AAS+N L Q +G VVSTEARA +N AGLT WSPNINIFRD
Sbjct: 89 VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148
Query: 174 PRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------A 223
PRWGRG ET GEDP L + A G+++GLQ D P + A KH
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHP---RTIATPKHIAVHSGPEPGRHG 205
Query: 224 YDLD----NWKGTDRYHFNAMVIYTY--------------------YLIK--------FK 251
+D+D + + T F A ++ +L+ FK
Sbjct: 206 FDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDWGFK 265
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
++VSDCD+VD + ++ ++A S+ AG DLNCG+ + A++ G V+E+
Sbjct: 266 GFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAY-RALGTAIERGEVDEAL 324
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
+D ++ FA RLG P K Y +LG KD+ N+ LAL AA + IVLLKN
Sbjct: 325 LDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIVLLKNANA 383
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+LPL A LAVIGPNA+ + NY+GT + TPL GL
Sbjct: 384 TLPLK--AGTRLAVIGPNADALAALEANYQGTSSQPVTPLLGL 424
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 50/83 (60%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 646 DRNDIALPAAQQALL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 702
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA G NP GRLP+T+Y
Sbjct: 703 GTAIARALAGDDNPGGRLPVTFY 725
>gi|154493680|ref|ZP_02033000.1| hypothetical protein PARMER_03021 [Parabacteroides merdae ATCC
43184]
gi|423723902|ref|ZP_17698051.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
CL09T00C40]
gi|154086890|gb|EDN85935.1| glycosyl hydrolase family 3 C-terminal domain protein
[Parabacteroides merdae ATCC 43184]
gi|409240709|gb|EKN33484.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
CL09T00C40]
Length = 868
Score = 235 bits (599), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 157/438 (35%), Positives = 216/438 (49%), Gaps = 82/438 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ID R+ DL+KRLT +EKI ++++ ++ RLGIP+Y+WW+EALHGV+ G
Sbjct: 28 FRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARAGK-- 85
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+F+ +VS EARA Y+ GLTFW
Sbjct: 86 --------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKGLTFW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRG ET GEDP L + VKGLQ G P K AC KHY +
Sbjct: 138 TPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQ---GDDPKYFKTHACAKHYAVH 194
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFKYIV----------------------SDCD 258
W +R+ F+ V ++ YL F+ +V SD
Sbjct: 195 SGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNRYQGKPCCSSDKL 251
Query: 259 SVDVLYNS------------------QHYTKTP--------EEAAAKSILAGLDLNCGSF 292
+D+L NS Q +TP E A+A ++L G DL CG+
Sbjct: 252 LIDILRNSWGYENIILSDCGAINDFWQRDERTPRHETHPDAESASADAVLNGTDLECGNS 311
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
K A+K G ++E+ +D ++ LG FD + PY Q+ V + +
Sbjct: 312 Y-KALIKALKEGKISENDLDVSLRRLLKGRFELGMFDPD-ERVPYAQIPYNVVESPEHVA 369
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
AL+ A + +VLLKN +LPLS T I+ +AV+GPNA + + NY G P T L+G
Sbjct: 370 QALEMAHKSMVLLKNKNNTLPLSKT-IRKIAVVGPNAADSTMLWANYNGFPTHTVTILEG 428
Query: 413 LAAVV---ATIYQAGCSN 427
+ V IY+ GC++
Sbjct: 429 IRNKVPDTEVIYELGCNH 446
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 69/133 (51%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQ-VDDAKKAAASADATVLVM--GADQSIEAESH----------DRLDLLLPGQ 473
N Q GT + VD A AA DA V+V G +E E DR ++ +P
Sbjct: 584 NFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKV 643
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
QQ ++ + K PV+ ++ +G +++ D I +IL Y GQ G A+AD+ FG
Sbjct: 644 QQEMVKALKATGK-PVVYVLCTGSALALNW--EDANIDAILNAWYGGQEAGTAVADILFG 700
Query: 534 RYNPSGRLPMTWY 546
YNPSGRLP+T+Y
Sbjct: 701 DYNPSGRLPVTFY 713
>gi|380512525|ref|ZP_09855932.1| beta-glucosidase [Xanthomonas sacchari NCPPB 4393]
Length = 885
Score = 235 bits (599), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 163/461 (35%), Positives = 226/461 (49%), Gaps = 90/461 (19%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV ++T EKI +++A ++ RLG+P YEWWSE LHG++ G AT FPQ
Sbjct: 40 LVAKMTRAEKIAQAMNAAPAIPRLGVPAYEWWSEGLHGIARNGE----------ATVFPQ 89
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRDPRWG 177
I AA++N L +G V STEARA +N+ AGLT WSPNINIFRDPRWG
Sbjct: 90 AIGLAATWNPELLHDVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWG 149
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------AYDLD 227
RG ET GEDP L + A G++ GLQ D P + A KH +D+D
Sbjct: 150 RGMETYGEDPYLTGRLAVGFIHGLQGDDPAHP---RTIATPKHLAVHSGPEPGRHGFDVD 206
Query: 228 ----NWKGTDRYHFNAMVI--------------------YTYYLIK--------FK-YIV 254
+++ T F A ++ +LI FK ++V
Sbjct: 207 VSPHDFEATYSPAFRAAIVDGQAGSVMCAYNSLHGTPACAADWLIDGRVRGDWGFKGFVV 266
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
SDCD++D + +Y ++A ++ AG DLNCG+ + A G +E+ +D +
Sbjct: 267 SDCDAIDDMTQFHYYRPDNAGSSAAALKAGHDLNCGTAY-RELGIAFDRGEADEALLDRS 325
Query: 315 ISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLP 373
+ FA RLG P + PY +LG +D+ + A++ LAL AA+Q +VLLKN +LP
Sbjct: 326 LVRLFAARYRLGEL--QPRRNDPYARLGARDIDSAAHRALALQAAQQSLVLLKNANATLP 383
Query: 374 LSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTA 433
L P LAV+GPNA+ + NY+GT + TPLQGL + G A
Sbjct: 384 LRPGL--RLAVLGPNADALAALEANYQGTSVQPVTPLQGL-------------RTRFGAA 428
Query: 434 QVDDAKKAAASADA------TVLVMGADQSIEAESHDRLDL 468
QV A+ A +A T L + E D LDL
Sbjct: 429 QVAYAQGAPLAAGVPGMIPETALRSDGKPGLRGEYFDNLDL 469
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 50/83 (60%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR DL LP QQ L+ E AK + P+++++MSG +++A+ W YPGQ+G
Sbjct: 643 DRNDLALPAAQQALL-ERAKASGKPLVVVLMSGSAVALNWAEQHADAIIAAW--YPGQSG 699
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA G NP GRLP+T+Y
Sbjct: 700 GTAIAQALAGDINPGGRLPVTFY 722
>gi|198425898|ref|XP_002119549.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 754
Score = 234 bits (598), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 196/609 (32%), Positives = 287/609 (47%), Gaps = 119/609 (19%)
Query: 16 LFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQE 75
LFL L+Y S +++ P F N SL I+ R+ DLV RLT++E
Sbjct: 6 LFLLLSYFHFASSKVTSEEFP----------------FRNFSLPIEERLEDLVNRLTIEE 49
Query: 76 KITFLVDS-------AGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVI 128
I L A +++RLGI Y+W +E L G + G AT FPQ I
Sbjct: 50 VILQLSRGGVRDNGPAPAITRLGIGPYQWNTECLRGYAMNG----------DATCFPQPI 99
Query: 129 LTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFWSPNINIFRDPRWGRGQ 180
AA+F+ L + K V+ EARA +N GL+ +SP INI R P WGR Q
Sbjct: 100 GLAATFDQGLIYKMAKTVALEARAKHNNFTKNGNFGDHTGLSCFSPVINILRHPLWGRNQ 159
Query: 181 ETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAM 240
ET GEDP+L S A YV GLQ G L A CKH+ AY T R+ F+A
Sbjct: 160 ETYGEDPVLTSLMARAYVTGLQ----GDEIYLPATAVCKHFVAYGGPENIPTTRFSFSAN 215
Query: 241 V----IYTYYLIKFK----------------------------------------YIVSD 256
V I T + F+ Y+VSD
Sbjct: 216 VSDHDIGTTFYPAFRECVHAGAQGVMCSYNAINGVPSCANPMLETTLRKKFHFDGYVVSD 275
Query: 257 ---CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE-----AAVKAGLVNE 308
+++D+ +N +TK+ E AA ++ AG+DL F GK AV+ GLV E
Sbjct: 276 ENALENIDLYFN---FTKSKLETAAVALNAGVDLELTGF-GKTNRYSLLNQAVEQGLVTE 331
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC-TQANQDLALDAARQGIVLLKN 367
+A+ + F T M LG FD P + + P DV + A++ A++ A + VLLKN
Sbjct: 332 AALRRSAKRLFRTRMALGEFD--PPEFNHWLNVPIDVVQSLAHRKQAVEVAAKSFVLLKN 389
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPL---QGLAAVVATIYQ 422
G LPL K ++++GP N ++ + G+Y E +++PL L++ +
Sbjct: 390 D-GILPLKQLYDK-VSIVGPFINNSEALTGDYPAEFNLKYFSSPLFAANSLSSSGVARFT 447
Query: 423 AGC--SNVQ----CGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQL 476
GC +N Q C T + K+ +D ++ +G + +EAES+DR D+ LPG+Q
Sbjct: 448 TGCVGTNNQNLPICATYNSTNVKEVVTGSDIVLVTLGTGRGVEAESNDRRDINLPGKQLQ 507
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI +V K A GPVI+++ + G D+S+ + +++ + Q G A+ +V G N
Sbjct: 508 LIQDVVKYANGPVIVVLFNAGPLDVSWVMGN--TAAVIACHFSAQMTGEAMLEVLTGVVN 565
Query: 537 PSGRLPMTW 545
P+GRLP TW
Sbjct: 566 PAGRLPNTW 574
>gi|399030621|ref|ZP_10730998.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398071229|gb|EJL62496.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 876
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 151/420 (35%), Positives = 213/420 (50%), Gaps = 68/420 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L + RV DLV RLTL+EK++ +++S+ ++ RL IP Y+WW+E LHGV+ T
Sbjct: 31 FQNPDLSFEKRVDDLVNRLTLEEKVSQMLNSSPAIPRLDIPAYDWWNETLHGVAR----T 86
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA---------GLTF 163
F T +PQ I AA+F+ + + + E RA+YN + GLT+
Sbjct: 87 PFK-----VTVYPQAIAMAATFDKNSLYKMADFSALEGRAIYNKAVESGRTNERYLGLTY 141
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PNINIFRDPRWGRGQET GEDP L +VKGLQ G P LK AAC KHY
Sbjct: 142 WTPNINIFRDPRWGRGQETYGEDPYLTGVLGDSFVKGLQ---GDDPKYLKAAACAKHYAV 198
Query: 224 Y------------DLDNWKGTDRY------------------HFNAM---------VIYT 244
+ D+ ++ D Y +NA ++ T
Sbjct: 199 HSGPEPLRHTFDVDVTPYELWDTYLPAFQKLVTESKVAGVMCAYNAFRTQPCCASDILMT 258
Query: 245 YYL---IKFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAA 300
L KF+ Y+ SDC ++D + + E A+A ++ G D++CG+ K A
Sbjct: 259 DILRNQWKFEGYVTSDCWAIDDFFKNHKTHPDAESASADAVFHGTDIDCGTDAYKALVQA 318
Query: 301 VKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAAR 359
VK G ++E ID ++ F RLG FD P + Y Q + ++ AL AR
Sbjct: 319 VKDGKISEKQIDISVKRLFMIRFRLGMFD--PVEMVKYAQTPTSVLENDEHKAHALKMAR 376
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT 419
Q IVLL+N +LPLS +K + V+GPN + ++GNY GTP K TT L+G+ V +
Sbjct: 377 QSIVLLRNENKTLPLS-KKLKKIVVLGPNVDNAIAILGNYNGTPSKLTTVLEGIKEKVGS 435
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 61/113 (53%), Gaps = 15/113 (13%)
Query: 445 ADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIM 494
ADA V V G +E E DR +LLP Q L+ + K P++ ++M
Sbjct: 608 ADAFVFVGGISPQLEGEEMKVNFPGFKGGDRTSILLPKIQTDLMKALKTTGK-PIVFVMM 666
Query: 495 SGGGFDISF-AKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+G I + A+N P I + W G GQA G A+ADV FG YNP+GRLP+T+Y
Sbjct: 667 TGSAIAIPWEAENIPAIANA-WYG--GQAAGTAVADVLFGNYNPAGRLPVTFY 716
>gi|66767544|ref|YP_242306.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
gi|66572876|gb|AAY48286.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
Length = 888
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 154/405 (38%), Positives = 210/405 (51%), Gaps = 69/405 (17%)
Query: 61 DLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG 120
+ R LV +++ +EK+ +++A ++ RLGIP YEWWSE LHG++ G
Sbjct: 37 EQRAAALVAQMSREEKVAQSMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------Y 86
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIF 171
AT FPQ I AAS+N L Q +G VVSTEARA +N AGLT WSPNINIF
Sbjct: 87 ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIF 146
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT--------- 222
RDPRWGRG ET GEDP L + A G+++GLQ D P + A KH
Sbjct: 147 RDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHP---RTIATPKHIAVHSGPEPGR 203
Query: 223 -AYDLD----NWKGTDRYHFNAMVIYTY--------------------YLIK-------- 249
+D+D + + T F A ++ +L+
Sbjct: 204 HGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDWG 263
Query: 250 FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNE 308
FK ++VSDCD+VD + ++ ++A S+ AG DLNCG+ + A++ G V+E
Sbjct: 264 FKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAY-RALGTAIERGEVDE 322
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
+ +D ++ FA RLG P K Y +LG KD+ N+ LAL AA + IVLLKN
Sbjct: 323 ALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIVLLKNA 381
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+LPL A LAVIGPNA+ + NY+GT + TPL GL
Sbjct: 382 NATLPLK--AGTRLAVIGPNADALAALEANYQGTSSQPVTPLLGL 424
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 50/83 (60%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 646 DRNDIALPAAQQALL-ERAKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 702
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA G NP GRLP+T+Y
Sbjct: 703 GTAIARALAGDDNPGGRLPVTFY 725
>gi|423344787|ref|ZP_17322476.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
CL03T12C32]
gi|409224378|gb|EKN17311.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
CL03T12C32]
Length = 866
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 157/438 (35%), Positives = 216/438 (49%), Gaps = 82/438 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ID R+ DL+KRLT +EKI ++++ ++ RLGIP+Y+WW+EALHGV+ G
Sbjct: 26 FRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARAGK-- 83
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+F+ +VS EARA Y+ GLTFW
Sbjct: 84 --------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKGLTFW 135
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRG ET GEDP L + VKGLQ G P K AC KHY +
Sbjct: 136 TPNINIFRDPRWGRGMETYGEDPYLTERMGLAVVKGLQ---GDDPKYFKTHACAKHYAVH 192
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFKYIV----------------------SDCD 258
W +R+ F+ V ++ YL F+ +V SD
Sbjct: 193 SGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNRYQGKPCCSSDKL 249
Query: 259 SVDVLYNS------------------QHYTKTP--------EEAAAKSILAGLDLNCGSF 292
+D+L NS Q +TP E A+A ++L G DL CG+
Sbjct: 250 LIDILRNSWGYENIILSDCGAINDFWQRDERTPRHETHPDAESASADAVLNGTDLECGNS 309
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
K A+K G ++E+ +D ++ LG FD + PY Q+ V + +
Sbjct: 310 Y-KALIKALKEGKISENDLDVSLRRLLKGRFELGMFDPD-ERVPYAQIPYNVVESPEHVA 367
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
AL+ A + +VLLKN +LPLS T I+ +AV+GPNA + + NY G P T L+G
Sbjct: 368 QALEMAHKSMVLLKNKNNTLPLSKT-IRKIAVVGPNAADSTMLWANYNGFPTHTVTILEG 426
Query: 413 LAAVV---ATIYQAGCSN 427
+ V IY+ GC++
Sbjct: 427 IRNKVPDTEVIYELGCNH 444
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 69/133 (51%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQ-VDDAKKAAASADATVLVM--GADQSIEAESH----------DRLDLLLPGQ 473
N Q GT + VD A AA DA V+V G +E E DR ++ +P
Sbjct: 582 NFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKV 641
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
QQ ++ + K PV+ ++ +G +++ D I +IL Y GQ G A+AD+ FG
Sbjct: 642 QQEMVKALKATGK-PVVYVLCTGSALALNW--EDANIDAILNAWYGGQEAGTAVADILFG 698
Query: 534 RYNPSGRLPMTWY 546
YNPSGRLP+T+Y
Sbjct: 699 DYNPSGRLPVTFY 711
>gi|325929067|ref|ZP_08190221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
gi|325540562|gb|EGD12150.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
Length = 850
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 156/428 (36%), Positives = 212/428 (49%), Gaps = 91/428 (21%)
Query: 71 LTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILT 130
+TL+EK + ++A ++ RLG+P Y+WW+EALHGV+ G GAT FPQ I
Sbjct: 1 MTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------GATVFPQAIGM 50
Query: 131 AASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFWSPNINIFRDPRWGRGQET 182
AA+F+ L + +S EARA ++ L GLTFWSPNINIFRDPRWGRGQET
Sbjct: 51 AATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRDPRWGRGQET 110
Query: 183 PGEDPLLASKYATGYVKGLQ-------QTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRY 235
GEDP L ++ +V+GLQ + G P R K+ A KH+ + + DR+
Sbjct: 111 YGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYR-KLDATAKHFAVH---SGPEADRH 166
Query: 236 HFNAMV----IYTYYLIKFK---------------------------------------- 251
HF+A +Y YL F+
Sbjct: 167 HFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGF 226
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNES 309
Y+VSDC ++ ++ T E+AAA ++ G +L CG AAV+ GL++E+
Sbjct: 227 KGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEEYST-LPAAVRQGLIDEA 285
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDA-----ARQGIVL 364
ID A++ MRLG FD P GQL + NQ A DA AR+ +VL
Sbjct: 286 QIDTALTTLMTARMRLGMFD------PPGQLPWSTIPASVNQSPAHDALARRTARESLVL 339
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV---VATIY 421
LKN G LPLS +K +AVIGP A+ T ++GNY GTP T LQG+ A +Y
Sbjct: 340 LKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQVLY 398
Query: 422 QAGCSNVQ 429
G V+
Sbjct: 399 ARGADLVE 406
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A A SAD V V G +E E DR DL LP Q+ L+ +
Sbjct: 574 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQAT 633
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV+ ++ +G I +A+ + +IL YPGQ GG A+AD FG NP GRLP+T
Sbjct: 634 GK-PVVAVLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 690
Query: 545 WYPQS 549
+Y +S
Sbjct: 691 FYKES 695
>gi|392537607|ref|ZP_10284744.1| Beta-glucosidase [Pseudoalteromonas marina mano4]
Length = 870
Score = 234 bits (597), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 157/433 (36%), Positives = 209/433 (48%), Gaps = 76/433 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N S ID RV DLV RLTL+EK+ L D + ++ RL IP+Y WW+EALHGV+ G
Sbjct: 34 YLNESASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNIPEYNWWNEALHGVARAGK-- 91
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AA+F+ L +G +S E RA ++ LA GLT+W
Sbjct: 92 --------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLAENNRSMYTGLTYW 143
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRGQET GEDP L ++ A ++ GLQ G + LK A KHY +
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQ---GDNTEYLKSVATLKHYAVH 200
Query: 225 D-LDNWKGTDRYHFNAMVIYTYYLIKFK-------------------------------- 251
+ + +D Y + + YL FK
Sbjct: 201 SGPEVSRHSDDYTASKKDLAETYLPAFKDVIAQTKVASVMCAYNSVNGTPACGNDELIQN 260
Query: 252 ----------YIVSDCDSVDVLYN--SQHYTKTPEEAAAKSILAGLDLNCGSFLGK---H 296
YIVSDC ++ Y+ S + T +AAA ++ G DLNCG G +
Sbjct: 261 KLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTEAKAAAMALKTGTDLNCGDHHGNTYSY 320
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
AVK GLV E +D A+ +LG FD +P PY V + + L +
Sbjct: 321 LSQAVKEGLVEEKDVDKALKRLMYARFKLGMFD-NPENVPYSDTSIDIVGSNKHLALTQE 379
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
AA++ +VLLKN LPL + +A+IGPNA+ ++GNY G P TP L
Sbjct: 380 AAKKSLVLLKNEQ-VLPLKGN--EKVALIGPNADNEAILLGNYNGMPIVPITPKLALEQR 436
Query: 417 VAT---IYQAGCS 426
+ Y AG S
Sbjct: 437 LGKNNLTYTAGSS 449
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 65/115 (56%), Gaps = 13/115 (11%)
Query: 442 AASADATVLVMGADQSIEAE---------SH-DRLDLLLPGQQQLLITEVAKVAKGPVIL 491
A AD V V G ++E E SH DR ++ LP Q L+ ++ + K P++L
Sbjct: 603 ANEADVIVFVGGISANLEGEEMPLQIDGFSHGDRTNINLPKSQLNLLKKLKQTGK-PIVL 661
Query: 492 IIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ MSG +++ + I +I+ YPG+A G+A+ + +G Y+PSG+LP+T+Y
Sbjct: 662 VNMSGSAMALNWENEN--IDAIIQGFYPGEAAGSALVSLLYGEYSPSGKLPITFY 714
>gi|423342048|ref|ZP_17319763.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
CL02T12C29]
gi|409219455|gb|EKN12417.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
CL02T12C29]
Length = 868
Score = 234 bits (597), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 155/438 (35%), Positives = 217/438 (49%), Gaps = 82/438 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ID R+ DL+KRLT +EK+ ++++ ++ RLGIP+Y+WW+EALHGV+ G
Sbjct: 28 FRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARAGK-- 85
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+F+ +VS EARA Y+ GLTFW
Sbjct: 86 --------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKGLTFW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRG ET GEDP L + VKGLQ G P K AC KHY +
Sbjct: 138 TPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQ---GDDPKYFKTHACAKHYAVH 194
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFKYIVSD------------------CDS--- 259
W +R+ F+ V ++ YL F+ +V + C S
Sbjct: 195 SGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNRYQGKPCCSSDKL 251
Query: 260 -VDVLYNSQHY------------------TKTP--------EEAAAKSILAGLDLNCGSF 292
+D+L NS Y +TP E A+A ++L G DL CG+
Sbjct: 252 LIDILRNSWGYENIILSDCGAINDFWERDERTPRHETHPDAESASADAVLNGTDLECGNS 311
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
+ A+K G ++E+ +D ++ LG FD + PY Q+ V + +
Sbjct: 312 Y-RALVKALKDGKISENDLDVSLRRLLKGRFELGMFDPD-ERVPYAQIPYNVVESPEHVA 369
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
AL+ A + +VLLKN +LPLS T I+ +AV+GPNA + + NY G P T L+G
Sbjct: 370 QALEMAHKSMVLLKNKNNTLPLSKT-IRKIAVVGPNAADSTMLWANYNGFPTHTVTILEG 428
Query: 413 LAAVV---ATIYQAGCSN 427
+ V IY+ GC++
Sbjct: 429 IRNKVPDTEVIYELGCNH 446
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 69/133 (51%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQ-VDDAKKAAASADATVLVM--GADQSIEAESH----------DRLDLLLPGQ 473
N Q GT + VD A AA DA V+V G +E E DR ++ LP
Sbjct: 584 NFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKV 643
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
QQ ++ + K PV+ ++ +G +++ + I +IL Y GQ G A+AD+ FG
Sbjct: 644 QQEMVKALKATGK-PVVYVLCTGSALALNW--EEANIDAILNAWYGGQEAGTAVADILFG 700
Query: 534 RYNPSGRLPMTWY 546
YNPSGRLP+T+Y
Sbjct: 701 DYNPSGRLPVTFY 713
>gi|399029285|ref|ZP_10730258.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398072895|gb|EJL64089.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 871
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 160/434 (36%), Positives = 222/434 (51%), Gaps = 78/434 (17%)
Query: 37 VFACDVVSNPSLA--SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
VF + ++ + A + F N +L + RV DLV R+++ EKI+ L+DS+ ++ RLG+P+
Sbjct: 8 VFMLNFLALNTFAQENFAFKNPNLTTEQRVDDLVSRMSIDEKISQLMDSSPAIERLGVPE 67
Query: 95 YEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY 154
Y WW+E+LHGV+ G AT FPQ I A+S++ L + V+S EARA +
Sbjct: 68 YNWWNESLHGVARAG----------YATVFPQSISIASSWDRQLIFDVANVISDEARAKH 117
Query: 155 NVGL--------AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
+ L GLTFWSPN+NIFRDPRWGRG ET GEDP L + YV GLQ G
Sbjct: 118 HEYLRRGQHGMYQGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTGQLGLKYVNGLQ---G 174
Query: 207 GSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA----MVIYTYYLIKFK----------- 251
+ LKV A KHY + + R+ FNA + +Y YL F+
Sbjct: 175 TNEKYLKVIATAKHYAVH---SGPEPSRHLFNAETSDIDLYETYLPAFRTLVKEGHVYSV 231
Query: 252 ------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSI 281
YIVSDC +V ++ T A+A ++
Sbjct: 232 MGAYNRFRGESCSASPFLFNILRNVWGFDGYIVSDCGAVTDIWKYHKITGDAATASALAL 291
Query: 282 LAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQL 340
GLDL CGS K + A+ L++E+ ID A+ F +LG FD P + Y Q+
Sbjct: 292 KDGLDLECGSSF-KSLKEAIDRKLISEADIDIAVKRLFTARFKLGMFD--PEEIVSYAQI 348
Query: 341 GPKDVCTQANQD-LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
P V + D LA A+++ IVLLKN +LPLS IK +AVIGPNAN +++ GNY
Sbjct: 349 -PYSVNNNSAHDWLARVASQKSIVLLKNQNNTLPLS-RDIKTVAVIGPNANDVQSLWGNY 406
Query: 400 EGTPCKYTTPLQGL 413
G P T L+G+
Sbjct: 407 SGVPSNPITVLKGI 420
Score = 92.8 bits (229), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 53/122 (43%), Positives = 74/122 (60%), Gaps = 13/122 (10%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
+ +A + A ADA VLV+G ++ +E E DR L LP Q+ L+ +
Sbjct: 589 LQEAVQVAGQADAIVLVLGLNERLEGEEMKVEADGFEGGDRTSLDLPSNQEELMKAMTAT 648
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PVIL++++G I++A ND + +IL GYPGQ GG AIADV FG YNP+GRLP+T
Sbjct: 649 GK-PVILVLINGSALSINWA-ND-HVPAILTAGYPGQQGGNAIADVLFGDYNPAGRLPVT 705
Query: 545 WY 546
+Y
Sbjct: 706 YY 707
>gi|384428895|ref|YP_005638255.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
gi|341937998|gb|AEL08137.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
Length = 888
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 152/405 (37%), Positives = 210/405 (51%), Gaps = 69/405 (17%)
Query: 61 DLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG 120
+ R LV +++ +EK+ +++A ++ RLGIP YEWW+E LHG++ G
Sbjct: 37 EQRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWNEGLHGIARNG----------Y 86
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIF 171
AT FPQ I AAS+N L Q +G VVSTEARA +N AGLT WSPNINIF
Sbjct: 87 ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIF 146
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT--------- 222
RDPRWGRG ET GEDP L + A G+++GLQ D P + A KH
Sbjct: 147 RDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHP---RTIATPKHIAVHSGPEPGR 203
Query: 223 -AYDLD----NWKGTDRYHFNAMVIYTY--------------------YLIK-------- 249
+D+D + + T F A ++ +L+
Sbjct: 204 HGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDWG 263
Query: 250 FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNE 308
FK ++VSDCD+VD + ++ ++A ++ AG DLNCG+ + A++ G V+E
Sbjct: 264 FKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGTAY-RALGTAIERGEVDE 322
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
+ +D ++ FA RLG P K Y +LG KD+ N+ LAL AA + IVLLKN
Sbjct: 323 ALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESIVLLKNA 381
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+LPL A LAVIGPNA+ + NY+GT + TPL GL
Sbjct: 382 NATLPLK--ASTRLAVIGPNADALAALEANYQGTSSQPVTPLLGL 424
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 60/114 (52%), Gaps = 13/114 (11%)
Query: 443 ASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A +DA V +G +E E DR D+ LP QQ L+ E AK + P++++
Sbjct: 615 AQSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALL-ERAKASGKPLVVV 673
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+MSG +++AK W YPGQ+GG AIA G NP GRLP+T+Y
Sbjct: 674 LMSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFY 725
>gi|15837447|ref|NP_298135.1| family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
gi|9105751|gb|AAF83655.1|AE003924_1 family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
Length = 882
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 150/399 (37%), Positives = 211/399 (52%), Gaps = 69/399 (17%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV ++TLQEKIT +++A ++ RLGIP Y+WWSE LHG++ G AT FPQ
Sbjct: 37 LVAKMTLQEKITQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YATVFPQ 86
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRDPRWG 177
I AAS+N L Q +G V STEARA +N+ AGLT WSPNINIFRDPRWG
Sbjct: 87 AIGLAASWNTDLLQHVGTVTSTEARAKFNLAGGPGKDHPRYAGLTLWSPNINIFRDPRWG 146
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY---------------- 221
RG ET GEDP L + A +++GLQ G P+ + A KH+
Sbjct: 147 RGMETYGEDPYLTGQLAVSFIRGLQ---GNIPDHPRTIATPKHFAVHSGPEPGRHSFDVD 203
Query: 222 -TAYDL-------------DNWKGTDRYHFNAM----VIYTYYLIKFK---------YIV 254
+AYDL D G+ +NA+ + +L+ + ++V
Sbjct: 204 VSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRNDWGFNGFVV 263
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
SDCD++D + + + A+A ++ +G DLNCG+ + A+ G ++E+ +D A
Sbjct: 264 SDCDAIDDMTRFHFFRQDNASASAAALKSGNDLNCGNTY-RDLNQAIARGDIDEALLDQA 322
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+ FA RLG PY +G K + T A++ LAL AA Q +VLLKN+ +LPL
Sbjct: 323 LIRLFAARQRLGTLQPR-EHDPYATIGIKHIDTPAHRALALQAAVQSLVLLKNSGNTLPL 381
Query: 375 SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+P LAV+GP+A+ + NY+GT TPL GL
Sbjct: 382 TPGT--TLAVLGPDADSLTALEANYQGTSSTPVTPLTGL 418
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 67/123 (54%), Gaps = 13/123 (10%)
Query: 434 QVDDAKKAAASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAK 483
Q+ +A++A A ADA V +G +E E DR + LP Q+ L+ V
Sbjct: 600 QLAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKT 659
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
K P+I+++MSG +++A++ +IL YPGQ+GG AIA G NP GRLP+
Sbjct: 660 TGK-PLIVVLMSGSAVALNWAQH--HANAILAAWYPGQSGGTAIAQALAGDVNPGGRLPV 716
Query: 544 TWY 546
T+Y
Sbjct: 717 TFY 719
>gi|386718620|ref|YP_006184946.1| glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
gi|384078182|emb|CCH12773.1| Glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
Length = 897
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 156/435 (35%), Positives = 217/435 (49%), Gaps = 79/435 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + S + R LV ++TL EK + ++A ++ RLG+P Y+WW+E LHGV+ G
Sbjct: 38 WLDVSASFEQRAASLVAQMTLDEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ-- 95
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
AT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 96 --------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLRQGAHGRYQGLTFW 147
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPN+NIFRDPRWGRGQET GEDP L ++ +V+GLQ G P K+ A KH +
Sbjct: 148 SPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQ---GDDPVYRKLDATAKHLAVH 204
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ DR+HF+A +Y YL F+
Sbjct: 205 ---SGPEADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESASASRFL 261
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
Y+VSDC ++ ++ H T E AAA ++ G +L CG
Sbjct: 262 LRDVLRRDWGFKGYVVSDCWAIVDIWKHHHIVTTREAAAALAVRNGTELECGQEYAT-LP 320
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD-LALDA 357
+AV+ GL++E+ ID A++ F MRLG FD P + + ++ P V + D LAL A
Sbjct: 321 SAVRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARI-PASVNQAPSHDALALKA 378
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL---A 414
A+ +VLLKN G LPLS IK +AV+GP A+ T ++GNY GTP T LQG+ A
Sbjct: 379 AQASLVLLKND-GILPLS-RDIKRIAVVGPTADDTMALLGNYFGTPAAPVTILQGIREAA 436
Query: 415 AVVATIYQAGCSNVQ 429
V Y G V+
Sbjct: 437 KGVEVRYARGVDLVE 451
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 64/127 (50%), Gaps = 13/127 (10%)
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVA 482
A D+A AA AD V V G +E E DR DL LP Q+ L+ +
Sbjct: 617 APFDEALDAAREADVVVFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALH 676
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
K PV++++ G + +A++ + +IL YPGQ GG A+ FG NP+GRLP
Sbjct: 677 ATGK-PVVMVLTGGSAIAVDWAQS--HLPAILMSWYPGQRGGTAVGQALFGDVNPAGRLP 733
Query: 543 MTWYPQS 549
+T+Y S
Sbjct: 734 VTFYKAS 740
>gi|393786911|ref|ZP_10375043.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
CL02T12C05]
gi|392658146|gb|EIY51776.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
CL02T12C05]
Length = 863
Score = 233 bits (593), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 164/458 (35%), Positives = 221/458 (48%), Gaps = 91/458 (19%)
Query: 37 VFACDVVSNPSLASLG---FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
+F C +S+ S+ + G F N L ++ RV DLV+RLTL EK+ + D + SV RLGI
Sbjct: 5 LFFCLFLSSLSVGAQGILPFNNPDLPVEERVEDLVRRLTLHEKVLLMCDYSSSVPRLGIK 64
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
+Y WW+EALHGV G AT FPQ I AA+F+ + + + VS EARA
Sbjct: 65 QYNWWNEALHGVGRAGL----------ATVFPQAIGMAATFDDCAVKQVFECVSDEARAK 114
Query: 154 YNVG--------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD 205
Y+ GLTFW+PN+NIFRDPRWGRGQET GEDP L S+ V+GLQ
Sbjct: 115 YHHSENKDGSERYRGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQ--- 171
Query: 206 GGSPNRL-KVAACCKHYTAYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------- 251
G S ++ K+ AC KHY + W +R+ F+ I + YL FK
Sbjct: 172 GPSESKYDKLHACAKHYALHSGPEW---NRHRFDVENISPRDLWETYLPAFKALVQQGGV 228
Query: 252 ----------------------------------YIVSDCDSVDVLYNSQHYT--KTPEE 275
+VSDC ++ Y H+ T E
Sbjct: 229 KEVMCAYNRFEGEPCCGSNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHSTKES 288
Query: 276 AAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP--- 332
A A ++ AG DL+CG + E AV+ G++ E ID ++S LG D
Sbjct: 289 AVAAAVKAGTDLDCGVDY-QSLEKAVEKGIITEKQIDVSLSRLLKARFELGLMDEEHLVS 347
Query: 333 -SKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANV 391
S PY V ++ ++ AL+ AR+ + LLKN G+LPLS K + VIGPNAN
Sbjct: 348 WSDIPYTV-----VDSEKHRAKALEVARKSMTLLKNKNGTLPLSKHCGK-IVVIGPNAND 401
Query: 392 TKTMIGNYEGTPCKYTTPLQGLAAVV---ATIYQAGCS 426
+ M GNY G P T L+G+ + IY GC
Sbjct: 402 SIMMWGNYNGFPSHTVTILEGITHKLDAGQVIYDKGCE 439
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/114 (42%), Positives = 65/114 (57%), Gaps = 13/114 (11%)
Query: 443 ASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
+ A+A V V G +E E DR + LP Q+ L+ E+ K K P+ILI
Sbjct: 598 SDAEAIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELYKTGK-PIILI 656
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ SG +S A+ D +I+ YPGQAGG A+ADV FG YNP+GRLP+T+Y
Sbjct: 657 LCSGSAIGLS-AEVDLA-DAIIQAWYPGQAGGTAVADVLFGDYNPAGRLPVTFY 708
>gi|315499711|ref|YP_004088514.1| beta-glucosidase [Asticcacaulis excentricus CB 48]
gi|315417723|gb|ADU14363.1| Beta-glucosidase [Asticcacaulis excentricus CB 48]
Length = 869
Score = 233 bits (593), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 155/399 (38%), Positives = 208/399 (52%), Gaps = 71/399 (17%)
Query: 68 VKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQV 127
+ R+T+++K + + A + G+ YEWW+E LHGV+ G AT FPQ
Sbjct: 40 IARMTVEQKAAQMQNRAPDLPSAGLTAYEWWNEGLHGVARAGE----------ATVFPQA 89
Query: 128 ILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWGRG 179
I AA++N +L + +G VVSTEARA +N GLT WSPNINIFRDPRWGRG
Sbjct: 90 IGLAATWNPALLKQVGDVVSTEARAKFNSTDPAGDHQRYYGLTLWSPNINIFRDPRWGRG 149
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY-----------------T 222
QET GEDP L S+ A G+V GLQ G P KV A KH +
Sbjct: 150 QETYGEDPFLTSRLAEGFVTGLQ---GPDPQHPKVVASVKHLAVHSGPEAGRHGFAASVS 206
Query: 223 AYDLD-NWKGTDRYH------------FNA------------MVIYTYYLIKFK-YIVSD 256
YDL+ + RY +NA + Y FK Y+V+D
Sbjct: 207 PYDLEMTYLPAFRYSVMTTKAQSVMCAYNAVGGVPACASDLLLKTYVREAWGFKGYVVTD 266
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAIS 316
CD++ + Y E++A+S+ AG+DLNCG+ E AV+ GL+ ES +D +++
Sbjct: 267 CDAIYDMTRFHFYRLNDAESSAESLKAGVDLNCGNAYAALPE-AVQKGLIPESLMDQSLN 325
Query: 317 NNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
RLG DG PS P+ ++ P+ + T Q LAL AA Q +VLLKN G LPL P
Sbjct: 326 RLLDVRKRLG-IDGAPS--PWARISPEAINTPQAQGLALQAAEQSLVLLKNN-GVLPLKP 381
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
+ +AVIGPNA+ +T+ GNY G + TPL GL A
Sbjct: 382 G--QTVAVIGPNADTEETLRGNYNGIARQPVTPLTGLRA 418
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 48/123 (39%), Positives = 66/123 (53%), Gaps = 13/123 (10%)
Query: 434 QVDDAKKAAASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAK 483
Q+D+A KAA +ADA V +G IE E DR DL LP Q+ L+ V
Sbjct: 589 QIDEAVKAAQAADAIVAFVGLSPDIEGEELQILVPGFDRGDRTDLGLPRTQEDLLKAVKA 648
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
K P++++++SG +++A W YPG+AGG AIA G NPSGRLP+
Sbjct: 649 TGK-PLVVVLLSGSAVALNWADAHADAVVAAW--YPGEAGGTAIARTLTGEANPSGRLPV 705
Query: 544 TWY 546
T+Y
Sbjct: 706 TFY 708
>gi|340368019|ref|XP_003382550.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 742
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 191/590 (32%), Positives = 273/590 (46%), Gaps = 98/590 (16%)
Query: 37 VFACDV-VSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGS-------VS 88
+FAC + +S+ A F NTSL I+ RV D+V LTL+E + + + +
Sbjct: 9 LFACCLAISDHVEALFPFQNTSLSIEDRVKDIVDNLTLEELVEQMAHGGATLNGPAPGIP 68
Query: 89 RLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG-ATSFPQVILTAASFNASLFQAIGKVVS 147
RL I Y+W +E L G NV G ATSFP I AASFN L + + +
Sbjct: 69 RLHINPYQWGTECLSG-----------NVSAGDATSFPMPIGMAASFNYDLLKRVTNATA 117
Query: 148 TEARAMYNVGL--------AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVK 199
E RA + + GL+ WSP +NI RDPRWGR QET GEDP L+ YV
Sbjct: 118 YEVRAKHAAAVKDGSYAFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAYVN 177
Query: 200 GLQQTDGGSPNRLKVA-ACCKHYTAYDLDNWKGTDRYHFNAMVIY----TYYLIKFK--- 251
GLQ G+ +R +A A CKH+ + T R+ F+A V +L +FK
Sbjct: 178 GLQ----GNNSRYIIANAGCKHFDVHGGPENIPTSRFSFDAKVSMRDWRMTFLPQFKACV 233
Query: 252 --------------------------------------YIVSDCDSVDVLYNSQHYTKTP 273
Y+VSD +++ + HY
Sbjct: 234 EAGALSLMCSYNRINGVPACANKALLTDILRNEWDFKGYVVSDQGALEFIVIEHHYAPDF 293
Query: 274 EEAAAKSILAGLDLNCGSF------LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGF 327
+AAA + AG L G+ + +H AVK LV+ + +A+S F M+LG
Sbjct: 294 MKAAADAANAGTCLEDGNIGRKFFNVFEHLVDAVKNNLVSVDTLKNAVSRLFYVRMKLGE 353
Query: 328 FDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS----LPLSPTAIKNLA 383
FD P PY + + + A+ +L+L AA + IVL+KN G LP++ +K
Sbjct: 354 FDP-PDNNPYANIPLSVIQSDAHINLSLQAAMESIVLMKNDDGFRSPFLPIT-NEVKKAC 411
Query: 384 VIGPNANVTKTMIGNYEGTPCK--YTTPLQGLA-AVVATI---YQAGCSN-VQCGTAQVD 436
++GP ++ + + G+Y T + T L GL A + T Y GC + C
Sbjct: 412 MVGPFSDDPEVLFGDYSPTLMRDYVITSLAGLKNANIGTDTLNYAVGCEDGPACRNYDSA 471
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAK-GPVILIIMS 495
+ A + ++ G + +E+E D D+ LPG Q L+ + +K VILI+ +
Sbjct: 472 KVRSACDGVELIIVTAGLSKHLESEGKDLSDINLPGHQLDLMQDAEAASKNASVILILFN 531
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
DI +AK DP+I IL YPGQ G AIA+V G YNPSGRLP TW
Sbjct: 532 ASPLDIRYAKTDPRIVGILEAYYPGQTAGKAIANVLTGEYNPSGRLPNTW 581
>gi|389736853|ref|ZP_10190363.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
gi|388438821|gb|EIL95541.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
Length = 868
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 154/429 (35%), Positives = 213/429 (49%), Gaps = 85/429 (19%)
Query: 50 SLGFCNTSLGIDLRVGD-------LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEAL 102
+ GF + +D R D LV ++TL EK+ + + A ++ RLG+P Y+WWSE L
Sbjct: 12 AFGFALVAGCVDARTPDAHSRAVALVAKMTLPEKVAQMQNDAPAIPRLGVPAYDWWSEGL 71
Query: 103 HGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG----- 157
HG++ G AT FPQ I AAS++ SL A+G V+STEARA +N
Sbjct: 72 HGIARNG----------YATVFPQAIGLAASWDTSLLHAVGTVISTEARAKFNASGSGRA 121
Query: 158 ---LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKV 214
GLT WSPNINIFRDPRWGRGQET GEDP L + A +V+G+Q G P +
Sbjct: 122 HGLFQGLTLWSPNINIFRDPRWGRGQETYGEDPYLTGQLAVAFVRGIQ---GDDPQHPRA 178
Query: 215 AACCKHYTAYDLDNWKGTDRYHFNA--MVIYTYYLIKFK--------------------- 251
A KH+ A+ G D + + + YL F+
Sbjct: 179 IATPKHFVAHSGPE-AGRDSFDVDVSPHDLEDTYLPAFRTAVVDGHAGSVMCAYNALHGT 237
Query: 252 ---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG 290
Y+VSDCD+V + + ++ +A+ ++ AG DL+CG
Sbjct: 238 PACANAGLLDTRLRKDWGFAGYVVSDCDAVGDIASYHYFKPDDVQASVAAVQAGTDLDCG 297
Query: 291 SFLGKHTEA----AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
HT A AV+ G + ESA+D ++ F RLG G PY ++G +
Sbjct: 298 -----HTYASLAQAVRQGDIAESALDASLVRLFTARYRLGEL-GSRGNDPYARIGADQID 351
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
+ A++ LAL AA + +VLLKN +LPL A LAVIGP+A+ +T+ NY GT
Sbjct: 352 SPAHRKLALQAALESLVLLKNAHSTLPLH--AGMRLAVIGPDADALETLEANYHGTARHP 409
Query: 407 TTPLQGLAA 415
TPLQGL A
Sbjct: 410 VTPLQGLRA 418
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 64/118 (54%), Gaps = 13/118 (11%)
Query: 439 KKAAASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAKVAKGP 488
++ ADA V +G +E E DR D+ LP Q+ L+ E A+ + P
Sbjct: 592 ERVLHDADAVVAFIGLSPDVEGEQLRIDVPGFDGGDRTDIGLPAPQRALL-ERARASGKP 650
Query: 489 VILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+I++++SG + +A+ +IL YPGQAGG AIA V G YNP GRLP+T+Y
Sbjct: 651 LIVVLLSGSAVALDWAQQ--HADAILAAWYPGQAGGTAIAQVLAGDYNPGGRLPVTFY 706
>gi|313203744|ref|YP_004042401.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443060|gb|ADQ79416.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 1286
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 144/415 (34%), Positives = 212/415 (51%), Gaps = 59/415 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ NTS + R DL+ RLTL+EK + L +S ++ RLGI WSEALHG+ +G G
Sbjct: 33 YLNTSYSFEERAADLISRLTLEEKESLLGNSMAAIPRLGIKSMNVWSEALHGI--LG-GA 89
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFR 172
+ S + G TSFP + ++++ +L Q ++ EARA+ G GLT+WSP + R
Sbjct: 90 NQSVGISGPTSFPNSVALGSAWDPALMQREAMAIADEARAINQTGTKGLTYWSPVVEPIR 149
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR E+ GEDP LA++ A G+V+G+ G P LK C KHY A + + +
Sbjct: 150 DPRWGRTGESYGEDPFLAAEIAGGFVRGMV---GNDPTYLKSVPCAKHYFANNSEFDRHV 206
Query: 233 DRYHFNAMVIYTYYLIKFK----------------------------------------- 251
+ ++ + +YL +K
Sbjct: 207 SSSNMDSRDMREFYLAPYKKLIEQDNLPSIMSSYNAVNGVPTSASQLYLDTIARRTYGLK 266
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
YI DC +++ +Y +Y KT EEA AK + AG+D +CGS ++ AA+K GL+ +
Sbjct: 267 GYITGDCAAIEDIYTGHYYVKTAEEATAKGLKAGVDSDCGSIYQRYAIAALKKGLITMAD 326
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
ID A+ N F MR G FD P+K Y Q P V + AN+ LA + A + VLLKN
Sbjct: 327 IDRALLNIFIVRMRTGEFD-PPAKVLYAQFQPNIVNSPANKALAKEIATKTPVLLKNNIS 385
Query: 371 ------SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK--YTTPLQGLAAVV 417
+LPL+P +K +A+IGP+A+ K +G Y G P + TP G+ +
Sbjct: 386 LKTNRKALPLNPADLKKIALIGPHAD--KVELGPYSGRPAQENMITPFAGIKKYI 438
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 53/94 (56%), Gaps = 1/94 (1%)
Query: 453 GADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITS 512
G D+ E DRL LLLPG Q LI VA V I+++ + G ++ KN I
Sbjct: 619 GTDEKTATEEADRLTLLLPGNQVELIKAVAAVNPN-TIVVMQTLGCVEVEEFKNLQNIPG 677
Query: 513 ILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I+WVGY GQA G AIA V FG NP G+L TWY
Sbjct: 678 IIWVGYNGQAQGDAIASVLFGEVNPGGKLNGTWY 711
>gi|189468358|ref|ZP_03017143.1| hypothetical protein BACINT_04755 [Bacteroides intestinalis DSM
17393]
gi|189436622|gb|EDV05607.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 865
Score = 232 bits (591), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 164/480 (34%), Positives = 228/480 (47%), Gaps = 103/480 (21%)
Query: 18 LFLTYC-SLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEK 76
F+T+C S ++AQ+ P + N +L R DL+KR+TL+EK
Sbjct: 5 FFVTFCLSAVALVATAQNEP----------------YRNPNLSPSERAWDLLKRMTLEEK 48
Query: 77 ITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNA 136
I+ + + + ++ RLGIP Y+WW+EALHGV+ G AT FPQ I AA+F+
Sbjct: 49 ISQMKNGSPAIERLGIPAYDWWNEALHGVARAGK----------ATVFPQAIGLAATFDN 98
Query: 137 SLFQAIGKVVSTEARAMYNV--------GLAGLTFWSPNINIFRDPRWGRGQETPGEDPL 188
+VS EARA Y+ G GLTFW+PNINI+RDPRWGRG ET GEDP
Sbjct: 99 QAVYETFDIVSDEARAKYHDFQRKGERGGYKGLTFWTPNINIYRDPRWGRGMETYGEDPY 158
Query: 189 LASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVI-----Y 243
L S VKGLQ G + K AC KHY + W +R+ F++ I +
Sbjct: 159 LTSLMGLAVVKGLQGNGAGKYD--KAHACAKHYAVHSGPEW---NRHSFDSKNISQRDLW 213
Query: 244 TYYLIKFK------------------------------------------YIVSDCDSVD 261
YL FK +VSDC ++
Sbjct: 214 ETYLPAFKTLVTEGKVKEVMCAYNRFEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIG 273
Query: 262 VLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNF 319
Y H+ P E A+A ++++G DL CG E AVK GL+ E I+ ++
Sbjct: 274 DFYYPNHHETHPSAEAASADAVVSGTDLECGGSYSSLNE-AVKKGLITEDKINESVFRLL 332
Query: 320 ATLMRLGFFDGHP----SKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS 375
+LG FD S+ PY + K+ + D AL+ AR+ +VLL N SLPLS
Sbjct: 333 RARFQLGMFDDDTLVSWSEIPYSVVESKE-----HVDKALEMARKSMVLLTNKNNSLPLS 387
Query: 376 PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV---ATIYQAGCSNVQCGT 432
+I+ +AV+GPNAN + + NY G P K T L+G+ + + A Y+ GC V T
Sbjct: 388 -KSIRKVAVLGPNANDSVMLWANYNGFPTKSVTILEGIRSKLPEGAVYYEKGCDFVSTQT 446
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 73/147 (49%), Gaps = 19/147 (12%)
Query: 416 VVATIYQAGCSNV---QCGTAQVDDAKKAAASADAT---VLVMGADQSIEAESH------ 463
VV YQA V GT + D K AA A + V G ++E E
Sbjct: 566 VVLEFYQASGEAVLKFDIGTKKEIDYNKVAAKAAEADAIIFVGGLSSALEGEEMPVDLPG 625
Query: 464 ----DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYP 519
DR ++ LP Q+ ++ + K K PVI ++ SG + + + + ++L YP
Sbjct: 626 FKKGDRTNIDLPRVQEEMLKALKKTGK-PVIFVVCSGSTLALPWEAEN--LDAMLEAWYP 682
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTWY 546
GQ GG A+ADV FG YNP+GRLP+T+Y
Sbjct: 683 GQQGGTAVADVLFGDYNPAGRLPLTFY 709
>gi|443692971|gb|ELT94448.1| hypothetical protein CAPTEDRAFT_221920 [Capitella teleta]
Length = 757
Score = 232 bits (591), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 182/562 (32%), Positives = 261/562 (46%), Gaps = 86/562 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG-----SVSRLGIPKYEWWSEALHGVSY 107
F + SL D R DLV RLTL+E S G ++ RLGI Y W +E L G
Sbjct: 22 FQDPSLSWDDRADDLVARLTLEEIAPQTQASYGGQHTPAIERLGIKPYVWITECLAG--- 78
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA-------- 159
V AT++PQ I AASF+ L + + +S E RA +N A
Sbjct: 79 --------QVNTNATAYPQPIGMAASFSEELLFNVSRDISYEVRAHWNANRAVGKYSTKV 130
Query: 160 GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCK 219
GL+ +SP INI R P WGR QET GEDPLL+ A +V+GLQ G P L+ A CK
Sbjct: 131 GLSCFSPVINIMRHPLWGRNQETYGEDPLLSGTLAQSFVRGLQ---GDDPRYLRANAGCK 187
Query: 220 HYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------ 251
H+ + R+ F+A V +L +FK
Sbjct: 188 HFDVHGGPEDIPVSRFSFDAKVNMRDWRMTFLPQFKMCVDAGSYSLMCSYNRINGIPACA 247
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS--- 291
YIVSD ++ + HYT + +I AG +L G
Sbjct: 248 NKQLLTDITRDEWGFHGYIVSDSGAISNIKEQHHYTNSTVATVVAAIKAGTNLELGGGSN 307
Query: 292 -FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQAN 350
+ K +A +K GL+ E I + T +RLG FD + Y ++G + + +
Sbjct: 308 MYYPKQLDA-MKQGLLTEKEIRDNVRPLLYTRLRLGEFDPE-AMVDYNKIGVDVIQSPEH 365
Query: 351 QDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG-TPCKYT-T 408
++ A+ AA G VLLKN LP+ K LA++GP N T + G Y K+T T
Sbjct: 366 REQAVKAAYMGFVLLKNHNNLLPIKKQYSK-LAIVGPFTNATSELFGTYSSEVNLKFTST 424
Query: 409 PLQGLAAVVATIYQA-GCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
+GL+ + + A GC+N C DD + A A AD ++ +G+ Q E+E +DR
Sbjct: 425 IFEGLSPLGGSTRSANGCTNSACSGYVRDDVETAVAGADLVIVALGSGQRFESEGNDRAY 484
Query: 468 LLLPGQQQLLITEVAKVAKG-PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
L L G Q ++ + + G PVIL++++ G DI++AK DP +T+IL GYP Q+ G A
Sbjct: 485 LDLHGHQLDILKDAVFFSNGAPVILVLINAGPLDITWAKLDPGVTAILSCGYPAQSTGEA 544
Query: 527 IAD---VCFGRYNPSGRLPMTW 545
+ + + P+GRL TW
Sbjct: 545 LRRSLTMSEPQAAPAGRLQATW 566
>gi|329851587|ref|ZP_08266344.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
gi|328840433|gb|EGF90005.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
Length = 883
Score = 231 bits (590), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 155/426 (36%), Positives = 213/426 (50%), Gaps = 73/426 (17%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
NP L S + +T+ + R DLV R++L+EK L++ A ++ RLG+ +Y WW+E LHG
Sbjct: 30 NP-LESPAYQDTTKTAEQRAADLVSRMSLEEKAAQLINDAPAIPRLGVREYNWWNEGLHG 88
Query: 105 VSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG------- 157
V+ G AT FPQ + AA+F+ L + +S E RA Y
Sbjct: 89 VAAHG----------YATVFPQAVGMAATFDEPLIHRVADTISVEFRAKYVASRHRFGGS 138
Query: 158 --LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVA 215
GLT WSPNINIFRDPRWGRGQET GEDP L ++ +VKGLQ G P +
Sbjct: 139 DWFRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTARIGVAFVKGLQ---GEDPVYYRTI 195
Query: 216 ACCKHY-----------------TAYDLDN-----WKGTDRYHFNAMVIYTYYLIK---- 249
A KHY + YDL++ ++ T ++ Y I
Sbjct: 196 ATPKHYAVHSGPEASRHRDNINPSRYDLEDTYLPAFRATIVEGKAVSIMCAYNAIDGQPA 255
Query: 250 ----------------FK-YIVSDCDSV-DVLYN-SQHYTKTPEEAAAKSILAGLDLNCG 290
FK ++VSDCD+V D+ Y S HY TPEE + AG DL CG
Sbjct: 256 CANDDLLVKHLRQDWGFKGFVVSDCDAVGDIYYKTSHHYRPTPEEGVTVAYQAGTDLICG 315
Query: 291 SF-LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
+ H +AV+ G++ ES +D A+ F+ +LG FD P Q + + D TQA
Sbjct: 316 NANEADHVASAVRKGILPESLVDTALVRLFSARFKLGQFD--PPAQVFPAITADDYDTQA 373
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
N+D + A +VLLKN G LPL + + +AVIGPNA+ +++GNY G P T
Sbjct: 374 NRDFSQHVAESAMVLLKND-GLLPLK-SEPRTIAVIGPNADTMDSLVGNYNGDPSHPVTV 431
Query: 410 LQGLAA 415
L G+ A
Sbjct: 432 LAGIKA 437
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 67/120 (55%), Gaps = 13/120 (10%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAK 486
+A AA +D + V G Q +E E DR L LP QQ ++ +V+ K
Sbjct: 599 EAVAAAKESDLVIFVAGLSQRVEGEEMRVETPGFSGGDRTSLDLPPVQQKVLEQVSATGK 658
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PV+L++++G +++A D + +I+ YPG GGAA+A + G ++P+GRLP+T+Y
Sbjct: 659 -PVVLVLINGSALSVNWA--DKNVPAIVEAWYPGGQGGAAVARLIAGDFSPAGRLPVTFY 715
>gi|189464583|ref|ZP_03013368.1| hypothetical protein BACINT_00926 [Bacteroides intestinalis DSM
17393]
gi|189436857|gb|EDV05842.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 879
Score = 231 bits (589), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 158/435 (36%), Positives = 215/435 (49%), Gaps = 83/435 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N +L + R DLV RLTL+EK + +++ ++ RLGI Y+WW+EALHGV G
Sbjct: 42 YKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL-- 99
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV-----GLA---GLTFW 164
AT FPQ I ASFN L + +S EARA GL GLT W
Sbjct: 100 --------ATVFPQAIGMGASFNNELLYDVFTAISDEARAKNTEFSKEGGLKRYQGLTMW 151
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRGQET GEDP L S+ V+GLQ +G ++L AC KHY +
Sbjct: 152 TPNINIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEGEKYDKLH--ACAKHYAVH 209
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL FK
Sbjct: 210 SGPEW---NRHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNR 266
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPEE--AAAKSILAGLDLNCGSFLGK 295
+VSDC ++ YN + P++ A+AK++L+G D+ CG G
Sbjct: 267 LLMHILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKAVLSGTDIECGDSYGS 326
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E AVK GL++E ID ++ LG D PS+ + Q+ V ++ +++LAL
Sbjct: 327 LPE-AVKEGLIDEKQIDISLKRLMKARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELAL 384
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNL--AVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AR+ +VLL+N LPL+ KNL AV+GPNAN + GNY G P T L+G+
Sbjct: 385 RMARESLVLLQNNQSLLPLN----KNLKVAVVGPNANDSVMQWGNYNGFPSHTITLLEGI 440
Query: 414 AAVV---ATIYQAGC 425
+ IY+ GC
Sbjct: 441 REYLPESQIIYEPGC 455
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 56/113 (49%), Gaps = 13/113 (11%)
Query: 444 SADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILII 493
AD + G ++E E DR + LP Q L+ E+ K K ++ +
Sbjct: 615 EADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKAGK-KIVFVN 673
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
SG I+ +IL YPGQAGG AIA+V FG YNP+GRLP+T+Y
Sbjct: 674 FSGSA--IALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFY 724
>gi|332665860|ref|YP_004448648.1| beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332334674|gb|AEE51775.1| Beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 887
Score = 231 bits (589), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 149/442 (33%), Positives = 217/442 (49%), Gaps = 70/442 (15%)
Query: 37 VFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYE 96
+ +V S +T+L ++RV DLV RLTL+EK+ ++++A ++ RLGIP Y+
Sbjct: 26 ILIANVCQAQSFKGFPMWDTNLSFEVRVKDLVSRLTLEEKVGQMLNAAPAIPRLGIPAYD 85
Query: 97 WWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV 156
WW+E LHGV+ T F T +PQ I AA ++++ + + E RA++N
Sbjct: 86 WWNEVLHGVAR----TPFH-----VTVYPQAIGMAAGWDSTSLAMMAHYSALEGRAVFNK 136
Query: 157 GLA---------GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
A GLT+W+PNINIFRDPRWGRGQET GEDP L S +V+GLQ G
Sbjct: 137 ATALGRNNERYLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTSMLGRAFVRGLQ---GD 193
Query: 208 SPNRLKVAACCKHYTAYD-LDNWKGTDRYHFNAMVIYTYYLIKFK--------------- 251
P LK AAC KH+ + + + +D + + ++ YL FK
Sbjct: 194 DPKYLKAAACAKHFAVHSGPEPSRHSDNFSPSNYDLWDTYLPAFKELVTKAKVEGVMCAY 253
Query: 252 ---------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
Y+ SDC ++D + A+ ++L G
Sbjct: 254 NAFHGQPCCGSDVLMNDILRKQWQFKGYVTSDCWAIDDFFKFHKTHPDATSASVDAVLHG 313
Query: 285 LDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
D+ CG+ + K VK G++ E+ +D ++ F T RLG FD S Y Q
Sbjct: 314 TDVECGTDVYKSLLDGVKKGMIAEAQLDISLIRLFTTRYRLGMFD-PVSMVKYAQTPESI 372
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
+ T ++ +L A+Q IVLLKN +LPLS IK +AV+GPNA+ ++GNY G P
Sbjct: 373 LETAEHKAHSLKMAQQSIVLLKNEGNTLPLSKN-IKKIAVLGPNADNRIVVLGNYNGQPS 431
Query: 405 KYTTPLQG----LAAVVATIYQ 422
+ T LQG L V IY+
Sbjct: 432 EIITALQGIKNKLGQEVELIYE 453
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 46/115 (40%), Positives = 62/115 (53%), Gaps = 19/115 (16%)
Query: 445 ADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG---PVIL 491
ADA V V G +E E DR +LLP Q TE+ K+ KG P++
Sbjct: 619 ADAIVYVGGISPQLEGEEMRVDFPGFNGGDRTSILLPAVQ----TELLKMLKGTGKPLVF 674
Query: 492 IIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
++M+G I+ D I +I+ Y GQ+ G AIADV FG YNP+GRLP+T+Y
Sbjct: 675 VVMTGSA--IALPYEDQNIPAIVNAWYGGQSAGTAIADVLFGDYNPAGRLPVTFY 727
>gi|255572557|ref|XP_002527212.1| beta-glucosidase, putative [Ricinus communis]
gi|223533388|gb|EEF35138.1| beta-glucosidase, putative [Ricinus communis]
Length = 349
Score = 231 bits (589), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 130/312 (41%), Positives = 170/312 (54%), Gaps = 75/312 (24%)
Query: 46 PSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV 105
P S FCN SL + R L+ LTL+EKI L D+A + R GIP YEWWSE+LHG+
Sbjct: 35 PLHNSYTFCNQSLSVPTRAHSLISLLTLEEKIKQLSDNASGIPRFGIPPYEWWSESLHGI 94
Query: 106 SYVGPGTHFS-NVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFW 164
+ GPG F+ V AT FPQVI++AA+FN +L+ IG ++ EARAM+NVG +GLTFW
Sbjct: 95 AINGPGVSFTIGPVSAATGFPQVIISAAAFNRTLWFLIGSAIAIEARAMHNVGQSGLTFW 154
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS---------------- 208
+PN+NIFRDPRWGRGQETPGEDP+L S YA +VKG Q GG+
Sbjct: 155 APNVNIFRDPRWGRGQETPGEDPMLTSAYAIEFVKGFQ---GGNWKSGVSGSGSGRYGFG 211
Query: 209 ----------PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK--- 251
+ L ++ACCKH TAYDL+ W RY FNA+V + Y F+
Sbjct: 212 EKRMLRDDDGDDGLMLSACCKHLTAYDLEKWGNFSRYSFNAVVTEQDLEDTYQPPFRSCI 271
Query: 252 --------------------------------------YIVSDCDSVDVLYNSQHYTKTP 273
YIVSDCD+V ++ Q+Y+K+
Sbjct: 272 EEGKASCLMCSYNEVNGVPACAREDLLQKAREEWGFEGYIVSDCDAVATIFEYQNYSKSA 331
Query: 274 EEAAAKSILAGL 285
E+A A ++ AG+
Sbjct: 332 EDAVAIALKAGM 343
>gi|359450637|ref|ZP_09240068.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
gi|358043611|dbj|GAA76317.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
Length = 468
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 155/433 (35%), Positives = 208/433 (48%), Gaps = 76/433 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N S ID RV DLV RLTL+EK+ L D + ++ RL +P+Y WW+EALHGV+ G
Sbjct: 34 YLNKSASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNMPEYNWWNEALHGVARAGK-- 91
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
AT FPQ I AA+F+ L +G +S E RA ++ L GLT+W
Sbjct: 92 --------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLEENNRSMYTGLTYW 143
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRGQET GEDP L ++ A ++ GLQ G + LK A KHY +
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQ---GDNAEYLKSVATLKHYAVH 200
Query: 225 D-LDNWKGTDRYHFNAMVIYTYYLIKFK-------------------------------- 251
+ + +D Y + + YL FK
Sbjct: 201 SGPEVSRHSDDYTASEKDLAETYLPAFKDVIAQTKVASVMCAYNSVNGTPACGNDELIQN 260
Query: 252 ----------YIVSDCDSVDVLYN--SQHYTKTPEEAAAKSILAGLDLNCGSFLGK---H 296
YIVSDC ++ Y+ S + T +AAA ++ G DLNCG G +
Sbjct: 261 KLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTGAKAAAMALKTGTDLNCGDHHGNTYSY 320
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
AVK GLV E +D A+ +LG FD +P PY V + + L +
Sbjct: 321 LTQAVKEGLVEEKDVDKALKRLMYARFKLGMFD-NPENVPYSDTSIDVVGSNKHLALTQE 379
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
AA++ +VLLKN LPL + +A+IGPNA+ ++GNY G P TP L
Sbjct: 380 AAQKSLVLLKNEQ-VLPLKGN--EKIALIGPNADNEAILLGNYNGMPIVPITPKLALEQR 436
Query: 417 VAT---IYQAGCS 426
+ Y AG S
Sbjct: 437 LGKNNLTYTAGSS 449
>gi|325105296|ref|YP_004274950.1| beta-glucosidase [Pedobacter saltans DSM 12145]
gi|324974144|gb|ADY53128.1| Beta-glucosidase [Pedobacter saltans DSM 12145]
Length = 884
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 161/476 (33%), Positives = 228/476 (47%), Gaps = 94/476 (19%)
Query: 13 VSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLT 72
+ L YC L +S+ +Q P F N L ++ R+ +L+ LT
Sbjct: 6 IRFLVTCFFYCLLGNSNLKSQEIP--------------YKFRNPDLPVNERIENLLGLLT 51
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAA 132
L+EK+ +++S+ V RLGIP Y+WW+EALHGV+ G AT FPQ I AA
Sbjct: 52 LEEKVGLMMNSSKPVGRLGIPAYDWWNEALHGVARSGK----------ATVFPQAIGMAA 101
Query: 133 SFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDPRWGRGQETPG 184
++N S + ++S EARA YN + GL+FW+PNINIFRDPRWGRGQET G
Sbjct: 102 TWNESGHKQTFDLISDEARAKYNEAIRNGERGRYYGLSFWTPNINIFRDPRWGRGQETYG 161
Query: 185 EDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV--- 241
EDP L ++ V+GLQ G P K AC KH+ + W +R+ ++A
Sbjct: 162 EDPYLTARLGVAAVRGLQ---GDDPKYFKTHACAKHFAVHSGPEW---NRHSYDATASGR 215
Query: 242 -IYTYYLIKFK------------------------------------------YIVSDCD 258
++ YL FK +VSDC
Sbjct: 216 DLWETYLPAFKALVKEANVQEVMCAYNAYEGQPCCGSDRLLTDILRNRWEYKGIVVSDCW 275
Query: 259 SVDVLYNSQHYT--KTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAIS 316
++D + H+ K AAA +++ DL CGS E AV+ GL+++ ID ++
Sbjct: 276 AIDDFFRKGHHETHKDAAAAAADAVIHSTDLECGSAYTNLLE-AVRQGLISQQQIDISLR 334
Query: 317 NNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS 375
LG D P+++ P+ QL + V ++ + AL AR+ + LLKN LPLS
Sbjct: 335 RVLRGWFELGMLD--PAERLPWSQLPYQIVASKEHVQQALKVARESMTLLKNNGSILPLS 392
Query: 376 PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV---ATIYQAGCSNV 428
+IK +AVIGPNA + + GNY GTP T LQG+ + IY GC V
Sbjct: 393 -KSIKKIAVIGPNAADSVMLWGNYNGTPNSTVTILQGIKNKLPHAEIIYDKGCDWV 447
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 59/117 (50%), Gaps = 13/117 (11%)
Query: 440 KAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPV 489
K DA V G +E E D++ + LP Q+ L++ + K PV
Sbjct: 603 KRLKEVDAIVYAGGLSPQLEGEEMPVNADGFRGGDKISIDLPKIQRELLSSLKSTGK-PV 661
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ ++ +G ++ +++ ++L Y GQ G A+ADV FG YNP+GRLP+T+Y
Sbjct: 662 VFVLCTGSS--LALEQDEKNYNALLCAWYGGQEAGTAVADVLFGDYNPAGRLPITFY 716
>gi|189463167|ref|ZP_03011952.1| hypothetical protein BACCOP_03878 [Bacteroides coprocola DSM 17136]
gi|189430146|gb|EDU99130.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
coprocola DSM 17136]
Length = 865
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 157/435 (36%), Positives = 210/435 (48%), Gaps = 85/435 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ NTSL + R DL++RLTL+EK++ + +++ ++ RLGI Y+WW+EALHGV G
Sbjct: 27 YQNTSLTPEQRASDLLERLTLEEKVSLMQNASPAIPRLGIKAYDWWNEALHGVGRAGI-- 84
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AASF+ L + VS EARA Y GLTFW
Sbjct: 85 --------ATVFPQTIGMAASFDDELIYKVFTAVSDEARAKYTEFSKSGNLKRYQGLTFW 136
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRGQET GEDP L S+ V+GLQ D ++L AC KHY +
Sbjct: 137 TPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVRGLQGPDNMKYDKLH--ACAKHYAVH 194
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL FK
Sbjct: 195 SGPEW---NRHSFNAENIAPRDLWETYLPAFKALVQEADVKEVMCAYNRFEGEPCCGSNR 251
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
IVSDC ++ + + P E A+A ++L+G DL CG+ K
Sbjct: 252 LLMQILRDEWKYKGIIVSDCGAISDFWRKGDHETHPDKETASAGAVLSGTDLECGNNY-K 310
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP--SKQPYGQLGPKDVCTQANQDL 353
AV+ GL++E ID ++ LG D H PY V ++A++DL
Sbjct: 311 SLPEAVQKGLIDEKQIDISVKRLLTARFELGEMDEHVCWDSIPYSV-----VDSKAHKDL 365
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL+ AR+ IVLL+N LPL +A+IGPNAN + GNY G P +T + L
Sbjct: 366 ALEIARKSIVLLQNRNNILPLKEDM--KIALIGPNANDSVMQWGNYNGFPSHTSTLYEAL 423
Query: 414 AAVVAT---IYQAGC 425
+ IY GC
Sbjct: 424 KERIPANQLIYDFGC 438
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/127 (37%), Positives = 65/127 (51%), Gaps = 17/127 (13%)
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVA 482
A +D K +AD V G S+E E DR + LP Q+ LI+E+
Sbjct: 593 ASIDKVK----AADVIVFAGGISPSLEGEEMPVNAEGFKGGDRTTIELPAIQRRLISELK 648
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
K+ K P+I + SG + +IL YPGQAGG A+ADV FG YNPSG+LP
Sbjct: 649 KLGK-PIIFVNYSGSA--VGLEPESKICDAILQAWYPGQAGGTAVADVLFGDYNPSGKLP 705
Query: 543 MTWYPQS 549
+T+Y +
Sbjct: 706 VTFYKHT 712
>gi|449489074|ref|XP_002195511.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Taeniopygia guttata]
Length = 685
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 169/524 (32%), Positives = 251/524 (47%), Gaps = 76/524 (14%)
Query: 84 AGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG-ATSFPQVILTAASFNASLFQAI 142
A + RLGI Y W +E L G PG AT+FPQ + AA+F+ L +
Sbjct: 6 APPIPRLGIAPYNWNTECLRG----------DGEAPGWATAFPQALGLAAAFSPELIYRV 55
Query: 143 GKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 194
+TE RA +N A GL+ +SP +NI R P WGR QET GEDP L+ + A
Sbjct: 56 ANATATEVRAKHNSFAAAGRYSDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPFLSGELA 115
Query: 195 TGYVKGLQQTDGGSPNRLKVAACCKHYTAYD-------------LDNWKGTDRYHFNAMV 241
+V+GLQ G P +K +A CKH++ + +W+ T F A V
Sbjct: 116 RSFVQGLQ---GPHPRYVKASAGCKHFSVHGGHENILLYLLTVLERDWRMTFLPQFQACV 172
Query: 242 IYTYYLIKFKY----------------------------IVSDCDSVDVLYNSQHYTKTP 273
Y Y +VSD +V+++ HYT++
Sbjct: 173 RAGSYSFMCSYNRINGVPACANKKLLTDILRGEWGFDGYVVSDEGAVELIMLGHHYTRSF 232
Query: 274 EEAAAKSILAGLDLNCGSFLGKHT----EAAVKAGLVNESAIDHAISNNFATLMRLGFFD 329
E A S+ AG +L + + A+ G + + + F T MRLG FD
Sbjct: 233 LETAVASVNAGCNLELSYGMRNNVFMRIPEALAMGNITLQMLRDRVRPLFYTRMRLGEFD 292
Query: 330 GHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAI--KNLAVIGP 387
P+ PY L V + +++L+L+AA + VLLKN G+LPL + ++LAV+GP
Sbjct: 293 P-PAMNPYSSLDLSVVQSPEHRNLSLEAAVKSFVLLKNVRGTLPLKAQDLSSQHLAVVGP 351
Query: 388 NANVTKTMIGNYEGTP-CKYT-TPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAAS 444
A+ + + G+Y P +Y TP +GL + A + + AGCS +C + K +
Sbjct: 352 FADNPRVLFGDYAPVPEPRYIYTPRRGLEMLGANVSFAAGCSEPRCQRYSRAELVKVVGA 411
Query: 445 ADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKG-PVILIIMSGGGFDISF 503
AD ++ +G +E E+ DR DL LPG Q L+ + + A G PVIL++ + G D+S+
Sbjct: 412 ADVVLVCLGTGVDVETEAKDRSDLSLPGHQLELLQDAVQAAAGRPVILLLFNAGPLDVSW 471
Query: 504 AKNDPKITSILWVGYPGQAGGAAIADVCFGR--YNPSGRLPMTW 545
A+ + +IL +P QA G AIA V G +P+GRLP TW
Sbjct: 472 AQAHDGVGAILACFFPAQATGLAIARVLLGEAGASPAGRLPATW 515
>gi|197106390|ref|YP_002131767.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
gi|196479810|gb|ACG79338.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
Length = 888
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 157/469 (33%), Positives = 227/469 (48%), Gaps = 82/469 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T L + R DLV R+TL+EK + +A ++ RLG+P Y WW+E LHGV+ G
Sbjct: 38 YRDTRLPAERRAADLVARMTLEEKSRQIGHTAPAIPRLGVPAYNWWNEGLHGVARAGI-- 95
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTF 163
AT FPQ I AA+++ + V+ TE RA Y + GLT
Sbjct: 96 --------ATVFPQAIGMAATWDVDRMRGTADVIGTEFRAKYAERVHPDGSTDWYRGLTV 147
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
WSPNINIFRDPRWGRGQET GEDP L + +++GLQ D PN K A KHY
Sbjct: 148 WSPNINIFRDPRWGRGQETYGEDPYLTGRMGVAFIRGLQGQD---PNFFKTIATAKHYAV 204
Query: 224 YD-LDNWKGTDRYHFNAMVIYTYYLIKFK------------------------------- 251
+ ++ + + H +A + YL F+
Sbjct: 205 HSGPESNRHREDVHPSAYDLEDTYLPAFRAAVTEGKVQAVMCAYNAVDGVPACASEDLMD 264
Query: 252 -----------YIVSDCDSVDVLY--NSQHYTKTPEEAAAKSILAGLDLNCGSFLGK-HT 297
++VSDC + +Y +S Y KTPEE +++ AG+DL CG + +T
Sbjct: 265 QRLRRDWGFSGHVVSDCGAAANIYREDSLAYVKTPEEGITRALNAGMDLVCGDYRADWNT 324
Query: 298 EA-----AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
EA AV+ G+++E+ +D A+ FA +RLG FD P++ P+ ++ T ++
Sbjct: 325 EAEATVSAVRKGMLDETVLDGALVRLFADRIRLGLFD-PPAEVPFSKITAAQNDTPEHRA 383
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
++L+ A+ + LLKN G LPL + +AV+GPNA+ +IGNY GTP T L G
Sbjct: 384 MSLEMAKASMTLLKND-GVLPLKGEP-RRIAVVGPNADSVDALIGNYYGTPSNPVTVLAG 441
Query: 413 LAAV---VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSI 458
+ A +Y G V + V D A ADA G Q +
Sbjct: 442 IRARFPKAEVVYAEGTGLVGPASLPVPD---AVLCADAACRTKGLKQEV 487
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 64/120 (53%), Gaps = 13/120 (10%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAK 486
DA AA +AD V V G +E E DR L LP QQ L+ + K
Sbjct: 604 DAVAAAKAADLVVFVGGLTARVEGEEMKLQVPGFAGGDRTSLDLPAPQQDLLRRLHATGK 663
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PV+L++M+G +++A D + +I+ YPG GG A+A + G Y+P+GRLP+T+Y
Sbjct: 664 -PVVLVLMNGSALSVNWA--DANLPAIVEAWYPGGEGGHAVAQLLAGDYSPAGRLPVTFY 720
>gi|62733930|gb|AAX96039.1| Similar to F28K19.27 [Oryza sativa Japonica Group]
Length = 244
Score = 229 bits (584), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 109/191 (57%), Positives = 135/191 (70%), Gaps = 7/191 (3%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
FC+ L + R DLV LTL EK++ L D A V+RLG+P YEWWSE LHG+S G G
Sbjct: 50 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 109
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIF 171
F+ V TSFPQVILTAA+F+A L++ +G+ V EARA+YN+G A GLT WSPN+NIF
Sbjct: 110 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 169
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
RDPRWGRGQETPGEDP+ AS+YA +V GLQ G + +ACCKH TAYDLD W
Sbjct: 170 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGG------EASACCKHATAYDLDYWNN 223
Query: 232 TDRYHFNAMVI 242
RY++++ VI
Sbjct: 224 VVRYNYDSKVI 234
>gi|262381651|ref|ZP_06074789.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
gi|262296828|gb|EEY84758.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
Length = 868
Score = 229 bits (584), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 154/439 (35%), Positives = 209/439 (47%), Gaps = 82/439 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ++ R+ DL+ RLT +EKI +++ ++ RLGIP Y+WW+EALHGV+ G
Sbjct: 28 FRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG--- 84
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+F+ + +VS EARA Y+ GLTFW
Sbjct: 85 -------RATVFPQAIAMAATFDDNAVHETFTIVSDEARAKYHQYQKDKEYDRYKGLTFW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRG ET GEDP L K +GLQ G PN K AC KHY +
Sbjct: 138 TPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ---GDDPNYYKTHACAKHYAVH 194
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFKYIVSDCDS--------------------- 259
W +R+ FNA +Y YL F+ +V + D
Sbjct: 195 SGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKL 251
Query: 260 -VDVLYNSQHY------------------TKTP--------EEAAAKSILAGLDLNCGSF 292
+D+L NS Y TP E A+A ++L G DL CG
Sbjct: 252 LIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGS 311
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
+ A+ G ++E +D ++ LG FD + PY ++ V + +
Sbjct: 312 Y-RALNKALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIA 369
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
ALD AR+ IVLLKN LPL IK +AV+GPNA + + NY G P K T ++G
Sbjct: 370 KALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVGPNAADSTMLWANYNGFPSKTVTIVEG 428
Query: 413 LAAVV---ATIYQAGCSNV 428
+ V IY+ GC++
Sbjct: 429 IRNKVPNAEVIYELGCNHT 447
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 68/124 (54%), Gaps = 15/124 (12%)
Query: 435 VDDAKKAAASADATVLVM---------GADQSIEAESH---DRLDLLLPGQQQLLITEVA 482
VD A+ DA V+V G + ++AE DR ++ +P Q+ ++ +
Sbjct: 593 VDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV 652
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
K PV+ ++ +G +++ +ND + +IL Y GQ GG A+ADV FG YNP+GRLP
Sbjct: 653 ATGK-PVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLP 709
Query: 543 MTWY 546
+T+Y
Sbjct: 710 ITFY 713
>gi|298376791|ref|ZP_06986746.1| beta-glucosidase [Bacteroides sp. 3_1_19]
gi|298266669|gb|EFI08327.1| beta-glucosidase [Bacteroides sp. 3_1_19]
Length = 868
Score = 228 bits (582), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 154/439 (35%), Positives = 209/439 (47%), Gaps = 82/439 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ++ R+ DL+ RLT +EKI +++ ++ RLGIP Y+WW+EALHGV+ G
Sbjct: 28 FRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG--- 84
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+F+ + +VS EARA Y+ GLTFW
Sbjct: 85 -------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTFW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRG ET GEDP L K +GLQ G PN K AC KHY +
Sbjct: 138 TPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ---GDDPNYYKTHACAKHYAVH 194
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFKYIVSDCDS--------------------- 259
W +R+ FNA +Y YL F+ +V + D
Sbjct: 195 SGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKL 251
Query: 260 -VDVLYNSQHY------------------TKTP--------EEAAAKSILAGLDLNCGSF 292
+D+L NS Y TP E A+A ++L G DL CG
Sbjct: 252 LIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGS 311
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
+ A+ G ++E +D ++ LG FD + PY ++ V + +
Sbjct: 312 Y-RALNKALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIA 369
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
ALD AR+ IVLLKN LPL IK +AV+GPNA + + NY G P K T ++G
Sbjct: 370 KALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVGPNAADSTMLWANYNGFPSKTVTIVEG 428
Query: 413 LAAVV---ATIYQAGCSNV 428
+ V IY+ GC++
Sbjct: 429 IRNKVPNAEVIYELGCNHT 447
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 68/124 (54%), Gaps = 15/124 (12%)
Query: 435 VDDAKKAAASADATVLVM---------GADQSIEAESH---DRLDLLLPGQQQLLITEVA 482
VD A+ DA V+V G + ++AE DR ++ +P Q+ ++ +
Sbjct: 593 VDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV 652
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
K PV+ ++ +G +++ +ND + +IL Y GQ GG A+ADV FG YNP+GRLP
Sbjct: 653 ATGK-PVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLP 709
Query: 543 MTWY 546
+T+Y
Sbjct: 710 ITFY 713
>gi|150007848|ref|YP_001302591.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
8503]
gi|301310124|ref|ZP_07216063.1| beta-glucosidase [Bacteroides sp. 20_3]
gi|423336365|ref|ZP_17314112.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
CL09T03C24]
gi|149936272|gb|ABR42969.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
gi|300831698|gb|EFK62329.1| beta-glucosidase [Bacteroides sp. 20_3]
gi|409240840|gb|EKN33614.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
CL09T03C24]
Length = 868
Score = 228 bits (581), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 154/439 (35%), Positives = 209/439 (47%), Gaps = 82/439 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ++ R+ DL+ RLT +EKI +++ ++ RLGIP Y+WW+EALHGV+ G
Sbjct: 28 FRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG--- 84
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+F+ + +VS EARA Y+ GLTFW
Sbjct: 85 -------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTFW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRG ET GEDP L K +GLQ G PN K AC KHY +
Sbjct: 138 TPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ---GDDPNYYKTHACAKHYAVH 194
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFKYIVSDCDS--------------------- 259
W +R+ FNA +Y YL F+ +V + D
Sbjct: 195 SGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKL 251
Query: 260 -VDVLYNSQHY------------------TKTP--------EEAAAKSILAGLDLNCGSF 292
+D+L NS Y TP E A+A ++L G DL CG
Sbjct: 252 LIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGS 311
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
+ A+ G ++E +D ++ LG FD + PY ++ V + +
Sbjct: 312 Y-RALNKALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIA 369
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
ALD AR+ IVLLKN LPL IK +AV+GPNA + + NY G P K T ++G
Sbjct: 370 KALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVGPNAADSTMLWANYNGFPTKTVTIVEG 428
Query: 413 LAAVV---ATIYQAGCSNV 428
+ V IY+ GC++
Sbjct: 429 IRNKVPNAEVIYELGCNHT 447
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 68/124 (54%), Gaps = 15/124 (12%)
Query: 435 VDDAKKAAASADATVLVM---------GADQSIEAESH---DRLDLLLPGQQQLLITEVA 482
VD A+ DA V+V G + ++AE DR ++ +P Q+ ++ +
Sbjct: 593 VDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV 652
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
K PV+ ++ +G +++ +ND + +IL Y GQ GG A+ADV FG YNP+GRLP
Sbjct: 653 ATGK-PVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLP 709
Query: 543 MTWY 546
+T+Y
Sbjct: 710 ITFY 713
>gi|256840106|ref|ZP_05545615.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
gi|256739036|gb|EEU52361.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
Length = 868
Score = 228 bits (581), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 154/439 (35%), Positives = 209/439 (47%), Gaps = 82/439 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ++ R+ DL+ RLT +EKI +++ ++ RLGIP Y+WW+EALHGV+ G
Sbjct: 28 FRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG--- 84
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+F+ + +VS EARA Y+ GLTFW
Sbjct: 85 -------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTFW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRG ET GEDP L K +GLQ G PN K AC KHY +
Sbjct: 138 TPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ---GDDPNYYKTHACAKHYAVH 194
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFKYIVSDCDS--------------------- 259
W +R+ FNA +Y YL F+ +V + D
Sbjct: 195 SGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKL 251
Query: 260 -VDVLYNSQHY------------------TKTP--------EEAAAKSILAGLDLNCGSF 292
+D+L NS Y TP E A+A ++L G DL CG
Sbjct: 252 LIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGS 311
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
+ A+ G ++E +D ++ LG FD + PY ++ V + +
Sbjct: 312 Y-RALNKALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIA 369
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
ALD AR+ IVLLKN LPL IK +AV+GPNA + + NY G P K T ++G
Sbjct: 370 KALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVGPNAADSTMLWANYNGFPSKTVTIVEG 428
Query: 413 LAAVV---ATIYQAGCSNV 428
+ V IY+ GC++
Sbjct: 429 IRNKVPNAEVIYELGCNHT 447
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 68/124 (54%), Gaps = 15/124 (12%)
Query: 435 VDDAKKAAASADATVLVM---------GADQSIEAESH---DRLDLLLPGQQQLLITEVA 482
VD A+ DA V+V G + ++AE DR ++ +P Q+ ++ +
Sbjct: 593 VDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV 652
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
K PV+ ++ +G +++ +ND + +IL Y GQ GG A+ADV FG YNP+GRLP
Sbjct: 653 ATGK-PVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLP 709
Query: 543 MTWY 546
+T+Y
Sbjct: 710 ITFY 713
>gi|423226625|ref|ZP_17213090.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392628884|gb|EIY22909.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 863
Score = 228 bits (581), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 157/435 (36%), Positives = 213/435 (48%), Gaps = 83/435 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N +L + R DLV RLTL+EK + +++ ++ RLGI Y+WW+EALHGV G
Sbjct: 26 YKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL-- 83
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV-----GLA---GLTFW 164
AT FPQ I ASFN L + VS EARA GL GLT W
Sbjct: 84 --------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKRYQGLTMW 135
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRGQET GEDP L + V+GLQ +G ++L AC KHY +
Sbjct: 136 TPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYDKLH--ACAKHYAVH 193
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL FK
Sbjct: 194 SGPEW---NRHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNR 250
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPEE--AAAKSILAGLDLNCGSFLGK 295
+VSDC ++ YN + P++ A+AK++L+G D+ CG
Sbjct: 251 LLMQILRDEWGYKEIVVSDCWAISDFYNKDAHETDPDKQHASAKAVLSGTDVECGDSYAS 310
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E AVK GL++E ID ++ LG D PS+ + Q+ V ++ +++LAL
Sbjct: 311 LPE-AVKEGLIDEKQIDISLKRLMKARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELAL 368
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNL--AVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AR+ +VLL+N LPL+ KNL AV+GPNAN + GNY G P T L+G+
Sbjct: 369 RMARESLVLLQNNQSLLPLN----KNLKVAVVGPNANDSVMQWGNYNGFPSHTITLLEGI 424
Query: 414 AAVV---ATIYQAGC 425
+ IY+ GC
Sbjct: 425 REYLPESQIIYEPGC 439
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 60/122 (49%), Gaps = 17/122 (13%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
VD K+A D + G ++E E DR + LP Q L+ E+ K
Sbjct: 594 VDKVKEA----DVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 649
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K ++ + SG I+ +IL YPGQAGG AIA+V FG YNP+GRLP+T
Sbjct: 650 GK-KIVFVNFSGSA--IALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 706
Query: 545 WY 546
+Y
Sbjct: 707 FY 708
>gi|423290405|ref|ZP_17269254.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
CL02T12C04]
gi|392665792|gb|EIY59315.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
CL02T12C04]
Length = 861
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 154/444 (34%), Positives = 216/444 (48%), Gaps = 81/444 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +TSL + R DL+ RLTL+EK++ + +++ ++ RLGI +YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I ASFN SL + S EAR + GLT
Sbjct: 84 ----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLT 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L + V+GLQ + ++L AC KH+
Sbjct: 134 FWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH--ACAKHFA 191
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ F+A I + YL FK
Sbjct: 192 VHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGS 248
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P E A+A ++ AG DL CGS
Sbjct: 249 NRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASADAVRAGTDLECGSEY 308
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ AVKAGL++E ID ++ LG D P+ + ++ + ++ +Q L
Sbjct: 309 ASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WAEIPTSVLNSKEHQAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AR+ +VLL+N LPL+ T +K +AV+GPNAN + GNY G P T L+ +
Sbjct: 365 ALRMARESLVLLQNKNNILPLN-THLK-VAVMGPNANDSVMQWGNYNGIPAHTVTLLEAV 422
Query: 414 AAVV---ATIYQAGCSNVQCGTAQ 434
A + IY+ GC V T Q
Sbjct: 423 RAKLPEGQIIYEPGCDRVDGKTLQ 446
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 55/119 (46%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A K AD + G S+E E DR D+ LP Q+ L+ + K K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+ I SG I +IL YPGQAGG AI D +G YNP GRLP+T+Y
Sbjct: 651 -VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY 706
>gi|433679952|ref|ZP_20511614.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430814928|emb|CCP42243.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 909
Score = 228 bits (580), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 144/396 (36%), Positives = 206/396 (52%), Gaps = 69/396 (17%)
Query: 70 RLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVIL 129
++T +EK+ +++A ++ RLG+P YEWW+E LHG++ G AT FPQ I
Sbjct: 67 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIG 116
Query: 130 TAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRDPRWGRGQ 180
AA++N +L + +G V STEARA +N+ AGLT WSPNINIFRDPRWGRG
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176
Query: 181 ETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------AYDLD--- 227
ET GEDP L + A G+++GLQ D P + A KH +D+D
Sbjct: 177 ETYGEDPYLTGQLAVGFIRGLQGDDLTHP---RTIATPKHLAVHSGPEPGRHGFDVDVSP 233
Query: 228 -NWKGTDRYHFNAMVI--------------------YTYYLIKFK---------YIVSDC 257
+ + T F A ++ +L+ + ++VSDC
Sbjct: 234 HDLEATYTPAFRAAIVDGRAGAVMCAYNSLHGTPACAADWLLNGRLRGDWGFTGFVVSDC 293
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
D+VD + ++ ++A ++ AG DLNCG + + A+ G +E+ +D ++
Sbjct: 294 DAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEAVLDQSLVR 352
Query: 318 NFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPT 377
FA RLG K PY +LG KDV + A++ LAL AA+Q IVLL+N +LPL P
Sbjct: 353 LFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQNRNATLPLRPG 411
Query: 378 AIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
LAVIGPNA+ + NY+GT TPL GL
Sbjct: 412 L--RLAVIGPNADALAALEANYQGTSAAPVTPLLGL 445
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR DL LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 667 DRNDLALPAAQQALL-ERAKASGKPLVVVLMSGSAVALNWAKQHADAIVAAW--YPGQSG 723
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA V G NP GRLP+T+Y
Sbjct: 724 GTAIAQVLAGDVNPGGRLPVTFY 746
>gi|383110854|ref|ZP_09931672.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
gi|313694427|gb|EFS31262.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
Length = 861
Score = 228 bits (580), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 151/438 (34%), Positives = 216/438 (49%), Gaps = 81/438 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +TSL + R DL+ RLTL+EK+ + +++ ++ RLGI +Y+WW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVALMQNASPAIPRLGIKEYDWWNEALHGVGRAGL 83
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I ASFN SL + VS EAR + GLT
Sbjct: 84 ----------ATVFPQSIGMGASFNDSLLYEVFDAVSDEARVKSRIFSENGVLKRYQGLT 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L + V+GLQ + G ++L AC KH+
Sbjct: 134 FWTPNVNIFRDPRWGRGQETYGEDPYLTGQLGMAVVRGLQGPENGKYDKLH--ACAKHFA 191
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ F+A I + YL FK
Sbjct: 192 VHSGPEW---NRHSFDAENITPRDLWETYLPAFKDLVQKADVKEVMCAYNRFEGEPCCGS 248
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P E A+A ++L+G DL CG
Sbjct: 249 NRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVLSGTDLECGGEY 308
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
G + AVKAGL++E ID ++ LG D P+ + ++ + ++ +QDL
Sbjct: 309 GSLAD-AVKAGLIDEKQIDVSLKRLLTARFELGEMDEQPA---WAEIPASTLNSKEHQDL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AR+ +VLL+N LPL+ T +K +AV+GPNAN + GNY G P T L+ +
Sbjct: 365 ALRMARESLVLLQNKNDILPLN-TDLK-VAVMGPNANDSVMQWGNYNGIPGHTVTLLEAV 422
Query: 414 AAVV---ATIYQAGCSNV 428
+ + +Y+ GC
Sbjct: 423 RSKLPEGQVMYEPGCDRT 440
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 57/119 (47%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A + AD + G S+E E DR D+ LP Q+ L+ + K K
Sbjct: 591 AVEKVKDADVVLFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPAVQRDLLKALKKAGKK 650
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+ I SG I +IL YPGQAGG AI DV FG YNP+GRLP+T+Y
Sbjct: 651 -VVFINYSGSA--IGLVPESNTCEAILQGWYPGQAGGTAIVDVLFGDYNPAGRLPVTFY 706
>gi|393781488|ref|ZP_10369683.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
CL02T12C01]
gi|392676551|gb|EIY69983.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
CL02T12C01]
Length = 850
Score = 228 bits (580), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 157/438 (35%), Positives = 214/438 (48%), Gaps = 84/438 (19%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYV 108
A L + N L + R DL++RLT++EKI+ + +++ + RLGI YEWW+EALHGV+
Sbjct: 12 AQLPYQNPDLTPEQRATDLLQRLTVEEKISLMQNNSPGIPRLGIRPYEWWNEALHGVARA 71
Query: 109 GPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAG 160
G AT FPQ I AASFN SL Q + VS EARA G
Sbjct: 72 GL----------ATVFPQTIGMAASFNDSLVQKVFTAVSDEARAKNRAFNDQGQYKRYQG 121
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
LT W+PN+NIFRDPRWGRGQET GEDP L S+ VKGLQ D ++L AC KH
Sbjct: 122 LTMWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYDKLH--ACAKH 179
Query: 221 YTAYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK------------------------ 251
+ + W +R+ FNA I + YL FK
Sbjct: 180 FAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDPCC 236
Query: 252 ------------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGS 291
+VSDC ++ + ++ + P+ A+A ++L+G DL CGS
Sbjct: 237 GSNRLLTQILRDEWGFNGIVVSDCGAISDFWGAKKHNTHPDAAHASADAVLSGTDLECGS 296
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLG-FFDGHPSKQPYGQLGPKDVCTQAN 350
K T+ AVKAG+++E ID ++ LG + HP PY + C + +
Sbjct: 297 NYRKLTD-AVKAGIISEEQIDISVKRLLKARFELGEMEESHPWALPYSIVD----CPE-H 350
Query: 351 QDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPL 410
+ LAL A + + LL+N LPL A +AVIGPNAN + GNY GTP +T L
Sbjct: 351 RHLALQIAHETMTLLQNKENILPLDKHA--KVAVIGPNANDSVMQWGNYNGTPSHTSTLL 408
Query: 411 QGLAAVVAT---IYQAGC 425
L + + IY+ C
Sbjct: 409 SALRSKLPAAQLIYEPVC 426
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 63/123 (51%), Gaps = 6/123 (4%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---ESHDRLDLLLPGQQQLLITEVAK 483
N+ ++ D + + + L+ G + + A + DR D+ LP Q+ ++ + K
Sbjct: 576 NMNATLEKLKDTEIVIFAGGISPLLEGEEMKVSAAGFKGGDRTDIELPAVQRNVLAALKK 635
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
K VI + SG ++ +IL YPGQ GG A+ADV FG YNP+GRLP+
Sbjct: 636 AGK-KVIFVNFSGSA--MALTPETENCDAILQAWYPGQEGGTAVADVLFGDYNPAGRLPV 692
Query: 544 TWY 546
T+Y
Sbjct: 693 TFY 695
>gi|224537384|ref|ZP_03677923.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521009|gb|EEF90114.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
DSM 14838]
Length = 863
Score = 227 bits (579), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 157/435 (36%), Positives = 213/435 (48%), Gaps = 83/435 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N +L + R DLV RLTL+EK + +++ ++ RLGI Y+WW+EALHGV G
Sbjct: 26 YKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL-- 83
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV-----GLA---GLTFW 164
AT FPQ I ASFN L + VS EARA GL GLT W
Sbjct: 84 --------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKRYQGLTMW 135
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRGQET GEDP L + V+GLQ +G ++L AC KHY +
Sbjct: 136 TPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYDKLH--ACAKHYAVH 193
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL FK
Sbjct: 194 SGPEW---NRHSFNAENIDPRDLWETYLPAFKNLVQKAHVKEVMCAYNRFEGEPCCGSNR 250
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPEE--AAAKSILAGLDLNCGSFLGK 295
+VSDC ++ YN + P++ A+AK++L+G D+ CG
Sbjct: 251 LLMQILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKAVLSGTDVECGDSYAS 310
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E AVK GL++E ID ++ LG D PS+ + Q+ V ++ +++LAL
Sbjct: 311 LPE-AVKEGLIDEKQIDISLKRLMKARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELAL 368
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNL--AVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AR+ +VLL+N LPL+ KNL AV+GPNAN + GNY G P T L+G+
Sbjct: 369 RMARESLVLLQNNQSLLPLN----KNLKVAVVGPNANDSVMQWGNYNGFPSHTITLLEGI 424
Query: 414 AAVV---ATIYQAGC 425
+ IY+ GC
Sbjct: 425 REYLPESQIIYEPGC 439
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 60/122 (49%), Gaps = 17/122 (13%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKV 484
VD K+A D + G ++E E DR + LP Q L+ E+ K
Sbjct: 594 VDKVKEA----DVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 649
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K ++ + SG I+ +IL YPGQAGG AIA+V FG YNP+GRLP+T
Sbjct: 650 GK-KIVFVNFSGSA--IALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 706
Query: 545 WY 546
+Y
Sbjct: 707 FY 708
>gi|299147288|ref|ZP_07040353.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298514566|gb|EFI38450.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 861
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 154/444 (34%), Positives = 216/444 (48%), Gaps = 81/444 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +TSL + R DL+ RLTL+EK++ + +++ ++ RLGI +YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I ASFN SL + S EAR + GLT
Sbjct: 84 ----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLT 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L + V+GLQ + ++L AC KH+
Sbjct: 134 FWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH--ACAKHFA 191
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ F+A I + YL FK
Sbjct: 192 VHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGS 248
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P E A+A ++ AG DL CGS
Sbjct: 249 NRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDLECGSEY 308
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ AVKAGL++E ID ++ LG D P+ + ++ + ++ +Q L
Sbjct: 309 ASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNSKEHQAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AR+ +VLL+N LPL+ T +K +AV+GPNAN + GNY G P T L+ +
Sbjct: 365 ALRMARESLVLLQNKNNILPLN-THLK-VAVMGPNANDSVMQWGNYNGIPAHTVTLLEAV 422
Query: 414 AAVV---ATIYQAGCSNVQCGTAQ 434
A + IY+ GC V T Q
Sbjct: 423 RAKLPEGQIIYEPGCDRVDGKTLQ 446
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 55/119 (46%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A K AD + G S+E E DR D+ LP Q+ L+ + K K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+ I SG I +IL YPGQAGG AI D +G YNP GRLP+T+Y
Sbjct: 651 -VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY 706
>gi|298386950|ref|ZP_06996504.1| beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298260100|gb|EFI02970.1| beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 846
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 154/415 (37%), Positives = 208/415 (50%), Gaps = 77/415 (18%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
S+A + N + I R+ DL+ +LT++EKI+ L ++ + R+GI KY +EALHG+
Sbjct: 17 SMAQDLYKNMNAPIHERIQDLLSKLTIEEKISLLRATSPGIERMGIDKYYMGNEALHGI- 75
Query: 107 YVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG------ 160
+ PG T FPQ I A+ +N L I V+S EARA +N G
Sbjct: 76 -IRPGKF--------TVFPQAIGLASMWNPELHHIIASVISDEARARWNELERGKKQKDQ 126
Query: 161 ----LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAA 216
LTFWSP +N+ RDPRWGR ET GEDP L+ T +VKGLQ G P LK +
Sbjct: 127 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQ---GDHPRYLKSVS 183
Query: 217 CCKHYTAYDLDNWKGTDRYHFNAMVIYT----YYLIKFK--------------------- 251
KH+ A N + +R++ +A + T YYL F+
Sbjct: 184 TPKHFAA----NNEEHNRFYCDAAITETDMREYYLPAFEKCIREGKAESIMTAYNAINGV 239
Query: 252 ---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG 290
YIVSDC + +L Y KTPE AA +I AGLDL CG
Sbjct: 240 PCTANNWLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKTPEAAAMIAIKAGLDLECG 299
Query: 291 SFL-GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
++ G A K +V+ + ID A + MRLG FD P K PY L P+ V +
Sbjct: 300 DYVFGAPLLNAYKQYMVSTAEIDSAAYHVLRARMRLGMFD-DPEKNPYNHLSPEIVGCEK 358
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
+++LAL+AARQ IVLLKN +LPL+ IK++AV+G NA G+Y GTP
Sbjct: 359 HKELALEAARQSIVLLKNQKNTLPLNAKKIKSIAVVGINA--ANCEFGDYSGTPV 411
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 66/110 (60%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
DA K +D + VMG +QSIE E DR + LP QQ+ I E K A I+++++G
Sbjct: 589 DASKVIRESDVVIAVMGINQSIEREGQDRSSIELPKDQQIFIREAYK-ANPNTIVVLVAG 647
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ + D I +I+ YPG+ GG AIA+V FG YNP+GRLP+T+Y
Sbjct: 648 SSMAVGWM--DQNIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFY 695
>gi|440733337|ref|ZP_20913088.1| beta-glucosidase [Xanthomonas translucens DAR61454]
gi|440362904|gb|ELQ00083.1| beta-glucosidase [Xanthomonas translucens DAR61454]
Length = 895
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 145/396 (36%), Positives = 205/396 (51%), Gaps = 69/396 (17%)
Query: 70 RLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVIL 129
++T +EK+ +++A ++ RLG+P YEWW+E LHG++ G AT FPQ I
Sbjct: 53 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIG 102
Query: 130 TAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRDPRWGRGQ 180
AA++N +L + +G V STEARA +N+ AGLT WSPNINIFRDPRWGRG
Sbjct: 103 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 162
Query: 181 ETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------AYDLD--- 227
ET GEDP L + A G++ GLQ D P + A KH +D+D
Sbjct: 163 ETYGEDPYLTGQLAVGFIHGLQGDDLTHP---RTIATPKHLAVHSGPEPGRHGFDVDVSP 219
Query: 228 -NWKGTDRYHFNAMVI--------------------YTYYLIKFK---------YIVSDC 257
+ + T F A ++ +L+ + ++VSDC
Sbjct: 220 HDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWGFTGFVVSDC 279
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
D+VD + ++ ++A ++ AG DLNCG + + A+ G +E+ +D ++
Sbjct: 280 DAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEALLDQSLVR 338
Query: 318 NFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPT 377
FA RLG K PY QLG KDV + A++ LAL AA+Q IVLL+N +LPL P
Sbjct: 339 LFAARYRLGELQPQ-RKDPYAQLGAKDVDSAAHRALALQAAQQSIVLLQNRNATLPLRPG 397
Query: 378 AIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
LAVIGPNA+ + NY+GT TPL GL
Sbjct: 398 L--RLAVIGPNADALAALEANYQGTSAAPVTPLLGL 431
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR DL LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 653 DRNDLALPAAQQALL-ERAKASGKPLVVVLMSGSAVALNWAKQHADAIVAAW--YPGQSG 709
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA V G NP GRLP+T+Y
Sbjct: 710 GTAIAQVLAGDVNPGGRLPVTFY 732
>gi|212692496|ref|ZP_03300624.1| hypothetical protein BACDOR_01992 [Bacteroides dorei DSM 17855]
gi|212664971|gb|EEB25543.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
dorei DSM 17855]
Length = 864
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 154/438 (35%), Positives = 212/438 (48%), Gaps = 78/438 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N++L + R DL+++LTL+EK+ ++D++ V RLGI Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AASF I VS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S VKGLQ TD K+ AC KH+ +
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYD-KIHACAKHFAVH 192
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------------------- 254
W +R+ FNA I + YL+ F+ +V
Sbjct: 193 SGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDR 249
Query: 255 -----------------SDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
SDC ++D Y + + P E A+A ++L+G DL CGS
Sbjct: 250 LLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKA 309
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E+A K GL++E ID ++ LG D PSK + ++ VC+ + L+L
Sbjct: 310 LVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSL 367
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
D AR+ + LL N LPL + +AV+GPNAN + GNY GTP T L+G+ +
Sbjct: 368 DIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRS 426
Query: 416 VVAT----IYQAGCSNVQ 429
+ IY+ GCS V+
Sbjct: 427 AMGENDKLIYEQGCSWVE 444
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 62/130 (47%), Gaps = 20/130 (15%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQL 476
N++ A+V DA D + G S+E E DR D+ LP Q+
Sbjct: 591 NIKNTVAKVKDA-------DIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRE 643
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI + K VI + SG I+ +IL YPGQ+GG A A+V FG YN
Sbjct: 644 LIKALCDAGK-KVIFVNFSGS--PIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYN 700
Query: 537 PSGRLPMTWY 546
P+GRLP+T+Y
Sbjct: 701 PAGRLPVTFY 710
>gi|345514226|ref|ZP_08793739.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|229437207|gb|EEO47284.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
Length = 864
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 154/438 (35%), Positives = 212/438 (48%), Gaps = 78/438 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N++L + R DL+++LTL+EK+ ++D++ V RLGI Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AASF I VS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S VKGLQ TD K+ AC KH+ +
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYD-KIHACAKHFAVH 192
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------------------- 254
W +R+ FNA I + YL+ F+ +V
Sbjct: 193 SGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDR 249
Query: 255 -----------------SDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
SDC ++D Y + + P E A+A ++L+G DL CGS
Sbjct: 250 LLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKA 309
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E+A K GL++E ID ++ LG D PSK + ++ VC+ + L+L
Sbjct: 310 LVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSL 367
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
D AR+ + LL N LPL + +AV+GPNAN + GNY GTP T L+G+ +
Sbjct: 368 DIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRS 426
Query: 416 VVAT----IYQAGCSNVQ 429
+ IY+ GCS V+
Sbjct: 427 AMGENDKLIYEQGCSWVE 444
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 62/130 (47%), Gaps = 20/130 (15%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQL 476
N++ A+V DA D + G S+E E DR D+ LP Q+
Sbjct: 591 NIKNTVAKVKDA-------DIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRE 643
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI + K VI + SG I+ +IL YPGQ+GG A A+V FG YN
Sbjct: 644 LIKALCDAGK-KVIFVNFSGS--PIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYN 700
Query: 537 PSGRLPMTWY 546
P+GRLP+T+Y
Sbjct: 701 PAGRLPVTFY 710
>gi|237709184|ref|ZP_04539665.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|229456880|gb|EEO62601.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
Length = 864
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 154/438 (35%), Positives = 212/438 (48%), Gaps = 78/438 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N++L + R DL+++LTL+EK+ ++D++ V RLGI Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AASF I VS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S VKGLQ TD K+ AC KH+ +
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYD-KIHACAKHFAVH 192
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------------------- 254
W +R+ FNA I + YL+ F+ +V
Sbjct: 193 SGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDR 249
Query: 255 -----------------SDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
SDC ++D Y + + P E A+A ++L+G DL CGS
Sbjct: 250 LLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKA 309
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E+A K GL++E ID ++ LG D PSK + ++ VC+ + L+L
Sbjct: 310 LVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSL 367
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
D AR+ + LL N LPL + +AV+GPNAN + GNY GTP T L+G+ +
Sbjct: 368 DIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRS 426
Query: 416 VVAT----IYQAGCSNVQ 429
+ IY+ GCS V+
Sbjct: 427 AMGENDKLIYEQGCSWVE 444
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 62/130 (47%), Gaps = 20/130 (15%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQL 476
N++ A+V DA D + G S+E E DR D+ LP Q+
Sbjct: 591 NIKNTVAKVKDA-------DIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRE 643
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI + K VI + SG I+ +IL YPGQ+GG A A+V FG YN
Sbjct: 644 LIKALCDAGK-KVIFVNFSGS--PIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYN 700
Query: 537 PSGRLPMTWY 546
P+GRLP+T+Y
Sbjct: 701 PAGRLPVTFY 710
>gi|265752711|ref|ZP_06088280.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
3_1_33FAA]
gi|263235897|gb|EEZ21392.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
3_1_33FAA]
Length = 864
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 154/438 (35%), Positives = 212/438 (48%), Gaps = 78/438 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N++L + R DL+++LTL+EK+ ++D++ V RLGI Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AASF I VS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S VKGLQ TD K+ AC KH+ +
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYD-KIHACAKHFAVH 192
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------------------- 254
W +R+ FNA I + YL+ F+ +V
Sbjct: 193 SGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDR 249
Query: 255 -----------------SDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
SDC ++D Y + + P E A+A ++L+G DL CGS
Sbjct: 250 LLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKA 309
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E+A K GL++E ID ++ LG D PSK + ++ VC+ + L+L
Sbjct: 310 LVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSL 367
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
D AR+ + LL N LPL + +AV+GPNAN + GNY GTP T L+G+ +
Sbjct: 368 DIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRS 426
Query: 416 VVAT----IYQAGCSNVQ 429
+ IY+ GCS V+
Sbjct: 427 AMGENDKLIYEQGCSWVE 444
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 62/130 (47%), Gaps = 20/130 (15%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQL 476
N++ A+V DA D + G S+E E DR D+ LP Q+
Sbjct: 591 NIKNTVAKVKDA-------DIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRE 643
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI + K VI + SG I+ +IL YPGQ+GG A A+V FG YN
Sbjct: 644 LIKALCDAGK-KVIFVNFSGS--PIAMEPETQYCQAILQAWYPGQSGGKAAAEVLFGDYN 700
Query: 537 PSGRLPMTWY 546
P+GRLP+T+Y
Sbjct: 701 PAGRLPVTFY 710
>gi|423215029|ref|ZP_17201557.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692292|gb|EIY85530.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
CL03T12C04]
Length = 861
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 154/444 (34%), Positives = 216/444 (48%), Gaps = 81/444 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +TSL + R DL+ RLTL+EK++ + +++ ++ RLGI +YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I ASFN SL + S EAR + GLT
Sbjct: 84 ----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLT 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L + V+GLQ + ++L AC KH+
Sbjct: 134 FWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH--ACAKHFA 191
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ F+A I + YL FK
Sbjct: 192 VHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGS 248
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P E A+A ++ AG DL CGS
Sbjct: 249 NRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDLECGSEY 308
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ AVKAGL++E ID ++ LG D P+ + ++ + ++ +Q L
Sbjct: 309 ASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNSKEHQAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AR+ +VLL+N LPL+ T +K +AV+GPNAN + GNY G P T L+ +
Sbjct: 365 ALRMARESLVLLQNKNNILPLN-THLK-VAVMGPNANDSVMQWGNYNGIPAHTVTLLEAV 422
Query: 414 AAVV---ATIYQAGCSNVQCGTAQ 434
A + IY+ GC V T Q
Sbjct: 423 RAKLPEGQIIYEPGCDRVDGKTLQ 446
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 55/119 (46%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A K AD + G S+E E DR D+ LP Q+ L+ + K K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+ I SG I +IL YPGQAGG AI D +G YNP GRLP+T+Y
Sbjct: 651 -VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY 706
>gi|298481648|ref|ZP_06999839.1| beta-glucosidase [Bacteroides sp. D22]
gi|298272189|gb|EFI13759.1| beta-glucosidase [Bacteroides sp. D22]
Length = 861
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 154/444 (34%), Positives = 216/444 (48%), Gaps = 81/444 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +TSL + R DL+ RLTL+EK++ + +++ ++ RLGI +YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I ASFN SL + S EAR + GLT
Sbjct: 84 ----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLT 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L + V+GLQ + ++L AC KH+
Sbjct: 134 FWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH--ACAKHFA 191
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ F+A I + YL FK
Sbjct: 192 VHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGS 248
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P E A+A ++ AG DL CGS
Sbjct: 249 NRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDLECGSEY 308
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ AVKAGL++E ID ++ LG D P+ + ++ + ++ +Q L
Sbjct: 309 ASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNSKEHQAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AR+ +VLL+N LPL+ T +K +AV+GPNAN + GNY G P T L+ +
Sbjct: 365 ALRMARESLVLLQNKNNILPLN-THLK-VAVMGPNANDSVMQWGNYNGIPAHTVTLLEAV 422
Query: 414 AAVV---ATIYQAGCSNVQCGTAQ 434
A + IY+ GC V T Q
Sbjct: 423 RAKLPEGQIIYEPGCDRVDGKTLQ 446
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 55/119 (46%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A K AD + G S+E E DR D+ LP Q+ L+ + K K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+ I SG I +IL YPGQAGG AI D +G YNP GRLP+T+Y
Sbjct: 651 -VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY 706
>gi|423230604|ref|ZP_17217008.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
CL02T00C15]
gi|423244313|ref|ZP_17225388.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
CL02T12C06]
gi|392630748|gb|EIY24734.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
CL02T00C15]
gi|392642494|gb|EIY36260.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
CL02T12C06]
Length = 864
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 154/438 (35%), Positives = 212/438 (48%), Gaps = 78/438 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N++L + R DL+++LTL+EK+ ++D++ V RLGI Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AASF I VS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S VKGLQ TD K+ AC KH+ +
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYD-KIHACAKHFAVH 192
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------------------- 254
W +R+ FNA I + YL+ F+ +V
Sbjct: 193 SGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDR 249
Query: 255 -----------------SDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
SDC ++D Y + + P E A+A ++L+G DL CGS
Sbjct: 250 LLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSSYKA 309
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E+A K GL++E ID ++ LG D PSK + ++ VC+ + L+L
Sbjct: 310 LVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSL 367
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
D AR+ + LL N LPL + +AV+GPNAN + GNY GTP T L+G+ +
Sbjct: 368 DIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRS 426
Query: 416 VVAT----IYQAGCSNVQ 429
+ IY+ GCS V+
Sbjct: 427 AMGENDKLIYEQGCSWVE 444
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 62/130 (47%), Gaps = 20/130 (15%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQL 476
N++ A+V DA D + G S+E E DR D+ LP Q+
Sbjct: 591 NIKNTVAKVKDA-------DIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRE 643
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI + K VI + SG I+ +IL YPGQ+GG A A+V FG YN
Sbjct: 644 LIKALCDAGK-KVIFVNFSGS--PIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYN 700
Query: 537 PSGRLPMTWY 546
P+GRLP+T+Y
Sbjct: 701 PAGRLPVTFY 710
>gi|408824590|ref|ZP_11209480.1| Glucan 1,4-beta-glucosidase [Pseudomonas geniculata N1]
Length = 897
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 151/434 (34%), Positives = 214/434 (49%), Gaps = 77/434 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + S + R LV ++TL+EK + ++A ++ RLG+P Y+WW+E LHGV+ G
Sbjct: 38 WLDVSASFEQRAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ-- 95
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFW 164
AT FPQ I AA+F+ L + +S EARA ++ L GLTFW
Sbjct: 96 --------ATVFPQAIGLAATFDVPLMGQVAATISDEARAKHHQFLREGAHGRYQGLTFW 147
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPN+NIFRDPRWGRGQET GEDP L ++ +V+GLQ G P K+ A KH +
Sbjct: 148 SPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQ---GDDPVYRKLDATAKHLAVH 204
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ DR+HF+A +Y YL F+
Sbjct: 205 ---SGPEADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESASASRFL 261
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE 298
Y+VSDC ++ ++ T E AAA ++ G +L CG
Sbjct: 262 LRDVLRRDWGFKGYVVSDCWAIVDIWKHHRIVTTREAAAALAVRNGTELECGQEYAT-LP 320
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAA 358
+AV+ GL++E+ ID A++ F MRLG FD P + + ++ A+ LAL AA
Sbjct: 321 SAVRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPAHDALALKAA 379
Query: 359 RQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL---AA 415
+ +VLLKN G LPLS + +AV+GP A+ T ++GNY GTP T LQG+ A
Sbjct: 380 QASLVLLKND-GILPLSRN-TRRIAVVGPTADDTMALLGNYFGTPAAPVTILQGIREAAK 437
Query: 416 VVATIYQAGCSNVQ 429
V Y G V+
Sbjct: 438 GVEVRYARGVDLVE 451
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 48/83 (57%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR DL LP Q+ L+ + K PV++++ G + +A+ + +IL YPGQ G
Sbjct: 658 DRTDLRLPAPQRTLLEALHGTGK-PVVMVLTGGSAIAVDWAQA--HLPAILMSWYPGQRG 714
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G A+ FG NPSGRLP+T+Y
Sbjct: 715 GTAVGQALFGDVNPSGRLPVTFY 737
>gi|423240769|ref|ZP_17221883.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
CL03T12C01]
gi|392643731|gb|EIY37480.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
CL03T12C01]
Length = 864
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 154/438 (35%), Positives = 212/438 (48%), Gaps = 78/438 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N++L + R DL+++LTL+EK+ ++D++ V RLGI Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AASF I VS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S VKGLQ TD K+ AC KH+ +
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYD-KIHACAKHFAVH 192
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------------------- 254
W +R+ FNA I + YL+ F+ +V
Sbjct: 193 SGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCGSDR 249
Query: 255 -----------------SDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
SDC ++D Y + + P E A+A ++L+G DL CGS
Sbjct: 250 LLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPNAESASAAAVLSGTDLECGSSYKA 309
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E+A K GL++E ID ++ LG D PSK + ++ VC+ + L+L
Sbjct: 310 LVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDSLSL 367
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
D AR+ + LL N LPL + +AV+GPNAN + GNY GTP T L+G+ +
Sbjct: 368 DIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITLLEGIRS 426
Query: 416 VVAT----IYQAGCSNVQ 429
+ IY+ GCS V+
Sbjct: 427 AMGENDKLIYEQGCSWVE 444
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 62/130 (47%), Gaps = 20/130 (15%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQL 476
N++ A+V DA D + G S+E E DR D+ LP Q+
Sbjct: 591 NIKNTVAKVKDA-------DIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRE 643
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI + K VI + SG I+ +IL YPGQ+GG A A+V FG YN
Sbjct: 644 LIKALCDAGK-KVIFVNFSGS--PIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYN 700
Query: 537 PSGRLPMTWY 546
P+GRLP+T+Y
Sbjct: 701 PAGRLPVTFY 710
>gi|323451996|gb|EGB07871.1| hypothetical protein AURANDRAFT_71699 [Aureococcus anophagefferens]
Length = 1202
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 199/608 (32%), Positives = 278/608 (45%), Gaps = 120/608 (19%)
Query: 43 VSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEAL 102
V +PS A+ +C+ +L I RV DL R T+ E I+ + A +V RLG+P + EAL
Sbjct: 333 VESPSAAAYPYCDRALPIRARVADLAARFTVNETISQMGTMAAAVPRLGLPALNYGGEAL 392
Query: 103 HGVSYVGPGTHFSNVVPG--ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN----- 155
HGV +S G T FP ASF+ L++A+G EARA++
Sbjct: 393 HGV--------WSTCAAGRCPTQFPAPHAMGASFDRDLWRAVGAASGLEARALFRWNQRH 444
Query: 156 ---------VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
G GLTF++PN+N+ RDPRWGR +E P EDPLL Y +V+G Q DG
Sbjct: 445 NASDCARSLEGCLGLTFYAPNVNLARDPRWGRIEEVPSEDPLLNGVYGAEFVRGFQG-DG 503
Query: 207 GSPNRLKVA-ACCKHYTAYDLD---------NWKGT-------DRYHFNAMV----IYTY 245
+VA A KH+ Y+L+ +W G+ DR+ F+A V
Sbjct: 504 A----YRVANAVVKHFAVYNLEVDVEDTPPADWCGSAACAPPNDRHSFDARVSPRDFEET 559
Query: 246 YLIKFK-----------------------------------------YIVSDCDSV-DVL 263
Y+ F + +DC ++ D +
Sbjct: 560 YVGPFVAPVAAGAAAAMCSYNAVNGEPACTDGALLRGALRGALNFTGVLATDCGALEDAV 619
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLM 323
+ Y EAAA +I AG+D NCG L A+ AGLV A+ + +
Sbjct: 620 ARHKRYATE-AEAAAAAIAAGVDSNCGKVLTSALPEALAAGLVRPDALRPPLERLLEARL 678
Query: 324 RLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLA 383
RLG D + P + V + A++ LAL AAR+G+VLL+N LPL LA
Sbjct: 679 RLGLLDDWDADAPVPRPDVDAVDSPAHRALALRAAREGLVLLQNPNQILPLD--GRGTLA 736
Query: 384 VIGPNANVTKTMIGNYEGTPCK--YTTPLQGLAAVV---ATIYQAGCSNVQCGTAQVDDA 438
VIGPNAN + ++ Y GTP +PLQ L A +Y GC+ TA +D+A
Sbjct: 737 VIGPNANASMNLLSGYHGTPPPDLLRSPLQELEARWRGGKVVYAVGCNASGAATAALDEA 796
Query: 439 KKAAASADATVLVMG------------ADQSI----EAESHDRLDLLLPGQQQLLITEVA 482
A +AD VL +G D + EAES DR L LPG Q+ L +++
Sbjct: 797 VDLAKTADVVVLGLGLCGDNYGGGPPKEDATCFSIDEAESVDRTSLKLPGAQEALFSKIW 856
Query: 483 KVAKG-PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRL 541
+ K V + ++S G D SFAK+ ++L GY G+ GG A+AD G YNP G L
Sbjct: 857 ALGKPVAVAVFLVSAGAVDASFAKDK---AALLLAGYGGEFGGVAVADALLGAYNPGGAL 913
Query: 542 PMTWYPQS 549
T P +
Sbjct: 914 TATMLPDA 921
>gi|336404627|ref|ZP_08585320.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
gi|335941531|gb|EGN03384.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
Length = 861
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 215/444 (48%), Gaps = 81/444 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +TSL + R DL+ RLTL+EK++ + +++ ++ RLGI +YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I ASFN SL + S EAR + GLT
Sbjct: 84 ----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLT 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L + V+GLQ + ++L AC KH+
Sbjct: 134 FWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYDKLH--ACAKHFA 191
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ F+A I + YL FK
Sbjct: 192 VHSGPEW---NRHSFDAENIAPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGS 248
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P E A+A ++ G DL CGS
Sbjct: 249 NRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASAAAVRTGTDLECGSEY 308
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ AVKAGL++E ID ++ LG D P+ + ++ + ++ +Q L
Sbjct: 309 ASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WAEIPTSVLNSKEHQAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AR+ +VLL+N LPL+ T +K +AV+GPNAN + GNY G P T L+ +
Sbjct: 365 ALRMARESLVLLQNKNNILPLN-TNLK-IAVMGPNANDSVMQWGNYNGIPAHTVTLLEAV 422
Query: 414 AAVV---ATIYQAGCSNVQCGTAQ 434
A + IY+ GC V T Q
Sbjct: 423 RAKLPEGQIIYEPGCDRVDRKTLQ 446
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 55/119 (46%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A K AD + G S+E E DR D+ LP Q+ L+ + K K
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+ I SG I +IL YPGQAGG AI D +G YNP GRLP+T+Y
Sbjct: 651 -VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY 706
>gi|375149998|ref|YP_005012439.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361064044|gb|AEW03036.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 875
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 148/450 (32%), Positives = 209/450 (46%), Gaps = 76/450 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L + RV DLV RLTL+EK+ ++++A + RL IP Y+WW+E LHGV+
Sbjct: 30 FQNYRLSFEDRVNDLVSRLTLEEKVAQMLNAAPGIPRLDIPAYDWWNETLHGVA------ 83
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA---------GLTF 163
T FPQ I AA+++ + + + E R ++N +A GLT+
Sbjct: 84 ---RTPYNVTVFPQAIAMAATWDTAALYRMADCSALEGRVIHNKAIAAGKEKDRYLGLTY 140
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PNINIFRDPRWGRGQET GEDP L + A +V+GLQ G P LK AAC KHY
Sbjct: 141 WTPNINIFRDPRWGRGQETYGEDPYLTAALADAFVRGLQ---GNDPKYLKAAACAKHYAV 197
Query: 224 YDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------------- 251
+ + R+ F+ V ++ YL FK
Sbjct: 198 H---SGPEPSRHVFDVDVTPYDLWDTYLPSFKKLVTVSNVAGVMCAYNAFRKQPCCASDV 254
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT 297
Y+ SDC ++D Y + A+A ++ G D++CG+ +
Sbjct: 255 LMTDILRNQWSFKGYVTSDCGAIDDFYRNHKTHPDAAAASADAVFHGTDIDCGNEAYRAL 314
Query: 298 EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDA 357
AVK + E ID ++ F RLG FD PS Y Q ++ + A+ AL
Sbjct: 315 VQAVKENKITEKQIDISVKRLFMIRFRLGMFDP-PSMVKYAQTPATELESAAHAKHALLM 373
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL---- 413
A + IVLLKN +LPL +K + V+GPNA +GNY GTP K T QG+
Sbjct: 374 AHESIVLLKNANNTLPLK-KGLKKIVVLGPNATNVIAPLGNYSGTPSKLITLFQGIKEKA 432
Query: 414 AAVVATIYQAGCSNVQCGTAQVDDAKKAAA 443
A +Y+ + + DD A +
Sbjct: 433 GAATQVVYEKAVNYTNNNVLEYDDISNACS 462
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 59/117 (50%), Gaps = 13/117 (11%)
Query: 440 KAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPV 489
K + ADA + G +E E DR +LLP Q L+ + K PV
Sbjct: 602 KKYSDADAFIFAGGISPQLEGEEMKVSDPGFKGGDRTTILLPAIQTELMKALQASGK-PV 660
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ ++M+G + + I +I+ Y GQA G A+ADV FG YNPSGRLP+T+Y
Sbjct: 661 VFVMMTGSALATPWESEN--IPAIVNAWYGGQAAGTALADVLFGDYNPSGRLPVTFY 715
>gi|373952814|ref|ZP_09612774.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373889414|gb|EHQ25311.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 862
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 150/437 (34%), Positives = 204/437 (46%), Gaps = 75/437 (17%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
S L + N +L + R DLV RLTL+EK+ + D + +V RLGI K+ WWSEALHG +
Sbjct: 18 SAQQLPYQNPALSSEARAKDLVTRLTLKEKVGLMKDVSEAVPRLGIKKFNWWSEALHGYA 77
Query: 107 YVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GL 158
GP T FP+ + AASF+ + VS EARA N
Sbjct: 78 NQGP----------VTVFPEPVGMAASFDDQKLFHVFDAVSDEARAKNNEYRKQVESQRF 127
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
L+ W+PN+NIFRDPRWGRGQET GEDP L S+ VKGLQ K+ AC
Sbjct: 128 HDLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGP--ADAKYRKLLACA 185
Query: 219 KHYTAYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------- 251
KHY + W R+ N + + YL FK
Sbjct: 186 KHYAVHSGPEWS---RHEMNVTDVTPRDLWETYLPAFKSLVQDADVREVMCAYQRLDDEP 242
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS 291
+VSDC ++ YNS H + A+AK++L+G D+ C
Sbjct: 243 CCGNSRLLGQILREDWGFKYLVVSDCGAITDFYNSHHSSSDATHASAKAVLSGTDVECVG 302
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
+ AV GL+ E I+ ++ LG D P+ ++ V ++ +Q
Sbjct: 303 YAFDKIPDAVYRGLIKEKDINTSVVRLMTQRFELGEMD-KDELVPWTKIPLSVVNSEDHQ 361
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LALD AR+ + LL+N LPLS +I LAVIGPNAN ++ + GNY GTP + L+
Sbjct: 362 KLALDMARETMTLLQNNNNILPLS-KSIGKLAVIGPNANDSQMLSGNYNGTPLRTINILE 420
Query: 412 GLAAVVA---TIYQAGC 425
G+ + IY AGC
Sbjct: 421 GIKTKLGADHVIYDAGC 437
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 55/117 (47%), Gaps = 13/117 (11%)
Query: 440 KAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPV 489
+ AD V V G +E E DR D+ LP Q+ I + K K
Sbjct: 594 EKVKDADIVVFVGGISPKLEGEEMPVQLPGFKGGDRTDIELPAVQRNCIEALRKAGKK-- 651
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
++ ++ G I+ +IL Y G++GG A+ADV FG YNPSG LP+T+Y
Sbjct: 652 -IVFVNCSGSAIAMVPETQNCDAILQAWYAGESGGQAVADVLFGDYNPSGHLPVTFY 707
>gi|371777646|ref|ZP_09483968.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 865
Score = 226 bits (576), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 148/425 (34%), Positives = 200/425 (47%), Gaps = 74/425 (17%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L F N L + R DL+ RLT+QEK L D + ++ RLGI K+ WWSEALHG
Sbjct: 22 LPFQNPDLSSEERAKDLISRLTVQEKARLLCDQSEAIPRLGIKKFNWWSEALHG------ 75
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY---------NVGLAGL 161
++N T FPQ I AASFN L I +S EARA Y N L
Sbjct: 76 ---YANN-DSVTVFPQPIGMAASFNEELVFEIFNAISDEARAKYHQAQRRGEENRRFLSL 131
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY 221
+ W+PN+NIFRDPRWGRGQET GEDP L S+ VKGLQ + +L AC KHY
Sbjct: 132 SVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAKYRKL--LACAKHY 189
Query: 222 TAYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK------------------------- 251
T + W R+ N + Y Y+ FK
Sbjct: 190 TVHSGPEWS---RHELNINDVSPREFYETYMPAFKALVQKADVRQVMCAYHRLDDEPCCS 246
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC--GSF 292
+V+DC ++ Y + + TP AAA +LAG DL C ++
Sbjct: 247 NTRILQRILRDEWGYEHMVVADCGAISDFYTTHGISSTPVHAAATGLLAGTDLECIWDNY 306
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
K A++ L+ E ID ++ LG D + S P+ Q+ P + + ++
Sbjct: 307 HYKMLPEALEKDLITEKDIDRSLMRVLKGRFDLGEMDDN-SLVPWAQIPPSVLNCEKHRQ 365
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LA A+Q IVLL+N LPL ++I +AV+GPNA+ + GNY GTP + T L G
Sbjct: 366 LAYKMAQQSIVLLQNKNKVLPLDKSSINKIAVVGPNADDEVVLWGNYNGTPIRTITVLDG 425
Query: 413 LAAVV 417
+ + V
Sbjct: 426 IKSKV 430
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 48/86 (55%), Gaps = 3/86 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR ++ LP Q+ + + + K VI + SG I+ +I+ Y G++G
Sbjct: 632 DRTNIELPSVQRNCLKALKEAGK-TVIFVNCSGSA--IALEPETESCDAIIQAWYGGESG 688
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQS 549
G AIADV FG YNPSG+LP+T+Y S
Sbjct: 689 GQAIADVLFGDYNPSGKLPVTFYRNS 714
>gi|256394020|ref|YP_003115584.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
gi|256360246|gb|ACU73743.1| glycoside hydrolase family 3 domain protein [Catenulispora
acidiphila DSM 44928]
Length = 1212
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 183/600 (30%), Positives = 268/600 (44%), Gaps = 111/600 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFL-VDSAGSVSRLGIPKYEWWSEALHGVSYVGPG 111
+ T R DLV R+TL EK+ L +SA ++ RLG+ Y +WSE HG++ +G
Sbjct: 45 YLQTRYSFAERAADLVSRMTLPEKVAQLHTNSAPAIPRLGVQSYTYWSEGQHGINLLGAD 104
Query: 112 THFSNVV--PGATSFPQVILTAASFNASLFQAIGKVVSTEAR----------AMYNVG-- 157
++ P ATSFP + + S++ +L +S E R A N+G
Sbjct: 105 SNNGGAAGGPHATSFPTNLSSTMSWDPALVYQETTAISDEVRGELDKSLWGVAQNNIGPS 164
Query: 158 ---LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ-QTDGGSPNR-- 211
LT+W+P +N+ RDPRWGR E GEDP L K A +V G Q +T G+P
Sbjct: 165 ADDYGSLTYWAPTVNMDRDPRWGRTDEAFGEDPYLVGKMAGAFVAGYQGETIDGTPTSPY 224
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK-------------------- 251
LKVAA KH+ + +N + D + I YY +F+
Sbjct: 225 LKVAATAKHFALNNNENDRHADSADASESDIRDYYTAQFRSLVEDSHVAGLMTSYNAING 284
Query: 252 ----------------------YIVSDCDSV-DVLYNSQH-----------------YTK 271
YI SDC +V DV +S H +T
Sbjct: 285 TPSPADTYTTDALAQRTWGFDGYITSDCGAVGDVTASSSHDWAPPGWTVSVVNGTSTWTN 344
Query: 272 T------PEEAA--AKSILAGLDLNC--GSFLGKHTEAAVKAGLVNESAIDHAISNNFAT 321
T P +A A ++ AG D NC G + EAA+KA +++E IDHA+ F
Sbjct: 345 TATGVQVPADAGGQAYALRAGTDANCTGGDATLGNIEAAIKAEILSEGVIDHALVQLFTV 404
Query: 322 LMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN------TAGSLPLS 375
M+ G FD +K Y ++ + + +Q LA A +VLLKN A LP +
Sbjct: 405 RMQTGEFD-PANKVAYTRITKAQIQSPEHQALAEKVAANSLVLLKNDPMPGSAAKVLPAN 463
Query: 376 PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV------ATI---YQAGCS 426
P ++ N+ V+G AN +G Y G P +QG+ + V AT+ +
Sbjct: 464 PASLNNVVVVGDLANTV--TLGGYSGDPTLQVNAVQGITSAVKAANPNATVTFDACGTST 521
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAK 486
+ + A +AD V+ +G D S ES+DR L +PG LI++VA +
Sbjct: 522 TATAAASCSAATQAAIKTADLVVVFVGTDGSTAGESNDRASLAMPGNYDSLISQVAALGN 581
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+L + + G DI K D +I++ Y G++ G A+ADV FG+ NPSG L TWY
Sbjct: 582 PRTVLSMQTDGPVDIENVKGD--FPAIVYSAYNGESQGTALADVLFGKQNPSGHLDFTWY 639
>gi|424792251|ref|ZP_18218496.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422797157|gb|EKU25539.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 909
Score = 225 bits (573), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 144/396 (36%), Positives = 205/396 (51%), Gaps = 69/396 (17%)
Query: 70 RLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVIL 129
++T +EK+ +++A ++ RLG+P YEWW+E LHG++ G AT FPQ I
Sbjct: 67 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIG 116
Query: 130 TAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRDPRWGRGQ 180
AA++N +L + +G V STEARA +N+ AGLT WSPNINIFRDPRWGRG
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176
Query: 181 ETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------AYDLD--- 227
ET GEDP L + A G++ GLQ D P + A KH +D+D
Sbjct: 177 ETYGEDPYLTGQLAVGFIHGLQGDDLTHP---RTIATPKHLAVHSGPEPGRHGFDVDVSP 233
Query: 228 -NWKGTDRYHFNAMVI--------------------YTYYLIKFK---------YIVSDC 257
+ + T F A ++ +L+ + ++VSDC
Sbjct: 234 HDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWGFTGFVVSDC 293
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISN 317
D+VD + ++ ++A ++ AG DLNCG + + A+ G +E+ +D ++
Sbjct: 294 DAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEALLDKSLVR 352
Query: 318 NFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPT 377
FA RLG K PY +LG KDV + A++ LAL AA+Q IVLL+N +LPL P
Sbjct: 353 LFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQNRNATLPLRPG 411
Query: 378 AIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
LAVIGPNA+ + NY+GT TPL GL
Sbjct: 412 L--RLAVIGPNADALAALEANYQGTSAAPVTPLLGL 445
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR DL LP QQ L+ E AK + P+++++MSG +++AK W YPGQ+G
Sbjct: 667 DRNDLALPAAQQALL-ERAKASGKPLVVVLMSGSAVALNWAKQHADAIVAAW--YPGQSG 723
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA V G NP GRLP+T+Y
Sbjct: 724 GTAIAQVLAGDVNPGGRLPVTFY 746
>gi|423331656|ref|ZP_17309440.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
CL03T12C09]
gi|409230226|gb|EKN23094.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
CL03T12C09]
Length = 868
Score = 225 bits (573), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 152/439 (34%), Positives = 208/439 (47%), Gaps = 82/439 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ++ R+ DL+ RLT +EKI +++ ++ RLGIP Y+WW+EALHGV+ G
Sbjct: 28 FRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG--- 84
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+F+ + +VS EARA Y+ GLTFW
Sbjct: 85 -------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTFW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRG ET GEDP L K +GLQ G PN K AC KHY +
Sbjct: 138 TPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ---GDDPNYYKTHACAKHYAVH 194
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFKYIVSDCDS--------------------- 259
W +R+ F+ +Y YL F+ +V + D
Sbjct: 195 SGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKL 251
Query: 260 -VDVLYNSQHY------------------TKTP--------EEAAAKSILAGLDLNCGSF 292
+D+L NS Y TP E A+A ++L G DL CG
Sbjct: 252 LIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGS 311
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
+ A+ G ++E +D ++ LG FD + PY ++ V + +
Sbjct: 312 Y-RALNKALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIA 369
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
ALD AR+ IVLLKN LPL IK +AV+GPNA + + NY G P K T ++G
Sbjct: 370 KALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVGPNAADSTMLWANYNGFPTKTVTIVEG 428
Query: 413 LAAVV---ATIYQAGCSNV 428
+ V IY+ GC++
Sbjct: 429 IRNKVPNAEVIYELGCNHT 447
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 68/124 (54%), Gaps = 15/124 (12%)
Query: 435 VDDAKKAAASADATVLVM---------GADQSIEAESH---DRLDLLLPGQQQLLITEVA 482
VD A+ DA V+V G + ++AE DR ++ +P Q+ ++ +
Sbjct: 593 VDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV 652
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
K PV+ ++ +G +++ +ND + +IL Y GQ GG A+ADV FG YNP+GRLP
Sbjct: 653 ATGK-PVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLP 709
Query: 543 MTWY 546
+T+Y
Sbjct: 710 ITFY 713
>gi|443717728|gb|ELU08656.1| hypothetical protein CAPTEDRAFT_228276 [Capitella teleta]
Length = 731
Score = 225 bits (573), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 165/568 (29%), Positives = 260/568 (45%), Gaps = 84/568 (14%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGS----VSRLGIPKYEWWSEALHG 104
A F + +L D RV DLV+RLT++E + V G V RLG+ Y++ +E + G
Sbjct: 18 AKFPFEDVTLSWDKRVDDLVQRLTIEEVVNISVAQYGKSTIPVDRLGVKPYQFINECITG 77
Query: 105 VSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV-------G 157
V + +T+FPQ I ASF+ L + + ++ E R YN G
Sbjct: 78 VRW-----------ENSTAFPQAIGLGASFSPDLAFNMSQAIARELRGFYNTEVKSQIYG 126
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
G+ ++P INI R P WGR QET GEDP L+ + + G+VKGLQ G P ++ +
Sbjct: 127 HRGVNCFTPVINIMRHPLWGRNQETYGEDPWLSGQLSVGFVKGLQ---GDHPRYIQASGG 183
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------- 251
CKH+ ++ R+ F+A V +L +FK
Sbjct: 184 CKHFDVHNGPENIPVSRFGFDAKVSERDWRMTFLPQFKTCVEAGSINIMCSYNRINGVPA 243
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
Y++SD +++ + YTKT EAAA S+ AG ++
Sbjct: 244 CANKKLLTDILRKEWGFNGYVISDSGAIENIVYHHKYTKTLAEAAADSVKAGCNVELTGA 303
Query: 293 LGK-----HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
G + AVK L++E + + + MR G FD P+ ++ V +
Sbjct: 304 TGSGVAYFNLLNAVKQNLISEEELRENLKKPMYSRMRQGEFD-PVDMNPFTKIDMSVVLS 362
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCK 405
Q +QDLA+ A+ VL+KN LPL LA+IGP A+ +T+ G+Y P
Sbjct: 363 QEHQDLAVKASAMSFVLMKNLNRVLPLK-KRFDRLAIIGPFADNAETLFGDYIPNWDPKF 421
Query: 406 YTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHD 464
+TP +GL ++ + Y +GC + C +KA A + +G ++E E HD
Sbjct: 422 VSTPYEGLKSLGDDVRYASGCDDPSCTNYDPKAIEKAVKGAQFVFVCLGVGSNLEREGHD 481
Query: 465 RLDLLLPGQQQLLITEVAKVAK-GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
R DL LPG Q ++ + ++ P++L++ + G D+++ K P++ I+ YP
Sbjct: 482 RADLDLPGYQLQILKDAEFFSREAPLVLVLFNAGPVDLTWPKLSPEVDGIIECFYPAMGT 541
Query: 524 GAAIADVCFGRYN---PSGRLPMTWYPQ 548
G A+ V + P+ RLP TW Q
Sbjct: 542 GKALYQVVTATGDDGVPAARLPSTWPAQ 569
>gi|225873995|ref|YP_002755454.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
gi|225792796|gb|ACO32886.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
Length = 896
Score = 225 bits (573), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 150/430 (34%), Positives = 211/430 (49%), Gaps = 78/430 (18%)
Query: 55 NTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHF 114
N + I RV +LV ++TLQE+ ++++A ++ RLG+P Y WWSE LHG++ G
Sbjct: 41 NPNQPIQKRVHELVSQMTLQEEAAQMMNTAPAIPRLGVPAYNWWSEGLHGIARSG----- 95
Query: 115 SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSP 166
AT FPQ I +A+F+ + +G VSTEARA YN + GLT W+P
Sbjct: 96 -----YATVFPQAIGMSATFDPAAIHQMGTTVSTEARAKYNWAIRHDIHSIYFGLTLWAP 150
Query: 167 NINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL 226
NINI RDPRWGRGQET GEDP L A YV GLQ G +P LK A KH++ Y
Sbjct: 151 NINIVRDPRWGRGQETYGEDPFLTGTMAAEYVSGLQ---GNNPKYLKTVATPKHFSVY-- 205
Query: 227 DNWKGTDRYHFN----AMVIYTYYLIKFK------------------------------- 251
N + R+ N A + YL F+
Sbjct: 206 -NGPESMRHKINANPSAHDMQDTYLAAFRMAITKGHADSMMCSYNAVYGVPSCANKLLAD 264
Query: 252 ----------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFLGKHTEA 299
YI SDC ++ Y + +P+ AAA ++LAG D +CG+ K
Sbjct: 265 VVRGKWGFDGYITSDCGAISDFYRPGAHGYSPDAVHAAASAVLAGTDTDCGTGY-KVLPQ 323
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
+V+ GL++++AID A+ F RLG FD + PY + V + A++ AL+ A
Sbjct: 324 SVQQGLISKAAIDRAVERLFTARFRLGMFDPK-ADVPYNSIPYSVVDSAAHRAQALEDAS 382
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA- 418
+ +VLLKN G LPL + +AV+GPNA ++ GNY P + P+ G+ A
Sbjct: 383 KSMVLLKNEGGILPLR--NARTIAVVGPNAANLNSIEGNYNAIPSHPSLPVDGIEAAFPQ 440
Query: 419 --TIYQAGCS 426
+Y G S
Sbjct: 441 AHVVYAQGSS 450
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 49/83 (59%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR L LP QQ L+ + K PV+L++++G I +AK + IL YPG+AG
Sbjct: 655 DRTRLSLPQTQQDLLHALVATGK-PVVLVLLNGSALSIDWAKQ--HVQGILEAWYPGEAG 711
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AI + G+ +P G+LP+T+Y
Sbjct: 712 GEAIGETLSGQNDPGGKLPITFY 734
>gi|255013451|ref|ZP_05285577.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
gi|410103695|ref|ZP_11298616.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
gi|409236424|gb|EKN29231.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
Length = 868
Score = 225 bits (573), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 152/439 (34%), Positives = 208/439 (47%), Gaps = 82/439 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ++ R+ DL+ RLT +EKI +++ ++ RLGIP Y+WW+EALHGV+ G
Sbjct: 28 FRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG--- 84
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+F+ + +VS EARA Y+ GLTFW
Sbjct: 85 -------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLTFW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINIFRDPRWGRG ET GEDP L K +GLQ G PN K AC KHY +
Sbjct: 138 TPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ---GDDPNYYKTHACAKHYAVH 194
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFKYIVSDCDS--------------------- 259
W +R+ F+ +Y YL F+ +V + D
Sbjct: 195 SGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSDKL 251
Query: 260 -VDVLYNSQHY------------------TKTP--------EEAAAKSILAGLDLNCGSF 292
+D+L NS Y TP E A+A ++L G DL CG
Sbjct: 252 LIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECGGS 311
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
+ A+ G ++E +D ++ LG FD + PY ++ V + +
Sbjct: 312 Y-RALNKALADGKISEKDLDVSLRRLLKGRFELGMFDPD-ERVPYSKIPYSVVESPEHIA 369
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
ALD AR+ IVLLKN LPL IK +AV+GPNA + + NY G P K T ++G
Sbjct: 370 KALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVGPNAADSTMLWANYNGFPTKTVTIVEG 428
Query: 413 LAAVV---ATIYQAGCSNV 428
+ V IY+ GC++
Sbjct: 429 IRNKVPNAEVIYELGCNHT 447
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 68/124 (54%), Gaps = 15/124 (12%)
Query: 435 VDDAKKAAASADATVLVM---------GADQSIEAESH---DRLDLLLPGQQQLLITEVA 482
VD A+ DA V+V G + ++AE DR ++ +P Q+ ++ +
Sbjct: 593 VDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV 652
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
K PV+ ++ +G +++ +ND + +IL Y GQ GG A+ADV FG YNP+GRLP
Sbjct: 653 ATGK-PVVYVVCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLP 709
Query: 543 MTWY 546
+T+Y
Sbjct: 710 ITFY 713
>gi|315500297|ref|YP_004089100.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
excentricus CB 48]
gi|315418309|gb|ADU14949.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
excentricus CB 48]
Length = 882
Score = 225 bits (573), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 152/429 (35%), Positives = 212/429 (49%), Gaps = 72/429 (16%)
Query: 42 VVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEA 101
+ NP L + G+ + S + R DLV R+TL+EK L++ A ++ RL + +Y WW+E
Sbjct: 25 LAQNP-LDAPGYQDASKPPEARAADLVSRMTLEEKTAQLINDAPAIPRLNVREYNWWNEG 83
Query: 102 LHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY------- 154
LHGV+ G AT FPQ + AA+++ L + + +S E RA Y
Sbjct: 84 LHGVAAAG----------YATVFPQAVGLAATWDEPLIHRVAETISVEFRAKYLKERHRF 133
Query: 155 --NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
+ GLT WSPNINIFRDPRWGRGQET GEDP L ++ +V+GLQ G P
Sbjct: 134 GGSDWFGGLTVWSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQ---GDDPVYY 190
Query: 213 KVAACCKHY-----------------TAYDL-------------DNWKGTDRYHFNAM-- 240
+ A KHY + YDL + G+ +NA+
Sbjct: 191 RTVATPKHYAVHSGPEAGRHRDNVNPSPYDLADTYLPAFRATITEGQAGSIMCAYNAING 250
Query: 241 -------VIYTYYLIK---FK-YIVSDCDSV-DVLYNSQH-YTKTPEEAAAKSILAGLDL 287
+ YL K FK Y+VSDCD+V D+ Y + H Y TPEE + G DL
Sbjct: 251 QPACANEDLLVKYLRKDWGFKGYVVSDCDAVGDIYYKTSHAYRPTPEEGVTAAYQVGTDL 310
Query: 288 NCGSF-LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
CG+ H AV+ GL+ E +D A+ F +LG FD P + + ++ +D
Sbjct: 311 ICGNANEADHLTRAVRQGLLPEKTLDTALIRLFTARFKLGQFD--PPAKVFPKITAEDYD 368
Query: 347 TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 406
T AN+D + A +VLLKN LPL + +AVIGPNA+ +++GNY G P
Sbjct: 369 TPANRDFSQKVAESAMVLLKNENNLLPLKGEP-RQIAVIGPNADSMDSLVGNYNGDPSHP 427
Query: 407 TTPLQGLAA 415
T L G+ A
Sbjct: 428 VTVLSGIRA 436
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 66/119 (55%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A AA AD V V G Q +E E DR L LP QQ ++ +V+ K
Sbjct: 599 AVAAAKEADLVVFVAGLSQRVEGEEMRVETEGFSGGDRTTLNLPPAQQKVLEQVSAAGK- 657
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PV+L++++G I++A D + +I+ YPG GGAA+A + G Y+P+GRLP+T+Y
Sbjct: 658 PVVLVLINGSALGINWA--DKNVPAIIEAWYPGGQGGAAVARLIAGDYSPAGRLPVTFY 714
>gi|333381842|ref|ZP_08473521.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829771|gb|EGK02417.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
BAA-286]
Length = 861
Score = 224 bits (572), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 152/432 (35%), Positives = 207/432 (47%), Gaps = 71/432 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + +L + R DL+ RLTL+EK+ + D++ V+RLG+ K+ WWSEALHGV+ G
Sbjct: 23 YKDANLTPEERAQDLLSRLTLKEKVGLMGDNSIEVTRLGVKKFAWWSEALHGVANQG--- 79
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA---------GLTF 163
G T FP+ I AASFN L + +S EARA ++ GL+
Sbjct: 80 -------GVTVFPEPIGMAASFNDELLYHVFDAISDEARARFHFREKKGDERRQDNGLSV 132
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PN+NIFRDPRWGRGQET GEDP L S+ V GLQ +L AC KHY
Sbjct: 133 WTPNVNIFRDPRWGRGQETYGEDPYLTSRMGISVVNGLQGPKDAKYKKL--LACAKHYAV 190
Query: 224 YDLDNW-------KGTDRYHF------------------NAMVIY----------TYYLI 248
+ W D H M Y +L+
Sbjct: 191 HSGPEWNRHVLNLNNLDNRHLWETYMPAFQVLVQKADVSQVMCAYHRQDDDPCCGNNHLL 250
Query: 249 K--------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA 299
K FK +VSDC ++ Y S + +A K +LAG D+ CG H
Sbjct: 251 KRILRDEWGFKRMVVSDCGAIADFYTSHKVSSDALHSAVKGVLAGTDVECGFGYTYHELV 310
Query: 300 -AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAA 358
AV GL+ E+ ID ++ RLG FD + S P+ + + + +Q LAL+ A
Sbjct: 311 DAVSRGLIYEADIDKSVLRLLTERFRLGDFDDN-SIVPWANIPDTIINCKKHQALALEMA 369
Query: 359 RQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA 418
RQ + LL+N LPLS + K +AVIGPNA+ K M GNY G P K T L+G+ ++
Sbjct: 370 RQSMTLLQNKNNILPLS--SKKKIAVIGPNADDAKLMWGNYNGIPVKTVTILEGIKSIAG 427
Query: 419 T--IYQAGCSNV 428
Y+ GC V
Sbjct: 428 KDIFYEKGCDIV 439
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 48/83 (57%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP Q+ I + K K VI++ SG I +IL Y GQ+G
Sbjct: 628 DRTDIELPASQRNCIKALKKAGK-RVIMVNCSGSA--IGLMPESESCEAILQAWYGGQSG 684
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA+V FG+YNPSG+LP+T+Y
Sbjct: 685 GQAIAEVLFGKYNPSGKLPITFY 707
>gi|423294294|ref|ZP_17272421.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
CL03T12C18]
gi|392675485|gb|EIY68926.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
CL03T12C18]
Length = 861
Score = 224 bits (572), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 215/444 (48%), Gaps = 81/444 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +TSL + R DL+ RLTL+EK++ + +++ ++ RLGI +YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I ASFN SL + S EAR + GLT
Sbjct: 84 ----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGALKRYQGLT 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L + V+GLQ + ++L AC KH+
Sbjct: 134 FWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDTKYDKLH--ACAKHFA 191
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ F+A I + YL FK
Sbjct: 192 VHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGS 248
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P E A+A ++ G DL CGS
Sbjct: 249 NRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAAAVRTGTDLECGSEY 308
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ AVKAGL++E ID ++ LG D P+ + ++ + ++ +Q L
Sbjct: 309 ASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPASVLNSKEHQAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AR+ +VLL+N LPL+ T +K +AV+GPNAN + GNY G P T L+ +
Sbjct: 365 ALRMARESLVLLQNKNNILPLN-THLK-VAVMGPNANDSVMQWGNYNGIPAHTVTLLEAV 422
Query: 414 AAVV---ATIYQAGCSNVQCGTAQ 434
A + IY+ GC V T Q
Sbjct: 423 RAKLPEGQIIYEPGCDRVDGKTLQ 446
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 55/119 (46%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A K AD + G S+E E DR D+ LP Q+ L+ + K K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+ I SG I +IL YPGQAGG AI D +G YNP GRLP+T+Y
Sbjct: 651 -VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY 706
>gi|336415363|ref|ZP_08595703.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
3_8_47FAA]
gi|335940959|gb|EGN02821.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
3_8_47FAA]
Length = 861
Score = 224 bits (572), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 215/444 (48%), Gaps = 81/444 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +TSL + R DL+ RLTL+EK++ + +++ ++ RLGI +YEWW+EALHGV G
Sbjct: 24 LPYQDTSLAAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I ASFN SL + S EAR + GLT
Sbjct: 84 ----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLT 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L + V+GLQ + ++L AC KH+
Sbjct: 134 FWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH--ACAKHFA 191
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ F+A I + YL FK
Sbjct: 192 VHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGS 248
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P E A+A ++ AG DL CGS
Sbjct: 249 NRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDLECGSEY 308
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ AVKAGL++E ID ++ LG D + + ++ + ++ +Q L
Sbjct: 309 ASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNSKEHQAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AR+ +VLL+N LPL+ T +K +AV+GPNAN + GNY G P T L+ +
Sbjct: 365 ALRMARESLVLLQNKNNILPLN-THLK-VAVMGPNANDSVMQWGNYNGIPAHTVTLLEAV 422
Query: 414 AAVV---ATIYQAGCSNVQCGTAQ 434
A + IY+ GC V T Q
Sbjct: 423 RAKLPEGQIIYEPGCDRVDGKTLQ 446
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 55/117 (47%), Gaps = 13/117 (11%)
Query: 440 KAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPV 489
K AD + G S+E E DR D+ LP Q+ L+ + KV K V
Sbjct: 593 KRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKVGK-KV 651
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ I SG I +IL YPGQAGG AI D +G YNP GRLP+T+Y
Sbjct: 652 VFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY 706
>gi|295086418|emb|CBK67941.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
XB1A]
Length = 861
Score = 224 bits (572), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 215/444 (48%), Gaps = 81/444 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +TSL + R DL+ RLTL+EK++ + +++ ++ RLGI +YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I ASFN SL + S EAR + GLT
Sbjct: 84 ----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLT 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L + V+GLQ + ++L AC KH+
Sbjct: 134 FWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH--ACAKHFA 191
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ F+A I + YL FK
Sbjct: 192 VHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGS 248
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P E A+A ++ AG DL CGS
Sbjct: 249 NRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASAGAVRAGTDLECGSEY 308
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ AVKAGL++E ID ++ LG D + + ++ + ++ +Q L
Sbjct: 309 ASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNSKEHQAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AR+ +VLL+N LPL+ T +K +AV+GPNAN + GNY G P T L+ +
Sbjct: 365 ALRMARESLVLLQNKNNILPLN-THLK-VAVMGPNANDSVMQWGNYNGIPAHTVTLLEAV 422
Query: 414 AAVV---ATIYQAGCSNVQCGTAQ 434
A + IY+ GC V T Q
Sbjct: 423 RAKLPEGQIIYEPGCDRVDGKTLQ 446
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 55/119 (46%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A K AD + G S+E E DR D+ LP Q+ L+ + K K
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+ I SG I +IL YPGQAGG AI D +G YNP GRLP+T+Y
Sbjct: 651 -VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY 706
>gi|254295141|ref|YP_003061164.1| glycoside hydrolase [Hirschia baltica ATCC 49814]
gi|254043672|gb|ACT60467.1| glycoside hydrolase family 3 domain protein [Hirschia baltica ATCC
49814]
Length = 897
Score = 224 bits (571), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 159/473 (33%), Positives = 219/473 (46%), Gaps = 105/473 (22%)
Query: 19 FLTYCSL-QHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKI 77
F T CS Q + +AQ++ + F + SL R DLV +TL+EK
Sbjct: 17 FATACSSSQTDTETAQTT--------EEAKSSEFRFMDPSLSPKERALDLVSHMTLEEKA 68
Query: 78 TFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNAS 137
+ D A ++ RLG+ +Y WW+EALHGV+ G AT FPQ I AA+++
Sbjct: 69 AQMYDKAAAIPRLGLHEYNWWNEALHGVARAG----------HATVFPQAIGMAATWDED 118
Query: 138 LFQAIGKVVSTEARAMYN--------VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLL 189
L + V+S E RA ++ GLTFWSPNINIFRDPRWGRGQET GEDP L
Sbjct: 119 LMLEVANVISDEGRAKHHFYANEDVYAMYGGLTFWSPNINIFRDPRWGRGQETYGEDPYL 178
Query: 190 ASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD-------LDNWKGTDRYHFNAMVI 242
+ A ++ GLQ G K A KHY + DN+ TD +
Sbjct: 179 TGRMAVNFINGLQ---GDDDKYFKSVATVKHYAVHSGPEPSRHRDNYIATD------ADL 229
Query: 243 YTYYLIKFK------------------------------------------YIVSDCDSV 260
Y YL FK Y+VSDC ++
Sbjct: 230 YETYLPAFKTAFDETEVASVMCAYNAVWGDPACGSERLMKDLLREELGFDGYVVSDCGAI 289
Query: 261 -DVLYNSQH------------YTKTPEEAAAKSILAGLDLNCGSFLGKHTEA---AVKAG 304
D Y+ + + T +AAA S+ G DLNCG G +A AVK G
Sbjct: 290 GDFYYDEEKKAEGTAPYAAHDHVDTRAQAAALSVNMGTDLNCGDGEGNKMDALPQAVKEG 349
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
L+ E ID ++ ++ L +LG +D PS P+ + V + ++ + + +AAR +VL
Sbjct: 350 LITEETIDQSVVRLYSALFKLGMYD-DPSLVPWSNISIDTVASPSHLEKSEEAARASLVL 408
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
LKN G LPL P +AVIGPNA+ T++ NY G P T L+G+ A +
Sbjct: 409 LKND-GILPLKPDT--KVAVIGPNADNWWTLVANYYGQPTAPVTALKGIKAKI 458
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR + LP Q+ L+ E+ K PV+L+ SG +++ D + +I+ YPG+
Sbjct: 659 DRTHINLPAPQEKLLKELHATGK-PVVLVNFSGSAMALNW--EDENLPAIVQAFYPGEKS 715
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIAD+ +G ++PSGRLP+T+Y
Sbjct: 716 GTAIADLLWGEFSPSGRLPVTFY 738
>gi|262405256|ref|ZP_06081806.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294644754|ref|ZP_06722499.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294810589|ref|ZP_06769241.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345508031|ref|ZP_08787672.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|229444722|gb|EEO50513.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|262356131|gb|EEZ05221.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292639876|gb|EFF58149.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294442250|gb|EFG11065.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
Length = 861
Score = 224 bits (571), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 215/444 (48%), Gaps = 81/444 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +TSL + R DL+ RLTL+EK++ + +++ ++ RLGI +YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I ASFN SL + S EAR + GLT
Sbjct: 84 ----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLT 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L + V+GLQ + ++L AC KH+
Sbjct: 134 FWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH--ACAKHFA 191
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ F+A I + YL FK
Sbjct: 192 VHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGS 248
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P E A+A ++ AG DL CGS
Sbjct: 249 NRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDLECGSEY 308
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ AVKAGL++E ID ++ LG D + + ++ + ++ +Q L
Sbjct: 309 ASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNSKEHQAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AR+ +VLL+N LPL+ T +K +AV+GPNAN + GNY G P T L+ +
Sbjct: 365 ALRMARESLVLLQNKNNILPLN-THLK-VAVMGPNANDSVMQWGNYNGIPAHTVTLLEAV 422
Query: 414 AAVV---ATIYQAGCSNVQCGTAQ 434
A + IY+ GC V T Q
Sbjct: 423 RAKLPEGQIIYEPGCDRVDGKTLQ 446
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 55/119 (46%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A K AD + G S+E E DR D+ LP Q+ L+ + K K
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+ I SG I +IL YPGQAGG AI D +G YNP GRLP+T+Y
Sbjct: 651 -VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY 706
>gi|237719778|ref|ZP_04550259.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229451047|gb|EEO56838.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 861
Score = 224 bits (571), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 215/444 (48%), Gaps = 81/444 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +TSL + R DL+ RLTL+EK++ + +++ ++ RLGI +YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I ASFN SL + S EAR + GLT
Sbjct: 84 ----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLT 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L + V+GLQ + ++L AC KH+
Sbjct: 134 FWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH--ACAKHFA 191
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ F+A I + YL FK
Sbjct: 192 VHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGS 248
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P E A+A ++ AG DL CGS
Sbjct: 249 NRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETYPDKEHASAGAVRAGTDLECGSEY 308
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ AVKAGL++E ID ++ LG D + + ++ + ++ +Q L
Sbjct: 309 ASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNSKEHQAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AR+ +VLL+N LPL+ T +K +AV+GPNAN + GNY G P T L+ +
Sbjct: 365 ALRMARESLVLLQNKNNILPLN-THLK-VAVMGPNANDSVMQWGNYNGIPAHTVTLLEAV 422
Query: 414 AAVV---ATIYQAGCSNVQCGTAQ 434
A + IY+ GC V T Q
Sbjct: 423 RAKLPEGQIIYEPGCDRVDGKTLQ 446
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 55/119 (46%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A K AD + G S+E E DR D+ LP Q+ L+ + K K
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+ I SG I +IL YPGQAGG AI D +G YNP GRLP+T+Y
Sbjct: 651 -VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY 706
>gi|160885419|ref|ZP_02066422.1| hypothetical protein BACOVA_03419 [Bacteroides ovatus ATCC 8483]
gi|156109041|gb|EDO10786.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 861
Score = 224 bits (571), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 215/444 (48%), Gaps = 81/444 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +TSL + R DL+ RLTL+EK++ + +++ ++ RLGI +YEWW+EALHGV G
Sbjct: 24 LPYQDTSLAAEQRTEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I ASFN SL + S EAR + GLT
Sbjct: 84 ----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGDSGVLKRYQGLT 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L + V+GLQ + ++L AC KH+
Sbjct: 134 FWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH--ACAKHFA 191
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ F+A I + YL FK
Sbjct: 192 VHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGS 248
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P E A+A ++ AG DL CGS
Sbjct: 249 NRLLMQILRDEWGYEGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDLECGSEY 308
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ AVKAGL++E ID ++ LG D + + ++ + ++ +Q L
Sbjct: 309 ASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNSKEHQAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AR+ +VLL+N LPL+ T +K +AV+GPNAN + GNY G P T L+ +
Sbjct: 365 ALRMARESLVLLQNKNNILPLN-THLK-VAVMGPNANDSVMQWGNYNGIPAHTVTLLEAV 422
Query: 414 AAVV---ATIYQAGCSNVQCGTAQ 434
A + IY+ GC V T Q
Sbjct: 423 RAKLPEGQIIYEPGCDRVDGKTLQ 446
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 54/117 (46%), Gaps = 13/117 (11%)
Query: 440 KAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPV 489
K AD + G S+E E DR D+ LP Q+ L+ + K K V
Sbjct: 593 KRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK-V 651
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ I SG I +IL YPGQAGG AI D +G YNP GRLP+T+Y
Sbjct: 652 VFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY 706
>gi|427385138|ref|ZP_18881643.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
12058]
gi|425727306|gb|EKU90166.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
12058]
Length = 863
Score = 224 bits (571), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 151/433 (34%), Positives = 213/433 (49%), Gaps = 79/433 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N +L + R DLV RLTL+EK + + +++ ++ RLGI Y+WW+EALHGV G
Sbjct: 26 YKNPALTPEERAADLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL-- 83
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM-----YNVGLA---GLTFW 164
AT FPQ I ASFN L + VS EARA GL GLT W
Sbjct: 84 --------ATVFPQAIGMGASFNNDLLYDVFTAVSDEARAKTAEFSKEGGLKRYQGLTMW 135
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PN+NIFRDPRWGRGQET GEDP L + V+GLQ +GG ++L AC KH+ +
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGGKYDKLH--ACAKHFAVH 193
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ F+A + + YL FK
Sbjct: 194 SGPEW---NRHSFDAENVDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNR 250
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
IVSDC +++ +N + P E A+AK++L G D+ CG
Sbjct: 251 LLVQILRDEWAYDGIIVSDCWAINDFFNKGAHETEPDKEHASAKAVLTGTDVECGESYAS 310
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
+ AVKAGL++E ID ++ LG D +P + Q+ V ++ +++LAL
Sbjct: 311 LPQ-AVKAGLIDEKKIDISLKRLMKARFELGEMD-NPELVSWAQIPYSVVDSKEHRELAL 368
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
AR+ +VLL+N LPL+ + +AV+GPNAN + GNY G P T L+G+
Sbjct: 369 RMARESLVLLQNNQNVLPLNKSL--KVAVVGPNANDSVMQWGNYNGFPGHTVTLLEGIRQ 426
Query: 416 VVAT---IYQAGC 425
+ IY+ GC
Sbjct: 427 YLPEAQLIYEPGC 439
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 56/112 (50%), Gaps = 13/112 (11%)
Query: 445 ADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIM 494
AD V G ++E E DR + LP Q L+ E+ K K V+ +
Sbjct: 600 ADIIVFAGGISPAVEGEEMRVTIPGFKGGDRETIELPSIQSRLLAELKKAGK-KVVFVNF 658
Query: 495 SGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
SG I+ +IL YPGQAGG AIA+V FG YNP+GRLP+T+Y
Sbjct: 659 SGSA--IALTPETKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFY 708
>gi|294777452|ref|ZP_06742903.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
gi|294448520|gb|EFG17069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
Length = 864
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 157/451 (34%), Positives = 214/451 (47%), Gaps = 85/451 (18%)
Query: 47 SLASLGFCN-------TSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWS 99
SL L CN +SL + R DL+++LTL+EK+ ++D++ V RLGI Y WW+
Sbjct: 11 SLVMLSGCNNQPAYKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWN 70
Query: 100 EALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM---YNV 156
EALHGV+ G AT FPQ I AASF I VS EARA Y+
Sbjct: 71 EALHGVARSGL----------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSA 120
Query: 157 G-----LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR 211
GLT W+P +NI+RDPRWGRG ET GEDP L S VKGLQ D
Sbjct: 121 AGSYERYQGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCMDANQKYD 180
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------ 254
K+ AC KH+ + W +R+ FNA I + YL+ F+ +V
Sbjct: 181 -KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAY 236
Query: 255 ------------------------------SDCDSVDVLYNSQHYTKTP--EEAAAKSIL 282
SDC ++D Y + + P E A+A ++L
Sbjct: 237 NRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL 296
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
+G DL CGS E+A K GL++E ID ++ LG D P K + ++
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPY 354
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
VC+ + L+LD AR+ + LL N LPL + +AV+GPNAN + GNY GT
Sbjct: 355 SVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGT 413
Query: 403 PCKYTTPLQGLAAVVAT----IYQAGCSNVQ 429
P T L+G+ + + IY+ GCS V+
Sbjct: 414 PKHTITLLEGIRSAMGENDKLIYEQGCSWVE 444
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 62/130 (47%), Gaps = 20/130 (15%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQL 476
N++ A+V DA D + G S+E E DR D+ LP Q+
Sbjct: 591 NIKNTVAKVKDA-------DIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRE 643
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI + K VI + SG I+ +IL YPGQ+GG A A+V FG YN
Sbjct: 644 LIKALCDAGK-KVIFVNFSGS--PIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYN 700
Query: 537 PSGRLPMTWY 546
P+GRLP+T+Y
Sbjct: 701 PAGRLPVTFY 710
>gi|374312362|ref|YP_005058792.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358754372|gb|AEU37762.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 874
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 148/407 (36%), Positives = 206/407 (50%), Gaps = 73/407 (17%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
ID R+ +L+ ++T+ E+I L D A ++ RLG+P Y WW+E LHG++ G
Sbjct: 35 IDARIDELIAKMTVSERIAQLQDRAPAIERLGLPSYNWWNEGLHGLARDG---------- 84
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARA-MYNVG------LAGLTFWSPNINIFR 172
AT FPQ I AA+++A L +G VVSTEARA Y+ G GLT WSPNINIFR
Sbjct: 85 YATVFPQAIGLAATWDAPLLHEVGDVVSTEARAKFYSHGGENTPRFGGLTVWSPNINIFR 144
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGRGQET GEDP L + T +V+G+Q G P LK A KH+ A+ +
Sbjct: 145 DPRWGRGQETYGEDPFLTATLGTQFVEGVQ---GNDPFYLKADATPKHFAAH---SGPEE 198
Query: 233 DRYHFNAMV----IYTYYLIKFK------------------------------------- 251
R FNA+V + YL F
Sbjct: 199 GRDSFNAVVSPHDLADTYLPAFHALTTNAHAAALMCSYNEIDGTPSCASGNNLQDLVRER 258
Query: 252 -----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLV 306
Y+VSDCD+V + H+ AA ++ AG+DL+CG+ ++ ++ L
Sbjct: 259 WGFKGYVVSDCDAVGNIAGYHHFATDNAHGAADALNAGVDLDCGNTYAALSK-SLDQNLT 317
Query: 307 NESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLK 366
E+ ++ A+ +RLG D S PY +G +++ + A+ LAL AA + IVLLK
Sbjct: 318 TEAKLNQALHRLLLARVRLGMLDPL-SCSPYRDIGAEELDSPAHHTLALRAAEESIVLLK 376
Query: 367 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
N G LPL + K ++VIGP A++ K + NY GT TPL G
Sbjct: 377 ND-GVLPLQASTQK-VSVIGPTADMVKVLEANYHGTALHPITPLDGF 421
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 66/124 (53%), Gaps = 13/124 (10%)
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVA 482
A +D A + AA +D V +G +E E+ DR L LP Q+ L++ +
Sbjct: 593 ALLDQAVQTAAKSDVIVAFVGLSPDLEGEALQLRLKGFNGGDRTSLDLPEAQRTLLSRLT 652
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
++ K PVI+++ SG G ++ +L YPG+AGG A+A + G NPSGRLP
Sbjct: 653 QLHK-PVIIVLTSGSG--VALGPEAKDAAGVLEAWYPGEAGGEALAGILAGNVNPSGRLP 709
Query: 543 MTWY 546
+T+Y
Sbjct: 710 VTFY 713
>gi|150003731|ref|YP_001298475.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|319640047|ref|ZP_07994774.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
gi|345517061|ref|ZP_08796539.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|149932155|gb|ABR38853.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
gi|254833833|gb|EET14142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|317388325|gb|EFV69177.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
Length = 864
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 157/451 (34%), Positives = 214/451 (47%), Gaps = 85/451 (18%)
Query: 47 SLASLGFCN-------TSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWS 99
SL L CN +SL + R DL+++LTL+EK+ ++D++ V RLGI Y WW+
Sbjct: 11 SLVMLSGCNNQPAYKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWN 70
Query: 100 EALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM---YNV 156
EALHGV+ G AT FPQ I AASF I VS EARA Y+
Sbjct: 71 EALHGVARSGL----------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSA 120
Query: 157 G-----LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR 211
GLT W+P +NI+RDPRWGRG ET GEDP L S VKGLQ D
Sbjct: 121 AGSYERYQGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCMDANQKYD 180
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------ 254
K+ AC KH+ + W +R+ FNA I + YL+ F+ +V
Sbjct: 181 -KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAY 236
Query: 255 ------------------------------SDCDSVDVLYNSQHYTKTP--EEAAAKSIL 282
SDC ++D Y + + P E A+A ++L
Sbjct: 237 NRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL 296
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
+G DL CGS E+A K GL++E ID ++ LG D P K + ++
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPY 354
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
VC+ + L+LD AR+ + LL N LPL + +AV+GPNAN + GNY GT
Sbjct: 355 SVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGT 413
Query: 403 PCKYTTPLQGLAAVVAT----IYQAGCSNVQ 429
P T L+G+ + + IY+ GCS V+
Sbjct: 414 PKHTITLLEGIRSAMGENDKLIYEQGCSWVE 444
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 62/130 (47%), Gaps = 20/130 (15%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQL 476
N++ A+V DA D + G S+E E DR D+ LP Q+
Sbjct: 591 NIKNTVAKVKDA-------DVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRE 643
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI + K VI + SG I+ +IL YPGQ+GG A A+V FG YN
Sbjct: 644 LIKALCDAGK-KVIFVNFSGS--PIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYN 700
Query: 537 PSGRLPMTWY 546
P+GRLP+T+Y
Sbjct: 701 PAGRLPVTFY 710
>gi|423313129|ref|ZP_17291065.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
CL09T03C04]
gi|392686343|gb|EIY79649.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
CL09T03C04]
Length = 864
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 157/451 (34%), Positives = 214/451 (47%), Gaps = 85/451 (18%)
Query: 47 SLASLGFCN-------TSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWS 99
SL L CN +SL + R DL+++LTL+EK+ ++D++ V RLGI Y WW+
Sbjct: 11 SLVMLSGCNNQPAYKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWN 70
Query: 100 EALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM---YNV 156
EALHGV+ G AT FPQ I AASF I VS EARA Y+
Sbjct: 71 EALHGVARSGL----------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSA 120
Query: 157 G-----LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR 211
GLT W+P +NI+RDPRWGRG ET GEDP L S VKGLQ D
Sbjct: 121 AGSYERYQGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCMDANQKYD 180
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------ 254
K+ AC KH+ + W +R+ FNA I + YL+ F+ +V
Sbjct: 181 -KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAY 236
Query: 255 ------------------------------SDCDSVDVLYNSQHYTKTP--EEAAAKSIL 282
SDC ++D Y + + P E A+A ++L
Sbjct: 237 NRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL 296
Query: 283 AGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
+G DL CGS E+A K GL++E ID ++ LG D P K + ++
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPY 354
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
VC+ + L+LD AR+ + LL N LPL + +AV+GPNAN + GNY GT
Sbjct: 355 SVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGT 413
Query: 403 PCKYTTPLQGLAAVVAT----IYQAGCSNVQ 429
P T L+G+ + + IY+ GCS V+
Sbjct: 414 PKHTITLLEGIRSAMGENDKLIYEQGCSWVE 444
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 63/130 (48%), Gaps = 20/130 (15%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQL 476
N++ A+V DA D + G S+E E DR D+ LP Q+
Sbjct: 591 NIKNTVAKVKDA-------DVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRE 643
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI + K VI + SG I+ +IL YPGQ+GG A+A+V FG YN
Sbjct: 644 LIKALCDAGK-KVIFVNFSGS--PIAMEPETKYCQAILQAWYPGQSGGKAVAEVLFGDYN 700
Query: 537 PSGRLPMTWY 546
P+GRLP+T+Y
Sbjct: 701 PAGRLPVTFY 710
>gi|340369765|ref|XP_003383418.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 748
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 182/571 (31%), Positives = 260/571 (45%), Gaps = 92/571 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDS-AGS------VSRLGIPKYEWWSEALHGV 105
F + SL I+ RV D+V +L+L + + + AGS + + I Y+W +E L G
Sbjct: 29 FRDPSLPIEERVKDIVDQLSLDQLVEQMAHGGAGSNGPAPGIPKFNIKPYQWGTECLSGD 88
Query: 106 SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL------- 158
G ATSFP I AASFN L + + + E RA +
Sbjct: 89 VNAG----------DATSFPMSIGMAASFNYDLLKQVSNATAYEVRAKNTAAVLNGSYAF 138
Query: 159 -AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
GL+ WSP +NI RDPRWGR QET GEDP L+ +V GLQ G P + A
Sbjct: 139 HTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAFVTGLQ---GDDPTYVIANAG 195
Query: 218 CKHYTAY-----------------DLDNWKGTDRYHFNAMV-------IYTYYLIK---- 249
CKH+ + + +W+ T F A V + +Y I
Sbjct: 196 CKHFDVHGGPEDTPLPRASFDANVTMIDWRMTFLPQFKACVEAGALSLMCSYNRINGVPA 255
Query: 250 ----------------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
FK Y+VSD +++ + HY AAA + AG L G+
Sbjct: 256 CANKKLLTDILRNEWNFKGYVVSDQGALENIVTQHHYAPDFVTAAADAANAGTCLEDGNS 315
Query: 293 LGK------HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC 346
GK + + AV+ GLV+ + A+S F +LG FD + PY + +
Sbjct: 316 EGKGGNVFDNLDDAVEKGLVSVDTLKDAVSRLFYVRTKLGEFDPPDNNNPYANIPLSIIQ 375
Query: 347 TQANQDLALDAARQGIVLLKN-TAGS--LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
+ + L++ AA + IVL+KN GS LPL+ K V+GP TM G+Y T
Sbjct: 376 SDEHIKLSIQAAMETIVLMKNDNDGSPFLPLAADDFKKACVVGPFIENADTMFGDYSPTM 435
Query: 404 CK--YTTPLQGLAAVVA----TIYQAGCSN-VQCGTAQVDDAKKAAASADATVLVMGADQ 456
TPL G+ Y+ GC++ C + A D ++ G +
Sbjct: 436 MTDYIVTPLAGIKTTQIGSDLLNYEDGCTDGPACEIYDGYKVRTACEGVDLVIVTAGLSR 495
Query: 457 SIEAESHDRLDLLLPGQQQLLITEVAKVAKG--PVILIIMSGGGFDISFAKNDPKITSIL 514
+E E HD D+ LPG Q L+T+ A+ A G P+IL++ + DIS+AK++P+ +IL
Sbjct: 496 YLEHEGHDISDIYLPGHQMSLLTD-AESASGSAPIILLLFNANPLDISYAKSNPRFAAIL 554
Query: 515 WVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
YPGQ G AIA+V G YNP+GRLP TW
Sbjct: 555 EAYYPGQEAGVAIANVLTGSYNPAGRLPNTW 585
>gi|116312033|emb|CAJ86398.1| OSIGBa0125M19.1 [Oryza sativa Indica Group]
Length = 247
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 112/200 (56%), Positives = 144/200 (72%), Gaps = 2/200 (1%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
++P+ ++ FCN +L R LV LTL EKI L ++A RLG+P +EWWSE+LH
Sbjct: 31 ASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLH 90
Query: 104 GVSYVGPGTHFSN-VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT 162
GV GPG +FS+ V AT FPQVIL+AA+FN SL++A + ++ EARAM+N G AGLT
Sbjct: 91 GVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLT 150
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PNIN+FRDPRWGRGQETPGEDP + S Y+ YVKG Q+ D G R+ ++ACCKHY
Sbjct: 151 FWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQR-DYGEEGRMMLSACCKHYI 209
Query: 223 AYDLDNWKGTDRYHFNAMVI 242
AYDL+ W+G RY FNA V+
Sbjct: 210 AYDLEKWRGFTRYTFNAKVM 229
>gi|380692997|ref|ZP_09857856.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 837
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 151/409 (36%), Positives = 201/409 (49%), Gaps = 77/409 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N + I RV DL+ +LT++EK++ L ++ + R+GI KY +EALHG+ + PG
Sbjct: 14 YKNMNAPIHERVQDLLSKLTIEEKVSLLRATSPGIERMGIDKYYMGNEALHGI--IRPGK 71
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LT 162
T FPQ I A+ +N L I V+S EARA +N G LT
Sbjct: 72 F--------TVFPQAIGLASMWNPELHHIIAGVISDEARARWNELERGKKQKDQFSDLLT 123
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FWSP +N+ RDPRWGR ET GEDP L+ T +VKGLQ G P LK A KH+
Sbjct: 124 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQ---GDHPRYLKAVATPKHFA 180
Query: 223 AYDLDNWKGTDRYHFNAMVIYT----YYLIKFK--------------------------- 251
A N + +R++ +A + T YY F+
Sbjct: 181 A----NNEEHNRFYCDAAITETDLREYYFPAFEKCIREGKAESIMTAYNAINGVPCTANN 236
Query: 252 ---------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL-GK 295
YIVSDC + +L Y KTPE AA +I AGLD+ CG ++
Sbjct: 237 WLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKTPEAAAMIAIKAGLDVECGDYVFAN 296
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
A K +V+ + ID A MRLG FD P K PY L P+ V + + DLAL
Sbjct: 297 PLLNAYKQYMVSAAEIDSAAYRVLRARMRLGMFDD-PEKNPYNHLSPEIVGCKKHHDLAL 355
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
+AARQ IVLLKN +LPL+ IK++AV+G NA G+Y GTP
Sbjct: 356 EAARQSIVLLKNQQNTLPLNAQKIKSIAVVGINA--ANCEFGDYSGTPV 402
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/110 (43%), Positives = 66/110 (60%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
DA K +D + VMG +QSIE E DR + LP QQ+ I E K A I+++++G
Sbjct: 580 DASKIIRESDVVIAVMGINQSIEREGQDRNSIELPKDQQIFIREAYK-ANPNTIVVLVAG 638
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I + D I +I+ YPG+ GG AIA+V FG YNP+GRLP+T+Y
Sbjct: 639 SSMAIGWM--DQHIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFY 686
>gi|254786805|ref|YP_003074234.1| glycoside hydrolase family 3 domain-containing protein
[Teredinibacter turnerae T7901]
gi|237686035|gb|ACR13299.1| glycoside hydrolase family 3 domain protein [Teredinibacter
turnerae T7901]
Length = 888
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 164/466 (35%), Positives = 229/466 (49%), Gaps = 83/466 (17%)
Query: 9 RAPKVSVLFLFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASL--GFCNTSLGIDLRVGD 66
R P V L LT SL + S ++PV VS S A+ + +T+L ID RV D
Sbjct: 3 RNPLNPVPILGLTLASLLFTGCSPDNNPV--PKPVSERSTANEQPAYMDTTLDIDTRVDD 60
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV R+ L EKI+ + + + ++ LGI +Y+WW+EALHGV+ G AT FPQ
Sbjct: 61 LVSRMDLAEKISQMYNESPAIEHLGIAEYDWWNEALHGVARAG----------KATVFPQ 110
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGL--------AGLTFWSPNINIFRDPRWGR 178
I AA ++ I + VS EARA ++ + GLTFWSPNINIFRDPRWGR
Sbjct: 111 AIGMAAMWDRETMFDIAEAVSDEARAKHHYFVENGVHFRYTGLTFWSPNINIFRDPRWGR 170
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD-LDNWKGTDRYHF 237
GQET GEDP L + A Y+ GLQ G +P LK AA KH+ + + + +D Y
Sbjct: 171 GQETYGEDPYLTGELALPYISGLQ---GENPKYLKTAAMAKHFAVHSGPEKSRHSDNYIA 227
Query: 238 NAMVIYTYYLIKFK------------------------------------------YIVS 255
+ + YL F+ ++VS
Sbjct: 228 SPKDLNETYLPAFEKAVVEGDVESVMCAYNRVNDEPACGNDMLLKETLRGKWGFKGHVVS 287
Query: 256 DCDSVDVLY--NSQHYTKTPEEAAAKSILAGLDLNCG-----SFLGKHTEAAVKAGLVNE 308
DC ++ Y + H P AAA ++ +G DLNCG +F H A++ ++ +
Sbjct: 288 DCGAIADFYAPEAHHVVMAPAAAAAWAVRSGTDLNCGTDRLSTFANLHF--ALQREMITQ 345
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
ID ++ T +LG FD P Q PY ++ V +QA+ L AA + VLLKN
Sbjct: 346 DEIDQSVKRLMKTRFKLGMFD--PDDQVPYSKIPMDVVGSQAHLALTQKAAEKSFVLLKN 403
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+ G LPL ++ +A+IGPNA ++GNY G P K TPL G+
Sbjct: 404 S-GILPLKKSS--KVAIIGPNATNPTVLVGNYFGDPIKPVTPLDGI 446
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 60/97 (61%), Gaps = 6/97 (6%)
Query: 453 GADQSIEAESHD---RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPK 509
G + S+E E D R D+ LP Q+ L+ + K+ K P++L+ SG +++A N+
Sbjct: 636 GEEMSVEIEGFDHGDRTDIRLPEPQRKLLATLKKLNK-PIVLVNFSGSAIALNWANNN-- 692
Query: 510 ITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ +IL YPG+A G A+A + +G +PSGRLP+T+Y
Sbjct: 693 VDAILQGFYPGEATGTALARILWGEVSPSGRLPITFY 729
>gi|293370605|ref|ZP_06617157.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292634339|gb|EFF52876.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 861
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 152/444 (34%), Positives = 214/444 (48%), Gaps = 81/444 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + +TSL + R DL+ RLTL+EK++ + +++ ++ RLGI +YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I ASFN SL + S EAR + GLT
Sbjct: 84 ----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLT 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L + V+GLQ + ++L AC KH+
Sbjct: 134 FWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYDKLH--ACAKHFA 191
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ F+A I + YL FK
Sbjct: 192 VHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGS 248
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P E A+A ++ G DL CGS
Sbjct: 249 NRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAAAVRTGTDLECGSEY 308
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ AVKAGL++E ID ++ LG D + + ++ + ++ +Q L
Sbjct: 309 ASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNSKEHQAL 364
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL AR+ +VLL+N LPL+ T +K +AV+GPNAN + GNY G P T L+ +
Sbjct: 365 ALRMARESLVLLQNKNNILPLN-THLK-VAVMGPNANDSVMQWGNYNGIPAHTVTLLEAV 422
Query: 414 AAVV---ATIYQAGCSNVQCGTAQ 434
A + IY+ GC V T Q
Sbjct: 423 RAKLPEGQIIYEPGCDRVDGKTLQ 446
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 55/119 (46%), Gaps = 13/119 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A K AD + G S+E E DR D+ LP Q+ L+ + K K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+ I SG I +IL YPGQAGG AI D +G YNP GRLP+T+Y
Sbjct: 651 -VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY 706
>gi|393786770|ref|ZP_10374902.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
CL02T12C05]
gi|392658005|gb|EIY51635.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
CL02T12C05]
Length = 864
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 152/441 (34%), Positives = 214/441 (48%), Gaps = 86/441 (19%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
+ + L + + +L + R DL++RLT++EK++ + +++ + RLGI YEWW+EALHGV+
Sbjct: 24 TYSQLPYQDPNLTPEQRATDLLQRLTIEEKVSLMQNNSPGILRLGIKPYEWWNEALHGVA 83
Query: 107 YVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM---YNV-----GL 158
G AT FPQ I AASF+ +L + +S EARA +N
Sbjct: 84 RAGL----------ATVFPQTIGMAASFDDTLIYEVFNAISDEARAKNRHFNTLGQYKRY 133
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
GLT W+PNINIFRDPRWGRGQET GEDP L S+ VKGLQ D N+L AC
Sbjct: 134 QGLTMWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYNKLH--ACA 191
Query: 219 KHYTAYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------- 251
KH+ + W +R+ FNA I + YL FK
Sbjct: 192 KHFAVHSGPEW---NRHSFNAENIIPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDP 248
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNC 289
+VSDC ++ + ++ + P+ A+A+++L G DL C
Sbjct: 249 CCGSNRLLTQILRNEWGFKGIVVSDCGAISDFWGTKKHNTHPDAAHASAEAVLNGTDLEC 308
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDG-HPSKQPYGQL-GPKDVCT 347
GS K TE A+KAG+++E I+ ++ LG + HP PY + PK C
Sbjct: 309 GSNYRKLTE-AIKAGIISEKQINVSVKRLLKARFELGEMENIHPWTLPYSIVDSPKHRC- 366
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
LAL A + + LL+N LPL A +A+IGPNAN + GNY GTP +
Sbjct: 367 -----LALKMAHETMTLLQNKGKVLPLDKQA--RIAIIGPNANDSVMQWGNYNGTPSHTS 419
Query: 408 TPLQGLAA---VVATIYQAGC 425
T L + IY+ C
Sbjct: 420 TLLSAFRKRLPISHLIYEPVC 440
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 54/111 (48%), Gaps = 13/111 (11%)
Query: 446 DATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
D + G S+E E DR D+ P Q+ ++ + + K VIL+ S
Sbjct: 602 DIIIFAGGISPSLEGEEMNVSATGFKGGDRTDIEFPAVQRKVLAALKEAGK-KVILVNFS 660
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G ++ +IL YPG+ GG AI +V FG YNP+GRLP+T+Y
Sbjct: 661 GSA--MALTPETKSCDAILQAWYPGEEGGMAIVNVLFGDYNPAGRLPITFY 709
>gi|393789623|ref|ZP_10377743.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
CL02T12C05]
gi|392650339|gb|EIY44008.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
CL02T12C05]
Length = 863
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 155/433 (35%), Positives = 213/433 (49%), Gaps = 78/433 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N +L + R DL+ RLTL+EK+T + + + + RLGI Y+WW+EALHGV+ G
Sbjct: 25 YRNPNLSPEERAEDLLGRLTLKEKVTLMQNESFPIERLGIAHYDWWNEALHGVARAGI-- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN---------VGLAGLTF 163
AT FP + A+SF+ + I VS E RA Y+ V GLTF
Sbjct: 83 --------ATVFPITMGMASSFDDKAIEDIFTAVSDEVRAKYHDAHRQGRRGVRCEGLTF 134
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PNINIFRDPRWGRGQET GEDP L S+ VKGLQ ++L AC KHY
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPADAKYDKLH--ACAKHYAV 192
Query: 224 YDLDNWKGTDRYHFNAMVI-----YTYYLIKFK--------------------------- 251
+ K R+ FNA I + YL FK
Sbjct: 193 HSGPEAK---RHSFNAENISPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDPCCGSN 249
Query: 252 ---------------YIVSDCDSV-DVLYNSQHYT-KTPEEAAAKSILAGLDLNCGSFLG 294
+VSDC ++ D Y+ +H T K +A+A ++L+G DL CG
Sbjct: 250 RLLTQILRDDWGYKHVVVSDCGAISDFFYSDRHATHKNAADASAAAVLSGTDLECGIEYA 309
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
H + AVK GL++E I+ ++ LG D + P+ ++ V Q ++ +A
Sbjct: 310 -HLDKAVKEGLISEERINTSLFRLLKARFELGEMDDD-ALVPWSKISIDTVDCQMHKQMA 367
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
LD R+ +VLL N G LPL T + +AV+GPNAN + GNYEGTP T L+G+
Sbjct: 368 LDITRKSMVLLHNN-GVLPLGKTGAR-IAVMGPNANDSVMQWGNYEGTPSHTVTVLEGIR 425
Query: 415 AVVAT-IYQAGCS 426
+ IY+ GC
Sbjct: 426 NKIGNVIYEKGCE 438
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 57/117 (48%), Gaps = 14/117 (11%)
Query: 444 SADATVLVMGADQSIEAESH-----------DRLDLLLPGQQQLLITEVAKVAKGPVILI 492
AD + V G ++E E DR + LP + Q I + K A VI +
Sbjct: 598 EADVVIFVGGISPNLEGEEKNFVNCPGFVGGDRTSIELP-EVQRNILKALKKAGKKVIFV 656
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
SG ++ +IL YPGQAGG A+AD+ FG YNPSG+LP+T+Y +
Sbjct: 657 NCSGSA--MALVPETQSCDAILQAWYPGQAGGTAVADIIFGDYNPSGKLPVTFYKNT 711
>gi|256393466|ref|YP_003115030.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
gi|256359692|gb|ACU73189.1| glycoside hydrolase family 3 domain protein [Catenulispora
acidiphila DSM 44928]
Length = 1343
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 183/596 (30%), Positives = 267/596 (44%), Gaps = 107/596 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFL-VDSAGSVSRLGIPKYEWWSEALHGVSYVGPG 111
+ +T R DLV R+TL EK L +SA ++ RLG+ +Y +WSE HGV+ +G
Sbjct: 49 YLDTHYSFAERAADLVSRMTLPEKAAQLQTNSAPAIPRLGVQEYTYWSEGQHGVNTLGAD 108
Query: 112 THFSNVVPG--ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG--------- 160
++ +V G ATSFP S++ +L VS E R + L G
Sbjct: 109 SNRGDVTGGVHATSFPVNFAATMSWDPALTYKETTAVSDEVRGFLDKSLWGTGQNNLGPS 168
Query: 161 ------LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ-QTDGGSPNR-- 211
LTFW+PN+N+ RDP WGR E+ GEDP L S A +V G Q Q+ G
Sbjct: 169 ASDYGALTFWAPNVNMDRDPLWGRTNESFGEDPYLTSTMAGAFVDGYQGQSMTGQQQTPY 228
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK-------------------- 251
LKVAA KHY+ ++++ + T I YY +F
Sbjct: 229 LKVAATAKHYSLNNIEDSRHTGSSDTTDANIRDYYTKQFASLVRDAHVSGIMTSYNAVNG 288
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPE--------------- 274
Y SDC ++ +Y + + P
Sbjct: 289 TPSPADTYTVDELLQATYGFAGYTTSDCGAIGDVYGAASHGWAPPGWTSNGTSWTNNATG 348
Query: 275 ---EAAAK----SILAGLDLNC--GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRL 325
AAA +I AG LNC G ++ AA+ GL++ +D ++ F M
Sbjct: 349 RQISAAAGGQAFAIRAGTQLNCAGGEMTAQNISAAIDLGLLSNGVVDATLTRLFTVRMET 408
Query: 326 GFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS------LPLSPTAI 379
G FD K Y ++ + + A+Q LA A IVLL+N A S LP+ P
Sbjct: 409 GEFD-PAGKVGYTKITKDQIESPAHQALAEQVAANDIVLLQNGAVSGTSAKLLPVDPAKT 467
Query: 380 KNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL-AAVVATIYQAGCSNVQCGT------ 432
++ ++G AN K +G Y G P +QG+ AAV A A + CGT
Sbjct: 468 DSVVIVGDLAN--KVTLGGYSGEPTHEVNAVQGITAAVQAANPSATVTFDACGTGTQITT 525
Query: 433 -AQVDDAKKAAASADATVLVM-GADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVI 490
A A +AA + + VLV+ G+D S+ E++DR L LPG LI++V+ +
Sbjct: 526 PASCSAATQAAIKSASLVLVVAGSDLSVADEANDRSTLALPGNYDSLISQVSALGNPRTA 585
Query: 491 LIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
L++ + G +DI A+ D +I++ GY GQ+ G A+A V FG+ NP+G L TWY
Sbjct: 586 LVMQADGPYDIQDAQKD--FPAIVFSGYNGQSQGTALAQVLFGQQNPAGHLDFTWY 639
>gi|94969405|ref|YP_591453.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
gi|94551455|gb|ABF41379.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
Length = 902
Score = 222 bits (565), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 155/426 (36%), Positives = 208/426 (48%), Gaps = 88/426 (20%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R DLV+R+TL EK L D A ++ RLG+P Y+ WSEALHGV+ G AT
Sbjct: 48 RAHDLVQRMTLDEKAAQLEDWATAIPRLGVPDYQTWSEALHGVARAG----------HAT 97
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDP 174
FPQ I AA+++ + + +G V+STEAR YN GLTFWSPNINIFRDP
Sbjct: 98 VFPQAIGMAATWDTEMVKQMGDVISTEARGKYNEAQREGNHRIFWGLTFWSPNINIFRDP 157
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGRGQET GEDP L K ++ G+Q D P K A KH+ + + R
Sbjct: 158 RWGRGQETYGEDPFLTGKMGIAFIDGVQGPDAAHP---KAVATSKHFAVHSGPE---SLR 211
Query: 235 YHFNAMV----IYTYYLIKFK--------------------------------------- 251
+ F+ V + YL F+
Sbjct: 212 HGFDVKVSPRDLEETYLAAFRATVTDGHVKSVMCAYNAVDGMGACANKMLLEEHLKQAWG 271
Query: 252 ---YIVSDCDSV-DVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFL-GKHTEA-AVKA 303
++VSDC ++ DV +Q + P+ AAA S+ AG DL+C + G +T A AV+
Sbjct: 272 FKGFVVSDCGAIMDV---TQGHKNAPDIVHAAAISLAAGTDLSCSIWEPGFNTLADAVRK 328
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIV 363
GLV E + A +A LG FD P P ++ V ++ ++ AL AA + IV
Sbjct: 329 GLVTEDMVTRAAERLYAARFELGMFD-EPGSNPNDKIDMSQVASEEHRAEALKAAEESIV 387
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQA 423
LLKN G LPL K +AVIGP A + ++ GNY G P + TPL G+ + Q
Sbjct: 388 LLKND-GLLPLK--NAKTIAVIGPTAELLASLEGNYNGQPVRPVTPLDGI------VKQF 438
Query: 424 GCSNVQ 429
G NV+
Sbjct: 439 GAENVR 444
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 46/83 (55%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR + LP Q+ L+ + K PV+++ +SG +++A +IL YPG G
Sbjct: 661 DRTSIDLPATQEKLLEALGAAGK-PVVVVNLSGSAVALNWANQ--HAGAILQAWYPGVEG 717
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA G NP+GRLP+T+Y
Sbjct: 718 GTAIAKTLAGESNPAGRLPVTFY 740
>gi|325103214|ref|YP_004272868.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
gi|324972062|gb|ADY51046.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
12145]
Length = 866
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 152/438 (34%), Positives = 211/438 (48%), Gaps = 79/438 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L D RV DL++RLT++EK+ + D + + RLGI +Y WW+EALHGV+ G
Sbjct: 26 FQDNRLPFDKRVDDLLQRLTVEEKVLLMQDVSRPIERLGIKQYNWWNEALHGVARAGL-- 83
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AASF+ + VS EARA +N L+ GLT W
Sbjct: 84 --------ATVFPQPIGMAASFDRDALFNVFNAVSDEARAKHNYHLSQGSYGRYEGLTMW 135
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P INIFRDPRWGRG ET GEDP L + VKGLQ G ++L AC KH+ +
Sbjct: 136 TPTINIFRDPRWGRGIETYGEDPYLTAVMGVQAVKGLQGPSNGKYDKLH--ACAKHFAVH 193
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ F+A I Y YL F+
Sbjct: 194 SGPEW---NRHSFDAANIKQRDLYETYLPAFEALVKEAKVQEVMCAYNRFEGDPCCGSDR 250
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFLGK 295
+V+DC ++ + + P+ A+A ++ +G DL+CGS
Sbjct: 251 LLQQILRKKWGFEGIVVADCGAIADFFKENAHKTHPDAASASAAAVYSGTDLDCGSSYKA 310
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
TE AVK GL+ E ID ++ RLG D S P+ ++ V ++A+ +AL
Sbjct: 311 LTE-AVKKGLIEEKDIDVSVRRLLMARFRLGEMDDQ-SLVPWSKISYNVVASKAHNQIAL 368
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
D AR+ I LL+N LPL +K +AV+GPNA + GNY GTP T L+G+ A
Sbjct: 369 DMARKSITLLQNKNNILPLKSGGLK-IAVMGPNAQDSVMQWGNYNGTPANTITILEGIKA 427
Query: 416 VVA----TIYQAGCSNVQ 429
+ IY+ C V+
Sbjct: 428 ALGPKDKLIYEQACGLVE 445
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/124 (39%), Positives = 64/124 (51%), Gaps = 13/124 (10%)
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVA 482
A ++ + K A AD V V G S+E E DR D+ LP Q+ + +
Sbjct: 591 ANINKSIKNIAGADLVVFVGGISPSLEGEEMGVKLPGFRGGDRTDIQLPTIQRQFVKALK 650
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
+ K VI I SG I A +I+ YPGQAGG A+ADV FG+YNPSGRLP
Sbjct: 651 EAGK-RVIFINCSGS--PIGLADEMANSEAIVQAWYPGQAGGQAVADVLFGKYNPSGRLP 707
Query: 543 MTWY 546
+T+Y
Sbjct: 708 ITFY 711
>gi|224535242|ref|ZP_03675781.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
DSM 14838]
gi|224523140|gb|EEF92245.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
DSM 14838]
Length = 864
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 154/444 (34%), Positives = 215/444 (48%), Gaps = 86/444 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L R DL+KR+TL+EK++ + + + ++ RLGIP Y+WW+EALHGV+ G
Sbjct: 24 YKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHGVARAGK-- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+F+ +VS EARA Y+ G GLTFW
Sbjct: 82 --------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERDGYKGLTFW 133
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINI+RDPRWGRG ET GEDP L S VKGLQ GG+ K AC KHY +
Sbjct: 134 TPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQ--GGGTGKYDKAHACAKHYAVH 191
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ F+A I + YL FK
Sbjct: 192 SGPEW---NRHSFDAKNISQRDLWETYLSAFKTLVKEGKVKEVMCAYNRFEGEPCCSNKQ 248
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
+VSDC ++ Y H+ P A+A ++++G DL CG
Sbjct: 249 LLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLECGGSYSS 308
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP----SKQPYGQLGPKDVCTQANQ 351
E AV+ GL++E I+ ++ +LG FD S+ PY + K+ T+
Sbjct: 309 LNE-AVRKGLISEEKINESVFRLLRARFQLGMFDDDALVSWSEIPYSVVESKEHVTK--- 364
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
AL+ AR+ +VLL N +LPLS +I+ +AV+GPNAN + + NY G P K T L+
Sbjct: 365 --ALEMARKSMVLLTNKNHTLPLS-KSIRKVAVLGPNANDSVMLWANYNGFPTKSVTILE 421
Query: 412 GLAAVV---ATIYQAGCSNVQCGT 432
G+ + + Y+ GC V T
Sbjct: 422 GIKSKLPEGTVYYEKGCDYVNTQT 445
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 64/118 (54%), Gaps = 13/118 (11%)
Query: 442 AASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVIL 491
AA ADA + V G ++E E DR ++ LP Q ++ + K K PVI
Sbjct: 597 AAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQAEMLKALKKTGK-PVIF 655
Query: 492 IIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
++ SG + + + + +IL YPGQ GG A+ADV FG YNP+GRLP+T+Y S
Sbjct: 656 VLCSGSTLALPWEAEN--LDAILEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASS 711
>gi|299149391|ref|ZP_07042448.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298512578|gb|EFI36470.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 853
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 148/421 (35%), Positives = 208/421 (49%), Gaps = 73/421 (17%)
Query: 37 VFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYE 96
V +C ++ L + N + + RV DL+ RLT++EKI+ L ++ + RLGI KY
Sbjct: 18 VMSCSLIHAQEL----YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYY 73
Query: 97 WWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV 156
+EALHGV V PG T FPQ I AA++N L + + V+S EARA +N
Sbjct: 74 HGNEALHGV--VRPGRF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNE 123
Query: 157 GLAG----------LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
G LTFWSP +N+ RDPRWGR ET GEDP L+ T +VKGLQ G
Sbjct: 124 LDQGREQKEQFSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQ---G 180
Query: 207 GSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK--------------- 251
P LK+ + KH+ A + ++ + + + YY F+
Sbjct: 181 DDPRYLKIVSTPKHFAANNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAY 240
Query: 252 ---------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
Y+VSDC +L N+ Y KT E AA SI AG
Sbjct: 241 NALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAG 300
Query: 285 LDLNCG-SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
LDL CG ++ A K +V+++ ID A + M+LG FDG + PY ++ P
Sbjct: 301 LDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDG-TERNPYTRISPS 359
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
+ ++ +Q +ALDAAR+ IVLLKN LPL+ +K++AV+G NA K G+Y G P
Sbjct: 360 VIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVGINAG--KCEFGDYSGAP 417
Query: 404 C 404
Sbjct: 418 V 418
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 68/112 (60%), Gaps = 7/112 (6%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + V VMG ++SIE E DR D+ LP Q+ + E+ KV P I++++
Sbjct: 596 EAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVA 653
Query: 497 GGFDISFAKN--DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G S A N D I +I+ YPG+ GG A+ADV FG YNP+GRLP+T+Y
Sbjct: 654 GS---SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYY 702
>gi|383113364|ref|ZP_09934136.1| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
gi|382948729|gb|EFS32368.2| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
Length = 850
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 148/421 (35%), Positives = 208/421 (49%), Gaps = 73/421 (17%)
Query: 37 VFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYE 96
V +C ++ L + N + + RV DL+ RLT++EKI+ L ++ + RLGI KY
Sbjct: 15 VMSCSLIHAQEL----YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYY 70
Query: 97 WWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV 156
+EALHGV V PG T FPQ I AA++N L + + V+S EARA +N
Sbjct: 71 HGNEALHGV--VRPGRF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNE 120
Query: 157 GLAG----------LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
G LTFWSP +N+ RDPRWGR ET GEDP L+ T +VKGLQ G
Sbjct: 121 LDQGREQKEQFSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQ---G 177
Query: 207 GSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK--------------- 251
P LK+ + KH+ A + ++ + + + YY F+
Sbjct: 178 DDPRYLKIVSTPKHFAANNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAY 237
Query: 252 ---------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
Y+VSDC +L N+ Y KT E AA SI AG
Sbjct: 238 NALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAG 297
Query: 285 LDLNCG-SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
LDL CG ++ A K +V+++ ID A + M+LG FDG + PY ++ P
Sbjct: 298 LDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDG-TERNPYTRISPS 356
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
+ ++ +Q +ALDAAR+ IVLLKN LPL+ +K++AV+G NA K G+Y G P
Sbjct: 357 VIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVGINAG--KCEFGDYSGAP 414
Query: 404 C 404
Sbjct: 415 V 415
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 68/112 (60%), Gaps = 7/112 (6%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + V VMG ++SIE E DR D+ LP Q+ + E+ KV P I++++
Sbjct: 593 EAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVA 650
Query: 497 GGFDISFAKN--DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G S A N D I +I+ YPG+ GG A+ADV FG YNP+GRLP+T+Y
Sbjct: 651 GS---SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYY 699
>gi|330996729|ref|ZP_08320604.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
gi|329572574|gb|EGG54217.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
Length = 852
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 147/394 (37%), Positives = 199/394 (50%), Gaps = 69/394 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ DL+ RLT++EKI+ LV+ A ++ RLGI KY +EALHGV V PG T
Sbjct: 39 RIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGV--VRPGDF--------T 88
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNV------GLAG----LTFWSPNINIFR 172
FPQ I AA +N L I +S EAR + +AG LTFWSP +N+ R
Sbjct: 89 VFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAGASDLLTFWSPTVNMAR 148
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L+ +VKGLQ G P LK + KH+ + ++ + +
Sbjct: 149 DPRWGRTPETYGEDPYLSGVLGVAFVKGLQ---GNHPRYLKTVSTPKHFAVNNEEHNRSS 205
Query: 233 DRYHFNAMVIYTYYLIKFK----------------------------------------- 251
+ + YYL F+
Sbjct: 206 CNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVNDVPCTVNTYLIKNVLRGDWGFN 265
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS-FLGKHTEAAVKAGLVNES 309
YIVSDC + + + HY KT E AA ++ AGLDL CG+ G+ A + +V+E+
Sbjct: 266 GYIVSDCSAPEWMITKHHYVKTREAAATLAVKAGLDLECGNQVYGEGLLKAYRQYMVSEA 325
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
ID A M LG FD PS+ PY Q+ P V +A+QDLAL+AARQ +VLLKN
Sbjct: 326 DIDSAAYRILRGRMMLGLFD-DPSQNPYNQIEPSVVGCKAHQDLALEAARQSMVLLKNKD 384
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
LPL+P +K++AV+G +A + G+Y GTP
Sbjct: 385 NFLPLNPQKVKSIAVVGISAGHCE--FGDYSGTP 416
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/114 (43%), Positives = 68/114 (59%), Gaps = 3/114 (2%)
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A+ DA K AA D TV V+G ++SIE E DR L LP QQ I E+ KV V+++
Sbjct: 592 ARYGDAGKVAAECDVTVAVLGINKSIEREGQDRFTLELPIDQQEFIKELYKVNPNTVVVL 651
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ G ++ D + +IL YPG+ GG A+A+V FG YNP GRLP+T+Y
Sbjct: 652 V---AGSSLAVNWMDENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYY 702
>gi|336417083|ref|ZP_08597412.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
3_8_47FAA]
gi|335936708|gb|EGM98626.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
3_8_47FAA]
Length = 850
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 148/421 (35%), Positives = 208/421 (49%), Gaps = 73/421 (17%)
Query: 37 VFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYE 96
V +C ++ L + N + + RV DL+ RLT++EKI+ L ++ + RLGI KY
Sbjct: 15 VMSCSLIHAQEL----YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYY 70
Query: 97 WWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV 156
+EALHGV V PG T FPQ I AA++N L + + V+S EARA +N
Sbjct: 71 HGNEALHGV--VRPGRF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNE 120
Query: 157 GLAG----------LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
G LTFWSP +N+ RDPRWGR ET GEDP L+ T +VKGLQ G
Sbjct: 121 LDQGREQKEQFSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQ---G 177
Query: 207 GSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK--------------- 251
P LK+ + KH+ A + ++ + + + YY F+
Sbjct: 178 DDPRYLKIVSTPKHFAANNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAY 237
Query: 252 ---------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
Y+VSDC +L N+ Y KT E AA SI AG
Sbjct: 238 NALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAG 297
Query: 285 LDLNCG-SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
LDL CG ++ A K +V+++ ID A + M+LG FDG + PY ++ P
Sbjct: 298 LDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDG-TERNPYTRISPS 356
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
+ ++ +Q +ALDAAR+ IVLLKN LPL+ +K++AV+G NA K G+Y G P
Sbjct: 357 VIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVGINAG--KCEFGDYSGAP 414
Query: 404 C 404
Sbjct: 415 V 415
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 68/112 (60%), Gaps = 7/112 (6%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + V VMG ++SIE E DR D+ LP Q+ + E+ KV P I++++
Sbjct: 593 EAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVA 650
Query: 497 GGFDISFAKN--DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G S A N D I +I+ YPG+ GG A+ADV FG YNP+GRLP+T+Y
Sbjct: 651 GS---SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYY 699
>gi|225873993|ref|YP_002755452.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
gi|225791521|gb|ACO31611.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
Length = 894
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 147/436 (33%), Positives = 213/436 (48%), Gaps = 76/436 (17%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
FA +PS + + N SL +R DLV R+TL+EK + LV++A ++ RL +P Y W
Sbjct: 26 FAQSQTQSPSTPA--YLNPSLPPVVRARDLVSRMTLKEKASQLVNAARAIPRLKVPAYNW 83
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG 157
WSEALHGV+ V G T FP+ I A+F+ + + TE R +Y
Sbjct: 84 WSEALHGVA-----------VNGTTEFPEPIGLGATFDVPAIHEMAVDIGTEGRVVYEEN 132
Query: 158 --------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSP 209
GL FW+PN+NIFRDPRWGRGQET GEDP L K +V G+Q G +P
Sbjct: 133 EKDGSSKIFHGLDFWAPNLNIFRDPRWGRGQETYGEDPFLTGKMGVAFVSGMQ---GDNP 189
Query: 210 NRLKVAACCKHYTAY------------DLDNWKGTDRYH--FNAMVIY--------TYYL 247
+V A KH+ + D+ D Y F A ++ +Y
Sbjct: 190 KYYRVIATPKHFDVHSGPEPTRHFADVDVSLHDQLDTYEPAFRAAIMQGHADSVMCSYNA 249
Query: 248 IK--------------------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
I FK Y+VSDCD+V +Y+ Y T +AAA S+ G+D
Sbjct: 250 INGQPACANQFTLQHQLRGAWGFKGYVVSDCDAVHDIYSGHKYRPTLAQAAAISMERGMD 309
Query: 287 LNCGSFLGKHTE-------AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQ 339
+C F + AV+ G +++ A+D A+ F ++LG FD PY
Sbjct: 310 NDCADFAQPKGDDDYKAYIDAVQQGYLSQQAMDTALVRLFTARIKLGLFDPK-GMDPYAD 368
Query: 340 LGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
++ + A++ A A + +VLLKN G+LPL P ++ ++AV+GP A+ T ++GNY
Sbjct: 369 TPHSELNSPAHRAYARKLADESMVLLKND-GTLPLKPGSVHSIAVVGPLADQTAVLLGNY 427
Query: 400 EGTPCKYTTPLQGLAA 415
G P + L+GL A
Sbjct: 428 NGVPTHTVSFLEGLRA 443
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 52/83 (62%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR +L +P ++ L+ VAK K PV++++M+G +++ ++L Y G+ G
Sbjct: 652 DRTNLQMPEPEEALVEAVAKTGK-PVVVVLMNGSALAVNWISQ--HANAVLEAWYSGEEG 708
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
GAAIAD G+ +P+GRLP+T+Y
Sbjct: 709 GAAIADTLSGKNDPAGRLPVTFY 731
>gi|390957160|ref|YP_006420917.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
gi|390412078|gb|AFL87582.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
Length = 908
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 154/436 (35%), Positives = 215/436 (49%), Gaps = 78/436 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N +L R DLV R+TL+EK +V+ A ++ RL +P Y++W+E LHGV+ G
Sbjct: 24 YLNPALTPQQRAADLVGRMTLEEKSLQMVNGAAAIPRLNVPAYDYWNEGLHGVARSG--- 80
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AA+++A L + IG V++TEARA N L GLTFW
Sbjct: 81 -------YATMFPQAIGMAATWDAPLLKQIGDVIATEARAKNNEALRRNNHDIYFGLTFW 133
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINIFRDPRWGRGQET GEDP L ++ +++GLQ TD P KV A KH+ +
Sbjct: 134 SPNINIFRDPRWGRGQETYGEDPHLTTQLGVNFIEGLQGTD---PKFYKVIATPKHFAVH 190
Query: 225 DLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------------- 251
+ R+ F+ ++ YL +F+
Sbjct: 191 ---SGPEEGRHKFDVEPTPHDLWDTYLPQFRAAIVDAKADSIMCAYNRIDGQPACGSKLL 247
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGKH 296
++ SDC ++D + + P E A ++LAG D NCGS K
Sbjct: 248 LVDILRNDWKFQGFVTSDCGAIDDFFRPNTHQTEPDAEHADKAALLAGTDTNCGSTYRKL 307
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
+ AVK+GL+ ES ID ++ F +RLG FD S PY Q+ V + AN +A
Sbjct: 308 GD-AVKSGLIKESDIDVSLRRLFEARVRLGLFDPAGSV-PYAQIPFSQVNSPANAAVAKR 365
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
AA + +VLLKN G LPL K +AVIGPN ++ GNY G P+ L +
Sbjct: 366 AAEESMVLLKND-GILPLKAGKYKTIAVIGPNGASLSSLEGNYNGMAHDPRMPVDALRSA 424
Query: 417 VA---TIYQAGCSNVQ 429
++ +Y G V+
Sbjct: 425 LSGTNVVYAPGAPYVE 440
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 65/120 (54%), Gaps = 13/120 (10%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAK 486
+A +AA +D V ++G +E E DR D+ LP QQ L+ + K
Sbjct: 623 EALEAANKSDLVVAMLGLSPDLEGEEMPVKLPGFVGGDRTDISLPASQQALLQGLIATGK 682
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
P I+++++G I+ A D K +IL YPG+AG A+AD GR NPSGRLP+T+Y
Sbjct: 683 -PTIVVLLNGSALAINLA--DEKANAILESWYPGEAGSTALADTLVGRNNPSGRLPITFY 739
>gi|315607027|ref|ZP_07882031.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315251081|gb|EFU31066.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 866
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 146/443 (32%), Positives = 209/443 (47%), Gaps = 82/443 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L R DL RLTL+EK + +S+ ++ RLGIP++EWWSEALHG++ G
Sbjct: 25 YQNLQLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG--- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
AT FPQ AAS++ L + S EA A N+ G++ W
Sbjct: 82 -------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIKRYQGVSIW 134
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--------QTDGGSPNRLKVAA 216
+PNINIFRDPRWGRGQET GEDP L S+ V GLQ + P K A
Sbjct: 135 TPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDMRPFTERPRYYKTLA 194
Query: 217 CCKHYTAYDLDNWKGTDRYHFNAMV-----IYTYYLIKFK-------------------- 251
C KHY + W +R+ F+ ++ YL FK
Sbjct: 195 CAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVREVMCAYQRIDG 251
Query: 252 ----------------------YIVSDCDSVDVLYNSQHY--TKTPEEAAAKSILAGLDL 287
+VSDC ++ Y H+ +TP EA+A + AG D+
Sbjct: 252 SPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETPAEASAMGVRAGTDV 311
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
CG+ AV+ GL++ AID ++ +G FD P+ GP+ + +
Sbjct: 312 ECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFDSE-KLVPWKLTGPEVIAS 369
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
+ ++ LALD AR+ + LL+N LPLS ++ +AV+GPNAN + + GNY G P T
Sbjct: 370 ETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWGNYTGYPISTT 428
Query: 408 TPLQGLAAVV-ATIYQAGCSNVQ 429
T L+G+ + V A + GC ++
Sbjct: 429 TILKGIRSKVPAARFVEGCGYIR 451
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 64/140 (45%), Gaps = 23/140 (16%)
Query: 427 NVQCGTAQVDDAKKA----------AASADATVLVMGADQSIEAESH----------DRL 466
N Q D A+K+ A AD V V G +E E DR
Sbjct: 583 NKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFNGGDRT 642
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ LP Q+ +I + + K V+ + SGG ++ ++L Y G+AGG A
Sbjct: 643 SIELPEAQREVIRLLRQAGK-LVVFVNCSGGA--VALVPEAEACDAVLQAWYAGEAGGQA 699
Query: 527 IADVCFGRYNPSGRLPMTWY 546
+ADV FG YNPSG+LP+T+Y
Sbjct: 700 VADVLFGDYNPSGKLPVTFY 719
>gi|402304900|ref|ZP_10823963.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400380686|gb|EJP33499.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 866
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 146/443 (32%), Positives = 209/443 (47%), Gaps = 82/443 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L R DL RLTL+EK + +S+ ++ RLGIP++EWWSEALHG++ G
Sbjct: 25 YQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG--- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
AT FPQ AAS++ L + S EA A N+ G++ W
Sbjct: 82 -------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIKRYQGVSIW 134
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--------QTDGGSPNRLKVAA 216
+PNINIFRDPRWGRGQET GEDP L S+ V GLQ + P K A
Sbjct: 135 TPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDMRPFTERPRYYKTLA 194
Query: 217 CCKHYTAYDLDNWKGTDRYHFNAMV-----IYTYYLIKFK-------------------- 251
C KHY + W +R+ F+ ++ YL FK
Sbjct: 195 CAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVREVMCAYQRIDG 251
Query: 252 ----------------------YIVSDCDSVDVLYNSQHY--TKTPEEAAAKSILAGLDL 287
+VSDC ++ Y H+ +TP EA+A + AG D+
Sbjct: 252 SPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETPAEASAMGVRAGTDV 311
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
CG+ AV+ GL++ AID ++ +G FD P+ GP+ + +
Sbjct: 312 ECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFDSE-KLVPWKLTGPEVIAS 369
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
+ ++ LALD AR+ + LL+N LPLS ++ +AV+GPNAN + + GNY G P T
Sbjct: 370 ETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWGNYTGYPISTT 428
Query: 408 TPLQGLAAVV-ATIYQAGCSNVQ 429
T L+G+ + V A + GC ++
Sbjct: 429 TILKGIRSKVPAARFVEGCGYIR 451
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 64/140 (45%), Gaps = 23/140 (16%)
Query: 427 NVQCGTAQVDDAKKA----------AASADATVLVMGADQSIEAESH----------DRL 466
N Q D A+K+ A AD V V G +E E DR
Sbjct: 583 NKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFKGGDRT 642
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ LP Q+ +I + + K V+ + SGG ++ ++L Y G+AGG A
Sbjct: 643 SIELPEAQREVIRLLRQAGK-LVVFVNCSGGA--VALVPEAEACDAVLQAWYAGEAGGQA 699
Query: 527 IADVCFGRYNPSGRLPMTWY 546
+ADV FG YNPSG+LP+T+Y
Sbjct: 700 VADVLFGDYNPSGKLPVTFY 719
>gi|329956868|ref|ZP_08297436.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328523625|gb|EGF50717.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 864
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 159/454 (35%), Positives = 215/454 (47%), Gaps = 78/454 (17%)
Query: 37 VFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYE 96
+ AC VVS+ SLA + + S R DLVK+LTL+EK+ ++D++ V RLGI Y
Sbjct: 8 LLACLVVSSGSLAQSIYKDNSYSPAERAEDLVKQLTLEEKVALMMDNSKPVERLGIKPYN 67
Query: 97 WWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV 156
WW+EALHGV+ G AT FPQ I AASF+ VS EARA
Sbjct: 68 WWNEALHGVARSG----------WATVFPQPIGMAASFSPEALHTAFVAVSDEARAKNAA 117
Query: 157 GLA--------GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
A GLT W+P +NI+RDPRWGRG ET GEDP LAS VKGLQ D
Sbjct: 118 YSAEGSYKRYQGLTIWTPTVNIYRDPRWGRGIETYGEDPYLASVMGVSVVKGLQCLDENE 177
Query: 209 PNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK------------ 251
KV AC KH+ + W +R+ FNA I Y YL F+
Sbjct: 178 KYD-KVHACAKHFAVHSGPEW---NRHSFNAENISPRDLYETYLPPFEALVKEGKVKEVM 233
Query: 252 ------------------------------YIVSDCDSVDVLYNSQ-HYTK-TPEEAAAK 279
+V+DC ++ +N + H T A++
Sbjct: 234 CAYNRFEGEPCCGSNRLLNHILRREWGYDGIVVADCSAISDFHNDKGHKTHADAASASSA 293
Query: 280 SILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQ 339
++L+G DL CGS TE VK G ++E+ ID ++ LG D P + + Q
Sbjct: 294 AVLSGTDLECGSNYRSLTE-GVKKGFIDEADIDRSVKRLLQARFELGEMD-EPDQVRWAQ 351
Query: 340 LGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
+ VC+ + L+LD AR+ + LL N +LPL +AV+GPNAN + GNY
Sbjct: 352 IPYSVVCSDKHDSLSLDMARKSMTLLLNKNNALPLERGGT-TIAVMGPNANDSVMQWGNY 410
Query: 400 EGTPCKYTTPLQGLAAVVAT----IYQAGCSNVQ 429
G P + T L G+ + + IY+ GCS V+
Sbjct: 411 NGLPKRTITILDGIRSAMGKDDKLIYEQGCSWVE 444
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 62/133 (46%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQVDDAKKAAAS---ADATVLVMGADQSIEAESH----------DRLDLLLPGQ 473
N G + D +++ A AD + G +E E DR D+ LP
Sbjct: 581 NFDLGFKEEADIQRSVAKVKDADVVIFAGGISPQLEGEEMGVKLPGFRGGDRTDIELPAV 640
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q+ +I + K +I ++ G I+ +IL YPGQ+GG A+A+V FG
Sbjct: 641 QREMIKALHDAGKK---VIFVNCSGSPIAMEPETEYCQAILQAWYPGQSGGKAVAEVLFG 697
Query: 534 RYNPSGRLPMTWY 546
YNP+GRLP T+Y
Sbjct: 698 DYNPAGRLPATFY 710
>gi|409195436|ref|ZP_11224099.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
21150]
Length = 867
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 153/455 (33%), Positives = 217/455 (47%), Gaps = 87/455 (19%)
Query: 47 SLASLGFCNTSLGID------LRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
+LA +G CNT + D R DL+K LTL+EK++ +VD ++ RLGI +Y WW+E
Sbjct: 15 TLAGVG-CNTEIWKDNSYSPEERADDLLKELTLEEKVSLMVDRNTAIERLGIEEYNWWNE 73
Query: 101 ALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV---- 156
ALHGV+ G AT FPQ + AA+F+ + + S EARA ++
Sbjct: 74 ALHGVARAGQ----------ATVFPQPVGMAAAFDRDMVLDVFSAASDEARAKHHFFKER 123
Query: 157 ----GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
GLT W+PNIN+FRDPRWGRG E GEDP + T VKGLQ G ++L
Sbjct: 124 GERGRYQGLTMWTPNINVFRDPRWGRGMEAYGEDPFMNGVLGTAVVKGLQGDRSGKYDKL 183
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------- 251
AC KHY + W +R+ FNA I + YL FK
Sbjct: 184 H--ACAKHYAVHSGPEW---NRHSFNAENIRPRDLHETYLPAFKKLVIDGDVRMVMCAYN 238
Query: 252 --------------------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILA 283
+VSDC +++ +N + P+ A+ ++LA
Sbjct: 239 RFEGEPCCGNNQLLRDILRNEWGFDGVVVSDCWAINDFFNKDAHAMYPDAKTASTDAVLA 298
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGP 342
G DLNCG E AV+ GL+ E +D ++ LG D P ++ + ++
Sbjct: 299 GTDLNCGDSYPSLVE-AVEQGLITEEQLDISLRRLLIARFELGEMD--PDEEVEWSKIPH 355
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
V + + ++AL+AAR+ + LL N G+LPL + +AV+GPNAN + GNY GT
Sbjct: 356 SVVSSPTHSEMALEAARKSMTLLMNKNGALPLKKEGL-TVAVMGPNANDSLMQWGNYNGT 414
Query: 403 PCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDD 437
P TT LQG+ + Q + GT VDD
Sbjct: 415 PATTTTILQGIRNALGNDDQV---IYEQGTQWVDD 446
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 49/126 (38%), Positives = 68/126 (53%), Gaps = 15/126 (11%)
Query: 433 AQVDDAKKAAASADATVLVMGADQS--IEAESH----------DRLDLLLPGQQQLLITE 480
A+VD A ADA V+V + S +E E DR D+ LP Q+ ++
Sbjct: 589 AKVDIPSSVAKVADADVVVFASGISPFLEGEEMGVDLPGFKGGDRTDIALPAIQKEMLKA 648
Query: 481 VAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGR 540
+ K K +IL+ SG I F + ++IL YPGQAGG A+A+V FG YNP+GR
Sbjct: 649 LHKAGK-EIILVNCSGSA--IGFEEATDYSSAILQAWYPGQAGGQAVAEVLFGDYNPAGR 705
Query: 541 LPMTWY 546
LP+T+Y
Sbjct: 706 LPVTFY 711
>gi|393782347|ref|ZP_10370532.1| hypothetical protein HMPREF1071_01400 [Bacteroides salyersiae
CL02T12C01]
gi|392673618|gb|EIY67077.1| hypothetical protein HMPREF1071_01400 [Bacteroides salyersiae
CL02T12C01]
Length = 862
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 151/433 (34%), Positives = 214/433 (49%), Gaps = 78/433 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L + R DL+ RLTL+EK+ + + + V RLGI Y+WW+EALHGV+ G
Sbjct: 24 YKNPDLSPEQRAEDLLGRLTLEEKVRLMQNESFPVERLGIAHYDWWNEALHGVARAGI-- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN---------VGLAGLTF 163
AT FP + A++F+ L + + VVS EARA Y+ V GLTF
Sbjct: 82 --------ATVFPITMGMASTFDDKLVEDVFTVVSDEARAKYHNAHREGRRGVRCEGLTF 133
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PN+NIFRDPRWGRGQET GEDP L S+ V GLQ ++L AC KHY
Sbjct: 134 WTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVNGLQGPADAKYDKLH--ACAKHYAV 191
Query: 224 YDLDNWKGTDRYHFNAMVI-----YTYYLIKFK--------------------------- 251
+ K R++FNA I + YL FK
Sbjct: 192 HSGPEAK---RHYFNAENISPRDLWETYLPAFKDLVQEADVKEVMCAYNRFEGDPCCGSN 248
Query: 252 ---------------YIVSDCDSV-DVLYNSQHYT-KTPEEAAAKSILAGLDLNCGSFLG 294
+VSDC ++ D Y +H T K +A+A ++L+G DL CG
Sbjct: 249 RLLTQILRDEWGYKHMVVSDCGAISDFFYKDRHATHKDAADASAAAVLSGTDLECGIEYA 308
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
H E AVK GL++E I+ ++ LG D + P+ ++ V + ++ +A
Sbjct: 309 -HLEEAVKKGLISEERINTSLRRLLKARFELGEMDDD-ALVPWSKISIDTVDCETHKQMA 366
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
LD R+ +VLL N G LPL+ T + +AV+GPNA + GNY+GTP +T L+G+
Sbjct: 367 LDVTRKSMVLLHNN-GVLPLAKTGTR-IAVMGPNAVDSVMQWGNYKGTPSHTSTILEGIR 424
Query: 415 AVVATI-YQAGCS 426
+ + Y+ GC
Sbjct: 425 NKIGNVPYEKGCE 437
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 58/118 (49%), Gaps = 14/118 (11%)
Query: 443 ASADATVLVMGADQSIEAESH-----------DRLDLLLPGQQQLLITEVAKVAKGPVIL 491
+ AD + V G +E E DR + LP Q+ ++ + + K VI
Sbjct: 596 SDADVVIFVGGISPDLEGEDKYFVNCPGFSGGDRTTIELPEVQRNILKALKQAGK-KVIF 654
Query: 492 IIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+ SG ++ +IL YPGQAGG A+ADV FG YNPSG+LP+T+Y +
Sbjct: 655 VNCSGSA--VALVPETKSCDAILQAWYPGQAGGTAVADVLFGDYNPSGKLPVTFYKNT 710
>gi|288927072|ref|ZP_06420962.1| beta-glucosidase [Prevotella buccae D17]
gi|288336152|gb|EFC74543.1| beta-glucosidase [Prevotella buccae D17]
Length = 866
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 146/443 (32%), Positives = 209/443 (47%), Gaps = 82/443 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L R DL RLTL+EK + +S+ ++ RLGIP++EWWSEALHG++ G
Sbjct: 25 YQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG--- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
AT FPQ AAS++ L + S EA A N+ G++ W
Sbjct: 82 -------FATVFPQTTAMAASWDDELLYHVFCAASDEAVAKNNLARKSGDIKRYQGVSIW 134
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--------QTDGGSPNRLKVAA 216
+PNINIFRDPRWGRGQET GEDP L S+ V GLQ + P K A
Sbjct: 135 TPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDMRPFTERPRYYKTLA 194
Query: 217 CCKHYTAYDLDNWKGTDRYHFNAMV-----IYTYYLIKFK-------------------- 251
C KHY + W +R+ F+ ++ YL FK
Sbjct: 195 CAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVREVMCAYQRIDG 251
Query: 252 ----------------------YIVSDCDSVDVLYNSQHY--TKTPEEAAAKSILAGLDL 287
+VSDC ++ Y H+ +TP EA+A + AG D+
Sbjct: 252 SPCCGNTRYLHQILRGEWEYNGLVVSDCGAISDFYREGHHHVVETPAEASAMGVRAGTDV 311
Query: 288 NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT 347
CG+ AV+ GL++ AID ++ +G FD P+ GP+ + +
Sbjct: 312 ECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFDSE-KLVPWKLTGPEVIAS 369
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
+ ++ LALD AR+ + LL+N LPLS ++ +AV+GPNAN + + GNY G P T
Sbjct: 370 ETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWGNYTGYPISTT 428
Query: 408 TPLQGLAAVV-ATIYQAGCSNVQ 429
T L+G+ + V A + GC ++
Sbjct: 429 TILKGIRSKVPAARFVEGCGYIR 451
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 64/140 (45%), Gaps = 23/140 (16%)
Query: 427 NVQCGTAQVDDAKKA----------AASADATVLVMGADQSIEAESH----------DRL 466
N Q D A+K+ A AD V V G +E E DR
Sbjct: 583 NKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFKGGDRT 642
Query: 467 DLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAA 526
+ LP Q+ +I + + K V+ + SGG ++ ++L Y G+AGG A
Sbjct: 643 SIELPEAQREVIRLLRQAGK-LVVFVNCSGGA--VALVPETEACDAVLQAWYAGEAGGQA 699
Query: 527 IADVCFGRYNPSGRLPMTWY 546
+ADV FG YNPSG+LP+T+Y
Sbjct: 700 VADVLFGDYNPSGKLPVTFY 719
>gi|325299987|ref|YP_004259904.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
gi|324319540|gb|ADY37431.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
Length = 864
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 154/448 (34%), Positives = 207/448 (46%), Gaps = 101/448 (22%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + N L + R DLV RLTL+EK + + +++ ++ RLGI Y+WW+EALHGV G
Sbjct: 25 LPYQNPKLTPEERANDLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGVGRAGI 84
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN--------VGLAGLT 162
AT FPQ I AASF+ L + VS EARA Y GLT
Sbjct: 85 ----------ATVFPQTIGMAASFDDELLYQVFTAVSDEARAKYTQFRKEGDLKRYQGLT 134
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FW+PN+NIFRDPRWGRGQET GEDP L S+ V+GLQ + ++L AC KH+
Sbjct: 135 FWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEDAPYDKLH--ACAKHFA 192
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ FNA I + Y+ FK
Sbjct: 193 VHSGPEW---NRHEFNAENIAPRDLWETYMPAFKDLVQKAHVKEVMCAYNRLEGEPCCGN 249
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTPEE--AAAKSILAGLDLNCGSFL 293
+VSDC ++ + + P++ A+A ++L+G DL CGS
Sbjct: 250 NRLLTHILRDEWGYQGIVVSDCGAISDFWRKGDHETHPDKAHASAGAVLSGTDLECGSNY 309
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC------- 346
K AVKAGL+ ES +D ++ LG D KDVC
Sbjct: 310 -KSLPEAVKAGLIAESQLDISVKRLLKARFELGEMD-------------KDVCWDTIPYS 355
Query: 347 ---TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
QA++DLAL AR+ IVLL+N LPL +A++GPNAN + GNY G P
Sbjct: 356 VVDCQAHKDLALRMARESIVLLQNRNNILPLRKDM--KIALVGPNANDSIMHWGNYNGFP 413
Query: 404 CKYTTPLQGLAAVVAT---IYQAGCSNV 428
T + L + IY+ GC
Sbjct: 414 SHTETLYEALKKRLPASQLIYEFGCDRT 441
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/123 (35%), Positives = 66/123 (53%), Gaps = 6/123 (4%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH---DRLDLLLPGQQQLLITEVAK 483
N+Q +V DA + + + G + ++AE DR + LP Q+ L+ E+ K
Sbjct: 590 NLQATADKVKDADVILFAGGISPTLEGEEMPVDAEGFRGGDRTSIELPAIQRQLVGELKK 649
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
+ K P++ I SG + A ++ YPGQAGG AIADV FG YNP+G+LP+
Sbjct: 650 LGK-PIVFINYSGSA--MGLAPESEICDGMIQAWYPGQAGGTAIADVLFGDYNPAGKLPV 706
Query: 544 TWY 546
T+Y
Sbjct: 707 TFY 709
>gi|313205375|ref|YP_004044032.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312444691|gb|ADQ81047.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 858
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 156/435 (35%), Positives = 205/435 (47%), Gaps = 82/435 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + N L ++R DL+ RLTL EK + +++ ++ RLGI YEWW+EALHGV G
Sbjct: 22 LPYQNPKLSAEVRATDLLARLTLAEKAALMQNNSPAIPRLGIKAYEWWNEALHGVGRSGV 81
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV-----GLA---GLT 162
AT FPQ I AASFN L VS EARA N GL GLT
Sbjct: 82 ----------ATVFPQAIGMAASFNNGLLFDAFTAVSDEARAKSNKFSEQGGLKRYQGLT 131
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
+W+PN+NIFRDPRWGRGQET GEDP L S VKGLQ D ++L AC KH+
Sbjct: 132 YWTPNVNIFRDPRWGRGQETYGEDPYLTSLMGVAVVKGLQGPDNAEYDKLH--ACAKHFA 189
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ FNA I + YL FK
Sbjct: 190 VHSGPEW---NRHSFNAENINPRDLWETYLPAFKALVQKADVKEVMCAYNRFEDEPCCGS 246
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFL 293
+VSDC ++ Y + P+ AAA ++L G DL CGS
Sbjct: 247 NRLLTQILRNDWKFDGLVVSDCWAISDFYKPNAHATQPDATHAAANAVLNGTDLECGSDF 306
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
++ AVKAGL+ E ID ++ L++ F G + + V ++ +Q+L
Sbjct: 307 -RNLPEAVKAGLIEEKRIDVSLKR----LLKARFELGEMNSDQVWPISYSVVNSEKHQNL 361
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL A + IVLL+N LPLS +AV+GPNAN + GNY G P T L+ +
Sbjct: 362 ALRMAEESIVLLQNNNNILPLSKKL--KIAVMGPNANDSVMQWGNYNGFPAHTVTLLEAM 419
Query: 414 AAV---VATIYQAGC 425
IY+ GC
Sbjct: 420 RKSFPGAQLIYEPGC 434
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 55/112 (49%), Gaps = 13/112 (11%)
Query: 445 ADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIM 494
AD V G S+E E DR D+ LP Q+ L+ + K A V+ +
Sbjct: 595 ADVVVFAGGIAPSLEGEEMRVTVPGFKGGDRTDIELPAIQRRLL-QALKDAGKKVVFVNF 653
Query: 495 SGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
SG + +IL YPGQAGG A+A+V G YNPSGRLP+T+Y
Sbjct: 654 SGSA--MGLVPETQSCEAILQAWYPGQAGGTAVANVLLGNYNPSGRLPVTFY 703
>gi|332881172|ref|ZP_08448831.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
gi|357047867|ref|ZP_09109460.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
11840]
gi|332680886|gb|EGJ53824.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
gi|355529206|gb|EHG98645.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
11840]
Length = 851
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 146/394 (37%), Positives = 198/394 (50%), Gaps = 69/394 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ DL+ RLT++EKI+ LV+ A ++ RLGI KY +EALHGV V PG T
Sbjct: 39 RIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGV--VRPGDF--------T 88
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNV------GLAG----LTFWSPNINIFR 172
FPQ I AA +N L I +S EAR + +AG LTFWSP +N+ R
Sbjct: 89 VFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAGASDLLTFWSPTVNMAR 148
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L+ +VKGLQ G P LK + KH+ + ++ + +
Sbjct: 149 DPRWGRTPETYGEDPYLSGVLGVAFVKGLQ---GDHPRYLKTVSTPKHFAVNNEEHNRSS 205
Query: 233 DRYHFNAMVIYTYYLIKFK----------------------------------------- 251
+ + YYL F+
Sbjct: 206 CNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVNDVPCTVNTYLIKNVLRGDWGFN 265
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGS-FLGKHTEAAVKAGLVNES 309
YIVSDC + + + HY KT E AA ++ GLDL CG+ G+ A + +V+E+
Sbjct: 266 GYIVSDCSAPEWMITKHHYVKTREAAATLAVKVGLDLECGNQVYGEGLLKAYRQYMVSEA 325
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
ID A M LG FD PS+ PY Q+ P V +A+QDLAL+AARQ +VLLKN
Sbjct: 326 DIDSAAYRILRGRMMLGLFDA-PSQNPYNQIEPSVVGCKAHQDLALEAARQSMVLLKNKD 384
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
LPL+P +K++AV+G +A + G+Y GTP
Sbjct: 385 NFLPLNPKKVKSIAVVGISAGHCE--FGDYSGTP 416
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 72/124 (58%), Gaps = 6/124 (4%)
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVA 482
AG NV A+ DA K AA D TV V+G ++SIE E DR L LP QQ I E+
Sbjct: 584 AGSQNV---LARFGDAGKVAAECDVTVAVLGINKSIEREGQDRFSLELPVDQQEFIKELY 640
Query: 483 KVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
KV V++++ G ++ D + +IL YPG+ GG A+A+V FG YNP GRLP
Sbjct: 641 KVNPNTVVVLV---AGSSMAVNWMDENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLP 697
Query: 543 MTWY 546
+T+Y
Sbjct: 698 LTYY 701
>gi|333380553|ref|ZP_08472244.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826548|gb|EGJ99377.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
BAA-286]
Length = 957
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 176/566 (31%), Positives = 272/566 (48%), Gaps = 94/566 (16%)
Query: 43 VSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSE 100
++N L + N +L ++ RV DL+ +T+++K+ L + G + LG+P E
Sbjct: 160 IANVPLKERAYMNPNLPLESRVEDLLSVMTVEDKMELLREGWGIPGIPHLGVPAIHK-VE 218
Query: 101 ALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG 160
A+HG SY G+ GAT FPQ I A++N L +A + E V
Sbjct: 219 AIHGFSY---GS-------GATIFPQSIGMGATWNKRLIEAAAMAIGDET-----VSANA 263
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
+ WSP +++ +D RWGR +ET GEDP+L ++ ++KG Q + KH
Sbjct: 264 VQAWSPVLDVAQDARWGRCEETYGEDPVLVTEIGGAWIKGYQSKG--------LMTTPKH 315
Query: 221 YTAY-------DLDNWKGTDRYHFNAMVI--------YTYYLIKFKY------------- 252
+ A+ D + ++R ++ Y Y I Y
Sbjct: 316 FAAHGAPLGGRDSHDIGLSEREMREIHLVPFRDIYKKYKYQSIMMSYSDFLGVPVAKSKE 375
Query: 253 ---------------IVSDCDSVDVLYNSQHYTKTPE-EAAAKSILAGLDLNCG-SFLGK 295
IVSDC ++ L +HYT + EAA +++ AG+ NCG ++
Sbjct: 376 LLKGILRDEWGFDGFIVSDCGAIGNLTARKHYTAVDKVEAARQALAAGIATNCGDTYNDP 435
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLA 354
AA K G +N +D TL R G F+ +P K + ++ P + +Q LA
Sbjct: 436 DVIAAAKRGELNMDDLDFTCKTLLRTLFRNGLFENNPCKPLDWNKIYP-GWNSPEHQALA 494
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP--CKYTTPLQG 412
A++ IVLL+N LPLS ++K +AVIGP A+ + G+Y P + + L G
Sbjct: 495 RKTAQESIVLLENKGNILPLS-KSLKTIAVIGPGADNLQP--GDYTSKPQPGQLKSVLTG 551
Query: 413 LAAVVAT----IYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA-------- 460
+ A V + +Y+ GC + + A KAA +AD VLV+G + EA
Sbjct: 552 IKAAVNSSTKVLYEEGCRFIGTEGTDIAKAVKAAENADVAVLVLGDCSTSEALKGITNTS 611
Query: 461 -ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYP 519
E+HD L+LPG+QQ L+ V K K PV+LI+ +G +++S+A + + + W+ P
Sbjct: 612 GENHDLATLILPGEQQKLLEAVCKTGK-PVVLILQAGRPYNLSYAAENCQAVLVNWL--P 668
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTW 545
GQ GG A ADV FG YNP+GRLPMT+
Sbjct: 669 GQEGGYATADVLFGDYNPAGRLPMTF 694
>gi|423227459|ref|ZP_17213920.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392623089|gb|EIY17195.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 864
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 214/444 (48%), Gaps = 86/444 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L R DL+KR+TL+EK++ + + + ++ RLGIP Y+WW+EALHGV+ G
Sbjct: 24 YKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHGVARAGK-- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+F+ +VS EARA Y+ G GLTFW
Sbjct: 82 --------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERDGYKGLTFW 133
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINI+RDPRWGRG ET GEDP L S VKGLQ GG+ K AC KHY +
Sbjct: 134 TPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQ--GGGTGKYDKAHACAKHYAVH 191
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ F+A I + YL FK
Sbjct: 192 SGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVKEGKVKEVMCAYNRFEGEPCCSNKQ 248
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
+VSDC ++ Y H+ P A+A ++++G DL CG
Sbjct: 249 LLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLECGGSYSS 308
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP----SKQPYGQLGPKDVCTQANQ 351
E AV+ GL++E I+ ++ +LG FD S+ PY + K+ +
Sbjct: 309 LNE-AVRKGLISEEKINESVFRLLRARFQLGMFDDDALVSWSEIPYSVVESKEHVAK--- 364
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
AL+ AR+ +VLL N +LPLS +I+ +AV+GPNAN + + NY G P K T L+
Sbjct: 365 --ALEMARKSMVLLTNKNHTLPLS-KSIRKVAVLGPNANDSVMLWANYNGFPTKSVTILE 421
Query: 412 GLAAVV---ATIYQAGCSNVQCGT 432
G+ + + Y+ GC V T
Sbjct: 422 GIKSKLPEGTVYYEKGCDYVNTQT 445
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 64/118 (54%), Gaps = 13/118 (11%)
Query: 442 AASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVIL 491
AA ADA + V G ++E E DR ++ LP Q ++ + K K PVI
Sbjct: 597 AAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQAEMLKALKKTGK-PVIF 655
Query: 492 IIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
++ SG + + + + +IL YPGQ GG A+ADV FG YNP+GRLP+T+Y S
Sbjct: 656 VLCSGSTLALPWEAEN--LDAILEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASS 711
>gi|365121645|ref|ZP_09338561.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
6_1_58FAA_CT1]
gi|363645135|gb|EHL84409.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
6_1_58FAA_CT1]
Length = 868
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 154/441 (34%), Positives = 214/441 (48%), Gaps = 88/441 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L D R DL+ R+TL+EK + ++ G + RLG+ Y+WW+EALHG++ G
Sbjct: 26 YKNPELSPDERALDLLNRMTLKEKFAQMHNNTGGIERLGVRPYDWWNEALHGIARAGK-- 83
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---------MYNVGLAGLTF 163
AT FPQ I AA+F+ + + +VS E RA MYN G GLTF
Sbjct: 84 --------ATVFPQAIGLAATFDDTAVYEMFDMVSDEGRAKYHDFQRKGMYN-GYKGLTF 134
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PNINIFRDPRWGRG ET GEDP L +K VKGLQ G+ K AC KHY
Sbjct: 135 WTPNINIFRDPRWGRGMETYGEDPFLTTKMGLAVVKGLQGD--GTQKYDKAHACAKHYAV 192
Query: 224 YDLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIVSD------------------CDSV 260
+ W +R+ +NA I YL FK +V++ C +
Sbjct: 193 HSGPEW---NRHSYNAENISIRDLRETYLPAFKALVTEGKVKEVMCAYNRFEGEPCCSNK 249
Query: 261 DVLYN-------------------SQHYTKTPEEA-------AAKSILAGLDLNCGSFLG 294
+L N + YTK E +A ++++G DL CG
Sbjct: 250 TLLINILKDEWGFDDVIVSDCGAIADFYTKGRHETHASAADASADAVISGTDLECGGSYW 309
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP----SKQPYGQLGPKDVCTQAN 350
E A++ GL+ E+ I+ ++ LG FD S PY VC +
Sbjct: 310 ALDE-ALEKGLITETKINESVFRLLRARFELGMFDDDSLVSWSSIPYSV-----VCCDKH 363
Query: 351 QDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPL 410
+ AL+ AR+ +VLL N +LPLS +IK +AV+GPNAN + + NY GTP + T L
Sbjct: 364 KAKALEMARKSMVLLSNKNNTLPLS-KSIKKVAVMGPNANDSVMLWANYNGTPDRSVTIL 422
Query: 411 QGLAAVV---ATIYQAGCSNV 428
+G+ A + + IY+ GC V
Sbjct: 423 EGIKAKLPEGSVIYEKGCDYV 443
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 75/150 (50%), Gaps = 19/150 (12%)
Query: 416 VVATIYQAGCS---NVQCGTAQVDDAKKAAAS---ADATVLVMGADQSIEAESH------ 463
++ YQ G G ++ D K A ADA + V G S+E E
Sbjct: 567 ILIEYYQGGGKGALKFDVGLSRQIDYKAVAEKVKDADAIIFVGGISSSLEGEEMGVKYPG 626
Query: 464 ----DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYP 519
DR ++ LP Q+ ++ + + K PVI ++ SG +S+ D + +IL YP
Sbjct: 627 FRNGDRTNIDLPQVQKNMMKALKETGK-PVIFVLCSGSTMALSW--EDKNMDAILQAWYP 683
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
GQ GG A+ADV FG YNP+GRLP+T+Y S
Sbjct: 684 GQEGGTAVADVLFGDYNPAGRLPLTFYASS 713
>gi|423289663|ref|ZP_17268513.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
CL02T12C04]
gi|423298156|ref|ZP_17276215.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
CL03T12C18]
gi|392663697|gb|EIY57244.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
CL03T12C18]
gi|392667374|gb|EIY60884.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
CL02T12C04]
Length = 850
Score = 218 bits (556), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 149/419 (35%), Positives = 209/419 (49%), Gaps = 70/419 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N + + RV DL+ RLT++EKI+ L ++ + RLGI KY +EALHGV V PG
Sbjct: 27 YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPGR 84
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LT 162
T FPQ I AA++N L Q + V+S EARA +N G LT
Sbjct: 85 F--------TVFPQAIGLAATWNPVLQQKVATVISDEARARWNELDQGRNQKEQFSDVLT 136
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FWSP +N+ RDPRWGR ET GEDP L+ T +VKGLQ G P LK+ + KH+
Sbjct: 137 FWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQ---GEDPRYLKIVSTPKHFV 193
Query: 223 AYDLDNWKGTDRYHFNAMVIYTYYLIKFK------------------------------- 251
A + ++ + + + YY F+
Sbjct: 194 ANNEEHNRFICNPQISEKQLREYYFPAFEMCVKKGKAASIMTAYNALNDVPCTLNAWLLQ 253
Query: 252 -----------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEA 299
Y+VSDC +L N+ Y KT E AA SI AGLDL CG ++
Sbjct: 254 KVLRQDWGFRGYVVSDCGGPSLLVNAHKYVKTKETAATLSIKAGLDLECGDDVYDEYLLN 313
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
A K +V+++ ID A + A M+LG FD + PY ++ P + ++ +Q +ALDAAR
Sbjct: 314 AYKQYMVSDADIDSAACHVLAARMKLGMFDSK-ERNPYARISPSVIGSKDHQQVALDAAR 372
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC-KYTTPLQGLAAVV 417
+ IVLLKN LPL+ +K++AV+G NA + G+Y G P + + LQG+ V
Sbjct: 373 ECIVLLKNQKNMLPLNVDKLKSIAVVGINAGTCE--FGDYSGAPVIEPVSVLQGIKNRV 429
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 69/112 (61%), Gaps = 7/112 (6%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + + V VMG ++SIE E DR D+ LP Q+ + E+ KV P I++++
Sbjct: 593 EAGKAVSECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVA 650
Query: 497 GGFDISFAKN--DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G S A N D I +I+ YPG+ GG A+ADV FG YNP+GRLP+T+Y
Sbjct: 651 GS---SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYY 699
>gi|28199699|ref|NP_780013.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
gi|182682443|ref|YP_001830603.1| beta-glucosidase [Xylella fastidiosa M23]
gi|417557804|ref|ZP_12208815.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
gi|28057820|gb|AAO29662.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
gi|182632553|gb|ACB93329.1| Beta-glucosidase [Xylella fastidiosa M23]
gi|338179587|gb|EGO82522.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
Length = 882
Score = 218 bits (554), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 149/405 (36%), Positives = 210/405 (51%), Gaps = 69/405 (17%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV ++T QEKI +++A ++ RLGIP Y+WWSE LHG++ G AT FPQ
Sbjct: 37 LVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YATVFPQ 86
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRDPRWG 177
I AAS+N L Q +G V STEARA +N+ AGLT WSPNINIFRDPRWG
Sbjct: 87 AIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFRDPRWG 146
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY---------------- 221
RG ET GEDP L S+ A +++GLQ G +P+ + A KH+
Sbjct: 147 RGMETYGEDPYLTSQLAVSFIRGLQ---GDTPDHPRTIATPKHFAVHSGPEQGRHSFDVD 203
Query: 222 -TAYDL-------------DNWKGTDRYHFNAM----VIYTYYLIKFK---------YIV 254
+AYDL D G+ +NA+ + +L+ + ++V
Sbjct: 204 VSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRNDWGFNGFVV 263
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
SDCD+++ + + + A+A ++ +G DLNCG+ + A+ G ++ES +D A
Sbjct: 264 SDCDAIEDMTRFHFFRQDNASASAAALKSGNDLNCGNTY-RDLNQAIARGDIDESTLDQA 322
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+ F RLG PY +G K + T A++ LAL AA Q +VLLKN+ +LPL
Sbjct: 323 LIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSLVLLKNSGNTLPL 381
Query: 375 SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT 419
P LAV+GP+A+ + NY+GT TPL GL T
Sbjct: 382 PPET--TLAVLGPDADSLTALEANYQGTSSTPVTPLTGLRTRFGT 424
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 68/125 (54%), Gaps = 13/125 (10%)
Query: 432 TAQVDDAKKAAASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEV 481
T Q+ +A++A A ADA V +G +E E DR + LP Q+ L+ V
Sbjct: 598 TPQLAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHV 657
Query: 482 AKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRL 541
K P+I+++MSG +++A++ +IL YPGQ+GG AIA G NP GRL
Sbjct: 658 KTTGK-PLIVVLMSGSAVALNWAQH--HADAILAAWYPGQSGGTAIAQALAGDVNPGGRL 714
Query: 542 PMTWY 546
P+T+Y
Sbjct: 715 PVTFY 719
>gi|423222970|ref|ZP_17209439.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640546|gb|EIY34345.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 862
Score = 218 bits (554), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 152/442 (34%), Positives = 211/442 (47%), Gaps = 71/442 (16%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVG 109
+L + N L R DLVKRLTL+EK + D + ++ RLGI K+ WWSEALHGV+ G
Sbjct: 20 TLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG 79
Query: 110 PGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN---------VGLAG 160
T FP+ + AASFN L I VS E RA +N V
Sbjct: 80 ----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFHS 129
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
L+ W+PN+NIFRDPRWGRGQET GEDP L S+ VKGLQ + + K+ AC KH
Sbjct: 130 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPE--NEKYRKLLACAKH 187
Query: 221 YTAY----------DLDNWKGTDRYH-----FNAM--------VIYTYYLIK-------- 249
Y + +L+N D + F A+ V+ Y +
Sbjct: 188 YAVHSGPEWSRHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDDDPCCGNT 247
Query: 250 ------------FKY-IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGK 295
FKY +VSDC ++ + S + AA K +AG D+ CG + +
Sbjct: 248 RLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTMAGTDVECGYGYAYQ 307
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
AV GL+ E +D + LG D PS + ++ V +A++DL+L
Sbjct: 308 KLPEAVSRGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVVNCKAHKDLSL 366
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
+ +RQ + LL+N LPLS +I+ +AVIGPNA+ + GNY GTP + T L G +
Sbjct: 367 NMSRQTMTLLQNKNNVLPLS-KSIRKIAVIGPNADDKPMLWGNYNGTPNQTITILDGFKS 425
Query: 416 VVAT---IYQAGCSNVQCGTAQ 434
+ +Y GC V T +
Sbjct: 426 KLKKNQIVYMKGCDLVNDQTLE 447
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP Q+ + + K ++ ++ G ++ +IL Y G+ G
Sbjct: 629 DRTDIELPAVQRNFLKALKDAGKQ---VVFVNCSGSSMALLPETESCDAILQAWYGGELG 685
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G A+ADV FG YNPSG+LP+T+Y
Sbjct: 686 GYAVADVLFGDYNPSGKLPVTFY 708
>gi|317474349|ref|ZP_07933623.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909030|gb|EFV30710.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 877
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 149/424 (35%), Positives = 203/424 (47%), Gaps = 86/424 (20%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ++ RV +L+ LTLQEK++ +++ + ++ RLGIP Y WW EA HG+
Sbjct: 19 FQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWGEACHGL------- 71
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV------------GLAG 160
+ G T FPQ I AA+F+ S +VS EARA YN +
Sbjct: 72 ----IAGGVTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPLDGDIGPYVSAIPN 127
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
LTFW+PNINIFRDPRWGRGQET GEDP L S+ V G+Q G + K AC KH
Sbjct: 128 LTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQ---GDDEHYYKTHACAKH 184
Query: 221 YTAYDLDNWKGTD--RYHFNAMV----IYTYYLIKFK----------------------- 251
Y + G + R+ FNA+V ++ YL F+
Sbjct: 185 YGVH-----SGPEPLRHEFNAVVSMRDLWETYLPAFETLVVKGNVREVMCAYSAYEGEPC 239
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCG 290
+VSDCD+++ Y + P A+A ++L G DL CG
Sbjct: 240 CASNRLLVDILRNRWGFDGMVVSDCDAINDFYVKGRHETHPDAAAASADAVLTGTDLECG 299
Query: 291 SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQA 349
E AV+ G++ E +D ++ RLG D P+K PY + + Q
Sbjct: 300 RSYNALIE-AVEKGIIKEQDLDVSLRRILTERFRLGLLD--PAKYVPYSTIPGSVIDCQE 356
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
++D AL A + VLLKN LPL IK++AV+GPN N + M GNY G+P T
Sbjct: 357 HRDHALKMAHESQVLLKNEGNILPLDKN-IKSIAVVGPNINDSIMMRGNYSGSPTHCITI 415
Query: 410 LQGL 413
LQGL
Sbjct: 416 LQGL 419
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 63/114 (55%), Gaps = 3/114 (2%)
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A ++ K+ +A + V G + E E H+R + LP Q+ + + + K PVI +
Sbjct: 608 AVFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKRFLKAMHETGK-PVIYV 666
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
SG I+ A D ++L YPGQ GG A+ADV FG YNPSG+LP+T+Y
Sbjct: 667 NCSGSA--IALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYNPSGKLPVTFY 718
>gi|218132025|ref|ZP_03460829.1| hypothetical protein BACEGG_03650 [Bacteroides eggerthii DSM 20697]
gi|217985785|gb|EEC52125.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 888
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 148/424 (34%), Positives = 203/424 (47%), Gaps = 86/424 (20%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ++ RV +L+ LTLQEK++ +++ + ++ RLGIP Y WW EA HG+
Sbjct: 30 FQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWGEACHGL------- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV------------GLAG 160
+ G T FPQ I AA+F+ S +VS EARA YN +
Sbjct: 83 ----IAGGVTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPLDGDIGPYVSAIPN 138
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
LTFW+PNINIFRDPRWGRGQET GEDP L S+ V G+Q G + K AC KH
Sbjct: 139 LTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQ---GDDEHYYKTHACAKH 195
Query: 221 YTAYDLDNWKGTD--RYHFNAMV----IYTYYLIKFK----------------------- 251
Y + G + R+ FNA+V ++ YL F+
Sbjct: 196 YGVH-----SGPEPLRHEFNAVVSMRDLWETYLPAFETLVVKGNVREVMCAYSAYEGEPC 250
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCG 290
+VSDCD+++ Y + P A+A ++L G DL CG
Sbjct: 251 CASNRLLVDILRNRWGFDGMVVSDCDAINDFYVKGRHETHPDAAAASADAVLTGTDLECG 310
Query: 291 SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQA 349
E AV+ G++ E +D ++ RLG D P+K PY + + Q
Sbjct: 311 RSYNALIE-AVEKGIIKEQDLDVSLRRILTERFRLGLLD--PAKYVPYSTIPGSVIDCQE 367
Query: 350 NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
++D AL A + VLLKN LPL IK++A++GPN N + M GNY G+P T
Sbjct: 368 HRDHALKMAHESQVLLKNEGNILPLDKN-IKSIAIVGPNINDSIMMRGNYSGSPTHCITI 426
Query: 410 LQGL 413
LQGL
Sbjct: 427 LQGL 430
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 63/114 (55%), Gaps = 3/114 (2%)
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A ++ K+ +A + V G + E E H+R + LP Q+ + + + K PVI +
Sbjct: 619 AVFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKRFLKAMHETGK-PVIYV 677
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
SG I+ A D ++L YPGQ GG A+ADV FG YNPSG+LP+T+Y
Sbjct: 678 NCSGSA--IALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYNPSGKLPVTFY 729
>gi|317477153|ref|ZP_07936394.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316906696|gb|EFV28409.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 863
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 155/509 (30%), Positives = 243/509 (47%), Gaps = 94/509 (18%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
I +R+ ++++++TL+EK+ L + + S+ RL +P Y +W+E LHGV+ G
Sbjct: 56 ISVRIENIIRQMTLEEKVAQLSNESDSIPRLNLPSYNYWNECLHGVARAGE--------- 106
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWSPNINIFRDPRWGR 178
T FPQ I A++++ L + I +STEAR Y ++G GLT+W+P IN+ RDPRWGR
Sbjct: 107 -VTVFPQAINLASTWDTLLVKRIASAISTEARLKYLDIG-KGLTYWAPTINMARDPRWGR 164
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN 238
+ET GEDP L S+ +VKGLQ G PN LK A KH+ A + +N + +
Sbjct: 165 NEETYGEDPYLTSRLGVAFVKGLQ---GDHPNYLKTVATVKHFVANNQENDRFSSSSQIP 221
Query: 239 AMVIYTYYLIKFK------------------------------------------YIVSD 256
+Y YY ++ ++VSD
Sbjct: 222 TKQLYEYYFPAYEACVKEANVQSIMTAYNAFNGIPPSGSTWLLEDVLRKEWGFDGFVVSD 281
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAIS 316
C ++ V+ + EEAAA I +G DL CG ++ AAV+ GLV+E AID A++
Sbjct: 282 CGAIGVMNWQHRIVNSLEEAAALGINSGCDLECGGTYRENLVAAVQRGLVSEYAIDRALT 341
Query: 317 NNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS 375
+LG FD P + PY K + + + LA +AA + I+LLKN LP+
Sbjct: 342 RVLTMRFKLGEFD--PIELVPYNHYDKKLLAGEQFRRLAYEAAVKSIILLKNEDNFLPID 399
Query: 376 PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV 435
++++A++GP A+ +G Y G P + LQG+ +V + S ++ GT+ V
Sbjct: 400 KKDVRSIAIVGPFAD--NNYLGGYSGKPVHNISLLQGVKKMVGE--EVEISYIE-GTSVV 454
Query: 436 DDAKKAAASADATVLVM--GADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
+ D++ L+ G + + A+ D DL P L + + K
Sbjct: 455 -------SPVDSSYLLASDGVNNGLTADYIDGHDLNSP----FLFSRIDKTV-------- 495
Query: 494 MSGGGFDISFAKNDPKIT----SILWVGY 518
GFD D ++T S+ W GY
Sbjct: 496 ----GFDWGDGTPDQRLTKNDFSVRWSGY 520
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 64/113 (56%), Gaps = 3/113 (2%)
Query: 434 QVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
Q+D KK + AD ++ +G D + E+ D + LP Q+LL+ E+ KV + LI+
Sbjct: 598 QIDKVKKIVSRADLVLVALGNDGKLARENRDLPSIYLPMTQELLLKEIYKV-NPRIALIL 656
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+G +A + SIL YPGQ GGAA+A + FG NPSG+LPMT Y
Sbjct: 657 QTGNPLTSQWAAE--HVPSILQAWYPGQEGGAALAGILFGLENPSGKLPMTIY 707
>gi|427384989|ref|ZP_18881494.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
12058]
gi|425728250|gb|EKU91109.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
12058]
Length = 862
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 152/442 (34%), Positives = 207/442 (46%), Gaps = 71/442 (16%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVG 109
+L + N L R DLV RLTL+EK + D + ++ RLGI K+ WWSEALHGV+ G
Sbjct: 20 TLPYQNPELSPAERAKDLVSRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG 79
Query: 110 PGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN---------VGLAG 160
T FP+ + AASFN L I S E RA +N
Sbjct: 80 ----------NVTVFPEPVGMAASFNEKLVFEIFNATSDEMRAKHNERVRNGLEDTRFHS 129
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
L+ W+PN+NIFRDPRWGRGQET GEDP L S+ VKGLQ + K+ AC KH
Sbjct: 130 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPE--DEKYRKLLACAKH 187
Query: 221 YTAY----------DLDNWKGTDRYH-----FNAM--------VIYTYYLIK-------- 249
Y + +L+N D + F A+ V+ Y +
Sbjct: 188 YAVHSGPEWSRHSANLNNISPRDLWETYLPAFKALVQKADVREVMCAYQRLDDDPCCGST 247
Query: 250 ------------FKY-IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGK 295
FKY +VSDC ++ + S + AA K +AG D+ CG + +
Sbjct: 248 RLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTMAGTDVECGYGYAYQ 307
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
AV GL+ E ID + LG D PS + Q+ V ++A++DL+L
Sbjct: 308 KLPEAVSRGLITEEEIDKHVLRLLEGRFELGEMD-DPSLVKWSQIPMSVVNSKAHKDLSL 366
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
+ +RQ + LL+N LPLS +I+ +AVIGPNA+ + GNY GTP + T L G
Sbjct: 367 NMSRQTMTLLQNKNNVLPLS-KSIRKIAVIGPNADDKPMLWGNYNGTPNQTITILDGFKT 425
Query: 416 VVAT---IYQAGCSNVQCGTAQ 434
+ IY GC V T +
Sbjct: 426 KLKKNQIIYMKGCDLVNDKTLE 447
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 58/121 (47%), Gaps = 13/121 (10%)
Query: 436 DDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVA 485
D++ + D V V G +E E DR D+ LP Q+ + + + +
Sbjct: 591 DESISKLKNVDMVVFVGGISPQLEGEEMPLNLPGFKNGDRTDIELPAVQRNFLKALKEAS 650
Query: 486 KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
K ++ ++ G ++ +IL Y G+ GG A+ADV FG YNPSG+LP+T+
Sbjct: 651 KQ---VVFVNCSGSSMALLPETESCDAILQAWYGGELGGQAVADVLFGDYNPSGKLPVTF 707
Query: 546 Y 546
Y
Sbjct: 708 Y 708
>gi|224537265|ref|ZP_03677804.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521119|gb|EEF90224.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
DSM 14838]
Length = 885
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 152/442 (34%), Positives = 211/442 (47%), Gaps = 71/442 (16%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVG 109
+L + N L R DLVKRLTL+EK + D + ++ RLGI K+ WWSEALHGV+ G
Sbjct: 20 TLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG 79
Query: 110 PGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN---------VGLAG 160
T FP+ + AASFN L I VS E RA +N V
Sbjct: 80 ----------NVTVFPEPVGMAASFNDKLVFDIFNAVSDEMRAKHNERVRNGLEDVRFHS 129
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
L+ W+PN+NIFRDPRWGRGQET GEDP L S+ VKGLQ + + K+ AC KH
Sbjct: 130 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPE--NEKYRKLLACAKH 187
Query: 221 YTAY----------DLDNWKGTDRYH-----FNAM--------VIYTYYLIK-------- 249
Y + +L+N D + F A+ V+ Y +
Sbjct: 188 YAVHSGPEWSRHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDDDPCCGNT 247
Query: 250 ------------FKY-IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGK 295
FKY +VSDC ++ + S + AA K +AG D+ CG + +
Sbjct: 248 RLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTMAGTDVECGYGYAYQ 307
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
AV GL+ E +D + LG D PS + ++ V +A++DL+L
Sbjct: 308 KLPEAVSKGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVVNCKAHKDLSL 366
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
+ +RQ + LL+N LPLS +I+ +AVIGPNA+ + GNY GTP + T L G +
Sbjct: 367 NMSRQTMTLLQNKNNVLPLS-KSIRKIAVIGPNADDKPMLWGNYNGTPNQTITILDGFKS 425
Query: 416 VVAT---IYQAGCSNVQCGTAQ 434
+ +Y GC V T +
Sbjct: 426 KLKKNQIVYMKGCDLVNDQTLE 447
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 52/111 (46%), Gaps = 13/111 (11%)
Query: 446 DATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
D V V G +E E DR D+ LP Q+ + + K ++ ++
Sbjct: 601 DVVVFVGGISPQLEGEEMPVNIPGFKGGDRTDIELPAVQRNFLKALKDAGKQ---VVFVN 657
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G ++ +IL Y G+ GG A+ADV FG YNPSG+LP+T+Y
Sbjct: 658 CSGSSMALLPETESCDAILQAWYGGELGGYAVADVLFGDYNPSGKLPVTFY 708
>gi|393781363|ref|ZP_10369562.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
CL02T12C01]
gi|392676856|gb|EIY70278.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
CL02T12C01]
Length = 863
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 159/455 (34%), Positives = 218/455 (47%), Gaps = 91/455 (20%)
Query: 40 CDVVSNPSLASLG---FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYE 96
C +S+ S+ + G F N+ L ++ R DL++RLTLQEK+ + D + + RLGI +Y
Sbjct: 8 CLFLSSLSIGAQGILPFNNSDLPVEERAQDLLQRLTLQEKVLLMCDYSSPIPRLGIKRYN 67
Query: 97 WWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV 156
WW+EALHGV G AT FPQ I AA+F+ + + VS EARA Y+
Sbjct: 68 WWNEALHGVGRAGL----------ATVFPQAIGMAATFDDCAVRQAFECVSDEARAKYHH 117
Query: 157 G--------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
GLTFW+PN+NIFRDPRWGRGQET GEDP L S+ V+GLQ G S
Sbjct: 118 SENKEGSERYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGLAVVRGLQ---GPS 174
Query: 209 PNRL-KVAACCKHYTAYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK----------- 251
++ K+ AC KHY + W +R+ F+ I + YL FK
Sbjct: 175 ESKYDKLHACAKHYALHSGPEW---NRHSFDVDSISPRDLWETYLPAFKALVQQGGVKEV 231
Query: 252 -------------------------------YIVSDCDSVDVLYNSQHYTKTP--EEAAA 278
+VSDC ++ Y H+ P E A A
Sbjct: 232 MCAYNRFEGEPCCGSNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHPTKEAAVA 291
Query: 279 KSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP----SK 334
++ AG DL+CG + AV+ G++ E ID ++ LG D S
Sbjct: 292 AAVKAGTDLDCGVDY-YALQKAVEEGIITEKQIDVSLFRLLKARFELGLMDEEHLVSWSD 350
Query: 335 QPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKT 394
PY V ++ +++ AL+ AR+ + LLKN G+LPLS K +AVIGPNAN +
Sbjct: 351 IPYTV-----VDSEKHREKALEMARKSMTLLKNDHGTLPLSKHCGK-IAVIGPNANDSVM 404
Query: 395 MIGNYEGTPCKYTTPLQGLAAVVAT---IYQAGCS 426
M GNY G P T L+G+ + IY GC
Sbjct: 405 MWGNYNGFPSHTVTILEGITHKLGAEQIIYDKGCE 439
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 69/127 (54%), Gaps = 15/127 (11%)
Query: 432 TAQVDDAKKAAASADATVLVM--GADQSIEAES----------HDRLDLLLPGQQQLLIT 479
T + + + AA DA V+V G +E E DR + LP Q+ L+
Sbjct: 585 TKRFNPNEIAARVGDAEVIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQ 644
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
E+ K K PVILI+ SG +S A+ D +I+ Y GQAGG A+ADV FG YNP+G
Sbjct: 645 ELHKTGK-PVILILCSGSAIGLS-AEVDLA-DAIIQAWYLGQAGGTAVADVLFGDYNPAG 701
Query: 540 RLPMTWY 546
RLP+T+Y
Sbjct: 702 RLPVTFY 708
>gi|380696428|ref|ZP_09861287.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 851
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 151/431 (35%), Positives = 213/431 (49%), Gaps = 74/431 (17%)
Query: 37 VFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYE 96
V +C + L + N + + RV DL+ RLT++EKI+ L ++ + RLGI KY
Sbjct: 16 VMSCSFIHAQEL----YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYY 71
Query: 97 WWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV 156
+EALHGV V PG T FPQ I AA++N L + + V+S EARA +N
Sbjct: 72 HGNEALHGV--VRPGRF--------TVFPQAIGLAATWNPELQRRVATVISDEARARWNE 121
Query: 157 GLAG----------LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
G LTFWSP +N+ RDPRWGR ET GEDP L+ T +VKGLQ G
Sbjct: 122 LDQGRAQKEQFSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQ---G 178
Query: 207 GSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK--------------- 251
P+ LK+ + KH+ A + ++ + + + YY F+
Sbjct: 179 DDPHYLKIVSTPKHFAANNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAY 238
Query: 252 ---------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
Y+VSDC +L N+ Y KT E AA S+ AG
Sbjct: 239 NALNDVPCTLNAWLLQKVLRKDWGFQGYVVSDCGGPALLVNAHKYLKTKEAAATLSLKAG 298
Query: 285 LDLNCG-SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
LDL CG A K +V+++ ID A + M+LG FDG + PY ++ P
Sbjct: 299 LDLECGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDG-VERNPYTKISPS 357
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
+ ++ +Q +ALDAARQ IVLLKN LPL+ + +K++AV+G NA K G+Y G P
Sbjct: 358 VIGSKEHQQIALDAARQCIVLLKNQKNMLPLNASKLKSIAVVGINAG--KCEFGDYSGAP 415
Query: 404 -CKYTTPLQGL 413
+ + LQG+
Sbjct: 416 VVEPVSILQGI 426
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 69/110 (62%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + + VMG ++SIE E DR D+ LP Q+ + E+ KV +I+I+++G
Sbjct: 594 EAGKAVRECETVIAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNSN-MIVILVAG 652
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I++ D + +I+ YPG+ GG A+A+V FG YNP+GRLP+T+Y
Sbjct: 653 SSLAINWM--DEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYY 700
>gi|189464219|ref|ZP_03013004.1| hypothetical protein BACINT_00556 [Bacteroides intestinalis DSM
17393]
gi|189438009|gb|EDV06994.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 865
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 137/406 (33%), Positives = 198/406 (48%), Gaps = 60/406 (14%)
Query: 55 NTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHF 114
N S I RV +L+ ++TL+EK+ L + S+ RL +P Y +W+E LHGV+ G
Sbjct: 51 NLSQPISARVENLISKMTLEEKVAQLSNETDSIPRLNLPSYNYWNECLHGVARAGE---- 106
Query: 115 SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDP 174
T FPQ I A++++ L + + +STEAR Y GLT+WSP IN+ RDP
Sbjct: 107 ------VTVFPQAINLASTWDTLLIKKVASAISTEARLKYLEIGKGLTYWSPTINMARDP 160
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR 234
RWGR +ET GEDP L S+ +VKGLQ G P+ LK A KH+ A + +N + +
Sbjct: 161 RWGRNEETYGEDPYLTSRLGVAFVKGLQ---GDHPDYLKTVATIKHFVANNQENDRFSSS 217
Query: 235 YHFNAMVIYTYYLIKFK------------------------------------------Y 252
+Y YY ++ +
Sbjct: 218 SQIPTKQLYEYYFPAYEACVKEADAQSVMTAYNAFNGVAPSGSTWLLGDVLRKEWGFDGF 277
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAID 312
+VSDC ++ V+ + EEAAA I +G DL CG + AAVK GLV+E AID
Sbjct: 278 VVSDCGAIGVMNWQHRVVNSLEEAAALGINSGCDLECGGTYREKLVAAVKMGLVSEQAID 337
Query: 313 HAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS 371
A++ +LG FD P + PY K + + LA +AA + IVLLKN
Sbjct: 338 KALTRVLTARFKLGEFD--PIELVPYNHYDKKLLAGEKFGKLAYEAAVKSIVLLKNDNDF 395
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
LP+ I+++A++GP A+ +G Y G P + LQG+ +V
Sbjct: 396 LPVDKKKIRSVAIVGPFAD--NNYLGGYSGKPVHNVSLLQGVKDLV 439
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 64/113 (56%), Gaps = 3/113 (2%)
Query: 434 QVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
Q+D K+ + AD ++ +G D+ + E+ D + LP Q+LL+ E+ KV LI+
Sbjct: 598 QIDKVKEFVSGADLVLVALGNDEKLARENRDLPSIYLPMTQELLLKEIYKV-NPRTALIL 656
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+G +A + + +IL YPGQ GG A+A + FG NPSG+LPMT Y
Sbjct: 657 HTGNPLTSKWAAEN--VPAILQAWYPGQEGGKALAGILFGSENPSGKLPMTIY 707
>gi|255690202|ref|ZP_05413877.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260624221|gb|EEX47092.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 853
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 151/425 (35%), Positives = 211/425 (49%), Gaps = 70/425 (16%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
S A + N + + RV DL+ RLT++EKI+ L ++ + RLGI KY +EALHGV
Sbjct: 23 SYAQELYKNANAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV- 81
Query: 107 YVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG------ 160
V PG T FPQ I AA++N L + I V+S EARA +N G
Sbjct: 82 -VRPGRF--------TVFPQAIGLAATWNPELQKRIATVISDEARARWNELDQGRNQKEQ 132
Query: 161 ----LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAA 216
LTFWSP +N+ RDPRWGR ET GEDP L+ T +VKGLQ G P+ LK+ +
Sbjct: 133 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQ---GDDPHYLKIVS 189
Query: 217 CCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK------------------------- 251
KH+ A + ++ + + + YY F+
Sbjct: 190 TPKHFAANNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNNVPCTL 249
Query: 252 -----------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFL 293
Y+VSDC +L N+ Y KT E AA SI AGLDL CG
Sbjct: 250 NSWLLQKVLRRDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVY 309
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
++ A K + +E+ ID A + M+LG FDG + PY ++ P + ++ +Q +
Sbjct: 310 DEYLLNAYKQYMASEADIDSAAYHVLTARMKLGLFDG-VERNPYAKISPSVIGSKEHQTV 368
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-CKYTTPLQG 412
AL+AAR+ IVLLKN LPL+ +K++AV+G NA K G+Y G P + + LQG
Sbjct: 369 ALNAARECIVLLKNQKNMLPLNVKKLKSIAVVGINAG--KCEFGDYSGAPVVEPVSILQG 426
Query: 413 LAAVV 417
+ V
Sbjct: 427 IKNRV 431
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 69/110 (62%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + V VMG ++SIE E DR D+ LP Q+ + E+ KV +IL++++G
Sbjct: 595 EAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IILVLVAG 653
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+++ + + +I+ YPG+ GG A+A+V FG YNP+GRLP+T+Y
Sbjct: 654 SSLAVNW--ENEHLPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYY 701
>gi|94442924|emb|CAJ91136.1| beta-xylosidase [Platanus x acerifolia]
Length = 231
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 114/228 (50%), Positives = 142/228 (62%), Gaps = 48/228 (21%)
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIK 249
A YV+GLQQ G +RLKVAACCKHYTAYDLDNWKG DR+HFNA V + + +
Sbjct: 1 AASYVRGLQQP-YGRDSRLKVAACCKHYTAYDLDNWKGIDRFHFNARVSKQDLEDTFNVP 59
Query: 250 FK------------------------------------------YIVSDCDSVDVLYNSQ 267
F+ YIV+DCDS+ VLY++Q
Sbjct: 60 FRECVVEGKVASVMCSYNQVNGIPTCADPNLLRNTIRGEWRLNGYIVTDCDSIGVLYDTQ 119
Query: 268 HYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGF 327
HYT TPEE+AA +I AG+DL+CG FL HT+ A+ +++E +D A++N A MRLG
Sbjct: 120 HYTSTPEESAADAIKAGVDLDCGPFLAVHTQEAITRRMLSEVYVDGALANTLAVQMRLGM 179
Query: 328 FDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS 375
FDG PS QP+G LGP+DVCT A+Q LAL+AARQGIVL+KN GSLPLS
Sbjct: 180 FDGEPSAQPFGHLGPRDVCTSAHQQLALEAARQGIVLMKNQ-GSLPLS 226
>gi|294675412|ref|YP_003576028.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
gi|294472176|gb|ADE81565.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
Length = 875
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 147/437 (33%), Positives = 208/437 (47%), Gaps = 82/437 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + N +L R DL+ RLTL EK++ ++D++ ++ RLGIP+++WW+EALHG+ G
Sbjct: 23 LPYQNANLSAAQRADDLLSRLTLDEKVSLMMDTSPAIPRLGIPQFQWWNEALHGIGRNG- 81
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLT 162
AT FP + AAS++ +L + VS EAR L+
Sbjct: 82 ---------FATVFPITMAMAASWDDALLHQVFTAVSDEARVKAQQAKCTGDIKRYQSLS 132
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT-----DGGSPNRLKVAAC 217
FW+PNINIFRDPRWGRGQET GEDP L +K V+GLQ D G K+ AC
Sbjct: 133 FWTPNINIFRDPRWGRGQETYGEDPYLTAKMGLAVVRGLQGVGYNGEDLGVSKYRKLLAC 192
Query: 218 CKHYTAYDLDNWKGTDRYHFN-----AMVIYTYYLIKFKYIV------------------ 254
KH+ + W +R+ FN ++ YL FK +V
Sbjct: 193 AKHFAVHSGPEW---NRHEFNIENLPERDLWETYLPAFKALVQEGKVAEVMCAYQRIDGQ 249
Query: 255 ------------------------SDCDSV-DVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
SDC ++ D L + +K EA+AK++LAG D+ C
Sbjct: 250 ACCAQTRYEQQILRDEWGFDGLITSDCGAIRDFLPRWHNVSKDGAEASAKAVLAGTDVEC 309
Query: 290 GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQA 349
GS KH AV+ G V E+ ID ++ LG D + ++ V +QA
Sbjct: 310 GSEY-KHLPEAVRRGDVKEADIDRSLRRLLIARFELGDMDSD-DLNAWTKIPETVVASQA 367
Query: 350 NQDLALDAARQGIVLLKNTAGSLPL------SPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
++DLAL A + IVLL+N LPL + K++ V+GPNAN + M GNY G P
Sbjct: 368 HKDLALKMALKSIVLLQNKIKVLPLGNPLNAGAGSDKDIVVMGPNANDSVMMWGNYAGYP 427
Query: 404 CKYTTPLQGLAAVVATI 420
T L G+ + T+
Sbjct: 428 THTVTALDGITRMAKTL 444
Score = 46.6 bits (109), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 44/83 (53%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR + LP Q+ L+ + K K VI + SG ++ A +IL Y G+ G
Sbjct: 649 DRTSIELPQAQRDLLAVLHKAGK-KVIFVNCSGSA--MALAPELETCDAILQWWYGGEQG 705
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
GAA+A FG PSG+LP+T+Y
Sbjct: 706 GAALATTLFGMVAPSGKLPVTFY 728
>gi|71731103|gb|EAO33170.1| Beta-glucosidase [Xylella fastidiosa subsp. sandyi Ann-1]
Length = 882
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 148/405 (36%), Positives = 209/405 (51%), Gaps = 69/405 (17%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV ++T QEKI +++A ++ RLGIP Y+WWSE LHG++ G AT FPQ
Sbjct: 37 LVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YATVFPQ 86
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRDPRWG 177
I AAS+N L Q +G V STEARA +N+ AGLT WSPNINIFRDPRWG
Sbjct: 87 AIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFRDPRWG 146
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY---------------- 221
RG ET GEDP L + A +++GLQ G +P+ + A KH+
Sbjct: 147 RGMETYGEDPYLTGQLAVSFIRGLQ---GDTPDHPRTIATPKHFAVHSGPEQGRHSFDVD 203
Query: 222 -TAYDL-------------DNWKGTDRYHFNAM----VIYTYYLIKFK---------YIV 254
+AYDL D G+ +NA+ + +L+ + ++V
Sbjct: 204 VSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRNDWGFNGFVV 263
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
SDCD+++ + + + A+A ++ +G DLNCG+ + A+ G ++ES +D A
Sbjct: 264 SDCDAIEDMTRFHFFRQDNASASAAALKSGDDLNCGNTY-RDLNQAIARGDIDESTLDQA 322
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+ F RLG PY +G K + T A++ LAL AA Q +VLLKN+ +LPL
Sbjct: 323 LIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSLVLLKNSGNTLPL 381
Query: 375 SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT 419
P LAV+GP+A+ + NY+GT TPL GL T
Sbjct: 382 PPET--TLAVLGPDADSLTALEANYQGTSSTPVTPLTGLRTRFGT 424
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 67/123 (54%), Gaps = 13/123 (10%)
Query: 434 QVDDAKKAAASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAK 483
Q+ +A++A A ADA V +G +E E DR + LP Q+ L+ V
Sbjct: 600 QLAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKT 659
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
K P+I+++MSG +++A++ +IL YPGQ+GG AIA G NP GRLP+
Sbjct: 660 TGK-PLIVVLMSGSAVALNWAQH--HADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPV 716
Query: 544 TWY 546
T+Y
Sbjct: 717 TFY 719
>gi|170731072|ref|YP_001776505.1| beta-glucosidase [Xylella fastidiosa M12]
gi|167965865|gb|ACA12875.1| Beta-glucosidase [Xylella fastidiosa M12]
Length = 882
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 149/405 (36%), Positives = 210/405 (51%), Gaps = 69/405 (17%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV ++T QEKI +++A ++ RLGIP Y+WWSE LHG++ G AT FPQ
Sbjct: 37 LVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YATVFPQ 86
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGLTFWSPNINIFRDPRWG 177
I AAS+N L Q +G V STEARA +N+ AGLT WSPNINIFRDPRWG
Sbjct: 87 AIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFRDPRWG 146
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY---------------- 221
RG ET GEDP L S+ A +++GLQ G P+ + A KH+
Sbjct: 147 RGMETYGEDPYLTSQLAVSFIRGLQ---GNIPDHPRTIATPKHFAVHSGPEPGRHSFDVD 203
Query: 222 -TAYDL-------------DNWKGTDRYHFNAM----VIYTYYLIKFK---------YIV 254
+AYDL D G+ +NA+ + +L+ + ++V
Sbjct: 204 VSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRNDWGFNGFVV 263
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHA 314
SDCD+++ + + + A+A ++ +G DLNCG+ + A+ G ++ES +D A
Sbjct: 264 SDCDAIEDMTRFHFFRQDNASASAAALKSGDDLNCGNTY-RDLNQAIARGDIDESTLDQA 322
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+ F RLG PY +G K + T A++ LAL AA Q +VLLKN+ +LPL
Sbjct: 323 LIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSLVLLKNSGNTLPL 381
Query: 375 SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT 419
+P LAV+GP+A+ + NY+GT TPL GL T
Sbjct: 382 TPGT--TLAVLGPDADSLTALEANYQGTSSTPVTPLIGLRTRFGT 424
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 67/123 (54%), Gaps = 13/123 (10%)
Query: 434 QVDDAKKAAASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAK 483
Q+ +A++A A ADA V +G +E E DR + LP Q+ L+ V
Sbjct: 600 QLAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKT 659
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
K P+I+++MSG +++A++ +IL YPGQ+GG AIA G NP GRLPM
Sbjct: 660 TGK-PLIVVLMSGSAVALNWAQH--HADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPM 716
Query: 544 TWY 546
T+Y
Sbjct: 717 TFY 719
>gi|224536538|ref|ZP_03677077.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521794|gb|EEF90899.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
DSM 14838]
Length = 863
Score = 216 bits (550), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 151/438 (34%), Positives = 206/438 (47%), Gaps = 79/438 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + SL + R LVK LTL+EK ++D + SV RLGI Y WW+EALHGV+ G
Sbjct: 23 YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AASFN + + VS EARA GLT W
Sbjct: 81 --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S+ VKGLQ G ++L AC KH+ +
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGKYDKLH--ACAKHFAVH 190
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I Y YL F+
Sbjct: 191 SGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDPCCGSDR 247
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
+VSDC ++ YN + + P E A+A ++++G DL CGS
Sbjct: 248 LLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVISGTDLECGSSYKA 307
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E +VK GL++E +D ++ LG D P K + ++ V + A+ LAL
Sbjct: 308 LIE-SVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASAAHDSLAL 365
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
+ AR+ + LL N LPL + +AV+GPNAN + GNY G P T L G+
Sbjct: 366 NMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNANDSVMQWGNYNGMPAHTVTILDGVRN 424
Query: 416 VVAT----IYQAGCSNVQ 429
++ T IY+ GC V+
Sbjct: 425 LLGTDDKLIYEQGCPWVE 442
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 64/133 (48%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQVDDAKKAAA---SADATVLVMGADQSIEAESH----------DRLDLLLPGQ 473
N G + D +K+ AD + G S+E E DR D+ LP
Sbjct: 579 NFDLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAV 638
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q+ LI + + K +I+++ G I K +IL YPGQ GG A+A+V FG
Sbjct: 639 QRELIDALHRAGKK---IILVNCSGSPIGLEPETQKCEAILQAWYPGQQGGKAVAEVLFG 695
Query: 534 RYNPSGRLPMTWY 546
YNP+G+LP+T+Y
Sbjct: 696 DYNPAGKLPVTFY 708
>gi|365121914|ref|ZP_09338824.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643627|gb|EHL82934.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
6_1_58FAA_CT1]
Length = 1073
Score = 216 bits (550), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 147/421 (34%), Positives = 202/421 (47%), Gaps = 71/421 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F +T+L R+ DL+ RL + EKI+ L ++ ++ RLGI KY +EALHGV V PG
Sbjct: 29 FRDTTLSHHERIKDLLSRLNVSEKISLLRATSPAIPRLGIDKYYHGNEALHGV--VRPGK 86
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LT 162
T FPQ I A+ +N Q + +S EAR +N G LT
Sbjct: 87 F--------TVFPQAIGLASMWNPDFLQEVSTAISDEARGRWNELNQGKDQTAGASDLLT 138
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FWSP IN+ RDPRWGR ET GEDP L T +V+GLQ G P +KV + KH+
Sbjct: 139 FWSPTINMARDPRWGRTPETYGEDPFLTGTLGTAFVRGLQ---GNDPKYIKVVSTPKHFA 195
Query: 223 AYDLDNWKGTDRYHFNAMVIYTYYLIKFK------------------------------- 251
A + ++ + + + + YY F+
Sbjct: 196 ANNEEHNRASGNAVISERDLREYYFPAFEKCIKEGQAQSVMSAYNAVNGIPCTLNKWLLT 255
Query: 252 -----------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEA 299
Y+VSDC + + + + HY T EEAA+ I AGLDL CG +
Sbjct: 256 DVLRDDWGFDGYVVSDCSAPEYIVSQHHYVDTYEEAASLCIKAGLDLECGDNVYITPLLN 315
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
A G+V S ID A MRLG FD P++ PY ++ P V + +++LAL+AAR
Sbjct: 316 AYNRGMVTMSEIDSAAYRVLRGRMRLGLFD-DPNENPYNKISPSIVGCEKHRELALEAAR 374
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT 419
Q +VLLKN LP+ IK++AV+G NA G+Y GTP TP+ L + A
Sbjct: 375 QSLVLLKNDKDMLPIQTDNIKSIAVVGINA--ANCEFGDYSGTPV--NTPISVLEGIKAR 430
Query: 420 I 420
I
Sbjct: 431 I 431
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 66/110 (60%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
DA + +D T+ V+G D++IE E DR + LP QQ+ I E K A ++++++G
Sbjct: 735 DAGEIIRGSDLTIAVLGIDRTIEREGQDRSTIELPEDQQIFIEEAYK-ANPNTVVVLVAG 793
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I++ D I ++L YPG+ GG A+A+ FG YNP GRLP+T+Y
Sbjct: 794 SSLAINWI--DQNIPAVLDAWYPGEQGGTAVAEALFGDYNPGGRLPLTFY 841
>gi|294674604|ref|YP_003575220.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
gi|294474051|gb|ADE83440.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
Length = 869
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 155/447 (34%), Positives = 214/447 (47%), Gaps = 94/447 (21%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T L RV +L+ LT +EK+ +++ + S+ RLGIP Y WWSEA HGV
Sbjct: 25 YLDTKLSNKERVENLLSLLTPEEKVGLMMNKSISIDRLGIPSYNWWSEACHGVRQ----- 79
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA-----------MYNV----- 156
G T +PQ I AA+FNA LF + VS EARA ++NV
Sbjct: 80 ------DGYTVYPQPIGMAAAFNAQLFYDVFSQVSDEARANWNRTDHNDPKLFNVPMGVT 133
Query: 157 ---GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLK 213
G LTFW PN+NIFRDPRWGRGQET GEDP L + V G+Q G + K
Sbjct: 134 YYPGNPELTFWCPNVNIFRDPRWGRGQETCGEDPYLNAVLGVQTVLGMQ---GNNDKYFK 190
Query: 214 VAACCKHYTAY---------------DLDNW------------KGTDRYHFNAMVIYTYY 246
AC KHY + + D W KG R M Y +
Sbjct: 191 THACAKHYAVHSGPEPLRHSMNVEPTNRDLWETYLPAFKALVKKGNVR---EVMCAYQRF 247
Query: 247 --------------LIKFKY-----IVSDCDSVDVLYNS-QHYT-KTPEEAAAKSILAGL 285
+++ K+ +++DCD+++ +N QH T K A+ ++L G
Sbjct: 248 EGKPCCTSDRLLIDILRNKWGYDAIVLTDCDAINNFFNRGQHETHKDGLSASVDAVLNGT 307
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKD 344
DL CG E +K GL+ ES +D+ + LG FD P+ P+ +LG
Sbjct: 308 DLECGKVFMSLVE-GLKKGLIKESDLDNHLRKTLMGRFELGMFD--PADMLPWAKLGADV 364
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP- 403
+ ++ N +A+ AAR+ +VLL+N AG LPLS + IK LAV+GPNA+ + GNY GTP
Sbjct: 365 ISSEKNDAMAVQAARESMVLLENKAGILPLSKS-IKTLAVLGPNADDVNMLNGNYGGTPT 423
Query: 404 -CKYTTPLQGLAAVV---ATIYQAGCS 426
+ L G+ A V IY C
Sbjct: 424 AAHQHSLLSGIKAAVPGAKIIYNKACE 450
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 64/119 (53%), Gaps = 5/119 (4%)
Query: 430 CGTAQVD--DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKG 487
C VD D ADA V++ G +E E D+ D+ LP QQ+L+ + K K
Sbjct: 599 CERNLVDFTDLANQVKEADAIVIIGGISAQMEGEGGDKQDIELPKVQQMLVKAMHKTGK- 657
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PVI + SG I+F + + ++L Y GQ G A+A+V FG YNP G+LP+T+Y
Sbjct: 658 PVIFVNCSGSA--IAFGSVEGEYDALLQAWYAGQGGAKALAEVLFGDYNPGGKLPVTFY 714
>gi|167521708|ref|XP_001745192.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776150|gb|EDQ89770.1| predicted protein [Monosiga brevicollis MX1]
Length = 614
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 160/530 (30%), Positives = 242/530 (45%), Gaps = 76/530 (14%)
Query: 71 LTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILT 130
LT +E G+VSR+G+P+Y+W A+HGV V TSFP +
Sbjct: 56 LTAREGGGGSPGPPGNVSRIGLPEYDWGMNAIHGVQSSCIKDD-DGTVYCPTSFPNPVNY 114
Query: 131 AASFNASLFQAIGKVVSTEARAMYNVGLA-----------GLTFWSPNINIFRDPRWGRG 179
++N S + +G+++ E RA++ G GL WSPNINI R P WGR
Sbjct: 115 GFTWNYSAYLELGRIIGVETRALWLAGAVEASTWSGRPHIGLDTWSPNINIARSPLWGRN 174
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNA 239
QE PGEDP + ++ Y GLQ G L+ KH+ AY L++ G R++FNA
Sbjct: 175 QEVPGEDPFMNGQFGKAYTLGLQ---GDDDTYLQAIVTLKHWDAYSLEDSDGATRHNFNA 231
Query: 240 MV-----IYTYY---------------------------------------LIKFK-YIV 254
+V + TY+ L KF Y+
Sbjct: 232 IVSNFSLMDTYWPAFRVAVTEGKAKGVMCSYNAVNGIPTCAHPLLRTVLRDLWKFDGYVS 291
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAG-LDLNCGSFLGKHTEAAVKAGLVNESAIDH 313
SD +V+ + ++ YT + AA +I G D++ G+ K V G +D+
Sbjct: 292 SDTGAVEDISDNHKYTPSWATAACAAIRDGQTDIDSGAVYMKSLLQGVSEGHCRMEDVDN 351
Query: 314 AISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLP 373
A+ N LG FD Q Y + V T A++ + + +VLL+N LP
Sbjct: 352 ALRNTLRLRFELGLFDP-VENQSYWHVPLAAVNTNASRATNMLHTLESMVLLQNKNNVLP 410
Query: 374 LSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK------YTTPLQGLAAVVAT---IYQAG 424
L+ +A+IGP+A + M+GNY G C +P L +++ T Y G
Sbjct: 411 LASNT--KVALIGPHAKAQEDMVGNYLGQLCPDNNFDCVVSPHDALVSILGTDAVTYAPG 468
Query: 425 CSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKV 484
+ C + +D+A A +AD VL++G D+SIEAES+DR + LP Q L + + V
Sbjct: 469 TNVTTCSQSHIDEAVSVATAADVAVLMLGIDESIEAESNDRKSIDLPECQHQLASAIFAV 528
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGR 534
K P ++++++GG I K + +I+ GYPG GG AIA G+
Sbjct: 529 GK-PTVIVLLNGGMLAIENEKQ--QADAIIEAGYPGFYGGTAIAQTLTGQ 575
>gi|409196594|ref|ZP_11225257.1| beta-glucosidase [Marinilabilia salmonicolor JCM 21150]
Length = 876
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 148/440 (33%), Positives = 206/440 (46%), Gaps = 77/440 (17%)
Query: 37 VFACDVVSNPSLA---SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIP 93
+FA ++S SLA F N SL + RV DL+ RLT++EK L D + ++ RLGI
Sbjct: 16 LFAGVILSISSLAQGYDYPFQNPSLSFEERVKDLISRLTVEEKAALLCDQSDAIPRLGIE 75
Query: 94 KYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM 153
K+ WWSEALHG ++N T FPQ + AASFN L I S E RA
Sbjct: 76 KFNWWSEALHG---------YANN-DSVTVFPQPVGMAASFNDELVFDIFNATSDEGRAK 125
Query: 154 Y---------NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT 204
Y N L+ W+PN+NIFRDPRWGRGQET GEDP L S+ VKGLQ
Sbjct: 126 YHQAQRRGEENRRFLSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGP 185
Query: 205 DGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIVSDCDS 259
+ +L AC KHYT + W R+ N + Y Y+ FK +V D D
Sbjct: 186 EDAKYRKL--LACAKHYTVHSGPEWS---RHELNVNDVSPREFYETYMPAFKALVQDADV 240
Query: 260 VDVL-----------------------------------------YNSQH-YTKTPEEAA 277
V+ +++ H + TP AA
Sbjct: 241 RQVMCAYHRLDDEPCCSNTRILQRILRDEWGYEHLVAADCGAITDFHTTHGISSTPVHAA 300
Query: 278 AKSILAGLDLNC--GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ 335
A +LAG DL C ++ K A++ L+ E ID ++ LG D +
Sbjct: 301 ATGLLAGTDLECIWDNYHYKMLPEALEKDLIKEEHIDSSLFRVLMGRFELGEMDDD-AIV 359
Query: 336 PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTM 395
P+ Q + + +++LA A+Q + LL+N LPL ++ + +AVIGPNA+ + +
Sbjct: 360 PWAQTPASVLNNKEHRELAYKMAQQSMTLLQNKNNVLPLDKSSTQKIAVIGPNADDEEVL 419
Query: 396 IGNYEGTPCKYTTPLQGLAA 415
GNY GTP + T L G+ +
Sbjct: 420 WGNYNGTPIRTITILDGMKS 439
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 56/123 (45%), Gaps = 13/123 (10%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAK 486
D K + V G S+E E DR + LP Q+ + + + K
Sbjct: 606 DLLKKLEGIETVVFAGGLSGSLEGEEMPVSYPGFKGGDRTHIELPSVQRNALKALKEAGK 665
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
VI + SG I+ +IL Y G++GG AIADV FG YNPSG+LP+T+Y
Sbjct: 666 -TVIFVNFSGSA--IALEPETESCDAILQAWYAGESGGQAIADVLFGDYNPSGKLPLTFY 722
Query: 547 PQS 549
S
Sbjct: 723 RNS 725
>gi|427387362|ref|ZP_18883418.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
12058]
gi|425725523|gb|EKU88394.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
12058]
Length = 865
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 212/444 (47%), Gaps = 86/444 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L R DL+KR+TL+EKI+ + + + ++ RLGIP Y WW+EALHGV+ G
Sbjct: 25 YKNPDLTPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYNWWNEALHGVARAGK-- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AA+F+ +VS EARA Y+ G GLTFW
Sbjct: 83 --------ATVFPQAIGLAATFDNQAVHETFSIVSDEARAKYHDFQRKGERDGYKGLTFW 134
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PNINI+RDPRWGRG ET GEDP L S VKGLQ G + K AC KHY +
Sbjct: 135 TPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGDGTGKYD--KTHACAKHYAVH 192
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ F+A I + YL FK
Sbjct: 193 SGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVTEGKVKEVMCAYNRYEGEPCCSNKQ 249
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
+VSDC ++ Y H+ P A+A ++++G DL CG
Sbjct: 250 LLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLECGGSYSS 309
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP----SKQPYGQLGPKDVCTQANQ 351
E AV+ GL++E I+ ++ +LG FD + S+ PY + K+ +
Sbjct: 310 LNE-AVRKGLISEDKINESVFRLLRARFQLGMFDDNTLVSWSEIPYSVVESKEHVAK--- 365
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
AL+ AR+ +VLL N LPLS +++ +AV+GPNAN + + NY G P K T L+
Sbjct: 366 --ALEMARKSMVLLTNKNNILPLS-KSVRKVAVLGPNANDSVMLWANYNGFPTKSVTILE 422
Query: 412 GLAAVV---ATIYQAGCSNVQCGT 432
G+ + A Y+ GC V T
Sbjct: 423 GIRNKLPEGAVYYEKGCDFVNTQT 446
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 43/118 (36%), Positives = 64/118 (54%), Gaps = 13/118 (11%)
Query: 442 AASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVIL 491
AA AD + V G S+E E DR ++ LP Q+ ++ + K K PV+
Sbjct: 598 AAEADVIIFVGGLSSSLEGEEMPVDLPGFRKGDRTNIDLPQVQEEMLKALKKTGK-PVVF 656
Query: 492 IIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
++ SG + + + + +I+ YPGQ GG A+ADV FG YNP+GRLP+T+Y S
Sbjct: 657 VLCSGSTLALPWEAEN--LDAIIEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASS 712
>gi|423223593|ref|ZP_17210062.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638218|gb|EIY32065.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 863
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 151/438 (34%), Positives = 206/438 (47%), Gaps = 79/438 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + SL + R LVK LTL+EK ++D + SV RLGI Y WW+EALHGV+ G
Sbjct: 23 YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AASFN + + VS EARA GLT W
Sbjct: 81 --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S+ VKGLQ G ++L AC KH+ +
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGKYDKLH--ACAKHFAVH 190
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I Y YL F+
Sbjct: 191 SGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDPCCGSDR 247
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
+VSDC ++ YN + + P E A+A ++++G DL CGS
Sbjct: 248 LLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVISGTDLECGSSYKA 307
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E +VK GL++E +D ++ LG D P K + ++ V + A+ LAL
Sbjct: 308 LIE-SVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASAAHDSLAL 365
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
+ AR+ + LL N LPL + +AV+GPNAN + GNY G P T L G+
Sbjct: 366 NMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNANDSVMQWGNYNGMPAHTVTILDGVRN 424
Query: 416 VVAT----IYQAGCSNVQ 429
++ T IY+ GC V+
Sbjct: 425 LLGTDDKLIYEQGCPWVE 442
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 64/133 (48%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQVDDAKKAAA---SADATVLVMGADQSIEAESH----------DRLDLLLPGQ 473
N G + D +K+ AD + G S+E E DR D+ LP
Sbjct: 579 NFDLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAV 638
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q+ LI + + K +I+++ G I K +IL YPGQ GG A+A+V FG
Sbjct: 639 QRELIDALHRAGKK---IILVNCSGSPIGLEPETQKCEAILQAWYPGQQGGKAVAEVLFG 695
Query: 534 RYNPSGRLPMTWY 546
YNP+G+LP+T+Y
Sbjct: 696 DYNPAGKLPVTFY 708
>gi|189464698|ref|ZP_03013483.1| hypothetical protein BACINT_01042 [Bacteroides intestinalis DSM
17393]
gi|189436972|gb|EDV05957.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 862
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 151/442 (34%), Positives = 211/442 (47%), Gaps = 71/442 (16%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVG 109
+L + N L R DLVKRLTL+EK + D + ++ RLGI K+ WWSEALHGV+ G
Sbjct: 20 TLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG 79
Query: 110 PGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN---------VGLAG 160
T FP+ + AASFN L I VS E RA +N V
Sbjct: 80 ----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFHS 129
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
L+ W+PN+NIFRDPRWGRGQET GEDP L S+ VKGLQ + + K+ AC KH
Sbjct: 130 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPE--NEKYRKLLACAKH 187
Query: 221 YTAY----------DLDNWKGTDRYH-----FNAM--------VIYTYYLIK-------- 249
Y + +L+N D + F A+ V+ Y +
Sbjct: 188 YAVHSGPEWSRHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDDDPCCGNT 247
Query: 250 ------------FKY-IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGK 295
FKY +VSDC ++ + S + AA K +AG D+ CG + +
Sbjct: 248 RLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTMAGTDVECGYGYAYQ 307
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
AV GL+ E ++ + LG D PS + ++ V +A++DL+L
Sbjct: 308 KLPEAVSRGLITEEEVNKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVVNCKAHKDLSL 366
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
+ +RQ + LL+N LPLS +I+ +AVIGPNA+ + GNY GTP + T L G +
Sbjct: 367 NMSRQTMTLLQNKNNVLPLS-KSIRKIAVIGPNADDKPMLWGNYNGTPNQTITILDGFKS 425
Query: 416 VVAT---IYQAGCSNVQCGTAQ 434
+ +Y GC V T +
Sbjct: 426 KLKKNQIVYMKGCDLVNDQTLE 447
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 52/111 (46%), Gaps = 13/111 (11%)
Query: 446 DATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
D V V G +E E DR D+ LP Q+ + + K ++ ++
Sbjct: 601 DVVVFVGGISPQLEGEEMPVNIPGFKGGDRTDIELPAVQRNFLKALKDAGKQ---VVFVN 657
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G ++ +IL Y G+ GG A+ADV FG YNPSG+LP+T+Y
Sbjct: 658 CSGSSMALLPETESCDAILQAWYGGELGGYAVADVLFGDYNPSGKLPVTFY 708
>gi|383114360|ref|ZP_09935124.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
gi|313693934|gb|EFS30769.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
Length = 863
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 144/433 (33%), Positives = 210/433 (48%), Gaps = 81/433 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T L ++ R DL++RLTL+EK+ + +++ ++ RLGI YEWW+EALHGV+ G
Sbjct: 28 YQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL-- 85
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---MYNVG-----LAGLTFW 164
AT FPQ I AASFN L + VS EARA +N GLT W
Sbjct: 86 --------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLTMW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PN+NIFRDPRWGRGQET GEDP L+ + V+GLQ + ++L AC KH+ +
Sbjct: 138 TPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLH--ACAKHFAVH 195
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL FK
Sbjct: 196 SGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNR 252
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFLGK 295
+V+DC ++ + + + P+ A+A ++L+G DL CG
Sbjct: 253 LLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGNFKS 312
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
T+ AVK GL++E I+ ++ LG + S P+ + + +++LAL
Sbjct: 313 ITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPKHKELAL 368
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
A + +VLL+N LPL+ +AVIGPNAN + GNY G P T L+G+ A
Sbjct: 369 KMAHESLVLLQNNNNILPLNRQM--KVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGIRA 426
Query: 416 VV---ATIYQAGC 425
+ IY+ C
Sbjct: 427 KLPDAQIIYEPVC 439
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 56/113 (49%), Gaps = 13/113 (11%)
Query: 444 SADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILII 493
SAD + G +E ES DR ++ LP Q+ ++ + K K + +
Sbjct: 599 SADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-KTVFVN 657
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
SG I +IL YPGQAGG A+ADV FG YNP+GRLP+T+Y
Sbjct: 658 FSGSAMAI--VPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY 708
>gi|218186207|gb|EEC68634.1| hypothetical protein OsI_37026 [Oryza sativa Indica Group]
Length = 1241
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 134/329 (40%), Positives = 171/329 (51%), Gaps = 73/329 (22%)
Query: 144 KVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQ 203
+ VSTEARAMYN+G GLT+WSPNIN+ RDPRWGR ETPGEDP + +YA +V+G+Q
Sbjct: 916 QAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQD 975
Query: 204 TD-------GGSPNR--LKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK--- 251
GG PN LK +ACCKHY AYDLD+W R+ F+A V + F+
Sbjct: 976 IPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPF 1035
Query: 252 -------------------------------------------YIVSDCDSVDVLY-NSQ 267
YIVSDCD+V V+ N+
Sbjct: 1036 EMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNAT 1095
Query: 268 HYTKTPEEAAAKSILAGLDLNCGS-------------FLGKHTEAAVKAGLVNESAIDHA 314
T EA+A ++ AGLDL+CG FL + AV G + ES ID+A
Sbjct: 1096 WLGYTGAEASAAALKAGLDLDCGESWKNETDGHPLMDFLTTYGMEAVNKGKMRESDIDNA 1155
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
++N + TLMRLG+FD Y LG +D+CT ++ LALD ARQGIVLLKN LPL
Sbjct: 1156 LTNQYMTLMRLGYFD---DIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPL 1212
Query: 375 SPTAIKNLAVIGPNANV-TKTMIGNYEGT 402
+ + V GP+ K M G+Y GT
Sbjct: 1213 DANKVGFVNVRGPHVQAPEKIMDGDYTGT 1241
>gi|365122193|ref|ZP_09339098.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
6_1_58FAA_CT1]
gi|363642907|gb|EHL82241.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
6_1_58FAA_CT1]
Length = 853
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 145/395 (36%), Positives = 191/395 (48%), Gaps = 69/395 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ DL+ RLT++EKI+ L ++ + RLGI KY +EALHGV V PG T
Sbjct: 40 RIMDLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPGNF--------T 89
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LTFWSPNINIFR 172
FPQ I A+ +N L I +S EAR +N G LTFWSP +N+ R
Sbjct: 90 VFPQAIGLASMWNPELLYEISTAISDEARGRWNELNRGKDQKGFFSDLLTFWSPTVNMAR 149
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L+ K +VKGLQ G P LK+ + KH+ A + ++ +
Sbjct: 150 DPRWGRTPETYGEDPFLSGKLGVAFVKGLQ---GNDPRYLKIVSTPKHFAANNEEHNRFE 206
Query: 233 DRYHFNAMVIYTYYLIKFK----------------------------------------- 251
H + + YYL F+
Sbjct: 207 CNPHISERNLREYYLPAFESCIKEGKAQSIMSAYNAINDVPCTLNPWLLTQVLRKEWGFN 266
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNES 309
Y+VSDC L Y KTPE AA SI AGLDL CG + + A K +V ++
Sbjct: 267 GYVVSDCGGPGFLVTHHKYVKTPEAAATLSIKAGLDLECGDNVYIEPLMNAYKQCMVTDA 326
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
ID A M LG FD P K PY + P V + ++ LAL+AARQ +VLLKN
Sbjct: 327 DIDTAAYRILRARMMLGLFD-DPEKNPYNAISPSIVGCEKHRQLALEAARQSLVLLKNEK 385
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
LPL+P +K++AV+G NA + G+Y GTP
Sbjct: 386 NFLPLNPKKVKSIAVVGINAGNCE--FGDYSGTPV 418
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 66/110 (60%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A +A D TV V+G ++SIE E DR + LP QQL I E K A ++++++G
Sbjct: 596 EAGRAIRECDVTVAVLGINKSIEREGQDRYTIELPADQQLFIKEAYK-ANPNTVVVLVAG 654
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I++ D I +IL YPG+ GG A+A+ FG YNP GRLP+T+Y
Sbjct: 655 SSLAINWI--DENIPAILNAWYPGEQGGTAVAEALFGDYNPGGRLPLTYY 702
>gi|373460605|ref|ZP_09552356.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
gi|371955223|gb|EHO73027.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
Length = 858
Score = 215 bits (547), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 149/444 (33%), Positives = 215/444 (48%), Gaps = 71/444 (15%)
Query: 39 ACDVVSNPSLASLGF--CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYE 96
A ++ PSL + + N +L R DL+ RLTL+EK ++D + ++ RLGI K+
Sbjct: 17 ASSALAQPSLTGMAYPYQNPNLSALTRAQDLLSRLTLEEKALLMLDESPAIPRLGIKKFF 76
Query: 97 WWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-- 154
WWSEALHG + +G T FP+ I AASFN +L + S E RA Y
Sbjct: 77 WWSEALHGAANMG----------NVTVFPEPIAMAASFNDALLYKVFSAASDEMRAQYHH 126
Query: 155 -------NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
+ L+ W+PN+NIFRDPRWGRGQET GEDP L + T V+GLQ +
Sbjct: 127 RIRNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTAVMGTAVVRGLQGPEDS 186
Query: 208 SPNRLKVAACCKHYTAY----------DLDNWKGTDRYH-----FNAM--------VIYT 244
+L AC KHY + +L+N D + F + V+
Sbjct: 187 KYRKL--WACAKHYAVHSGPEYTRHTANLNNVSPRDLWETYLPAFKTLVEEAKVREVMCA 244
Query: 245 YYLIK--------------------FKY-IVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
Y + F+Y +VSDC +V ++ + + A AK+ LA
Sbjct: 245 YQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGAVSDIWQNHKTSSDAVHATAKAALA 304
Query: 284 GLDLNCG-SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
G D+ CG ++ K AV+ GL++E +D + LG D P+ P+ ++
Sbjct: 305 GTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRLLEGRFDLGEMDD-PALVPWSKIPY 363
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
+ ++A++ L+LD ARQ IVLL+N LPL + +AVIGPNA+ M GNY GT
Sbjct: 364 SVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN-ERIAVIGPNADNVPMMWGNYNGT 422
Query: 403 PCKYTTPLQGLAAVVATI-YQAGC 425
P + T L G+ A + Y GC
Sbjct: 423 PNRTVTILDGIRAKHKNVKYIKGC 446
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 56/120 (46%), Gaps = 13/120 (10%)
Query: 440 KAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPV 489
+A + V V G ++E E DR D+ LP Q+ I + K
Sbjct: 603 RALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAAGKQ-- 660
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
L+ ++ G I+ +I+ Y GQ GG A+ADV FG YNPSG+LP+T+Y S
Sbjct: 661 -LVYVNCSGSAIALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVTFYKNS 719
>gi|294675359|ref|YP_003575975.1| 1,4-beta-xylosidase [Prevotella ruminicola 23]
gi|225016052|gb|ACN78955.1| xylosidase/arabinofuranosidase [Prevotella ruminicola]
gi|294472720|gb|ADE82109.1| putative 1,4-beta-xylosidase [Prevotella ruminicola 23]
Length = 861
Score = 215 bits (547), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 150/435 (34%), Positives = 209/435 (48%), Gaps = 70/435 (16%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
S +L + N +L R DL RLTL+EK ++D + ++ RLGI K+ WWSEALHG +
Sbjct: 18 SAQTLPYQNPNLSAKERAVDLCSRLTLEEKAMLMLDESPAIPRLGIKKFFWWSEALHGAA 77
Query: 107 YVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN----------V 156
+G T+FP+ + AASFN L + + STE RA YN +
Sbjct: 78 NMG----------NVTNFPEPVGMAASFNPHLLFKVFDIASTEFRAQYNHRMYDLNGEDM 127
Query: 157 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAA 216
+ L+ W+PN+NIFRDPRWGRGQET GEDP L S VKGLQ + +L A
Sbjct: 128 KMRSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVQVVKGLQGPEDARYRKL--WA 185
Query: 217 CCKHYTAYD----------------LDNWKG---------TDRYHFNAMVIY-------- 243
C KHY + D W+ D M Y
Sbjct: 186 CAKHYAVHSGPEYTRHTANLTDVSARDFWETYMPAFKTLVKDAKVREVMCAYQRLDDDPC 245
Query: 244 --TYYLIK--------FKY-IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-S 291
+ L++ F+Y +VSDC +V Y + + +K++LAG D+ CG +
Sbjct: 246 CGSTRLLQQILRDEWGFEYLVVSDCGAVSDFYENHKSSSDAVHGTSKAVLAGTDVECGFN 305
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
+ K AV+ GL++E +D + LG D PS + ++ + T+A+
Sbjct: 306 YAYKSLPEAVRKGLLSEKEVDKHVIRLLEGRFDLGEMD-DPSLVEWSKIPYSAMSTKASA 364
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
++ALD ARQ IVLL+N LPL A K +A+IGPNA+ M GNY GTP T L
Sbjct: 365 NVALDMARQTIVLLQNKNNILPLKKNAEK-IAIIGPNAHNEPMMWGNYNGTPNHTVTILD 423
Query: 412 GLAAVVAT-IYQAGC 425
G+ A +Y GC
Sbjct: 424 GVKAKQKKLVYIPGC 438
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 44/83 (53%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR + LP Q+ + + K A VI + SG I+ +I+ YPGQ G
Sbjct: 631 DRTSIELPKVQREFL-KALKAAGKQVIYVNCSGSA--IALQPETESCDAIVQAWYPGQEG 687
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G A+ADV FG YNP G+L +T+Y
Sbjct: 688 GTAVADVLFGDYNPGGKLSVTFY 710
>gi|399025517|ref|ZP_10727513.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
gi|398077894|gb|EJL68841.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
Length = 875
Score = 215 bits (547), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 146/434 (33%), Positives = 204/434 (47%), Gaps = 79/434 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N +L ++ R+ +L+ LT+ EKI ++D++ +V RL IP Y WW+EALHGV+ G
Sbjct: 25 FRNPNLPVEQRIENLLGLLTVDEKIGMMMDNSKAVPRLEIPAYGWWNEALHGVARAGT-- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTF 163
AT FPQ I AA+++ +++S EARA YN GLTF
Sbjct: 83 --------ATVFPQAIGMAAAWDVPEHLKTFEMISDEARAKYNKSFDEASKTGRYEGLTF 134
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PNINIFRDPRWGRGQET GEDP L S VKGLQ G P K AC KH+
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQ---GNDPKYFKTHACAKHFAV 191
Query: 224 YDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------------- 251
+ W +R+ +NA V +Y YL FK
Sbjct: 192 HSGPEW---NRHSYNAEVSKRDLYETYLPAFKSLVLEGNVREVMCAYNAFDGQPCCASNT 248
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPEE--AAAKSILAGLDLNCGSFLGK 295
+VSDC ++ Y +++ P+E AA ++ DL CG
Sbjct: 249 LLNEILRGKWKYDGMVVSDCWALADFYQEKYHGTHPDEKSTAADALKHSTDLECGDTYN- 307
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
+ ++ GL+ E ID ++ LG D S + Q+ V + ++ AL
Sbjct: 308 NLNKSLAGGLITEKDIDISMRRILKGWFELGMLDPKSSVL-WNQIPYSVVDSDEHKKQAL 366
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
A++ IVL+KN LP + IK +AV+GPNA+ +GNY GTP T L+G+ A
Sbjct: 367 KMAQKSIVLMKNENNILPFNKN-IKKIAVVGPNADDEMMQLGNYNGTPSSIVTILEGIKA 425
Query: 416 V---VATIYQAGCS 426
IY+ G
Sbjct: 426 KFPNTEIIYEKGSE 439
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 60/118 (50%), Gaps = 13/118 (11%)
Query: 439 KKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGP 488
K+ AD V G S+E E D+ + LP Q+ L+ E+ K K P
Sbjct: 596 KEKVKDADVIVFAGGLSPSLEGEEMLVNAEGFKGGDKTSIELPKVQRELLAELRKTGK-P 654
Query: 489 VILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+ ++ +G + +++ +L Y GQ+GG A+ADV G YNPSGRLP+T+Y
Sbjct: 655 VVFVLCTGSS--LGLEQDEKNYDVLLNAWYGGQSGGTAVADVLAGDYNPSGRLPVTFY 710
>gi|299141953|ref|ZP_07035087.1| beta-glucosidase [Prevotella oris C735]
gi|298576415|gb|EFI48287.1| beta-glucosidase [Prevotella oris C735]
Length = 858
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 149/444 (33%), Positives = 215/444 (48%), Gaps = 71/444 (15%)
Query: 39 ACDVVSNPSLASLGF--CNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYE 96
A ++ PSL + + N +L R DL+ RLTL+EK ++D + ++ RLGI K+
Sbjct: 17 ASSALAQPSLTGMAYPYQNPNLSALTRAQDLLSRLTLEEKALLMLDESPAIPRLGIKKFF 76
Query: 97 WWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-- 154
WWSEALHG + +G T FP+ I AASFN +L + S E RA Y
Sbjct: 77 WWSEALHGAANMG----------NVTVFPEPIAMAASFNDALLYKVFSAASDEMRAQYHH 126
Query: 155 -------NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
+ L+ W+PN+NIFRDPRWGRGQET GEDP L + T V+GLQ +
Sbjct: 127 RIRNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTAVMGTAVVRGLQGPEDS 186
Query: 208 SPNRLKVAACCKHYTAY----------DLDNWKGTDRYH-----FNAM--------VIYT 244
+L AC KHY + +L+N D + F + V+
Sbjct: 187 KYRKL--WACAKHYAVHSGPEYTRHTANLNNVSPRDLWETYLPAFKTLVEEAKVREVMCA 244
Query: 245 YYLIK--------------------FKY-IVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
Y + F+Y +VSDC +V ++ + + A AK+ LA
Sbjct: 245 YQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGAVSDIWQNHKTSSDAVHATAKAALA 304
Query: 284 GLDLNCG-SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
G D+ CG ++ K AV+ GL++E +D + LG D P+ P+ ++
Sbjct: 305 GTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRLLEGRFDLGEMDD-PALVPWSKIPY 363
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
+ ++A++ L+LD ARQ IVLL+N LPL + +AVIGPNA+ M GNY GT
Sbjct: 364 SVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN-ERIAVIGPNADNVPMMWGNYNGT 422
Query: 403 PCKYTTPLQGLAAVVATI-YQAGC 425
P + T L G+ A + Y GC
Sbjct: 423 PNRTVTILDGIRAKHKNVKYIKGC 446
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 56/120 (46%), Gaps = 13/120 (10%)
Query: 440 KAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPV 489
+A + V V G ++E E DR D+ LP Q+ I + K
Sbjct: 603 RALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAAGKQ-- 660
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
L+ ++ G I+ +I+ Y GQ GG A+ADV FG YNPSG+LP+T+Y S
Sbjct: 661 -LVYVNCSGSAIALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVTFYKNS 719
>gi|393786908|ref|ZP_10375040.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
CL02T12C05]
gi|392658143|gb|EIY51773.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
CL02T12C05]
Length = 854
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 151/401 (37%), Positives = 197/401 (49%), Gaps = 81/401 (20%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ DL+ +LT++EKI+ L ++ + RL I KY +EALHGV V PG T
Sbjct: 38 RILDLLSKLTIEEKISLLRATSPGIPRLHIDKYYHGNEALHGV--VRPGNF--------T 87
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LTFWSPNINIFR 172
FPQ I AA +N L I V+S EARA +N G LTFWSP +N+ R
Sbjct: 88 VFPQAIGLAAMWNPQLLNEISTVISDEARARWNELEQGKKQLGQFSDLLTFWSPTVNMAR 147
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L+ K +VKGLQ G P LK+ + KH+ A N +
Sbjct: 148 DPRWGRTPETYGEDPFLSGKLGVSFVKGLQ---GDDPRYLKIVSTPKHFAA----NNEEH 200
Query: 233 DRYHFNAMV----IYTYYLIKFK------------------------------------- 251
+R+ N ++ + YYL F+
Sbjct: 201 NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAINDVPCTLNNWLLKKVLRHD 260
Query: 252 -----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA---AVKA 303
Y+VSDC L Y KT E AAA SI AGLDL CG + + E A K
Sbjct: 261 WGFDGYVVSDCGGPSFLVTHHKYVKTLEAAAALSIQAGLDLECGDEV--YMEPLLNAYKQ 318
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIV 363
+V+E+ ID A + MRLG FD P+ PY ++ P V + + LAL+AARQ IV
Sbjct: 319 YMVSEAEIDSAAYHVLRARMRLGLFD-DPALNPYNKISPSIVGCEKHSKLALEAARQSIV 377
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
LLKN LPL IK++AV+G NA ++ G+Y GTP
Sbjct: 378 LLKNEKKFLPLDSKKIKSIAVVGINAGNSE--FGDYSGTPV 416
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 64/101 (63%), Gaps = 3/101 (2%)
Query: 446 DATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAK 505
D TV V+G ++SIE E DR + LP QQ+ I E K+ V++++ +G I++
Sbjct: 603 DLTVAVLGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVVVLV-AGSSLAINWM- 660
Query: 506 NDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
D I +I+ YPG+AGG A+A+V FG YNP G+LP+T+Y
Sbjct: 661 -DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYY 700
>gi|393773722|ref|ZP_10362116.1| glycosyl hydrolase family 3 N terminal domain protein
[Novosphingobium sp. Rr 2-17]
gi|392720897|gb|EIZ78368.1| glycosyl hydrolase family 3 N terminal domain protein
[Novosphingobium sp. Rr 2-17]
Length = 846
Score = 214 bits (546), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 147/397 (37%), Positives = 204/397 (51%), Gaps = 69/397 (17%)
Query: 68 VKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQV 127
+ + L EK+ L +A + +G+P Y+WW+E LHG++ G AT FPQ
Sbjct: 17 IAKADLAEKVGQLQSTAPAAKAMGLPAYDWWNEGLHGLARNGE----------ATVFPQA 66
Query: 128 ILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWGRG 179
I AA+F+ L + +G V+STEARA +N GLT WSPNINIFRDPRWGRG
Sbjct: 67 IGLAATFDPVLMEKVGTVISTEARAKFNTRPVNADRRIYEGLTIWSPNINIFRDPRWGRG 126
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT----------AYDLD-- 227
QET GEDP L+ A G++KGLQ TD P +V A KH+ ++D+D
Sbjct: 127 QETYGEDPFLSGTLAVGFIKGLQGTDPAHP---RVIATPKHFAVHSGPEAGRDSFDVDVS 183
Query: 228 ------------NWKGTDRYHFNAMVIYTYY----------LIK--------FK-YIVSD 256
T+ + M Y LI FK ++VSD
Sbjct: 184 PQDREATYLPAFRKAVTEGRPLSVMCAYNSTHGTPVCADDALINQRLRKDWGFKGFVVSD 243
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAIS 316
CD++ ++ HY EA+A SI AG D +CG+ + AV+ GLV+E+ ID A++
Sbjct: 244 CDAIANVWMFHHYAADAAEASAASIKAGTDFDCGNTYAALPQ-AVERGLVDEATIDRALA 302
Query: 317 NNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+ LG G + P+G++ P V T A++ LAL+AAR+ IVLL+N LPL P
Sbjct: 303 RSLQARHSLGIAFG--AANPWGRIKPSQVHTLASRALALEAARKSIVLLQNDNARLPLKP 360
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
LAV+G NA+ + +Y GT TPLQG+
Sbjct: 361 GT--KLAVVGANADDLSVIEASYHGTAADPITPLQGI 395
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/124 (37%), Positives = 65/124 (52%), Gaps = 17/124 (13%)
Query: 434 QVDDAKKAAASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAK 483
Q +A A +ADA V V+G +E E+ DR D+ LP Q L+ +
Sbjct: 561 QRAEAIAATRNADAIVAVLGLSPDLEGEALQVQVPGFVGGDRSDIALPRPQAQLLEALHA 620
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKIT-SILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
K PVI+++ SG I DPK+ ++L YPG+ GG A+A+ G NPSGRLP
Sbjct: 621 TGK-PVIVVLTSGSAIAI-----DPKLADAVLAAWYPGEEGGTALAETLGGINNPSGRLP 674
Query: 543 MTWY 546
+T+Y
Sbjct: 675 VTFY 678
>gi|125541196|gb|EAY87591.1| hypothetical protein OsI_09002 [Oryza sativa Indica Group]
Length = 194
Score = 214 bits (546), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 104/191 (54%), Positives = 131/191 (68%), Gaps = 12/191 (6%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
FC+ L + R DLV LTL EK++ L D A V+RLG+P YEWWSE LHG+S G G
Sbjct: 5 FCDARLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 64
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIF 171
F+ V TSF TAA+F+A L++ +G+ V EARA+YN+G A GLT WSPN+NIF
Sbjct: 65 RFNGTVRAVTSF-----TAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 119
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
RDPRWGRGQETPGEDP+ AS+YA +V GLQ+ G + +ACCKH TAYDLD W
Sbjct: 120 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQEIGG------EASACCKHATAYDLDYWNN 173
Query: 232 TDRYHFNAMVI 242
RY++++ VI
Sbjct: 174 VVRYNYDSKVI 184
>gi|219118959|ref|XP_002180246.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408503|gb|EEC48437.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 682
Score = 214 bits (545), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 179/579 (30%), Positives = 265/579 (45%), Gaps = 88/579 (15%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFL---------VDSAGSVSRLGIPKY 95
N +L +C+ SL ID R+ DL+ LTL EK+ + + +VSR+G+P Y
Sbjct: 64 NDVAKALPYCDMSLSIDERLEDLLSHLTLDEKVDMIGADPTQDVCMTHTMNVSRIGLPDY 123
Query: 96 EWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN 155
W E + VG N AT F + AASFN S + G V TE RA+ N
Sbjct: 124 YWLVET---NTAVGSACIAENKC--ATEFSGPLSIAASFNRSSWFLKGSVFGTEQRALMN 178
Query: 156 VG----------LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD 205
V GLT + PNIN RDPR+GR E PGEDP L+ +YA V+G+Q+ D
Sbjct: 179 VHGERFHTHSGRHIGLTAFGPNINQQRDPRFGRSSELPGEDPFLSGQYAAHMVQGMQERD 238
Query: 206 G-GSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK------------- 251
G P KV A KH+TAY + +G D Y+ + ++ YL +++
Sbjct: 239 ANGYP---KVLAYLKHFTAYSREEGRGNDDYNISMYDLFDTYLPQYEMGMVQGGATGVMC 295
Query: 252 ------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSI 281
++ +DC +V+ L +AAA ++
Sbjct: 296 SYNAVNGIPACANDYLLNKILRQRWNRSDAHVTTDCGAVNNLRGKPIQAADEAQAAAMAL 355
Query: 282 LAGLDLNCGSFLGKHT-EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQL 340
+ G D+ GS L H A+ G E A++ AI ++ G FD P+ + L
Sbjct: 356 MNGADIEMGSTLFVHNLTTAITLGYATEEAVNQAIRRSYRPHFIAGRFD-DPTLSEWFSL 414
Query: 341 GPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE 400
G D+ ++ +Q++ L+AA QG+VLLK+ LP++ A LAV+GP ++ +YE
Sbjct: 415 GLDDIQSKKHQEIQLEAALQGLVLLKHEDSILPIA--AGTKLAVLGPLGMTRSGLMSDYE 472
Query: 401 --------GTPCKYT--TPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVL 450
G C T + + T+ AG T+ V+ + AA D VL
Sbjct: 473 SDQSCFGGGHDCIPTLAESIGFINGKEFTVAAAGVDVDSRNTSDVERILQLAADRDLIVL 532
Query: 451 VMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKI 510
+G ++ E E DR D LPGQQ L V + K PV+L++++GG + P
Sbjct: 533 CLGNTKTQEQEGFDRKDTALPGQQYALFEAVLTLRK-PVVLVLVNGGQIALDGMTGYP-- 589
Query: 511 TSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
++I+ P GG A+A FG+ N G+LP T YP S
Sbjct: 590 SAIIEAFNPNGIGGTALAASLFGQENRWGKLPYTIYPYS 628
>gi|270340289|ref|ZP_06007688.2| periplasmic beta-glucosidase [Prevotella bergensis DSM 17361]
gi|270331970|gb|EFA42756.1| periplasmic beta-glucosidase [Prevotella bergensis DSM 17361]
Length = 867
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 142/432 (32%), Positives = 205/432 (47%), Gaps = 78/432 (18%)
Query: 48 LASLGFCNT----------SLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEW 97
+A L FCN +L R DL RLTL+EK + ++D + ++ RLGI +++W
Sbjct: 12 MAVLPFCNMPAQQLPYQNPALSAHERAVDLCGRLTLEEKASLMLDDSPAIPRLGIKRFQW 71
Query: 98 WSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN-- 155
WSEALHGV+ +G T FPQ I AASFN +L + S E RA +N
Sbjct: 72 WSEALHGVANMG----------DVTVFPQPIGMAASFNDNLVYQVFNATSDEMRAKWNEL 121
Query: 156 -------VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGS 208
GL+ W+PN+NIFRDPRWGRGQET GEDP L S+ V+GLQ + +
Sbjct: 122 QASGKEVTRFHGLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVRGLQGPE--T 179
Query: 209 PNRLKVAACCKHYTAYDLDNW-KGTDRY-HFNAMVIYTYYLIKFK--------------- 251
K+ AC KHY + W + TD + ++ Y+ FK
Sbjct: 180 AKYRKLWACAKHYAIHSGPEWARHTDNVTDVSPRDLWETYMPAFKSLVQDANVREVMCAY 239
Query: 252 ---------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
+VSDC +V + S + AA K++LAG
Sbjct: 240 QRWDDDPCCGNNRLLQRILRDEWGFKYLVVSDCGAVSDFWTSHKSSSNARNAATKAVLAG 299
Query: 285 LDLNCG-SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
D+ CG +++ K AVK G + E+ +D + LG D + S P+ +
Sbjct: 300 TDVECGYNYVYKSIPEAVKYGAMTEAEMDEHVVRLLEGRFDLGEMDDN-SIVPWSSIPSS 358
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
+C++ ++ L+LD + Q + LL+N+ LPL+ K +AVIGPN + M GNY GTP
Sbjct: 359 ALCSKKHRQLSLDMSLQTMTLLQNSNDVLPLNKKE-KKIAVIGPNVDNEPMMWGNYNGTP 417
Query: 404 CKYTTPLQGLAA 415
T L G+ +
Sbjct: 418 RSTVTILDGIKS 429
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 56/122 (45%), Gaps = 13/122 (10%)
Query: 438 AKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKG 487
A K + V V G +E E DR D+ LP Q+ + + + K
Sbjct: 597 AIKQLKDCETVVFVGGISPQLEGEEMPINVDGFKGGDRTDIELPKVQREFLKALKQAGKS 656
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYP 547
V+ + SG I+ +IL YPGQ GG A+A V FG YNPSG+LP+T+Y
Sbjct: 657 -VVFVCCSGSA--IALTPETKTCDAILQAWYPGQEGGEAVARVLFGEYNPSGKLPITFYK 713
Query: 548 QS 549
+
Sbjct: 714 NT 715
>gi|160886913|ref|ZP_02067916.1| hypothetical protein BACOVA_04927 [Bacteroides ovatus ATCC 8483]
gi|423288977|ref|ZP_17267828.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
CL02T12C04]
gi|423294866|ref|ZP_17272993.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
CL03T12C18]
gi|156107324|gb|EDO09069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
gi|392668741|gb|EIY62235.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
CL02T12C04]
gi|392676057|gb|EIY69498.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
CL03T12C18]
Length = 863
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 144/433 (33%), Positives = 209/433 (48%), Gaps = 81/433 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T L + R DL++RLTL+EK+ + +++ ++ RLGI YEWW+EALHGV+ G
Sbjct: 28 YQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL-- 85
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---MYNVG-----LAGLTFW 164
AT FPQ I AASFN L + VS EARA +N GLT W
Sbjct: 86 --------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLTMW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PN+NIFRDPRWGRGQET GEDP L+ + V+GLQ + ++L AC KH+ +
Sbjct: 138 TPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLH--ACAKHFAVH 195
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL FK
Sbjct: 196 SGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNR 252
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFLGK 295
+V+DC ++ + + + P+ A+A ++L+G DL CG
Sbjct: 253 LLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGNFKS 312
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
T+ AVK GL++E I+ ++ LG + S P+ + + +++LAL
Sbjct: 313 ITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPKHKELAL 368
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
A + +VLL+N LPL+ +AVIGPNAN + GNY G P T L+G+ A
Sbjct: 369 KMAHESLVLLQNNNNILPLNRQM--KVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGIRA 426
Query: 416 VV---ATIYQAGC 425
+ IY+ C
Sbjct: 427 KLPDAQIIYEPVC 439
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 56/113 (49%), Gaps = 13/113 (11%)
Query: 444 SADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILII 493
SAD + G +E ES DR ++ LP Q+ ++ + K K + +
Sbjct: 599 SADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-KTVFVN 657
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
SG I +IL YPGQAGG A+ADV FG YNP+GRLP+T+Y
Sbjct: 658 FSGSAMAI--VPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY 708
>gi|256394357|ref|YP_003115921.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
gi|256360583|gb|ACU74080.1| glycoside hydrolase family 3 domain protein [Catenulispora
acidiphila DSM 44928]
Length = 1357
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 172/603 (28%), Positives = 258/603 (42%), Gaps = 114/603 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFL-VDSAGSVSRLGIPKYEWWSEALHGVSYVGPG 111
+ NTS + R DLV R+TL EK L SA ++ RLG+ +Y + +EA HG++Y+G
Sbjct: 49 YLNTSYPFEARAADLVSRMTLAEKAAQLNTTSAPAIPRLGVQQYTYQAEAQHGINYLGGD 108
Query: 112 THFSNVV---PGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG-------- 160
+ +V P ATSFP ++ S++ +L VS EAR + + L G
Sbjct: 109 QNSGSVAGNPPVATSFPTNFASSMSWDPALVYQETTAVSDEARGLVDKSLFGTGQNNLGP 168
Query: 161 -------LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ---QTDGGSPN 210
LTFW+P +N+ RDPRWGR E GEDP L + A +V G Q T
Sbjct: 169 SASDYGSLTFWAPTVNLDRDPRWGRTDEAFGEDPYLVGQMAGAFVNGFQGNSMTGQSLDG 228
Query: 211 RLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK------------------- 251
LK AA KHY D++ + + + + YY +F
Sbjct: 229 YLKAAATAKHYALNDVEQNRTGISSNVSDTDLRDYYTKQFADLIENSHVAGLMTSYNAIN 288
Query: 252 -----------------------YIVSDCDSVDVLY--------------------NSQH 268
Y+ SDC +V Y +
Sbjct: 289 GTPSVADTYTANQLAQRTYGFNGYVTSDCGAVGTAYRNFPAGHAWAPPGWTTDGGDTNSI 348
Query: 269 YTKTPEEA--------AAKSILAGLDLNCG--SFLGKHTEAAVKAGLVNESAIDHAISNN 318
+T T A A S+ AG +NCG F ++ +AA+ AG+++E ID ++
Sbjct: 349 WTNTSTGAKISGAAGGEAYSLRAGTQVNCGGDEFSLQNIQAAISAGILSEGVIDSDLTKL 408
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN------TAGSL 372
F M G FD SK PY + + + A+Q LA A +VLLKN +A L
Sbjct: 409 FTIRMETGEFD-PASKVPYTSITKAQIQSPAHQALATSVADNSLVLLKNANVSGTSAPLL 467
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAG-------- 424
P S + + N+ ++G AN + +G+Y G P +QGL + +
Sbjct: 468 PASASKLANVVILGDMAN--QVTLGDYSGAPSLQVNAVQGLTTAIKAANPSANILFDAAG 525
Query: 425 -CSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAK 483
S + A AD V+ +G +Q+ E +DR L +PG LIT+
Sbjct: 526 TSSTTTSAATLSSATQAAIKKADLVVMFVGTNQNNAQEGNDRTTLNMPGNYDSLITQTTA 585
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
+ L++ S G IS + + ++++ GY G++ G A+ADV G+ NPSG L
Sbjct: 586 LGNPKTALVVQSDGPVKISDVQG--SVPAVVFSGYNGESQGTALADVLLGKQNPSGHLNF 643
Query: 544 TWY 546
TWY
Sbjct: 644 TWY 646
>gi|295132888|ref|YP_003583564.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294980903|gb|ADF51368.1| beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 855
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 147/446 (32%), Positives = 205/446 (45%), Gaps = 84/446 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N +L + R DLV RLTL+EK + + D + ++ RLGI K+ WWSEALHG
Sbjct: 14 YQNPNLSPEERAEDLVNRLTLEEKASLMFDVSEAIPRLGIKKFNWWSEALHG-------- 65
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG---------LTF 163
F+N T FP+ + AASF+ L + S E RA Y+ L L+
Sbjct: 66 -FANN-DDVTVFPEPVGMAASFDDELVYQVFDATSDEVRAKYHEALRNGEENKRFLSLSV 123
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PN+NIFRDPRWGRGQET GEDP L S+ VKGLQ + +L AC KHY
Sbjct: 124 WTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAKYKKL--LACAKHYAV 181
Query: 224 YDLDNWKGTDRYHFN-----AMVIYTYYLIKFK--------------------------- 251
+ W R+ N ++ YL FK
Sbjct: 182 HSGPEWS---RHELNLNNVSQRDLWETYLPAFKVLVQDANVRQVMCAYQRLDDEPCCGSD 238
Query: 252 ---------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKH 296
+VSDC ++ Y S + + AAAK++LAG D+ C KH
Sbjct: 239 RLLQQILREKWGFEHLVVSDCGAIQDFYTSHNVSSDAVHAAAKAVLAGTDVECQ--WDKH 296
Query: 297 T----EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQ 351
AV+ GLV E ID ++ LG D P + PY Q+ + + ++
Sbjct: 297 NYKLLPEAVEKGLVKEEDIDRSVKRVLIGRFELGEMD--PDEIVPYAQIPASVINNEEHR 354
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LAL AR+ + LL+N LPLS + +AVIGPNA+ + GNY GTP + + L
Sbjct: 355 QLALKMARESMTLLQNKNNILPLSKGQDR-IAVIGPNADDEPMLWGNYNGTPVRTISILD 413
Query: 412 GLAAVV---ATIYQAGCSNVQCGTAQ 434
G+ + + + +Y C V+ Q
Sbjct: 414 GITSKIGEKSIVYDKACDLVEDKVTQ 439
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 44/168 (26%), Positives = 67/168 (39%), Gaps = 22/168 (13%)
Query: 398 NYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGT---------AQVDDAKKAAASADAT 448
N+ P K ++ I+ A +N Q +D K +
Sbjct: 536 NWRTLPSKIPYKVEKGKKYTIEIHYAQLNNWQANLEFNFGKEIPVNYEDLLKKLKGIETV 595
Query: 449 VLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
+ V G +E E DR D+ LP Q+ + + K +I ++ G
Sbjct: 596 IFVGGLSTKLEGEEMPVSYPGFKGGDRTDIALPSVQRNCLKTLKDAGKK---VIFVNNSG 652
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I +IL Y G++GG A+ADV FG YNPSG+LP+T+Y
Sbjct: 653 SAIGLVPETTSCDAILQAWYGGESGGQAVADVLFGDYNPSGKLPVTFY 700
>gi|298387490|ref|ZP_06997042.1| beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298259697|gb|EFI02569.1| beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 853
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 147/415 (35%), Positives = 205/415 (49%), Gaps = 70/415 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N + + RV DL+ RLT++EKI+ L ++ + RLGI KY +EALHGV V PG
Sbjct: 30 YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPGR 87
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LT 162
T FPQ I AA++N L + + V+S EARA +N G LT
Sbjct: 88 F--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 139
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FWSP +N+ RDPRWGR ET GEDP L+ T +V GLQ G P+ LK+ + KH+
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQ---GDDPHYLKIVSTPKHFA 196
Query: 223 AYDLDNWKGTDRYHFNAMVIYTYYLIKFK------------------------------- 251
A + ++ + + + YY F+
Sbjct: 197 ANNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNSWLLQ 256
Query: 252 -----------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEA 299
Y+VSDC +L N+ Y KT E AA SI AGLDL CG
Sbjct: 257 KVLRQDWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLLN 316
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
A K +V+++ ID A + M+LG FD + PY ++ P + ++ +Q +ALDAAR
Sbjct: 317 AYKQYMVSDADIDSAACHVLTARMKLGLFDSG-ERNPYTKISPSVIGSKEHQQIALDAAR 375
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-CKYTTPLQGL 413
Q IVLLKN LPL+ +K++AV+G NA K G+Y G P + + LQG+
Sbjct: 376 QCIVLLKNQKNRLPLNADKLKSIAVVGINAG--KCEFGDYSGAPVVEPVSILQGI 428
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 68/112 (60%), Gaps = 7/112 (6%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + V VMG ++SIE E DR D+ LP Q+ + E+ KV P I++++
Sbjct: 596 EAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVA 653
Query: 497 GGFDISFAKN--DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G S A N D + +I+ YPG+ GG A+A+V FG YNP+GRLP+T+Y
Sbjct: 654 GS---SLAVNWMDEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYY 702
>gi|365121873|ref|ZP_09338785.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
6_1_58FAA_CT1]
gi|363644185|gb|EHL83481.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
6_1_58FAA_CT1]
Length = 850
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 145/399 (36%), Positives = 196/399 (49%), Gaps = 77/399 (19%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ DL+ RLT++EKI+ L ++ + RL I KY +EALHG+ V PG T
Sbjct: 37 RIMDLLSRLTIEEKISLLRATSPGIPRLEIEKYYHGNEALHGI--VRPGNF--------T 86
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LTFWSPNINIFR 172
FPQ I A+ +N I V+S EARA +N G LTFWSP +N+ R
Sbjct: 87 VFPQAIGLASMWNPDFLYEISTVISDEARARWNELNRGKDQKRLFSDLLTFWSPTVNMAR 146
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L+ K +VKGLQ G P LKV + KH+ A N +
Sbjct: 147 DPRWGRTPETYGEDPFLSGKLGVAFVKGLQ---GNDPRYLKVVSTPKHFAA----NNEEH 199
Query: 233 DRYHFNAMV----IYTYYLIKFK------------------------------------- 251
+R+ N + + YYL F+
Sbjct: 200 NRFECNPQISERDLREYYLPAFERCIIDGKAQSIMTAYNAINDVPCTLNTWLLKKVLRTD 259
Query: 252 -----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGL 305
Y+VSDC + +L Y KTPE AA ++ AGLDL CG + + A K +
Sbjct: 260 WGFNGYVVSDCGAPSLLVTHHKYVKTPEAAATLALKAGLDLECGDNVYIEPLMNAYKQYM 319
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
V+E+ ID A M LG FD P+K PY L P V + ++++AL+AARQ +VLL
Sbjct: 320 VSEAEIDTAAYRILRARMMLGLFD-DPAKNPYNALSPSIVGCEKHKNMALEAARQSLVLL 378
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
KN LP++P IK++AV+G NA + G+Y G P
Sbjct: 379 KNENNFLPINPKKIKSIAVVGINAGNCE--FGDYSGKPV 415
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 67/112 (59%), Gaps = 7/112 (6%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
DAKKA D T+ VMG ++SIE E DR + LP Q+L I E K+ P + +++
Sbjct: 593 DAKKAIQECDMTIAVMGINKSIEREGRDRDHIELPKDQELFIEEAYKL--NPKMAVVLVA 650
Query: 497 GGFDISFAKN--DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G S A N D + +IL YPG+ GG A+A+ FG YNP+GRLP+T+Y
Sbjct: 651 GS---SLAVNWMDEHVPAILNAWYPGEQGGTAVAEALFGDYNPAGRLPLTYY 699
>gi|313204104|ref|YP_004042761.1| beta-glucosidase [Paludibacter propionicigenes WB4]
gi|312443420|gb|ADQ79776.1| Beta-glucosidase [Paludibacter propionicigenes WB4]
Length = 871
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 144/437 (32%), Positives = 204/437 (46%), Gaps = 73/437 (16%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + N +L + R DL+ RLTL EK T + D + +V RLGI K+ WWSEALHG + G
Sbjct: 30 LPYKNPNLSSEERAKDLISRLTLSEKATLMCDISDAVPRLGIKKFNWWSEALHGFANNG- 88
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG---------LAGL 161
T FP+ + AASF+ +L I VS EARA YN L
Sbjct: 89 ---------NVTVFPEPVGMAASFDDNLVYKIFDAVSDEARAKYNEATKKGLENARFLSL 139
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY 221
+ W+PN+NIFRDPRWGRGQET GEDP L S+ VKGLQ K+ AC KH+
Sbjct: 140 SVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGP--ADAKYRKLLACAKHF 197
Query: 222 TAYDLDNW--KGTDRYHFNAMVIYTYYLIKFK---------------------------- 251
+ W + + + ++ YL FK
Sbjct: 198 AVHSGPEWSRHSLNINNLDPRDLWETYLPAFKSLVQKSDVRQVMCAYQRLDDEPCCGNTQ 257
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC--GSFLGK 295
+V+DC +V Y S + AA+K + AG D+ C + + K
Sbjct: 258 LLQRILRDEWGYKYMVVADCGAVSDFYTSHKVSSDAVHAASKGVWAGTDVECQWDNHIYK 317
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
AV GL+ E+ I+ + N LG D + P+ ++ V + ++ LAL
Sbjct: 318 QLPDAVAKGLITEAEINKHLLNVLIGRFDLGEMDDD-ALVPWSKIPMSVVNNEEHRKLAL 376
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
D A + + LL+N LPLS + K +AV+GPNAN + GNY GTP + T L G+ +
Sbjct: 377 DMALKSMTLLQNKNNILPLSKS--KKIAVVGPNANDKPMLWGNYNGTPVRTITILDGITS 434
Query: 416 VVA---TIYQAGCSNVQ 429
++ +Y+ GC V+
Sbjct: 435 KLSANKVLYEKGCDLVE 451
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 47/83 (56%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP Q+ + + + K VI + SG I+ +IL Y G++G
Sbjct: 638 DRTDIELPAVQRNCLKALKEAGK-QVIFVNCSGSA--IALIPETESCDAILQAWYGGESG 694
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G A+ADV FG YNPSG+LP+T+Y
Sbjct: 695 GQAVADVLFGDYNPSGKLPITFY 717
>gi|383125190|ref|ZP_09945844.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
gi|251838523|gb|EES66609.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
Length = 853
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 147/415 (35%), Positives = 205/415 (49%), Gaps = 70/415 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N + + RV DL+ RLT++EKI+ L ++ + RLGI KY +EALHGV V PG
Sbjct: 30 YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPGR 87
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LT 162
T FPQ I AA++N L + + V+S EARA +N G LT
Sbjct: 88 F--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 139
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FWSP +N+ RDPRWGR ET GEDP L+ T +V GLQ G P+ LK+ + KH+
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQ---GDDPHYLKIVSTPKHFA 196
Query: 223 AYDLDNWKGTDRYHFNAMVIYTYYLIKFK------------------------------- 251
A + ++ + + + YY F+
Sbjct: 197 ANNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNPWLLQ 256
Query: 252 -----------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEA 299
Y+VSDC +L N+ Y KT E AA SI AGLDL CG
Sbjct: 257 KVLRQDWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLLN 316
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
A K +V+++ ID A + M+LG FD + PY ++ P + ++ +Q +ALDAAR
Sbjct: 317 AYKQYMVSDADIDSAAYHVLTARMKLGLFDSG-ERNPYTKISPSVIGSKEHQQIALDAAR 375
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-CKYTTPLQGL 413
Q IVLLKN LPL+ +K++AV+G NA K G+Y G P + + LQG+
Sbjct: 376 QCIVLLKNQKNRLPLNADKLKSIAVVGINAG--KCEFGDYSGAPVVEPVSILQGI 428
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 69/110 (62%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + V VMG ++SIE E DR D+ LP Q+ + E+ KV +I+++++G
Sbjct: 596 EAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAG 654
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I++ D I +I+ YPG+ GG A+A+V FG YNP+GRLP+T+Y
Sbjct: 655 SSLAINWM--DEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYY 702
>gi|293370402|ref|ZP_06616956.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292634550|gb|EFF53085.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 863
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 143/433 (33%), Positives = 209/433 (48%), Gaps = 81/433 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T L ++ R DL++RLTL+EK+ + +++ ++ RLGI YEWW+EALHGV+ G
Sbjct: 28 YQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL-- 85
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---MYNVG-----LAGLTFW 164
AT FPQ I AASFN L + VS EARA +N GLT W
Sbjct: 86 --------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLTMW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PN+NIFRDPRWGRGQET GEDP L+ + V+GLQ + ++L AC KH+ +
Sbjct: 138 TPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLH--ACAKHFAVH 195
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL FK
Sbjct: 196 SGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNR 252
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFLGK 295
+V+DC ++ + + + P+ A+A ++L+G DL CG
Sbjct: 253 LLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGNFKS 312
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
T+ AVK L++E I+ ++ LG + S P+ + + +++LAL
Sbjct: 313 ITD-AVKKDLISEEKINTSVKRVLKARFELGEMN---STHPWSNIPFSVIDCPKHKELAL 368
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
A + +VLL+N LPL+ +AVIGPNAN + GNY G P T L+G+ A
Sbjct: 369 KMAHESLVLLQNNNNILPLNRQM--KVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGIRA 426
Query: 416 VV---ATIYQAGC 425
+ IY+ C
Sbjct: 427 KLPDAQIIYEPVC 439
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 56/113 (49%), Gaps = 13/113 (11%)
Query: 444 SADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILII 493
SAD + G +E ES DR ++ LP Q+ ++ + K K + +
Sbjct: 599 SADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-KTVFVN 657
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
SG I +IL YPGQAGG A+ADV FG YNP+GRLP+T+Y
Sbjct: 658 FSGSAMAI--VPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY 708
>gi|325918730|ref|ZP_08180824.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325535054|gb|EGD06956.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 391
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 142/381 (37%), Positives = 196/381 (51%), Gaps = 67/381 (17%)
Query: 46 PSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV 105
P +A + S + R LV +++ EK+ ++ A ++ RL IP YEWWSE LHG+
Sbjct: 18 PCIAFAAPADRSGTPEQRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGI 77
Query: 106 SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG-------- 157
+ G AT FPQ I AAS+N +L Q +G VVSTEARA +N
Sbjct: 78 ARNG----------YATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHK 127
Query: 158 -LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAA 216
AGLT WSPNINIFRDPRWGRG ET GEDP L + A G+++GLQ D P + A
Sbjct: 128 RYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLNHP---RTIA 184
Query: 217 CCKHYT----------AYDLD----NWKGT----------DRYHFNAMVIYTY------- 245
KH +D+D + + T D ++ M Y
Sbjct: 185 TPKHIAVHSGPEPGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPAC 244
Query: 246 ---YLIK--------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFL 293
+L+ FK ++VSDCD+VD + ++ ++A ++ AG DLNCG
Sbjct: 245 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 304
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
+ A++ G V+E+ +D ++ FA RLG + P K PY +LG KDV A++ L
Sbjct: 305 -RELGTAIERGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRAL 362
Query: 354 ALDAARQGIVLLKNTAGSLPL 374
AL AA + IVLLKNTA +LPL
Sbjct: 363 ALQAAAESIVLLKNTATTLPL 383
>gi|393781366|ref|ZP_10369565.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
CL02T12C01]
gi|392676859|gb|EIY70281.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
CL02T12C01]
Length = 854
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 149/399 (37%), Positives = 194/399 (48%), Gaps = 77/399 (19%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ DL+ +LT++EKI+ L ++ + RL I KY +EALHGV V PG T
Sbjct: 38 RILDLLSKLTIEEKISLLRATSPGIPRLQIDKYYHGNEALHGV--VRPGNF--------T 87
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LTFWSPNINIFR 172
FPQ I AA +N L I +S EARA +N G LTFWSP +N+ R
Sbjct: 88 VFPQAIGLAAMWNPQLLNEISTAISDEARARWNELEQGKKQLGQFSDLLTFWSPTVNMAR 147
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L+ K +VKGLQ G P LK+ + KH+ A N +
Sbjct: 148 DPRWGRTPETYGEDPFLSGKLGVSFVKGLQ---GDDPRYLKIVSTPKHFAA----NNEEH 200
Query: 233 DRYHFNAMV----IYTYYLIKFK------------------------------------- 251
+R+ N ++ + YYL F+
Sbjct: 201 NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAINDVPCTLNNWLLKKVLRHD 260
Query: 252 -----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGL 305
Y+VSDC + D L Y KT E AA SI AGLDL CG + + A K +
Sbjct: 261 WGFDGYVVSDCGAPDFLVTHHKYVKTLEAAATLSIQAGLDLECGDNVYMEPLLNAYKQYM 320
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
V E+ ID A + MRLG FD P+ PY ++ P V + + LAL+AARQ IVLL
Sbjct: 321 VTEAEIDSAAYHILRARMRLGLFD-DPNLNPYNKISPSVVGCEKHSQLALEAARQSIVLL 379
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
KN LPL IK++AV+G NA + G+Y GTP
Sbjct: 380 KNEKKFLPLDLKKIKSIAVVGINAGNCE--FGDYSGTPV 416
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 66/109 (60%), Gaps = 3/109 (2%)
Query: 438 AKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
A A D T+ V+G ++SIE E DR + LP QQ+ I E K+ V++++ +G
Sbjct: 595 AGDAMRKCDLTIAVVGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVVVLV-AGS 653
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I++ D I +I+ YPG+AGG A+A+V FG YNP G+LP+T+Y
Sbjct: 654 SLAINWM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYY 700
>gi|261880245|ref|ZP_06006672.1| beta-glucosidase [Prevotella bergensis DSM 17361]
gi|270333079|gb|EFA43865.1| beta-glucosidase [Prevotella bergensis DSM 17361]
Length = 854
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 145/420 (34%), Positives = 208/420 (49%), Gaps = 80/420 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ NT L R DL RLTL+EK + + + ++ RLGIP++EWWSEALHG+ G
Sbjct: 25 YQNTDLSPKERAADLCSRLTLEEKSKIMQNGSPAIPRLGIPQFEWWSEALHGIGRNG--- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFW 164
AT FP + A+S++ +L Q + VS E R GL+FW
Sbjct: 82 -------FATVFPITMGMASSWDDALLQKVFDAVSDEGRVKAQQAKRSGTIKRYQGLSFW 134
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL-KVAACCKHYTA 223
+PNINIFRDPRWGRGQET GEDP L S+ V+GLQ G S ++ K+ AC KH+
Sbjct: 135 TPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQ---GPSDSKYRKLLACAKHFAV 191
Query: 224 YDLDNWKGTDRYHFNA-----MVIYTYYLIKFK--------------------------- 251
+ W +R+ FN ++ YL FK
Sbjct: 192 HSGPEW---NRHTFNVEDLPERDLWETYLPAFKALVQQGDVAEVMCAYQRIDGQPCCGNN 248
Query: 252 ---------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFLG 294
+VSDC +V + H+ +P+ A+AK++L+G D+ CGS
Sbjct: 249 RFLKSILRNEWNYQGMVVSDCWAVPDFWKKGHHEVSPDATHASAKAVLSGTDVECGSDYS 308
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDL 353
E AV+AG++ E+ +D ++ LG FD P + P+ ++ V ++A++ L
Sbjct: 309 NLPE-AVRAGIIKEADVDVSVRRLLEARFALGDFD--PDELVPWTKISESVVASKAHKQL 365
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
ALD AR+ +VLL+N LPL + K + V+G NA + M GNY G P + T LQGL
Sbjct: 366 ALDMARKSMVLLQNN-DILPLKRSGQK-IVVVGANAIDSTMMWGNYSGYPTQTVTILQGL 423
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 68/144 (47%), Gaps = 20/144 (13%)
Query: 413 LAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESH--------- 463
LA + I G S Q D + A AD + V G +E E
Sbjct: 574 LAMIKFDITHTGLSTPQ-------DIVRKTAGADVVIFVGGISPRLEGEEMEVSDPGFKG 626
Query: 464 -DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
DR + LP Q+ +I +++ + ++ ++ G I+ ++ +IL YPG+
Sbjct: 627 GDRTTIELPQAQREVIKALSEAGRR---IVFVNCSGSAIALTPESQRVDAILQAWYPGEQ 683
Query: 523 GGAAIADVCFGRYNPSGRLPMTWY 546
GG A+ADV FG YNPSG+LP+T+Y
Sbjct: 684 GGTAVADVLFGDYNPSGKLPVTFY 707
>gi|29347188|ref|NP_810691.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29339087|gb|AAO76885.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
VPI-5482]
Length = 853
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 146/415 (35%), Positives = 205/415 (49%), Gaps = 70/415 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N + + RV DL+ RLT++EKI+ L ++ + RLGI KY +EALHGV V PG
Sbjct: 30 YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPGR 87
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LT 162
T FPQ I AA++N L + + V+S EARA +N G LT
Sbjct: 88 F--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 139
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FWSP +N+ RDPRWGR ET GEDP L+ T +V GLQ G P+ LK+ + KH+
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQ---GDDPHYLKIVSTPKHFA 196
Query: 223 AYDLDNWKGTDRYHFNAMVIYTYYLIKFK------------------------------- 251
A + ++ + + + YY F+
Sbjct: 197 ANNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNPWLLQ 256
Query: 252 -----------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEA 299
Y+VSDC +L N+ Y KT E AA SI AGLDL CG
Sbjct: 257 KVLRQDWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLLN 316
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
A K +V+++ ID A + M+LG FD + PY ++ P + ++ +Q +ALDAAR
Sbjct: 317 AYKQYMVSDADIDSAAYHVLTARMKLGLFDSG-ERNPYTKISPSVIGSKEHQQIALDAAR 375
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-CKYTTPLQGL 413
Q +VLLKN LPL+ +K++AV+G NA K G+Y G P + + LQG+
Sbjct: 376 QCVVLLKNQKNRLPLNADKLKSIAVVGINAG--KCEFGDYSGAPVVEPVSILQGI 428
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 69/110 (62%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + V VMG ++SIE E DR D+ LP Q+ + E+ KV +I+++++G
Sbjct: 596 EAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAG 654
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I++ D I +I+ YPG+ GG A+A+V FG YNP+GRLP+T+Y
Sbjct: 655 SSLAINWM--DEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYY 702
>gi|337749758|ref|YP_004643920.1| glycoside hydrolase family protein [Paenibacillus mucilaginosus
KNP414]
gi|336300947|gb|AEI44050.1| glycoside hydrolase family 3 domain protein [Paenibacillus
mucilaginosus KNP414]
Length = 937
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 150/463 (32%), Positives = 222/463 (47%), Gaps = 69/463 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F +T L ++ RV DLV RLTL+EKI + + RLGI KY+ +E HGV+++G
Sbjct: 9 FQDTRLPLEERVQDLVSRLTLEEKINLMCQYQEEIPRLGIAKYKHGTEGAHGVAWLGE-- 66
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
AT FPQ + A ++N L + IG V+ EAR Y N + GLT W+P +++
Sbjct: 67 --------ATVFPQNVGLACTWNPELMREIGSVIGDEARVYYQRNPEINGLTIWAPTVDM 118
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L + +T VKG+Q G P LK AA KH+ + + +
Sbjct: 119 ERDPRWGRTEEAYGEDPYLTGRLSTELVKGMQ---GDHPFYLKTAATLKHFLGNNNEMDR 175
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + + YYL F+
Sbjct: 176 GECSASIDPRNMREYYLKAFEPAFREGGAQSMMTAYNSVNGTPCNLNPDVNAIVKGEWGM 235
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGKHTEAAVKAGLVN 307
++VSD V N Y + EA A S+ +G+D + + A++ GL+
Sbjct: 236 DGFVVSDAGDVLGTVNEHRYFASYAEAVAASVRSGIDSITDDAGITLRAVRDALEGGLLA 295
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E +D A+ N F +RLG FDG P + PY + +C + L+L AAR+G VLLKN
Sbjct: 296 EEDLDRAVGNAFRVRIRLGEFDG-PEENPYAHVPEAKLCAPEHAALSLQAAREGFVLLKN 354
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA---TIYQAG 424
G LPL+ A ++AVIGP A+V T Y GTP TPL+G+A +A I++ G
Sbjct: 355 E-GLLPLAKPA--SIAVIGPLADVVHTDW--YSGTPPYRITPLEGVAERMAPGTVIHRTG 409
Query: 425 CSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
++ + + + + ++ G ++ EAE +R D
Sbjct: 410 GDRIRIKSLRTGRYVRIGGADGFSLTADGTEE--EAEVFERTD 450
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 60/111 (54%), Gaps = 6/111 (5%)
Query: 438 AKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
A +AA A+ ++ +G I E DR DL L Q+ L+ V +A P + ++ G
Sbjct: 547 AVEAARQAETAIVFLGNSPFINGKECVDRPDLTLAPAQEALLQAV--LAANPRTVAVLVG 604
Query: 497 G-GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
F +++A+ + +IL+ + GQ G A ADV FG +P+GRL MTWY
Sbjct: 605 SYPFAVNWAQ--AHVPAILYTSHAGQELGRAAADVLFGDCSPAGRLNMTWY 653
>gi|298482082|ref|ZP_07000270.1| beta-glucosidase [Bacteroides sp. D22]
gi|298271639|gb|EFI13212.1| beta-glucosidase [Bacteroides sp. D22]
Length = 863
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 149/436 (34%), Positives = 211/436 (48%), Gaps = 87/436 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T L + R DL++RLTL+EK+ + +++ ++ RLGI YEWW+EALHGV+ G
Sbjct: 28 YQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL-- 85
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---MYNVG-----LAGLTFW 164
AT FPQ I AASFN L + VS EARA +N GLT W
Sbjct: 86 --------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLTMW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PN+NIFRDPRWGRGQET GEDP L+ + V+GLQ + ++L AC KH+ +
Sbjct: 138 TPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAVVRGLQGPEDAEYDKLH--ACAKHFAVH 195
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL FK
Sbjct: 196 SGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNR 252
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFLGK 295
+V+DC ++ + + + P+ A+A ++L G DL CG
Sbjct: 253 LLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAVHASADAVLNGTDLECGGNFKS 312
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFD-GHP-SKQPYGQLG-PKDVCTQANQD 352
T+ AVK GL++E I+ ++ LG + HP S PY + PK +++
Sbjct: 313 ITD-AVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNIPYSVIDCPK------HKE 365
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LAL A + +VLL+N LPL+ +AVIGPNAN + GNY G P T L+G
Sbjct: 366 LALKMAHESLVLLQNKNNILPLNRQM--KVAVIGPNANDSVMQWGNYNGFPSHTVTLLEG 423
Query: 413 LAAVV---ATIYQAGC 425
+ A + IY+ C
Sbjct: 424 IRAKLPDAQIIYEPVC 439
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 56/113 (49%), Gaps = 13/113 (11%)
Query: 444 SADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILII 493
+AD + G +E ES DR ++ LP Q+ ++ + K K + +
Sbjct: 599 NADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-KTVFVN 657
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
SG I +IL YPGQAGG A+ADV FG YNP+GRLP+T+Y
Sbjct: 658 FSGSAMAI--VPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY 708
>gi|336415490|ref|ZP_08595829.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
3_8_47FAA]
gi|335940369|gb|EGN02236.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
3_8_47FAA]
Length = 863
Score = 212 bits (539), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 149/436 (34%), Positives = 211/436 (48%), Gaps = 87/436 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T L + R DL++RLTL+EK+ + +++ ++ RLGI YEWW+EALHGV+ G
Sbjct: 28 YQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL-- 85
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---MYNVG-----LAGLTFW 164
AT FPQ I AASFN L + VS EARA +N GLT W
Sbjct: 86 --------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLTMW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PN+NIFRDPRWGRGQET GEDP L+ + V+GLQ + ++L AC KH+ +
Sbjct: 138 TPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLH--ACAKHFAVH 195
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL FK
Sbjct: 196 SGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNR 252
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFLGK 295
+V+DC ++ + + + P+ A+A ++L G DL CG
Sbjct: 253 LLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLNGTDLECGGNFKS 312
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFD-GHP-SKQPYGQLG-PKDVCTQANQD 352
T+ AVK GL++E I+ ++ LG + HP S PY + PK +++
Sbjct: 313 ITD-AVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNIPYSVINCPK------HKE 365
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LAL A + +VLL+N LPL+ +AVIGPNAN + GNY G P T L+G
Sbjct: 366 LALKMAHESLVLLQNKNNILPLNRQM--KVAVIGPNANDSVMQWGNYNGFPSHTVTLLEG 423
Query: 413 LAAVV---ATIYQAGC 425
+ A + IY+ C
Sbjct: 424 IRAKLPDAQIIYEPVC 439
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 56/113 (49%), Gaps = 13/113 (11%)
Query: 444 SADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILII 493
+AD + G +E ES DR ++ LP Q+ ++ + K K + +
Sbjct: 599 NADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-KTVFVN 657
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
SG I +IL YPGQAGG A+ADV FG YNP+GRLP+T+Y
Sbjct: 658 FSGSAMAI--VPETQSCDAILQAWYPGQAGGTAVADVLFGNYNPAGRLPITFY 708
>gi|299148437|ref|ZP_07041499.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298513198|gb|EFI37085.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 863
Score = 212 bits (539), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 149/436 (34%), Positives = 211/436 (48%), Gaps = 87/436 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T L + R DL++RLTL+EK+ + +++ ++ RLGI YEWW+EALHGV+ G
Sbjct: 28 YQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL-- 85
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---MYNVG-----LAGLTFW 164
AT FPQ I AASFN L + VS EARA +N GLT W
Sbjct: 86 --------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLTMW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PN+NIFRDPRWGRGQET GEDP L+ + V+GLQ + ++L AC KH+ +
Sbjct: 138 TPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLH--ACAKHFAVH 195
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL FK
Sbjct: 196 SGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNR 252
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFLGK 295
+V+DC ++ + + + P+ A+A ++L G DL CG
Sbjct: 253 LLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLNGTDLECGGNFKS 312
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFD-GHP-SKQPYGQLG-PKDVCTQANQD 352
T+ AVK GL++E I+ ++ LG + HP S PY + PK +++
Sbjct: 313 ITD-AVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNIPYSVINCPK------HKE 365
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LAL A + +VLL+N LPL+ +AVIGPNAN + GNY G P T L+G
Sbjct: 366 LALKMAHESLVLLQNKNNILPLNRQM--KVAVIGPNANDSVMQWGNYNGFPSHTVTLLEG 423
Query: 413 LAAVV---ATIYQAGC 425
+ A + IY+ C
Sbjct: 424 IRAKLPDAQIIYEPVC 439
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 56/113 (49%), Gaps = 13/113 (11%)
Query: 444 SADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILII 493
+AD + G +E ES DR ++ LP Q+ ++ + K K + +
Sbjct: 599 NADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-KTVFVN 657
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
SG I +IL YPGQAGG A+ADV FG YNP+GRLP+T+Y
Sbjct: 658 FSGSAMAI--VPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY 708
>gi|256393789|ref|YP_003115353.1| beta-glucosidase [Catenulispora acidiphila DSM 44928]
gi|256360015|gb|ACU73512.1| Beta-glucosidase [Catenulispora acidiphila DSM 44928]
Length = 1548
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 184/630 (29%), Positives = 268/630 (42%), Gaps = 150/630 (23%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG-SVSRLGIPKYEWWSEALHGVSYVGPG 111
+ +T R DLV R+TL EK+ L ++G ++ RLG+ +Y +WSE HGV+ +G
Sbjct: 64 YLDTHYSAQARAADLVSRMTLPEKVEQLSTNSGPAIPRLGVQQYTYWSEGQHGVNTLG-- 121
Query: 112 THFSNVVPG----ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG------- 160
+ N G +TSFP + S++ SL +S EAR + L G
Sbjct: 122 ANQDNGGNGGAVHSTSFPTNFASTMSWDPSLIYQETTAISDEARGFLDKSLFGVNQNNLG 181
Query: 161 --------LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPN-- 210
LTFW+P +N+ RDPRWGR E GEDP L S A +V G + G +P
Sbjct: 182 PSAADYGSLTFWAPTVNMDRDPRWGRTDEAFGEDPYLTSTMAGAFVNGYE---GNTPTGQ 238
Query: 211 ----RLKVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------- 251
LKVAA KHY D++ DR ++ V ++ YY +F
Sbjct: 239 SKTGTLKVAATAKHYALNDVEQ----DRTGISSNVSDTDLHDYYTKQFASLIENAHVSGL 294
Query: 252 -------------------------------YIVSDCDSVDVLYNS-------------- 266
Y+ SDC ++ Y S
Sbjct: 295 MTSYNAINGTPSVADTYTANQLAQRQFGFNGYVTSDCGAIGTAYQSFPSGHDWAPPGWTT 354
Query: 267 ------QHYTKT------PEEAA--AKSILAGLDLNCGSFLGKHTE--AAVKAGLVNESA 310
+T T P +A A ++ AG DLNC + + AA+ AG+++E
Sbjct: 355 DGKSSTGTWTNTATGATVPAQAGGQAYALRAGTDLNCAGGENTYAQITAAISAGVLSEGV 414
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN--- 367
ID+A+ F + G FD S PY + + + A+Q LA A +VLLKN
Sbjct: 415 IDNALVKIFTVRVETGEFDPAGSN-PYTGITKAQIQSPAHQALATKVADNSLVLLKNQPP 473
Query: 368 -TAGS---------------------LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
+G+ LPLS A + ++G AN +GNY P
Sbjct: 474 AASGTSTTPPAASSAASSAAAAAKPLLPLSAAATAKIVIVGDMANAV--TLGNYSSDPAL 531
Query: 406 YTTPLQGLAAVVATIYQ-AGCSNVQCGTAQVDDAKKAA--------ASADATVLVMGADQ 456
+P+QG+ A V A + CGT+ A + A ADA ++ +G +Q
Sbjct: 532 KVSPVQGITAAVRKANPGASVTFDACGTSTTASAAASCSAQTLADVAGADAVIVFVGTNQ 591
Query: 457 SIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWV 516
I E DR + +PG LI++VA V ++L + SGG I + D SI++
Sbjct: 592 QIADEGKDRTSIAMPGNYDSLISQVAAVGNPRMVLAVQSGGPVRIDDVQKD--FASIVFS 649
Query: 517 GYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G+ G++ G A+ADV FG NP G L TWY
Sbjct: 650 GFNGESQGTALADVLFGAQNPDGHLDFTWY 679
>gi|300777563|ref|ZP_07087421.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
gi|300503073|gb|EFK34213.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
Length = 896
Score = 211 bits (537), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 144/435 (33%), Positives = 203/435 (46%), Gaps = 81/435 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ++ R+ +L+ LT +EKI ++D++ +V RL IP Y WW+EALHGV+ G
Sbjct: 46 FRNPDLPVNERIENLLTLLTTEEKIGMMMDNSQAVPRLEIPAYGWWNEALHGVARAGI-- 103
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTF 163
AT FPQ I AA+++ +++S EARA YN GLTF
Sbjct: 104 --------ATVFPQAIGMAATWDVPEHFKTFEMISDEARAKYNRSFDEALKTGRYEGLTF 155
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PNINIFRDPRWGRGQET GEDP L S VKGLQ G P K AC KH+
Sbjct: 156 WTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQ---GNDPKFFKTHACAKHFAV 212
Query: 224 YDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------------- 251
+ W +R+ +NA + +Y YL FK
Sbjct: 213 HSGPEW---NRHSYNAEISKRDLYETYLPAFKALVQEGNVREVMCAYNAFDGQPCCANNT 269
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPEE--AAAKSILAGLDLNCGSFLGK 295
+VSDC ++ + +++ P+E AA ++ DL CG
Sbjct: 270 LLTEILRGKWKYDGMVVSDCWALADFFQKKYHGTHPDEKTTAADALKHSTDLECGDTYN- 328
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC-TQANQDLA 354
+ ++ +GL+ E ID ++ LG D P + P V ++ ++ A
Sbjct: 329 NLNKSLASGLITEKDIDESMRRILKGWFELGMLD--PKSSVHWNTIPYSVVDSEEHKKQA 386
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
L A++ IVL+KN LPL+ IK +AV+GPNA+ +GNY GTP T L G+
Sbjct: 387 LKMAQKSIVLMKNEKNILPLNRN-IKKIAVVGPNADDGLMQLGNYNGTPSSIVTILDGIK 445
Query: 415 AV---VATIYQAGCS 426
IY+ G
Sbjct: 446 TKFPNAEIIYEKGSE 460
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 62/118 (52%), Gaps = 13/118 (11%)
Query: 439 KKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGP 488
++ +AD V G S+E E D+ + LP Q+ L+ E+ K K P
Sbjct: 617 REKVKNADVIVFAGGLSPSLEGEEMMVNAEGFKGGDKTSIALPKVQRDLLAELRKTGK-P 675
Query: 489 VILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+ ++ +G + +++ ++L Y GQ+GG A+ADV G YNPSG+LP+T+Y
Sbjct: 676 VVFVLCTGSA--LGLEQDEKNYDALLNAWYGGQSGGTAVADVLAGDYNPSGKLPITFY 731
>gi|427384377|ref|ZP_18880882.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
12058]
gi|425727638|gb|EKU90497.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
12058]
Length = 1050
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 142/395 (35%), Positives = 198/395 (50%), Gaps = 69/395 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ DL+ RLT++EKI+ L ++ +SRL IPKY +EALHGV V PG T
Sbjct: 39 RIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPGRF--------T 88
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LTFWSPNINIFR 172
FPQ I AA++N L + + V+S EARA +N G LTFWSP +N+ R
Sbjct: 89 VFPQAIGLAATWNPVLQEQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVNMAR 148
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L+ T +VKGLQ G LK+ + KH+ A + ++ +
Sbjct: 149 DPRWGRTPETYGEDPYLSGIMGTAFVKGLQ---GNDSRYLKIVSTPKHFAANNEEHNRFV 205
Query: 233 DRYHFNAMVIYTYYLIKFK----------------------------------------- 251
+ + YYL F+
Sbjct: 206 CNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLLTKVLRNDWGFK 265
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNES 309
Y+VSDC +L N+ Y KT E AA SI AGLDL CG + +A + +V ++
Sbjct: 266 GYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDEPLLSAYRQYMVTDA 325
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
ID A M+LG FD K PY ++ P + ++ +Q++AL+AAR+ IVLLKN
Sbjct: 326 DIDSAAYRVLRARMQLGLFDSG-EKNPYTKISPAVIGSKEHQEVALNAARECIVLLKNQK 384
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
LPL+ IK++AV+G NA ++ G+Y G P
Sbjct: 385 KMLPLNAKKIKSIAVVGINAGSSE--FGDYSGLPV 417
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 68/112 (60%), Gaps = 7/112 (6%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + V V+G ++SIE E DR D+ LP Q+ + E+ KV P I++++
Sbjct: 595 EAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIVVVLVA 652
Query: 497 GGFDISFAKN--DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G S A N D + +I+ YPG++GG A+A+V FG YNP GRLP+T+Y
Sbjct: 653 GS---SLAVNWMDEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY 701
>gi|118489157|gb|ABK96385.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 343
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 99/157 (63%), Positives = 119/157 (75%), Gaps = 1/157 (0%)
Query: 395 MIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQC-GTAQVDDAKKAAASADATVLVMG 453
MIGNY G C YTTPLQG+ T++ +GC++V C G Q + A+ AA ADAT+LVMG
Sbjct: 1 MIGNYAGVACGYTTPLQGIRRYAKTVHLSGCNDVFCNGNQQFNAAEVAARHADATILVMG 60
Query: 454 ADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSI 513
DQSIEAE DR LLLPG QQ L++ VA+ ++GP IL++MSGG D+SFAKNDP+I +I
Sbjct: 61 LDQSIEAEFRDRKGLLLPGYQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAI 120
Query: 514 LWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
LWVGYPGQAGGAAIADV FG NP G+LPMTWYP Y
Sbjct: 121 LWVGYPGQAGGAAIADVLFGTANPGGKLPMTWYPHDY 157
>gi|317474225|ref|ZP_07933501.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909535|gb|EFV31213.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 858
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 145/405 (35%), Positives = 197/405 (48%), Gaps = 69/405 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N I R+ DL+ RLT++EKI+ L ++ +SRL IPKY +EALHGV V PG
Sbjct: 29 YKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPGR 86
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LT 162
T FPQ I AA++N L + + V+S EARA +N G LT
Sbjct: 87 F--------TVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLT 138
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FWSP +N+ RDPRWGR ET GEDP L+ T +VKGLQ G LK+ + KH+
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQ---GNDSRYLKIVSTPKHFA 195
Query: 223 AYDLDNWKGTDRYHFNAMVIYTYYLIKFK------------------------------- 251
A + ++ + + + YYL F+
Sbjct: 196 ANNEEHNRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALNDVPCTLNAWLLT 255
Query: 252 -----------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEA 299
Y+VSDC +L N+ Y KT E AA SI AGLDL CG
Sbjct: 256 KVLREDWGFKGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDAPLLN 315
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
A + +V ++ ID A M+LG FD PY ++ PK + ++ +Q +ALDAAR
Sbjct: 316 AYRQYMVTDADIDSAAYRVLRARMQLGLFDSG-ENNPYTKISPKVIGSKEHQKVALDAAR 374
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
+ IVLLKN LPL IK++AV+G NA ++ G+Y G P
Sbjct: 375 ECIVLLKNQNKMLPLDAKKIKSIAVVGINAG--RSEFGDYSGLPV 417
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 68/110 (61%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A + + V V+G +++IE E DR D+ LP Q+ + E+ KV +++++++G
Sbjct: 595 EAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVLVAG 653
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I++ D I +I+ YPG++GG A+A+V FG YNP GRLP+T+Y
Sbjct: 654 SSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY 701
>gi|336399370|ref|ZP_08580170.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
gi|336069106|gb|EGN57740.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
Length = 862
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 146/435 (33%), Positives = 201/435 (46%), Gaps = 77/435 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + L + R DL RLTL+EK + + D + ++ RLGI + WWSEALHG + G
Sbjct: 25 YQDPGLSFEARAKDLCSRLTLEEKASLMCDVSPAIPRLGIKPFNWWSEALHGYANNG--- 81
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA---------GLTF 163
T FP+ I AASFN ++ + S EAR YN +A L+
Sbjct: 82 -------DVTVFPEPIGMAASFNPTMVYQVFTATSDEARGKYNQSMAEGKEDTRFHSLSV 134
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PN+NIFRDPRWGRGQET GEDP L S VKGLQ + S K+ AC KH+
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVEVVKGLQGPE--STKYRKLYACAKHFAV 192
Query: 224 YDLDNWKGTDRYHFNAMVI-----YTYYLIKFK--------------------------- 251
+ + R+ N I + YL FK
Sbjct: 193 HSGPEYT---RHTANLADISPRDLWETYLPAFKATVQQAGVREVMCAYQRLDDEPCCGNS 249
Query: 252 ---------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGK 295
+VSDC ++ Y + H + AAAK LAG D+ CG +
Sbjct: 250 RLLQQILRDEWGFRHMVVSDCGAIADFYTNHHVSSDAVHAAAKGTLAGTDVECGFGYAYM 309
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
AV+ GLV+E+ +D + LG D P + ++ PK V + A++ LAL
Sbjct: 310 KLPEAVRRGLVSEAEVDKHVIRLLKGRFELGVMD-DPKLVSWTKISPKVVDSDAHRQLAL 368
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
+ ARQ + LL+N LPL+ + +AV+GPNA + GNY GTP + TT L+G+ A
Sbjct: 369 NMARQTMTLLQNRNNVLPLAKG--EKIAVVGPNAADGPMLWGNYNGTPSRTTTILEGIRA 426
Query: 416 VVATI--YQAGCSNV 428
Y GC V
Sbjct: 427 KAGKDIPYLQGCDLV 441
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 19/111 (17%)
Query: 449 VLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
V V G +E E DR + LP Q+ + + K A V+ + SG
Sbjct: 604 VFVGGISSKLEGEEMPVHVEGFKGGDRTSIELPAVQRDFLKAL-KAAGKTVVFVNCSGSA 662
Query: 499 FDISFAKNDPKITS---ILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
++ P++ S IL Y G+ GG A+ADV +G YNP G+LP+T+Y
Sbjct: 663 IALT-----PEVESCDAILQAWYAGEEGGRAVADVLYGDYNPGGKLPVTFY 708
>gi|218130696|ref|ZP_03459500.1| hypothetical protein BACEGG_02285 [Bacteroides eggerthii DSM 20697]
gi|217987040|gb|EEC53371.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 858
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 145/405 (35%), Positives = 197/405 (48%), Gaps = 69/405 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N I R+ DL+ RLT++EKI+ L ++ +SRL IPKY +EALHGV V PG
Sbjct: 29 YKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPGR 86
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LT 162
T FPQ I AA++N L + + V+S EARA +N G LT
Sbjct: 87 F--------TVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLT 138
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
FWSP +N+ RDPRWGR ET GEDP L+ T +VKGLQ G LK+ + KH+
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQ---GNDSRYLKIVSTPKHFA 195
Query: 223 AYDLDNWKGTDRYHFNAMVIYTYYLIKFK------------------------------- 251
A + ++ + + + YYL F+
Sbjct: 196 ANNEEHNRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALNDVPCTLNAWLLT 255
Query: 252 -----------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEA 299
Y+VSDC +L N+ Y KT E AA SI AGLDL CG
Sbjct: 256 KVLREDWGFKGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDAPLLN 315
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
A + +V ++ ID A M+LG FD PY ++ PK + ++ +Q +ALDAAR
Sbjct: 316 AYRQYMVTDADIDSAAYRVLRARMQLGLFDSG-ENNPYTKISPKVIGSKEHQKVALDAAR 374
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
+ IVLLKN LPL IK++AV+G NA ++ G+Y G P
Sbjct: 375 ECIVLLKNQNKMLPLDAKKIKSIAVVGINAG--RSEFGDYSGLPV 417
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 68/110 (61%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A + + V V+G +++IE E DR D+ LP Q+ + E+ KV +++++++G
Sbjct: 595 EAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVLVAG 653
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I++ D I +I+ YPG++GG A+A+V FG YNP GRLP+T+Y
Sbjct: 654 SSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY 701
>gi|402307522|ref|ZP_10826545.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400378572|gb|EJP31427.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 858
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 149/435 (34%), Positives = 206/435 (47%), Gaps = 79/435 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L +CN L R DL+ RLTL+EK ++D + ++ RLGI K+ WWSEALHG + +G
Sbjct: 22 LPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGAANMG- 80
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG---------L 161
G T FP+ + AASFN L + + S E RA YN + L
Sbjct: 81 ---------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDEKFHSL 131
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY 221
+ W+PN+NIFRDPRWGRGQET GEDP L S T V+GLQ + + K+ AC KHY
Sbjct: 132 SVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPE--TAKYRKLWACAKHY 189
Query: 222 TAYDLDNWKGTDRYHFNAMV-------IYTYYLIKFK----------------------- 251
+ G + A V ++ YL FK
Sbjct: 190 AVH-----SGPEYTRHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQRLDDDPC 244
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-S 291
+VSDC +V +Y + + AAAK+ +AG D+ CG
Sbjct: 245 CSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTDVECGFG 304
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
+ K AV+ GL+ E+ +D + LG D P + ++ + ++A++
Sbjct: 305 YAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVMDSKAHR 363
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LALD ARQ +VLL+N G LPL + +AVIGPNA+ M GNY GTP + T L
Sbjct: 364 QLALDMARQSLVLLQNKGGVLPLKAGG-EPIAVIGPNADDGPMMWGNYNGTPNRTVTILD 422
Query: 412 GLAAVVATI-YQAGC 425
G+ A + Y GC
Sbjct: 423 GIKARHKRVTYLKGC 437
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 13/111 (11%)
Query: 449 VLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
V V G ++E E DR ++ LP Q+ + + + K V+ + SG
Sbjct: 603 VFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK-TVVFVNCSGSA 661
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
I+ +IL Y GQ GG A++DV FG NPSG+LP+T+Y ++
Sbjct: 662 --IALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYKRT 710
>gi|357489449|ref|XP_003615012.1| Auxin-induced beta-glucosidase, partial [Medicago truncatula]
gi|355516347|gb|AES97970.1| Auxin-induced beta-glucosidase, partial [Medicago truncatula]
Length = 190
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 107/181 (59%), Positives = 130/181 (71%), Gaps = 4/181 (2%)
Query: 13 VSVLFLFLT-YCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRL 71
+S +FLFLT Y L H+ S P ++CD +NP SL FCN +L I R D+V RL
Sbjct: 11 ISFIFLFLTRYHRLVHADSPTHVPP-YSCDT-TNPLTKSLPFCNLNLTITQRAKDIVSRL 68
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
TL EKI+ LV++A S+ RLGIP Y+WW EALHGV+ G G + V GATSFPQVILTA
Sbjct: 69 TLDEKISQLVNTAPSIPRLGIPSYQWWDEALHGVANAGKGIRLNGSVAGATSFPQVILTA 128
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLA 190
ASF++ L+ I KV+ TEAR +YN G A G+TFW+PNINIFRDPRWGRGQET GEDPL+
Sbjct: 129 ASFDSKLWYQISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVN 188
Query: 191 S 191
S
Sbjct: 189 S 189
>gi|395492941|ref|ZP_10424520.1| glycoside hydrolase family protein [Sphingomonas sp. PAMC 26617]
Length = 865
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 151/414 (36%), Positives = 208/414 (50%), Gaps = 80/414 (19%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
I+ RV DL++R+TL+EK + + A ++ RLGIP Y++W+EALHGV+ G
Sbjct: 21 IEARVDDLMRRMTLEEKAAQMQNVAPAIPRLGIPPYDYWNEALHGVARAGE--------- 71
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIF 171
AT FPQ I AA+++ + A G+ V+TE RA YN A GLTFWSPNINIF
Sbjct: 72 -ATVFPQAIGMAATWDRDMMLAEGQTVATEGRAKYNQAQAQKNYDRYYGLTFWSPNINIF 130
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
RDPRWGRGQET GEDP L A +V G+Q TD N LK A KH+ + G
Sbjct: 131 RDPRWGRGQETLGEDPYLTGTMAVPFVHGVQGTDA---NYLKAIATPKHFAVH-----SG 182
Query: 232 TD--RYHFNA----MVIYTYYLIKFK---------------------------------- 251
+ R+ FN + YL F+
Sbjct: 183 PEQLRHQFNVDPSPRDLSETYLPAFRRAIVDGRAESLMCAYNAVDTKAACANTMLLKDTL 242
Query: 252 --------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE--AAV 301
++ SDC ++D + H + T E AA ++ AG D C F + + AV
Sbjct: 243 RGAWGFKGFVTSDCGAIDDITTGHHNSPTNPEGAALAVKAGTDTGC-DFKDEMLDLPRAV 301
Query: 302 KAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQG 361
KAG + E +D A+ F M+LG FD ++ P+ + + + A++ LAL AAR+
Sbjct: 302 KAGYLTEGDMDVALRRLFTARMKLGMFD-PAARVPFSTISIAENHSPAHRALALRAARES 360
Query: 362 IVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
IVLLKN G LPL+ A + +AV+GP A + GNY GTP P+ G+ A
Sbjct: 361 IVLLKND-GVLPLAAGA-RRIAVVGPTAASLIALEGNYNGTPVGAVLPVDGMTA 412
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 49/83 (59%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR + LP Q L+ + K P+++++ SG I+ + K ++L YPG+AG
Sbjct: 623 DRTAIALPAAQSQLLDALFATGK-PLVIVLQSGSA--IALGAQEAKARAVLEAWYPGEAG 679
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AIA+V G NPSGRLP+T+Y
Sbjct: 680 GQAIAEVLSGTVNPSGRLPVTFY 702
>gi|313204103|ref|YP_004042760.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443419|gb|ADQ79775.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 1278
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 137/416 (32%), Positives = 204/416 (49%), Gaps = 61/416 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ NT+ R DLV R+TL+EK + L ++ + RLG+ KY+ W EALHGV VG
Sbjct: 39 YLNTAYSFKERAADLVSRMTLEEKQSQLGNTMPPIPRLGVNKYDVWGEALHGV--VGRNN 96
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFR 172
+ + ATSFP + ++++ +L + VV+ EAR + + LT+WSP I R
Sbjct: 97 NSGMI---ATSFPNSVAVGSTWDPALIKRETSVVADEARGFNHDLIFTLTYWSPVIEPAR 153
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L S+ +G+V+GL G P LK C KHY A + + +
Sbjct: 154 DPRWGRTAETFGEDPFLVSQIGSGFVQGLM---GDDPTYLKTVPCGKHYFANNSEFNRHN 210
Query: 233 DRYHFNAMVIYTYYLIKFK----------------------------------------- 251
+ + + +YL ++
Sbjct: 211 GSANMDDRDMREFYLTPYRTLIQKDKLPSIMTAYSAVNGVPMSASKFLVDTIAKRTYGLD 270
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
Y+ DCD+V + NS Y K+ EAAA + G+D +CG A+K GL++E+
Sbjct: 271 GYVTGDCDAVADVVNSHRYAKSKAEAAAMGLKTGVDSDCGGIYQTSALEALKQGLISEAD 330
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT-- 368
+D A+ N + MRLG FD + PY + P + ++ DLAL+ A + VLLKN
Sbjct: 331 MDKALVNIYTIRMRLGEFDPQ-NIVPYAGIKPSIINDPSHNDLALEIATKSPVLLKNNLV 389
Query: 369 ----AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT--PCKYTTPLQGLAAVVA 418
+LPL+ IK +AV+GP A+ K +G+Y G P TPL+G+ +A
Sbjct: 390 GKSGKKALPLNAGTIKKIAVLGPQAD--KVELGDYSGEADPKYKITPLEGIKNYIA 443
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 62/106 (58%), Gaps = 3/106 (2%)
Query: 442 AASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGF-D 500
AASAD V+ +G DQ+ E DR + LPG Q LI +A V P ++++ G G +
Sbjct: 612 AASADVAVVFVGTDQTTGREESDRFAITLPGNQNELIKSIAAV--NPNTIVVIQGMGMVE 669
Query: 501 ISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ KN+P + I++ GY GQA G A+A V FG NP G+ +TWY
Sbjct: 670 VEQFKNNPNVAGIIFTGYNGQAQGTAMAKVLFGDVNPGGKTSLTWY 715
>gi|288925400|ref|ZP_06419334.1| beta-glucosidase [Prevotella buccae D17]
gi|288337871|gb|EFC76223.1| beta-glucosidase [Prevotella buccae D17]
Length = 858
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 148/435 (34%), Positives = 205/435 (47%), Gaps = 79/435 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L +CN L R DL+ RLTL+EK ++D + ++ RLGI K+ WWSEALHG + +G
Sbjct: 22 LPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGAANMG- 80
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG---------L 161
G T FP+ + AASFN L + + S E RA YN + L
Sbjct: 81 ---------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDEKFHSL 131
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY 221
+ W+PN+NIFRDPRWGRGQET GEDP L S T V+GLQ + + K+ AC KHY
Sbjct: 132 SVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPE--TAKYRKLWACAKHY 189
Query: 222 TAYDLDNWKGTDRYHFNAMV-------IYTYYLIKFK----------------------- 251
+ G + A V ++ YL FK
Sbjct: 190 AVH-----SGPEYTRHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQRLDDDPC 244
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-S 291
+VSDC +V +Y + + AAAK+ +AG D+ CG
Sbjct: 245 CSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTDVECGFG 304
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
+ K AV+ GL+ E+ +D + LG D P + ++ + ++A++
Sbjct: 305 YAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVMDSKAHR 363
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LALD ARQ +VLL+N G LPL + +AVIGPNA+ M GNY GTP + T L
Sbjct: 364 QLALDMARQSLVLLQNKGGVLPLKAGG-EPIAVIGPNADDGPMMWGNYNGTPNRTVTILN 422
Query: 412 GLAAVVATI-YQAGC 425
G+ + Y GC
Sbjct: 423 GIKVRHKRVTYLKGC 437
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 13/111 (11%)
Query: 449 VLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
V V G ++E E DR ++ LP Q+ + + + K V+ + SG
Sbjct: 603 VFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK-TVVFVNCSGSA 661
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
I+ +IL Y GQ GG A++DV FG NPSG+LP+T+Y ++
Sbjct: 662 --IALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYKRT 710
>gi|393784569|ref|ZP_10372732.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
CL02T12C01]
gi|392665550|gb|EIY59074.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
CL02T12C01]
Length = 929
Score = 208 bits (529), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 141/449 (31%), Positives = 209/449 (46%), Gaps = 69/449 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + SL R +LV LTL+EKI + ++ RL I Y +W+EA+HGV+ G
Sbjct: 42 FQDESLSFHERAKNLVSLLTLEEKINQVGHQTLAIPRLNIKGYNYWNEAIHGVARSGL-- 99
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFR 172
ATSFP +++++ L S EAR N GL +W P IN+ R
Sbjct: 100 --------ATSFPVSKAMSSTWDLPLIFDCAVATSDEARVYSNTKDKGLIYWCPTINMSR 151
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR +E GEDP L K A Y+KG+Q G P K A KH+ A + + + +
Sbjct: 152 DPRWGRDEENYGEDPFLTGKIAVEYIKGMQ---GDDPKYYKTIATAKHFAANNYEKGRHS 208
Query: 233 DRYHFNAMVIYTYYLIKFK----------------------------------------- 251
+A + YYL F+
Sbjct: 209 TSSDMDARNLREYYLPAFEMAVKEGNVRSVMSAYNALNGIPCGANHELLIDILRTEWGFN 268
Query: 252 -YIVSDCDSVDVLY--NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNE 308
++ SDC +VD +Y N H+ T EA+A SI+ G DLNCG+ + + A++ G + E
Sbjct: 269 GFVTSDCGAVDDVYQSNRHHFVNTAAEASAVSIVNGEDLNCGNTFQDYCKEAIEKGYMQE 328
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
+ +D A+ F +G FD + S P+ + + + ++ LA AA++ IVLLKN
Sbjct: 329 ADLDTALVRVFEARFSVGEFD-NASNVPWRSISDDVLDCEEHRQLAYKAAQEAIVLLKND 387
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNV 428
LPL T K++AVIGP N +G Y G+P TTP G+A + + V
Sbjct: 388 NNILPLDKT--KSVAVIGPFGNT--ITLGGYSGSPTALTTPFGGIAEKIGYV-------V 436
Query: 429 QCGTAQVDDAKKAAASADATVLVMGADQS 457
GT Q +D ++ + +D+ L A+ S
Sbjct: 437 NDGTIQFEDCEEQSVPSDSKRLTHEANGS 465
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/123 (41%), Positives = 70/123 (56%), Gaps = 3/123 (2%)
Query: 424 GCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAK 483
GC+ ++ AK+ AA AD + G D ++ ESHDR +L LPG QQ L+ E
Sbjct: 592 GCAVTGTAETNLERAKEIAAKADVVIFAAGTDLTVSDESHDRTNLNLPGDQQKLL-EAVY 650
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
A VIL++ + I++AK + +I+ Y GQA G AIADV +G YNPSG+L
Sbjct: 651 SANPNVILLLQTCSSVTINWAK--EHVPAIIEAWYGGQAQGKAIADVLYGDYNPSGKLTS 708
Query: 544 TWY 546
TWY
Sbjct: 709 TWY 711
>gi|315606832|ref|ZP_07881841.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315251497|gb|EFU31477.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 858
Score = 208 bits (529), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 148/435 (34%), Positives = 205/435 (47%), Gaps = 79/435 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L +CN +L R DL+ RLTL+EK ++D + ++ RLGI K+ WWSEALHG + +G
Sbjct: 22 LPYCNPALSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGAANMG- 80
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG---------L 161
G T FP+ + AASFN L + + S E RA YN + L
Sbjct: 81 ---------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDEKFHSL 131
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY 221
+ W+PN+NIFRDPRWGRGQET GEDP L S T V+GLQ + + K+ AC KHY
Sbjct: 132 SVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPE--TAKYRKLWACAKHY 189
Query: 222 TAYDLDNWKGTDRYHFNAMV-------IYTYYLIKFK----------------------- 251
+ G + A V ++ YL FK
Sbjct: 190 AVH-----SGPEYTRHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQRLDDDPC 244
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-S 291
+VSDC +V +Y + + AAAK+ +AG D+ CG
Sbjct: 245 CSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTDVECGFG 304
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
+ K AV+ GL+ E+ +D + LG D P + ++ + ++A++
Sbjct: 305 YAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVMDSKAHR 363
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LALD ARQ +VLL+N G LPL + VIGPNA+ M GNY GTP + T L
Sbjct: 364 QLALDMARQSLVLLQNKGGVLPLKAGG-DPITVIGPNADDGPMMWGNYNGTPNRTVTILD 422
Query: 412 GLAAVVATI-YQAGC 425
G+ A + Y GC
Sbjct: 423 GIKARHTRVTYLKGC 437
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 13/111 (11%)
Query: 449 VLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
V V G ++E E DR ++ LP Q+ + + + K V+ + SG
Sbjct: 603 VFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK-TVVFVNCSGSA 661
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
I+ +IL Y GQ GG A++DV FG NPSG+LP+T+Y ++
Sbjct: 662 --IALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYKRT 710
>gi|198425902|ref|XP_002120563.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 996
Score = 208 bits (529), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 174/554 (31%), Positives = 259/554 (46%), Gaps = 97/554 (17%)
Query: 68 VKRLTLQEKITFLVDS-------AGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG 120
V RLT++E + + A +++RLGI Y+W +E L G + G
Sbjct: 455 VSRLTVKELVLQISRGGAGDNGPAPAITRLGIGPYQWNTECLRGYAMNG----------D 504
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFWSPNINIFR 172
AT FPQ I AA+F+ L + K ++ EARA +N GL+ +SP INI R
Sbjct: 505 ATCFPQPIGLAATFDQGLIYKLAKTIALEARAKHNNFTKNGNFGDHTGLSCFSPVINILR 564
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
P WGR QET GEDP+++S A YV GLQ + P A CKH+ AYD +
Sbjct: 565 HPLWGRNQETFGEDPVMSSLMARAYVTGLQGDEIYYP----ATANCKHFAAYDGPENIPS 620
Query: 233 DRYHFNAMV-----------------------------------------IYTYYLIKFK 251
R FNA V + T KF
Sbjct: 621 SRLSFNANVSIEDLGRTYFPAFRECVHSGAFGIVCSYNAINGEPACASSYLQTILRDKFN 680
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE-----AAVKA 303
Y+ SD +++ YTK+ +A + AG+DL S+ GK+ AV+
Sbjct: 681 FKGYVSSDESAIEFFDIYFKYTKSNLLSAVAAFDAGVDLELTSY-GKNNRYSLLNQAVEQ 739
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVC-TQANQDLALDAARQGI 362
GLV E+A+ + F T M LG FD P + + P DV + A++ A++ A +
Sbjct: 740 GLVTEAALRRSAKRLFRTRMALGEFD--PQEFNHWLNVPIDVVQSLAHRKQAVEVAAKSF 797
Query: 363 VLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE-GTPCKY-TTPL---QGLAAVV 417
VLLKN G LPL+ +N+A++GP N ++ + G+Y KY ++PL L++
Sbjct: 798 VLLKND-GVLPLN-HRYENVAIVGPFINNSEALTGDYHPNYNLKYFSSPLFAANSLSSSG 855
Query: 418 ATIYQAGC--SNVQ----CGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLP 471
+ GC +N Q C T K+ +D ++ +G IEAE DRL + LP
Sbjct: 856 VARFTTGCVGTNNQNLPICATYNSTHVKEVVTGSDIVLVTLGTGTGIEAEGRDRLSMELP 915
Query: 472 GQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVC 531
G+Q +I +V K A GPVI+++ + G D+S+ + +++ + Q G A+ +V
Sbjct: 916 GKQLDMIKDVVKYANGPVIVVLFNAGPLDVSWVMGN--TAAVIACHFSAQMTGEAMLEVL 973
Query: 532 FGRYNPSGRLPMTW 545
G NP+GRLP TW
Sbjct: 974 TGIVNPAGRLPNTW 987
>gi|379722647|ref|YP_005314778.1| glycoside hydrolase family protein [Paenibacillus mucilaginosus
3016]
gi|378571319|gb|AFC31629.1| glycoside hydrolase family 3 domain-containing protein
[Paenibacillus mucilaginosus 3016]
Length = 937
Score = 208 bits (529), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 151/464 (32%), Positives = 222/464 (47%), Gaps = 71/464 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F +T L ++ RV DLV RLTL+EKI + + RLGI KY+ +E HGV+++G
Sbjct: 9 FQDTRLPLEERVQDLVSRLTLEEKINLMCQYQEEIPRLGIAKYKHGTEGAHGVAWLGE-- 66
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
AT FPQ + A ++N L + IG V+ EAR Y N + GLT W+P +++
Sbjct: 67 --------ATVFPQNVGLACTWNPELMREIGSVIGDEARVYYQRNPEINGLTIWAPTVDM 118
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L + +T VKG+Q G P LK AA KH+ + + +
Sbjct: 119 ERDPRWGRTEEAYGEDPYLTGRLSTELVKGMQ---GDHPFYLKTAATLKHFLGNNNEMDR 175
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + + YYL F+
Sbjct: 176 GECSASIDPRNMREYYLKAFEPAFREGGAQSMMTAYNSVNGTPCNLNPDVNAIVKGEWGM 235
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL---NCGSFLGKHTEAAVKAGLV 306
++VSD V N Y + EA A S+ +G+D + G L + A++ GL+
Sbjct: 236 DGFVVSDAGDVLGTVNEHRYFASYAEAVAASVRSGIDSITDDVGITL-RAIRDALEGGLL 294
Query: 307 NESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLK 366
E +D A+ N F +RLG FD P + PY + +C + L+L AAR+G VLLK
Sbjct: 295 AEEDLDRAVGNAFRVRIRLGEFD-SPEENPYAHVPEAKLCAPEHAALSLQAAREGFVLLK 353
Query: 367 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA---TIYQA 423
N G LPL+ A ++AVIGP A+V T Y GTP TPL+G+A +A I++
Sbjct: 354 NE-GLLPLAKPA--SIAVIGPLADVVHTDW--YSGTPPYRITPLEGVAERMAPGTVIHRT 408
Query: 424 GCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
G ++ + + + + ++ G + EAE +R D
Sbjct: 409 GGDRIRIKSLRTGRYVRLGGADGLSLTADGTAE--EAEVFERTD 450
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 60/111 (54%), Gaps = 6/111 (5%)
Query: 438 AKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
A +AA A+ ++ +G I E DR DL L ++ L+ V +A P + ++ G
Sbjct: 547 AVEAARQAETAIVFLGNSPFINGKECVDRPDLTLAPAKEALLQAV--LAANPRTVAVLVG 604
Query: 497 G-GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
F +++A+ + +IL+ + GQ G A ADV FG +P+GRL MTWY
Sbjct: 605 SYPFAVNWAQ--AHVPAILYTSHAGQELGRAAADVLFGDCSPAGRLNMTWY 653
>gi|329928264|ref|ZP_08282164.1| glycosyl hydrolase family 3 C-terminal domain protein
[Paenibacillus sp. HGF5]
gi|328937943|gb|EGG34345.1| glycosyl hydrolase family 3 C-terminal domain protein
[Paenibacillus sp. HGF5]
Length = 934
Score = 208 bits (529), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 148/463 (31%), Positives = 215/463 (46%), Gaps = 66/463 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L ++ RV DLV RLTL EKI + + RLG+ KY+ +E HGV+++G
Sbjct: 5 FQDPDLPLEERVHDLVSRLTLDEKIELMCQYQTEIPRLGVQKYKHGTEGAHGVAWLGE-- 62
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
AT FPQ A ++N L + IG V++ EAR Y + + GLT W+P +++
Sbjct: 63 --------ATVFPQNTGLACTWNPELMREIGSVIAEEARVYYQRDRAINGLTIWAPTVDL 114
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L + +TG VKG+Q G P K A KH+ + + +
Sbjct: 115 ERDPRWGRTEEAYGEDPHLTGQLSTGLVKGMQ---GDHPFYYKTVATLKHFYGNNNEADR 171
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G+ + YYL F+
Sbjct: 172 GSASVSIDPRNKREYYLKAFEAPFREGKAGSMMTAYNGINGTPCNLNHEVNDIVKQEWGM 231
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGKHTEAAVKAGLVN 307
++V D V HY + EA A S+ AG+D + + A++ GL+
Sbjct: 232 DGFVVGDAGDVLGTVMDHHYVASYAEAVADSVKAGIDSITDDQDISFRALRDALEQGLLE 291
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E +DHA+ N F RLG FD + PY ++ +C + +L+L AAR+ IVLLKN
Sbjct: 292 EQHLDHALRNTFRVRFRLGEFDPE-ERNPYSRVPESKLCAPEHAELSLRAARESIVLLKN 350
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV---ATIYQAG 424
G LPL K+ AVIGP AN + Y GTP TPLQG+ A + + G
Sbjct: 351 D-GLLPLPRDPFKSAAVIGPLAN--EAFTDWYSGTPPYRITPLQGVQAKAGDRSVQFHTG 407
Query: 425 CSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
V+ +A V A +S + +L+ G +S +A +R D
Sbjct: 408 LDQVRLRSA-VSGTYVALSSEEQGILLAGTSESADAAVFERND 449
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 3/86 (3%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+ DR D+ LP QQ LI V + V++I+ GG+ + + +IL+ + G
Sbjct: 565 ETIDRPDITLPPAQQALIQAVFEANPRTVVVIV---GGYPFAVNWEQEHLPAILFTSHAG 621
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
Q G A+ADV +G YNP GRL MTWY
Sbjct: 622 QELGHAVADVLYGDYNPGGRLNMTWY 647
>gi|330995911|ref|ZP_08319806.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
gi|329574250|gb|EGG55825.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
Length = 865
Score = 207 bits (528), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 147/434 (33%), Positives = 204/434 (47%), Gaps = 78/434 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L R DL+KRLTL+EK++ +++S+ +V RLGI Y WWSEALHGV+ G
Sbjct: 25 YKNPDLSPRERAEDLLKRLTLKEKVSLMINSSPAVERLGIKPYNWWSEALHGVARSGI-- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTF 163
AT +P + A+ F+ +A+ VS EARA Y+ GLTF
Sbjct: 83 --------ATVYPITMGMASVFDDEAVEAVYTTVSDEARAKYHDAHKKGRYGQMNEGLTF 134
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PN+NIFRDPRWGRGQET GEDP L S+ V+GLQ ++L AC KHY
Sbjct: 135 WTPNVNIFRDPRWGRGQETWGEDPYLTSRMGVAVVRGLQGPADAKYDKLH--ACAKHYAV 192
Query: 224 YDLDNWKGTDRYHFNA-----MVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTP----- 273
+ K R++F+ ++ YL FK +V + D +V+ Q + P
Sbjct: 193 HSGPEAK---RHYFDVEQLDPRDLWETYLPAFKVLVQEADVKEVMCAYQRFEGEPCCGSK 249
Query: 274 --------EEAAAKSILA-------------------------------GLDLNCGSFLG 294
+E K ++ G DL CG
Sbjct: 250 RLLHQILRDEWGFKYVVVSDCGAIGDFFNPGLHETHPDAATASASAVTSGTDLECGWGDY 309
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
EAAV GL+ E ID ++ LG D S P+ ++G V Q ++ +A
Sbjct: 310 MQLEAAVDRGLITEHRIDTSLCRLLEARFALGEMD-DDSLVPWSRIGIDTVDCQTHKQMA 368
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
LD AR+ +VLL N G LPL T ++ V+GPNA + GNYEGTP T L G+
Sbjct: 369 LDIARKSLVLLHND-GVLPLDKTR-GDVVVMGPNAVDSVMQWGNYEGTPSHTYTVLDGIR 426
Query: 415 AVVA--TIYQAGCS 426
+ Y+ GC+
Sbjct: 427 ERLGRDVRYEKGCN 440
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 50/89 (56%), Gaps = 9/89 (10%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITS---ILWVGYPG 520
DR + LP Q Q I + K A V+ + SG + P++ S IL YPG
Sbjct: 631 DRTSIELP-QVQRDILKALKAAGKKVVFVNCSGSAMALV-----PELESCDAILQAWYPG 684
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
QAGG A+ADV FG +NPSG+LP+T+Y +
Sbjct: 685 QAGGLAVADVLFGDFNPSGKLPVTFYKNT 713
>gi|147783207|emb|CAN77623.1| hypothetical protein VITISV_007589 [Vitis vinifera]
Length = 167
Score = 207 bits (528), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 107/144 (74%), Positives = 117/144 (81%), Gaps = 3/144 (2%)
Query: 8 NRAPKVSVLFLFLTYCSLQHSSSS---AQSSPVFACDVVSNPSLASLGFCNTSLGIDLRV 64
N APKV+V FL+ S SS AQSSPVFACDV +NP+L GFCNTSL RV
Sbjct: 6 NSAPKVTVFLCFLSCFSHFLSSPKWVLAQSSPVFACDVENNPTLGQFGFCNTSLETAARV 65
Query: 65 GDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSF 124
DLVKRLTL+EK+ FLV+SA SVSRLGIPKYEWWSEALHGVSYVGPGTHF++VVPGATSF
Sbjct: 66 ADLVKRLTLEEKVGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPGATSF 125
Query: 125 PQVILTAASFNASLFQAIGKVVST 148
PQVILTAASFNASLF+AIGKV T
Sbjct: 126 PQVILTAASFNASLFEAIGKVTLT 149
>gi|121308314|dbj|BAF43576.1| arabinofuranosidase/xylosidase homolog [Prunus persica]
Length = 349
Score = 207 bits (528), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 103/162 (63%), Positives = 125/162 (77%), Gaps = 1/162 (0%)
Query: 390 NVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV-DDAKKAAASADAT 448
+VT TMIGNY G C YTTPLQG+ TI+QAGC++V C Q+ A+ AA ADAT
Sbjct: 1 DVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADAT 60
Query: 449 VLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDP 508
VLVMG DQSIEAE DR LLLPG QQ L++ VA+ ++GP IL++MSGG D++FAKNDP
Sbjct: 61 VLVMGLDQSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDP 120
Query: 509 KITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+I++I+WVGYPGQAGG AIADV FG NP G+LPMTWYPQ+Y
Sbjct: 121 RISAIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNY 162
>gi|87200432|ref|YP_497689.1| beta-glucosidase [Novosphingobium aromaticivorans DSM 12444]
gi|87136113|gb|ABD26855.1| Beta-glucosidase [Novosphingobium aromaticivorans DSM 12444]
Length = 849
Score = 207 bits (528), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 146/386 (37%), Positives = 192/386 (49%), Gaps = 71/386 (18%)
Query: 92 IPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEAR 151
+P Y++W+EALHG++ G AT FPQ I AA+++A L + IG VVSTEAR
Sbjct: 46 LPAYDYWNEALHGLARNGV----------ATVFPQAIGLAATWDAPLLERIGTVVSTEAR 95
Query: 152 AMYNV-------GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQT 204
A YN GLT WSPNINIFRDPRWGRGQET GEDP+L AT YV+GLQ
Sbjct: 96 AKYNALPGKDRRRYQGLTIWSPNINIFRDPRWGRGQETYGEDPVLTGTLATAYVRGLQGP 155
Query: 205 DGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHF--NAMVIYTYYLIKFK----------- 251
D P +V A KH A+ G D ++ +A + YL F+
Sbjct: 156 DLDHP---RVIATPKHLVAHSGPE-AGRDSFNVQSSAYDMEATYLPAFRRALTEGKALSV 211
Query: 252 -------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKS 280
+VSDCD++ + + Q Y +T A+A +
Sbjct: 212 MCSYNSVHGVPVCGADWLLNQRVRKDWGFDGLVVSDCDAIGNINHYQRYRQTNAAASAAA 271
Query: 281 ILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQL 340
I AG+DLNCG EA + GL +D A++ FA +LG D + P+ +
Sbjct: 272 INAGMDLNCGRTYAALPEALAQ-GLTTREVVDRALARTFAARRKLG--DAFGATSPWATI 328
Query: 341 GPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE 400
V T ++ LAL+AAR+ +VLLKN G LPL A +AV+GPNA+ T+ NY
Sbjct: 329 PANTVDTAEHRALALEAARKSLVLLKNN-GVLPLRKGA--RIAVVGPNADSLDTLEANYH 385
Query: 401 GTPCKYTTPLQGLAAVVATIYQAGCS 426
GT + TPL G AA Y G S
Sbjct: 386 GTAAQPVTPLDGFAARYRMSYAQGAS 411
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 52/86 (60%), Gaps = 3/86 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP QQ L+ E K P++++++SG + + K + W YPGQ+G
Sbjct: 601 DRTDIALPRAQQDLL-ETLKATGKPLVVVLLSGSAVAMPWVKENADAVVAAW--YPGQSG 657
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQS 549
G AIAD+ G NPSGRLP+T+Y ++
Sbjct: 658 GTAIADLVDGTLNPSGRLPVTFYART 683
>gi|300726322|ref|ZP_07059774.1| beta-xylosidase B [Prevotella bryantii B14]
gi|291292284|gb|ADD92014.1| Xyl3A [Prevotella bryantii B14]
gi|299776347|gb|EFI72905.1| beta-xylosidase B [Prevotella bryantii B14]
Length = 885
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 152/451 (33%), Positives = 210/451 (46%), Gaps = 77/451 (17%)
Query: 21 TYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFL 80
TY + + SAQ++ S+ + L + N +L R DL RLTL+EK +
Sbjct: 20 TYSLICAAEKSAQNT--HTTSRTSDKTSTLLPYQNPNLSAYERAIDLCHRLTLEEKALLM 77
Query: 81 VDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQ 140
D + ++ RLGI K+ WWSEALHG + +G T+FP+ I A+SFN +L +
Sbjct: 78 QDESPAIPRLGIKKFFWWSEALHGAANMG----------NVTNFPEPIAMASSFNPTLLK 127
Query: 141 AIGKVVSTEARAMY---------NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++ S E RA Y + L+ W+PN+NIFRDPRWGRGQET GEDP L S
Sbjct: 128 SVFSAASDEMRAQYHHRMDNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTS 187
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY----------DLDNWKGTDRYH----- 236
V+GLQ + S K+ AC KH+ + +L+N D Y
Sbjct: 188 VMGCAVVEGLQGPE--SSKYRKLWACAKHFAVHSGPESTRHTANLNNISPRDLYETYLPA 245
Query: 237 FNAM--------VIYTYYLIK--------------------FKY-IVSDCDSVDVLYNSQ 267
F + V+ Y + FKY +VSDC +V ++ S
Sbjct: 246 FQSTVQDGHVREVMCAYQRLDDEPCCSNNRLLQQILREEWGFKYLVVSDCGAVSDIWQSH 305
Query: 268 HYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLG 326
+ A+ ++ LAG D+ CG + AVK GL+ E ID + LG
Sbjct: 306 KTSSDAVHASRQATLAGTDVECGYGYTYAKIPEAVKRGLLTEEEIDKHVIRLLEGRFDLG 365
Query: 327 FFDGHP----SKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNL 382
D SK PY + K A+ LALD ARQ IVLL+N LPL + +
Sbjct: 366 EMDDSKLVEWSKIPYSIMSCK-----AHAQLALDMARQSIVLLQNKGNILPLQLKKNERI 420
Query: 383 AVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AVIGPNA+ M GNY GTP + L+G+
Sbjct: 421 AVIGPNADNKPMMWGNYNGTPNHTVSILEGI 451
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 56/111 (50%), Gaps = 13/111 (11%)
Query: 446 DATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
D + V G S+E E DR D+ +P Q+ I +A+ K +IL+ S
Sbjct: 627 DKVIFVGGIAPSLEGEEMPVNIPGFKGGDRTDIEMPQVQRDFIKALAEAGK-QIILVNCS 685
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G I+ + +I+ YPGQ GG A+AD+ G+ NP G+LP+T+Y
Sbjct: 686 GSA--IALTPEAQRCQAIIQAWYPGQEGGTAVADILMGKVNPMGKLPVTFY 734
>gi|423223731|ref|ZP_17210200.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638106|gb|EIY31959.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 854
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 141/395 (35%), Positives = 196/395 (49%), Gaps = 69/395 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ DL+ RLT++EKI+ L ++ +SRL IPKY +EALHGV V PG T
Sbjct: 38 RIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPGRF--------T 87
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LTFWSPNINIFR 172
FPQ I AA++N L + V+S EARA +N G LTFWSP +N+ R
Sbjct: 88 VFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVNMAR 147
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L+ T +VKGLQ G LK+ + KH+ A + ++ +
Sbjct: 148 DPRWGRTPETYGEDPYLSGIMGTAFVKGLQ---GDDDRYLKIVSTPKHFAANNEEHNRFV 204
Query: 233 DRYHFNAMVIYTYYLIKFK----------------------------------------- 251
+ + YYL F+
Sbjct: 205 CNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLLTKVLRKDWGFK 264
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNES 309
Y+VSDC +L N+ Y KT E AAA SI AGLDL CG + +A + +V ++
Sbjct: 265 GYVVSDCGGPSLLVNAHKYVKTKEAAAALSIKAGLDLECGDDVYDQPLLSAYRQYMVTDA 324
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
ID A M LG FD + PY ++ P + + +Q++AL+AAR+ IVLLKN
Sbjct: 325 DIDSAAYRVLRARMELGLFDSG-EQNPYTKISPAVIGSAEHQEVALNAARECIVLLKNQK 383
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
LPL+ +K++AV+G NA ++ G+Y G P
Sbjct: 384 KMLPLNAKKVKSIAVVGINAGSSE--FGDYSGLPV 416
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 69/110 (62%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + V V+G ++SIE E DR D+ LP QQ + E+ KV +++++++G
Sbjct: 594 EAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKVNPN-IVVVLVAG 652
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I++ D I +I+ YPG++GG A+A+V FG YNP GRLP+T+Y
Sbjct: 653 SSLAINWM--DEHIPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY 700
>gi|224538282|ref|ZP_03678821.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520107|gb|EEF89212.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
DSM 14838]
Length = 864
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 148/435 (34%), Positives = 206/435 (47%), Gaps = 82/435 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +TSL + R DL+KRLTL+EK + +++ + ++ RL I Y WW+EALHG++ G
Sbjct: 29 YQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLARTGL-- 86
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN--------VGLAGLTFW 164
AT FPQ I ASF+ SL + VS EARA LT W
Sbjct: 87 --------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQALTVW 138
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PN+NIFRDPRWGRGQET GEDP L S+ V GLQ D N+L AC KHY +
Sbjct: 139 TPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYNKLH--ACAKHYAVH 196
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL FK
Sbjct: 197 SGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYNRFEGEPCCGSNR 253
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
+VSDC +V + + + P A+A ++L G D+ CG+ K
Sbjct: 254 LLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLNGTDVECGNSY-K 312
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
AVKAGL+ E+ ID ++ L++ F G + + + V + ++ LAL
Sbjct: 313 SLPDAVKAGLITENQIDISVKR----LLKARFELGEMDENVWTGISSDVVDSPKHRQLAL 368
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
AR+ + LL+N LPLS A +A+IGPNAN + GNY G P T L+G+
Sbjct: 369 QMARETMTLLQNNNNILPLSKQA--KIALIGPNANDSVMQWGNYNGLPSHTITLLEGMQR 426
Query: 416 VVAT---IYQAGCSN 427
+ T IY+ C +
Sbjct: 427 YLPTSNLIYEPVCGH 441
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 49/83 (59%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR ++ LP Q+ ++ E K A ++ + SG I+ +IL YPGQAG
Sbjct: 630 DRTEIELPAVQRRVV-EALKTAGKRIVFVNFSGAA--IALEPESQNCEAILQAWYPGQAG 686
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G A+A+V FG YNP+G+LP+T+Y
Sbjct: 687 GQAVAEVLFGDYNPAGKLPLTFY 709
>gi|423221630|ref|ZP_17208100.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392645869|gb|EIY39591.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 864
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 148/435 (34%), Positives = 206/435 (47%), Gaps = 82/435 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +TSL + R DL+KRLTL+EK + +++ + ++ RL I Y WW+EALHG++ G
Sbjct: 29 YQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLARTGL-- 86
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN--------VGLAGLTFW 164
AT FPQ I ASF+ SL + VS EARA LT W
Sbjct: 87 --------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQALTVW 138
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PN+NIFRDPRWGRGQET GEDP L S+ V GLQ D N+L AC KHY +
Sbjct: 139 TPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYNKLH--ACAKHYAVH 196
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL FK
Sbjct: 197 SGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYNRFEGEPCCGSNR 253
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
+VSDC +V + + + P A+A ++L G D+ CG+ K
Sbjct: 254 LLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLNGTDVECGNSY-K 312
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
AVKAGL+ E+ ID ++ L++ F G + + + V + ++ LAL
Sbjct: 313 SLPDAVKAGLITENQIDISVKR----LLKARFELGEMDENVWTGISSDVVDSPKHRQLAL 368
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
AR+ + LL+N LPLS A +A+IGPNAN + GNY G P T L+G+
Sbjct: 369 QMARETMTLLQNNNNILPLSKQA--KIALIGPNANDSVMQWGNYNGLPSHTITLLEGMQR 426
Query: 416 VVAT---IYQAGCSN 427
+ T IY+ C +
Sbjct: 427 YLPTSNLIYEPVCGH 441
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 49/83 (59%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR ++ LP Q+ ++ E K A ++ + SG I+ +IL YPGQAG
Sbjct: 630 DRTEIELPAVQRRVV-EALKTAGKRIVFVNFSGAA--IALEPESLNCEAILQAWYPGQAG 686
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G A+A+V FG YNP+G+LP+T+Y
Sbjct: 687 GQAVAEVLFGDYNPAGKLPLTFY 709
>gi|281422547|ref|ZP_06253546.1| beta-glucosidase [Prevotella copri DSM 18205]
gi|281403371|gb|EFB34051.1| beta-glucosidase [Prevotella copri DSM 18205]
Length = 869
Score = 207 bits (526), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 146/430 (33%), Positives = 204/430 (47%), Gaps = 77/430 (17%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N S L + N +L R DL RLTL+EK + ++D + ++ RLGI +++WWSEALHG
Sbjct: 20 NASAQQLPYQNPALSAHERAVDLCGRLTLEEKASLMLDDSPAIPRLGIKRFQWWSEALHG 79
Query: 105 VSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN--------- 155
V+ +G T FP+ I AASFN + + S E RA +N
Sbjct: 80 VANMG----------DVTVFPEPIGMAASFNDRMVYRVFDATSDEMRAKWNELQQKGGDV 129
Query: 156 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVA 215
L+ W+PN+NIFRDPRWGRGQET GEDP L S+ V+GLQ + +L
Sbjct: 130 TRFHALSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVRGLQGPEDTKYRKL--W 187
Query: 216 ACCKHYTAYDLDNW-KGTDRYH--------------FNAMV----------IYTYY---- 246
AC KHY + W + TD F ++V Y +
Sbjct: 188 ACAKHYAIHSGPEWARHTDNITDVTPRDLWETYMPAFKSLVQDAKVREVMCAYQRWDDEP 247
Query: 247 ------LIK--------FKY-IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG- 290
L++ FKY +VSDC +V + + + AAAK +LAG D+ CG
Sbjct: 248 CCGNTRLLQQILRDEWGFKYLVVSDCGAVTDFWENHKVSSNARNAAAKGVLAGTDVECGF 307
Query: 291 SFLGKHTEAAVKAG-LVNESAIDHAISNNFATLMRLGFFDGHP----SKQPYGQLGPKDV 345
+++ K AVK G L E +D + LG D + SK P L
Sbjct: 308 NYIYKSVPEAVKYGALTEEEEVDKHVIRLLEGRFDLGEMDDNKIVSWSKIPVSVL----- 362
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
C++A++ L+LD A Q + LL+N LPL +K +A IGPN + M GNY GTP +
Sbjct: 363 CSKAHRQLSLDMALQTMTLLQNKNEVLPLD-KKVKKIAFIGPNVDNEPMMWGNYNGTPRQ 421
Query: 406 YTTPLQGLAA 415
T L G+ +
Sbjct: 422 TITILDGIKS 431
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 55/120 (45%), Gaps = 13/120 (10%)
Query: 440 KAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPV 489
K + V V G +E E DR ++ LP Q+ + + + K V
Sbjct: 601 KQLKDCETVVFVGGISPQLEGEEMPIEISGFKGGDRTNIELPKVQRNFLKALKEAGK-KV 659
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+ + SG I+ +IL YPGQ GG A+A V FG YNP+G+LP+T+Y S
Sbjct: 660 VFVNCSGSA--IALTPETESCDAILQAWYPGQEGGEAVARVLFGEYNPAGKLPITFYKNS 717
>gi|330996730|ref|ZP_08320605.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella
xylaniphila YIT 11841]
gi|329572575|gb|EGG54218.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
Length = 725
Score = 207 bits (526), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 146/431 (33%), Positives = 208/431 (48%), Gaps = 73/431 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L R DL+KRLTL+EKI+ + + + V RLGI Y WWSEALHGV+ G
Sbjct: 25 YKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALHGVARNGL-- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---------MYNVGLAGLTF 163
AT +P + A+ F+ L + I VS E RA Y G GLTF
Sbjct: 83 --------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGRYGRGNEGLTF 134
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT- 222
W+PN+NIFRDPRWGRGQET GEDP L ++ V+G+Q + K AC KHY
Sbjct: 135 WNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVQGMQGPADAKYD--KTHACAKHYAV 192
Query: 223 ---------AYDLDNWKGTDRYH-----FNAMV-------------------------IY 243
++D++N + D + F A+V +
Sbjct: 193 HSGPEAKRHSFDVENLEPRDLWETYLPAFKALVQEADVKEVMCAYQRFEGEPCCGSNRLL 252
Query: 244 TYYLIK---FKY-IVSDCDSV-DVLYNSQHYTKTPEEA--AAKSILAGLDLNCGSFLGKH 296
T L +K+ +VSDC ++ D Y +H T P+ A +A +++ G DL CG H
Sbjct: 253 TQILRDEWGYKHLVVSDCGAISDFFYQGRHETH-PDAATSSASAVINGTDLECGVEYA-H 310
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
+ AV+ GL+ E ID ++ LG D + P+ ++ V ++ +ALD
Sbjct: 311 LDEAVERGLITEHRIDTSLRRLLEARFALGEMDDD-ALVPWSRISIDTVDCDMHRQMALD 369
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
R+ +VLL N G LPL + +AV+GPNA + GNY+G P T L+G+
Sbjct: 370 VTRKSMVLLHNN-GILPLDKGDVGKIAVMGPNAVDSVMQWGNYKGVPAHTYTILEGIRME 428
Query: 417 VATI-YQAGCS 426
V + Y+ GC
Sbjct: 429 VGNVPYEKGCE 439
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 57/117 (48%), Gaps = 14/117 (11%)
Query: 444 SADATVLVMGADQSIEAESH-----------DRLDLLLPGQQQLLITEVAKVAKGPVILI 492
A+ + V G ++E E DR + LP Q Q I + K A V+ +
Sbjct: 599 DAETIIFVGGISPNLEGEDKYFVYCPGFAGGDRTSIELP-QVQRDILKALKAAGKKVVFV 657
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
SG ++ +IL YPGQAGG A+ADV FG +NPSG+LP+T+Y +
Sbjct: 658 NCSGSA--VALVPELESCDAILQAWYPGQAGGLAVADVLFGDFNPSGKLPVTFYKNT 712
>gi|375145746|ref|YP_005008187.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361059792|gb|AEV98783.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 866
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 147/453 (32%), Positives = 203/453 (44%), Gaps = 75/453 (16%)
Query: 42 VVSNPSLA-SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSE 100
+ S S A L + N L + R DL+ RLTL+EK + + D + ++ RLGI K+ WWSE
Sbjct: 14 IFSTSSFAQKLPYQNPDLSSEERAKDLITRLTLEEKASLMFDQSPAIPRLGIKKFNWWSE 73
Query: 101 ALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY------ 154
ALHG++ T FP+ + AASF+ +L I S E RA Y
Sbjct: 74 ALHGLANNDK----------VTVFPEPVGMAASFDDTLVYKIFDATSDEVRAKYHDAIRN 123
Query: 155 ---NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR 211
N L+ W+PNINIFRDPRWGRGQET GEDP L S+ VKGLQ G + R
Sbjct: 124 GKENRRFLSLSVWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQ---GPADAR 180
Query: 212 L-KVAACCKHYTAYDLDNW--KGTDRYHFNAMVIYTYYLIKFK----------------- 251
K+ AC KH+ + W + Y +Y YL FK
Sbjct: 181 YRKLLACAKHFAVHSGPEWSRHTLNLYQVRPRDLYETYLPAFKSLVMEADVRQVMCAYQR 240
Query: 252 -------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
+VSDC ++ + S + A+AK LAG D
Sbjct: 241 LDDEPCCGNNRLLQTILRDNWGFKHVVVSDCGAITDFFTSHKVSSDAVHASAKGALAGTD 300
Query: 287 LNC--GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
+ C + K+ AV GL+ E ID + A LG D + P+ +
Sbjct: 301 VECVWEGYAFKNLPEAVSRGLITEKEIDKHLLRVLAGRFDLGEMDDD-AVVPWAAIPMSV 359
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
V +A++ LALD A++ + LL+N LPL A K +AVIGPNA+ + GNY G P
Sbjct: 360 VNNEAHRKLALDMAQESMTLLQNRNNILPLKRNA-KKIAVIGPNADNDPVLWGNYNGRPV 418
Query: 405 KYTTPLQGLAAVV---ATIYQAGCSNVQCGTAQ 434
+ + G+ A + +Y C V+ Q
Sbjct: 419 RTINIVSGIRAKMPPDGVLYDKACDLVENKVTQ 451
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 55/114 (48%), Gaps = 13/114 (11%)
Query: 446 DATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
D V V G +E E DR D+ LP Q+ + + K A VI + S
Sbjct: 605 DEVVFVGGLSTLLEGEEMPVSYPGFKGGDRTDIQLPEVQRNCLKAL-KAAGKKVIFVNCS 663
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
G I F +IL Y G++GG A+ADV FG YNPSG+LP+T+Y +
Sbjct: 664 GSA--IGFVPETESCDAILQAWYGGESGGQAVADVLFGDYNPSGKLPVTFYKDT 715
>gi|261404262|ref|YP_003240503.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
gi|261280725|gb|ACX62696.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
Y412MC10]
Length = 934
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 149/463 (32%), Positives = 213/463 (46%), Gaps = 66/463 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L ++ RV DLV RLTL EKI + + RLG+ KY+ +E HGV+++G
Sbjct: 5 FQDPDLPLEERVHDLVSRLTLDEKIELMCQYQTEIPRLGVQKYKHGTEGAHGVAWLGE-- 62
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
AT FPQ A ++N L + IG V++ EAR Y + + GLT W+P +++
Sbjct: 63 --------ATVFPQNTGLACTWNPELMREIGSVIAEEARVYYQRDRAINGLTIWAPTVDL 114
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L + +TG VKG+Q G P K A KH+ + + +
Sbjct: 115 ERDPRWGRTEEAYGEDPHLTGELSTGLVKGMQ---GDHPFYYKTVATLKHFYGNNNEADR 171
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G+ + YYL F+
Sbjct: 172 GSASVSIDPRNKREYYLKAFEAPFREGKAGSMMTAYNGINGTPCNLNHEVNDIVKQEWGM 231
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGKHTEAAVKAGLVN 307
++V D V HY + EA A S+ AG+D + + A++ GL+
Sbjct: 232 DGFVVGDAGDVLGTVMDHHYVASYAEAVAGSVKAGIDSITDDQDISFRALRDALEQGLLE 291
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E +DHA+ N F RLG FD + PY ++ +C + +L+L AAR+ IVLLKN
Sbjct: 292 EQHLDHALRNTFRVRFRLGEFDPE-ERNPYSRVPETKLCAPEHAELSLRAARESIVLLKN 350
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIY---QAG 424
G LPL K+ AVIGP AN + Y GTP TPLQG+ A Y G
Sbjct: 351 D-GLLPLPRDPFKSAAVIGPLAN--EAFTDWYSGTPPYRITPLQGVQAKAGDRYVQFHTG 407
Query: 425 CSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
V+ +A V A +S + +L+ G S +A +R D
Sbjct: 408 LDQVRLRSA-VSGTYVALSSEEQGILLAGTSASADAAVFERND 449
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 3/86 (3%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+ DR D+ LP QQ LI V + V++I+ GG+ + + +IL+ + G
Sbjct: 565 ETIDRPDITLPPAQQALIQAVFEANPHTVVVIV---GGYPFAVNWEQEHVPAILFTSHAG 621
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
Q G A+ADV +G YNP GRL MTWY
Sbjct: 622 QELGHAVADVLYGDYNPGGRLNMTWY 647
>gi|409196584|ref|ZP_11225247.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
21150]
Length = 1272
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 128/411 (31%), Positives = 199/411 (48%), Gaps = 61/411 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ NT+ + R DLV R+T +EK + ++ ++ RLG+ Y +W EALHG+ +
Sbjct: 33 YLNTAYSFEERAADLVSRMTPEEKQSQFGNTMPAIPRLGVNGYNFWGEALHGIMGRNNNS 92
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFR 172
+ ATSFP + A+++ L + +V+S EAR + + LT+WSP I R
Sbjct: 93 GMT-----ATSFPNSVAAGATWDPELIKRETEVISDEARGFNHDLIFTLTYWSPVIEPAR 147
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L S+ G+++G+ G P LK C KHY A + + + +
Sbjct: 148 DPRWGRTAETFGEDPFLVSEIGKGFIQGMM---GDDPRYLKTVPCGKHYLANNTEFNRHS 204
Query: 233 DRYHFNAMVIYTYYLIKFK----------------------------------------- 251
+ +Y YYL+ ++
Sbjct: 205 GSSELDQRDMYEYYLLPYRTLIRDYNLPSIMTSYNAVNGVPVSASKFLVDTIARKLYGLD 264
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESA 310
Y+ SDC ++D + HY EAAA I +G+D++CG + A++AGL+ +
Sbjct: 265 GYVTSDCGAIDDIVRGHHYADNYTEAAAMGISSGVDIDCGGVYQNNALKALEAGLITQGE 324
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT-- 368
+D A+ N F T MR G FD PY + P V ++ DLA++ A + VLLKN
Sbjct: 325 LDKALINIFTTRMRTGEFDPK-GMVPYAGIKPHKVNDPSHHDLAIEIATKTPVLLKNNNV 383
Query: 369 ----AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG--TPCKYTTPLQGL 413
+LP+ + IK +AV+GP A++ + +G+Y G P TPL G+
Sbjct: 384 ANTEKKALPIDLSKIKKIAVLGPQADLVE--LGDYSGPVEPDLRITPLAGI 432
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 61/105 (58%), Gaps = 1/105 (0%)
Query: 442 AASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDI 501
AASAD ++ +G DQS E DR + LPG Q +I VA A I+++ + G ++
Sbjct: 605 AASADVAIVFVGTDQSTGREESDRFSITLPGNQNEVIKAVAD-ANPNTIVVMQTMGMVEV 663
Query: 502 SFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
KN P I I+W GY GQA G A+A + FG NP G+L +TW+
Sbjct: 664 EEFKNHPNIKGIIWTGYNGQAQGTAMAKILFGDVNPGGKLNVTWH 708
>gi|357046682|ref|ZP_09108302.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
YIT 11840]
gi|355530484|gb|EHG99896.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
YIT 11840]
Length = 677
Score = 206 bits (524), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 147/434 (33%), Positives = 202/434 (46%), Gaps = 78/434 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L R DL+KRLTL+EK + +++S+ +V RLGI Y WWSEALHGV+ G
Sbjct: 25 YKNPDLSPRERAEDLLKRLTLKEKASLMINSSPAVERLGIKPYNWWSEALHGVARSGV-- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTF 163
AT +P + A+ F+ +A+ VS EARA Y+ GLTF
Sbjct: 83 --------ATVYPITMGMASVFDDKAVEAVYTTVSDEARAKYHDAHKKGRYGQMNEGLTF 134
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PN+NIFRDPRWGRGQET GEDP L S+ V+GLQ ++L AC KHY
Sbjct: 135 WTPNVNIFRDPRWGRGQETWGEDPYLTSRMGVAVVRGLQGPADAKYDKLH--ACAKHYAV 192
Query: 224 YDLDNWKGTDRYHFNA-----MVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTP----- 273
+ K R++F+ ++ YL FK +V + D +V+ Q + P
Sbjct: 193 HSGPEAK---RHYFDVEQLDPRDLWETYLPAFKVLVQEADVKEVMCAYQRFEGEPCCGSE 249
Query: 274 --------EEAAAKSILA-------------------------------GLDLNCGSFLG 294
+E K ++ G DL CG
Sbjct: 250 RLLHQILRDEWGFKYVVVSDCGAIGDFFNPGLHETHPDAATASASAVTSGTDLECGWGDY 309
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
EAAV GL+ E ID ++ LG D S P+ ++G V Q ++ +A
Sbjct: 310 MQLEAAVDRGLITEHRIDTSLCRLLEARFALGEMD-DDSLVPWSRIGIDTVDCQTHKQMA 368
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
LD AR+ +VLL N G LPL T ++ V+GPNA + GNYEGTP T L G+
Sbjct: 369 LDIARKSLVLLHND-GVLPLDKTR-GDVVVMGPNAVDSVMQWGNYEGTPSHTYTVLDGIR 426
Query: 415 AVVA--TIYQAGCS 426
+ Y+ GC
Sbjct: 427 ERLGRDVRYEKGCD 440
>gi|332877556|ref|ZP_08445303.1| glycosyl hydrolase family 3 protein, partial [Capnocytophaga sp.
oral taxon 329 str. F0087]
gi|332684662|gb|EGJ57512.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
Length = 676
Score = 206 bits (524), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 147/434 (33%), Positives = 202/434 (46%), Gaps = 78/434 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L R DL+KRLTL+EK + +++S+ +V RLGI Y WWSEALHGV+ G
Sbjct: 25 YKNPDLSPRERAEDLLKRLTLKEKASLMINSSPAVERLGIKPYNWWSEALHGVARSGV-- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTF 163
AT +P + A+ F+ +A+ VS EARA Y+ GLTF
Sbjct: 83 --------ATVYPITMGMASVFDDKAVEAVYTTVSDEARAKYHDAHKKGRYGQMNEGLTF 134
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PN+NIFRDPRWGRGQET GEDP L S+ V+GLQ ++L AC KHY
Sbjct: 135 WTPNVNIFRDPRWGRGQETWGEDPYLTSRMGVAVVRGLQGPADAKYDKLH--ACAKHYAV 192
Query: 224 YDLDNWKGTDRYHFNA-----MVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTP----- 273
+ K R++F+ ++ YL FK +V + D +V+ Q + P
Sbjct: 193 HSGPEAK---RHYFDVEQLDPRDLWETYLPAFKVLVQEADVKEVMCAYQRFEGEPCCGSE 249
Query: 274 --------EEAAAKSILA-------------------------------GLDLNCGSFLG 294
+E K ++ G DL CG
Sbjct: 250 RLLHQILRDEWGFKYVVVSDCGAIGDFFNPGLHETHPDAATASASAVTSGTDLECGWGDY 309
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
EAAV GL+ E ID ++ LG D S P+ ++G V Q ++ +A
Sbjct: 310 MQLEAAVDRGLITEHRIDTSLCRLLEARFALGEMD-DDSLVPWSRIGIDTVDCQTHKQMA 368
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
LD AR+ +VLL N G LPL T ++ V+GPNA + GNYEGTP T L G+
Sbjct: 369 LDIARKSLVLLHND-GVLPLDKTR-GDVVVMGPNAVDSVMQWGNYEGTPSHTYTVLDGIR 426
Query: 415 AVVA--TIYQAGCS 426
+ Y+ GC
Sbjct: 427 ERLGRDVRYEKGCD 440
>gi|323451833|gb|EGB07709.1| hypothetical protein AURANDRAFT_64764 [Aureococcus anophagefferens]
Length = 819
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 199/634 (31%), Positives = 287/634 (45%), Gaps = 132/634 (20%)
Query: 23 CSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGID-----------LRVGDLVKRL 71
C+L + S + +AC + + +A L + G+D R+ L +
Sbjct: 30 CALTNRSCGRCAPGAYACPLSATACVAGLAAYESCPGLDGTYLDASLPEADRLAWLADNV 89
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
L++ I LV++A +V + +P Y W ++ HGV GT + V P S
Sbjct: 90 PLEDMIGQLVNAAPAVDAVDLPAYNWLNDNEHGVK----GTAHATVYPMGASL------G 139
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLA--------------------GLTFWSPNINIF 171
AS++ L +G + E+RA +N GLA G+T ++PN+N+
Sbjct: 140 ASWSVDLAWRVGAAIGNESRATHN-GLADKSGNACGSTSTGEVVANGCGITLYAPNVNLV 198
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQ-----QTDGGSPNRLKVAACCKHYTAY-- 224
RDPRWGR +E GEDP L ++ A G V GLQ T G L ACCKH+ A+
Sbjct: 199 RDPRWGRAEEVYGEDPHLTAELAVGMVTGLQGNAEGSTSGPGGGPLVTGACCKHFAAHFA 258
Query: 225 -----DLDNWKGTDRYHFNAMV----IYTYYLIKFK------------------------ 251
DL DR +A V ++ YL K
Sbjct: 259 VYQNEDLP----ADRMVLDANVSSRDLWETYLPVMKACVVRAKATHVNGKPTCAHPELLN 314
Query: 252 -----------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA- 299
++VSD D+ L + Y T EEAAA I AG+D G G ++
Sbjct: 315 DVLRESWGFDGFVVSDYDAWSNLVTTHKYVSTWEEAAAAGINAGMDQEGG--FGDYSPVD 372
Query: 300 ----AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV-C-TQANQDL 353
AV+ G V + + + +RLG FD S YG+ D C T A L
Sbjct: 373 ALPDAVRNGTVAAATVRRSFERLMRVRLRLGMFDPPASTAVYGEAYQCDYQCETAAKLAL 432
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIG--NY---EGTPCKYTT 408
A +AAR+GIVL KN G+LPL+ A +A++GP + + ++G NY +G T
Sbjct: 433 AREAAREGIVLFKNAGGALPLAKGA--RIALVGPQVDDWRVLLGAVNYAFEDGPDVAPVT 490
Query: 409 PLQGLAAVVATIYQAGCSNVQCGT-AQVDDAKKAAASADATVLVMG-------------- 453
+GL AV AGC +V C VD AK+ AA+ADATV+V+G
Sbjct: 491 IQKGLEAVANVSVAAGCDSVACAALVDVDGAKRLAAAADATVVVLGDSFGATDGWPLCRG 550
Query: 454 -ADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITS 512
D E+ESHDR + LPG+Q L+ + + ++ +++ GG + A +D
Sbjct: 551 TRDDGCESESHDRATIELPGEQVALVAALRAASS-RLVCVLVHGGAVALGAAADDCDAVL 609
Query: 513 ILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
LWV PGQ GGAA+ADV FG Y+P+GR P+T Y
Sbjct: 610 DLWV--PGQMGGAALADVLFGDYSPAGRSPITMY 641
>gi|332881173|ref|ZP_08448832.1| glycosyl hydrolase family 3 protein, partial [Capnocytophaga sp.
oral taxon 329 str. F0087]
gi|332680887|gb|EGJ53825.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
Length = 675
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 145/431 (33%), Positives = 207/431 (48%), Gaps = 73/431 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L R DL+KRLTL+EKI+ + + + V RLGI Y WWSEALHGV+ G
Sbjct: 25 YKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALHGVARNGL-- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---------MYNVGLAGLTF 163
AT +P + A+ F+ L + I VS E RA Y G GLTF
Sbjct: 83 --------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGRYGRGNEGLTF 134
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT- 222
W+PN+NIFRDPRWGRGQET GEDP L ++ V+G+Q + K AC KHY
Sbjct: 135 WNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQGPADAKYD--KTHACAKHYAV 192
Query: 223 ---------AYDLDNWKGTDRYH-----FNAMV-------------------------IY 243
++D++N + D + F A+V +
Sbjct: 193 HSGPEAKRHSFDVENLEPRDLWETYLPAFKALVQEADVKEVMCAYQRFEGEPCCGSNRLL 252
Query: 244 TYYLIK---FKY-IVSDCDSV-DVLYNSQHYTKTPEEA--AAKSILAGLDLNCGSFLGKH 296
T L +K+ +VSDC ++ D Y +H T P+ A +A +++ G DL CG H
Sbjct: 253 TQILRDEWGYKHLVVSDCGAISDFFYQGRHETH-PDAATSSASAVINGTDLECGVEYA-H 310
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
+ AV+ GL+ E ID ++ LG D + P+ ++ V ++ +ALD
Sbjct: 311 LDEAVERGLITEHRIDTSLRRLLEARFALGEMDDD-ALVPWSRISIDTVDCGTHRRMALD 369
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
R+ +VLL N G LPL +AV+GPNA + GNY+G P T L+G+
Sbjct: 370 VTRKSMVLLHNN-GILPLDKGDAGKIAVMGPNAVDSVMQWGNYKGVPAHTYTILEGIRGA 428
Query: 417 VATI-YQAGCS 426
+ + Y+ GC
Sbjct: 429 IGNVPYEKGCE 439
>gi|189461690|ref|ZP_03010475.1| hypothetical protein BACCOP_02354 [Bacteroides coprocola DSM 17136]
gi|189431577|gb|EDV00562.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
coprocola DSM 17136]
Length = 499
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 147/416 (35%), Positives = 202/416 (48%), Gaps = 77/416 (18%)
Query: 46 PSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV 105
P A + N + R+ DL+ RLT++EK++ L ++ +SRL IPKY +EALHGV
Sbjct: 21 PMQAQELYKNEDAPLHERIMDLLSRLTVEEKVSLLRATSPGISRLDIPKYYHGNEALHGV 80
Query: 106 SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----- 160
V PG T FPQ I AA++N L + V+S EARA +N G
Sbjct: 81 --VRPGRF--------TVFPQAIGLAATWNPELQYQVATVISDEARARWNELDQGKLQKG 130
Query: 161 -----LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVA 215
LTFWSP +N+ RDPRWGR ET GEDP L+ T +V+GLQ G LKV
Sbjct: 131 QFSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGTMGTAFVRGLQ---GDDARYLKVV 187
Query: 216 ACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------------- 251
+ KH+ A N + +R+ N + + YYL F+
Sbjct: 188 STPKHFAA----NNEEHNRFECNPQISEKQLREYYLPAFEACIKDGKAASIMSAYNAINN 243
Query: 252 ----------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
Y+VSDC +L N+ Y KT E AA SI AGLDL C
Sbjct: 244 VPCTLNSWLLTKVLRHDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLEC 303
Query: 290 G-SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
G + A K +V+++ ID + MRLG FD + PY ++ P + ++
Sbjct: 304 GDDVYYEPLLNAYKQYMVSDADIDSTAYHVLKARMRLGLFD-NGKNNPYTKISPSIIGSK 362
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
+Q +AL+AARQ IVLLKN LPL +K++AV+G NA + G+Y G+P
Sbjct: 363 LHQRVALEAARQCIVLLKNHNWVLPLDTKKLKSIAVVGINAGNCE--FGDYSGSPV 416
>gi|357047866|ref|ZP_09109459.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
YIT 11840]
gi|355529205|gb|EHG98644.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
YIT 11840]
Length = 676
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 145/431 (33%), Positives = 207/431 (48%), Gaps = 73/431 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L R DL+KRLTL+EKI+ + + + V RLGI Y WWSEALHGV+ G
Sbjct: 25 YKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALHGVARNGL-- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---------MYNVGLAGLTF 163
AT +P + A+ F+ L + I VS E RA Y G GLTF
Sbjct: 83 --------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGRYGRGNEGLTF 134
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT- 222
W+PN+NIFRDPRWGRGQET GEDP L ++ V+G+Q + K AC KHY
Sbjct: 135 WNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQGPADAKYD--KTHACAKHYAV 192
Query: 223 ---------AYDLDNWKGTDRYH-----FNAMV-------------------------IY 243
++D++N + D + F A+V +
Sbjct: 193 HSGPEAKRHSFDVENLEPRDLWETYLPAFKALVQEADVKEVMCAYQRFEGEPCCGSNRLL 252
Query: 244 TYYLIK---FKY-IVSDCDSV-DVLYNSQHYTKTPEEA--AAKSILAGLDLNCGSFLGKH 296
T L +K+ +VSDC ++ D Y +H T P+ A +A +++ G DL CG H
Sbjct: 253 TQILRDEWGYKHLVVSDCGAISDFFYQGRHETH-PDAATSSASAVINGTDLECGVEYA-H 310
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
+ AV+ GL+ E ID ++ LG D + P+ ++ V ++ +ALD
Sbjct: 311 LDEAVERGLITEHRIDTSLRRLLEARFALGEMDDD-ALVPWSRISIDTVDCGTHRRMALD 369
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
R+ +VLL N G LPL +AV+GPNA + GNY+G P T L+G+
Sbjct: 370 VTRKSMVLLHNN-GILPLDKGDAGKIAVMGPNAVDSVMQWGNYKGVPAHTYTILEGIRGA 428
Query: 417 VATI-YQAGCS 426
+ + Y+ GC
Sbjct: 429 IGNVPYEKGCE 439
>gi|29348418|ref|NP_811921.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29340322|gb|AAO78115.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
VPI-5482]
Length = 863
Score = 205 bits (522), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 147/430 (34%), Positives = 213/430 (49%), Gaps = 72/430 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L + R DL+ RLTL+EKI + +S+ +V RLG+ Y WWSEALHGV+ G
Sbjct: 25 YKNPKLTPEERAEDLLGRLTLKEKIGLMKNSSFAVERLGVAPYNWWSEALHGVARNGL-- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---------MYNVGLAGLTF 163
AT FP + A++F+ + + VS E RA Y G GLTF
Sbjct: 83 --------ATVFPITMGMASTFDDEAIERVYVAVSDEGRAKFHDAHRSNRYGYGNEGLTF 134
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT- 222
W+PN+NIFRDPRWGRGQET GEDP L ++ VKG+Q + K AC KHY
Sbjct: 135 WNPNVNIFRDPRWGRGQETFGEDPYLTTRMGVAVVKGMQGPADAEYD--KAHACVKHYAV 192
Query: 223 ---------AYDLDNWKGTDRYH-----FNAM--------VIYTYYLIK----------- 249
++D+++ D + F A+ V+ Y ++
Sbjct: 193 HSGPEAKRHSFDVEDLSPRDLWETYLPAFKALVQEADVKEVMCAYQRLEGEPCCDSNRLL 252
Query: 250 ---------FKY-IVSDCDSVDVLY-NSQHYT-KTPEEAAAKSILAGLDLNCGSFLGKHT 297
+K+ +VSDC ++D + +H T K +A+A +++ G DL CGS H
Sbjct: 253 TQILRDEWGYKHLVVSDCGAIDDFFVKGRHETHKDAADASASAVINGTDLECGSIY-SHL 311
Query: 298 EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDA 357
E AVK GL+ E ID ++ LG D S P+ ++ V ++ +ALD
Sbjct: 312 EEAVKQGLITEERIDTSLRRLLKARFALGEMDPD-SIVPWSRISIDTVDCDLHKQMALDL 370
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
AR+ +VLL N G LPL+ T + +AV+GPNA + GNYEG P T L+G+ +
Sbjct: 371 ARKSMVLLCNN-GVLPLAKTGAR-IAVMGPNAVDSVMQWGNYEGVPSHTYTILEGIRCKI 428
Query: 418 ATI-YQAGCS 426
+ ++ GC
Sbjct: 429 GDVPFEKGCE 438
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 55/114 (48%), Gaps = 14/114 (12%)
Query: 444 SADATVLVMGADQSIEAESH-----------DRLDLLLPGQQQLLITEVAKVAKGPVILI 492
AD + G S+E E DR + LP Q Q I + K A V+ +
Sbjct: 598 EADIVIFAGGISPSLEGEEMYSVNSPGFAGGDRTSIELP-QVQRDILKALKKAGKKVVFV 656
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
SG ++ +IL YPGQ+GG A+ADV FG +NPSG+LP+T+Y
Sbjct: 657 NCSGSA--VALVPEMESCDAILQAWYPGQSGGLAVADVLFGDFNPSGKLPVTFY 708
>gi|383124608|ref|ZP_09945271.1| hypothetical protein BSIG_1643 [Bacteroides sp. 1_1_6]
gi|251841237|gb|EES69318.1| hypothetical protein BSIG_1643 [Bacteroides sp. 1_1_6]
Length = 863
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 147/430 (34%), Positives = 213/430 (49%), Gaps = 72/430 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L + R DL+ RLTL+EKI + +S+ +V RLG+ Y WWSEALHGV+ G
Sbjct: 25 YKNPKLTPEERAEDLLGRLTLKEKIGLMKNSSFAVERLGVAPYNWWSEALHGVARNGL-- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---------MYNVGLAGLTF 163
AT FP + A++F+ + + VS E RA Y G GLTF
Sbjct: 83 --------ATVFPITMGMASTFDDEAIERVYVAVSDEGRAKFHDAHRSNRYGYGNEGLTF 134
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT- 222
W+PN+NIFRDPRWGRGQET GEDP L ++ VKG+Q + K AC KHY
Sbjct: 135 WNPNVNIFRDPRWGRGQETFGEDPYLTTRMGVAVVKGMQGPADAEYD--KAHACVKHYAV 192
Query: 223 ---------AYDLDNWKGTDRYH-----FNAM--------VIYTYYLIK----------- 249
++D+++ D + F A+ V+ Y ++
Sbjct: 193 HSGPEAKRHSFDVEDLSPRDLWETYLPAFKALVQEADVKEVMCAYQRLEGEPCCDSNRLL 252
Query: 250 ---------FKY-IVSDCDSVDVLY-NSQHYT-KTPEEAAAKSILAGLDLNCGSFLGKHT 297
+K+ +VSDC ++D + +H T K +A+A +++ G DL CGS H
Sbjct: 253 TQILRDEWGYKHLVVSDCGAIDDFFVKGRHETHKDAADASASAVINGTDLECGSIY-SHL 311
Query: 298 EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDA 357
E AVK GL+ E ID ++ LG D S P+ ++ V ++ +ALD
Sbjct: 312 EEAVKQGLITEERIDTSLRRLLKARFALGEMDPD-SIVPWSRISIDTVDCDLHKQMALDL 370
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
AR+ +VLL N G LPL+ T + +AV+GPNA + GNYEG P T L+G+ +
Sbjct: 371 ARKSMVLLCNN-GVLPLAKTGAR-IAVMGPNAVDSVMQWGNYEGVPSHTYTILEGIRCKI 428
Query: 418 ATI-YQAGCS 426
+ ++ GC
Sbjct: 429 GDVPFEKGCE 438
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 56/117 (47%), Gaps = 14/117 (11%)
Query: 444 SADATVLVMGADQSIEAESH-----------DRLDLLLPGQQQLLITEVAKVAKGPVILI 492
AD + G S+E E DR + LP Q Q I + K A V+ +
Sbjct: 598 EADIVIFAGGISPSLEGEEMYSVNSPGFAGGDRTSIELP-QVQRDILKALKKAGKKVVFV 656
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
SG ++ +IL YPGQAGG A+ADV FG +NPSG+LP+T+Y +
Sbjct: 657 NCSGSA--VALVPEMESCDAILQAWYPGQAGGLAVADVLFGDFNPSGKLPVTFYKNT 711
>gi|189464310|ref|ZP_03013095.1| hypothetical protein BACINT_00651 [Bacteroides intestinalis DSM
17393]
gi|189438100|gb|EDV07085.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 864
Score = 205 bits (521), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 141/395 (35%), Positives = 193/395 (48%), Gaps = 69/395 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ DL+ RLT++EKI+ L ++ + RL IPKY +EALHGV V PG T
Sbjct: 38 RIMDLLSRLTVEEKISLLRATSPGIPRLDIPKYYHGNEALHGV--VRPGRF--------T 87
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LTFWSPNINIFR 172
FPQ I AA++N L + V+S EARA +N G LTFWSP +N+ R
Sbjct: 88 VFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVNMAR 147
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L+ T +VKGLQ G LK+ + KH+ A + ++ +
Sbjct: 148 DPRWGRTPETYGEDPYLSGVMGTAFVKGLQ---GDDDRYLKIVSTPKHFAANNEEHNRFV 204
Query: 233 DRYHFNAMVIYTYYLIKFK----------------------------------------- 251
+ + YYL F+
Sbjct: 205 CNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLLTKVLREDWGFK 264
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNES 309
Y+VSDC +L N+ Y KT E AA SI AGLDL CG + +A + +V +
Sbjct: 265 GYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVFDEPLLSAYRQYMVTNA 324
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
ID A M+LG FD K PY ++ P V + +Q++AL+AAR+ IVLLKN
Sbjct: 325 DIDSAAYRVLRARMQLGLFDSG-EKNPYTKISPAVVGSAKHQEVALNAARECIVLLKNQK 383
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPC 404
LPL+ +K++AV+G NA G+Y G+P
Sbjct: 384 KMLPLNAKKVKSIAVVGINAG--NCEFGDYSGSPV 416
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 67/112 (59%), Gaps = 7/112 (6%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + V V+G ++SIE E DR D+ LP Q + E+ KV P I++++
Sbjct: 594 EAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQMEFLQEIYKV--NPNIVVVLVA 651
Query: 497 GGFDISFAKN--DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G S A N D + +I+ YPG++GG A+A+V FG YNP GRLP+T+Y
Sbjct: 652 GS---SLAVNWMDEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY 700
>gi|167524198|ref|XP_001746435.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775197|gb|EDQ88822.1| predicted protein [Monosiga brevicollis MX1]
Length = 834
Score = 204 bits (520), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 179/579 (30%), Positives = 260/579 (44%), Gaps = 109/579 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFL-------VDSAGSVSRLGIPKYEWWSEALHGV 105
F N L R+ DLV RLTL+EK+ L + A +V RLGI + W SE + G+
Sbjct: 36 FRNPDLPWAARLDDLVGRLTLEEKLQQLQHGGAAQMTPAPAVERLGIGPFVWGSECVTGL 95
Query: 106 SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA------ 159
G H T+FPQ + AA+F+ +L + ++ E RA N
Sbjct: 96 GTDGNDPH-------GTAFPQPLGMAATFDPALLKRAAGTIALELRAQRNFDRENGVVKF 148
Query: 160 --GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
GL+ WSP +NI R P WGR ET GE P+L+S A +V+G+Q G AA
Sbjct: 149 HHGLSCWSPVVNINRHPLWGRNDETFGECPVLSSFMARSFVEGIQ---GNHTRYYAAAAA 205
Query: 218 CKHYTAYDLDNWKGTD--RYHFNAMV----IYTYYLIKFK-------------------- 251
CKH LD + G D RY F+A V + +L+ F+
Sbjct: 206 CKH-----LDVYGGPDNLRYVFDADVSQADLTGTFLMAFEECAAAGVMGYMCSYNSIRGV 260
Query: 252 ---------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG 290
Y+VSD +V + S +YT A ++ AG D+
Sbjct: 261 PACANYRTMTFFAREQWGFEGYVVSDQGAVFRITESHNYTANQTLGAVAALNAGCDME-D 319
Query: 291 SFLGKHT-----EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
S +H A+ L + + ID ++S F MRLG FD P P+ L V
Sbjct: 320 SDDAQHVAYYNLSLALDLKLTDMATIDASVSRLFYVRMRLGEFD-PPENDPWRSLNMSIV 378
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLA--VIGPNANVTKTMIGNY--EG 401
+ A+ ++A D A IVLLKN +LPLS A KN + ++GP A+ M+G Y G
Sbjct: 379 SSPAHVEMARDVATASIVLLKNQNETLPLS-AAAKNASYCLLGPFADNADLMMGKYSPHG 437
Query: 402 TPCKYTTPLQGLAAVVATI-------YQAGCSNVQCGTAQVDDAKKAAASA------DAT 448
+ T GLAA + Y GC+ C D AA + D
Sbjct: 438 STNVTVTYRAGLAAALQNASQTASFQYLEGCTGPFC-----DGLDTAAVTTFIQQGCDTV 492
Query: 449 VLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKV--AKGPVILIIMSGGGFDISFAKN 506
+L +G +E+ES DR ++ PG Q L+ V + K ++L++ + G D++ +
Sbjct: 493 LLAVGTSYHVESESLDRSNMSFPGAQPTLVQTVLEALGTKQRLVLLVSTAGPVDLAALEQ 552
Query: 507 DPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
D ++ +IL + Y GQ G A+AD+ G +PSGRLP +W
Sbjct: 553 DTRVAAILDLIYLGQTAGTALADILLGETSPSGRLPFSW 591
>gi|146301613|ref|YP_001196204.1| beta-glucosidase [Flavobacterium johnsoniae UW101]
gi|146156031|gb|ABQ06885.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 864
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 142/438 (32%), Positives = 201/438 (45%), Gaps = 78/438 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ + SL R DL+ RLTL+EK + D + ++ RLGI K+ WWSEALHG +
Sbjct: 24 YKDPSLSSKKRAEDLISRLTLEEKAALMCDQSDAIPRLGIKKFNWWSEALHGYA------ 77
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG---------LTF 163
+ NV T FP+ I AASF+ L + VS E RA YN + L+
Sbjct: 78 NNDNV----TVFPEPIGMAASFDDQLVFRVFDAVSDEGRAKYNQWIQNGNENKRFLSLSV 133
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PN+NIFRDPRWGRGQET GEDP L S+ VKGLQ K+ AC KH+
Sbjct: 134 WTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGP--ADAKYRKLLACAKHFAV 191
Query: 224 YDLDNWKGTDRYHFN-----AMVIYTYYLIKFK--------------------------- 251
+ W R+ N +Y YL FK
Sbjct: 192 HSGPEW---SRHELNLNNVKPRELYETYLPAFKALVQEADVRQVMCAYQRLDDEPCCSNT 248
Query: 252 ---------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC--GSFLG 294
+VSDC +V Y + + AA+K++LAG D+ C +
Sbjct: 249 RLLQRILRDEWGFQYLVVSDCGAVTDFYTTHKVSSDEVHAASKAVLAGTDVECVWDKYPF 308
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLA 354
K AV+ L+ E I+ ++ LG D S P+ Q+ + ++ +Q LA
Sbjct: 309 KKLPEAVEKDLIKEEEINKSLLRVLIGRFDLGEMDDD-SIVPWAQIPASVLNSKEHQQLA 367
Query: 355 LDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 414
L+ A++ + LL+N LPL+ + +AVIGPNA+ + GNY GTP K T +G+
Sbjct: 368 LEMAQKSMTLLQNKNNILPLNKN-VNKVAVIGPNADNEPMLWGNYNGTPNKTITIKKGIE 426
Query: 415 AVVA---TIYQAGCSNVQ 429
+A +Y C V+
Sbjct: 427 GKLAANKVLYDKSCDLVE 444
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 50/86 (58%), Gaps = 3/86 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR ++ LP Q+ + E+ K A VI + SG I+ +IL YPG++G
Sbjct: 631 DRTNIELPAVQRKCLKEL-KAAGKKVIFVNCSGSA--IALTPETESCDAILQAWYPGESG 687
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQS 549
G A+ADV FG YNP+G+LP+++Y S
Sbjct: 688 GQAVADVLFGDYNPAGKLPISFYKNS 713
>gi|380692851|ref|ZP_09857710.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 854
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 147/430 (34%), Positives = 203/430 (47%), Gaps = 72/430 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L R DL+ RLTL+EKI + +S+ +V RLGI Y WWSE LHGV+ G
Sbjct: 16 YKNHQLSPKERADDLLGRLTLKEKIGLMQNSSNAVERLGIAPYNWWSEGLHGVARNGL-- 73
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---------MYNVGLAGLTF 163
AT FP + A++F+ + I VS E RA Y G GLTF
Sbjct: 74 --------ATVFPITMGMASTFDDEAIEKIYTAVSDEGRAKFHDAHRRNRYGCGNEGLTF 125
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+PN+NIFRDPRWGRGQET GEDP L ++ VKG+Q + + K AC KHY
Sbjct: 126 WNPNVNIFRDPRWGRGQETFGEDPYLTTRMGVSVVKGMQGPADAAYD--KTHACAKHYAV 183
Query: 224 YDLDNWK--GTDRYHFNAMVIYTYYLIKFK------------------------------ 251
+ K D H + ++ YL FK
Sbjct: 184 HSGPEAKRHSFDVEHLSPRDLWETYLPAFKALVQEADVKEVMCAYQRFEGEPCCDSNRLL 243
Query: 252 ------------YIVSDCDSV-DVLYNSQHYT-KTPEEAAAKSILAGLDLNCGSFLGKHT 297
+VSDC ++ D Y +H T K +A+A +++ G DL CG H
Sbjct: 244 TQILRDEWGYKHLVVSDCGAISDFFYKDRHGTHKDAADASASAVINGTDLECGVEYA-HL 302
Query: 298 EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDA 357
E AVK GL+ E I+ ++ LG D S + ++G V ++ +ALD
Sbjct: 303 EEAVKRGLITEERINTSLRRLLEARFALGEMDPD-SIVAWSKIGIDTVDCDLHRQMALDI 361
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
R+ +VLL N G LPL+ T+ + +AV+GPNA + GNY+G P T L+G+ V
Sbjct: 362 TRKSMVLLHNN-GILPLAKTSTR-IAVMGPNAVDSVMQWGNYKGVPSHTYTILEGIRNKV 419
Query: 418 ATI-YQAGCS 426
+ Y+ GC
Sbjct: 420 GNVPYEKGCE 429
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 56/116 (48%), Gaps = 14/116 (12%)
Query: 445 ADATVLVMGADQSIEAESH-----------DRLDLLLPGQQQLLITEVAKVAKGPVILII 493
AD + + G +E E DR + LP Q Q I + K A V+ +
Sbjct: 590 ADIVIFLGGISPDLEGEEKHFVNCPGFSGGDRTSIELP-QVQRDILKALKKAGKKVVFVN 648
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
SG ++ +IL YPGQAGG A+ADV FG +NPSG+LP+T+Y +
Sbjct: 649 CSGSA--VALVPEMESCDAILQAWYPGQAGGLAVADVLFGDFNPSGKLPVTFYKST 702
>gi|317480996|ref|ZP_07940076.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
gi|316902889|gb|EFV24763.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
Length = 864
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 158/456 (34%), Positives = 212/456 (46%), Gaps = 81/456 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N SL + R G L+K LTL+EK++ ++DS+ V RLGI Y WW+EALHGV+ G
Sbjct: 22 YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM--------YNVGLAGLTFW 164
AT FPQ I AASF+ + VS EARA GLT W
Sbjct: 80 --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S+ VKGLQ T+ G ++L AC KH+ +
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLH--ACAKHFAVH 189
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------------------- 254
W +R+ FNA I Y YL F+ +V
Sbjct: 190 SGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSDR 246
Query: 255 -----------------SDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
SDC ++ Y + P E A+A ++L+G DL CGS
Sbjct: 247 LLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAAAVLSGTDLECGSSYEA 306
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E AVK G ++E A+D A+ LG D P K + + V + + LAL
Sbjct: 307 LVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDSLAL 364
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
D AR+ + LL N +LPL + +AV+GPNAN + GNY G P T L G+
Sbjct: 365 DMARKSMTLLMNKDNTLPLKRGGL-TVAVMGPNANDSVMQWGNYNGMPPHTVTILDGIRK 423
Query: 416 VVAT----IYQAGCSNVQCGTAQVDDAKKAAASADA 447
+ T IY+ GC V+ AQ+ +AD
Sbjct: 424 ALGTDDRLIYEQGCGWVE--RAQIQSVFNRCKTADG 457
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 64/133 (48%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQVDDAKKAAA---SADATVLVMGADQSIEAESH----------DRLDLLLPGQ 473
N G D +K+ AD V V G ++E E DR D+ LP
Sbjct: 579 NFDLGFKNEVDVRKSVERVKDADVVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAV 638
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q+ LI + K V+L+ SG I + +IL YPGQAGG A+A+V FG
Sbjct: 639 QRELIAALHHAGK-KVVLVNCSGS--PIGLEPETGRCGAILQAWYPGQAGGTAVAEVLFG 695
Query: 534 RYNPSGRLPMTWY 546
YNP+GRLP+T+Y
Sbjct: 696 DYNPAGRLPVTFY 708
>gi|333380551|ref|ZP_08472242.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826546|gb|EGJ99375.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
BAA-286]
Length = 854
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 142/409 (34%), Positives = 196/409 (47%), Gaps = 70/409 (17%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ DL+ RLT++EKI+ L ++ + RL IPKY +E+LHGV V PG T
Sbjct: 40 RIMDLLSRLTIEEKISLLRATSPGIPRLQIPKYYHGNESLHGV--VRPGRF--------T 89
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG----------LTFWSPNINIFR 172
FPQ I A+ +N L I +S EAR +N G LTFWSP +N+ R
Sbjct: 90 VFPQAIGLASMWNPELHHKIATAISDEARGRWNELEQGKLQTQRFTDLLTFWSPTVNMAR 149
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L+ T +V+GLQ G P LK+ + KH+ A + ++ +
Sbjct: 150 DPRWGRTPETYGEDPYLSGILGTAFVRGLQ---GDDPRYLKIVSTPKHFAANNEEHNRFV 206
Query: 233 DRYHFNAMVIYTYYLIKFK----------------------------------------- 251
+ + YY F+
Sbjct: 207 CNPQISERQLREYYFPAFEMCVKDGKSASIMSAYNAINDVPCTANPWLLTKVLRHDWGFN 266
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNES 309
Y+VSDC +L ++ Y KT E AA SI AGLDL CG + A +V+ +
Sbjct: 267 GYVVSDCGGPSLLVSAMKYVKTKEAAATLSIKAGLDLECGDDVYMQPLLNAYNQYMVSRA 326
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
ID A M LG FD P PY ++ P V + ++ LAL+AARQ IVLLKN
Sbjct: 327 DIDTAAYRVLRARMHLGLFD-DPDLNPYNKISPSVVGSAEHKQLALEAARQSIVLLKNNN 385
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY-TTPLQGLAAVV 417
+LPL+P +K++AV+G NA ++ G+Y G P + LQG+ V
Sbjct: 386 RTLPLNPKKVKSIAVVGINAGNSE--FGDYSGIPANAPVSILQGIKDKV 432
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 68/110 (61%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + + V+G +++IE E DR D+ LP Q+ I E+ KV +++++++G
Sbjct: 596 EAGKAVRECEQVIAVLGINKTIEREGQDRYDIHLPADQEEFIREIYKVNPN-IVVVLVAG 654
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I++ D + +I+ YPG+ GG A+A+V FG YNP GRLP+T+Y
Sbjct: 655 SSLAINWM--DEHVPAIVNAWYPGEQGGTAVAEVLFGEYNPGGRLPVTYY 702
>gi|270294390|ref|ZP_06200592.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270275857|gb|EFA21717.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 864
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 158/456 (34%), Positives = 212/456 (46%), Gaps = 81/456 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N SL + R G L+K LTL+EK++ ++DS+ V RLGI Y WW+EALHGV+ G
Sbjct: 22 YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM--------YNVGLAGLTFW 164
AT FPQ I AASF+ + VS EARA GLT W
Sbjct: 80 --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S+ VKGLQ T+ G ++L AC KH+ +
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLH--ACAKHFAVH 189
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------------------- 254
W +R+ FNA I Y YL F+ +V
Sbjct: 190 SGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSDR 246
Query: 255 -----------------SDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
SDC ++ Y + P E A+A ++L+G DL CGS
Sbjct: 247 LLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAVAVLSGTDLECGSSYEA 306
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E AVK G ++E A+D A+ LG D P K + + V + + LAL
Sbjct: 307 LVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDSLAL 364
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
D AR+ + LL N +LPL + +AV+GPNAN + GNY G P T L G+
Sbjct: 365 DMARKSMTLLMNKDNTLPLKRGGL-TVAVMGPNANDSVMQWGNYNGMPPHTVTILDGIRK 423
Query: 416 VVAT----IYQAGCSNVQCGTAQVDDAKKAAASADA 447
+ T IY+ GC V+ AQ+ +AD
Sbjct: 424 ALGTDDRLIYEQGCGWVE--RAQIQSVFNRCKTADG 457
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 65/133 (48%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQVDDAKKAAA---SADATVLVMGADQSIEAESH----------DRLDLLLPGQ 473
N G D +K+ AD V V G ++E E DR D+ LP
Sbjct: 579 NFDLGFKNEVDVRKSVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAV 638
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q+ LI + + K V+L+ SG I + +IL YPGQAGG A+A+V FG
Sbjct: 639 QRELIAALHRAGK-KVVLVNCSGS--PIGLEPETGRCGAILQAWYPGQAGGTAVAEVLFG 695
Query: 534 RYNPSGRLPMTWY 546
YNP+GRLP+T+Y
Sbjct: 696 DYNPAGRLPVTFY 708
>gi|333377782|ref|ZP_08469515.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
22836]
gi|332883802|gb|EGK04082.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
22836]
Length = 727
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 168/568 (29%), Positives = 262/568 (46%), Gaps = 98/568 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F NTSL + R+ +L+ +T+ EKI L + G V RLGI + SE LHG++ GPG
Sbjct: 26 FQNTSLSDEKRLDNLLSIMTIDEKINALSTNLG-VPRLGI-RNTGHSEGLHGMALGGPGN 83
Query: 113 ----------HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEAR-----AMYNVG 157
+V P T+FPQ +++ L + + + +TE R Y G
Sbjct: 84 WGGFKMVNYQRVPDVYP-TTTFPQAYGLGETWDTELIKKVADIEATEIRYYTQNERYTKG 142
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
GL +PN ++ RDPRWGR +E+ GEDP L S+ A ++KGLQ G +P K A+
Sbjct: 143 --GLVMRAPNADLARDPRWGRTEESFGEDPFLVSEMAVAFIKGLQ---GENPRYWKSASL 197
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK-------------------------- 251
KH+ A ++ + + +F+ + + YY F+
Sbjct: 198 MKHFLANSNEDGRDSTSSNFDNRLFHEYYSYPFRKGIEKGGSQAFMAAYNSWNEIPMTIH 257
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGK-- 295
I +D ++D+L + T E +A + AG+ G FL
Sbjct: 258 PILKKIRKDWNFKGIICTDGGALDLLIKAHKTFPTHTEGSAAIVKAGV----GQFLDNFR 313
Query: 296 -HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT----QAN 350
+ A++ G++ E+ ID AI NF ++LG DG +K PY +G D + +
Sbjct: 314 PYIYQALEKGMLTEAEIDKAIRGNFYIALKLGLLDGDQTKLPYAHIGVTDTVSVWRNKEI 373
Query: 351 QDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPL 410
QD + +VLLKN LPL+ IK +AVIGP AN + ++ Y GTP + L
Sbjct: 374 QDFVRLVTAKSVVLLKNEKKLLPLNKGNIKRIAVIGPRAN--EVLLDWYSGTPPYTVSIL 431
Query: 411 QGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVL-----VMGADQSIE------ 459
QG+ V + + + ++D A AA AD ++ V G D +
Sbjct: 432 QGIKNAVGNNVEV----IYESSNEIDKAYLAAQKADIAIVCVGNHVYGTDPKWKYSPVPS 487
Query: 460 --AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
E+ DR L L +Q+ L+ V K A +++++S F I++++ + I +IL +
Sbjct: 488 DGREAVDRKALSL--EQEDLVKIVHK-ANPNTVMVLVSSFPFAINWSQEN--IPAILHIT 542
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTW 545
Q G +ADV FG YNP+GR TW
Sbjct: 543 NNSQELGNGLADVIFGNYNPAGRTNQTW 570
>gi|160891087|ref|ZP_02072090.1| hypothetical protein BACUNI_03534 [Bacteroides uniformis ATCC 8492]
gi|156859308|gb|EDO52739.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
Length = 865
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 157/456 (34%), Positives = 213/456 (46%), Gaps = 81/456 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N SL + R G L+K LTL+EK++ ++DS+ V RLGI Y WW+EALHGV+ G
Sbjct: 23 YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM--------YNVGLAGLTFW 164
AT FPQ I AASF+ + VS EARA + GLT W
Sbjct: 81 --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYTSQESHERYQGLTMW 132
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S+ VKGLQ T+ G ++L AC KH+ +
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLH--ACAKHFAVH 190
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------------------- 254
W +R+ FNA I Y YL F+ +V
Sbjct: 191 SGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSDR 247
Query: 255 -----------------SDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGK 295
SDC ++ Y + P E A+A ++L+G DL CGS
Sbjct: 248 LLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAAAVLSGTDLECGSSYEA 307
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E AVK G ++E A+D A+ LG D P K + + V + + LAL
Sbjct: 308 LVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDSLAL 365
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
D AR+ + LL N +LPL + +AV+GPNAN + GNY G P T L G+
Sbjct: 366 DMARKSMTLLMNKDNTLPLKRGGL-TIAVMGPNANDSVMQWGNYNGMPPHTVTILDGIRK 424
Query: 416 VVAT----IYQAGCSNVQCGTAQVDDAKKAAASADA 447
+ + IY+ GC V+ AQ+ +AD
Sbjct: 425 ALGSDDRLIYEQGCGWVE--RAQIQSVFNRCKTADG 458
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 64/133 (48%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQVDDAKKAAA---SADATVLVMGADQSIEAESH----------DRLDLLLPGQ 473
N G D +K+ AD V V G ++E E DR D+ LP
Sbjct: 580 NFDLGFKNEVDVRKSVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAV 639
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q+ LI + K V+L+ SG I + +IL YPGQAGG A+A+V FG
Sbjct: 640 QRELIAALHHAGK-KVVLVNCSGS--PIGLEPETGRCGAILQAWYPGQAGGTAVAEVLFG 696
Query: 534 RYNPSGRLPMTWY 546
YNP+GRLP+T+Y
Sbjct: 697 DYNPAGRLPVTFY 709
>gi|383649536|ref|ZP_09959942.1| beta-glucosidase [Sphingomonas elodea ATCC 31461]
Length = 853
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 142/398 (35%), Positives = 194/398 (48%), Gaps = 71/398 (17%)
Query: 68 VKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQV 127
+ ++T+++K L +A + +G+P Y+WW+E LHG++ G AT FPQ
Sbjct: 20 IAKMTIEQKAAQLQSTAPADPVIGLPAYDWWNEGLHGLARDG----------YATVFPQA 69
Query: 128 ILTAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWGRG 179
I AA+++ L +G V+TEARA +N GLT WSPNINIFRDPRWGRG
Sbjct: 70 IGLAATWDVPLLHKVGDTVATEARAKFNAKPVTADRKIYEGLTIWSPNINIFRDPRWGRG 129
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN- 238
QET GEDP L A G+++GLQ D P KV A KH + G D + +
Sbjct: 130 QETYGEDPFLTGHLAVGFIQGLQGPDPAHP---KVLATPKHLAVHSGPE-AGRDGFDVDP 185
Query: 239 -AMVIYTYYLIKFK------------------------------------------YIVS 255
+ + Y F+ IVS
Sbjct: 186 SPQDLESTYTPAFRLALTEGKAQSVMCAYNSIHGTPACASGGLLNDRLRKDWGFTGLIVS 245
Query: 256 DCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAI 315
DCD+V ++ HY EA+A +I AG+DLNCG+ E AVK GLV+E+ +D A+
Sbjct: 246 DCDAVANIHLFHHYRLDAAEASAAAIKAGMDLNCGTTYAALPE-AVKRGLVSEAEVDVAL 304
Query: 316 SNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS 375
LG G P+ ++ P + T A + LAL+AAR+ IVLLKN LPL+
Sbjct: 305 KRALDARRALGIAFG--GANPWSRIKPGERGTPAQRALALEAARKAIVLLKNDGDRLPLA 362
Query: 376 PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
P +AVIG NA+ + GNY GT TPL G+
Sbjct: 363 PGG--RIAVIGANADDLGVLEGNYHGTARDPVTPLDGI 398
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 59/112 (52%), Gaps = 17/112 (15%)
Query: 446 DATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
D V V+G +E E+ DR D+ LP Q L+ + K K P++L++ S
Sbjct: 580 DTIVAVLGLSPDLEGEALSVSIPGFVGGDRTDIALPRPQLELLKALRKTGK-PLVLVLTS 638
Query: 496 GGGFDISFAKNDPKIT-SILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G + DP + +IL YPG+ GG AIA+ G+ NPSGRLP+T+Y
Sbjct: 639 GSAVAV-----DPSLADAILEAWYPGEEGGTAIAETLAGKNNPSGRLPLTFY 685
>gi|427384502|ref|ZP_18881007.1| hypothetical protein HMPREF9447_02040 [Bacteroides oleiciplenus YIT
12058]
gi|425727763|gb|EKU90622.1| hypothetical protein HMPREF9447_02040 [Bacteroides oleiciplenus YIT
12058]
Length = 862
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 148/438 (33%), Positives = 206/438 (47%), Gaps = 79/438 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ NTSL + R LVK LTL+EK ++D++ SV RLGI Y WW+EALHGV+ G
Sbjct: 22 YKNTSLSPEERAELLVKELTLEEKARLMMDASQSVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFW 164
AT FPQ I AASF+ + + VS E RA + GLT W
Sbjct: 80 --------ATVFPQPIGMAASFDPEMVYEVFNAVSDEVRAKNTYYTSRDSRERYQGLTMW 131
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S+ VKGLQ G ++L AC KH+ +
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGKYDKLH--ACAKHFAVH 189
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I Y YL F+
Sbjct: 190 SGPEW---NRHSFNAENINPRDLYETYLPPFEALVKEGEVKEVMCAYNRFEGEPCCGSDR 246
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFLGK 295
+V+DC ++ YN + + P A+A ++++G DL+CGS
Sbjct: 247 LLMQILRGEWGFDGIVVADCGAIADFYNDRGHHTHPNAESASAAAVVSGTDLDCGSSYKA 306
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E +VK GL+ E +D ++ LG D P K + ++ V + A+ LAL
Sbjct: 307 LVE-SVKKGLIAEETVDVSVKRLLKARFELGEMD-EPEKVSWTKIPFSVVASAAHDSLAL 364
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
+ AR+ + LL N LPL + +AV+GPNAN + GNY G P T L G+
Sbjct: 365 EIARKSMTLLMNKDNFLPLKRGGL-TVAVMGPNANDSVMQWGNYNGMPPHTVTILDGVRN 423
Query: 416 VVAT----IYQAGCSNVQ 429
++ IY+ GC V+
Sbjct: 424 ILGAEDKLIYEQGCPWVE 441
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 65/137 (47%), Gaps = 16/137 (11%)
Query: 427 NVQCGTAQVDDAKKAAA---SADATVLVMGADQSIEAESH----------DRLDLLLPGQ 473
N G + D +K+ AD + G S+E E DR D+ LP
Sbjct: 578 NFDLGFKKDVDIRKSVERVKDADVVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAV 637
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q+ LI + + K +I+++ G I K ++L YPGQ GG A+A+V FG
Sbjct: 638 QRELIDALHRAGKK---IILVNCSGSPIGLEPETKKCEALLQAWYPGQQGGTAVAEVLFG 694
Query: 534 RYNPSGRLPMTWYPQSY 550
YNP+GRLP+T+Y Y
Sbjct: 695 TYNPAGRLPVTFYRNIY 711
>gi|217967241|ref|YP_002352747.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
gi|217336340|gb|ACK42133.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
DSM 6724]
Length = 762
Score = 202 bits (514), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 170/532 (31%), Positives = 251/532 (47%), Gaps = 103/532 (19%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP E L G + GAT FPQ I A++F L + + V+
Sbjct: 86 TRLGIPAI-IHEECLSGF-----------MAKGATVFPQAIGMASTFEPELIRRVSDVIR 133
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
RA NV GL SP ++I RDPRWGR +ET GEDP L S+ A YVKGLQ D
Sbjct: 134 QHMRAA-NVH-QGL---SPVLDIPRDPRWGRTEETFGEDPYLVSRMAAEYVKGLQGEDW- 187
Query: 208 SPNRLKVAACCKHYTAY-------DLDNWKGTDR-------YHFNAMV--------IYTY 245
R + A KH+TAY +L K +R + F + + Y
Sbjct: 188 ---REGIIATVKHFTAYGISEGARNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAY 244
Query: 246 YLIK--------------------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
+ I FK Y+VSD ++ +L N K +EAA ++ AG
Sbjct: 245 HEIDGVPCASSKFLLTKILRWEWGFKGYVVSDYIAIRMLENFHRVAKDAKEAAVLALEAG 304
Query: 285 LDLNCGSF--LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
+D+ S G+ AVK GL++E I+ ++ LG FDG K P
Sbjct: 305 IDIELPSVDCYGEPLIQAVKEGLISEEVINASVERVLRAKFMLGLFDGDLEKDPKKVY-- 362
Query: 343 KDVCTQAN-QDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY-- 399
D+ + ++L+ + AR+ IVLLKN G LPLS I+ +AVIGPNA+ + + G+Y
Sbjct: 363 -DIFDKPEFRELSREVARRSIVLLKND-GILPLSKN-IRTVAVIGPNADNPRNLHGDYSY 419
Query: 400 -----------------EGTPCKYTTPLQGL----AAVVATIYQAGCSNVQCGTAQVDDA 438
E + + L+G+ +A +Y GC + D+A
Sbjct: 420 TAHIPSVSETLEGVKIPEECAVRTVSILEGIKNKVSAETQVLYAKGCEILSDSKEGFDEA 479
Query: 439 KKAAASADATVLVMGAD-----QSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
+ A AD + VMG + + I E +DR L L G Q+ L+ E+ K+ K P++L++
Sbjct: 480 IEIAKRADVIIAVMGEESGLFHRGISGEGNDRTTLELFGIQRDLLRELHKLGK-PIVLVL 538
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
++G + + + + +IL YPG+ GG A+ADV FG YNPSG+LP+++
Sbjct: 539 VNGRPQALKWEHEN--LNAILEAWYPGEEGGDAVADVIFGDYNPSGKLPISF 588
>gi|332185848|ref|ZP_08387595.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
gi|332014206|gb|EGI56264.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
Length = 838
Score = 202 bits (513), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 155/457 (33%), Positives = 212/457 (46%), Gaps = 84/457 (18%)
Query: 70 RLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVIL 129
+LT+++K LV+ A + L +P Y+WWSE LHG++ GP AT FPQ +
Sbjct: 24 KLTVEQKAAQLVNEAPAEPALDLPAYDWWSEGLHGIARNGP----------ATVFPQAMG 73
Query: 130 TAASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWGRGQE 181
AA+++ +L + +G VV+ EARA +N GLT WSPNINIFRDPRWGRGQE
Sbjct: 74 MAATWDPALIEKVGDVVAVEARAKFNTKPINADRGLFEGLTIWSPNINIFRDPRWGRGQE 133
Query: 182 TPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV 241
T GEDP L S+ +V+GLQ D P KV A KH + R F+ MV
Sbjct: 134 TYGEDPFLTSRIGIAFVRGLQGPDLKHP---KVIATVKHLAVHSGPE---GGRDSFDVMV 187
Query: 242 ----IYTYYLIKFK------------------------------------------YIVS 255
+ YL F+ VS
Sbjct: 188 SPRDLEATYLPAFRATVTEGKALSLMCAYNAIHGTPVCANPMLMTERLRTDWGFKGLTVS 247
Query: 256 DCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAI 315
DCD+V ++ H AAA ++ AG DLNCG+ E A+K GL+ E ID A+
Sbjct: 248 DCDAVGNIWMFHHAQPDAPAAAAAALRAGTDLNCGNTYRALPE-ALKRGLITEGEIDTAL 306
Query: 316 SNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS 375
+ A + R+ D P+ ++ P + T AN+ LAL+ AR+GI+LL N LPL
Sbjct: 307 ARALA-VRRMLTVD-----SPWNRIKPSQLGTPANRALALETARKGIILLDNPNDLLPLK 360
Query: 376 PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV 435
K LAV G +A+ + GNY G+ TPL G+ A + +V A +
Sbjct: 361 G---KKLAVFGADADDLGVLEGNYHGSAIDPVTPLDGIRAKFGRDVRYAQGSVLAEGAFI 417
Query: 436 ---DDAKKAAASADATVLVMGAD-QSIEAESHDRLDL 468
+ A A V MGA+ Q I A R+D
Sbjct: 418 NVPETALSANGKPGLAVETMGANGQWIAAGQDRRIDF 454
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 62/120 (51%), Gaps = 15/120 (12%)
Query: 437 DAKKAAASADATVLVMGADQSIEAES----------HDRLDLLLPGQQQLLITEVAKVAK 486
+A K A +AD V ++G +E E+ DR +++LP Q+ L+ + K
Sbjct: 555 EALKVAGTADVLVAIVGLSPDLEGEALGVSVPGFAGGDRTEVVLPEPQRNLLAALQATGK 614
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
PV+ +I+SG + D K + L YPG GG A+A++ G NPSGRLP+T Y
Sbjct: 615 -PVVAVIVSGSAVSLG----DIKPAATLAAFYPGAEGGTALAEILSGDVNPSGRLPVTIY 669
>gi|317476310|ref|ZP_07935559.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316907336|gb|EFV29041.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 862
Score = 202 bits (513), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 158/440 (35%), Positives = 209/440 (47%), Gaps = 83/440 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ TSL + R L+K LTL+EK+ ++D++ +V RLGI Y WW+EALHGV+ G
Sbjct: 22 YKTTSLAPEKRAELLLKELTLEEKVALMMDNSQAVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA---------GLTF 163
AT FPQ I AASF+ + VS EARA N A GLT
Sbjct: 80 --------ATVFPQPIGMAASFSPESVYEVFSAVSDEARAK-NANYASQGSYARYQGLTM 130
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+P +NI+RDPRWGRG ET GEDP L S+ VKGLQ T+ G ++L AC KH+
Sbjct: 131 WTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLH--ACAKHFAV 188
Query: 224 YDLDNWKGTDRYHFN-----AMVIYTYYLIKFK--------------------------- 251
+ W +R+ FN A +Y YL F+
Sbjct: 189 HSGPEW---NRHSFNVEDLSARDLYETYLPPFEALVKEAKVKEVMCAYNSFEGEPCCGSN 245
Query: 252 ---------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLG 294
I+SDC ++ YN + P E A+A ++L G DL CGS
Sbjct: 246 RLLMQILRNDWGFDGIILSDCGAIADFYNEHGHKAYPDAESASAAAVLNGTDLECGSSY- 304
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD-L 353
K AV+AG +NE ID A+ LG D P + ++ P V A D L
Sbjct: 305 KALVKAVRAGQINEKDIDKAVMRLLEARFALGEMD-DPDNVSWTKI-PFSVVASAEHDSL 362
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
ALD AR+ + LL+NT LPL + +AV+GPNAN + GNY G P T L G+
Sbjct: 363 ALDMARKSMTLLQNTGNILPLKRGGL-TVAVMGPNANDSVMQWGNYNGMPPHTVTILGGI 421
Query: 414 AAVVAT----IYQAGCSNVQ 429
+ IY+ GCS V+
Sbjct: 422 RKALGADDKLIYEQGCSWVE 441
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 65/123 (52%), Gaps = 6/123 (4%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---ESHDRLDLLLPGQQQLLITEVAK 483
++ A+V DA ++ + ++ G + + + DR D+ LP Q+ LI+ + +
Sbjct: 588 DINASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIELPAIQRELISALHR 647
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
K +I+++ G I+ +IL YPGQ GG A+A+V FG YNP GRLP+
Sbjct: 648 AGKK---IILVNCSGSPIALEPETKNCEAILQAWYPGQQGGTAVAEVLFGDYNPGGRLPV 704
Query: 544 TWY 546
T+Y
Sbjct: 705 TFY 707
>gi|430749766|ref|YP_007212674.1| beta-glucosidase-like glycosyl hydrolase [Thermobacillus composti
KWC4]
gi|430733731|gb|AGA57676.1| beta-glucosidase-like glycosyl hydrolase [Thermobacillus composti
KWC4]
Length = 971
Score = 201 bits (511), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 135/415 (32%), Positives = 199/415 (47%), Gaps = 62/415 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L ++ RV DL+ RLT +EKI + +V RLGI Y+ +EA HG++++GP
Sbjct: 9 FRDPDLPLETRVNDLIGRLTTEEKINLMAQYQDAVPRLGIAPYKHGTEAAHGIAWLGP-- 66
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
AT+FPQ I A +++A L + +G V+ EAR Y N + GLT W+P +++
Sbjct: 67 --------ATTFPQPIGLACTWDADLLRRVGSVIGDEARVWYRRNPAVNGLTLWAPTVDL 118
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L K A+ ++G+Q G P LK A KH+ + + +
Sbjct: 119 ERDPRWGRTEEAYGEDPFLVGKLASALIRGIQ---GDHPFYLKAVATLKHFLGNNNEAGR 175
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + + YYL F+
Sbjct: 176 GDKSVSIDPRNMREYYLKVFETAFKEGGALSMMTAYNAVNGVPANLLPMITSVVKQEWGM 235
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGKHTEAAVKAGLVN 307
++VSD V Y + A A SI AG+D + + + A++ GL+
Sbjct: 236 NGFVVSDAFDVTGTVRDHGYMASLPAAVAASIRAGIDSITDDAKVVKQAIREALEMGLLT 295
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E+ +D A+ N F RLG FD + PY ++G + ++ L+L+AAR+ IVLLKN
Sbjct: 296 ENDLDVALRNTFRVRFRLGEFDPD-DRNPYAEIGESHMMRPEHEALSLEAARKSIVLLKN 354
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQ 422
G LPLS I+ +AVIGP A+V Y GT TPL G+ +A Q
Sbjct: 355 D-GVLPLSADRIRKIAVIGPLADVVYR--DWYSGTLPYAVTPLAGIVEALAGRVQ 406
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 50/86 (58%), Gaps = 3/86 (3%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+ DR D+ LPG Q+ LI EV V ++++I G + + +IL++ + G
Sbjct: 605 ETVDRPDIALPGPQERLIREVLAVNPNTIVVVI---GSYPFALGWTAEHAPAILYLAHAG 661
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
Q G A+A FGR+NP+GRL MTWY
Sbjct: 662 QELGRAVAGTLFGRFNPAGRLNMTWY 687
>gi|393789624|ref|ZP_10377744.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
CL02T12C05]
gi|392650340|gb|EIY44009.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
CL02T12C05]
Length = 855
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 140/409 (34%), Positives = 202/409 (49%), Gaps = 75/409 (18%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ DL+ RLT++EK++ LV+ A + RL I KY +EALHGV V PG T
Sbjct: 39 RILDLLNRLTVEEKVSLLVNDAREIPRLNIDKYNHGNEALHGV--VRPGEF--------T 88
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNV------GLAG----LTFWSPNINIFR 172
FPQ I AA++N +L + +S EAR + +AG LTFWSP +N+ R
Sbjct: 89 VFPQAIGLAATWNPNLIFRVSTAISDEARGRWKELDYGKKQIAGGSDLLTFWSPTVNMAR 148
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L+ + +VKGLQ G +P LK + KH+ A + ++ + +
Sbjct: 149 DPRWGRTPETYGEDPFLSGRIGCEFVKGLQ---GDNPRYLKTVSTPKHFAANNEEHNRSS 205
Query: 233 DRYHFNAMVIYTYYLIKFK----------------------------------------- 251
+ + YYL F+
Sbjct: 206 CNARMSERDLREYYLPAFERCIVDGKAQSIMMAYNAVNDVPCTVNIYLIKKVLRGDWNFN 265
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKA---GLVN 307
YIVSDC + + + Y K E AA ++ AGLDL CG + +T +KA +V+
Sbjct: 266 GYIVSDCSAPEWMVTKHKYVKNLEAAATLALKAGLDLECGDRV--YTAPLLKAYNEYMVS 323
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E+ ID A + M LG FD PS+ PY ++ P + + +Q+LAL+ ARQ +VLLKN
Sbjct: 324 EAEIDSAAYHILRGRMLLGLFD-DPSQNPYNKIEPSVIGCKEHQELALETARQSMVLLKN 382
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV 416
LPL+ I+++AV+G +A G+Y G P TP+ L +
Sbjct: 383 QKNFLPLNRKKIRSIAVVGISA--AHCEFGDYSGNP--KNTPVSVLDGI 427
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 67/110 (60%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
DA K A D TV V+G ++SIE E DR L LP QQ I E+ KV V++++ +G
Sbjct: 595 DAGKVAKECDVTVAVLGINKSIEREGQDRYSLELPIDQQEFIKELYKVNPNTVVVLV-AG 653
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I++ D + +IL YPG+ GG A+A+V FG YNP GRLP+T+Y
Sbjct: 654 SSMAINWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYY 701
>gi|423305378|ref|ZP_17283377.1| hypothetical protein HMPREF1072_02317 [Bacteroides uniformis
CL03T00C23]
gi|423311197|ref|ZP_17289166.1| hypothetical protein HMPREF1073_03916 [Bacteroides uniformis
CL03T12C37]
gi|392679729|gb|EIY73108.1| hypothetical protein HMPREF1073_03916 [Bacteroides uniformis
CL03T12C37]
gi|392681368|gb|EIY74727.1| hypothetical protein HMPREF1072_02317 [Bacteroides uniformis
CL03T00C23]
Length = 864
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 158/456 (34%), Positives = 213/456 (46%), Gaps = 81/456 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N SL + R G L+K LTL+EK++ ++DS+ V RLGI Y WW+EALHGV+ G
Sbjct: 22 YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAM--------YNVGLAGLTFW 164
AT FPQ I AASF+ + VS EARA GLT W
Sbjct: 80 --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S+ VKGLQ T+ G ++L AC KH+ +
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLH--ACAKHFAVH 189
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------------------- 254
W +R+ FNA I Y YL F+ +V
Sbjct: 190 SGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSDR 246
Query: 255 -----------------SDCDSV-DVLYNSQHYTK-TPEEAAAKSILAGLDLNCGSFLGK 295
SDC ++ D + H T E A+A ++L+G DL CGS
Sbjct: 247 LLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHLDAESASAAAVLSGTDLECGSSYEA 306
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
E AVK G ++E A+D A+ LG D P K + + V + + LAL
Sbjct: 307 LVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDSLAL 364
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
D AR+ + LL N +LPL + +AV+GPNAN + GNY G P T L G+
Sbjct: 365 DMARKSMTLLMNKDNTLPLKRGGL-TIAVMGPNANDSVMQWGNYNGMPPHTVTILDGIRK 423
Query: 416 VVAT----IYQAGCSNVQCGTAQVDDAKKAAASADA 447
+ + IY+ GC V+ AQ+ +AD
Sbjct: 424 ALGSDDRLIYEQGCGWVE--RAQIQSVFNRCKTADG 457
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 65/133 (48%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQVDDAKKAAA---SADATVLVMGADQSIEAESH----------DRLDLLLPGQ 473
N G D +K+ AD V V G ++E E DR D+ LP
Sbjct: 579 NFDLGFKNEVDVRKSVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAV 638
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q+ LI + + K V+L+ SG I + +IL YPGQAGG A+A+V FG
Sbjct: 639 QRELIAALHRAGK-KVVLVNCSGS--PIGLEPETGRCGAILQAWYPGQAGGTAVAEVLFG 695
Query: 534 RYNPSGRLPMTWY 546
YNP+GRLP+T+Y
Sbjct: 696 DYNPAGRLPVTFY 708
>gi|218131278|ref|ZP_03460082.1| hypothetical protein BACEGG_02889 [Bacteroides eggerthii DSM 20697]
gi|217986495|gb|EEC52831.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 862
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 157/440 (35%), Positives = 209/440 (47%), Gaps = 83/440 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ TSL + R L+K LTL+EK+ ++D++ +V RLGI Y WW+EALHGV+ G
Sbjct: 22 YKTTSLAPEKRAELLLKELTLEEKVALMMDNSQAVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA---------GLTF 163
AT FPQ I AASF+ + VS EARA N A GLT
Sbjct: 80 --------ATVFPQPIGMAASFSPESVYEVFSAVSDEARAK-NANYASQGSYARYQGLTM 130
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+P +NI+RDPRWGRG ET GEDP L S+ VKGLQ T+ G ++L AC KH+
Sbjct: 131 WTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLH--ACAKHFAV 188
Query: 224 YDLDNWKGTDRYHFN-----AMVIYTYYLIKFK--------------------------- 251
+ W +R+ FN A +Y YL F+
Sbjct: 189 HSGPEW---NRHSFNVEDLSARDLYETYLPPFEALVKEAKVKEVMCAYNSFEGEPCCGSN 245
Query: 252 ---------------YIVSDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLG 294
I+SDC ++ YN + P E A+A ++L G DL CGS
Sbjct: 246 RLLMQILRNDWGFDGIILSDCGAIADFYNEHGHKAYPDAESASAAAVLNGTDLECGSSY- 304
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD-L 353
K AV+AG +NE ID A+ LG D P + ++ P V A D L
Sbjct: 305 KALVKAVRAGQINEKDIDKAVMRLLEARFALGEMD-DPDNVSWTKI-PFSVVASAEHDSL 362
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AL+ AR+ + LL+NT LPL + +AV+GPNAN + GNY G P T L G+
Sbjct: 363 ALNMARKSMTLLQNTGNILPLKRGGL-TVAVMGPNANDSVMQWGNYNGMPPHTVTILGGI 421
Query: 414 AAVVAT----IYQAGCSNVQ 429
+ IY+ GCS V+
Sbjct: 422 RKALGADDKLIYEQGCSWVE 441
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 65/123 (52%), Gaps = 6/123 (4%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---ESHDRLDLLLPGQQQLLITEVAK 483
++ A+V DA ++ + ++ G + + + DR D+ LP Q+ LI+ + +
Sbjct: 588 DINASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIELPAIQRELISALHR 647
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
K +I+++ G I+ +IL YPGQ GG A+A+V FG YNP GRLP+
Sbjct: 648 AGKK---IILVNCSGSPIALEPETKNCEAILQAWYPGQQGGTAVAEVLFGDYNPGGRLPV 704
Query: 544 TWY 546
T+Y
Sbjct: 705 TFY 707
>gi|380696432|ref|ZP_09861291.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 954
Score = 199 bits (507), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 166/548 (30%), Positives = 270/548 (49%), Gaps = 78/548 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSYVGP 110
+ + SL ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 170 YMDVSLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY--- 225
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
G+ GAT FPQ + A++N L + + V+ E A N A WSP +++
Sbjct: 226 GS-------GATIFPQALAMGATWNKKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD----------GGSP----NRLKVAA 216
+D RWGR +ET GEDP+L S+ ++KG Q G+P + +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQMGGAWIKGYQSRGLFTTPKHFGGHGAPLGGRDSHDIGL 333
Query: 217 CCKHYTAYDLDNWKGTDRYH--FNAMVIYTYYL----IKFK---------------YIVS 255
+ L ++ R + + M+ Y+ Y+ K K +IVS
Sbjct: 334 SEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKELLQQILRQEWGFNGFIVS 393
Query: 256 DCDSVDVLYNSQHYT-KTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDH 313
DC ++ L +HYT K EAA +++ AG+ NCG ++ K A K G +N +D+
Sbjct: 394 DCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGRINMEDLDN 453
Query: 314 AISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
+T+ R F+ +P K + ++ P + +++++A AAR+ IV+L+N L
Sbjct: 454 VCRTMLSTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVMLENKENLL 512
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVA----TIYQAGCS 426
PLS T ++ +AV+GP A+ + G+Y + P + + L G+ + V +Y+ GC
Sbjct: 513 PLSKT-LRTIAVVGPGADDLQP--GDYTPKLLPGQLKSVLTGIKSAVGKQTKVLYEQGCD 569
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQQQLL 477
+ A K A+ +D ++V+G + EA E++D L+LPG+QQ L
Sbjct: 570 FTNPDATNIPKAVKTASQSDVVIMVLGDCSTSEATNDVRKTCGENNDWATLILPGKQQEL 629
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ V K PVILI+ +G +DI A K + W+ PGQ GG A+ADV FG YNP
Sbjct: 630 LEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNWL--PGQEGGPAMADVLFGDYNP 686
Query: 538 SGRLPMTW 545
+GRLPMT+
Sbjct: 687 AGRLPMTF 694
>gi|301096878|ref|XP_002897535.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
gi|262106995|gb|EEY65047.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
Length = 537
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 166/539 (30%), Positives = 241/539 (44%), Gaps = 84/539 (15%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSA---GSVSRLGIPKYEWWSEALHGVSY 107
L FCNTSL RV DL+ RL L EK L A G+++ +G+P+Y W + +HGV
Sbjct: 22 LPFCNTSLSTTERVEDLLARLPLGEKAILLTARASPRGNMTSIGLPEYNWGANCVHGVQS 81
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
GT+ TSFP + A F+ + + +V+ E +
Sbjct: 82 TC-GTNC------PTSFPNPVNMGAIFDPEAVREMAQVIGWEEALTSD------------ 122
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLD 227
W G T LA + LQ+ L+ KHY AY +
Sbjct: 123 --------WTAGHPTLTSTETLAGGAIPRH--RLQEGKREDSRYLQAVVTLKHYIAYSYE 172
Query: 228 NWKGTDRYHFNAMVI-YTY---YLIKFKYIVSDCDSVDVL--YNS--------------- 266
N+ G +R F+A+V Y + Y F+ + D ++ V+ YNS
Sbjct: 173 NYGGGNRKEFDAIVSPYDFADTYFPAFRSSIVDGNATGVMCSYNSINGVPTCANNADSGA 232
Query: 267 -------QHYTKTPEEAAAKSILAGLDLNCGSFLG--KHTEAAVKAGLVNESAIDHAISN 317
HY T EAA +ILAG D+N G G K V++ + E A+D A+ +
Sbjct: 233 IEAISDRHHYVATRCEAARIAILAGTDVNSGRLFGYMKCLPELVRSNQLEEKALDDALRH 292
Query: 318 NFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPT 377
LG FD QPY + P DV T ++ LAL+ A + IVLL+N LPL
Sbjct: 293 TLKLRFELGLFD-PIDDQPYWHVKPSDVNTPKSRQLALNLATKSIVLLQNNNSVLPLRKG 351
Query: 378 AIKNLAVIGPNANVTKTMIGNYEGTPCK--------YTTPLQGLAAV---VATIYQAGCS 426
LAV+GP+A + ++GNY G C TP++ ++ + Y GC+
Sbjct: 352 T--KLAVLGPHAESKRGLLGNYLGQMCHGGYSEVGCIQTPMEAVSTTNGAATSTYALGCN 409
Query: 427 NVQCGTAQVDDAK-KAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVA 485
T ++ K K ADA VL +G D SIE E +DR D+ LP Q L+ V V
Sbjct: 410 ISGNSTDGFEETKVKVTQEADAVVLFLGIDTSIEEEVNDRNDIKLPSIQMQLLQRVRAVG 469
Query: 486 KGPVILIIMSGG--GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLP 542
K P ++++++GG G + + D + + YPG G A+ADV FG NP+G+LP
Sbjct: 470 K-PTVVVLINGGVLGAEDLILQTDALVEAF----YPGFFGAQAMADVLFGDANPNGKLP 523
>gi|354583533|ref|ZP_09002432.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
gi|353198174|gb|EHB63648.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
Length = 931
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 143/463 (30%), Positives = 213/463 (46%), Gaps = 69/463 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ++ RV DLV R TL+EKI + + RLG+ KY+ +E HGV+++G
Sbjct: 5 FQNPDLPLEERVNDLVSRFTLEEKIELMCQYQAEIPRLGVQKYKHGTEGAHGVAWLGE-- 62
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
AT FPQ A ++N L + IG V++ EAR Y + + GLT W+P +++
Sbjct: 63 --------ATVFPQNTGLACTWNPELMRQIGSVIADEARVYYQRDKAVNGLTIWAPTVDM 114
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L + +TG V+G+Q G P K A KH+ + + +
Sbjct: 115 ERDPRWGRTEEAYGEDPHLTGELSTGLVQGMQ---GDHPFYYKTVATLKHFYGNNNEVDR 171
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + YYL F+
Sbjct: 172 GNASVSIDPRNKREYYLKAFEPAFRKGGAKSMMTAYNGINGTPCNLNDEVNRIVKQEWGM 231
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGKHTEAAVKAGLVN 307
++V D V HY + EA A+S+ AG+D + + A++ GL+
Sbjct: 232 DGFVVGDAGDVLGTVMDHHYVDSYAEAVARSVKAGIDSITDDQEISFRALRDALEQGLLT 291
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E +DHA+ N F RLG FD + PY + +C + +L+L AAR+ +VLLKN
Sbjct: 292 EQDLDHALRNTFRVRFRLGEFDPE-DRNPYSHVPESKLCAPEHAELSLRAARESVVLLKN 350
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV---ATIYQAG 424
G LPL+ + + AVIGP A+ + Y GTP TPLQG+ + + G
Sbjct: 351 D-GLLPLN-NNMPSAAVIGPLAD--EAFTDWYSGTPPYRITPLQGITEKMNGRKVNFHTG 406
Query: 425 CSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
V+ +A + + AD L+ G ++S EA +R D
Sbjct: 407 LDQVRLRSAA---SGQYVTLADDNRLLFGTNESSEAALFERND 446
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 3/86 (3%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+ DR D++LP QQ LI V V++I+ G + + + SI++ + G
Sbjct: 562 ETIDRPDIVLPPSQQQLIQAVHAANPNTVVVIV---GSYPFAVNWEQEHVPSIVFTSHAG 618
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
Q G A+ADV +G YNP GRL MTWY
Sbjct: 619 QELGRAVADVLYGDYNPGGRLNMTWY 644
>gi|206901280|ref|YP_002250567.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
gi|206740383|gb|ACI19441.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
Length = 762
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 169/531 (31%), Positives = 250/531 (47%), Gaps = 101/531 (19%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP E L G + GAT FPQ I A++F L + + V+
Sbjct: 86 TRLGIPAI-IHEECLSGF-----------MAQGATVFPQAIGMASTFEPELIRRVSDVIR 133
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
+A NV GL SP ++I RDPRWGR +ET GEDP L S+ AT YVKGLQ D
Sbjct: 134 QHMKAA-NVH-QGL---SPVLDIPRDPRWGRTEETFGEDPYLVSRMATEYVKGLQGEDW- 187
Query: 208 SPNRLKVAACCKHYTAY-------DLDNWKGTDR-------YHFNAMV--------IYTY 245
R + A KH+TAY +L K +R + F + + Y
Sbjct: 188 ---REGIVATVKHFTAYGISEGARNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAY 244
Query: 246 YLIK--------------------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
+ I FK Y+VSD +V +L N + +EAA ++ AG
Sbjct: 245 HEIDGVPCASSKFLLTKILRWEWGFKGYVVSDYIAVRMLENFHKVARDAKEAAVLALEAG 304
Query: 285 LDLNCGSF--LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
+D+ S G+ AVK GL++E I+ ++ LG FD + K P
Sbjct: 305 IDIELPSVDCYGEPLIQAVKEGLISEEVINASVERVLRAKFMLGLFDDNLEKDPKKVYEV 364
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY--- 399
D +DL+ + AR+ IVLLKN G+LPLS +K +AVIGPNA+ + + G+Y
Sbjct: 365 FD--KPEFRDLSREVARRSIVLLKND-GTLPLSKN-LKKVAVIGPNADNPRNLHGDYSYT 420
Query: 400 ----------------EGTPCKYTTPLQGLAAVVA----TIYQAGCSNVQCGTAQVDDAK 439
E + + L+G+ V+ +Y GC + +A
Sbjct: 421 AHIPSIAEGLEGVKVEEKCVVRTVSILEGIRNKVSPETEVLYAKGCDIISDSKDGFAEAI 480
Query: 440 KAAASADATVLVMGAD-----QSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIM 494
+ A AD + VMG + + I E +DR L L G Q+ L+ E+ K+ K P++L+++
Sbjct: 481 EMAKEADVIIAVMGEESGLFHRGISGEGNDRTTLELFGVQRDLLKELHKLGK-PIVLVLI 539
Query: 495 SGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
+G + + + + +IL YPG+ GG A+ADV FG YNPSG+LP+++
Sbjct: 540 NGRPQALKWEHEN--LNAILEAWYPGEEGGNAVADVIFGDYNPSGKLPISF 588
>gi|383125188|ref|ZP_09945842.1| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
gi|382983435|gb|EES66611.2| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
Length = 954
Score = 198 bits (504), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 167/548 (30%), Positives = 269/548 (49%), Gaps = 78/548 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSYVGP 110
+ + SL ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY--- 225
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
G+ GAT FPQ + A++N L + + V+ E A N A WSP +++
Sbjct: 226 GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD----------GGSP----NRLKVAA 216
+D RWGR +ET GEDP+L S+ ++KG Q G+P + +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRGLFTTPKHFGGHGAPLGGRDSHDIGL 333
Query: 217 CCKHYTAYDLDNWKGTDRYH--FNAMVIYTYYL----IKFK---------------YIVS 255
+ L ++ R + + M+ Y+ Y+ K K +IVS
Sbjct: 334 SEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKELLQQILRQEWGFNGFIVS 393
Query: 256 DCDSVDVLYNSQHYT-KTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDH 313
DC ++ L +HYT K EAA +++ AG+ NCG ++ K A K G +N +D+
Sbjct: 394 DCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGRINMEDLDN 453
Query: 314 AISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
T+ R F+ +P K + ++ P + +++++A AAR+ IV+L+N L
Sbjct: 454 VCRTMLGTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVMLENKDNLL 512
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVA----TIYQAGCS 426
PLS T ++ +AV+GP A+ + G+Y + P + + L G+ V +Y+ GC
Sbjct: 513 PLSKT-LRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGIKGAVGKQTKVLYEQGCD 569
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQQQLL 477
+ A KAA+ +D ++V+G + EA E++D L+LPG+QQ L
Sbjct: 570 FTNPDETNIPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTCGENNDWATLILPGKQQEL 629
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ V K PVILI+ +G +DI A K + W+ PGQ GG A+ADV FG YNP
Sbjct: 630 LEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNWL--PGQEGGPAMADVLFGDYNP 686
Query: 538 SGRLPMTW 545
+GRLPMT+
Sbjct: 687 AGRLPMTF 694
>gi|29347190|ref|NP_810693.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29339089|gb|AAO76887.1| periplasmic beta-glucosidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 950
Score = 198 bits (504), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 167/548 (30%), Positives = 269/548 (49%), Gaps = 78/548 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSYVGP 110
+ + SL ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY--- 221
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
G+ GAT FPQ + A++N L + + V+ E A N A WSP +++
Sbjct: 222 GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD----------GGSP----NRLKVAA 216
+D RWGR +ET GEDP+L S+ ++KG Q G+P + +
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRGLFTTPKHFGGHGAPLGGRDSHDIGL 329
Query: 217 CCKHYTAYDLDNWKGTDRYH--FNAMVIYTYYL----IKFK---------------YIVS 255
+ L ++ R + + M+ Y+ Y+ K K +IVS
Sbjct: 330 SEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKELLQQILRQEWGFNGFIVS 389
Query: 256 DCDSVDVLYNSQHYT-KTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDH 313
DC ++ L +HYT K EAA +++ AG+ NCG ++ K A K G +N +D+
Sbjct: 390 DCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGRINMEDLDN 449
Query: 314 AISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
T+ R F+ +P K + ++ P + +++++A AAR+ IV+L+N L
Sbjct: 450 VCRTMLGTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVMLENKDNLL 508
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVA----TIYQAGCS 426
PLS T ++ +AV+GP A+ + G+Y + P + + L G+ V +Y+ GC
Sbjct: 509 PLSKT-LRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGIKGAVGKQTKVLYEQGCD 565
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQQQLL 477
+ A KAA+ +D ++V+G + EA E++D L+LPG+QQ L
Sbjct: 566 FTNPDETNIPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTCGENNDWATLILPGKQQEL 625
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ V K PVILI+ +G +DI A K + W+ PGQ GG A+ADV FG YNP
Sbjct: 626 LEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNWL--PGQEGGPAMADVLFGDYNP 682
Query: 538 SGRLPMTW 545
+GRLPMT+
Sbjct: 683 AGRLPMTF 690
>gi|206901921|ref|YP_002251428.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
gi|206741024|gb|ACI20082.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
Length = 756
Score = 198 bits (504), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 167/512 (32%), Positives = 239/512 (46%), Gaps = 94/512 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLF----QAIG 143
SR+ IP EALHG + G+T FPQ I A+++N L AIG
Sbjct: 127 SRVKIPIL-IHDEALHGC-----------MAKGSTIFPQAIGMASTWNPELIYQVATAIG 174
Query: 144 KVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQ 203
K T +R ++ V SP INI RDPR GR +ET GEDP LAS+ A Y+KG+Q+
Sbjct: 175 K--ETRSRGIHQV-------LSPTINIARDPRCGRTEETYGEDPYLASRMAVAYIKGVQE 225
Query: 204 TDGGSPNRLKVAACCKHYTAYDLDNWKGTDRY--HFNAMVIYTYYLIKFK---------- 251
V A KH+ A + + G D Y HF+ ++ Y F+
Sbjct: 226 QG--------VIATPKHFVANFVGD-GGRDSYPIHFSERLLREIYFPAFRASIEEAGALS 276
Query: 252 --------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAK 279
Y+VSD SV L ++ EAA
Sbjct: 277 LMAAYNSLDGIPCSSNKWLLTRILRKEWGFKGYVVSDYFSVLHLMTKHKVAESKAEAAKL 336
Query: 280 SILAGLDLNC-GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYG 338
S+ AGLD+ S + ++ +++ +D A+ +G FD +P P
Sbjct: 337 SLEAGLDMELPDSDCFEEIPGLIRESKLSQDTLDEAVRRVLRVKFWIGLFD-NPFVDPDY 395
Query: 339 QLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGN 398
D C++ +++LAL AR+ IVLLKN G LPL+ I+++AVIGPNA V + +G
Sbjct: 396 AERIND-CSE-HRELALRVARESIVLLKNE-GILPLNKD-IRSIAVIGPNAAVPR--LGG 449
Query: 399 YEGTPCKYTTPLQG----LAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGA 454
Y G K TPL+G L V + GC + D+A K A +D +L MG
Sbjct: 450 YSGYGVKVVTPLEGIKNKLGDKVKVYFAEGCGLNDTSKSGFDEAIKIAQKSDVAILFMGN 509
Query: 455 D-QSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSI 513
E E DR +L LPG Q+ LI E+ PVI+++++G I+ K+ ++
Sbjct: 510 SVPETEGEQRDRHNLNLPGVQEDLIKEICN-TNTPVIVVLINGSA--ITMMNWIDKVQAV 566
Query: 514 LWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
+ YPG+ GG AIADV FG YNP G+LP+++
Sbjct: 567 IEAWYPGEEGGNAIADVLFGDYNPGGKLPISF 598
>gi|329963634|ref|ZP_08301109.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
gi|328528179|gb|EGF55158.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
Length = 863
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 158/436 (36%), Positives = 205/436 (47%), Gaps = 83/436 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ TSL + R L+K LTL+EK++ ++D A V RLGI Y WW+EALHGV+ G
Sbjct: 23 YKTTSLPPEERAELLLKELTLEEKVSLMMDGAKPVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AASF+ + VS EARA + GLT W
Sbjct: 81 --------ATVFPQPIGMAASFSPEAVYGVFDAVSDEARAKNSYYASQGSYERYQGLTMW 132
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL-KVAACCKHYTA 223
+P +NI+RDPRWGRG ET GEDP L S+ VKGLQ GG+ R K+ AC KH+
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVEVVKGLQ---GGNDGRYDKLHACAKHFAV 189
Query: 224 YDLDNWKGTDRYHFNAMVI-----YTYYLIKFKYIV------------------------ 254
+ W +R+ FNA I Y YL F+ +V
Sbjct: 190 HSGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSN 246
Query: 255 ------------------SDCDSVDVLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLG 294
SDC ++ YN + + P E A+A ++L+G DL CGS
Sbjct: 247 RLLMQILRDEWGFDGIVLSDCGAIADFYNDRGHKTHPDAESASAAAVLSGTDLECGSSYK 306
Query: 295 KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD-L 353
E AVK G + E A+D A+ LG D P K + ++ P V A D L
Sbjct: 307 ALVE-AVKLGKIEEKAVDVAVKRLLTARFALGEMD-EPDKVSWTKI-PFSVVASAKHDSL 363
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG- 412
ALD AR+ + LL N LPL + +AV+GPNAN + GNY G P T L+G
Sbjct: 364 ALDIARKSMTLLMNKNDILPLKQGGL-TVAVMGPNANDSVMQWGNYNGMPSHTVTILEGV 422
Query: 413 ---LAAVVATIYQAGC 425
L A IY+ GC
Sbjct: 423 RKALGADDKLIYEQGC 438
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 65/133 (48%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQVDDAKKAAASADAT---VLVMGADQSIEAESH----------DRLDLLLPGQ 473
N G + D KK+ A + V G S+E E DR D+ LP
Sbjct: 579 NFDLGFRKEVDIKKSVAVVKDADVVIFVGGISPSLEGEEMGVNLPGFKKGDRTDIELPAV 638
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q+ +I + + K +I ++ G I + +IL YPGQAGG A+A+V FG
Sbjct: 639 QREMIAALHRSGKK---VIFVNCSGSPIGLEPETKECEAILQAWYPGQAGGTAVAEVLFG 695
Query: 534 RYNPSGRLPMTWY 546
+YNP+GRLP+T+Y
Sbjct: 696 KYNPAGRLPVTFY 708
>gi|189464325|ref|ZP_03013110.1| hypothetical protein BACINT_00666 [Bacteroides intestinalis DSM
17393]
gi|189438115|gb|EDV07100.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 935
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 162/560 (28%), Positives = 260/560 (46%), Gaps = 96/560 (17%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSY 107
SL + + +L ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 148 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY 206
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
GAT FPQ + A++N L + + V E + + WSP
Sbjct: 207 GS----------GATIFPQALAMGATWNKKLTEEVAMAVGDET-----LSAGTMQAWSPV 251
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLD 227
+++ +D RWGR +ET GEDP+L S+ ++KG Q + + KH+ +
Sbjct: 252 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------MGLYTTPKHFGGHGAP 303
Query: 228 NWKGTDRYH--FNAMVIYTYYLIKFKYI-------------------------------- 253
G D + + + +L+ F+++
Sbjct: 304 -LGGRDSHDIGLSEREMREVHLVPFRHVIRNYDCQSLMMAYSDFLGVPVAKSRELLHNIL 362
Query: 254 ----------VSDCDSVDVLYNSQHYT-KTPEEAAAKSILAGLDLNCG-SFLGKHTEAAV 301
VSDC ++ L +HYT K EAA +++ AG+ NCG ++ K A
Sbjct: 363 REEWGFSGFIVSDCGAIGNLTARKHYTAKNKIEAANQALAAGIATNCGDTYNDKEVIQAA 422
Query: 302 KAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQ 360
K G +N +D + R F+ P+K + ++ P + +++++A AAR+
Sbjct: 423 KDGRINMENLDEVCRTMLRMMFRNELFEKAPNKPLDWNKIYP-GWNSDSHKEMARQAARE 481
Query: 361 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVA 418
IVLL+N LPLS ++ +AV+GP AN + G+Y + P + + L G+ V
Sbjct: 482 SIVLLENKDNILPLSKD-MRTIAVLGPGANDLQP--GDYTPKLQPGQLKSVLTGIKQAVG 538
Query: 419 ----TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDR 465
IY+ GC G + A K A+ +D +LV+G + EA E+HD
Sbjct: 539 KQTKVIYEQGCDFTSLGENNIAKAVKVASQSDVVLLVLGDCSTSEATTDVYKTSGENHDY 598
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
L+LPG+QQ L+ V K PVILI+ +G +++S A K + W+ PGQ GG
Sbjct: 599 ATLILPGKQQELLEAVCATGK-PVILILQAGRPYNLSKASELCKAILVNWL--PGQEGGP 655
Query: 526 AIADVCFGRYNPSGRLPMTW 545
A ADV FG YNP+GRLPMT+
Sbjct: 656 ATADVLFGDYNPAGRLPMTF 675
>gi|393720890|ref|ZP_10340817.1| beta-glucosidase [Sphingomonas echinoides ATCC 14820]
Length = 563
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 141/395 (35%), Positives = 187/395 (47%), Gaps = 71/395 (17%)
Query: 71 LTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILT 130
LT +K L SA + G+P Y+WW+E LHG++ G AT FPQ I
Sbjct: 20 LTTAQKAAQLQSSAPADPAAGLPAYDWWNEGLHGLARNG----------YATVFPQAIGM 69
Query: 131 AASFNASLFQAIGKVVSTEARAMYNVG--------LAGLTFWSPNINIFRDPRWGRGQET 182
AA+++A L IG+VV+TEARA +N GLT WSPNINIFRDPRWGRGQET
Sbjct: 70 AATWDAPLLNRIGEVVATEARAKFNAQPVAADRKIYQGLTIWSPNINIFRDPRWGRGQET 129
Query: 183 PGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFN--AM 240
GEDP L ++ GLQ D P KV A KH+ + G D + +
Sbjct: 130 YGEDPYLTGHLGVAFITGLQGPD---PKHPKVIATPKHFVVHSGPE-AGRDGFDVDPSPQ 185
Query: 241 VIYTYYLIKFK------------------------------------------YIVSDCD 258
+ YL F+ + VSDCD
Sbjct: 186 DLEATYLPAFRLAVTEGKAQSLMCAYNAIHGVPACANAPLMVDRLRHDWGFTGFTVSDCD 245
Query: 259 SVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNN 318
+V ++ HY AAA S+ G DLNCG+ E A++ GLV ID A
Sbjct: 246 AVANVHLFHHYRLDAAGAAAASLKGGTDLNCGTTYAALPE-ALQRGLVTSQDIDTAFGRA 304
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
A LG G + P+ ++ P+D+ T + +AL+AAR+ IVLLKN LPL T
Sbjct: 305 MAARKALGIAYG--ATSPWARISPRDIGTPEHHAVALEAARKAIVLLKNEGDRLPLK-TG 361
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+ +A+IG +A+ + GNY GT TPL+GL
Sbjct: 362 TR-IALIGADADDIGVLQGNYHGTAIAPVTPLEGL 395
>gi|397642422|gb|EJK75223.1| hypothetical protein THAOC_03061, partial [Thalassiosira oceanica]
Length = 534
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 140/415 (33%), Positives = 206/415 (49%), Gaps = 86/415 (20%)
Query: 206 GGSPNRLKVAACCKHYTAYDLD--------------NWKGT-----------DRY---HF 237
GG+ R ++AA CKH AY L+ +W+GT +R+ H+
Sbjct: 2 GGAGLRPRIAATCKHLAAYSLETDRFNFSADGIDRTDWEGTYLPAFDACVHAERFLLEHY 61
Query: 238 NAM-------------VIYTYYLIK---------------------FKYIVSDCDSVDVL 263
NA V+ +Y I +VSDC +VD +
Sbjct: 62 NASGGGGGGQDRGALGVMCSYNAIDGVPACADPALLKDMLRRDWNFTGLVVSDCWAVDNI 121
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLM 323
+++ + + EEA ++ +G+DL+CG+ A L++E ID A+S F LM
Sbjct: 122 HSNHRFVASYEEAVGLALRSGVDLDCGNTFQDFGRLAYDESLLDEDDIDEALSRLFRVLM 181
Query: 324 RLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN-----TAGSLPLSPTA 378
LG+FD + +P + ++ + LAL+AA Q IVLLKN G LPLS
Sbjct: 182 DLGYFD--ETDEPDAKSSDDEM---EHDQLALEAALQSIVLLKNGINEDEPGPLPLSLAK 236
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDA 438
K +A+ GP A+ ++GNY G P TPL GLA + G A V D
Sbjct: 237 HKEIALFGPLADNQTVLLGNYHGLPSTIVTPLMGLA-------KMGVEVAFRQRASVCDF 289
Query: 439 KKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKG---PVILIIMS 495
+A T+LV+G DQS+EAE DR LLLP +Q+ LI +++ +K PV+L+++S
Sbjct: 290 HGESA----TILVVGLDQSLEAEDQDRTTLLLPVEQRDLIKTISRCSKVRDLPVVLVVVS 345
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
GG D+S KN I +++ + YPGQ GG+A+A V +G YNPSG+L T YP+SY
Sbjct: 346 GGMVDLSRYKNSSDIDAMIHMSYPGQNGGSALAQVLYGAYNPSGKLVGTMYPESY 400
>gi|374320933|ref|YP_005074062.1| beta-glucosidase-related glycosidase [Paenibacillus terrae HPL-003]
gi|357199942|gb|AET57839.1| beta-glucosidase-related glycosidase [Paenibacillus terrae HPL-003]
Length = 976
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 133/412 (32%), Positives = 198/412 (48%), Gaps = 61/412 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L +D RV DLV RLTL+EKI ++ ++ RLG+ Y+ +EA HG++++G
Sbjct: 12 FQNPKLHLDERVRDLVSRLTLEEKIESMLQYQPAIERLGVAAYKHGTEAAHGIAWLGE-- 69
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--LAGLTFWSPNINI 170
ATSFPQ + A +++ L + +G V+S EAR Y L GLT W+P +++
Sbjct: 70 --------ATSFPQPVGLACTWDTELMRQVGSVISDEARVYYRRDPVLNGLTLWAPTVDM 121
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L + A VKG+Q G P LK A KH+ + + +
Sbjct: 122 ERDPRWGRNEEAYGEDPYLTGELAKELVKGIQ---GDHPVYLKAVATLKHFLGNNNEVDR 178
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G+D + + YYL F+
Sbjct: 179 GSDSSSIDPRNLREYYLKAFEKPFTEGKAQSMMTSYNLINGTPATLYHGVNDIVRDEWGM 238
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD-LNCGSFLGKHT-EAAVKAGLVN 307
++VSD V + Y + A+SI AG+D + + L K A+ G++
Sbjct: 239 DGFVVSDAGDVMGIVKDHQYYDSHTPGVAESIRAGIDSITDDADLSKQAIRDALVQGMLQ 298
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E+ +D A+ + F RLG FD + PY +G + + T+ ++L+L AAR+ IVLLKN
Sbjct: 299 ETDLDQALFHTFRVRFRLGEFDP-ATDNPYALIGEEALMTEQARELSLRAAREQIVLLKN 357
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT 419
LPL P +AVIG N + Y GT + TPL+ + A + T
Sbjct: 358 ERSLLPLDPAGCGKVAVIGALGN--EVYRDWYSGTFPYFVTPLEAIRAKLET 407
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 50/87 (57%), Gaps = 5/87 (5%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG-GFDISFAKNDPKITSILWVGYP 519
E +DR L L QQ LI EV + P +++++G F I + + I +I++ +
Sbjct: 574 EENDRPGLELAASQQRLIEEVYRA--NPNTIVVLTGSYPFAIPWVQE--HIPAIVYTSHA 629
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTWY 546
GQ G A+ADV FG Y P+GRL MTWY
Sbjct: 630 GQEHGTAVADVLFGDYAPAGRLNMTWY 656
>gi|298387489|ref|ZP_06997041.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298259696|gb|EFI02568.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 950
Score = 197 bits (500), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 167/548 (30%), Positives = 268/548 (48%), Gaps = 78/548 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSYVGP 110
+ + SL ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY--- 221
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
G+ GAT FPQ + A++N L + + V+ E A N A WSP +++
Sbjct: 222 GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD----------GGSP----NRLKVAA 216
+D RWGR +ET GEDP+L S+ ++KG Q G+P + +
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRGLFTTPKHFGGHGAPLGGRDSHDIGL 329
Query: 217 CCKHYTAYDLDNWKGTDRYH--FNAMVIYTYYL----IKFK---------------YIVS 255
+ L ++ R + + M+ Y+ Y+ K K +IVS
Sbjct: 330 SEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKELLQQILRQEWGFNGFIVS 389
Query: 256 DCDSVDVLYNSQHYT-KTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDH 313
DC ++ L +HYT K EAA +++ AG+ NCG ++ K A K G +N +D+
Sbjct: 390 DCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGRINMEDLDN 449
Query: 314 AISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
T+ R F+ +P K + ++ P + +++++A AAR+ IV+L+N L
Sbjct: 450 VCRTMLGTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVMLENKENLL 508
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVA----TIYQAGCS 426
PLS T + +AV+GP A+ + G+Y + P + + L G+ V +Y+ GC
Sbjct: 509 PLSKT-LCTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGIKGAVGKQTKVLYEQGCD 565
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQQQLL 477
+ A KAA+ +D ++V+G + EA E++D L+LPG+QQ L
Sbjct: 566 FTNPDETNIPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTCGENNDWATLILPGKQQEL 625
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ V K PVILI+ +G +DI A K + W+ PGQ GG A+ADV FG YNP
Sbjct: 626 LEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNWL--PGQEGGPAMADVLFGDYNP 682
Query: 538 SGRLPMTW 545
+GRLPMT+
Sbjct: 683 AGRLPMTF 690
>gi|423223721|ref|ZP_17210190.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638096|gb|EIY31949.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 954
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 164/551 (29%), Positives = 266/551 (48%), Gaps = 78/551 (14%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSY 107
SL + + +L ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 167 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY 225
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
G+ GAT FPQ + A++N L + + V E A + WSP
Sbjct: 226 ---GS-------GATIFPQALAMGATWNKKLTEDVAMAVGDETLAA-----GTMQAWSPV 270
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD----------GGSP----NRLK 213
+++ +D RWGR +ET GEDP+L S+ ++KG Q G+P +
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSKGLFTTPKHFGGHGAPLGGRDSHD 330
Query: 214 VAACCKHYTAYDLDNWKGTDRYH--FNAMVIYTYYL----IKFK---------------Y 252
+ + L ++ R + + M+ Y+ YL K + +
Sbjct: 331 IGLSEREMREVHLVPFRHVIRNYDCQSVMMAYSDYLGVPVAKSRELLHSILREEWGFDGF 390
Query: 253 IVSDCDSVDVLYNSQHYT-KTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESA 310
IVSDC ++ L +HYT K EAA +++ AG+ NCG ++ K A K G +N
Sbjct: 391 IVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNDKEVIQAAKDGRINMEN 450
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
+D + R F+ P+K + ++ P + +++++A AAR+ IV+L+N
Sbjct: 451 LDEVCRTMLRMMFRNELFEKTPNKPLDWNKIYP-GWNSDSHKEMARQAARESIVMLENKD 509
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVA----TIYQA 423
LPL+ ++ +AV+GP A+ + G+Y + P + + L G+ V +Y+
Sbjct: 510 NILPLAKD-MRTIAVVGPGADDLQP--GDYTPKLLPGQLKSVLTGIKQAVGKQTKVVYEQ 566
Query: 424 GCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQQ 474
GC + A KAA+ +D VLV+G + E+ E+HD L+LPG+Q
Sbjct: 567 GCDFTSSNGTDIPKAVKAASQSDVVVLVLGDCSTSESTTDVYKTSGENHDYATLILPGKQ 626
Query: 475 QLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGR 534
Q L+ V K PVILI+ +G +++S A K + W+ PGQ GG A ADV FG
Sbjct: 627 QELLEAVCATGK-PVILILQAGRPYNLSKASELCKAILVNWL--PGQEGGPATADVLFGD 683
Query: 535 YNPSGRLPMTW 545
YNP+GRLPMT+
Sbjct: 684 YNPAGRLPMTF 694
>gi|423289665|ref|ZP_17268515.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
CL02T12C04]
gi|423298158|ref|ZP_17276217.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
CL03T12C18]
gi|392663699|gb|EIY57246.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
CL03T12C18]
gi|392667376|gb|EIY60886.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
CL02T12C04]
Length = 955
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 166/549 (30%), Positives = 269/549 (48%), Gaps = 80/549 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSYVGP 110
+ + SL ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 171 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY--- 226
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
G+ GAT FPQ + A++N L + + V+ E + N A WSP +++
Sbjct: 227 GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDET-VVANTKQA----WSPVLDV 274
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG---------------GSPNRLKVA 215
+D RWGR +ET GEDP+L S+ ++KG Q+ G G + +
Sbjct: 275 AQDARWGRCEETFGEDPVLVSQIGGAWIKGY-QSRGLFTTPKHFGGHGAPLGGRDSHDIG 333
Query: 216 ACCKHYTAYDLDNWKGTDRYH--FNAMVIYTYYL--------------IKFK-----YIV 254
+ L ++ R + + M+ Y+ Y+ ++ + +IV
Sbjct: 334 LSEREMREVHLVPFRHVVRNYDCQSLMMAYSDYMGIPVAGSTELLQQILRQEWGFNGFIV 393
Query: 255 SDCDSVDVLYNSQHYT-KTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAID 312
SDC ++ L +HYT K EAA +++ AG+ NCG ++ K A K G +N +D
Sbjct: 394 SDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNDKEVIQAAKDGRINMVNLD 453
Query: 313 HAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS 371
+ AT+ R F+ +P K + ++ P + ++++A AAR+ IV+L+N
Sbjct: 454 NVCRTMLATMFRNELFEKNPCKPLDWNKIYP-GWNSDRHREMARQAARESIVMLENKDNL 512
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVA----TIYQAGC 425
LPLS T +K +AV+GP A+ + G+Y + P + + L G+ A V +Y+ GC
Sbjct: 513 LPLSKT-LKTIAVLGPGADDLQP--GDYTPKLQPGQLKSVLSGIKAAVGKQTKVLYEQGC 569
Query: 426 SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQQQL 476
+ A KAA+ +D V+V+G + EA E++D L+LPG+QQ
Sbjct: 570 DFTTPDATNIPKAVKAASQSDVVVMVLGDCSTSEATNNVRKTCGENNDWATLILPGKQQE 629
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
L+ V K PV+LI+ +G +D+ A K + W+ PGQ GG A ADV FG YN
Sbjct: 630 LLEAVCATGK-PVVLILQAGRPYDLLKASEMCKAILVNWL--PGQEGGPATADVLFGDYN 686
Query: 537 PSGRLPMTW 545
P GRLPMT+
Sbjct: 687 PGGRLPMTF 695
>gi|224536377|ref|ZP_03676916.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522015|gb|EEF91120.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
DSM 14838]
Length = 954
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 164/551 (29%), Positives = 266/551 (48%), Gaps = 78/551 (14%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSY 107
SL + + +L ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 167 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY 225
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
G+ GAT FPQ + A++N L + + V E A + WSP
Sbjct: 226 ---GS-------GATIFPQALAMGATWNKKLTEDVAMAVGDETLAA-----GTMQAWSPV 270
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD----------GGSP----NRLK 213
+++ +D RWGR +ET GEDP+L S+ ++KG Q G+P +
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSKGLFTTPKHFGGHGAPLGGRDSHD 330
Query: 214 VAACCKHYTAYDLDNWKGTDRYH--FNAMVIYTYYL----IKFK---------------Y 252
+ + L ++ R + + M+ Y+ YL K + +
Sbjct: 331 IGLSEREMREVHLVPFRHVIRNYDCQSVMMAYSDYLGVPVAKSRELLHSILREEWGFDGF 390
Query: 253 IVSDCDSVDVLYNSQHYT-KTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESA 310
IVSDC ++ L +HYT K EAA +++ AG+ NCG ++ K A K G +N
Sbjct: 391 IVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNDKEVIQAAKDGRINMEN 450
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
+D + R F+ P+K + ++ P + +++++A AAR+ IV+L+N
Sbjct: 451 LDEVCRTMLRMMFRNELFEKTPNKPLDWNKIYP-GWNSDSHKEMARQAARESIVMLENKD 509
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVA----TIYQA 423
LPL+ ++ +AV+GP A+ + G+Y + P + + L G+ V +Y+
Sbjct: 510 NILPLAKD-MRTIAVVGPGADDLQP--GDYTPKLLPGQLKSVLTGIKQAVGKQTKVVYEQ 566
Query: 424 GCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQQ 474
GC + A KAA+ +D VLV+G + E+ E+HD L+LPG+Q
Sbjct: 567 GCDFTSSNGTNIPKAVKAASQSDVVVLVLGDCSTSESTTDVYKTSGENHDYATLILPGKQ 626
Query: 475 QLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGR 534
Q L+ V K PVILI+ +G +++S A K + W+ PGQ GG A ADV FG
Sbjct: 627 QELLEAVCATGK-PVILILQAGRPYNLSKASELCKAILVNWL--PGQEGGPATADVLFGD 683
Query: 535 YNPSGRLPMTW 545
YNP+GRLPMT+
Sbjct: 684 YNPAGRLPMTF 694
>gi|224536364|ref|ZP_03676903.1| hypothetical protein BACCELL_01238, partial [Bacteroides
cellulosilyticus DSM 14838]
gi|224522024|gb|EEF91129.1| hypothetical protein BACCELL_01238 [Bacteroides cellulosilyticus
DSM 14838]
Length = 808
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 136/386 (35%), Positives = 189/386 (48%), Gaps = 69/386 (17%)
Query: 72 TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
T++EKI+ L ++ +SRL IPKY +EALHGV V PG T FPQ I A
Sbjct: 1 TVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPGRF--------TVFPQAIGLA 50
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAG----------LTFWSPNINIFRDPRWGRGQE 181
A++N L + V+S EARA +N G LTFWSP +N+ RDPRWGR E
Sbjct: 51 ATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVNMARDPRWGRTPE 110
Query: 182 TPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMV 241
T GEDP L+ T +VKGLQ G LK+ + KH+ A + ++ + +
Sbjct: 111 TYGEDPYLSGIMGTAFVKGLQ---GDDDRYLKIVSTPKHFAANNEEHNRFVCNPQISEKQ 167
Query: 242 IYTYYLIKFK------------------------------------------YIVSDCDS 259
+ YYL F+ Y+VSDC
Sbjct: 168 LREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLLTKVLRKDWGFKGYVVSDCGG 227
Query: 260 VDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDHAISNN 318
+L N+ Y KT E AAA SI AGLDL CG + +A + +V ++ ID A
Sbjct: 228 PSLLVNAHKYVKTKEAAAALSIKAGLDLECGDDVYDQPLLSAYRQYMVTDADIDSAAYRV 287
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
M LG FD + PY ++ P + + +Q++AL+AAR+ IVLLKN LPL+
Sbjct: 288 LRARMELGLFDSG-EQNPYTKISPAVIGSAEHQEVALNAARECIVLLKNQKKMLPLNARK 346
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPC 404
+K++AV+G NA ++ G+Y G P
Sbjct: 347 VKSIAVVGINAGSSE--FGDYSGLPV 370
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 69/110 (62%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KA + V V+G ++SIE E DR D+ LP QQ + E+ KV +++++++G
Sbjct: 548 EAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKVNPN-IVVVLVAG 606
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
I++ D I +I+ YPG++GG A+A+V FG YNP GRLP+T+Y
Sbjct: 607 SSLAINWM--DEHIPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYY 654
>gi|307718113|ref|YP_003873645.1| glucan 1,4-beta-glucosidase [Spirochaeta thermophila DSM 6192]
gi|306531838|gb|ADN01372.1| glucan 1,4-beta-glucosidase [Spirochaeta thermophila DSM 6192]
Length = 952
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 133/430 (30%), Positives = 210/430 (48%), Gaps = 66/430 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L ++ RV DL+ RLTL+EK++ + V RLGI Y EA HGV++VG
Sbjct: 22 FRDPDLPLEERVSDLISRLTLEEKLSLIPVRQAGVPRLGIDPYPIGGEAAHGVAWVGE-- 79
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--GLAGLTFWSPNINI 170
AT FPQ + + +++ L + +G+V+ EARA ++ GLT W+P +++
Sbjct: 80 --------ATVFPQPVGLSCTWDKDLLRRVGEVIGVEARAYRDILGTEFGLTLWAPTVDL 131
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP LAS+ A V+G+Q G P L++ A KH+ A + + +
Sbjct: 132 VRDPRWGRTEEAYGEDPCLASELAGSLVRGMQ---GDHPFYLRMGATLKHFFANNNETDR 188
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + +++ YYL F+
Sbjct: 189 GISSSEIHPWLMHNYYLEVFRRIIERARVTCIMTAYNAVNGTPCLLHPAVRELVKERWGL 248
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL---NCGSFLGKHTEAAVKAGLV 306
++V+D + Y + E+ A ++ AG+D + + + + A++ GL+
Sbjct: 249 PGFVVTDAADFGMTVGMHAYFEDHAESIAATLRAGVDALTEDDRTLVIESLRRALERGLL 308
Query: 307 NESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLK 366
E +D A+ N F RLG FD P + PY + + +C ++ LA +AAR+ +VLLK
Sbjct: 309 EEKDLDAALRNIFRIRFRLGEFDP-PERNPYAGITREALCAPSHTRLAREAARKSVVLLK 367
Query: 367 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV---VATIYQA 423
N G LPLS ++++ +AV+GP A+ T Y GT TPLQ L V Y+
Sbjct: 368 NE-GILPLSASSLRRVAVVGPLADEVHT--DWYSGTLPYVVTPLQALKERLPGVEVWYEE 424
Query: 424 GCSNVQCGTA 433
G S V+ +A
Sbjct: 425 GSSRVRLRSA 434
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 61/123 (49%), Gaps = 7/123 (5%)
Query: 428 VQCGTAQVDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAK 486
++ G QV + A +AD + MG + I E+ DR DL LP Q L+ +
Sbjct: 555 IRSGEEQV---RACAEAADVVLAFMGNNPFINGKETVDRPDLALPPLQHRLVRTAFEANP 611
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V+++I G + + D + ++L+ + GQ G A+ DV G +P+GRL +TWY
Sbjct: 612 RTVLVLI---GSYPFAVPWEDAHLPALLYSSHGGQEMGQALVDVLLGEVSPAGRLSLTWY 668
Query: 547 PQS 549
S
Sbjct: 669 RSS 671
>gi|427384392|ref|ZP_18880897.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
12058]
gi|425727653|gb|EKU90512.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
12058]
Length = 954
Score = 196 bits (497), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 165/555 (29%), Positives = 266/555 (47%), Gaps = 81/555 (14%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSY 107
+L + + +L ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 167 ALRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY 225
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
GAT FPQ + A++N L + I V E A + WSP
Sbjct: 226 GS----------GATIFPQALAMGATWNKKLTEEIAMAVGDETLAA-----GTMQAWSPV 270
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG---------------GSPNRL 212
+++ +D RWGR +ET GEDP+L S+ ++KG Q+ G G +
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGY-QSKGLFTTPKHFGGHGAPLGGRDSH 329
Query: 213 KVAACCKHYTAYDLDNWKGTDRYH--FNAMVIYTYYL----IKFK--------------- 251
+ + L ++ R + + M+ Y+ +L K K
Sbjct: 330 DIGLSEREMREVHLVPFRHVIRNYDCQSLMMAYSDFLGVPVAKSKELLHNILREEWGFDG 389
Query: 252 YIVSDCDSVDVLYNSQHYT-KTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNES 309
+IVSDC ++ L +HYT K EAA +++ AG+ NCG ++ K A K G +N
Sbjct: 390 FIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNDKEVIQAAKDGRLNME 449
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
+D+ + R F+ P+K + ++ P + ++++A AAR+ IV+L+N
Sbjct: 450 NLDNVCRTMLRMMFRNELFEKAPNKPLDWNKIYP-GWNSDNHKEMARQAARESIVMLENK 508
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVA----TIYQ 422
LPL I+++AV+GP A+ + G+Y + P + + L G+ V IY+
Sbjct: 509 ENILPLD-KGIRSIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGIKQAVGKQTKVIYE 565
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQ 473
GC + A KAA+ +D V+V+G + EA E+HD L+LPG+
Sbjct: 566 QGCDFTNLSETNIPKAVKAASQSDVVVMVLGDCSTSEATTDVYKTSGENHDYATLILPGK 625
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
QQ L+ V K PVIL++ +G ++++ A K + W+ PGQ GG A ADV FG
Sbjct: 626 QQELLEAVCATGK-PVILVLQAGRPYNLTKASKLCKAIIVNWL--PGQEGGPATADVLFG 682
Query: 534 RYNPSGRLPMTWYPQ 548
YNP+GRLPMT +PQ
Sbjct: 683 DYNPAGRLPMT-FPQ 696
>gi|300785890|ref|YP_003766181.1| beta-glucosidase [Amycolatopsis mediterranei U32]
gi|384149201|ref|YP_005532017.1| beta-glucosidase [Amycolatopsis mediterranei S699]
gi|399537773|ref|YP_006550435.1| beta-glucosidase [Amycolatopsis mediterranei S699]
gi|299795404|gb|ADJ45779.1| beta-glucosidase [Amycolatopsis mediterranei U32]
gi|340527355|gb|AEK42560.1| beta-glucosidase [Amycolatopsis mediterranei S699]
gi|398318543|gb|AFO77490.1| beta-glucosidase [Amycolatopsis mediterranei S699]
Length = 1218
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 168/599 (28%), Positives = 255/599 (42%), Gaps = 110/599 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFL-VDSAGSVSRLGIPKYEWWSEALHGVSYVGPG 111
+ +T R DLV R+TL EK+ L +SA ++ RLG+ +Y +WSE HG++ +G
Sbjct: 45 YRDTHYSFAERAADLVARMTLPEKVLQLRTNSAPAIPRLGVQQYTYWSEGQHGLNTLGAN 104
Query: 112 THFSNVVPG--ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG--------- 160
T+ V G ATSFP + + S++ L Q +S EAR M + L G
Sbjct: 105 TNDGTVTGGVHATSFPTNLASTMSWDPELIQQETTAISDEARGMLDKSLWGVAQNNIGPD 164
Query: 161 ------LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ-QTDGGSPNR-- 211
LT+W+P +N+ RDPRWGR E GEDP L +K A +V G Q QT G P
Sbjct: 165 KNNYGSLTYWAPTVNLDRDPRWGRTDEGFGEDPYLVAKMAGAFVNGYQGQTASGRPATPY 224
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVL--YNSQHY 269
LKVAA KHY +++N + D + YY +F+ ++ D ++ YN+ +
Sbjct: 225 LKVAATAKHYALNNVENDRHADSSDTTEANLRDYYTKQFRNLIQDAHVSGLMTSYNAING 284
Query: 270 TKTPEEAAAKSILA----GLD----LNCGSF-----LGKHTEA----------------- 299
T +P + + +A G D +CG+ G H A
Sbjct: 285 TPSPSDTYTANAIAQRTYGFDGYTTSDCGAVGDVYAPGSHNWAPPGWTTATSNGGTQWTN 344
Query: 300 ----------------AVKAGL-VNESAIDHAISN----NFATLMRLGFFDG-----HPS 333
A++AG +N + + ++N A ++ G D +
Sbjct: 345 TATGQQVAGAAGGQAYALRAGTQLNCTGTEATVANIQEAIKAGVLSEGVLDNALVHVFTT 404
Query: 334 KQPYGQLGPKD-----------VCTQANQDLALDAARQGIVLLKN------TAGSLPLSP 376
+ G+ P D + + +Q LA A +VLLKN A LP P
Sbjct: 405 RMQTGEFDPPDRVAYTKITKDVIQSPEHQALAAKVAAHSLVLLKNDPVPGTAAPLLPADP 464
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGC---------SN 427
+ + V+G A K +G Y G P +QG+ + V A +
Sbjct: 465 AKLGTVVVVGDLAG--KVTLGGYSGEPALQVNAVQGITSAVKAANPAATVTFDACGTSTA 522
Query: 428 VQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKG 487
+ + A +AD V+ G D ++ E DR + +PG LI +V
Sbjct: 523 TTTAASCSAETLAALKTADLVVVFAGTDGNVATEGRDRTTIAMPGNYDSLIDQVKAAGNP 582
Query: 488 PVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
L + +GG + A P I++ GY G++ G A+ADV FG+ NPSG L TWY
Sbjct: 583 RTALAVQAGGAVSLGHAAGIP---GIVFSGYNGESQGTALADVLFGKQNPSGHLNFTWY 638
>gi|393782348|ref|ZP_10370533.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
CL02T12C01]
gi|392673619|gb|EIY67078.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
CL02T12C01]
Length = 852
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 136/396 (34%), Positives = 193/396 (48%), Gaps = 73/396 (18%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
R+ DL+ RLT++EKI+ LV+ A + RL I KY +EALHG+ V PG T
Sbjct: 40 RLLDLLSRLTIEEKISLLVNDAREIPRLNIDKYYHGNEALHGI--VRPGEF--------T 89
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNV------GLAG----LTFWSPNINIFR 172
FPQ I AA++N L + +S EAR + +AG LTFWSP +N+ R
Sbjct: 90 VFPQAIGLAATWNPGLIFEVSSAISDEARGRWKELDYGKKQIAGASDLLTFWSPTVNMAR 149
Query: 173 DPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGT 232
DPRWGR ET GEDP L +VKGLQ G P LK + KH+ A + ++ + +
Sbjct: 150 DPRWGRTPETYGEDPFLTGVIGCEFVKGLQ---GDHPRYLKTVSTPKHFAANNEEHNRSS 206
Query: 233 DRYHFNAMVIYTYYLIKFK----------------------------------------- 251
+ + +YL F+
Sbjct: 207 CNARMSERDLREFYLPSFERCIVDAKAQSIMMAYNAVNGVPCTVNTYLIKNVLRGDWGFN 266
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKA---GLVN 307
YIVSDC + + + Y + + AA +I AGLDL CG + +T +KA +V+
Sbjct: 267 GYIVSDCSAPEWMVTKHKYVRDLDAAATLAIKAGLDLECGDRV--YTAPLLKAYNESMVS 324
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
++ ID A M LG FD PS+ PY Q+ P + + +Q+LAL+ ARQ +VLLKN
Sbjct: 325 KADIDSAAYRVLRGRMLLGLFD-DPSQNPYNQIEPSVIGCKKHQELALETARQSMVLLKN 383
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 403
LPL+ +K++AV+G NA + G+Y G P
Sbjct: 384 QKNFLPLNLKKVKSIAVVGINAGHCE--FGDYSGIP 417
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 55/121 (45%), Positives = 72/121 (59%), Gaps = 9/121 (7%)
Query: 432 TAQVD------DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVA 485
T QVD DA KAA D TV V+G ++SIE E DR L LP QQ I E+ KV
Sbjct: 585 TDQVDRLALFGDAGKAAKECDVTVAVLGINKSIEREGQDRYSLELPTDQQEFIRELYKVN 644
Query: 486 KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
V++++ +G I++ D + +IL YPG+ GG AIA+V FG YNP GRLP+T+
Sbjct: 645 PNTVVVLV-AGSSLAINWI--DENVPAILNAWYPGEQGGTAIAEVLFGDYNPGGRLPLTY 701
Query: 546 Y 546
Y
Sbjct: 702 Y 702
>gi|237721201|ref|ZP_04551682.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229448997|gb|EEO54788.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 863
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 144/433 (33%), Positives = 209/433 (48%), Gaps = 81/433 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ +T L + R DL++RLTL+EK+ + +++ ++ RLGI YEWW+EALHGV+ G
Sbjct: 28 YQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL-- 85
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA---MYNVG-----LAGLTFW 164
AT FPQ I AASFN L + VS EARA +N GLT W
Sbjct: 86 --------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLTMW 137
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+PN+NIFRDPRWGRGQET GEDP L+ + V+GLQ + ++L AC KH+ +
Sbjct: 138 TPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLH--ACAKHFAVH 195
Query: 225 DLDNWKGTDRYHFNAMVI-----YTYYLIKFK---------------------------- 251
W +R+ FNA I + YL FK
Sbjct: 196 SGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCGSNR 252
Query: 252 --------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFLGK 295
+V+DC ++ + + + P+ A+A ++L+G DL CG
Sbjct: 253 LLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGNFKS 312
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
T+ AVK GL++E I+ ++ LG + S P+ + + +++LAL
Sbjct: 313 ITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPKHKELAL 368
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
A + +VLL+N LPL+ +AVIGPNAN + GNY G P T L+G+ A
Sbjct: 369 KMAHESLVLLQNNNNLLPLNRQM--KVAVIGPNANDSVMQWGNYNGFPSHTVTLLEGIRA 426
Query: 416 VV---ATIYQAGC 425
+ IY+ C
Sbjct: 427 KLPDAQIIYEPVC 439
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 56/113 (49%), Gaps = 13/113 (11%)
Query: 444 SADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPVILII 493
SAD + G +E ES DR ++ LP Q+ ++ + K K + +
Sbjct: 599 SADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-KTVFVN 657
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
SG I +IL YPGQAGG A+ADV FG YNP+GRLP+T+Y
Sbjct: 658 FSGSAMAI--VPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY 708
>gi|304406516|ref|ZP_07388172.1| Beta-glucosidase [Paenibacillus curdlanolyticus YK9]
gi|304344574|gb|EFM10412.1| Beta-glucosidase [Paenibacillus curdlanolyticus YK9]
Length = 939
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 135/411 (32%), Positives = 197/411 (47%), Gaps = 63/411 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N +L ++ RV DLV RLTL+EKI + ++ RLGI Y+ +E HG++++G
Sbjct: 9 FRNPALPLEERVSDLVSRLTLEEKIHNMPQYQSAIERLGIQAYKHGTEGAHGIAWLGE-- 66
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
ATSFPQ I +++ L + IG V+ TEAR Y N + GLT W+P +++
Sbjct: 67 --------ATSFPQPIGLGCTWDQELMKEIGDVIGTEARVFYKRNPQINGLTLWAPTVDM 118
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L S+ A +G+Q G P LK A KH+ + + +
Sbjct: 119 ERDPRWGRTEEAYGEDPHLTSELAVALTQGMQ---GDHPKYLKAVATLKHFLGNNNEINR 175
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G+ + + YY F+
Sbjct: 176 GSCSVSIDPRNMREYYFKAFEPVFVKGGAQSMMTAYNSVNGTPVLLHETVDSVVKEQWMD 235
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGKHTEAAVKAGLVNE 308
+IVSD + + N HY + +A A+SI AG+D + + A++ GL+ E
Sbjct: 236 GFIVSDAGDMIGIVNDHHYYDSLAQAVAESIKAGIDSITDEAKAVTDAIRDALEQGLLGE 295
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
+ +D A+SN F MRLG FD P + PY + + +Q L+L AAR+ IVLLKN
Sbjct: 296 ADLDKALSNTFRVRMRLGEFD--PQELVPYAAIDDSSLMKPEHQALSLKAARKNIVLLKN 353
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA 418
G+LPL + +AVIGP + Y GT TPLQ + +A
Sbjct: 354 D-GALPLDAEKLSKIAVIGPLGDTVYR--DWYSGTFPYKITPLQAIQQKLA 401
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 61/110 (55%), Gaps = 4/110 (3%)
Query: 438 AKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
A +AA ++ ++ +G + + E +DR L L Q+ L+ V + V+++
Sbjct: 546 AVEAAKQSEVAIVFVGNNPLVNGKEENDRPGLELAASQERLVKAVFEANPNTVVVVT--- 602
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
GG+ + + I ++++ + GQ GAA+ADV FG YNP+GRL MTWY
Sbjct: 603 GGYPFTLNWIEANIPAVIYSSHSGQEHGAALADVLFGDYNPAGRLNMTWY 652
>gi|298480647|ref|ZP_06998843.1| beta-glucosidase [Bacteroides sp. D22]
gi|298273081|gb|EFI14646.1| beta-glucosidase [Bacteroides sp. D22]
Length = 862
Score = 194 bits (494), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 139/410 (33%), Positives = 188/410 (45%), Gaps = 79/410 (19%)
Query: 77 ITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNA 136
++ ++D + +V RLGI Y WW+EALHGV+ G AT FPQ I AASF+
Sbjct: 46 VSLMMDGSKAVERLGIKPYNWWNEALHGVARAGL----------ATVFPQPIGMAASFSP 95
Query: 137 SLFQAIGKVVSTEARAMYNV--------GLAGLTFWSPNINIFRDPRWGRGQETPGEDPL 188
+ VS EARA GLT W+P +N++RDPRWGRG ET GEDP
Sbjct: 96 QTVYGVFDAVSDEARAKNTYYASQGSYERYQGLTMWTPTVNVYRDPRWGRGIETYGEDPY 155
Query: 189 LASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVI-----Y 243
L S+ VKGLQ + G ++L AC KH+ + W +R+ FN I Y
Sbjct: 156 LTSRMGVEVVKGLQGGNDGKYDKLH--ACAKHFAVHSGPEW---NRHEFNVENIKPRDLY 210
Query: 244 TYYLIKFKYIV------------------------------------------SDCDSVD 261
YL F+ +V SDC ++
Sbjct: 211 ETYLPPFEALVKEAKVKEVMCAYNRFEGDPCCGSDRLLMQILRNEWGFDGIVLSDCGAIA 270
Query: 262 VLYNSQHYTKTP--EEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNF 319
YN + + P E A+A ++L+G DL CGS E AVK G ++E +D A+
Sbjct: 271 DFYNDRGHKTHPDAESASAAAVLSGTDLECGSSYKALIE-AVKLGKIDEKDVDVAVKRLL 329
Query: 320 ATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAI 379
A LG D P K + ++ V ++ + LALD AR+ + LL N LPL +
Sbjct: 330 AARFALGEMD-EPDKVSWTKIPFSVVASEKHDSLALDIARKSMTLLMNKNDILPLKRGGL 388
Query: 380 KNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG----LAAVVATIYQAGC 425
+AV+GPNAN + GNY G P T L G L A IY+ GC
Sbjct: 389 -TIAVMGPNANDSVMQWGNYNGMPSHTVTVLDGVRKALDANDKLIYEQGC 437
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/133 (37%), Positives = 66/133 (49%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQVDDAKKAAAS---ADATVLVMGADQSIEAESH----------DRLDLLLPGQ 473
N G + D +K+ AD + V G S+E E DR D+ LP
Sbjct: 578 NFDLGFKKEVDIRKSVEKVKGADVVIFVGGISPSLEGEEMGVNLPGFKRGDRTDIELPAV 637
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q+ LI + K VIL+ SG I K +IL YPGQAGG A+A+V FG
Sbjct: 638 QRELIAALHHSGK-KVILVNCSGS--PIGLEPETKKCEAILQAWYPGQAGGTAVAEVLFG 694
Query: 534 RYNPSGRLPMTWY 546
+YNP+GRLP+T+Y
Sbjct: 695 KYNPAGRLPVTFY 707
>gi|326427096|gb|EGD72666.1| hypothetical protein PTSG_04397 [Salpingoeca sp. ATCC 50818]
Length = 614
Score = 194 bits (493), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 142/442 (32%), Positives = 205/442 (46%), Gaps = 64/442 (14%)
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
SPNINI RDPRWGR QE P EDPLL ++ Y GLQQ G KV KH+ AY
Sbjct: 11 SPNININRDPRWGRNQEVPSEDPLLNGEFGKLYTMGLQQ--GEDSRYTKVVVTLKHWDAY 68
Query: 225 DLDNWKGTDRYHFNAMV-----IYTYY--------------------------------- 246
L++ G R++F+A V + TY+
Sbjct: 69 SLEDSDGFTRHNFDAKVSNFALMDTYWPAFRKAVMEGNAKGVMCSYNALNGRPTCTHPLL 128
Query: 247 ------LIKFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG-LDLNCGSFLGKHTE 298
+ KF Y+ SD +++ +Y HYT A A ++ G D++ G+
Sbjct: 129 TKVLRDIWKFDGYVTSDTGAIEDIYAKHHYTANASAAVAAALRDGRCDMDSGAVYHDALL 188
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAA 358
AV +G + +D A+ N LG FD QPY ++ + T QDL +
Sbjct: 189 DAVNSGECSMDDVDRALYNTLKLRFELGLFDP-IEDQPYWRINASSINTTYAQDLNMKIT 247
Query: 359 RQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK------YTTPLQG 412
+ ++LL+N +LP + +AVIGP+ N + ++GNY G C T+PL
Sbjct: 248 LESMILLQNHNNALPFKKG--RKVAVIGPHINAQEALVGNYLGQLCPDDSFDCITSPLAA 305
Query: 413 LAAVVA---TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLL 469
+ A+ T+ G + C A + +A A AD VL++G + +IEAES+DR +
Sbjct: 306 IEAINGMSNTVSAMGSGVLACTDASIQEAVNVAKDADYVVLLIGINDTIEAESNDRTSID 365
Query: 470 LPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIAD 529
LP Q L +A + K ++I +GG I K ++ +I+ GYPG GGAAIA
Sbjct: 366 LPQCQHKLTAAIAHLNKTTAAVLI-NGGMLAIEQEKK--QLPAIIEAGYPGFYGGAAIAK 422
Query: 530 VCFGRYNP-SGRLPMTWYPQSY 550
FG N G+LP T YP Y
Sbjct: 423 TIFGDNNHLGGKLPYTVYPADY 444
>gi|189464498|ref|ZP_03013283.1| hypothetical protein BACINT_00840 [Bacteroides intestinalis DSM
17393]
gi|189438288|gb|EDV07273.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 862
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 149/440 (33%), Positives = 202/440 (45%), Gaps = 79/440 (17%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
+ + NTSL + R LVK LTL+EK+ ++D + V RL I Y WW+EALHGV+ G
Sbjct: 20 VSYKNTSLSPEERADLLVKELTLEEKVQLMMDGSRPVERLDIKPYNWWNEALHGVARAGL 79
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLT 162
AT FPQ I AASF+ + VS EARA GLT
Sbjct: 80 ----------ATVFPQPIGMAASFSPETVYDVFTAVSDEARAKNTYYSSKDSRERYQGLT 129
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYT 222
W+P +NI+RDPRWGRG ET GEDP L S+ VKGLQ G N K+ AC KH+
Sbjct: 130 MWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADG--NYDKLHACAKHFA 187
Query: 223 AYDLDNWKGTDRYHFNAMVI-----YTYYLIKFK-------------------------- 251
+ W +R+ FNA I Y YL F+
Sbjct: 188 VHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGEVKEVMCAYNRFEGEPCCGS 244
Query: 252 ----------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCGSFL 293
+VSDC ++ YN + + P+ A+A ++++G DL CGS
Sbjct: 245 NRLLMQILRDEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVVSGTDLECGSSY 304
Query: 294 GKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
E +VK GL++E ++ ++ LG D K + ++ V + A+ L
Sbjct: 305 KALVE-SVKKGLISEEKVNTSVRRLLKARFELGEMD-DLEKVSWAKIPFSVVASAAHDSL 362
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG- 412
AL AR+ + LL N LPL + +AV+GPNAN + GNY G P T L G
Sbjct: 363 ALKIARKSMTLLMNKDNFLPLRRGGL-TVAVMGPNANDSVMQWGNYNGMPPHTVTILDGV 421
Query: 413 ---LAAVVATIYQAGCSNVQ 429
L A IY+ GC V+
Sbjct: 422 RNLLGADDKLIYEQGCPWVE 441
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 64/133 (48%), Gaps = 16/133 (12%)
Query: 427 NVQCGTAQVDDAKKAAA---SADATVLVMGADQSIEAESH----------DRLDLLLPGQ 473
N G + D +K+ AD + G S+E E DR D+ LP
Sbjct: 578 NFDLGFKREVDIQKSVEQVKEADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAV 637
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
Q+ LI + + K +++++ G I K +IL YPGQ GG A+A+V FG
Sbjct: 638 QRELIDALYRAGKK---IVLVNCSGSPIGLEPETKKCEAILQAWYPGQQGGTAVAEVLFG 694
Query: 534 RYNPSGRLPMTWY 546
YNP+GRLP+T+Y
Sbjct: 695 DYNPAGRLPVTFY 707
>gi|392305174|emb|CCI71537.1| beta-glucosidase [Paenibacillus polymyxa M1]
Length = 973
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 135/447 (30%), Positives = 211/447 (47%), Gaps = 65/447 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + + +D RV DLV RLTL+EKI ++ ++ RLG+ Y+ +EA HG++++G
Sbjct: 12 FQDPKIHLDERVRDLVSRLTLEEKIESMLQYQPAIDRLGVAAYKHGTEAAHGIAWLGE-- 69
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
ATSFPQ + A +++ L + IG V++ EAR Y + L GLT W+P +++
Sbjct: 70 --------ATSFPQPVGLACTWDTELMRQIGSVIADEARVYYRRDPKLNGLTLWAPTVDM 121
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L + + VKG+Q G P LK A KH+ + + +
Sbjct: 122 ERDPRWGRNEEAYGEDPYLTGELSKELVKGIQ---GDHPVYLKAVATLKHFLGNNNEVDR 178
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + + YYL F+
Sbjct: 179 GVGSSSIDPRNLREYYLRAFEKPFTEGGAQSMMTSYNLINGTPATLYHGVNDIVRGEWGM 238
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD-LNCGSFLGKHT-EAAVKAGLVN 307
++VSD V + N Y + A+SI AG+D + + L K A++ GL+
Sbjct: 239 DGFVVSDAGDVMGIVNDHQYYDSHTPGVAESIRAGIDSITDDAELSKQAIREALEQGLLQ 298
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E +D A+ + F RLG FD + PY +G + + T+ ++L+L AAR+ IVLLKN
Sbjct: 299 EEDLDRALFHTFRVRFRLGEFDP-AADNPYASIGEEALMTEQARELSLRAAREQIVLLKN 357
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSN 427
LPL+P +AVIG + N + Y GT TPL+ + A + ++G +
Sbjct: 358 EQELLPLNPAQCGKVAVIGSHGN--EVFHDWYSGTLPYRVTPLEAIRAKL----ESGSAE 411
Query: 428 VQCGTAQVDDAKKAAASADATVLVMGA 454
+ + D A+ D L + +
Sbjct: 412 ERVVYSDAKDRVTLTAATDGAKLTVNS 438
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 5/86 (5%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG-GFDISFAKNDPKITSILWVGYP 519
E +DR L L QQ +I EV +V P +++++G F I + + I +I++ +
Sbjct: 581 EENDRPGLELAASQQRIIEEVYRV--NPNTIVVLTGSYPFAIPWLQE--HIPAIVYTSHA 636
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTW 545
GQ G AIADV FG Y P+GRL MTW
Sbjct: 637 GQEYGTAIADVLFGDYAPAGRLNMTW 662
>gi|313204584|ref|YP_004043241.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443900|gb|ADQ80256.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 727
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 164/569 (28%), Positives = 255/569 (44%), Gaps = 91/569 (15%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
S + F NT L + R+ +L+ +TL EK+ L + G V RLGI + SE LHG++
Sbjct: 21 SQTTFPFQNTGLPDNERLDNLLSLMTLDEKVNALSTNLG-VPRLGI-RNTGHSEGLHGMA 78
Query: 107 YVGPGTH------FSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEAR---AMYNVG 157
GPG + P T FPQ +++ L Q + + +TE R N+
Sbjct: 79 LGGPGNWGGSERGVAKTYP-TTIFPQAYGLGETWDTELIQKVADIEATEIRFYAQNANLQ 137
Query: 158 LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAAC 217
G+ +PN ++ RDPRWGR +E+ GED L S+ +VKGLQ G P K A+
Sbjct: 138 KGGMVMRAPNADLARDPRWGRTEESYGEDAFLGSRLTVAFVKGLQ---GNDPKYWKSASL 194
Query: 218 CKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKF--------------------------- 250
KH+ A ++ + + +F+ + YY F
Sbjct: 195 MKHFLANSNEDGRDSTSSNFDERLFREYYSFPFYKGITEGGSRAFMASYNAWNGVPMTVN 254
Query: 251 --------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGK- 295
I +D ++ +L N+ H T E AA + A + G FL
Sbjct: 255 PILKKIARDEWGNNGIICTDGGALSLLVNAHHAFPTLTEGAAAVVKA----SVGQFLDNF 310
Query: 296 --HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD- 352
+ A+K GL+ E ID+ I NF ++LG D SK PY +G D + N+
Sbjct: 311 RSYIYEALKKGLLTEKNIDNVIRGNFYVALKLGLLDADQSKVPYTGIGVTDTVSPWNKQD 370
Query: 353 ---LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP 409
+ +VLLKNTAG LPL+ + IK++AVIGP AN + ++ Y GTP +
Sbjct: 371 TKAFVRKVTAKSVVLLKNTAGLLPLNKSKIKSIAVIGPRAN--EVLLDWYSGTPPYAVSI 428
Query: 410 LQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGAD-------------Q 456
LQG+ V + + ++D A AA AD ++ +G
Sbjct: 429 LQGIKNAVGKDIEV----FYAPSDEMDKATLAARKADVAIVCVGNHPYGTDARWKISPVP 484
Query: 457 SIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWV 516
S E+ DR + L +Q+ L+ V + A +++++S F I++++ + + +IL V
Sbjct: 485 SDGREAVDRKSITL--EQEDLVKLVMQ-ANPKTVMVLVSNFPFAINWSQEN--VPAILHV 539
Query: 517 GYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
Q G +ADV FG +P+GR TW
Sbjct: 540 TNNSQELGNGLADVIFGDVSPAGRTTQTW 568
>gi|225872720|ref|YP_002754177.1| xylan 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
gi|225793233|gb|ACO33323.1| xylann 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
Length = 721
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 166/566 (29%), Positives = 253/566 (44%), Gaps = 100/566 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N +L D R+ DL+ R+TLQEKI L D G V RLGIP E LHG + GP
Sbjct: 26 FQNPALSPDQRIDDLLSRMTLQEKIQALGDDPG-VPRLGIPG-ALTEEGLHGAAIGGPA- 82
Query: 113 HFSN----VVPGATSFPQVILTAASFNASLFQAIGKVVSTEAR-AMYNVGLAGLTFWSPN 167
H+ VVP T FPQ +++ +L Q V + E R A+ GL +PN
Sbjct: 83 HWEGRGRAVVP-TTQFPQNHGLGQTWDPALLQKAANVEAYETRWAVNKYHDGGLIVRAPN 141
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLD 227
N+ RDPRWGR +E+ GEDP L A ++KGLQ G +P + AA KH+ AY +
Sbjct: 142 ANLSRDPRWGRTEESYGEDPYLVGTLAVAWIKGLQ---GNNPRYWETAALMKHFDAYSNE 198
Query: 228 NWKGTDRYHFNAMVIYTYYLIKFKY----------------------------------- 252
+ +F + Y YY + F+
Sbjct: 199 ANRDGSSSNFGKRLFYEYYSVPFRMGIEQGHSDAFMTSYNAWNGIPMTANPVLKSVVMKK 258
Query: 253 ------IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKH---TEAAVKA 303
I +D ++ + HY KT EAAA ++ AG++ FL ++ E A++
Sbjct: 259 WGFNGIICTDAGALSNMVTHFHYYKTMPEAAAGAVHAGIN----QFLDRYQQPVEEALQQ 314
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG----------PKDVCTQANQDL 353
L+ E ID + + ++RLG D S PY +G P D + +
Sbjct: 315 KLLTEQQIDQDLKGVYRVVLRLGLMDPS-SMSPYSMIGLTNDNPAKGDPWDWPSH----I 369
Query: 354 ALD--AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
ALD + IVLLKN +LPL + ++AVIGP AN+ + Y GTP TP++
Sbjct: 370 ALDRKVTDESIVLLKNQNHALPLDAKKLHSIAVIGPWANIVA--LDWYSGTPPFGVTPVE 427
Query: 412 GLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA----------- 460
G+ V + ++ + + A A +D ++++G + +A
Sbjct: 428 GIRQRVGPDVKVTFND----GSNLQAAAALAKQSDEAIVIIGNHPTCDAGWGKCALPSEG 483
Query: 461 -ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYP 519
E+ DR L LP + I + A ++++ + + + + I +IL + +
Sbjct: 484 KEAFDRTALNLPDES---IAKAVYAANPHTVVVLQTSFPYTTDWTQ--AHIPAILEMAHN 538
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTW 545
+ G A+ADV FG Y+P+GRL TW
Sbjct: 539 SEEQGTALADVLFGDYDPAGRLAQTW 564
>gi|29829120|ref|NP_823754.1| sugar hydrolase [Streptomyces avermitilis MA-4680]
gi|29606226|dbj|BAC70289.1| putative sugar hydrolase [Streptomyces avermitilis MA-4680]
Length = 954
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 140/408 (34%), Positives = 191/408 (46%), Gaps = 69/408 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L RV DL+ RLTL E+I FL +V RLGI + EALHGV+++GP
Sbjct: 11 FRDPRLPFAKRVDDLLARLTLDERIAFLHQFTPAVERLGIAAFRTGQEALHGVAWMGP-- 68
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWSPNINIF 171
AT FPQ + A++N L + IG+ VS E RAM GL WSP +N+
Sbjct: 69 --------ATVFPQAVGLGATWNDDLVRRIGEAVSAETRAMRARDDRVGLNVWSPTVNLL 120
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
R P WGRG+E EDP L S AT Y +GL+ G P + A KH+ A++ + +
Sbjct: 121 RHPLWGRGEEGYSEDPRLTSAIATAYTRGLR---GDHPTYWRTAPVLKHWLAHNNETNRD 177
Query: 232 TDRYHFNAMVIYTYYLIKFKYIV------------------------------------- 254
T V++ Y L F+ V
Sbjct: 178 TTSSSVRPRVLHEYDLRAFRETVEAGAVAGVMPAYNLVNGRPNHVSPYLREHLRTWTDQE 237
Query: 255 ----SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL-----NCGSFLGKHTEAAVKAGL 305
SD + L +S+HY T EEA A ++LAG+D S + + A+ GL
Sbjct: 238 LLVCSDAGAPSNLVDSEHYFDTHEEATAAAVLAGVDSFTDHGTDSSKIVERVRGALVQGL 297
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
++E+ ID A+ A RLG FD P PY G D T A++ LA ++A Q +VLL
Sbjct: 298 LSEADIDEAVRRQLAIRFRLGEFD--PRLDPYA--GTLDFDTAAHRALARESAEQAVVLL 353
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
KN G LPL P +A +G A+ K + Y GT +TPL+GL
Sbjct: 354 KNE-GLLPLGPEV--RIAAVGLLADECK--LDWYSGTLLHRSTPLEGL 396
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 43/86 (50%), Gaps = 5/86 (5%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+ DR L LP Q+ L+ ++++ + +A + + + LW + G
Sbjct: 593 ETEDRTTLRLPAHQERLLRAARAANPATALVLVSA-----YPYAVDHADLPAALWTAHGG 647
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
QA G A+A V G +P+GRLP TWY
Sbjct: 648 QAAGTALARVLSGDVSPAGRLPQTWY 673
>gi|94495010|ref|ZP_01301591.1| family 3 glycoside hydrolase [Sphingomonas sp. SKA58]
gi|94425276|gb|EAT10296.1| family 3 glycoside hydrolase [Sphingomonas sp. SKA58]
Length = 872
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 137/399 (34%), Positives = 198/399 (49%), Gaps = 84/399 (21%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
+V++LTLQEK L+++A ++ RLGIP Y WW+E+LHG +GP VP T+FP+
Sbjct: 38 IVRQLTLQEKTEQLLNTAPAIPRLGIPAYNWWTESLHGA--IGP-------VP-TTNFPE 87
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTFWSPNINIFRDPRWG 177
I AA+F+A L + V+STE +A++ +G GL WSPNINIFRDPRWG
Sbjct: 88 PIGLAATFDAPLIHTVASVISTEVQALHTLGRQTGHLGRIGTGLDTWSPNINIFRDPRWG 147
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD-------LDNWK 230
RGQET GEDP L ++ +++G+Q G +P+ V A KH+ + DN
Sbjct: 148 RGQETYGEDPYLTARMGVAFIQGMQ---GDNPDLPDVVATPKHFAVHSGPEPSRHTDNIF 204
Query: 231 GTDR-------YHFNAMVI-------------------------YTYYLIK---FK-YIV 254
T R F A ++ T YL F+ Y+V
Sbjct: 205 ATKRDLEDTYLPAFRAAIVEGKAGSIMCAYNRVDGEPACGSAMLLTDYLRNAWGFRGYVV 264
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF-------LGKHTEAAVKAGLVN 307
SDCD+V +Y Y +P + ++ G+D C + LG + A+ A ++
Sbjct: 265 SDCDAVVDIYAHHKYAPSPATGVSVALRRGVDSECNNATLSGQAGLGDRYKDALAANHIS 324
Query: 308 ESAIDHAISNNFATLMRLGFFDG----HPSKQPYGQLGPKDVCTQANQDLALDAARQGIV 363
ID A+ F+ +R G G P+ P +G T A+ LALDAA + +V
Sbjct: 325 IGDIDSALVRLFSARLRNGDLPGLSARKPNSTPVSAIG-----TPAHGALALDAAVKSLV 379
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
LLKN G+LPL P + + GP A+ T+ + GNY T
Sbjct: 380 LLKNE-GALPLKPGV--RIVLAGPLADATRVLRGNYSST 415
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/120 (38%), Positives = 63/120 (52%), Gaps = 13/120 (10%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAK 486
D AD V VMG +E E DR L LP Q + E AK
Sbjct: 593 DLAAGVRDADVIVAVMGLTSDLEGEEMPVKVEGFEGGDRTTLALPADQ-IAFLEKAKATG 651
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
P+IL++M+G D+ +AK++ +IL YPGQ+GG AIA+V G+ +P GRLP+T+Y
Sbjct: 652 KPLILVMMNGSAIDLGWAKDN--AAAILEAWYPGQSGGLAIANVLSGKADPGGRLPLTFY 709
>gi|21224034|ref|NP_629813.1| sugar hydrolase [Streptomyces coelicolor A3(2)]
gi|7801257|emb|CAB91121.1| putative sugar hydrolase [Streptomyces coelicolor A3(2)]
Length = 960
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 165/517 (31%), Positives = 233/517 (45%), Gaps = 83/517 (16%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
S A+ F + L RV DL+ RLTL EK FL A +V RLGI + EALHGV+
Sbjct: 2 SAATPPFRDPQLPFAKRVDDLMSRLTLDEKTAFLHQFAPAVERLGIAPFRTGQEALHGVA 61
Query: 107 YVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWS 165
++GP AT FPQ + A++N L + +G+ VS EARAM GL WS
Sbjct: 62 WMGP----------ATVFPQAVGLGATWNEELVRRVGEAVSKEARAMRARDDRVGLNVWS 111
Query: 166 PNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD 225
P +N+ R P WGR +E EDP L S AT Y +GL+ G P + A KH+ A++
Sbjct: 112 PTVNLLRHPLWGRNEEGYSEDPRLTSAVATAYTRGLR---GDHPAYWRTAPVLKHWLAHN 168
Query: 226 LDNWKGTDRYHFNAMVIYTYYLIKFKYIV------------------------------- 254
+ + T V++ Y L F+ V
Sbjct: 169 NETDRDTSSSSVRPRVLHEYDLRAFRDTVEAGAVAGVMPAYNLVNGRPNHLSPHLGRHLR 228
Query: 255 ----------SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGK---HTEA 299
SD + L +S+HY T EEA A ++ AG+D + G+ K
Sbjct: 229 TWTQEDLLVCSDAGAPSNLVDSEHYFDTHEEATAAALRAGVDSFTDHGTDSSKIVARVRG 288
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK--DVCTQANQDLALDA 357
A++AGL+ E+ +D A+ + RLG FD P PY D T ++ LA DA
Sbjct: 289 ALEAGLLTEADVDAAVRRQLSVRFRLGEFD--PENDPYSGAADTGADFDTPEHRALARDA 346
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
A Q +VLLKN G LPL+P +AV+G A+ K + Y GT +TPL+GL
Sbjct: 347 AEQAVVLLKND-GVLPLAPE--TRVAVVGLLADECK--LDWYSGTLIHRSTPLEGLYERF 401
Query: 418 A---TIYQAGCSNVQCGTA-----QVDDAKKAAASADATVLVMGADQSIEAESHDRLDLL 469
+ G V+ TA V A A+A A T + D ++ A D L
Sbjct: 402 GADRVSFAEGVDRVRLRTADGRFLHVLPADDASAEAPGTEGAL--DPALLAGRTD----L 455
Query: 470 LPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN 506
P + TE+A + G +L + + G +S A++
Sbjct: 456 PPLTTDAVGTELALIDWGEGVLTLRASDGRYLSVAED 492
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+ DR L LP QQ+ L+ A+ A +L ++S +A + ++LW + G
Sbjct: 598 ETEDRTTLRLPAQQERLL-RAARAANPATVLALVSA----YPYAVGVETLPAVLWTAHGG 652
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
QA G A+A + G +P+GRLP TWY
Sbjct: 653 QAAGTALARILAGDVSPAGRLPQTWY 678
>gi|290957046|ref|YP_003488228.1| glycosyl hydrolase [Streptomyces scabiei 87.22]
gi|260646572|emb|CBG69669.1| putative glycosyl hydrolase [Streptomyces scabiei 87.22]
Length = 950
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 136/408 (33%), Positives = 190/408 (46%), Gaps = 68/408 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L + R+ DL++RLT E+I FL A +V RLG+ + EALHGV+++GP
Sbjct: 11 FRDPQLPVAKRIDDLLQRLTPDERIAFLHQFAPAVDRLGVAAFRTGQEALHGVAWMGP-- 68
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWSPNINIF 171
AT FPQ + A++N L + +G+ VS E RAM GL WSP +N+
Sbjct: 69 --------ATVFPQAVGLGATWNEELVRRVGEAVSGEVRAMRARDDRVGLNVWSPTVNLL 120
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
R P WGR +E EDP L S AT Y +GL+ G P + A KH+ A++ + +
Sbjct: 121 RHPLWGRNEEGYSEDPRLTSAIATAYTRGLR---GDHPTYWRTAPVLKHWLAHNNETDRS 177
Query: 232 TDRYHFNAMVIYTYYLIKFKYIV------------------------------------- 254
T V+ Y L F+ V
Sbjct: 178 TTSSSVRPRVLREYDLRAFRETVEAGAVAGVMPAYNLVNGRPNHVSPLLAEQLRTWTDEP 237
Query: 255 ----SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL-----NCGSFLGKHTEAAVKAGL 305
SD + L +S+HY T EEA A S+ AG+D GS + A+ L
Sbjct: 238 LLVCSDAGAPSNLVDSEHYFDTHEEATAASLRAGVDSFTDHGTDGSTITGRVRDALTRSL 297
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
+ ID A+ + RLG FD P PY G +D T A++ LA +AA Q +VLL
Sbjct: 298 LTMDDIDEAVRRQLSVRFRLGEFD--PGSDPYA--GVEDFDTPAHRALAREAAEQAVVLL 353
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
+N G LPL+ A +AV+GP A+ K + Y GT +TPL+GL
Sbjct: 354 RNEGGLLPLA--ADTRVAVVGPLADECK--LDWYSGTLLHRSTPLEGL 397
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 53/99 (53%), Gaps = 6/99 (6%)
Query: 449 VLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKND 507
++V G D I E+ DR L LP Q+ L+ A+ A L+++S +A +
Sbjct: 575 LVVAGNDPHINGRETEDRTTLALPAHQERLL-RAARTANPNTALVLVSA----YPYAVDV 629
Query: 508 PKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ ++LW + GQA G A+A V G +P+GRLP TWY
Sbjct: 630 TGLPAVLWTAHGGQAAGTALARVVAGDVSPAGRLPQTWY 668
>gi|255690204|ref|ZP_05413879.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
gi|260624223|gb|EEX47094.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 954
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 164/549 (29%), Positives = 264/549 (48%), Gaps = 80/549 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSYVGP 110
+ + SL +D RV L+ +T +K+ + + G + L +P EA+HG SY
Sbjct: 170 YMDASLPVDERVESLLAAMTPADKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 227
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
GAT FPQ + A++N L + + + E + N A WSP +++
Sbjct: 228 ---------GATIFPQALAMGATWNRQLTEEVAMAIGDET-VIANTKQA----WSPVLDV 273
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG---------------GSPNRLKVA 215
+D RWGR +ET GEDP+L S+ ++KG Q+ G G + +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQMGGAWIKGY-QSKGLFTTPKHFGGHGAPLGGRDSHDIG 332
Query: 216 ACCKHYTAYDLDNWKGTDRYH--FNAMVIYTYYL--------------IKFK-----YIV 254
+ L ++ R + + M+ Y+ Y+ ++ + +IV
Sbjct: 333 LSEREMREVHLVPFRHVIRNYDCQSLMMAYSDYMGIPIAKSTELLQRILRQEWGFNGFIV 392
Query: 255 SDCDSVDVLYNSQHYT-KTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAID 312
SDC ++ L +HYT K EAA +++ AG+ NCG ++ K A K G +N +D
Sbjct: 393 SDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGRINMENLD 452
Query: 313 HAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGS 371
+ AT+ R F+ +P K + ++ P + +++ +A AA + IV+L+N
Sbjct: 453 NVCRTMLATMFRNELFEKNPCKPLDWNKIYP-GWNSDSHKAMAHRAACESIVMLENKDNL 511
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVA----TIYQAGC 425
LPLS ++ +AV+GP A+ + G+Y + P + + L G+ A V+ +Y+ GC
Sbjct: 512 LPLS-KELRTIAVLGPGADDLQP--GDYTPKLQPGQLKSVLTGIKAAVSKQTKVLYEKGC 568
Query: 426 SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQQQL 476
+ G + A K A+ AD V+V+G EA E++D L+LPG+QQ
Sbjct: 569 DFTETGMTDIPKAVKTASQADVVVMVLGDCSISEATKDVRKTCGENNDLATLVLPGKQQE 628
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
L+ V K PVILI+ +G +D+ A K + W+ PGQ GG A ADV FG YN
Sbjct: 629 LLEAVCATGK-PVILILQAGRPYDLLKASEMCKAILVNWL--PGQEGGPATADVLFGDYN 685
Query: 537 PSGRLPMTW 545
P GRLPMT+
Sbjct: 686 PGGRLPMTF 694
>gi|242206820|ref|XP_002469265.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
gi|220731725|gb|EED85567.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
Length = 312
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 111/292 (38%), Positives = 147/292 (50%), Gaps = 54/292 (18%)
Query: 44 SNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALH 103
+N L + C+TS R L+ TL+EKI ++A V RLG+P Y+WW EALH
Sbjct: 24 ANGPLTTNTVCDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEALH 83
Query: 104 GVSYVGPGTHF--SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGL 161
GV+ PG F S ATSFPQ IL A+F+ +L + +VSTEARA N +G+
Sbjct: 84 GVA-ESPGVIFAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRSGI 142
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY 221
FW+PNIN F+DPRWGRGQETPGEDP Y + GLQ G P ++ A CKH+
Sbjct: 143 DFWTPNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQ--GGLDPEYKRIVATCKHF 200
Query: 222 TAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK-------------------------- 251
AYDL+NW+G RY F+A+V + +Y F+
Sbjct: 201 AAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNGVPSCAN 260
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
YI SDCD++ +Y +YT T E A ++ AG
Sbjct: 261 SYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTATRAETVADALNAG 312
>gi|253574725|ref|ZP_04852065.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251845771|gb|EES73779.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 953
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 138/406 (33%), Positives = 190/406 (46%), Gaps = 63/406 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L + RV DLV R TL EKI + V RLG+ KY+ +E HGV+++G
Sbjct: 10 FQNPDLPLAERVDDLVSRFTLDEKIELMCQFQTEVPRLGVKKYKHGTEGAHGVAWLGE-- 67
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
ATSFPQ A ++N L + IG V+ EAR + N + GLT W+P +++
Sbjct: 68 --------ATSFPQNTGLACTWNPELMRQIGSVIGDEARVYFQRNPEVNGLTIWAPTVDM 119
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L T VKGLQ G P K A KH+ A + + +
Sbjct: 120 ERDPRWGRTEEAYGEDPHLTGALTTELVKGLQ---GDHPFYYKAVATLKHFYANNNEIGR 176
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G++ + YYL F+
Sbjct: 177 GSESVSIDPRNKREYYLKAFERAFREGRAGSMMTAYNGINGTPCNLNSEVNEIVKGEWEM 236
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGKHTEAAVKAGLVN 307
++V D V Y ++ EA A SI AG+D + + A++ GL+
Sbjct: 237 DGFVVGDAGDVMGTVLDHKYVESYAEAVAGSIKAGIDSMTDEQPIMFHALRNALEQGLLA 296
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
ES +D A+ N F RLG FD + PY ++ + +C + LAL A+R+ IVLLKN
Sbjct: 297 ESDLDRALRNAFRVRFRLGEFDPE-ERNPYSRVPEEKLCAPEHAALALKASREAIVLLKN 355
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
G LPL P + ++AV+GP AN T Y GTP TPLQG+
Sbjct: 356 -EGLLPL-PQTLGSVAVVGPLANEAYTDW--YSGTPPYRITPLQGI 397
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 69/120 (57%), Gaps = 4/120 (3%)
Query: 428 VQCGTAQVDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAK 486
V+ + ++ A +AA SA+ V+ +G + I A E+ DR+D++LP QQ LI V V
Sbjct: 549 VEIVESGIEQAVQAAKSAETAVVFVGNNPFINAKETIDRVDIVLPPAQQALIQAVRAVNP 608
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
V++I+ G + + + + SIL+ + Q G A+ DV FG YNP+GRL MTWY
Sbjct: 609 NTVVVIV---GSYPFAVNWENAHVPSILFTSHSAQELGHAVTDVLFGDYNPAGRLNMTWY 665
>gi|114568800|ref|YP_755480.1| glycoside hydrolase family protein [Maricaulis maris MCS10]
gi|114339262|gb|ABI64542.1| glycoside hydrolase, family 3 domain protein [Maricaulis maris
MCS10]
Length = 750
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 171/549 (31%), Positives = 258/549 (46%), Gaps = 103/549 (18%)
Query: 58 LGIDLR---VGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHF 114
LG D+R VG ++ ++ ++ + L A SRLGIP + +HG F
Sbjct: 57 LGDDIRAGQVGSIINQVD-RDTVLELQRIAREESRLGIPLL-VGRDVIHG---------F 105
Query: 115 SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDP 174
VVP P I AAS+N L +A ++ S EA VG+ TF +P I++ RDP
Sbjct: 106 KTVVP----LP--IGQAASWNPQLVEACARLASEEA---STVGV-NWTF-APMIDVCRDP 154
Query: 175 RWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD- 233
RWGR E GEDP+L S V+G Q P+ L AAC KH+ Y G D
Sbjct: 155 RWGRIAECLGEDPVLTSVLGAAMVRGFQGASLDDPSSL--AACAKHFAGYGASE-SGRDY 211
Query: 234 -----------RYHF----------------------------NAMVIYTYYLIKFKY-- 252
HF N+ ++ +++Y
Sbjct: 212 NTTNLPENELRNVHFPPFRAAVEAGVASLMTSFSDIDGVPATANSFLLRDVLREEWRYDG 271
Query: 253 -IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN--CGSFLGKHTEAAVKAGLVNES 309
+VSD D++ L T+T +EAA ++ AG+D++ G++L +H V +G +
Sbjct: 272 LVVSDWDAIQQLC-VHGLTETRDEAAFQAASAGVDMDMVAGAYL-QHLAGLVASGRIELE 329
Query: 310 AIDHAISNNFATLMRLGFFDGHP--SKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
+D ++N RLG FD P + +P ++ LA +AA Q VLLKN
Sbjct: 330 TVDRMVANVLRLKFRLGLFDSRPVLADEP---------ARMTSRSLAKEAALQSCVLLKN 380
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGN--YEGTPCKYTTPLQGLAAVVATIYQAGC 425
+LPL P + +LAVIGP AN +G ++G P + TPL + ++ A AG
Sbjct: 381 EGRALPLDPACLDHLAVIGPLANEPAEQLGTWVFDGDPERSVTPLAAIESLAA---DAGM 437
Query: 426 S----NVQCGTAQVD-----DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQL 476
S T +D +A+ A +AD V+ +G + + E+H R D+ LPG Q
Sbjct: 438 SVSHARAMPTTRSLDETAFAEAEAIARNADVVVVFLGEEAILSGEAHCRADIDLPGAQVS 497
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
L+ + V K PVI +I +G ++ +D + +IL+ +PG GGAAIAD+ FGR
Sbjct: 498 LVKRLKAVGK-PVIAVIQAGRPLTLTSVIDD--LDAILFAWHPGSLGGAAIADLLFGRAC 554
Query: 537 PSGRLPMTW 545
PSG+LP+++
Sbjct: 555 PSGKLPVSF 563
>gi|365121891|ref|ZP_09338802.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
6_1_58FAA_CT1]
gi|363644131|gb|EHL83433.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
6_1_58FAA_CT1]
Length = 855
Score = 191 bits (486), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 143/418 (34%), Positives = 203/418 (48%), Gaps = 82/418 (19%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
I RV DL+ R+T++EK++ ++ +A + RL I KY +EALHG+ V PG
Sbjct: 39 IHDRVMDLLSRMTVEEKVSLMIHNAPGIPRLEIDKYYHGNEALHGI--VRPGKF------ 90
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN-VGLAG---------LTFWSPNIN 169
T FPQ I AAS+N L I +S EAR +N +GL L+FWSP +N
Sbjct: 91 --TVFPQAIGMAASWNPELIYKISTAISDEARGKWNALGLGKKQLDGSSDLLSFWSPTVN 148
Query: 170 IFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNW 229
+ RDPRWGR ET GEDP L +VKGLQ G P LK A KH+ A N
Sbjct: 149 MARDPRWGRTPETYGEDPHLTGTLGCAFVKGLQ---GNHPKYLKAVATPKHFAA----NN 201
Query: 230 KGTDRYHFNAMV----IYTYYLIKFK---------------------------------- 251
+ +R H NA++ + YYL F+
Sbjct: 202 EEHNRAHCNAVISERDLREYYLPSFEKCIVEGKAQSIMTAYNAVNGIPCTVNTYLIKKVL 261
Query: 252 --------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKA 303
Y+V+DC + + Y K E AA AG D+ C + +T+ + A
Sbjct: 262 REDWGFQGYVVTDCSAPAWMVTQHKYVKDYETAAVLMAKAGSDMECADNV--YTQPLLNA 319
Query: 304 GL---VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQ 360
V+++ ID + M LG FD P K PY ++ P+ V + +Q+LAL+ ARQ
Sbjct: 320 YYNYRVSDADIDSIAYHLLRGRMLLGLFD-DPEKNPYNKISPEKVGCKEHQELALETARQ 378
Query: 361 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT-TPLQGLAAVV 417
+VLLKN LP++P IK++AV+G NA+ + G+Y GTP + T L+G+ +V
Sbjct: 379 SLVLLKNENNFLPINPKKIKSIAVVGINAD--RCEFGDYSGTPVNESVTVLEGIKRLV 434
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/110 (43%), Positives = 66/110 (60%), Gaps = 3/110 (2%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
+A KAA D TV V+G D+SIE E DR L LP QQ I E+ K+ V++++
Sbjct: 598 EAGKAAKECDITVAVLGIDKSIEREGQDRYTLELPADQQEFIREIYKINPKTVVVLV--- 654
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G I+ D I +I+ YPG+ GG A+A+ FG+YNP GRLP+T+Y
Sbjct: 655 AGSSIAINWIDENIPAIIDAWYPGEQGGTAVAEALFGKYNPGGRLPLTFY 704
>gi|386724230|ref|YP_006190556.1| beta-glucosidase [Paenibacillus mucilaginosus K02]
gi|384091355|gb|AFH62791.1| beta-glucosidase [Paenibacillus mucilaginosus K02]
Length = 973
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 131/422 (31%), Positives = 201/422 (47%), Gaps = 62/422 (14%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
++ RV DL+ +LTL EKI + ++ RLGI Y+ +EA HG++++G
Sbjct: 16 LEDRVNDLIAQLTLDEKIELMPQYQIAIERLGIQAYKHGTEAAHGMAWLGE--------- 66
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINIFRDPRWG 177
ATS+PQ I A ++++ L + IG + EAR Y N GLT W+P +++ RDPRWG
Sbjct: 67 -ATSYPQPIGLACTWDSELLKRIGSAIGDEARGFYKQNPNFNGLTLWAPTVDLERDPRWG 125
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHF 237
R +E GEDP+LA K A +G+Q G P LK A KH+ + + +G
Sbjct: 126 RTEEAYGEDPVLAGKLAAALTQGIQ---GDHPFYLKAVATLKHFIGNNNEAGRGDTSVSL 182
Query: 238 NAMVIYTYYL----IKFK-------------------------------------YIVSD 256
+ + YYL I FK ++VSD
Sbjct: 183 DPRNLREYYLKVFEITFKEGGAQSMMTAYNAVNGVPANLSELVIDIVKGEWGMNGFVVSD 242
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSIL-AGLD--LNCGSFLGKHTEAAVKAGLVNESAIDH 313
V HY +T +EA A+SI G+D + + + + A+K GL+ E+ +D
Sbjct: 243 AFDVTGTMRDHHYVETLKEAVARSIREGGIDSITDDAAVVTEAIHEALKDGLLTENDLDT 302
Query: 314 AISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLP 373
A+ N F RLG FD P + PY + + + LA +A+++ +VLLKN +LP
Sbjct: 303 ALRNTFRVRFRLGEFD-PPERNPYATIDESAILHPEHAKLAREASQKAVVLLKNDGKTLP 361
Query: 374 LSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTA 433
L + +AVIGP A+V Y G+ TPLQG+ +A + ++ GT
Sbjct: 362 LQADKLSKVAVIGPLADVVYQ--DWYSGSLPYAVTPLQGIRERLAAEGKDAVTSYAAGTD 419
Query: 434 QV 435
++
Sbjct: 420 RM 421
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/136 (35%), Positives = 73/136 (53%), Gaps = 10/136 (7%)
Query: 414 AAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPG 472
++VV T QA ++C ++ +D A +A ADA ++ +G I E+ DR D+ L
Sbjct: 554 SSVVGT--QADTFEIECVSSGLDAAVASARGADAAIVFVGNHPLINGKETMDRPDITLAE 611
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN--DPKITSILWVGYPGQAGGAAIADV 530
Q+ L +A + K P +I++ G FA N K +I++ + GQ G A+A V
Sbjct: 612 SQEKLA--LAVIEKNPNTIIVVVGS---YPFALNALQEKAKAIIYTSHAGQELGRAVAAV 666
Query: 531 CFGRYNPSGRLPMTWY 546
FG NP+GRL MTWY
Sbjct: 667 LFGDVNPAGRLNMTWY 682
>gi|310644496|ref|YP_003949255.1| glycoside hydrolase family protein [Paenibacillus polymyxa SC2]
gi|309249447|gb|ADO59014.1| Glycoside hydrolase family 3 domain protein [Paenibacillus polymyxa
SC2]
Length = 976
Score = 191 bits (485), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 129/408 (31%), Positives = 196/408 (48%), Gaps = 61/408 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + + +D RV DLV RLTL+EKI ++ ++ RLG+ Y+ +EA HG++++G
Sbjct: 15 FQDPKIHLDERVRDLVSRLTLEEKIESMLQYQPAIDRLGVAAYKHGTEAAHGIAWLGE-- 72
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
ATSFPQ + A +++ L + IG V++ EAR Y + L GLT W+P +++
Sbjct: 73 --------ATSFPQPVGLACTWDTELMRQIGSVIADEARVYYRRDPKLNGLTLWAPTVDM 124
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L + + VKG+Q G P LK A KH+ + + +
Sbjct: 125 ERDPRWGRNEEAYGEDPYLTGELSKELVKGIQ---GDHPVYLKAVATLKHFLGNNNEVDR 181
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + + YYL F+
Sbjct: 182 GVGSSSIDPRNLREYYLRAFEKPFTEGGAQSMMTSYNLINGTPATLYHGVNDIVRGEWGM 241
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD-LNCGSFLGKHT-EAAVKAGLVN 307
++VSD V + N Y + A+SI AG+D + + L K A++ G +
Sbjct: 242 DGFVVSDAGDVMGIVNDHQYYDSHTPGVAESIRAGIDSITDDAELSKQAIREALEQGSLQ 301
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E +D A+ + F RLG FD + PY +G + + T+ ++L+L AAR+ IVLLKN
Sbjct: 302 EEDLDRALFHTFRVRFRLGEFDP-AADNPYASIGEEALMTEQARELSLRAAREQIVLLKN 360
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
LPL+P +AVIG + N + Y GT TPL+ + A
Sbjct: 361 EQEFLPLNPAQCGKVAVIGSHGN--EVFRDWYSGTLPYRVTPLEAIRA 406
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 50/86 (58%), Gaps = 5/86 (5%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG-GFDISFAKNDPKITSILWVGYP 519
E +DR L L QQ LI EV +V P +++++G F I + + I +I++ +
Sbjct: 584 EENDRPGLELAASQQRLIEEVYRV--NPNTIVVLTGSYPFAIPWLQE--HIPAIIYTSHA 639
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTW 545
GQ G AIADV FG Y P+GRL MTW
Sbjct: 640 GQEHGTAIADVLFGDYAPAGRLNMTW 665
>gi|167765093|ref|ZP_02437206.1| hypothetical protein BACSTE_03479 [Bacteroides stercoris ATCC
43183]
gi|167696721|gb|EDS13300.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
stercoris ATCC 43183]
Length = 944
Score = 191 bits (485), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 163/550 (29%), Positives = 245/550 (44%), Gaps = 111/550 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F V+ +RLGIP ++ +E + GV SY AT+FP +
Sbjct: 141 LNEIQRFFVED----TRLGIP-VDFTNEGIRGVESY------------KATNFPTQLGLG 183
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L + +G + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 184 HTWNRELIRQVGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 238
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR----------------- 234
+ V+GLQ +VAA KH+ AY N KG
Sbjct: 239 ELGIEMVRGLQHNH-------QVAATAKHFAAYS--NNKGAREGMARVDPQMPPREVENI 289
Query: 235 --YHFNAMVIY--------------------TYYLIKFK---------YIVSDCDSVDVL 263
Y F ++ +YY + + Y+VSD D+V+ L
Sbjct: 290 HIYPFKRVIREAGLLGVMSSYNDYDGIPIQGSYYWLTTRLRKEMGFRGYVVSDSDAVEYL 349
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNC-----GSFLGKHTEAAVKAGLVNESAIDHAISNN 318
Y + K +EA +S+ AGL++ C SF+ E VK G ++E I+ + +
Sbjct: 350 YTKHNTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLREL-VKEGGLSEEVINDRVRDI 408
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
+G FD G +V +AN+ +AL A+R+ IVLLKNT +LPL+
Sbjct: 409 LRVKFLIGLFDAPYQTDLAG--ADDEVEKEANEAVALQASRESIVLLKNTDNTLPLNIDK 466
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL----AAVVATIYQAGCSNVQC---- 430
IK +AV GPNA+ + +Y + TT L+G+ +Y GC V
Sbjct: 467 IKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIREKAQGKAEVLYTKGCDLVDAHWPE 526
Query: 431 -----------GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
A++D A A AD V+V+G Q E+ R L LPG Q L+
Sbjct: 527 SEIMEYPLTPDEQAEIDRAVANARQADVAVVVLGGGQRTCGENKSRTSLELPGHQLKLLQ 586
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
V K PVILI+++G +++A D + +IL YPG GG +AD+ FG YNP G
Sbjct: 587 AVQATGK-PVILILINGRPLSVNWA--DKFVPAILEAWYPGSKGGTVVADILFGDYNPGG 643
Query: 540 RLPMTWYPQS 549
+L +T +P++
Sbjct: 644 KLTVT-FPKT 652
>gi|198274480|ref|ZP_03207012.1| hypothetical protein BACPLE_00628 [Bacteroides plebeius DSM 17135]
gi|198272682|gb|EDY96951.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
plebeius DSM 17135]
Length = 912
Score = 191 bits (485), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 160/548 (29%), Positives = 248/548 (45%), Gaps = 107/548 (19%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F V+ +RLGIP ++ +E + GV +Y+ AT+FP +
Sbjct: 106 LNEVQRFFVEE----TRLGIPA-DFTNEGIRGVENYI------------ATNFPTQLALG 148
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRGQETPGEDPLLA 190
++N L + IG + EAR L G T ++P +++ RD RWGR +E GE P L
Sbjct: 149 HTWNRELIRQIGYITGREAR------LLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLV 202
Query: 191 SKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLD--------------NWKGTDRYH 236
++ KGLQ TD ++VA+ KH+ AY + +W+ + H
Sbjct: 203 AELGIAMGKGLQ-TD------MQVASTAKHFIAYSNNKGAREGFARVDPQMSWREVENIH 255
Query: 237 -------------FNAMVIY----------TYYLIKFK---------YIVSDCDSVDVLY 264
M Y +YY + + Y+VSD D+V+ LY
Sbjct: 256 AYPFTRVIQEAGILGVMSSYNDYDGFPIQSSYYWLTQRLRGTMGFRGYVVSDSDAVEYLY 315
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAID-HAISNNFATLM 323
+ K +EA +S+ AGL++ C +F + L+ E + I N ++
Sbjct: 316 SKHKTAKDMKEAVRQSVEAGLNVRC-TFRSPESYVLPLRELIQEGGLSMETIDNRVRDIL 374
Query: 324 RLGFFDGHPSKQPYGQ---LGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIK 380
R+ F G PY L K+V ++A+Q +AL A+R+G+VLLKN LPL + IK
Sbjct: 375 RVKFLTGL-FDTPYQTDLALADKEVNSEAHQQVALQASREGLVLLKNANNLLPLDKSQIK 433
Query: 381 NLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV----ATIYQAGCSNVQCG----- 431
+AV GPNA+ + +Y + TT L+G+ V Y GC V
Sbjct: 434 RIAVCGPNADEASFALTHYGPVAVEVTTVLEGIKQQVKEGTKVTYTKGCDLVDANWPESE 493
Query: 432 ----------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEV 481
++ A +D V+V+G E+ R L LPG QQ L+ +
Sbjct: 494 IISYPLTAEEKTEIQKAVDNVKESDVAVVVLGGGIRTCGENKSRTSLDLPGHQQQLLEAI 553
Query: 482 AKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRL 541
K PV+L++++G I++A D + +IL YPG GG AIA+ FG YNP G+L
Sbjct: 554 VATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSQGGTAIAEALFGDYNPGGKL 610
Query: 542 PMTWYPQS 549
+T +P++
Sbjct: 611 TVT-FPKT 617
>gi|63099951|gb|AAY32974.1| glycosyl hydrolase [Sorangium cellulosum]
Length = 1070
Score = 191 bits (485), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 153/502 (30%), Positives = 229/502 (45%), Gaps = 59/502 (11%)
Query: 18 LFLTYCSLQHSSSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKI 77
L T+ + ++SA A + + L F + L I+LRV DL+ RLTL EK+
Sbjct: 76 LMSTWSRFRRIAASAPLVLALALHPSGSSASDMLPFQDPGLSIELRVRDLLGRLTLDEKL 135
Query: 78 TFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG-ATSFPQVILTAASFNA 136
+ L ++ RLGIP ++ +EALHGV++ + VV T FPQ I A ++N
Sbjct: 136 SLLHQFQPAIPRLGIPDFKAGTEALHGVAWSTDRDNGGAVVTATGTVFPQAIGLATTWNP 195
Query: 137 SLFQAIGKVVSTEARAMYNVG--LAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 194
L + +G+ V E R + + + GL W+P +N+ RDPRWGR +E EDPLL+ A
Sbjct: 196 DLVRQVGEAVGDEVRGYHALAPRIWGLQVWAPVVNLLRDPRWGRNEEGYSEDPLLSGVIA 255
Query: 195 TGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK--- 251
Y +GL+ G P LK A KHY A + + + T + V + Y + FK
Sbjct: 256 AAYGRGLE---GDDPLYLKTAPVIKHYLANNNEIHRDTTSSNLRPRVKHEYDELAFKMPI 312
Query: 252 --------------------------------------YIVSDCDSVDVLYNSQHYTKTP 273
Y VSD + L SQ Y T
Sbjct: 313 AADAVTGVMTSYNLVNGRPATVNPDVGDVVRSWTEKTLYNVSDAWAPYNLTGSQRYFATN 372
Query: 274 EEAAAKSILAGLDL-----NCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFF 328
EEA A ++LAG+D N + + +A+ GL+ E ID ++ + + +RLG F
Sbjct: 373 EEAFAATLLAGVDSFTVDNNDSAPTIEILRSALAQGLLTEEDIDASVEHVLSVRLRLGDF 432
Query: 329 DGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPN 388
D P PY +GP+ + + A++ LA AA + +VLL+N LPL P+A + +AV+GP
Sbjct: 433 D--PDGGPYAGIGPEVIDSPAHRQLARRAAGEAMVLLENRRRLLPLDPSATRRIAVVGPL 490
Query: 389 ANVTKTMIGNYEGTPCKYTTPLQGLAAVV--ATIYQA-GCSNVQCGTAQVDDAKKAAASA 445
++ T Y G TPL G+ + AT+ + G + A A
Sbjct: 491 SDTLYT--DWYSGALPYRVTPLDGIRERLSGATVLSSEGVDRIVLRDVASGRYVTAGADE 548
Query: 446 DATVLVMGADQSIEAESHDRLD 467
D VL + A + E D D
Sbjct: 549 DGDVLRVSAVSAGPTEEFDVFD 570
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 62/113 (54%), Gaps = 4/113 (3%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
VD+A + A ADA V+V+G+ I E HDR + L Q L+ V +A P +++
Sbjct: 672 VDEAVRVATGADAAVVVVGSMPFINGREDHDRTTMALAEGQSALVRAV--LAANPRTILV 729
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ + ++ + +ILW + GQ G AI+DV FG +NP+GRL TWY
Sbjct: 730 VETS-YPMTMPWEKLHVPAILWTTHAGQETGHAISDVLFGDHNPAGRLTQTWY 781
>gi|329962030|ref|ZP_08300041.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
gi|328530678|gb|EGF57536.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
Length = 941
Score = 191 bits (485), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 172/614 (28%), Positives = 262/614 (42%), Gaps = 147/614 (23%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEW----WS-------EALHGVS 106
+D RV DL+K++TL EK +V G V + +P EW W E L+G
Sbjct: 60 VDARVEDLLKQMTLDEKTCQMVTLYGYKRVLKDALPTPEWKQMLWKDGIGAIDEHLNGFQ 119
Query: 107 YVG-PGTHFSNVVPG--------------------------------------ATSFPQV 127
G P + NV P AT+FP
Sbjct: 120 QWGLPPSDNENVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESYKATNFPTQ 179
Query: 128 ILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDP 187
+ ++N +L +G + EAR L ++P +++ RD RWGR +E GE P
Sbjct: 180 LGLGHTWNRALIHKVGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESP 234
Query: 188 LLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG-----------TDRYH 236
L ++ V+GLQQ VAA KH+ AY N KG T +
Sbjct: 235 YLVAELGIEMVRGLQQ---------HVAATGKHFAAYS--NNKGAREGMARVDPQTSPHE 283
Query: 237 FNAMVIY----------------------------TYYLIKFK---------YIVSDCDS 259
+ IY +YY + + Y+VSD D+
Sbjct: 284 VENIHIYPFRRVIKEAGLLGVMSSYNDYDGIPIQGSYYWLTTRLRDEMGFRGYVVSDSDA 343
Query: 260 VDVLYNSQHYTKTPEEAAAKSILAGLDLNC-----GSFLGKHTEAAVKAGLVNESAIDHA 314
V+ LY K +EA +S+ AGL++ C SF+ E VK G ++E ++
Sbjct: 344 VEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLREL-VKEGGLDEETVNDR 402
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPL 374
+ + +G FD G K+V + N+ +AL A+R+ +VLLKN +LPL
Sbjct: 403 VRDILRVKFLIGLFDAPYQTDLAG--ADKEVEKEENEAVALQASRESVVLLKNENSTLPL 460
Query: 375 SPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV----ATIYQAGCSNVQC 430
+ +K +AV GPNA+ + +Y + TT L+G+ V +Y GC V
Sbjct: 461 NINTVKKIAVCGPNADEDGYALTHYGPLAVEVTTVLKGIQDKVNGKAEVLYTKGCDLVDA 520
Query: 431 G---------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQ 475
A+++ A + A AD V+V+G Q E+ R L LPG+Q
Sbjct: 521 NWPESEIIDYPLTPDEQAEINKAVENARRADVAVVVLGGGQRTCGENKSRSSLDLPGRQL 580
Query: 476 LLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRY 535
L+ V K PV+LI+++G +++A D + +IL YPG GG A+AD+ FG Y
Sbjct: 581 QLLQAVQATGK-PVVLILINGRPLSVNWA--DKYVPAILEAWYPGSKGGVALADILFGDY 637
Query: 536 NPSGRLPMTWYPQS 549
NP G+L +T +P++
Sbjct: 638 NPGGKLTVT-FPKT 650
>gi|354580734|ref|ZP_08999639.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
gi|353203165|gb|EHB68614.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
Length = 766
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 157/533 (29%), Positives = 241/533 (45%), Gaps = 102/533 (19%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
SRLGIP + E HG ++ GAT FP + +++N LF+++ + V+
Sbjct: 103 SRLGIPIL-FGEECSHG-----------HMAIGATVFPVPLTIGSTWNPELFRSMCRAVA 150
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E R+ G +SP +++ RDPRWGR +ET GEDP L +++A V+GLQ
Sbjct: 151 AETRSQ-----GGAATYSPVLDVVRDPRWGRTEETFGEDPHLVAEFAVAAVQGLQGDRLD 205
Query: 208 SPNRLKVAACCKHYTAYDL-DNWKGTDRYHFNAMVIYTYYLIKFK--------------- 251
+ + L A KH+ Y + + H ++ L+ F+
Sbjct: 206 AEDSL--LATLKHFAGYGASEGGRNGAPVHMGLRELHEIDLLPFRKAVEAGAQSVMTAYN 263
Query: 252 --------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
++++DC ++D+L + + + EEAAA+++ AG+
Sbjct: 264 EIDGVPCTSSRYLLHDVLREAWGFDGFVITDCGAIDMLKSGHNTAASGEEAAAQALTAGV 323
Query: 286 DLN-CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
D+ GS + A++ G + E ++ A+ A RLG FD P P + K
Sbjct: 324 DMEMSGSMFRVYLRQALEQGHITEDDLNTAVGRVLAMKFRLGLFD-RPYTDP--ERAEKV 380
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT-- 402
+ + + +LA A +GIVLLKN LPL+P K +AVIGPNAN +G+Y
Sbjct: 381 IGCEEHIELARRVAAEGIVLLKNEGNVLPLNPKTGK-IAVIGPNANAPYNQLGDYTSPQP 439
Query: 403 PCKYTTPLQGLA------AVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMG--- 453
P + T L+G+ A +Y GC + A AA AD V+ +G
Sbjct: 440 PGQIITVLEGIRRHIGEDADTRVLYAPGCRIQGDSREGLSHALACAAEADVIVMAIGGSS 499
Query: 454 ---------------------ADQSIE-AESHDRLDLLLPGQQQLLITEVAKVAKGPVIL 491
A +E E DR L L G Q L+ E+ K+ K PV++
Sbjct: 500 ARDFGEGTIDLRTGASVVTGLAQSDMECGEGIDRSTLHLMGVQLELLQEIHKLGK-PVVV 558
Query: 492 IIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
+ ++G I+ D I +IL YPGQ GG+AIAD+ FG NPSGRL +T
Sbjct: 559 VYING--RPITEPWIDEHIPAILEAWYPGQEGGSAIADILFGDVNPSGRLTLT 609
>gi|83698596|emb|CAI43942.1| putative sugar hydrolase [Sorangium cellulosum]
Length = 963
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 148/469 (31%), Positives = 218/469 (46%), Gaps = 59/469 (12%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L F + L I+LRV DL+ RLTL EK++ L ++ RLGIP ++ +EALHGV++
Sbjct: 2 LPFQDPGLSIELRVRDLLGRLTLDEKLSLLHQFQPAIPRLGIPDFKAGTEALHGVAWSTD 61
Query: 111 GTHFSNVVPG-ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--LAGLTFWSPN 167
+ VV T FPQ I A ++N L + +G+ V E R + + + GL W+P
Sbjct: 62 RDNGGAVVTATGTVFPQAIGLATTWNPDLVRQVGEAVGDEVRGYHALAPRIWGLQVWAPV 121
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLD 227
+N+ RDPRWGR +E EDPLL+ A Y +GL+ G P LK A KHY A + +
Sbjct: 122 VNLLRDPRWGRNEEGYSEDPLLSGVIAAAYGRGLE---GDDPLYLKTAPVIKHYLANNNE 178
Query: 228 NWKGTDRYHFNAMVIYTYYLIKFK------------------------------------ 251
+ T + V + Y + FK
Sbjct: 179 IHRDTTSSNLRPRVKHEYDELAFKMPIAADAVTGVMTSYNLVNGRPATVNPDVGDVVRSW 238
Query: 252 -----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL-----NCGSFLGKHTEAAV 301
Y VSD + L SQ Y T EEA A ++LAG+D N + + +A+
Sbjct: 239 TEKTLYNVSDAWAPYNLTGSQRYFATNEEAFAATLLAGVDSFTVDNNDSAPTIEILRSAL 298
Query: 302 KAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQG 361
GL+ E ID ++ + + +RLG FD P PY +GP+ + + A++ LA AA +
Sbjct: 299 AQGLLTEEDIDASVEHVLSVRLRLGDFD--PDGGPYAGIGPEVIDSPAHRQLARRAAGEA 356
Query: 362 IVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV--AT 419
+VLL+N LPL P+A + +AV+GP ++ T Y G TPL G+ + AT
Sbjct: 357 MVLLENRRRLLPLDPSATRRIAVVGPLSDTLYT--DWYSGALPYRVTPLDGIRERLSGAT 414
Query: 420 IYQA-GCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
+ + G + A A D VL + A + E D D
Sbjct: 415 VLSSEGVDRIVLRDVASGRYVTAGADEDGDVLRVSAVSAGPTEEFDVFD 463
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 62/113 (54%), Gaps = 4/113 (3%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
VD+A + A ADA V+V+G+ I E HDR + L Q L+ V +A P +++
Sbjct: 565 VDEAVRVATGADAAVVVVGSMPFINGREDHDRTTMALAEGQSALVRAV--LAANPRTILV 622
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ + ++ + +ILW + GQ G AI+DV FG +NP+GRL TWY
Sbjct: 623 VETS-YPMTMPWEKLHVPAILWTTHAGQETGHAISDVLFGDHNPAGRLTQTWY 674
>gi|299149395|ref|ZP_07042452.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298512582|gb|EFI36474.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 950
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 161/548 (29%), Positives = 266/548 (48%), Gaps = 78/548 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSYVGP 110
+ + SL ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 223
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
GAT FPQ + A++N L + + V+ E A N A WSP +++
Sbjct: 224 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYV-----KGLQQTDG---------GSPNRLKVAA 216
+D RWGR +ET GEDP+L S+ ++ +GL T G + +
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRGLFTTPKHFGGHGAPLGGRDSHDIGL 329
Query: 217 CCKHYTAYDLDNWKGTDRYH--FNAMVIYTYYL--------------IKFK-----YIVS 255
+ L ++ R + + M+ Y+ Y+ ++ + +IVS
Sbjct: 330 SEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTELLQQILRQEWGFNGFIVS 389
Query: 256 DCDSVDVLYNSQHYTKTPE-EAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDH 313
DC ++ L +HYT + EAA +++ AG+ NCG ++ K A K G +N +D+
Sbjct: 390 DCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGRINMENLDN 449
Query: 314 AISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
+T+ R F+ +P K + ++ P + +++++A AAR+ IV+L+N L
Sbjct: 450 VCRTMLSTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVMLENKENLL 508
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVA----TIYQAGCS 426
PL+ ++ +AV+GP A+ + G+Y + P + + L G+ V +Y+ GC
Sbjct: 509 PLTKN-LRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGIKEAVGKQTKVLYEQGCD 565
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQQQLL 477
+ A KAA+ +D V+V+G + EA E++D L+LPG+QQ L
Sbjct: 566 FTNPDETNIPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTCGENNDWATLILPGKQQEL 625
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ V K PVILI+ +G +DI A K + W+ PGQ GG A+ADV FG YNP
Sbjct: 626 LEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNWL--PGQEGGPAMADVLFGDYNP 682
Query: 538 SGRLPMTW 545
GRLPMT+
Sbjct: 683 GGRLPMTF 690
>gi|456388529|gb|EMF53969.1| glycosyl hydrolase [Streptomyces bottropensis ATCC 25435]
Length = 950
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 135/408 (33%), Positives = 191/408 (46%), Gaps = 68/408 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L + R+ DL++RLT E+I FL A +V RLG+ + EALHGV+++GP
Sbjct: 11 FRDPQLPLAKRIDDLLQRLTPDERIAFLHQFAPAVDRLGVAAFRTGQEALHGVAWMGP-- 68
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWSPNINIF 171
AT FPQ + A++N L + +G+ VSTE RAM GL WSP +N+
Sbjct: 69 --------ATVFPQAVGLGATWNEDLVRRVGEAVSTEVRAMRARDDRVGLNVWSPTVNLL 120
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
R P WGR +E EDP L S AT Y +GL+ G P + A KH+ A++ + +
Sbjct: 121 RHPLWGRNEEGYSEDPRLTSAIATAYTRGLR---GDHPTYWRTAPALKHWLAHNNETDRS 177
Query: 232 TDRYHFNAMVIYTYYLIKFKYIV------------------------------------- 254
T V++ Y L F+ V
Sbjct: 178 TTSSSVRPRVLHEYDLRAFRETVEAGAVAGVMPAYNLVNGRPNHVSPHLAEHLRTWTDEP 237
Query: 255 ----SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL-----NCGSFLGKHTEAAVKAGL 305
SD + L +S+HY T EEA A S+ AG+D GS + A+ L
Sbjct: 238 LLVCSDAGAPTNLVDSEHYFDTHEEATAASLRAGVDSFTDHGTDGSTMTDRVRGALARSL 297
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
+ + ID A+ + RLG FD P + PY + KD T A++ LA +AA +VLL
Sbjct: 298 LTPADIDEAVRRQLSVRFRLGEFD--PGRDPYADV--KDFDTPAHRALAQEAAEGAVVLL 353
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
KN LPL+ A +AV+G A+ K + Y GT +TPL+GL
Sbjct: 354 KNEGDLLPLA--ADTRIAVVGLLADECK--LDWYSGTLLHRSTPLEGL 397
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 58/108 (53%), Gaps = 6/108 (5%)
Query: 440 KAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
+ AA AD ++V G D I E+ DR L LP Q+ L+ A+ A L+++S
Sbjct: 566 RVAAEADVVLVVAGNDPHINGRETEDRTTLALPAHQERLL-RAARAANPNTALVLVSA-- 622
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+A + + +++W + GQA G A+A V G +P+GRLP TWY
Sbjct: 623 --YPYAVDPAGLPAVMWTAHGGQAAGTALARVLAGDVSPAGRLPQTWY 668
>gi|15614471|ref|NP_242774.1| glucan 1,4-beta-glucosidase [Bacillus halodurans C-125]
gi|10174526|dbj|BAB05627.1| glucan 1,4-beta-glucosidase [Bacillus halodurans C-125]
Length = 926
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 141/422 (33%), Positives = 201/422 (47%), Gaps = 68/422 (16%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
+D RV LV LTL EKI + +V RLGI KY+ +E HGV+++G
Sbjct: 17 LDERVESLVSELTLDEKIRLMCQYQEAVPRLGIGKYKHGTEGAHGVAWLGK--------- 67
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINIFRDPRWG 177
AT FPQ I ++N L + IG V++ EAR Y N GLT W+P +++ RDPRWG
Sbjct: 68 -ATVFPQNIGLGCTWNEELMKKIGDVIAEEARVYYQENPEQNGLTIWAPTVDMERDPRWG 126
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHF 237
R +E GEDP L K T +KG+Q G P LK AA KH+ A + + +
Sbjct: 127 RTEEAYGEDPYLTGKLTTELIKGMQ---GDHPFYLKTAATLKHFLANNNEEARTECSVSI 183
Query: 238 NAMVIYTYYLIKFK-----------------------------------------YIVSD 256
+ + YYL F+ ++VSD
Sbjct: 184 DPRNLREYYLKAFEPQVKEGHVQCVMTAYNAVNGTLCNMNPDVNAILKEEWGLQGFVVSD 243
Query: 257 -CDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGKHTEAAVKAGLVNESAIDH 313
CD V + + Q + ++ EA A SI G+D + + + A++ GL++E +D
Sbjct: 244 ACDVVGSVDDHQ-FVESYAEAVALSIKNGIDNVTDDEKIIHRAIGDALEQGLLSEEDLDQ 302
Query: 314 AISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLP 373
A+ F +RLG FD +K PY + + +C + +LAL AA++ IVLLKN LP
Sbjct: 303 ALKRVFRVRIRLGEFD-QENKNPYATVAKEKLCAPEHSELALQAAKEAIVLLKND-DLLP 360
Query: 374 LSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL--AAVVATIYQAGCSNVQCG 431
LSP K +AVIGP AN T Y GTP +PL+G+ A +Y G ++
Sbjct: 361 LSPE--KKVAVIGPLANEIYT--DWYSGTPPYTISPLEGIMQKANEPVLYADGSDRIRLR 416
Query: 432 TA 433
A
Sbjct: 417 EA 418
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/113 (40%), Positives = 64/113 (56%), Gaps = 4/113 (3%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
+D A +AA AD V+ +G I E DR DL LP QQ LI EV KV + V++I+
Sbjct: 534 IDQAVEAAKQADTAVVFVGNQPYINGKECVDRPDLALPPAQQRLIEEVVKVNRNTVVVIV 593
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G + + + +I++ + GQ G A++DV FG YNP+GRL MTWY
Sbjct: 594 ---GSYPYTIPWVKENVPAIVYTAHGGQEFGRAVSDVLFGDYNPAGRLNMTWY 643
>gi|383113360|ref|ZP_09934132.1| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
gi|382948727|gb|EFS32364.2| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
Length = 954
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 161/548 (29%), Positives = 266/548 (48%), Gaps = 78/548 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSYVGP 110
+ + SL ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 227
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
GAT FPQ + A++N L + + V+ E A N A WSP +++
Sbjct: 228 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYV-----KGLQQTDG---------GSPNRLKVAA 216
+D RWGR +ET GEDP+L S+ ++ +GL T G + +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRGLFTTPKHFGGHGAPLGGRDSHDIGL 333
Query: 217 CCKHYTAYDLDNWKGTDRYH--FNAMVIYTYYL--------------IKFK-----YIVS 255
+ L ++ R + + M+ Y+ Y+ ++ + +IVS
Sbjct: 334 SEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTELLQQILRQEWGFNGFIVS 393
Query: 256 DCDSVDVLYNSQHYTKTPE-EAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDH 313
DC ++ L +HYT + EAA +++ AG+ NCG ++ K A K G +N +D+
Sbjct: 394 DCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGRINMENLDN 453
Query: 314 AISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
+T+ R F+ +P K + ++ P + +++++A AAR+ IV+L+N L
Sbjct: 454 VCRTMLSTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVMLENKENLL 512
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVA----TIYQAGCS 426
PL+ ++ +AV+GP A+ + G+Y + P + + L G+ V +Y+ GC
Sbjct: 513 PLTKN-LRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGIKEAVGKQTKVLYEQGCD 569
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQQQLL 477
+ A KAA+ +D V+V+G + EA E++D L+LPG+QQ L
Sbjct: 570 FTNPDETNIPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTCGENNDWATLILPGKQQEL 629
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ V K PVILI+ +G +DI A K + W+ PGQ GG A+ADV FG YNP
Sbjct: 630 LEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNWL--PGQEGGPAMADVLFGDYNP 686
Query: 538 SGRLPMTW 545
GRLPMT+
Sbjct: 687 GGRLPMTF 694
>gi|337748463|ref|YP_004642625.1| beta-glucosidase [Paenibacillus mucilaginosus KNP414]
gi|336299652|gb|AEI42755.1| Beta-glucosidase [Paenibacillus mucilaginosus KNP414]
Length = 973
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 131/422 (31%), Positives = 201/422 (47%), Gaps = 62/422 (14%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
++ RV DL+ +LTL EKI + ++ RLGI Y+ +EA HG++++G
Sbjct: 16 LEDRVIDLIAQLTLDEKIELMPQYQIAIERLGIQAYKHGTEAAHGMAWLGE--------- 66
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINIFRDPRWG 177
ATS+PQ I A ++++ L + IG + EAR Y N GLT W+P +++ RDPRWG
Sbjct: 67 -ATSYPQPIGLACTWDSELLKRIGSAIGDEARGFYKQNPNFNGLTLWAPTVDLERDPRWG 125
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHF 237
R +E GEDP+LA K A +G+Q G P LK A KH+ + + +G
Sbjct: 126 RTEEAYGEDPVLAGKLAAALTQGIQ---GDHPFYLKAVATLKHFIGNNNEAGRGDTSVSL 182
Query: 238 NAMVIYTYYL----IKFK-------------------------------------YIVSD 256
+ + YYL I FK ++VSD
Sbjct: 183 DPRNLREYYLKVFEITFKEGGAQSMMTAYNAVNGVPANLSELVIDIVKGEWGMNGFVVSD 242
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSIL-AGLD--LNCGSFLGKHTEAAVKAGLVNESAIDH 313
V HY +T +EA A+SI G+D + + + + A+K GL+ E+ +D
Sbjct: 243 AFDVTGTMRDHHYVETLKEAVARSIREGGIDSITDDAAVVTEAIHEALKDGLLTENDLDT 302
Query: 314 AISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLP 373
A+ N F RLG FD P + PY + + + LA +A+++ +VLLKN +LP
Sbjct: 303 ALRNTFRVRFRLGEFD-PPERNPYATIDESAILHPEHAKLAREASQKAVVLLKNDEKTLP 361
Query: 374 LSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTA 433
L + +AVIGP A+V Y G+ TPLQG+ +A + ++ GT
Sbjct: 362 LQADKLSKVAVIGPLADVVYQ--DWYSGSLPYAVTPLQGIRERLAAEGKDAVTSYAAGTD 419
Query: 434 QV 435
++
Sbjct: 420 RM 421
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/136 (35%), Positives = 73/136 (53%), Gaps = 10/136 (7%)
Query: 414 AAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPG 472
++VV T QA ++C ++ +D A +A ADA ++ +G I E+ DR D+ L
Sbjct: 554 SSVVGT--QADTFEIECVSSGLDAAVASARGADAAIVFVGNHPLINGKETMDRPDITLAE 611
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN--DPKITSILWVGYPGQAGGAAIADV 530
Q+ L +A + K P +I++ G FA N K +I++ + GQ G A+A V
Sbjct: 612 SQEKLA--LAVIEKNPNTIIVVVGS---YPFALNALQEKAKAIIYTSHAGQELGRAVAAV 666
Query: 531 CFGRYNPSGRLPMTWY 546
FG NP+GRL MTWY
Sbjct: 667 LFGDVNPAGRLNMTWY 682
>gi|284174578|ref|ZP_06388547.1| Beta-xylosidase [Sulfolobus solfataricus 98/2]
gi|356934752|gb|AET42953.1| beta-xylosidase-like protein [Sulfolobus solfataricus 98/2]
Length = 754
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 163/565 (28%), Positives = 250/565 (44%), Gaps = 121/565 (21%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK--YEWWSEALHGVSYVGPGTHFSNV 117
+DL+ ++VK + +K FLV++ +RL IP +E L G S
Sbjct: 58 LDLKPKEVVKLVNKVQK--FLVEN----TRLKIPAIIHEECLSGLMGYS----------- 100
Query: 118 VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRW 176
+T+FPQ I A+++N L + + ++ R L G+ SP +++ RDPRW
Sbjct: 101 ---STAFPQAIGLASTWNPELLTNVASTIRSQGR------LIGVNQCLSPVLDVCRDPRW 151
Query: 177 GRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL-DNWKGTDRY 235
GR +ET GEDP L + Y+ GLQ ++ A KH+ A+ + + +
Sbjct: 152 GRCEETYGEDPYLVASMGLAYITGLQ-------GETQLVATAKHFAAHGFPEGGRNIAQV 204
Query: 236 HFNAMVIYTYYLIKFK------------------------------------------YI 253
H + +L F+ +
Sbjct: 205 HVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQEWGFDGIV 264
Query: 254 VSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF--LGKHTEAAVKAGLVNESAI 311
VSD D + L EAA ++ +G+D+ + G+ A+K GLV+E+ I
Sbjct: 265 VSDYDGIRQLEAIHKVASNKMEAAILALESGVDIEFPTIDCYGEPLVTAIKEGLVSEAII 324
Query: 312 DHAISNNFATLMRLGFFDGHPSKQPYGQLG--PKDVCTQANQDLALDAARQGIVLLKNTA 369
D A+ RLG D P+ P+ + + +++LAL AAR+ IVLLKN
Sbjct: 325 DRAVERVLRIKERLGLLD-----NPFVDESAVPERLDDRKSRELALKAARESIVLLKNEN 379
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-------CKYTTPLQGLAAVVA---T 419
LPLS I +AVIGPNAN + M+G+Y T + T LQG+A V
Sbjct: 380 NMLPLSKN-INKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKKVGEGKV 438
Query: 420 IYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGAD-------------------QSIEA 460
+Y GC +A + A AD + VMG Q++
Sbjct: 439 LYAKGCDIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFKKYQAVTG 498
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E +DR L L G Q+ L+ E+ K K P+IL++++G +S N + +I+ +PG
Sbjct: 499 EGNDRASLRLLGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAIIEAWFPG 555
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTW 545
+ GG AIAD+ FG YNPSGRLP+T+
Sbjct: 556 EEGGNAIADIIFGDYNPSGRLPITF 580
>gi|365122063|ref|ZP_09338970.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643257|gb|EHL82578.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
6_1_58FAA_CT1]
Length = 819
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 153/536 (28%), Positives = 232/536 (43%), Gaps = 110/536 (20%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP ++ +E +HG+++ AT P I +++NA L G +
Sbjct: 150 TRLGIP-VDFSNETIHGLNHTK-----------ATPLPAPIGIGSTWNAPLVYKAGSIAG 197
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
EA+A L ++P +++ RDPRWGR E GEDP L + T VKG+Q+
Sbjct: 198 KEAKA-----LGYTNIYAPILDLARDPRWGRVLECYGEDPFLVATLGTQMVKGIQEQG-- 250
Query: 208 SPNRLKVAACCKHYTAYDL----------------------------------------- 226
VAA KH+ Y +
Sbjct: 251 ------VAATLKHFAVYSVPKGGRDGSVRTDPHVAPREMHQMHLYPFKKVIQDAHPMGVM 304
Query: 227 ---DNWKG---TDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKS 280
++W G T Y+F ++ + Y+VSD D+V+ +YN H +T EEA
Sbjct: 305 SSYNDWDGVPVTASYYFLTQLLRQEFGFD-GYVVSDSDAVEYVYNKHHVAETYEEAVRMV 363
Query: 281 ILAGLDLNCGSFLGKH-----TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ 335
+ AGL++ +F VK G ++ ID +++ RLG FD Q
Sbjct: 364 LEAGLNVRT-TFAAPDIFILPARKLVKEGRLSMKVIDERVADVLRVKFRLGLFD-----Q 417
Query: 336 PY---GQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVT 392
P+ + K V N+D LD RQ +VLLKN LPL + + + GP A
Sbjct: 418 PFVADPKAADKIVGADKNKDFVLDIQRQSLVLLKNENNLLPLDKNKLSRILITGPLAKEE 477
Query: 393 KTMIGNYEGTPCKYTTPLQG----LAAVVATIYQAGCSNVQCG---------------TA 433
M+ Y + T +G L VA Y GC
Sbjct: 478 NYMVSRYGPQELENITVYEGIKNYLGNKVAVDYALGCKVKDAKWPESEIIHSPLTTEEQQ 537
Query: 434 QVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
++ +A + A +D + V+G D+ ES R L LPG+QQ L+ + K PV+L++
Sbjct: 538 EIQNAVEKAKLSDIVIAVLGEDEESTGESKSRSGLDLPGRQQQLLEALYATGK-PVVLVL 596
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
++G I++A D I +IL +PGQ GG AIA+ FG YNP G+LP+T +P++
Sbjct: 597 INGQPLTINWA--DRYIPAILEAWFPGQMGGTAIAETLFGDYNPGGKLPVT-FPKT 649
>gi|380692929|ref|ZP_09857788.1| glycoside hydrolase family protein [Bacteroides faecis MAJ27]
Length = 777
Score = 189 bits (481), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 158/526 (30%), Positives = 243/526 (46%), Gaps = 92/526 (17%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP ++ +E + GV + AT FP +++N L IG+V +
Sbjct: 126 TRLGIP-VDFTNEGIRGVCH-----------EKATFFPAQCGQGSTWNKELIARIGEVEA 173
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQ---- 203
EA V L +SP ++I +DPRWGR E GEDP L + ++ LQ+
Sbjct: 174 KEA-----VALGYTNIYSPILDIAQDPRWGRAVECYGEDPYLVGQLGKQMIQSLQKHKLV 228
Query: 204 ---------------TDGGS-------PNRLKVAACCKHYTAY-------------DLDN 228
DGG+ P ++ A+ D D
Sbjct: 229 ATPKHFAVYSIPVGGRDGGTRTDPHVAPREMRTLYLEPFRVAFQEAGALGVMSSYNDYDG 288
Query: 229 WKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL- 287
T Y F ++ + K Y+VSD D+V+ + + EEA +S+ AGL++
Sbjct: 289 EPITGSYRFLTQILRQEWGFK-GYVVSDSDAVEFISSKHKVADNNEEAVVQSVNAGLNVR 347
Query: 288 -NCGSFLG--KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY---GQLG 341
N S G K +A+ G V+++ ID +S LG FD PY G+L
Sbjct: 348 TNFSSPAGFIKPLRSAIAKGKVSQATIDQRVSEILYVKFWLGLFD-----NPYRGDGKLA 402
Query: 342 PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG 401
K V + +Q +AL+AARQ IVLLKN LPL T +K++AVIGPNA+ K +I Y
Sbjct: 403 DKIVHCKEHQAVALEAARQSIVLLKNQDNLLPLQKT-LKSVAVIGPNADEQKELICRYGP 461
Query: 402 TPCKYTTPLQGLAAVV---ATIYQAGCSNVQCGTAQ---------------VDDAKKAAA 443
+ T +G+ + +Y+ GC V + +D+A +AA
Sbjct: 462 SNAPIKTVYKGIKEALPGAKVVYKKGCEIVDPHFPESEVLPFDITPKEQQIMDEAIEAAK 521
Query: 444 SADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISF 503
SA+ ++V+G + E R L LPG+Q+ L+ V K+ K P IL+++ G I++
Sbjct: 522 SAEVVIMVLGGSEVTVREERSRTSLDLPGRQEELLKAVCKLGK-PTILVMIDGRASSINY 580
Query: 504 AKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
AK + +IL +PG+ G A+A+ FG NP G+L +T +P+S
Sbjct: 581 AKK--YVPAILHAWFPGEFCGQAVAETIFGDNNPGGKLAVT-FPKS 623
>gi|332184951|ref|ZP_08386700.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
gi|332014675|gb|EGI56731.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
sp. S17]
Length = 886
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 133/397 (33%), Positives = 194/397 (48%), Gaps = 86/397 (21%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV ++T EK+ L+++A ++ RLG+P Y WW+E+LHG +GP VP T+FP+
Sbjct: 37 LVGKMTTDEKLYQLLNTAPAIPRLGVPAYNWWTESLHGA--LGP-------VP-TTNFPE 86
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTFWSPNINIFRDPRWG 177
+ AA+F+ +L + +S E R ++ + GL WSPNINIFRDPRWG
Sbjct: 87 PVGLAATFDDALVGRAAQAISREVRGLHALARQTGRLGRIGTGLDSWSPNINIFRDPRWG 146
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHF 237
RGQET GEDP L ++ +V+G+Q TD PNR+ V A KHY + + + R+
Sbjct: 147 RGQETYGEDPFLTARMGVAFVRGMQGTD---PNRIDVVATPKHYAVH---SGPESTRHEA 200
Query: 238 NAMV----IYTYYLIKFK------------------------------------------ 251
N V + YL F+
Sbjct: 201 NVYVSQHDLVDTYLPAFRAAIVEGGAGSIMCAYNRIDGQPACASDLLLKDYLRGRWGFRG 260
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA-------AVKAG 304
Y+VSDCD+V + + HY P A A ++ AG+D C + T+ A+ G
Sbjct: 261 YVVSDCDAVKDIDANHHYAPDPATAVAAAMRAGVDNECNNATLTDTDGLTNPYRDALARG 320
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQL--GPKDVCTQANQDLALDAARQGI 362
L++ + ID +++ FA R G P +P P D+ T A+Q LAL+ A + +
Sbjct: 321 LISVADIDRSLTRLFAARYRTGDL---PGVRPLSMQSSSPADIGTPAHQALALETAEKSL 377
Query: 363 VLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
VLLKN G LPL A +AVIGP + T+ + GNY
Sbjct: 378 VLLKNK-GILPLKAQA--RIAVIGPLGDATRVLRGNY 411
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 67/118 (56%), Gaps = 13/118 (11%)
Query: 439 KKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGP 488
+ AA +DA V V+G +EAE D+ L LP QQ L+ E AK P
Sbjct: 609 RTAAKDSDALVAVVGLTSDLEAEEAPISVPGFKGGDKTTLDLPADQQALL-EQAKATGKP 667
Query: 489 VILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+I+++M+G ++++AK + W YPGQAGG A+ +V G+ NP+GRLP+T+Y
Sbjct: 668 LIVVLMNGSPVNLAWAKANADAIVEAW--YPGQAGGLAVGNVLSGKTNPAGRLPLTFY 723
>gi|217968103|ref|YP_002353609.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
gi|217337202|gb|ACK42995.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
DSM 6724]
Length = 756
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 170/500 (34%), Positives = 235/500 (47%), Gaps = 93/500 (18%)
Query: 100 EALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLF----QAIGKVVSTEARAMYN 155
EALHG + G+T FPQ I A+++N L AIGK T +R ++
Sbjct: 138 EALHGC-----------MAKGSTIFPQAIGMASTWNPELIYQVATAIGK--ETRSRGIHQ 184
Query: 156 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVA 215
V SP INI RDPR GR +ET GEDP LAS+ A Y+KG+Q+ V
Sbjct: 185 V-------LSPTINIARDPRCGRTEETYGEDPYLASRMAVAYIKGVQEQG--------VI 229
Query: 216 ACCKHYTAYDLDNWKGTDRY--HFNAMVIYTYYLIKFK---------------------- 251
A KH+ A + + G D Y HF+ ++ Y FK
Sbjct: 230 ATPKHFAANFVGD-GGRDSYPIHFSERLLREVYFPAFKASIKEAGALSLMAAYNSLDGIP 288
Query: 252 --------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC-G 290
Y+VSD SV L ++ EAA ++ AGLD+
Sbjct: 289 CSSNKWLLTDVLRKEWGFKGYVVSDYFSVLHLMTKHKVAESKAEAARLALEAGLDMELPD 348
Query: 291 SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQAN 350
S + VK G ++E I+ A+ G FD +P P D C + +
Sbjct: 349 SDCFEEMINLVKGGKLSEETINEAVRRILGVKFWAGLFD-NPFVDPDYAERVND-CAE-H 405
Query: 351 QDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPL 410
++LAL AR+ IVLLKN G LPLS I ++AVIGPNA V + +G Y G K TPL
Sbjct: 406 RELALRVARESIVLLKNE-GILPLSKD-IGSIAVIGPNAAVPR--LGGYSGYGVKIVTPL 461
Query: 411 QGLAAVV---ATIYQA-GCSNVQCGTAQVDDAKKAAASADATVLVMGAD-QSIEAESHDR 465
+G+ + A IY A GC + D+A K A +D +L +G E E DR
Sbjct: 462 EGIKNKMENKAKIYFAEGCGLNDTSKSGFDEAIKIAQKSDVAILFVGNSVPETEGEQRDR 521
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
+L LPG Q+ LI E+ PVI+++++G I+ K+ +++ YPG+ GG
Sbjct: 522 HNLNLPGVQEELIKEICN-TNTPVIVVLINGSA--ITMMNWIDKVQAVIEAWYPGEEGGN 578
Query: 526 AIADVCFGRYNPSGRLPMTW 545
AIADV FG YNP G+LP+T+
Sbjct: 579 AIADVLFGDYNPGGKLPITF 598
>gi|397691065|ref|YP_006528319.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
gi|395812557|gb|AFN75306.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
Length = 769
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 182/629 (28%), Positives = 263/629 (41%), Gaps = 172/629 (27%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKI-----------TFLVDSAGSVS------------- 88
+ N L D R DL+KR+TL+EK+ T D G+ S
Sbjct: 18 YKNPDLSPDERAEDLLKRMTLEEKVAQMLCIWEDKKTIFFDDEGNFSIDKVREHLKNGIG 77
Query: 89 --------------------------------RLGIPKYEWWSEALHGVSYVGPGTHFSN 116
RLGIP + E LHG++
Sbjct: 78 QIARLSDTNGGLDPYQMVEFANKLQKFFVEETRLGIPVI-FHEECLHGLA---------- 126
Query: 117 VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRW 176
ATS+P I AA+FN L + I ++ +AR+ +P +++ RDPRW
Sbjct: 127 -AKDATSYPVPIGLAATFNPELIEKIFSAIAEDARSR-----GAHQALTPVVDVVRDPRW 180
Query: 177 GRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD--R 234
GR +ET GED L S+ VKGLQ DG N KV A KH+ A+ GT+
Sbjct: 181 GRVEETFGEDTYLVSQMGIASVKGLQ-GDGSLNNNNKVIATLKHFAAHGQPE-SGTNCAP 238
Query: 235 YHFNAMVIYTYYLIKFK------------------------------------------Y 252
+F+ + +L+ FK +
Sbjct: 239 ANFSERFLRDTFLMPFKEAIDKAGVISVMASYNEIDGIPSHANKWLLRKVLRDEWNFKGF 298
Query: 253 IVSDCDSVDVLYNSQHYTK-----TPEEAAAKSILAGLDL-----NCGSFLGKHTEAAVK 302
+VSD ++ L++ + EAA ++ AG+++ +C L + VK
Sbjct: 299 VVSDYYAITELFHKEETVSHGVAANKVEAAKLALEAGVNIEFPNPDCYPNLTE----MVK 354
Query: 303 AGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD--LALDAARQ 360
GL +ES ID + LG FD PY + P + QD LAL AAR+
Sbjct: 355 GGLADESDIDALVLPMLKYKFELGLFD-----NPYVEAEPGQFENKLEQDRELALQAARE 409
Query: 361 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA-- 418
I LLKN LPL K +AVIGPNA+ +T++G Y GTP YT+ QG+ V
Sbjct: 410 TITLLKNEGNLLPLK--DFKKIAVIGPNAD--RTLLGGYHGTPKYYTSVYQGIKDKVGKN 465
Query: 419 --TIYQAGCSNVQCGT---------------AQVDDAKKAAASADATVLVMGADQSIEAE 461
Y GC G+ +++A A +D VLV+G ++ E
Sbjct: 466 GEVFYSEGCKITVGGSWNDDEVILPDPAEDEKLINEAVAVAQKSDVAVLVLGGNEQTSRE 525
Query: 462 S----H--DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILW 515
+ H DR L L G+Q L+ E+ K K PV++++ +G I F K++ + +IL
Sbjct: 526 AWNKKHLGDRPSLELVGRQNKLVEEILKTGK-PVVVLLFNGRPNSIGFIKDN--VPAILE 582
Query: 516 VGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
Y GQ G A+ADV FG YNPSG+LP++
Sbjct: 583 CWYLGQETGRAVADVLFGDYNPSGKLPVS 611
>gi|294146655|ref|YP_003559321.1| putative beta-glucosidase [Sphingobium japonicum UT26S]
gi|292677072|dbj|BAI98589.1| putative beta-glucosidase [Sphingobium japonicum UT26S]
Length = 874
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 133/392 (33%), Positives = 188/392 (47%), Gaps = 76/392 (19%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
+V RLTL EK+ L++ A ++ RLGIP Y WW+E+LHG P T+FP+
Sbjct: 40 IVDRLTLDEKVAQLLNVAPAIPRLGIPAYNWWTESLHGAIGAVP----------TTNFPE 89
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTFWSPNINIFRDPRWG 177
I AA+F+A L Q + + TE RA++ +G GL WSPNINIFRDPRWG
Sbjct: 90 PIGLAATFDAPLIQQVAAAIGTEVRALHTLGRRTGHLGRIGTGLDTWSPNINIFRDPRWG 149
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD-------LDNWK 230
RGQET GEDP L ++ +V+G+Q G P+ V A KH+ + DN
Sbjct: 150 RGQETYGEDPYLTARIGVAFVQGMQ---GNDPDLPLVVATPKHFAVHSGPEPSRHTDNIF 206
Query: 231 GTDR-------YHFNAMVIY--------------------TYYLIK--------FK-YIV 254
T R F A ++ ++ L+K FK Y+V
Sbjct: 207 ATPRDLEDTYLPAFRAAIVEGKAGSIMCAYNRVDGQPACGSHMLLKDYLRGAWGFKGYVV 266
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF-------LGKHTEAAVKAGLVN 307
SDCD+V +Y Y A ++ G+D C + LG + ++ G +
Sbjct: 267 SDCDAVVDIYEHHKYAPDAATGVAVALRHGVDSECNNATLGGRPDLGDRYKESLARGHIA 326
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
ID A+ F+ R G G S++P + P + T +Q LAL A + +VLLKN
Sbjct: 327 MGDIDAALVRLFSARYRNGDLPGLSSRKP-NAIPPSAIGTPDHQALALSTAEKSLVLLKN 385
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
G LPL P A +A+ GP A+ T+ + GNY
Sbjct: 386 D-GILPLRPGA--RIALAGPLADATRVLRGNY 414
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 50/83 (60%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR L LP Q+ + E A+ P++++ M+G D+S+AK + W YPGQ+G
Sbjct: 632 DRTTLDLPADQRAFL-EKARALGKPLVIVAMNGSAIDLSWAKENAAAIVEAW--YPGQSG 688
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G A+ +V GR +P GRLP+T+Y
Sbjct: 689 GLAVGNVLSGRADPGGRLPVTFY 711
>gi|238589520|ref|XP_002392043.1| hypothetical protein MPER_08438 [Moniliophthora perniciosa FA553]
gi|215457551|gb|EEB92973.1| hypothetical protein MPER_08438 [Moniliophthora perniciosa FA553]
Length = 448
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 109/285 (38%), Positives = 164/285 (57%), Gaps = 5/285 (1%)
Query: 267 QHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLG 326
YT A+A ++ AG+D++CG+ + A+ L++E + A+ + +L+R G
Sbjct: 1 HQYTDDVVNASALALRAGVDIDCGTTYPTNLGKALNRSLISEDNLRKALVRQYHSLIRTG 60
Query: 327 FFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG 386
+FD P +QPY QL DV T+ Q+LA AA +G+VLLKN G+LPL P+ I+ +A++G
Sbjct: 61 YFD-PPERQPYRQLNWSDVNTKQTQELAHRAAVEGMVLLKND-GTLPLKPS-IQKIALVG 117
Query: 387 PNANVTKTMIGNYEGTPCKYTTPLQGL-AAVVATIYQAGCSNVQCGTAQVDDAKKAAASA 445
P AN T+ M NY +PLQ A + G + T+ A +AA +
Sbjct: 118 PFANATQQMQSNYAQPAPFVISPLQAFREAAFDVAFANGTAINTTDTSGFAAAIEAAQKS 177
Query: 446 DATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAK 505
D + G D S+E E DR+++ PG Q LI E+A + K P I++ M GG D S+ K
Sbjct: 178 DVIIFAGGMDLSVEDEFRDRMEISWPGNQLDLIKELAVLEK-PFIVLSMGGGQVDCSWLK 236
Query: 506 NDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
+DP++ SI+W G P Q+GG A+ D+ G+ P+GRLP+T YP SY
Sbjct: 237 DDPRVNSIIWGGLPSQSGGPALLDIITGKKAPAGRLPITQYPASY 281
>gi|329956938|ref|ZP_08297506.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328523695|gb|EGF50787.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 944
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 160/548 (29%), Positives = 245/548 (44%), Gaps = 107/548 (19%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F V+ +RLGIP ++ +E + GV SY AT+FP +
Sbjct: 141 LNEVQRFFVED----TRLGIP-VDFTNEGIRGVESY------------KATNFPTQLGLG 183
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L + +G + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 184 HTWNRELIRQVGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 238
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDN--WKGTDR--------------- 234
+ V+GLQ +VAA KH+ AY + +G R
Sbjct: 239 ELGIEMVRGLQHNH-------QVAATAKHFAAYSNNKGAREGMSRVDPQMSPREVENIHI 291
Query: 235 YHFNAMVIYT--------------------YYLIKFK---------YIVSDCDSVDVLYN 265
Y F ++ T YY + + Y+VSD D+V+ LY
Sbjct: 292 YPFKRVIRETGLLGIMSSYNDYDGIPVQGSYYWLTTRLRQEMGFRGYVVSDSDAVEYLYT 351
Query: 266 SQHYTKTPEEAAAKSILAGLDLNC-----GSFLGKHTEAAVKAGLVNESAIDHAISNNFA 320
+ K +EA +S+ AGL++ C SF+ E VK G ++E I+ + +
Sbjct: 352 KHNTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLREL-VKEGGLSEEVINDRVRDILR 410
Query: 321 TLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIK 380
+G FD G +V AN+ +AL A+R+ +VLLKN +LPL+ IK
Sbjct: 411 VKFLIGLFDSPYQTDLAG--ADNEVEKAANEAVALQASRESVVLLKNADNTLPLNIDKIK 468
Query: 381 NLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL----AAVVATIYQAGCSNVQC------ 430
+AV GPNA+ + +Y + TT L+G+ +Y GC V
Sbjct: 469 KIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIREKAQGKAEVLYTKGCDLVDAHWPESE 528
Query: 431 ---------GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEV 481
A++D A A AD V+V+G Q E+ R L LPG Q L+ V
Sbjct: 529 IIEYPLTPDEQAEIDRAAANARQADVAVVVLGGGQRTCGENKSRTSLDLPGHQLKLLQAV 588
Query: 482 AKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRL 541
K PV+L++++G +++A D + +IL YPG GG A+AD+ FG YNP G+L
Sbjct: 589 QATGK-PVVLVLINGRPLSVNWA--DKFVPAILEAWYPGSKGGTAVADILFGDYNPGGKL 645
Query: 542 PMTWYPQS 549
+T +P++
Sbjct: 646 TVT-FPKT 652
>gi|218132023|ref|ZP_03460827.1| hypothetical protein BACEGG_03648 [Bacteroides eggerthii DSM 20697]
gi|217985783|gb|EEC52123.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 762
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 170/576 (29%), Positives = 266/576 (46%), Gaps = 100/576 (17%)
Query: 60 IDLRVGDLVKRLTLQEKIT---------FLVD---------------SAGSV--SRLGIP 93
+++RV DL+KR+TL+EKI F VD S SV SRL +
Sbjct: 35 VEVRVADLLKRMTLEEKIAQMQDLKFKDFSVDGKVDTVKMDSVLKGMSYASVFGSRLSVE 94
Query: 94 KYEWWSEALH---------GVSYVGPGTHFSNVV-PGATSFPQVILTAASFNASLFQAIG 143
+ + A++ G+ +G ++ GAT FPQ I +++FN + +
Sbjct: 95 QMQESMFAINKYMAEHNRLGIPVLGEAESLHGLIHDGATIFPQSIALSSTFNPDITHRVA 154
Query: 144 KVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQ 203
V++ EA+A G+ + SP +++ R+ RWGR +ET GEDP L + YV +
Sbjct: 155 TVIAQEAKA---TGVDQVL--SPVLDLARELRWGRVEETYGEDPYLVGRMGVAYVSAFNK 209
Query: 204 TD----------GGSP----NRLKVAACCKHYTAYDLDNWKGTDR--YHFNAMVIYTYYL 247
GSP N V C + + L ++ R ++ M Y Y
Sbjct: 210 EGVMTTLKHFLAHGSPTGGLNLASVTGCERDLRSLYLKPFQDVMREAMPYSVMNSYNSYE 269
Query: 248 ------------------IKFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
+ FK YI SD SV++L + H K +AA ++++AG+D+
Sbjct: 270 SVPVAASHWILDDILRGEMGFKGYISSDWGSVEMLRSLHHTAKDKADAACQAVIAGVDVE 329
Query: 289 CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
+ ++ V++G++ E ID +S +G FD +K+ L + V T
Sbjct: 330 VDGDCYETLDSLVRSGVLPEKEIDKCVSRVLTAKFAMGLFDKDYTKR--ANLS-QTVHTP 386
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY-- 406
+LAL AAR+ +L+KN LPL ++++AVIGPNA + G+Y T
Sbjct: 387 EAVELALVAARESAILVKNENSLLPLDANKLRSVAVIGPNA--AQVQFGDYMWTNSNEYG 444
Query: 407 TTPLQGLAAV----VATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGA-------- 454
TPLQG+ AV V Y GC + A AA ++D +L +GA
Sbjct: 445 ITPLQGIEAVTQGKVKINYAKGCEIHTQDRSGFSQAVTAARNSDVALLFVGAMSGSPGRP 504
Query: 455 -DQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSI 513
S+ ES D D+ LPG Q+ LI V K P I+++++G F I + K++ + +
Sbjct: 505 WPNSVSGESFDLSDISLPGCQEALIRAVKATGK-PTIVVLVAGKPFAIPWVKDNCEAVIV 563
Query: 514 LWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
W Y G+ G AIA++ FG NPSGRL ++ +PQS
Sbjct: 564 QW--YGGEQEGRAIAEILFGEVNPSGRLNVS-FPQS 596
>gi|336417087|ref|ZP_08597416.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
3_8_47FAA]
gi|335936712|gb|EGM98630.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
3_8_47FAA]
Length = 954
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 160/548 (29%), Positives = 266/548 (48%), Gaps = 78/548 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSYVGP 110
+ + SL ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 227
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINI 170
GAT FPQ + A++N L + + V+ E A N A WSP +++
Sbjct: 228 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYV-----KGLQQTDG---------GSPNRLKVAA 216
+D RWGR +ET GEDP+L S+ ++ +GL T G + +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSRGLFTTPKHFGGHGAPLGGRDSHDIGL 333
Query: 217 CCKHYTAYDLDNWKGTDRYH--FNAMVIYTYYL--------------IKFK-----YIVS 255
+ L ++ R + + M+ Y+ Y+ ++ + +IVS
Sbjct: 334 SEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTELLQQILRQEWGFNGFIVS 393
Query: 256 DCDSVDVLYNSQHYTKTPE-EAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDH 313
DC ++ L +HYT + EAA +++ AG+ NCG ++ K A K G ++ +D+
Sbjct: 394 DCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKEVIQAAKDGRIDMENLDN 453
Query: 314 AISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
+T+ R F+ +P K + ++ P + +++++A AAR+ IV+L+N L
Sbjct: 454 VCRTMLSTMFRNELFEKNPCKPLDWKKIYP-GWNSDSHKEMARQAARESIVMLENKENLL 512
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVA----TIYQAGCS 426
PL+ ++ +AV+GP A+ + G+Y + P + + L G+ V +Y+ GC
Sbjct: 513 PLTKN-LRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGIKEAVGKQTKVLYEQGCD 569
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQQQLL 477
+ A KAA+ +D V+V+G + EA E++D L+LPG+QQ L
Sbjct: 570 FTNPDETNIPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTCGENNDWATLILPGKQQEL 629
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ V K PVILI+ +G +DI A K + W+ PGQ GG A+ADV FG YNP
Sbjct: 630 LEAVCATGK-PVILILQAGRPYDILKASEMCKAILVNWL--PGQEGGPAMADVLFGDYNP 686
Query: 538 SGRLPMTW 545
GRLPMT+
Sbjct: 687 GGRLPMTF 694
>gi|393782428|ref|ZP_10370612.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
CL02T12C01]
gi|392673256|gb|EIY66719.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
CL02T12C01]
Length = 596
Score = 188 bits (478), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 138/446 (30%), Positives = 209/446 (46%), Gaps = 74/446 (16%)
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
+T+WSPN+NIFRDPRWGRGQET GEDP L ++ YV+GLQ G P LK AAC KH
Sbjct: 1 MTYWSPNVNIFRDPRWGRGQETYGEDPYLTAEIGKAYVRGLQ---GNDPFFLKAAACAKH 57
Query: 221 YTAYDLDNWKGTD--RYHFNA----MVIYTYYLIKFK----------------------- 251
Y + G + R+ FNA ++ YL F+
Sbjct: 58 YAVH-----SGPEALRHEFNASPSKRDLFETYLPAFEALVKEAKVEAVMGAYNRVYGESA 112
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF 292
++VSDC +VD +Y K EA+A ++ +GL+LNCG
Sbjct: 113 SGSFFLLTDILRKKWGFKGHVVSDCGAVDDIYGGHKIAKDVAEASAIALKSGLNLNCGGS 172
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
EA ++ L+ E +D+A+ T ++LG + PY + + + +
Sbjct: 173 FHALKEA-LERKLITEVDLDNALMPLMMTRLKLGNLTDD-DESPYKNISDSVIASYTHAM 230
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
+A + A++ +VLLKN +LPL +K + V GP A T M+GNY G + T LQG
Sbjct: 231 VAREVAQKSMVLLKNNNHTLPLKKD-VKTIFVTGPYAADTYVMMGNYYGVSPRSNTFLQG 289
Query: 413 LAAVVA----TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMG---ADQSIE----AE 461
+AA V+ Y+ G D +A+ ++V+G D+ E A
Sbjct: 290 IAAKVSGGTSINYKIGILPTTPNMNPADWTVGEVRAAEVAIVVIGLSGIDEGEEGDAIAS 349
Query: 462 SH--DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYP 519
SH D+ +L LP Q + ++++ ++ +I G D+ + W YP
Sbjct: 350 SHRGDKQNLKLPEHQLKFLRDISRNRWNKLVTVITGGSPIDLEEVSELSDAVIMAW--YP 407
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTW 545
GQ GG A+ D+ FG + SGR+P+T+
Sbjct: 408 GQEGGMALGDLLFGDVSFSGRMPVTF 433
>gi|389794138|ref|ZP_10197297.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
gi|388432924|gb|EIL89908.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
Length = 869
Score = 188 bits (478), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 137/411 (33%), Positives = 195/411 (47%), Gaps = 87/411 (21%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV R+TL EK+ L++ A ++ RLGIP+Y WW+E+LHG P T+FP+
Sbjct: 25 LVARMTLAEKLPQLLNVAPAIPRLGIPRYNWWTESLHGALGTLP----------TTNFPE 74
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTFWSPNINIFRDPRWG 177
I ASF+A L + + +STE R ++ + GL WSPN+NIFRDPRWG
Sbjct: 75 PIGLGASFDAPLVKRVAAAISTEVRGLHTLARQTGRLGRIGTGLDTWSPNLNIFRDPRWG 134
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHF 237
RGQET GEDP L ++ +V+G+Q G + +R +V A KH+ + N + R+
Sbjct: 135 RGQETYGEDPFLTARMGVAFVEGMQ---GPNADRPEVIATPKHFAVH---NGPESTRHEA 188
Query: 238 NAMV----IYTYYLIKFK------------------------------------------ 251
N V + YL F+
Sbjct: 189 NVFVSRHDLVDTYLPAFRAAITEAKAGSIMCAYNRIDGQPACANDLLLKHYLRGAWGFTG 248
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-------SFLGKHTEAAVKAG 304
Y+VSDCD+V + + HY A A ++ AG+D C + L A++
Sbjct: 249 YVVSDCDAVKDISDHHHYAPDAATAVAAAVRAGVDNECNGATLSDTAGLEDRYSEALQRE 308
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
L++ + ID A+ FA R+G G P P + P + + AN LALDAA++ +VL
Sbjct: 309 LISVADIDRALVRLFAARYRVGDLAGLPGTAP--AVPPSAIGSAANSQLALDAAQKTLVL 366
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY----EGTPCKYTTPLQ 411
LKN G LPL A +AVIGP + T+ + GNY GTP LQ
Sbjct: 367 LKND-GVLPLK--AGVKVAVIGPLGDATRVLRGNYSSPLSGTPISVLDGLQ 414
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 67/120 (55%), Gaps = 13/120 (10%)
Query: 437 DAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAK 486
D +AAA AD V V+G +EAE D+ L LP Q ++ + K
Sbjct: 590 DLDRAAAQADVLVAVVGLTSDLEAEETSVTVPGFAGGDKTTLDLPADQIAMLKQARATGK 649
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
P+I++ M+G D++++K + +IL YPGQ+GG AIA+V G+ +P GRLP+T+Y
Sbjct: 650 -PLIVVAMNGSPIDLAWSKQN--AAAILEAWYPGQSGGLAIANVLSGKTDPGGRLPLTFY 706
>gi|289768782|ref|ZP_06528160.1| sugar hydrolase [Streptomyces lividans TK24]
gi|289698981|gb|EFD66410.1| sugar hydrolase [Streptomyces lividans TK24]
Length = 960
Score = 188 bits (478), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 163/517 (31%), Positives = 232/517 (44%), Gaps = 83/517 (16%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
S A+ F + L RV DL+ RLTL EK FL A +V RLGI + EALHGV+
Sbjct: 2 SAATPPFRDPQLPFAKRVDDLMSRLTLDEKTAFLHQFAPAVERLGIAPFRTGQEALHGVA 61
Query: 107 YVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWS 165
++GP AT FPQ + A++N L + +G+ VS EARAM GL WS
Sbjct: 62 WMGP----------ATVFPQAVGLGATWNEELVRRVGEAVSKEARAMRARDDRVGLNVWS 111
Query: 166 PNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD 225
P +N+ R P WGR +E EDP L S AT Y +GL+ G P + A KH+ A++
Sbjct: 112 PTVNLLRHPLWGRNEEGYSEDPRLTSAVATAYTRGLR---GDHPAYWRTAPVLKHWLAHN 168
Query: 226 LDNWKGTDRYHFNAMVIYTYYLIKFKYIV------------------------------- 254
+ + T V++ Y L F+ V
Sbjct: 169 NETDRDTSSSSVRPRVLHEYDLRAFRDTVEAGAVAGVMPAYNLVNGRPNHLSPHLGRHLR 228
Query: 255 ----------SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGK---HTEA 299
SD + L +S+HY T EEA A ++ AG+D + G+ K
Sbjct: 229 TWTQEDLLVCSDAGAPSNLVDSEHYFDTHEEATAAALRAGVDSFTDHGTDSSKIVARVRG 288
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK--DVCTQANQDLALDA 357
A+ AGL+ E+ +D A+ + RLG FD P PY D T ++ LA +A
Sbjct: 289 ALDAGLLTEADVDAAVRRQLSVRFRLGEFD--PEHDPYAGAADTGADFDTPEHRALAREA 346
Query: 358 ARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV 417
A Q +VLLKN G LP++P +AV+G A+ K + Y GT +TPL+GL
Sbjct: 347 AEQAVVLLKND-GVLPMAPE--TRVAVVGLLADECK--LDWYSGTLIHRSTPLEGLYERF 401
Query: 418 A---TIYQAGCSNVQCGTA-----QVDDAKKAAASADATVLVMGADQSIEAESHDRLDLL 469
+ G V+ TA V A A+A A T + D ++ A D L
Sbjct: 402 GADRVSFAEGVDRVRLRTADGRFLHVLPADDASAEAPGTEGAL--DPALLAGRTD----L 455
Query: 470 LPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN 506
P + TE+A + G +L + + G +S A++
Sbjct: 456 PPLTTDAVGTELALIDWGEGVLTLRAPDGRYLSVAED 492
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+ DR L LP QQ+ L+ A+ A +L ++S +A + ++LW + G
Sbjct: 598 ETEDRTTLRLPAQQERLL-RAARAANPATVLALVSA----YPYAVGVETLPAVLWTAHGG 652
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
QA G A+A + G +P+GRLP TWY
Sbjct: 653 QAAGTALARILAGDVSPAGRLPQTWY 678
>gi|302542160|ref|ZP_07294502.1| beta-glucosidase [Streptomyces hygroscopicus ATCC 53653]
gi|302459778|gb|EFL22871.1| beta-glucosidase [Streptomyces himastatinicus ATCC 53653]
Length = 973
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 146/432 (33%), Positives = 208/432 (48%), Gaps = 70/432 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L + RV DL+ RLT E+I L A SV RLG+ + EALHGV+++GP
Sbjct: 15 FRDPRLPLGTRVDDLLARLTPDERIAMLHQYAPSVERLGLAAFRTGQEALHGVAWMGP-- 72
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG-LAGLTFWSPNINIF 171
AT FPQ + A+++ L + IG+ V TEARAM L GL W+P +N+
Sbjct: 73 --------ATVFPQAVGLGATWSPDLVRRIGEAVGTEARAMRARDPLIGLNVWAPVVNLL 124
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA-------- 223
RDPRWGR +E EDP L S A Y +GL+ G P+R + A KH+ A
Sbjct: 125 RDPRWGRNEEGYAEDPYLTSALAAAYTRGLR---GDHPDRWRTAPVLKHWLAHNNETDRD 181
Query: 224 -------------YDLDNWKGTDRYHFNAMVIYTYYLIKFK------------------- 251
YDL ++G R A V+ Y L+ +
Sbjct: 182 TTSSSVRPRVLHEYDLAAFRGAVRAGAVAGVMPAYNLVNGRPNHVSPYLREHLRTWTDQP 241
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL------DLNCGSFLGKHTEAAVKAG 304
+ SD + L +S+HY T EE A ++LAG+ D + +G+ A+ G
Sbjct: 242 LVVCSDAGAPSNLIDSEHYFATHEEGVAAALLAGVDSFTDHDQDSSKTIGR-VRGALARG 300
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
L++ S +D A+ LG FDG PY T A++ LAL+AA Q +VL
Sbjct: 301 LIDASTVDTAVRRLLTMRCALGEFDG--EDDPYADPRGAGFDTPAHRALALEAAEQAVVL 358
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA---TIY 421
LKN G LPLSP+A +AV+GP A+ K + Y G+ + ++P +GLAA + ++
Sbjct: 359 LKND-GLLPLSPSATGTVAVVGPLADECK--LDWYSGSLLRRSSPREGLAARLGADRVVF 415
Query: 422 QAGCSNVQCGTA 433
G V+ TA
Sbjct: 416 VEGTDTVRLRTA 427
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 47/96 (48%), Gaps = 4/96 (4%)
Query: 452 MGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKI 510
+G D I E+ DR L LP Q L A A L++ S + + A + +
Sbjct: 588 LGNDPHINGRETEDRTTLALPAGQDRLW-RAAHAANPRTALVLTSSYPYAVGDAAD--AL 644
Query: 511 TSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
++LW + GQA G +A V G +P+GRLP TWY
Sbjct: 645 PALLWTAHGGQAAGTGLARVLCGDVSPAGRLPQTWY 680
>gi|375311103|ref|ZP_09776360.1| beta-glucosidase-related glycosidase, partial [Paenibacillus sp.
Aloe-11]
gi|375076843|gb|EHS55094.1| beta-glucosidase-related glycosidase, partial [Paenibacillus sp.
Aloe-11]
Length = 566
Score = 188 bits (477), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 128/408 (31%), Positives = 194/408 (47%), Gaps = 61/408 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + + +D RV DLV RLTL+EKI ++ ++ RLG+ Y+ +EA HG++++G
Sbjct: 12 FQDPKIHLDERVRDLVSRLTLEEKIESMLQYQPAIDRLGVAAYKHGTEAAHGIAWLGE-- 69
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
ATSFPQ + A +++ L + IG V+ EAR Y + L GLT W+P +++
Sbjct: 70 --------ATSFPQPVGLACTWDTELMRQIGSVIGDEARVYYRRDPKLNGLTLWAPTVDM 121
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L + + VKG+Q G P LK A KH+ + + +
Sbjct: 122 ERDPRWGRNEEAYGEDPYLTGELSKELVKGIQ---GDHPVYLKAVATLKHFLGNNNEVDR 178
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G+ + + YYL F+
Sbjct: 179 GSGSSSIDPRNLREYYLRAFEKPFTEGGAQSMMTSYNLINGTPATLYHGVNDIVRGEWGM 238
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD-LNCGSFLGKHT-EAAVKAGLVN 307
++VSD V + N Y + A+SI G+D + + L K A+ G +
Sbjct: 239 DGFVVSDAGDVMGIANDHQYYDSHTPGVAESIRVGIDSITDDAELSKQAIREALAQGTLE 298
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E+ +D A+ + F RLG FD + PY +G + + T+ ++L+L AAR+ +VLLKN
Sbjct: 299 EADLDRALFHTFRVRFRLGEFD-PAADNPYALIGEEHLMTEQARELSLRAAREQVVLLKN 357
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
G LPL P +AVIG N + Y GT TPL+ + A
Sbjct: 358 ERGLLPLDPAQCGKVAVIGTLGN--EVYRDWYSGTLPYRVTPLEAICA 403
>gi|390454819|ref|ZP_10240347.1| beta-glucosidase-related glycosidase [Paenibacillus peoriae KCTC
3763]
Length = 998
Score = 188 bits (477), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 128/408 (31%), Positives = 195/408 (47%), Gaps = 61/408 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + + +D RV DLV RLTL+EKI ++ ++ RLG+ Y+ +EA HG++++G
Sbjct: 12 FQDPKIHLDERVRDLVSRLTLEEKIESMLQYQPAIDRLGVAAYKHGTEAAHGIAWLGE-- 69
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
ATSFPQ + A +++ L + +G V+ EAR Y + L GLT W+P +++
Sbjct: 70 --------ATSFPQPVGLACTWDTELMRQVGSVIGDEARVYYRRDPKLNGLTLWAPTVDM 121
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L + + VKG+Q G P LK A KH+ + + +
Sbjct: 122 ERDPRWGRNEEAYGEDPYLTGELSKELVKGIQ---GDHPVYLKAVATLKHFLGNNNEVDR 178
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G+ + + YYL F+
Sbjct: 179 GSGSSSIDPRNLREYYLRAFEKPFTEGGAQSMMTSYNLINGTPATLYHGVNDIVRGEWGM 238
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD-LNCGSFLGKHT-EAAVKAGLVN 307
++VSD V + N Y + A+SI AG+D + + L K A+ G +
Sbjct: 239 DGFVVSDAGDVMGIANDHQYYDSHTLGVAESIRAGIDSITDDAELSKQAIREALTQGTLE 298
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E+ +D A+ + F RLG FD + PY +G + + T+ ++L+L AAR+ +VLLKN
Sbjct: 299 ETDLDRALFHTFRVRFRLGEFDP-AADNPYALIGEEALMTEQARELSLRAAREQVVLLKN 357
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
G LPL P +AVIG N + Y GT TPL+ + A
Sbjct: 358 ERGLLPLDPAQCGKVAVIGTLGN--EVYHDWYSGTLPYRVTPLEAIRA 403
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 50/87 (57%), Gaps = 5/87 (5%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG-GFDISFAKNDPKITSILWVGYP 519
E +DR L L QQ LI EV + P +++++G F I + + I +I++ +
Sbjct: 592 EENDRPGLELAASQQRLIEEVYRA--NPNTIVVLTGSYPFAIPWVQE--HIPAIVYTSHA 647
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTWY 546
GQ G A+ADV FG Y P+GRL MTWY
Sbjct: 648 GQEHGTAVADVLFGDYAPAGRLNMTWY 674
>gi|15899739|ref|NP_344344.1| Beta-xylosidase [Sulfolobus solfataricus P2]
gi|13816430|gb|AAK43134.1| Beta-xylosidase [Sulfolobus solfataricus P2]
Length = 754
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 147/502 (29%), Positives = 224/502 (44%), Gaps = 99/502 (19%)
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRG 179
+T+FPQ I A+++N L + + ++ R L G+ SP +++ RDPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELLTNVASTIRSQGR------LIGVNQCLSPVLDVCRDPRWGRC 154
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL-DNWKGTDRYHFN 238
+ET GEDP L + Y+ GLQ ++ A KH+ A+ + + + H
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQ-------GETQLVATAKHFAAHGFPEGGRNIAQVHVG 207
Query: 239 AMVIYTYYLIKFK------------------------------------------YIVSD 256
+ +L F+ +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQEWGFDGIVVSD 267
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF--LGKHTEAAVKAGLVNESAIDHA 314
D + L EAA ++ +G+D+ + G+ A+K GLV+E+ ID A
Sbjct: 268 YDGIRQLEAIHKVASNKMEAAILALESGVDIEFPTIDCYGEPLVTAIKEGLVSEAIIDRA 327
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLG--PKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
+ RLG D P+ P+ + + +++LAL AAR+ IVLLKN L
Sbjct: 328 VERVLRIKERLGLLD-----NPFVDESAVPERLDDRKSRELALKAARESIVLLKNENNML 382
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-------CKYTTPLQGLAAVVA---TIYQ 422
PLS I +AVIGPNAN + M+G+Y T + T LQG+A V +Y
Sbjct: 383 PLSKN-INKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKKVGEGKVLYA 441
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGAD-------------------QSIEAESH 463
GC +A + A AD + VMG Q++ E +
Sbjct: 442 KGCDIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFKKYQAVTGEGN 501
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR L L G Q+ L+ E+ K K P+IL++++G +S N + +I+ +PG+ G
Sbjct: 502 DRASLRLLGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAIIEAWFPGEEG 558
Query: 524 GAAIADVCFGRYNPSGRLPMTW 545
G AIAD+ FG YNPSGRLP+T+
Sbjct: 559 GNAIADIIFGDYNPSGRLPITF 580
>gi|319787180|ref|YP_004146655.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
gi|317465692|gb|ADV27424.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
suwonensis 11-1]
Length = 903
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 130/397 (32%), Positives = 191/397 (48%), Gaps = 86/397 (21%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV +LT +EK+ L+++A ++ RLG+P Y WW+E+LHG P T+F+ +
Sbjct: 54 LVAKLTTEEKLDQLLNTAPAIPRLGVPHYNWWTESLHGAMGTLPTTNFA----------E 103
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTFWSPNINIFRDPRWG 177
I AA+F+ L + +G+V++TE R ++ + GL WSPNINIFRDPRWG
Sbjct: 104 PIGLAATFDDGLVKRVGQVIATEVRGLHALARQTGRTGRMGTGLNTWSPNINIFRDPRWG 163
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHF 237
RGQET GEDP LA++ YVKG+Q D P + A KHY + + + R+
Sbjct: 164 RGQETYGEDPFLAARMGVAYVKGVQGED---PRWYDIIATPKHYAVH---SGPESTRHSA 217
Query: 238 NAMV----IYTYYLIKFK------------------------------------------ 251
N V + YL F+
Sbjct: 218 NVYVSRHDLEDTYLPAFRAAIVEGGAGSVMCAYNRVDGQPACANDLLLKDYLRGAWDFKG 277
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCG-------SFLGKHTEAAVKAG 304
Y+VSDCD+V + HY A A ++ +G+D C + L + + A++ G
Sbjct: 278 YVVSDCDAVTDINRHHHYAPDAASAVAAAMRSGVDNECNGATLVDMAGLARPYQEALERG 337
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYG--QLGPKDVCTQANQDLALDAARQGI 362
L++ + +D A+ FA R G P +P P DV + LAL+ A + +
Sbjct: 338 LISMADVDRALVRLFAARYRNGDL---PGLRPLSTETASPADVGKPEHAALALEVAEKSL 394
Query: 363 VLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
VLLKN+ G LPL P A LAV+GP A+ T+ + GNY
Sbjct: 395 VLLKNS-GVLPLRPQA--KLAVVGPLADATRVLRGNY 428
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 69/126 (54%), Gaps = 17/126 (13%)
Query: 435 VDDA----KKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITE 480
VDD + AAA +D V V+G +EAE D+ L L QQ L+ E
Sbjct: 618 VDDPIGQMRAAAAQSDVVVAVVGLTSDLEAEEAPVEVPGFKGGDKTSLDLLADQQALL-E 676
Query: 481 VAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGR 540
A+ P+++++M+G ++++A+ +I+ YPGQ+GG AI +V G NP+GR
Sbjct: 677 AARATGKPLVVVVMNGSPVNLAWARQ--HAAAIIEAWYPGQSGGLAIGNVLAGHANPAGR 734
Query: 541 LPMTWY 546
LP+T+Y
Sbjct: 735 LPLTFY 740
>gi|224538725|ref|ZP_03679264.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519667|gb|EEF88772.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
DSM 14838]
Length = 942
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 160/550 (29%), Positives = 242/550 (44%), Gaps = 111/550 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F ++ +RLGIP ++ +E + GV SY AT+FP +
Sbjct: 141 LNEVQRFFIEE----TRLGIP-VDFTNEGIRGVESY------------RATNFPTQLGLG 183
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L + IG + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 184 HTWNRELIRQIGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 238
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR----------------- 234
+ V+G+Q +VAA KH+ AY N KG
Sbjct: 239 ELGIEMVRGMQHNH-------QVAATGKHFVAYS--NNKGAREGMARVDPQMSPREVEMI 289
Query: 235 --YHFNAMVIY--------------------TYYLIKFK---------YIVSDCDSVDVL 263
Y F ++ +YY + + Y+VSD D+V+ L
Sbjct: 290 HVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGEMGFRGYVVSDSDAVEYL 349
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNESAIDHAISNN 318
Y K +EA +S+ AGL++ C +F + VK G ++E I+ + +
Sbjct: 350 YTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEEVINDRVRDI 408
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
+G FD G K+V N+ LAL A+R+ +VLLKN LPL
Sbjct: 409 LRVKFLVGLFDTPYQTDLAG--ADKEVEKAENESLALQASRESLVLLKNENNVLPLDINN 466
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL----AAVVATIYQAGCSNVQCG--- 431
+K +AV GPNA+ + +Y + TT L+G+ +Y GC V
Sbjct: 467 VKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEVLYTKGCDLVDANWPE 526
Query: 432 ------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
A++D A + A AD V+V+G Q E+ R L LPG+Q L+
Sbjct: 527 SELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLELPGRQLKLLQ 586
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
V K PV+L++++G I++A D + +IL YPG GG A+ADV FG YNP G
Sbjct: 587 AVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGTAVADVLFGDYNPGG 643
Query: 540 RLPMTWYPQS 549
+L +T +P+S
Sbjct: 644 KLTVT-FPKS 652
>gi|284998833|ref|YP_003420601.1| glycoside hydrolase family protein [Sulfolobus islandicus L.D.8.5]
gi|284446729|gb|ADB88231.1| glycoside hydrolase, family 3 domain protein [Sulfolobus islandicus
L.D.8.5]
Length = 754
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 148/502 (29%), Positives = 221/502 (44%), Gaps = 99/502 (19%)
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRG 179
+T+FPQ I A+++N L I V+ ++ R L G+ SP +++ +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGR------LVGVNQCLSPVLDVCKDPRWGRC 154
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL-DNWKGTDRYHFN 238
+ET GEDP L + Y+ GLQ + ++ A KH+ A+ + + + H
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------QLVATAKHFAAHGFPEGGRNIAQVHVG 207
Query: 239 AMVIYTYYLIKFK------------------------------------------YIVSD 256
+ +L F+ +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF--LGKHTEAAVKAGLVNESAIDHA 314
D + L EAA ++ +G+D+ + G+ A+K GLV ES ID A
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEFPTIDCYGEPLVNALKEGLVPESLIDRA 327
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLG--PKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
+ RLG D P+ P+ + +++LAL AR+ IVLLKN L
Sbjct: 328 VERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNIL 382
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-------CKYTTPLQGLAAVVA---TIYQ 422
PLS + +AVIGPNAN + M+G+Y T + T LQG+ V +Y
Sbjct: 383 PLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGESKVLYA 441
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGAD-------------------QSIEAESH 463
GC +A + A AD + +MG Q++ E +
Sbjct: 442 KGCDIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFKKYQAVTGEGN 501
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR L LPG Q+ L+ E+ K K P+IL++++G +S N K W +PG+ G
Sbjct: 502 DRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIINYVKAVIEAW--FPGEEG 558
Query: 524 GAAIADVCFGRYNPSGRLPMTW 545
G AIADV FG YNPSGRLP+T+
Sbjct: 559 GNAIADVIFGDYNPSGRLPITF 580
>gi|402299776|ref|ZP_10819350.1| glycoside hydrolase family protein [Bacillus alcalophilus ATCC
27647]
gi|401725066|gb|EJS98379.1| glycoside hydrolase family protein [Bacillus alcalophilus ATCC
27647]
Length = 937
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 131/417 (31%), Positives = 196/417 (47%), Gaps = 71/417 (17%)
Query: 46 PSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV 105
P F NT L ++ RV DLV R TL EKI ++ + RLGI ++ +EA HG+
Sbjct: 2 PQAVKFPFQNTELLVEDRVKDLVNRFTLDEKIELMMQHQPEIKRLGIKAHKQGTEAAHGI 61
Query: 106 SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTF 163
+++G AT FPQ I ++++N L IG V+ EAR Y + GLT
Sbjct: 62 AWLGE----------ATVFPQNIGLSSTWNKDLMNKIGTVIGDEARVFYQKDPSKHGLTL 111
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+P +++ RDPRWGR +E GEDP L + T VKG+Q G P+ LK A KH+
Sbjct: 112 WAPTVDMERDPRWGRTEEAYGEDPELTGQLTTELVKGMQ---GNDPDYLKSVATLKHF-- 166
Query: 224 YDLDNWKGTDRYHFNAMV----IYTYYLIKFK---------------------------- 251
L N +R +A + ++ YYL FK
Sbjct: 167 --LGNNNEINRDKCSASIDPRNMHEYYLKAFKPAIVEGKAKSIMTAYNSINGTPALLHPY 224
Query: 252 -------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGKH 296
+IVSD V + N Y +T EA A +I +G+D + +
Sbjct: 225 VEEVVKQEWGMDGFIVSDAGDVVGIVNDHKYYETYSEALADTIKSGIDSITDDAEISLQA 284
Query: 297 TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALD 356
A+ GL+ E+ +D A+ N F RLG FD + PY + + + ++ L+
Sbjct: 285 MRDALSKGLLVEADLDKALINTFKVRFRLGEFD--EERNPYAFVSESKLLAKEHKQLSYQ 342
Query: 357 AARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
AA++ +VLLKN G LPL+ +++AVIGP ++ ++ Y GTP T L+ +
Sbjct: 343 AAQEQVVLLKND-GLLPLNEGTTESIAVIGPLSD--DVLVDWYSGTPSYRITALEAI 396
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 72/121 (59%), Gaps = 4/121 (3%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVA 485
++ A +++A KAAA D V+V+G + I E DR D++LP +Q+ L+ V +V
Sbjct: 536 EIEVIEAGIEEAVKAAAEQDYAVVVLGNNPFINGKECIDREDIVLPPEQEKLLQAVKQVN 595
Query: 486 KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
I +++S + +++A + ++ I++ + G G A+ADV FG YNP+GRL MTW
Sbjct: 596 PN-TIAVLVSSYPYALNWANEN--LSGIMYTSHAGPELGKAVADVLFGDYNPAGRLSMTW 652
Query: 546 Y 546
Y
Sbjct: 653 Y 653
>gi|227831319|ref|YP_002833099.1| glycoside hydrolase family protein [Sulfolobus islandicus L.S.2.15]
gi|227457767|gb|ACP36454.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
L.S.2.15]
Length = 754
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 148/502 (29%), Positives = 221/502 (44%), Gaps = 99/502 (19%)
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRG 179
+T+FPQ I A+++N L I V+ ++ R L G+ SP +++ +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGR------LVGVNQCLSPVLDVCKDPRWGRC 154
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL-DNWKGTDRYHFN 238
+ET GEDP L + Y+ GLQ + ++ A KH+ A+ + + + H
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------QLVATAKHFAAHGFPEGGRNIAQVHVG 207
Query: 239 AMVIYTYYLIKFK------------------------------------------YIVSD 256
+ +L F+ +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF--LGKHTEAAVKAGLVNESAIDHA 314
D + L EAA ++ +G+D+ + G+ A+K GLV ES ID A
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEFPTIDCYGEPLVNALKEGLVPESLIDRA 327
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLG--PKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
+ RLG D P+ P+ + +++LAL AR+ IVLLKN L
Sbjct: 328 VERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNIL 382
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-------CKYTTPLQGLAAVVA---TIYQ 422
PLS + +AVIGPNAN + M+G+Y T + T LQG+ V +Y
Sbjct: 383 PLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGESKVLYA 441
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGAD-------------------QSIEAESH 463
GC +A + A AD + +MG Q++ E +
Sbjct: 442 KGCDIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSKEEFKKYQAVTGEGN 501
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR L LPG Q+ L+ E+ K K P+IL++++G +S N K W +PG+ G
Sbjct: 502 DRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIINYVKAVIEAW--FPGEEG 558
Query: 524 GAAIADVCFGRYNPSGRLPMTW 545
G AIADV FG YNPSGRLP+T+
Sbjct: 559 GNAIADVIFGDYNPSGRLPITF 580
>gi|374992733|ref|YP_004968228.1| sugar hydrolase [Streptomyces bingchenggensis BCW-1]
gi|297163385|gb|ADI13097.1| sugar hydrolase [Streptomyces bingchenggensis BCW-1]
Length = 946
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 136/409 (33%), Positives = 187/409 (45%), Gaps = 70/409 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L R+ DL+ RLTL EKI FL A +V RLG+ + EALHGV+++GP
Sbjct: 11 FRDRQLPFAKRIDDLLSRLTLDEKIGFLHQFAPAVERLGVAAFRTGQEALHGVAWMGP-- 68
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWSPNINIF 171
AT FPQ + A++N L + +G+ VS E RAM GL WSP +N+
Sbjct: 69 --------ATVFPQAVGLGATWNTDLVRRVGEAVSKEVRAMRARDDRVGLNVWSPTVNLL 120
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
R P WGR +E ED L S AT Y GL+ G P + A KH+ A++ + +
Sbjct: 121 RHPLWGRNEEGYSEDAKLTSAIATAYTHGLR---GEHPTYWRTAPVLKHWLAHNNETGRD 177
Query: 232 TDRYHFNAMVIYTYYLIKFKYIV------------------------------------- 254
V++ Y L F+ V
Sbjct: 178 VTSSSVRPRVLHEYDLRAFRETVEAGAVAGVMPAYNLVNGRPNHVSPYLREHLRTWTDED 237
Query: 255 ----SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD------LNCGSFLGKHTEAAVKAG 304
SD + L +S+HY T EEA A ++ AG+D + G+ A++ G
Sbjct: 238 LLVCSDAGAPSNLVDSEHYFDTHEEATAAALRAGVDSFTDHGTDSSQITGR-VRGALEQG 296
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
L+ ES ID A+ + RLG FD P P+ D T A++ LA +AA Q +VL
Sbjct: 297 LLTESDIDAAVRRQLSVRFRLGEFD--PEHDPHALTA--DFDTPAHRTLAQEAAEQAVVL 352
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
LKN G LPL+P LAV+G A+ K + Y GT +TPL GL
Sbjct: 353 LKNDGGLLPLAPD--TRLAVVGLLADECK--LDWYSGTLLHRSTPLAGL 397
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 62/112 (55%), Gaps = 6/112 (5%)
Query: 436 DDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIM 494
D + A+ AD ++V G D I E+ DR L LPG Q+ L+ A+ A +L ++
Sbjct: 557 DAVSQVASQADVVLVVAGNDPHINGRETEDRTTLALPGHQERLL-RAARTANPNTVLALV 615
Query: 495 SGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
S +A + ++ ++LW + GQA G+A+A + G +P+GRLP TWY
Sbjct: 616 SA----YPYAVDHAELPALLWTAHGGQAAGSALARILAGDVSPAGRLPQTWY 663
>gi|308071263|ref|YP_003872868.1| beta-glucosidase-related glycosidase [Paenibacillus polymyxa E681]
gi|305860542|gb|ADM72330.1| Beta-glucosidase-related glycosidase [Paenibacillus polymyxa E681]
Length = 984
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 195/408 (47%), Gaps = 61/408 (14%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + + +D RV DLV RLTL+EK+ ++ ++ RLG+ Y+ +EA HG++++G
Sbjct: 12 FQDPKIHLDERVRDLVLRLTLEEKVESMLQYQPAIDRLGVAAYKHGTEAAHGIAWLGE-- 69
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
ATSFPQ + A +++ L + IG V++ EAR Y + L GLT W+P +++
Sbjct: 70 --------ATSFPQPVGLACTWDTELMRLIGSVIADEARVYYRRDPKLNGLTLWAPTVDM 121
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L + + VKG+Q G P LK A KH+ + + +
Sbjct: 122 ERDPRWGRNEEAYGEDPYLTGELSKELVKGIQ---GDHPVYLKAVATLKHFLGNNNEVDR 178
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + + YYL F+
Sbjct: 179 GIGSSSIDPRNLREYYLRAFEKPFTEGGAQSMMTSYNLINGTPATLYHGVNDIVRDEWGM 238
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD-LNCGSFLGKHT-EAAVKAGLVN 307
++VSD + + N Y + A+SI AG+D + + L K A+ G +
Sbjct: 239 DGFVVSDAGDIMGIVNDHQYYASHTPGVAESIRAGIDSITDDAELSKQAIREALAQGTLQ 298
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
E +D A+ + F RLG FD + PY +G + + T+ ++L+L AAR+ IVLLKN
Sbjct: 299 EEDLDRALFHTFRVRFRLGEFDP-AADNPYASIGEEALMTEQARELSLRAAREQIVLLKN 357
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
LPL+P +AVIG + N + Y GT TPL+ + A
Sbjct: 358 KHELLPLNPARCGKVAVIGTHGN--EVYRDWYSGTLPYRVTPLEAIRA 403
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 5/86 (5%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG-GFDISFAKNDPKITSILWVGYP 519
E +DR L L QQ LI EV +V P +++I++G F I + + I +I++ +
Sbjct: 592 EENDRRGLELAASQQRLIEEVYRV--NPNMIVILTGSYPFAIPWVQE--HIPAIVYTSHA 647
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTW 545
GQ G AIADV FG Y P+GRL MTW
Sbjct: 648 GQEHGTAIADVLFGDYAPAGRLNMTW 673
>gi|423302093|ref|ZP_17280116.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
CL09T03C10]
gi|408471184|gb|EKJ89716.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
CL09T03C10]
Length = 1039
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 158/550 (28%), Positives = 246/550 (44%), Gaps = 111/550 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F ++ +RLGIP ++ +E + GV SY AT+FP +
Sbjct: 233 LNEVQRFFIEE----TRLGIPT-DFTNEGIRGVESY------------KATNFPTQLGLG 275
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L + IG + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 276 HTWNRQLLRQIGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 330
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR----------------- 234
+ VKG+Q +VAA KH+ AY N KG
Sbjct: 331 ELGIEMVKGMQHNH-------QVAATGKHFIAYS--NNKGAREGMARVDPQMSPREVEMI 381
Query: 235 --YHFNAMVIY--------------------TYYLIKFK---------YIVSDCDSVDVL 263
Y F ++ +YY + + Y+VSD D+V+ L
Sbjct: 382 HVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGDMGFRGYVVSDSDAVEYL 441
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNESAIDHAISNN 318
Y K +EA +S+ AGL++ C +F + VK G ++E I+ + +
Sbjct: 442 YTKHGTAKDMKEAVRQSVEAGLNIRC-TFRSPDSYVLPLRELVKEGELSEEIINDRVRDI 500
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
+G FD HP + + ++V +N+++AL A+R+ IVLLKN LPL+ +
Sbjct: 501 LRVKFLVGLFD-HPYQTDL-KGADEEVEKASNEEIALQASRESIVLLKNDKNVLPLNAST 558
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG----LAAVVATIYQAGCSNVQCG--- 431
IK +AV GPNA+ + +Y + T+ L+G L +Y GC V
Sbjct: 559 IKKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKLGGKAEVLYTKGCELVDANWPE 618
Query: 432 ------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
+++ A AD V+V+G Q E+ R L LPG+Q L+
Sbjct: 619 SELMEYPLSENEQEEIEKAVSQTKQADVAVVVLGGGQRTCGENKSRSSLALPGRQLDLLK 678
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
V K PV+L++++G I++A D + +IL YPG GG A+ADV FG YNP G
Sbjct: 679 AVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGKAVADVLFGDYNPGG 735
Query: 540 RLPMTWYPQS 549
+L +T +P++
Sbjct: 736 KLTVT-FPKT 744
>gi|410096880|ref|ZP_11291865.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225497|gb|EKN18416.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
CL02T12C30]
Length = 799
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 156/533 (29%), Positives = 227/533 (42%), Gaps = 104/533 (19%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP ++ +E +HG+++ AT P I +++N L G +
Sbjct: 138 TRLGIP-VDFSNEGIHGLNHTK-----------ATPLPAPINIGSTWNRDLVHQAGDIAG 185
Query: 148 TEARAM-YNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
EA+A+ YN ++P +++ RDPRWGR ET GEDP L + VKG+QQ
Sbjct: 186 KEAKALGYN------NVYAPILDVARDPRWGRVLETYGEDPYLVGELGIQMVKGIQQNG- 238
Query: 207 GSPNRLKVAACCKHYTAYDL---------------------------------------- 226
VA+ KH+ Y +
Sbjct: 239 -------VASTLKHFAVYSIPKGGRDAAVRTDPHVAPRELHEIHLYPFKRVVQKAHPKGV 291
Query: 227 ----DNWKG---TDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAK 279
++W G T Y+F ++ Y K YIVSD ++V+ + H + EEA +
Sbjct: 292 MSSYNDWDGVPVTASYYFLTQLLRQEYGFK-GYIVSDSEAVEFVQTKHHVADSYEEAVRQ 350
Query: 280 SILAGLDLNCGSFLGKH----TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ 335
+ AGL++ K VK G ++ ++D +++ LG FD K
Sbjct: 351 VVEAGLNVRTNFTHPKDYILPVRKLVKEGKLSMKSVDRMVADVLRVKFELGLFDSPYVKD 410
Query: 336 PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTM 395
P + K V ++D LD +Q +VLLKN LPL K + + GP A T M
Sbjct: 411 P--KAADKIVGADKHRDFVLDMQKQSLVLLKNENNLLPLDKNQTKKVLIAGPLAKETNYM 468
Query: 396 IGNYEGTPCKYTTPLQG----LAAVVATIYQAGCSNVQCG--------TAQVDDAKKA-- 441
I Y T G L +Y GC T D+ KK
Sbjct: 469 ISRYGPQGLDNITVYDGIKDYLGNQTEVVYAKGCEVKDANWPDSEIVPTPLTDEEKKGIA 528
Query: 442 -----AASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
AA D + V+G D+S ES R L LPG+QQ L+ + K PV+L++++G
Sbjct: 529 EAATAAADCDVIIAVLGEDESCTGESKSRTGLDLPGRQQQLLEALHATGK-PVVLVLING 587
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
I++A D I SIL +PGQ GG AIA FG YNP GRL +T +P+S
Sbjct: 588 QPLTINWA--DRNIPSILEAWFPGQLGGEAIAQTLFGDYNPGGRLSVT-FPRS 637
>gi|284029787|ref|YP_003379718.1| glycoside hydrolase family 3 domain-containing protein [Kribbella
flavida DSM 17836]
gi|283809080|gb|ADB30919.1| glycoside hydrolase family 3 domain protein [Kribbella flavida DSM
17836]
Length = 959
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 128/386 (33%), Positives = 179/386 (46%), Gaps = 62/386 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + + + R+GDL+ RLTL EK+ L +V RLG+ + +EALHGV+++GP
Sbjct: 25 FRDPARKLPERIGDLLDRLTLPEKLGLLHQHQAAVPRLGVASFRTGTEALHGVAWLGP-- 82
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--GLTFWSPNINI 170
AT FPQ I AAS+N L QA+G + E RA + A GL W+P +N
Sbjct: 83 --------ATVFPQAIGLAASWNTELLQAVGSAIGDEVRAFHRRDPAQVGLNVWAPVVNP 134
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E EDP L A Y +GL G +P LK A KH+ AY+ + +
Sbjct: 135 LRDPRWGRNEEGYSEDPWLTGLLAVAYGQGLT---GTAPGPLKTAPTLKHFLAYNNETDR 191
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
T + V++ Y L F+
Sbjct: 192 CTTSSNLPPRVLHDYELPAFRAPIAAGAAVAVMPSYNLVNGRPAHLSPLINDALRSWTED 251
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL-----NCGSFLGKHTEAAVKAG 304
+VSD + LY Q Y A A ++ AG+D N +H AA+ G
Sbjct: 252 DLLVVSDAGAPANLYELQRYYADGPTAYAAALCAGVDSFTQDDNDAGPSTRHLTAALDRG 311
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
L+ + ID A+ + A RLG FD P PY L + V ++ LA +AARQ IVL
Sbjct: 312 LLTQDDIDQAVRHALAVRFRLGEFDP-PELDPYAALDDEVVGCPEHRQLAQEAARQSIVL 370
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNAN 390
LK+ LPL P + +A++GP A+
Sbjct: 371 LKHECHVLPLDPATTRRVAIVGPLAD 396
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 54/102 (52%), Gaps = 4/102 (3%)
Query: 446 DATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFA 504
D ++V G + E+ DR L LP Q L+ V A +L++ S F + +A
Sbjct: 567 DVAIVVAGNHPLVNGRETEDREALDLPAGQDRLVRAV-HAANSRTVLVMSSSYPFSVEWA 625
Query: 505 KNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
D + ++LW + GQ G A+ADV FG +PSGRLP TWY
Sbjct: 626 --DEHLPALLWSAHGGQEYGRALADVLFGDTDPSGRLPQTWY 665
>gi|270296173|ref|ZP_06202373.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270273577|gb|EFA19439.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 942
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 160/550 (29%), Positives = 244/550 (44%), Gaps = 111/550 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F V+ +RLGIP ++ +E + GV SY AT+FP +
Sbjct: 141 LNEVQRFFVED----TRLGIP-VDFTNEGIRGVESYR------------ATNFPTQLGLG 183
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L + +G + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 184 HTWNRELIRQVGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 238
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR----------------- 234
+ V+GLQ +VAA KH+ AY N KG
Sbjct: 239 ELGIEMVRGLQHNH-------QVAATGKHFAAYS--NNKGAREGMARVDPQMSPREVENI 289
Query: 235 --YHFNAMVIY--------------------TYYLIKFK---------YIVSDCDSVDVL 263
Y F ++ +YY + + Y+VSD D+V+ L
Sbjct: 290 HIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVSDSDAVEYL 349
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNC-----GSFLGKHTEAAVKAGLVNESAIDHAISNN 318
Y K +EA +S+ AGL++ C SF+ E VK G ++E I+ + +
Sbjct: 350 YTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLREL-VKEGGLSEEVINDRVRDI 408
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
+G FD G ++V + N+ +AL A+R+ IVLLKN LPL +
Sbjct: 409 LRVKFLIGLFDAPYQTDLAG--ADREVEKEENEAIALQASRESIVLLKNAGELLPLDINS 466
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL----AAVVATIYQAGCSNVQC---- 430
K +AV GPNAN + +Y + TT L+G+ +Y GC V
Sbjct: 467 TKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGCDLVDAHWPE 526
Query: 431 -----------GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
A++D A + A AD ++V+G Q E+ R L LPG+Q L+
Sbjct: 527 SEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDLPGRQLQLLQ 586
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
+ K PV+LI+++G I++A D + +IL YPG GG A+AD+ FG YNP G
Sbjct: 587 AIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADILFGDYNPGG 643
Query: 540 RLPMTWYPQS 549
+L +T +P++
Sbjct: 644 KLTVT-FPKT 652
>gi|160892207|ref|ZP_02073210.1| hypothetical protein BACUNI_04671 [Bacteroides uniformis ATCC 8492]
gi|156858685|gb|EDO52116.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
Length = 990
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 160/550 (29%), Positives = 245/550 (44%), Gaps = 111/550 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F V+ +RLGIP ++ +E + GV SY AT+FP +
Sbjct: 189 LNEVQRFFVED----TRLGIP-VDFTNEGIRGVESY------------RATNFPTQLGLG 231
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L + +G + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 232 HTWNRELIRQVGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 286
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR----------------- 234
+ V+GLQ +VAA KH+ AY N KG
Sbjct: 287 ELGIEMVRGLQHNH-------QVAATGKHFAAYS--NNKGAREGMARVDPQMSPREVENI 337
Query: 235 --YHFNAMVIY--------------------TYYLIKFK---------YIVSDCDSVDVL 263
Y F ++ +YY + + Y+VSD D+V+ L
Sbjct: 338 HIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVSDSDAVEYL 397
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNC-----GSFLGKHTEAAVKAGLVNESAIDHAISNN 318
Y K +EA +S+ AGL++ C SF+ E VK G ++E I+ + +
Sbjct: 398 YTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLREL-VKEGGLSEEVINDRVRDI 456
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
+G FD P + ++V + N+ +AL A+R+ IVLLKN LPL +
Sbjct: 457 LRVKFLIGLFDA-PYQTDLAD-ADREVEKEENEAIALQASRESIVLLKNAGELLPLDINS 514
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL----AAVVATIYQAGCSNVQC---- 430
K +AV GPNAN + +Y + TT L+G+ +Y GC V
Sbjct: 515 TKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGCDLVDAHWPE 574
Query: 431 -----------GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
A++D A + A AD ++V+G Q E+ R L LPG+Q L+
Sbjct: 575 SEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDLPGRQLQLLQ 634
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
+ K PV+LI+++G I++A D + +IL YPG GG A+AD+ FG YNP G
Sbjct: 635 AIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADILFGDYNPGG 691
Query: 540 RLPMTWYPQS 549
+L +T +P++
Sbjct: 692 KLTVT-FPKT 700
>gi|317480750|ref|ZP_07939836.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
gi|316903091|gb|EFV24959.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
Length = 942
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 160/550 (29%), Positives = 245/550 (44%), Gaps = 111/550 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F V+ +RLGIP ++ +E + GV SY AT+FP +
Sbjct: 141 LNEVQRFFVED----TRLGIP-VDFTNEGIRGVESYR------------ATNFPTQLGLG 183
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L + +G + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 184 HTWNRELIRQVGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 238
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR----------------- 234
+ V+GLQ +VAA KH+ AY N KG
Sbjct: 239 ELGIEMVRGLQHNH-------QVAATGKHFAAYS--NNKGAREGMARVDPQMSPREVENI 289
Query: 235 --YHFNAMVIY--------------------TYYLIKFK---------YIVSDCDSVDVL 263
Y F ++ +YY + + Y+VSD D+V+ L
Sbjct: 290 HIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVSDSDAVEYL 349
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNC-----GSFLGKHTEAAVKAGLVNESAIDHAISNN 318
Y K +EA +S+ AGL++ C SF+ E VK G ++E I+ + +
Sbjct: 350 YTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLREL-VKEGGLSEEVINDRVRDI 408
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
+G FD P + ++V + N+ +AL A+R+ IVLLKN LPL +
Sbjct: 409 LRVKFLIGLFDA-PYQTDLAD-ADREVEKEENEAIALQASRESIVLLKNAGELLPLDINS 466
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL----AAVVATIYQAGCSNVQC---- 430
K +AV GPNAN + +Y + TT L+G+ +Y GC V
Sbjct: 467 TKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGCDLVDAHWPE 526
Query: 431 -----------GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
A++D A + A AD ++V+G Q E+ R L LPG+Q L+
Sbjct: 527 SEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDLPGRQLQLLQ 586
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
+ K PV+LI+++G I++A D + +IL YPG GG A+AD+ FG YNP G
Sbjct: 587 AIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADILFGDYNPGG 643
Query: 540 RLPMTWYPQS 549
+L +T +P++
Sbjct: 644 KLTVT-FPKT 652
>gi|452973347|gb|EME73169.1| glycoside hydrolase [Bacillus sonorensis L12]
Length = 989
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 137/455 (30%), Positives = 206/455 (45%), Gaps = 79/455 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ++ RV DL+ RLTL+EK++ + ++ RLGIP ++ +EALHGV+++G
Sbjct: 50 FQNPGLSVEKRVSDLISRLTLEEKVSLMHQYQPAIPRLGIPAFKTGTEALHGVAWLGK-- 107
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--GLTFWSPNINI 170
AT FPQ + A ++N +L + +G V E R +++ A G+ W+P +++
Sbjct: 108 --------ATVFPQAVGLAHTWNPALIKRVGSAVGDEVRGFHHLNPAANGVNVWAPVVDL 159
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E EDP L K + Y GL+ G P LK KH+ Y+ + +
Sbjct: 160 LRDPRWGRNEEGYSEDPFLTGKISAAYASGLK---GDHPFYLKTVPTLKHFLGYNNETDR 216
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + + YY+ F+
Sbjct: 217 GFSSSSIDPRNMQEYYIKPFETAITEKAAYGLMPAYNSINDKPAILSPLLHTAVKNKWAG 276
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL-----NCGSFLGKHTEAAVKA 303
+IVSD + N Y EEA A ++ AG+D + A+K
Sbjct: 277 KHFFIVSDAFDPSGIVNDHRYYDNHEEAHAHALKAGIDSFTDQGENPALTKNAVTGALKK 336
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGI 362
GL+ E +D A++N F+ R G FD P K+ PY +L + + +Q LA A Q I
Sbjct: 337 GLIAEKDLDQALANVFSLRFRTGEFD--PDKRNPYSRLTENVINSPKHQKLAKKTAEQSI 394
Query: 363 VLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT--- 419
VLLKN+ LPL A + +AVIGP + Y GT TPL G+ +
Sbjct: 395 VLLKNSKQLLPLERRADEKIAVIGPFGDALYEDW--YSGTMPYQITPLAGITDKIGKDRV 452
Query: 420 -----IYQAGCSNVQCG---TAQVDDAKKAAASAD 446
I Q+G + G TAQ D+ + +A+AD
Sbjct: 453 SFAEGIEQSGFRSALTGKYVTAQ-DEKQPLSATAD 486
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 63/113 (55%), Gaps = 4/113 (3%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
V++A A SAD T++ +G + I E+ DR D+ LP Q+ LI V K A L++
Sbjct: 591 VEEAAALAKSADKTIVFVGNNPYINGRETQDRKDITLPPAQEELIKAVTK-ANPNTALVV 649
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
S F +++A D + SIL+ + GQ G A+AD+ FG P GRL TW+
Sbjct: 650 TSSYPFALNWA--DRHVPSILYSAHGGQEAGRALADILFGDKAPGGRLTQTWH 700
>gi|386346119|ref|YP_006044368.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
6578]
gi|339411086|gb|AEJ60651.1| glycoside hydrolase family 3 domain protein [Spirochaeta
thermophila DSM 6578]
Length = 942
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 132/430 (30%), Positives = 204/430 (47%), Gaps = 66/430 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ N L ++ RV DLV RLTL+EKI+ + +V RLGI Y EA HGV++VG
Sbjct: 12 YRNPDLPLEERVNDLVSRLTLEEKISLIPVRQAAVPRLGIDPYPIGGEAAHGVAWVG--- 68
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--GLAGLTFWSPNINI 170
AT FPQ + + +++ L IG+V+ EARA ++ GLT W+P +++
Sbjct: 69 -------KATVFPQPVGLSCTWDRDLLGRIGEVIGVEARAYRDILGTEFGLTLWAPTVDL 121
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP LAS+ A V+G+Q G P L++ A KH+ A + + +
Sbjct: 122 VRDPRWGRTEEAYGEDPCLASELAGSLVRGMQ---GDHPFYLRMGATLKHFFANNNETDR 178
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + +++ YYL F+
Sbjct: 179 GISSSEMHPWLMHNYYLEVFRRIIERARVTCIMTAYNAVNGIPCLIHPAVKRLVKEEWGL 238
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL---NCGSFLGKHTEAAVKAGLV 306
++V+D + Y + E+ A +I +G+D + + + A++ GL+
Sbjct: 239 PGFVVTDAADFGMTVGMHAYFEDHVESIAATIRSGVDAITEDDRTIVIDSLRHALERGLL 298
Query: 307 NESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLK 366
E +D A+ N F RLG FD P + PY + +C ++ +A +AA++ +VLLK
Sbjct: 299 KEEDLDTALRNIFRIRFRLGEFDP-PDRNPYAGITKDALCAPSHARIAREAAQKSVVLLK 357
Query: 367 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA---TIYQA 423
N G LPL +K +AV+GP A+ T Y GT TPL L + Y+
Sbjct: 358 NR-GLLPLHAHTLKRVAVVGPLAHEVHT--DWYSGTLPYVVTPLDALKERLTGAEVWYEE 414
Query: 424 GCSNVQCGTA 433
G S V+ +A
Sbjct: 415 GSSRVRFRSA 424
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 59/116 (50%), Gaps = 4/116 (3%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
++ ++ A AD + G + I E+ DR DL+LP QQ L+ + +++I
Sbjct: 549 LERVRELAGKADVVLAFCGNNPYINGKETVDRPDLILPPLQQALVRTAFEANPRTALILI 608
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
G + S A D + +IL+ + GQ G A+ DV G +P+GRL +TWY S
Sbjct: 609 ---GSYPFSIAWEDEHLPAILYSSHGGQEMGRALTDVLLGDVSPAGRLSLTWYRSS 661
>gi|423222018|ref|ZP_17208488.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392644204|gb|EIY37946.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 942
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 159/550 (28%), Positives = 242/550 (44%), Gaps = 111/550 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F ++ +RLGIP ++ +E + GV SY AT+FP +
Sbjct: 141 LNEVQRFFIEE----TRLGIP-VDFTNEGIRGVESY------------RATNFPTQLGLG 183
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L + +G + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 184 HTWNRELIRQVGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 238
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR----------------- 234
+ V+G+Q + +VAA KH+ AY N KG
Sbjct: 239 ELGIEMVRGMQHSH-------QVAATGKHFVAYS--NNKGAREGMARVDPQMSPREVEMI 289
Query: 235 --YHFNAMVIY--------------------TYYLIKFK---------YIVSDCDSVDVL 263
Y F ++ +YY + + Y+VSD D+V+ L
Sbjct: 290 HVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGEMGFRGYVVSDSDAVEYL 349
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNESAIDHAISNN 318
Y K +EA +S+ AGL++ C +F + VK G ++E I+ + +
Sbjct: 350 YTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEEVINDRVRDI 408
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
+G FD G K+V N+ LAL A+R+ +VLLKN LPL
Sbjct: 409 LRVKFLVGLFDTPYQTDLAG--ADKEVEKAENESLALQASRESLVLLKNENNVLPLDINN 466
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL----AAVVATIYQAGCSNVQCG--- 431
+K +AV GPNA+ + +Y + TT L+G+ +Y GC V
Sbjct: 467 VKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEVLYTKGCDLVDANWPE 526
Query: 432 ------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
A++D A + A AD V+V+G Q E+ R L LPG+Q L+
Sbjct: 527 SELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLDLPGRQLKLLQ 586
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
V K PV+L++++G I++A D + IL YPG GG A+ADV FG YNP G
Sbjct: 587 AVQATGK-PVVLVLINGRPLSINWA--DKFVPVILEAWYPGSKGGTAVADVLFGDYNPGG 643
Query: 540 RLPMTWYPQS 549
+L +T +P+S
Sbjct: 644 KLTVT-FPKS 652
>gi|229580225|ref|YP_002838625.1| glycoside hydrolase family protein [Sulfolobus islandicus
Y.G.57.14]
gi|229581131|ref|YP_002839530.1| glycoside hydrolase family protein [Sulfolobus islandicus
Y.N.15.51]
gi|228010941|gb|ACP46703.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
Y.G.57.14]
gi|228011847|gb|ACP47608.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
Y.N.15.51]
Length = 754
Score = 185 bits (470), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 147/502 (29%), Positives = 220/502 (43%), Gaps = 99/502 (19%)
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRG 179
+T+FPQ I A+++N L I V+ ++ R L G+ SP +++ +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGR------LVGVNQCLSPVLDVCKDPRWGRC 154
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL-DNWKGTDRYHFN 238
+ET GEDP L + Y+ GLQ + ++ A KH+ A+ + + + H
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------QLVATAKHFAAHGFPEGGRNIAQVHVG 207
Query: 239 AMVIYTYYLIKFK------------------------------------------YIVSD 256
+ +L F+ +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF--LGKHTEAAVKAGLVNESAIDHA 314
D + L EAA ++ +G+D+ + G+ A+K GLV ES ID A
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEFPTIDCYGEPLVNALKEGLVPESLIDRA 327
Query: 315 ISNNFATLMRLGFFDGHPSKQPYGQLG--PKDVCTQANQDLALDAARQGIVLLKNTAGSL 372
+ RLG D P+ P+ + +++LAL AR+ IVLLKN L
Sbjct: 328 VERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNIL 382
Query: 373 PLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-------CKYTTPLQGLAAVVA---TIYQ 422
PLS + +AVIGPNAN + M+G+Y T + T LQG+ V +Y
Sbjct: 383 PLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGESKVLYA 441
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGAD-------------------QSIEAESH 463
GC +A + A AD + +MG Q++ E +
Sbjct: 442 KGCDIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFKKYQAVTGEGN 501
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR L LPG Q+ L+ E+ K K P+IL++++G +S N K W +PG+ G
Sbjct: 502 DRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIINYVKAVIEAW--FPGEEG 558
Query: 524 GAAIADVCFGRYNPSGRLPMTW 545
G AIADV FG YNP GRLP+T+
Sbjct: 559 GNAIADVIFGDYNPGGRLPITF 580
>gi|218130693|ref|ZP_03459497.1| hypothetical protein BACEGG_02282 [Bacteroides eggerthii DSM 20697]
gi|217987037|gb|EEC53368.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 954
Score = 185 bits (470), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 156/552 (28%), Positives = 265/552 (48%), Gaps = 80/552 (14%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSY 107
+L + + L ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 167 TLRYMDPRLPVNERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY 225
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
G+ GAT FPQ + A++N +L + + + E A + WSP
Sbjct: 226 ---GS-------GATIFPQALAMGATWNKNLTEKVAMAIGEETLAA-----GTMQAWSPV 270
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG---------------GSPNRL 212
+++ +D RWGR +ET GEDP+L S+ ++KG Q+ G G +
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGY-QSKGLFTTPKHFGGHGAPLGGRDSH 329
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHF--NAMVIYTYYL----IKFK--------------- 251
+ + L ++ R + + M+ Y+ +L K K
Sbjct: 330 DIGLSEREMREVHLVPFRHVIRNYSCQSLMMAYSDFLGVPVAKSKELLRNILREEWGFDG 389
Query: 252 YIVSDCDSVDVLYNSQHYT-KTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNES 309
++VSDC ++ L + +HYT K EAA +++ AG+ NCG ++ K A K G ++ +
Sbjct: 390 FVVSDCGAIGNLTSRKHYTAKNKIEAANQALAAGIATNCGDTYNDKEVIQAAKDGRLDMA 449
Query: 310 AIDHAISNNFATLMRLGFFDG-HPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
+D+ + R F+ H + ++ P + +++++A +AR+ IV+L+N
Sbjct: 450 NLDNVCRTMLRMMFRNELFEKEHKEPLDWNKIYP-GWNSDSHKEIARQSARESIVMLENK 508
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVAT----IYQ 422
LPLS ++ +AV+GP A+ + G+Y + P + + L G+ V +Y+
Sbjct: 509 DDVLPLS-KHVRTIAVLGPGADNLQP--GDYTPKLRPGQLKSVLTGIKQAVGNQTKILYE 565
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQ 473
GC + +A K AA +D VLV+G + EA E+HD L+LPG+
Sbjct: 566 QGCEFTGSNGENIPNAVKVAAQSDVVVLVLGDCSTSEATTDVYKTSGENHDYATLILPGR 625
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
QQ L+ V K PV+L++ G ++++ K + W+ PGQ GG A ADV FG
Sbjct: 626 QQELLEAVCATGK-PVVLVLQIGRPYNLTKESELCKAIIVNWL--PGQEGGLATADVLFG 682
Query: 534 RYNPSGRLPMTW 545
YNP+GRLPMT+
Sbjct: 683 DYNPAGRLPMTF 694
>gi|418472201|ref|ZP_13041963.1| sugar hydrolase [Streptomyces coelicoflavus ZG0656]
gi|371547177|gb|EHN75575.1| sugar hydrolase [Streptomyces coelicoflavus ZG0656]
Length = 948
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 134/408 (32%), Positives = 187/408 (45%), Gaps = 65/408 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L RV DL+ RLTL EK FL A +V RLGI + EALHGV+++GP
Sbjct: 8 FRDPHLPFAKRVDDLMSRLTLDEKTAFLHQFAPAVERLGIAAFRTGQEALHGVAWMGP-- 65
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWSPNINIF 171
AT FPQ + A++N L + +G+ VS EAR M G GL WSP +N+
Sbjct: 66 --------ATVFPQAVGLGATWNEDLVRRVGEAVSKEARVMRARDGRVGLNVWSPTVNLL 117
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
R P WGR +E EDP L S AT Y +GL+ G P + A KH+ A++ + +
Sbjct: 118 RHPLWGRNEEGYSEDPKLTSAVATAYTRGLR---GDHPAYWRTAPVLKHWLAHNNETARD 174
Query: 232 TDRYHFNAMVIYTYYLIKFKYIV------------------------------------- 254
T V+ Y L F+ V
Sbjct: 175 TSSSSVRPRVLNEYDLRAFRETVEAGAVAGVMPAYNLVNGRPNHLSPHLGGHLRTWTEED 234
Query: 255 ----SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL-----NCGSFLGKHTEAAVKAGL 305
SD + L +S+HY T EEA A ++ AG+D S + A++ GL
Sbjct: 235 LLVCSDAGAPSNLVDSEHYFDTHEEATAAALRAGVDSFTDHGTDSSTIVARVRGALEQGL 294
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
+ E+ +D A+ + RLG FD +D + ++ LA +AA Q +VLL
Sbjct: 295 LTEADVDAAVRRQLSVRFRLGEFDAEHDPYAVTADSERDFDSPEHRALAREAAEQAVVLL 354
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
KN G LPL+P A +AV+G A+ K + Y GT +TPL+GL
Sbjct: 355 KND-GVLPLAPGA--RVAVVGLLADECK--LDWYSGTLLHRSTPLEGL 397
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 48/86 (55%), Gaps = 5/86 (5%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+ DR L LPG Q+ L+ A+ A +L ++S +A + + ++LW + G
Sbjct: 586 ETEDRTTLRLPGHQERLL-RAARAANPATVLALVSA----YPYAVDAETLPAVLWTAHGG 640
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
QA G A+A V G +P+GRLP TWY
Sbjct: 641 QAAGTALARVLAGDVSPAGRLPQTWY 666
>gi|189467715|ref|ZP_03016500.1| hypothetical protein BACINT_04107 [Bacteroides intestinalis DSM
17393]
gi|189435979|gb|EDV04964.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 943
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 156/550 (28%), Positives = 242/550 (44%), Gaps = 111/550 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F ++ +RLGIP ++ +E + G+ SY AT+FP +
Sbjct: 141 LNEVQRFFIEE----TRLGIP-VDFTNEGIRGIESY------------RATNFPTQLGLG 183
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L + +G + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 184 HTWNRELIRQVGLITGREARI-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 238
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR----------------- 234
+ V+G+Q +VAA KH+ AY N KG
Sbjct: 239 ELGIEMVRGMQHNH-------QVAATGKHFVAYS--NNKGAREGMARVDPQMSPREVEMI 289
Query: 235 --YHFNAMVIY--------------------TYYLIKFK---------YIVSDCDSVDVL 263
Y F ++ +YY + + Y+VSD D+V+ L
Sbjct: 290 HVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGEMGFRGYVVSDSDAVEYL 349
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNESAIDHAISNN 318
Y K +EA +S+ AGL++ C +F + VK G ++E I+ + +
Sbjct: 350 YTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEEVINDRVRDI 408
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
+G FD G ++V N+ LAL A+R+ +VLLKN LPL
Sbjct: 409 LRVKFLIGLFDAPYQTDLAG--ADREVEKAENESLALQASRESLVLLKNENNVLPLDINN 466
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL----AAVVATIYQAGCSNVQCG--- 431
+K +AV GPNA+ + +Y + TT L+G+ +Y GC V
Sbjct: 467 VKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKSEGKAEVLYTKGCDLVDANWPE 526
Query: 432 ------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
A++D A + A AD V+V+G Q E+ R L LPG+Q L+
Sbjct: 527 SELIDYPMTDNEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLDLPGRQLKLLQ 586
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
V K PV+L++++G I++A D + +IL YPG GG A+ADV FG YNP G
Sbjct: 587 AVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGTAVADVLFGDYNPGG 643
Query: 540 RLPMTWYPQS 549
++ +T +P+S
Sbjct: 644 KMTVT-FPKS 652
>gi|385776908|ref|YP_005649476.1| glycoside hydrolase family protein [Sulfolobus islandicus REY15A]
gi|323475656|gb|ADX86262.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
REY15A]
Length = 754
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 150/503 (29%), Positives = 223/503 (44%), Gaps = 101/503 (20%)
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRG 179
+T+FPQ I A+++N L I V+ ++AR L G+ SP +++ +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQAR------LVGVNQCLSPVLDVCKDPRWGRC 154
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL-DNWKGTDRYHFN 238
+ET GEDP L + Y+ GLQ + ++ A KH+ A+ + + + H
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------QLVATAKHFAAHGFPEGGRNIAQVHVG 207
Query: 239 AMVIYTYYLIKFK------------------------------------------YIVSD 256
+ +L F+ +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKA---GLVNESAIDH 313
D + L EAA ++ +G+D+ + + ++E V A GLV ES ID
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEFPT-IDCYSEPLVNALTEGLVPESLIDR 326
Query: 314 AISNNFATLMRLGFFDGHPSKQPYGQLG--PKDVCTQANQDLALDAARQGIVLLKNTAGS 371
A+ RLG D P+ P+ + +++LAL AR+ IVLLKN
Sbjct: 327 AVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNI 381
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-------CKYTTPLQGLAAVVA---TIY 421
LPLS + +AVIGPNAN + M+G+Y T + T LQG+ V +Y
Sbjct: 382 LPLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKKVGESKVLY 440
Query: 422 QAGCSNVQCGTAQVDDAKKAAASADATVLVMGAD-------------------QSIEAES 462
GC +A + A AD + VMG Q++ E
Sbjct: 441 AKGCDIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFKKYQAVTGEG 500
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
+DR L LPG Q+ L+ E+ K K P+IL++++G +S N K W +PG+
Sbjct: 501 NDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSPIINYVKAVIEAW--FPGEE 557
Query: 523 GGAAIADVCFGRYNPSGRLPMTW 545
GG AIADV FG YNPSGRLP+T+
Sbjct: 558 GGNAIADVIFGDYNPSGRLPITF 580
>gi|373956830|ref|ZP_09616790.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373893430|gb|EHQ29327.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 823
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 155/530 (29%), Positives = 238/530 (44%), Gaps = 100/530 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP ++ +E + GV AT+FP + +++ L +G +
Sbjct: 171 TRLGIP-VDFTNEGIRGVEAYQ-----------ATAFPTQLNMGMTWDKPLVNQMGNITG 218
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
EARA L ++P +++ RD RWGR +E GEDP L ++ KG+QQ +
Sbjct: 219 MEARA-----LGYTNVYAPILDVARDQRWGRLEEVYGEDPYLVARLGVEMAKGMQQNN-- 271
Query: 208 SPNRLKVAACCKHYTAYDLDN--WKGTDR---------------YHFNAMVIY------- 243
++AA KH+ Y + +G R Y F ++
Sbjct: 272 -----QIAATAKHFAVYSANKGGREGLARTDPQVAPREVENILLYPFKKVIKEAGLMGVM 326
Query: 244 --------------TYYLIK-------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSI 281
+Y+LI+ FK Y+VSD D+++ LYN H ++A ++
Sbjct: 327 SSYNDYDGIPISGSSYWLIQRLRQEFGFKGYVVSDSDALEYLYNKHHVAADLKDAVYQAF 386
Query: 282 LAGLDLNCG----SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY 337
+AG+++ + + VK G + I+ + + +LG FD HP Q
Sbjct: 387 MAGMNVRTTFRTPDSIIIYARQLVKEGKLPIDTINSRVRDVLRVKFKLGLFD-HPYVQD- 444
Query: 338 GQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIG 397
+ K V ANQ +AL A+++ IVLLKN LPLS + LAVIGPNA
Sbjct: 445 AEASAKLVNCAANQAVALQASKESIVLLKNKGAILPLSKQ--QTLAVIGPNALNDDYAHT 502
Query: 398 NYEGTPCKYTTPLQGLAAVVA---TIYQAGCSNV---------------QCGTAQVDDAK 439
+Y K L+G+ A V +Y GC+ V Q A++D A
Sbjct: 503 HYGPLASKSINILEGIQAKVGAGKVLYALGCNLVDKHWPESEILPQDPDQAEQAKIDSAV 562
Query: 440 KAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGF 499
A AD V+V+G + E+ R L LPG Q L+ V K PV+++++
Sbjct: 563 TIARHADVAVVVLGGNTQTAGENKSRTSLDLPGYQLRLVKAVKATGK-PVVVVLIGSQPM 621
Query: 500 DISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
I++ D I I++ GYPG GG A+ADV FG YNP G+L +T +P+S
Sbjct: 622 TINWI--DQHIDGIIYAGYPGTQGGTAVADVLFGDYNPGGKLTLT-FPKS 668
>gi|383115617|ref|ZP_09936373.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
gi|313694979|gb|EFS31814.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
Length = 946
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 170/619 (27%), Positives = 270/619 (43%), Gaps = 141/619 (22%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEW----WSEALHGV- 105
+ + S +D R+ DL+K++TL+EK +V G V + +P EW W + + +
Sbjct: 53 YEDPSAPVDARIEDLLKQMTLEEKTCQMVTLYGYKRVLKDDLPTPEWKNQLWKDGIGAID 112
Query: 106 ----------------SYVGPG-----------------------THFSNV-VPG----- 120
YV P T F+N + G
Sbjct: 113 EHLNGFQQWGLPPSDNEYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYK 172
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQ 180
AT+FP + ++N L + +G + EAR L ++P +++ RD RWGR +
Sbjct: 173 ATNFPTQLGLGHTWNRELIRQVGVITGREARM-----LGYTNVYAPILDVGRDQRWGRYE 227
Query: 181 ETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDN--WKGTDR---- 234
E GE P L ++ V+G+QQ +VAA KH+ AY + +G R
Sbjct: 228 EVYGESPYLVAELGIEMVRGMQQD-------YQVAATGKHFIAYSNNKGGREGMSRVDPQ 280
Query: 235 -----------YHFNAMVIY--------------------TYYLIKFK---------YIV 254
Y F ++ +YY + + Y+V
Sbjct: 281 MSPREVEMVHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVV 340
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNES 309
SD D+V+ LY + K +EA +S+ AGL++ C +F + VK G ++E
Sbjct: 341 SDSDAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEE 399
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
I+ + + +G FD HP + + ++V N+++AL A+R+ IVLLKN
Sbjct: 400 VINDRVRDILRVKFLVGLFD-HPYQTDL-KGADEEVEKAENEEVALQASRESIVLLKNDQ 457
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA----AVVATIYQAGC 425
LPL + IK +AV GPNA+ +G+Y + T+ L+G+ V +Y GC
Sbjct: 458 DVLPLDISGIKKIAVCGPNADECSYALGHYGPLAVEVTSVLKGIQEKTDGKVEVLYSKGC 517
Query: 426 SNVQCG---------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
V ++D A A AD V+V+G Q E+ R L L
Sbjct: 518 ELVDANWPESELIDFPLTEEEQKEIDRAVSQAKEADVAVVVLGGGQRTCGENKSRSSLDL 577
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG+Q L+ V K PV+L++++G I++A D + +IL YPG GG A+ADV
Sbjct: 578 PGRQLDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGAKGGKAVADV 634
Query: 531 CFGRYNPSGRLPMTWYPQS 549
FG YNP G+L +T +P++
Sbjct: 635 LFGDYNPGGKLTVT-FPKT 652
>gi|443726791|gb|ELU13850.1| hypothetical protein CAPTEDRAFT_222124 [Capitella teleta]
Length = 626
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 153/517 (29%), Positives = 232/517 (44%), Gaps = 103/517 (19%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDS-AGSVSRLGIPKYEWWSEALHGVSYVGPG 111
F + SL D+RV DLV RLT++E I + S + + RLGI + W +E L GV++
Sbjct: 21 FLDPSLPWDVRVDDLVSRLTIEEVIGQTMASLSPPIPRLGIKGFVWGTECLRGVAFTK-- 78
Query: 112 THFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTF 163
+T+FPQ + AA+F++ L I + V+ EARA +N A GL+
Sbjct: 79 ---------STAFPQALGLAATFSSELVHNISEAVAIEARAHWNADRAQGRVVDHEGLSC 129
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
+SP INI R P WGR QET GEDP L+ A +V+GLQ TD P ++ +A CKH+
Sbjct: 130 FSPVINIMRHPLWGRNQETYGEDPFLSGSLAQAFVRGLQGTD---PRYVRASAGCKHFDV 186
Query: 224 YDLDNWKGTDRYHFNAMV----IYTYYLIKFKY-IVSDCDSVDVLYNSQHYTKTPEEAAA 278
+ + R+ F++ V +L +FK I + S+ YNS +
Sbjct: 187 HGGPENIPSSRFSFDSKVSVRDWRMTFLPQFKMCIEAGTHSLMCSYNSINGVPACANRKL 246
Query: 279 KSILAGLDLNCGSFLGKHTEAA----VKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK 334
+ LA + F+ A K VN+S+ +H
Sbjct: 247 LTTLARQEFGFKGFVVSDDWAIEFINTKHQYVNDSSPEH--------------------- 285
Query: 335 QPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIK--NLAVIGPNANVT 392
++L++ AA VLLKN LPL K +L V P+
Sbjct: 286 ----------------RELSIRAASMSFVLLKNI-NVLPLKKKVPKLASLKVFHPSE--- 325
Query: 393 KTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVM 452
LQ A +GC + CGT + A +D ++ +
Sbjct: 326 -----------------LQFPIA-------SGCDDTVCGTYNKSETLNAVRHSDLVIIAL 361
Query: 453 GADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKG-PVILIIMSGGGFDISFAKNDPKIT 511
G Q++E E DR D LPG Q L+ + A+G PV+L++ + G ++++AK+ P +
Sbjct: 362 GTGQAVEKEGKDRRDAELPGHQLQLLKDAVTAAEGVPVVLLLFNAGPLNVTWAKSRPDVH 421
Query: 512 SILWVGYPGQAGGAAI---ADVCFGRYNPSGRLPMTW 545
+IL +P QA G A+ + G+ P+GRLP TW
Sbjct: 422 AILACFFPAQATGQALFRTLTMADGQSAPAGRLPATW 458
>gi|371940122|dbj|BAL45498.1| glycoside hydrolase [Bacillus licheniformis]
Length = 980
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 141/455 (30%), Positives = 209/455 (45%), Gaps = 78/455 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + SL + RV DL+ RLTL+EK++ + ++ RLGIP ++ +EALHGV+++G
Sbjct: 40 FQDPSLPVQKRVSDLLSRLTLEEKVSLMHQYQPAIPRLGIPAFKTGTEALHGVAWLGE-- 97
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--GLTFWSPNINI 170
AT FPQ + A +++ SL + IG V E R +++ A G+ W+P +++
Sbjct: 98 --------ATVFPQAVGLAHTWDRSLIKQIGSAVGDEVRGFHHLDPAANGVNVWAPVVDL 149
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E EDP L + AT Y GL+ G P LK KH+ AY+ + +
Sbjct: 150 LRDPRWGRNEEGYSEDPFLTGEIATAYASGLR---GEHPFYLKTVPTLKHFLAYNNETDR 206
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + + YYL FK
Sbjct: 207 GFSSSSIDPRNMNEYYLKPFKTAISAKAAYSLMPAYNSVNDKPAILSPLLNSTVKGKWAG 266
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGKHTEA---AVKA 303
+IVSD + Y + E+A A ++ AG+D + G G A A+K
Sbjct: 267 DDFFIVSDAFDPSGIVTDHKYYDSHEKAHAHAVKAGIDNFTDQGENPGLTKTALTGALKN 326
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQDLALDAARQGI 362
GL++E +D A++N F+ R G FD P + PY +L + + ++ LA AA + I
Sbjct: 327 GLMSEKELDQALANTFSIRFRTGEFD--PDELNPYSRLTDGVINSPKHRQLAKKAAEKAI 384
Query: 363 VLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT--- 419
VLLKN LP + +N+AVIGP N Y GT TPL G+A +
Sbjct: 385 VLLKNDRNLLPFNTRKNENIAVIGPFGNALYEDW--YSGTMPYRKTPLDGIADKIGKDRV 442
Query: 420 -----IYQAGCSNVQCG---TAQVDDAKKAAASAD 446
+ Q+G + G TA D + ASAD
Sbjct: 443 SFAEGLEQSGFKSALTGKFVTAGRDGKQPLTASAD 477
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 67/115 (58%), Gaps = 4/115 (3%)
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVIL 491
+ ++A + A +AD V+ +G + I E+ DR D+ LP Q+ L+ V+K A +L
Sbjct: 580 SSTEEAVRLAKTADKAVVFVGNNPYINGRETEDRKDITLPPAQENLLKAVSK-ANPNTVL 638
Query: 492 IIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
++ S F +++AK I +I++ + GQ G+A+AD+ FG NP GRL TW+
Sbjct: 639 VVTSSYPFALNWAK--VHIPAIIYSAHGGQEAGSALADILFGDENPGGRLTQTWH 691
>gi|409197445|ref|ZP_11226108.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
21150]
Length = 737
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 164/568 (28%), Positives = 253/568 (44%), Gaps = 93/568 (16%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVG 109
S F N L ++ RV DL+ R+TL+EK++ L + SV RLGI K E HGV+ G
Sbjct: 39 SYPFQNADLDMETRVDDLLSRMTLEEKVSAL-STDPSVPRLGI-KGAPHIEGYHGVAMGG 96
Query: 110 PGT---HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY---NVGLAGLTF 163
P VP T FPQ A++N L + G++ S EAR ++ + GL
Sbjct: 97 PANWAPKGDERVP-TTQFPQAYGMGATWNPELIRKAGEIESIEARYIFQNPEISKGGLVV 155
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
+PN ++ RDPRWGR +E GEDP L +T + KGLQ G + A+ KH+ A
Sbjct: 156 RAPNADLGRDPRWGRTEEVLGEDPFLVGTLSTAFTKGLQ---GDDEKYWRTASLLKHFLA 212
Query: 224 YDLDNWKGTDRYHFNAMVIYTYYLIKFKY------------------------------- 252
+N + + +F+ + Y YY F+
Sbjct: 213 NSNENTRDSSSSNFDTQLFYEYYGATFRRAILEGGSNAYMTAYNAVNGVPAHIHPMHKEI 272
Query: 253 ----------IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE---A 299
I +D +L + AA I AGL+ FL + E
Sbjct: 273 SMARWGVNGIICTDGGGYTLLVRAHKAYDDYYRAAEGVIKAGLN----QFLDNYREGVWG 328
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG----PKDVCTQANQDLAL 355
A+ G + E +D + + +++LG D K PY +G P + +Q+ AL
Sbjct: 329 ALAHGYLAEEDLDEVLKGVYRVMIKLGQLDPQ-DKVPYASIGRDGKPAPWTSPEHQEAAL 387
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
AR+ +VLLKN +LPL+ + +AVIG A+ ++ Y G P +TPL G+
Sbjct: 388 QMARESVVLLKNEKQTLPLAGDELGKVAVIGHLADT--ILLDWYSGMPPFMSTPLDGIKE 445
Query: 416 VVATIYQAGCSNVQCG-TAQVDDAKKAAASADATVLVMG-------------ADQSIEAE 461
+ G V + A +AA+ AD ++V+G D + E
Sbjct: 446 ------KMGADKVLFAPDNDYNAAVEAASQADVAIVVLGNHPYCDSERWGDCPDPGMGRE 499
Query: 462 SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQ 521
+ DR L L + L V + A IL++ S + I++++ + + +I+ + + GQ
Sbjct: 500 AVDRKTLRL--TDEWLAQRVFE-ANPNTILVLQSSFPYGINWSQEN--LPAIVHITHNGQ 554
Query: 522 AGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+ G A+ADV FG YNP G+L TW P+S
Sbjct: 555 STGTALADVLFGDYNPGGKLTQTW-PKS 581
>gi|319901343|ref|YP_004161071.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
gi|319416374|gb|ADV43485.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
P 36-108]
Length = 781
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 158/534 (29%), Positives = 254/534 (47%), Gaps = 108/534 (20%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP + E HG ++ GAT FP + A++++ SL Q +G+ ++
Sbjct: 126 TRLGIPVL-FAEECPHG-----------HMAIGATVFPTALSAASTWDESLMQQMGEAIA 173
Query: 148 TEARAM-YNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ---Q 203
EAR N+G + P +++ R+PRW R +ET GEDP+L S +KG+Q Q
Sbjct: 174 LEARLQGANIG------YGPVLDVAREPRWSRMEETFGEDPVLTSVMGVALMKGMQGDVQ 227
Query: 204 TDGGSPNRLKVAACCKHYTAYDL-DNWKGTDRYHFNAMVIYTYYLIKFKY--------IV 254
DG + + KH+ AY + ++ R + +++ YL FK I+
Sbjct: 228 NDGKH-----LYSTLKHFAAYGVPESGHNGSRANSGMRQLFSEYLPPFKKAVEAGAGTIM 282
Query: 255 SDCDSVD-------------VLYNSQHY-------------------TKTPEEAAAKSIL 282
+ +S+D VL N + K +EAAAK++
Sbjct: 283 TSYNSIDGVPCTSNKFLLTEVLRNQWGFKGFVYSDLISIEGIVGMRAAKDNKEAAAKALR 342
Query: 283 AGLDLNCG-SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG 341
AGLD++ G G++ + A + GL+ +D A+SN ++G F+ +P P +
Sbjct: 343 AGLDMDLGGDAFGRNLKQAYEEGLITMDDLDRAVSNVLRLKFQMGLFE-NPYVSP--EQA 399
Query: 342 PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG 401
K + ++ +++LA AR+G+VLLKN G LPL +K +AVIGPNA++ +G+Y
Sbjct: 400 GKHIRSREHKELARRVAREGVVLLKND-GVLPLD-KHLKRIAVIGPNADMMYNQLGDYTA 457
Query: 402 TPCK--YTTPLQGLAAVVA----TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGAD 455
+ T L G+ A V+ +Y GC+ + + A AA ADA +LV+G
Sbjct: 458 PQDRKEIVTVLDGVRAAVSKTTQVVYVKGCAVRDTTESDIPAAVAAAQRADAVILVVGGS 517
Query: 456 QSIE-------------------------AESHDRLDLLLPGQQQLLITEVAKVAKGPVI 490
+ + E DR L L G Q+ LI VA K P++
Sbjct: 518 SARDFKTKYISTGAATVSEDIKVLPDMDCGEGFDRSSLRLLGDQEKLINAVAATGK-PLV 576
Query: 491 LIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
+I ++G +++ A + + W YPG+ GGA IAD+ FG YNP+GRLP++
Sbjct: 577 VIYIAGRAMNMNLAADKARALLAAW--YPGEQGGAGIADILFGDYNPAGRLPVS 628
>gi|440694104|ref|ZP_20876743.1| glycosyl hydrolase family 3 C-terminal domain protein [Streptomyces
turgidiscabies Car8]
gi|440283921|gb|ELP71119.1| glycosyl hydrolase family 3 C-terminal domain protein [Streptomyces
turgidiscabies Car8]
Length = 970
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 139/408 (34%), Positives = 195/408 (47%), Gaps = 68/408 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L R+ DL+ RLTL E++ FL A +V RLG+ + EALHGV+++GP
Sbjct: 11 FRDPRLPTAKRIDDLLSRLTLDERVAFLHQFAPAVERLGVAAFRTGQEALHGVAWMGP-- 68
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWSPNINIF 171
AT FPQ + A++N L Q +G+VVS E RAM GL W+P +N+
Sbjct: 69 --------ATVFPQAVGLGATWNEELVQRVGEVVSREVRAMRARDDRVGLNVWAPTVNLL 120
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
R P WGR +E EDP L S AT Y +GL+ G P + A KH+ A++ + +
Sbjct: 121 RHPLWGRNEEGYSEDPKLTSAIATAYTRGLR---GDHPAYWRTAPVLKHWLAHNNETGRD 177
Query: 232 TDRYHFNAMVIYTYYLIKFKYIV------------------------------------- 254
V++ Y L F+ V
Sbjct: 178 VTSSSVRPRVLHEYDLRAFRETVEAGAVAGVMPAYNLVNGRPNHVSPLLGEHLRTWTDED 237
Query: 255 ----SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGK---HTEAAVKAGL 305
SD + L +S+HY T EEA A +++AG+D + G+ K + A+ GL
Sbjct: 238 LLVCSDAAAPSNLVDSEHYFDTHEEATAAALVAGVDSFTDHGTDSSKIVARLQGALSQGL 297
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
+ E ID A+ + RLG FD P PY + D T A++ LAL+AA Q IVLL
Sbjct: 298 LTEEDIDRAVRRQLSVRFRLGEFD--PQYDPYASV--PDFDTPAHRALALEAAEQAIVLL 353
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
KN AG+LPLSP + +AV+G A+ K Y GT +TPL+ L
Sbjct: 354 KNDAGTLPLSPGS--RVAVVGLLADECKADW--YSGTLIHRSTPLEAL 397
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+ DR L LP QQ L+ + A+ A +L + S +A + ++ + LW + G
Sbjct: 605 ETEDRTTLRLPAHQQRLL-DAARAANPRTVLALTSA----YPYAVDVTELPAALWTAHGG 659
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
QA G A+A V G +P+GRLP TWY
Sbjct: 660 QAAGTALARVLAGDVSPAGRLPQTWY 685
>gi|253574420|ref|ZP_04851761.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251846125|gb|EES74132.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 782
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 153/534 (28%), Positives = 232/534 (43%), Gaps = 105/534 (19%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
SRLGIP +G ++ GAT FP + +++N L++ + + V+
Sbjct: 119 SRLGIP------------ILIGEECSHGHMAIGATVFPVPLSLGSTWNVELYREMCRAVA 166
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ--QTD 205
E RA G +SP +++ RDPRWGR +E GED L S+ A V+GLQ D
Sbjct: 167 RETRAQ-----GGAVTYSPVLDVVRDPRWGRTEECFGEDAYLISEMAVASVEGLQGESLD 221
Query: 206 GGSPNRLKVAACCKHYTAY-DLDNWKGTDRYHFNAMVIYTYYLIKFK------------- 251
G VAA KH+ Y + + H + L+ F+
Sbjct: 222 GED----SVAATLKHFVGYGSSEGGRNAGPVHMGRRELLEVDLLPFRKAVEAGAASIMPA 277
Query: 252 ----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
+++DC ++D+L + + +AA ++I A
Sbjct: 278 YNEIDGVPCTTNEELLDGVLRGEWGFDGMVITDCGAIDMLASGHDVAEDGRDAAIQAIRA 337
Query: 284 GLDLN-CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
G+D+ G GKH AV++G + E +D A+ RLG F+ P P +
Sbjct: 338 GIDMEMSGVMFGKHLVEAVRSGQLEEEVLDRAVRRVLTLKFRLGLFE-RPYADP--ERAE 394
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
+ + + + +LA A +G+VLLKN G LPLS A +AVIGPNA+ +G+Y
Sbjct: 395 RVIGSAEHVELARQLASEGVVLLKNKDGVLPLSADA-GTIAVIGPNADAGYNQLGDYTSP 453
Query: 403 P--CKYTTPLQGLAAVVA-----TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGAD 455
K TT L G+ + +A +Y GC D A A AD V+V+G
Sbjct: 454 QPRSKVTTVLGGIRSKLAETPERVLYAPGCRINGNSREGFDVALSCAEKADTVVMVVGGS 513
Query: 456 QSIE-------------------------AESHDRLDLLLPGQQQLLITEVAKVAKGPVI 490
+ + E DR++L L G Q LI E+ K+ K P++
Sbjct: 514 SARDFGEGTIDLRTGASKVTDNAESDMDCGEGIDRMNLSLSGVQLELIQEIHKLGK-PLV 572
Query: 491 LIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
++ ++G I+ D +IL YPGQ GG AIAD+ FG NPSGRL ++
Sbjct: 573 VVYING--RPIAEPWIDEHADAILEAWYPGQEGGHAIADILFGDVNPSGRLTIS 624
>gi|423303577|ref|ZP_17281576.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
CL03T00C23]
gi|423307700|ref|ZP_17285690.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
CL03T12C37]
gi|392687941|gb|EIY81232.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
CL03T00C23]
gi|392689569|gb|EIY82846.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
CL03T12C37]
Length = 942
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 159/550 (28%), Positives = 244/550 (44%), Gaps = 111/550 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F V+ +RLGIP ++ +E + GV SY AT+FP +
Sbjct: 141 LNEVQRFFVED----TRLGIP-VDFTNEGIRGVESY------------RATNFPTQLGLG 183
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L + +G + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 184 HTWNRELIRQVGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 238
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR----------------- 234
+ V+GLQ +VAA KH+ AY N KG
Sbjct: 239 ELGIEMVRGLQHNH-------QVAATGKHFAAYS--NNKGAREGMARVDPQMSPREVENI 289
Query: 235 --YHFNAMVIY--------------------TYYLIKFK---------YIVSDCDSVDVL 263
Y F ++ +YY + + Y+VSD D+V+ L
Sbjct: 290 HIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVSDSDAVEYL 349
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNC-----GSFLGKHTEAAVKAGLVNESAIDHAISNN 318
Y K +EA +S+ AGL++ C SF+ E VK G ++E I+ + +
Sbjct: 350 YTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLREL-VKEGGLSEEVINDRVRDI 408
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
+G FD G ++V + N+ +AL A+ + +VLLKN LPL +
Sbjct: 409 LRVKFLIGLFDAPYQTDLAG--ADREVEKEENEAIALQASHESVVLLKNADELLPLDINS 466
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL----AAVVATIYQAGCSNVQC---- 430
K +AV GPNAN + +Y + TT L+G+ + +Y GC V
Sbjct: 467 TKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKSKAEVLYTKGCDLVDAHWPE 526
Query: 431 -----------GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
A++D A + A AD V+V+G Q E+ R L LPG+Q L+
Sbjct: 527 SEIIDYPLTDDEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRTSLDLPGRQLQLLQ 586
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
+ K PV+LI+++G I++A D + +IL YPG GG A+AD+ FG YNP G
Sbjct: 587 AIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADILFGDYNPGG 643
Query: 540 RLPMTWYPQS 549
+L +T +P++
Sbjct: 644 KLTVT-FPKT 652
>gi|393781221|ref|ZP_10369422.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
CL02T12C01]
gi|392677556|gb|EIY70973.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
CL02T12C01]
Length = 946
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 161/550 (29%), Positives = 242/550 (44%), Gaps = 111/550 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F V+ +RLGIP ++ +E + GV SY AT+FP +
Sbjct: 141 LNEVQRFFVEE----TRLGIP-VDFTNEGIRGVESY------------KATNFPTQLGLG 183
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L IG + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 184 HTWNRKLIHQIGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 238
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR----------------- 234
+ V+G+Q +VAA KH+ AY N KG
Sbjct: 239 ELGIEMVRGMQHNH-------QVAATGKHFIAYS--NNKGAREGMARVDPQMSPREVEMI 289
Query: 235 --YHFNAMVIY--------------------TYYLIKFK---------YIVSDCDSVDVL 263
Y F ++ +YY + + Y+VSD D+V+ L
Sbjct: 290 HVYPFKRVIQEAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGQMGFRGYVVSDSDAVEYL 349
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNC-----GSFLGKHTEAAVKAGLVNESAIDHAISNN 318
Y K +EA +S+ AGL++ C S++ E + GL +E I+ + +
Sbjct: 350 YTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVQEGGL-SEEVINDRVRDI 408
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
+G FD G +V + N+ +AL A+R+ IVLLKN +LPL T+
Sbjct: 409 LRVKFLVGLFDAPYQTDLKG--ADDEVEKEENEAVALQASRESIVLLKNENNTLPLDITS 466
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV----ATIYQAGCSNVQC---- 430
+K +AV GPNA + +Y + TT + GL + +Y GC V
Sbjct: 467 VKKIAVCGPNAAEKAYALTHYGPLAVEVTTVVDGLREKLNGKAEVLYTKGCDLVDAHWPE 526
Query: 431 -----------GTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
+++D A A AD V+V+G Q E+ R L LPG+Q L+
Sbjct: 527 SEIIDYPLSKDEQSEIDKAVAQAQEADVAVVVLGGGQRTCGENKSRSSLDLPGRQLDLLK 586
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
V K PVIL++++G +++A D + +IL YPG GG AIADV FG YNP G
Sbjct: 587 AVQATGK-PVILVLINGRPLSVNWA--DKFVPAILEAWYPGSKGGTAIADVLFGDYNPGG 643
Query: 540 RLPMTWYPQS 549
+L +T +P+S
Sbjct: 644 KLTVT-FPKS 652
>gi|317474222|ref|ZP_07933498.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909532|gb|EFV31210.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 954
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 156/552 (28%), Positives = 265/552 (48%), Gaps = 80/552 (14%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEWWSEALHGVSY 107
+L + + L ++ RV L+ +T ++K+ + + G + L +P EA+HG SY
Sbjct: 167 TLRYMDPRLPVNGRVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY 225
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPN 167
G+ GAT FPQ + A++N +L + + + E A + WSP
Sbjct: 226 ---GS-------GATIFPQALAMGATWNKNLTEKVAMAIGEETLAA-----GTMQAWSPV 270
Query: 168 INIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG---------------GSPNRL 212
+++ +D RWGR +ET GEDP+L S+ ++KG Q+ G G +
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGY-QSKGLFTTPKHFGGHGAPLGGRDSH 329
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHF--NAMVIYTYYL----IKFK--------------- 251
+ + L ++ R + + M+ Y+ +L K K
Sbjct: 330 DIGLSEREMREIHLVPFRHVIRNYACQSLMMAYSDFLGVPVAKSKELLRNILREEWGFDG 389
Query: 252 YIVSDCDSVDVLYNSQHYT-KTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNES 309
++VSDC ++ L + +HYT K EAA +++ AG+ NCG ++ K A K G ++ +
Sbjct: 390 FVVSDCGAIGNLTSRKHYTAKNKIEAANQALAAGIATNCGDTYNDKEVIQAAKDGRLDMA 449
Query: 310 AIDHAISNNFATLMRLGFFDG-HPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
+D+ + R F+ H + ++ P + +++++A +AR+ IV+L+N
Sbjct: 450 NLDNVCRTMLRMMFRNELFEKEHKEPLDWNKIYP-GWNSDSHKEIARQSARESIVMLENK 508
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNY--EGTPCKYTTPLQGLAAVVAT----IYQ 422
LPLS ++ +AV+GP A+ + G+Y + P + + L G+ V +Y+
Sbjct: 509 DDLLPLS-KHVRTIAVLGPGADNLQP--GDYTPKLRPGQLKSVLTGIKQAVGNQTKILYE 565
Query: 423 AGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQ 473
GC + +A K AA +D VLV+G + EA E+HD L+LPG+
Sbjct: 566 QGCEFTGSNGENIPNAVKVAAQSDVVVLVLGDCSTSEATTDVYKTSGENHDYATLILPGR 625
Query: 474 QQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFG 533
QQ L+ V K PV+L++ G ++++ K + W+ PGQ GG A ADV FG
Sbjct: 626 QQELLEAVCATGK-PVVLLLQIGRPYNLTKESELCKAIIVNWL--PGQEGGLATADVLFG 682
Query: 534 RYNPSGRLPMTW 545
YNP+GRLPMT+
Sbjct: 683 DYNPAGRLPMTF 694
>gi|86141717|ref|ZP_01060241.1| beta-glucosidase [Leeuwenhoekiella blandensis MED217]
gi|85831280|gb|EAQ49736.1| beta-glucosidase [Leeuwenhoekiella blandensis MED217]
Length = 758
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 160/531 (30%), Positives = 239/531 (45%), Gaps = 102/531 (19%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP S+ +HG T+FP + ++S++ L + ++ +
Sbjct: 98 TRLGIPLL-IGSDIIHGYK---------------TTFPIPLGLSSSWDMELIEKTAQIAA 141
Query: 148 TEARAMYNVGLAGLTF-WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
EA A G+ + +SP ++I RDPRWGR E GEDP L S A V G QQ D
Sbjct: 142 KEATA------DGINWNFSPMVDIARDPRWGRISEGAGEDPYLGSAIAKAMVTGYQQEDL 195
Query: 207 GSPNRLKVAACCKHYTAYDL-DNWKGTDRYHFNAMVIYTYYLIKFK-------------- 251
N + + KH+ Y + + + + + ++ YL +K
Sbjct: 196 TEENTM--ISTVKHFALYGAAEGGRDYNTTDMSRVKMFNEYLPPYKAAIDAGAESVMSSF 253
Query: 252 ---------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
++VSD SV+ + H + +A SI AG
Sbjct: 254 NDVDGVPASGNKWLLTHLLREQWGFEGFVVSDYTSVNEMI--AHGLGDLQAVSALSINAG 311
Query: 285 LDLN-CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
LD++ G + +V G V+E+ I +A +LG FD P K + +
Sbjct: 312 LDMDMVGEGFLTTLKKSVDEGKVSEATITNACRRILEAKYKLGLFD-DPYKYSDSKRPER 370
Query: 344 DVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY--EG 401
D+ T AN+++A DAAR+ VLLKN +LPL TA +A+IGP AN M+G + G
Sbjct: 371 DILTAANKEIARDAARKSFVLLKNENKTLPLDKTA--KIALIGPLANNKNNMLGTWAPTG 428
Query: 402 TPCKYTTPLQGLAAVV--ATIYQAGCSNVQCGTAQ---------------------VDDA 438
P T +GL V A I +N+ TA +++A
Sbjct: 429 DPQLSTPIFEGLKNVAPNAEISYTKGANISNDTAYAKKINVFGPRIEISEATPETLLEEA 488
Query: 439 KKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
+ A +AD V V+G + ES R D+ +P Q+ LI E+ K+ K PV+L++MSG
Sbjct: 489 LQNAETADVVVAVVGEATEMSGESSSRTDITIPESQKTLIQELVKIGK-PVVLVLMSGRP 547
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
DI+ P SIL V +PG G A+ADV FG YNPSG+L +W PQ+
Sbjct: 548 LDITEELALP--VSILQVWHPGIQAGNAVADVLFGDYNPSGKLTASW-PQN 595
>gi|340347926|ref|ZP_08671025.1| beta-glucosidase [Prevotella dentalis DSM 3688]
gi|433652779|ref|YP_007296633.1| beta-glucosidase-like glycosyl hydrolase [Prevotella dentalis DSM
3688]
gi|339608382|gb|EGQ13286.1| beta-glucosidase [Prevotella dentalis DSM 3688]
gi|433303312|gb|AGB29127.1| beta-glucosidase-like glycosyl hydrolase [Prevotella dentalis DSM
3688]
Length = 849
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 142/416 (34%), Positives = 205/416 (49%), Gaps = 75/416 (18%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
L ++LTL EKI+ L+D + ++ +L IP ++WW+EALHGV+ G AT FPQ
Sbjct: 32 LCRQLTLSEKISLLLDDSPAIPQLNIPAFQWWNEALHGVARNGV----------ATVFPQ 81
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDPRWGR 178
I AA+F+ L Q + VS EARA A GL+FW+PNINIFRDPRWGR
Sbjct: 82 TIGMAAAFDDELLQQVFTAVSDEARAKAERAKALGHVRRYQGLSFWTPNINIFRDPRWGR 141
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY----------DLDN 228
GQET GEDP L + V+GLQ D S K AC KH+ + +++N
Sbjct: 142 GQETYGEDPYLTGRMGLAVVRGLQGPDTAS--YYKTLACAKHFAVHSGPESLRHEMNIEN 199
Query: 229 WKGTDRYH-----FNAMVI------------------------YTYYLIKFKY-----IV 254
D + F ++V ++++ ++ +V
Sbjct: 200 LSPRDLWETYLPAFKSLVTKGQVAEVMCAYQRFDGQPCCGNSRLLQHILRDEWNFKGLVV 259
Query: 255 SDCDSV-DVLYNSQH-YTKTPEEAAAKSILAGLDLNCGSFLGKHT-EAAVKAGLVNESAI 311
SDC ++ D +H K EA+A++ AG D+ CGS H+ E AVKAG++ ES I
Sbjct: 260 SDCGAISDFWIPGRHGVAKDAVEASAQAQRAGTDVECGS--NYHSLEEAVKAGVLKESDI 317
Query: 312 DHAISNNFATLMRLGFFDGHPSKQ-PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
D ++ + LG D PS P+ + V A++ LAL AR+ +VLL+N
Sbjct: 318 DRSVIRVLEARLALG--DISPSADVPWKTIPYAVVDCPAHRQLALRMARESMVLLQNKHH 375
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGC 425
+LPLS + V+G NA + + NY GTP T L G+ + + + GC
Sbjct: 376 TLPLSTH--DKILVVGENAVDSMMLWANYNGTPSHTMTILDGIRQLAEHVEFMPGC 429
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 61/117 (52%), Gaps = 13/117 (11%)
Query: 440 KAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLITEVAKVAKGPV 489
+ A ADA V V G +E E DR + LP Q+ I ++ VA+ P+
Sbjct: 588 EKARQADAVVFVGGLSPRLEGEEMKVDLPGFSGGDRTSIELPVAQREAIQALS-VARKPI 646
Query: 490 ILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+++ SG I+ +IL YPG+ GG A+ADV FG+ NPSG+LP+T+Y
Sbjct: 647 VMVNCSGSA--IALEPETKNCDAILQAWYPGEEGGRAVADVLFGKVNPSGKLPVTFY 701
>gi|379721503|ref|YP_005313634.1| beta-glucosidase [Paenibacillus mucilaginosus 3016]
gi|378570175|gb|AFC30485.1| beta-glucosidase [Paenibacillus mucilaginosus 3016]
Length = 951
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 127/415 (30%), Positives = 196/415 (47%), Gaps = 62/415 (14%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
++ +LTL EKI + ++ RLGI Y+ +EA HG++++G ATS+PQ
Sbjct: 1 MIAQLTLDEKIELMPQYQIAIERLGIQAYKHGTEAAHGMAWLGE----------ATSYPQ 50
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINIFRDPRWGRGQETPG 184
I A ++++ L + IG + EAR Y N GLT W+P +++ RDPRWGR +E G
Sbjct: 51 PIGLACTWDSELLKRIGSAIGDEARGFYKQNPNFNGLTLWAPTVDLERDPRWGRTEEAYG 110
Query: 185 EDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYT 244
EDP+LA K A +G+Q G P LK A KH+ + + +G + +
Sbjct: 111 EDPVLAGKLAAALTQGIQ---GDHPFYLKAVATLKHFIGNNNEAGRGDTSVSLDPRNLRE 167
Query: 245 YYL----IKFK-------------------------------------YIVSDCDSVDVL 263
YYL I FK ++VSD V
Sbjct: 168 YYLKVFEITFKEGGAQSMMTAYNAVNGVPANLSELVIDIVKGEWGMNGFVVSDAFDVTGT 227
Query: 264 YNSQHYTKTPEEAAAKSIL-AGLD--LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFA 320
HY +T +EA A+SI G+D + + + + A+K GL+ E+ +D A+ N F
Sbjct: 228 MRDHHYVETLKEAVARSIREGGIDSITDDAAVVTEAIHEALKDGLLTENDLDTALRNMFR 287
Query: 321 TLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIK 380
RLG FD P + PY + + + LA +A+++ +VLLKN +LPL +
Sbjct: 288 VRFRLGEFD-PPERNPYAAIDESAILHPEHAKLAREASQKAVVLLKNDGKTLPLQADKLS 346
Query: 381 NLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQV 435
+AVIGP A+V Y G+ TPLQG+ +A + ++ GT ++
Sbjct: 347 KVAVIGPLADVVYQ--DWYSGSLPYAVTPLQGIRERLAAEGKDAVTSYAAGTDRM 399
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 48/136 (35%), Positives = 74/136 (54%), Gaps = 10/136 (7%)
Query: 414 AAVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPG 472
++VV T QA ++C ++ +D A +A ADA ++ +G I E+ DR D+ L
Sbjct: 532 SSVVGT--QADTFEIECVSSGLDAAVASARGADAAIVFVGNHPLINGKETMDRPDITLAE 589
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKN--DPKITSILWVGYPGQAGGAAIADV 530
Q+ L +A + K P +I++ G FA N K +I++ + GQ G A+ADV
Sbjct: 590 SQEKLA--LAVIEKNPNTIIVVVGS---YPFALNALQEKAKAIIYTSHAGQELGRAVADV 644
Query: 531 CFGRYNPSGRLPMTWY 546
FG NP+GR+ MTWY
Sbjct: 645 LFGNVNPAGRVNMTWY 660
>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
Length = 1552
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 155/530 (29%), Positives = 235/530 (44%), Gaps = 91/530 (17%)
Query: 74 QEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAAS 133
++ ++ + +A +R GIP E +HG+ V G T FPQ I A+
Sbjct: 791 RQAVSTIQQNAADSTRFGIPVIPV-IEGIHGI-----------VQDGCTIFPQAIAQGAT 838
Query: 134 FNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
FN L + + + TE RA + +P+++I R+ RWGR +ET GEDP L S+
Sbjct: 839 FNPQLVFRMAQHIGTEMRA-----IGARQVLAPDLDIAREQRWGRVEETFGEDPYLISRM 893
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYTAY-------DLDNWKGTDRYHFNAMVIYTYY 246
YVKG+Q + GG P KH+ A+ +L + KG R F+ V Y
Sbjct: 894 GYNYVKGIQ-SRGGIP-------TLKHFVAHGTPQGGLNLASVKGGQRELFDVYVKPFEY 945
Query: 247 LIK-----------------------------------FK-YIVSDCDSVDVLYNSQHYT 270
+I+ FK YI SD S+ +L H
Sbjct: 946 VIRHTKAGSVMNCYSAYDNEAITSSPFFLRTLLRDSLHFKGYIYSDWGSIPMLRYFHHTA 1005
Query: 271 KTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDG 330
+ EAA ++I AG+DL GS + + GL++++ ID A ++ T G FD
Sbjct: 1006 DSETEAAQQAINAGVDLEAGSDYYRTAPTLIAQGLLDKARIDSAAAHVLYTKFEAGLFDE 1065
Query: 331 HPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNAN 390
S + + + + T +A A + +VLL+N LPL + ++AV+GPNA
Sbjct: 1066 LASDTLHWR---QQIHTPEAVAVAKQLADESLVLLENRNHFLPLDLNRLHSIAVVGPNA- 1121
Query: 391 VTKTMIGNYEGTPCKY--TTPLQGLAAVVA----TIYQAGCSNVQCGTAQVDDAKKAAAS 444
+ G+Y T TPL G+ V Y GC T +D+A A
Sbjct: 1122 -AQVQFGDYSWTADNRHGITPLAGIQQVAGMRTKVRYVKGCDYYSQNTDSIDEAVALAKQ 1180
Query: 445 ADATVLVMGADQ---------SIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
+D TV+V+G S E +D DL+LPG QQ LI +A K P I+++++
Sbjct: 1181 SDVTVVVVGTQSMLLARPSQPSTSGEGYDLSDLILPGVQQQLIERIAATGK-PFIVVMVT 1239
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
G KN + W G G+ G ++A FG+ NPSGRLP+++
Sbjct: 1240 GRPLLTEAFKNKADALLVQWYG--GEQAGLSLAQALFGQLNPSGRLPISF 1287
>gi|374375578|ref|ZP_09633236.1| Beta-glucosidase [Niabella soli DSM 19437]
gi|373232418|gb|EHP52213.1| Beta-glucosidase [Niabella soli DSM 19437]
Length = 768
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 176/595 (29%), Positives = 266/595 (44%), Gaps = 135/595 (22%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAGSV-------------------------------- 87
+++RV DL+ R+TLQEK+ + D + S
Sbjct: 35 VEVRVKDLLSRMTLQEKLAQMQDLSASEIMVSDAIDPVKMDQKLKGKSFGVFEGMGLTVA 94
Query: 88 ----------------SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTA 131
+RLGIP SEALHG V GAT +PQ I
Sbjct: 95 QYAKVINTLQEYMVKKTRLGIPVLTS-SEALHGC-----------VHGGATVYPQAIALG 142
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRGQETPGEDPLLA 190
++FN +L + + ++ E +A G+T SP++++ R+ RWGR +ET GEDP L
Sbjct: 143 STFNPALVNQMTQTITKELKAQ------GITQVLSPDLDLARELRWGRVEETYGEDPYLT 196
Query: 191 SKYATGYVKGLQQ--------------TDGGSPNRLKVAACCKHYTAYDLDNWKGT--DR 234
S+ + KG + T G N VA + + L ++ +
Sbjct: 197 SRMGVAFTKGFTENNIICTPKHFAAHGTPRGGLNLASVAGGERELRSIYLKPFEAVIKEA 256
Query: 235 YHFNAMVIYTYY------------------LIKFK-YIVSDCDSVDVLYNSQHYTKTPEE 275
+ M Y+ Y + F+ Y+ SD +V++LY+ Q K P E
Sbjct: 257 QPLSIMNAYSSYDGVPMAASHQVLTDILRKELGFRGYVYSDWGAVEMLYSFQRTAKGPAE 316
Query: 276 AAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ 335
AA +++ AGLDL S + ++ VKAG + ID A+S + +G F+
Sbjct: 317 AALQAVKAGLDLEVWSDCFEKLDSLVKAGTLPVRYIDTAVSRILSAKFAIGLFE-----H 371
Query: 336 PYGQLG--PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTK 393
PY L D+ T + LALD AR+ IVLLKN LPL +I+++AVIGPNA+
Sbjct: 372 PYPNLKGLTSDIHTPQSVQLALDIARESIVLLKNEDHLLPLK-GSIRSIAVIGPNAD--H 428
Query: 394 TMIGNYEGTPCKY--TTPLQGLAAVVATIYQ----AGCSNVQCGTAQVDDAKKAAASADA 447
G+Y T TPLQG+ A+ Q GC A AA+ +D
Sbjct: 429 VQFGDYSWTNDNRHGITPLQGIQALAGNKIQINHAKGCDTWSQNREGFSAAVAAASKSDV 488
Query: 448 TVLVMGAD---------QSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
V+ +G+ + E +D DL LPG Q+ L+ + + K PV+++++SG
Sbjct: 489 AVVFVGSSSASPGYPHPDATSGEGYDLSDLKLPGVQEDLVKAIKQTGK-PVVVVLVSGKP 547
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGR----YNPSGRLPMTWYPQS 549
F I + K + I +I+ YPG+ GG AIA+V FG NPSG+L ++ +PQS
Sbjct: 548 FAIPWIKEN--IPAIVGQWYPGEQGGTAIAEVLFGPANGGINPSGKLNVS-FPQS 599
>gi|150002739|ref|YP_001297483.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|294776994|ref|ZP_06742455.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
gi|149931163|gb|ABR37861.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
gi|294449242|gb|EFG17781.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
Length = 788
Score = 183 bits (464), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 153/527 (29%), Positives = 243/527 (46%), Gaps = 94/527 (17%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP ++ +E + G+ + AT FP A++N L IG+V +
Sbjct: 137 TRLGIP-VDFTNEGIRGLCH-----------DRATYFPAQCGQGATWNKKLIARIGEVEA 184
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD-- 205
EA V L +SP ++I +DPRWGR ET GEDP L + + LQ+ +
Sbjct: 185 KEA-----VALGYTNIYSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKYNLV 239
Query: 206 -------------GG-----------SPNRLKVAACCKHYTAY-------------DLDN 228
GG +P ++ A+ D D
Sbjct: 240 ATPKHFAVYSIPIGGRDGKTRTDPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDG 299
Query: 229 WKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
T YHF ++ + K Y+VSD ++V+ + N T E+ A+++ AGL++
Sbjct: 300 EPITGSYHFLTEILRQEWGFK-GYVVSDSEAVEFISNKHKVADTYEDGIAQAVNAGLNIR 358
Query: 289 C-----GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY---GQL 340
F+ AV G +++ +D ++ RLG FD PY G+
Sbjct: 359 THFTPPADFILP-LRKAVDNGKISQETLDKRVAEILRIKFRLGLFD-----NPYRGNGKQ 412
Query: 341 GPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE 400
+ V ++ +Q ++L+AARQ +VLLKN LPLS +I+++AVIGPNAN +I Y
Sbjct: 413 AEQIVHSKEHQAVSLEAARQSLVLLKNETNLLPLS-KSIRSIAVIGPNANEQTQLICRYG 471
Query: 401 GTPCKYTTPLQGLAAVV---ATIYQAGCSNVQC-----------GTAQ----VDDAKKAA 442
T QG+ ++ IY+ GC + TA+ +++A +AA
Sbjct: 472 PANAPIKTVYQGIKELLPHTEVIYKKGCDIIDPHFPESEILDFPKTAEEVQLMEEAIRAA 531
Query: 443 ASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS 502
A+ V+V+G ++ E R L LPG+Q+ L+ V K P+IL+++ G I+
Sbjct: 532 KQAEVVVMVLGGNELTVREDRSRTSLNLPGRQEELLKAVCATGK-PIILVMLDGRASSIN 590
Query: 503 FAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+A I +IL +PG+ G A+A+ FG YNP GRL +T +P+S
Sbjct: 591 YAA--AHIPAILHAWFPGEFCGQAVAEALFGDYNPGGRLAVT-FPKS 634
>gi|227828570|ref|YP_002830350.1| glycoside hydrolase [Sulfolobus islandicus M.14.25]
gi|229585800|ref|YP_002844302.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.27]
gi|227460366|gb|ACP39052.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.14.25]
gi|228020850|gb|ACP56257.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.16.27]
Length = 755
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 147/508 (28%), Positives = 231/508 (45%), Gaps = 103/508 (20%)
Query: 117 VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPR 175
+V AT+FPQ I A++++ L + + + +A+ L G SP +++ RDPR
Sbjct: 97 MVKTATAFPQAIGLASTWDPDLIREVSSTIRYQAK------LIGTNQCLSPVLDVCRDPR 150
Query: 176 WGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL-DNWKGTDR 234
WGR +ET GED L + YVKGLQ + ++ A KH+ A+ + +
Sbjct: 151 WGRCEETYGEDQYLVASIGLAYVKGLQGEN-------ELIATVKHFAAHGFPEGGRNIAP 203
Query: 235 YHFNAMVIYTYYLIKFK------------------------------------------Y 252
H + +L F+
Sbjct: 204 VHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQEWGFEGI 263
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD-----LNCGSFLGKHTEAAVKAGLVN 307
+VSD D++ L + +EAA ++ AG+D ++C G+ AVK GL++
Sbjct: 264 VVSDYDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDC---FGEPLLEAVKEGLIS 320
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
ES ID A+ +LG F+ H + P+ + +++LALD AR+ IVLLKN
Sbjct: 321 ESIIDRAVERVLRIKEKLGLFNNHYINE---NNVPEKLDNSKSRELALDVARKSIVLLKN 377
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-------CKYTTPLQGLAAVVA-- 418
LPL+ I +AVIGPNAN + ++G+Y T + T L+G+ V+
Sbjct: 378 D-NILPLNKN-IGTIAVIGPNANEPRNLLGDYTYTGHLNADGGIEVVTVLEGIMRKVSNN 435
Query: 419 --TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGAD-------------------QS 457
+Y GC +A + A D + VMG Q+
Sbjct: 436 TNVLYAKGCDIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGKDEFEKYQA 495
Query: 458 IEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
+ E +DR L LPG Q+ L+ E+ K K P+IL++++G +S N+ + +I+
Sbjct: 496 VTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFNE--VNAIIDAW 552
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTW 545
+PG+ GG AIADV FG YNPSGRLP+++
Sbjct: 553 FPGEEGGNAIADVIFGDYNPSGRLPISF 580
>gi|380694149|ref|ZP_09859008.1| glycoside hydrolase 3 [Bacteroides faecis MAJ27]
Length = 946
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 156/550 (28%), Positives = 247/550 (44%), Gaps = 111/550 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F ++ +RLGIP ++ +E + GV SY AT+FP +
Sbjct: 141 LNEVQRFFIEE----TRLGIPT-DFTNEGIRGVESY------------KATNFPTQLGLG 183
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L +G + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 184 HTWNRRLIHQVGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 238
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG-------------------T 232
+ V+G+Q ++AA KH+ AY N KG T
Sbjct: 239 ELGIEMVRGMQHNH-------QIAATGKHFIAYS--NNKGAREGMARVDPQMSPREVEMT 289
Query: 233 DRYHFNAMVIY--------------------TYYLIKFK---------YIVSDCDSVDVL 263
Y F ++ +YY + + Y+VSD D+V+ L
Sbjct: 290 HVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVSDSDAVEYL 349
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNESAIDHAISNN 318
Y K +EA +S+ AGL++ C +F + VK G ++E I+ + +
Sbjct: 350 YTKHGTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEEVINDRVRDI 408
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
+G FD HP Q + ++V AN+++AL A+R+ IVLLKN LPL +
Sbjct: 409 LRVKFLVGLFD-HPY-QIDLKGADEEVEKAANEEIALQASRESIVLLKNDKNILPLDASG 466
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV----ATIYQAGCSNVQCG--- 431
I+ +AV GPNA+ + +Y + T+ L+G+ + +Y GC V
Sbjct: 467 IQKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKGKAEVLYTKGCDLVDANWPE 526
Query: 432 ------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
+++ A AD V+V+G Q E+ R L LPG+Q L+
Sbjct: 527 SELIDYPLTDEEQKEIEKAVDQTKQADVAVVVLGGGQRTCGENKSRSSLDLPGRQLDLLK 586
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
VA K PV+L++++G I++A D + +I+ YPG GG A+ADV FG YNP G
Sbjct: 587 AVAATGK-PVVLVLINGRPLSINWA--DKFVPAIVEAWYPGSKGGKAVADVLFGEYNPGG 643
Query: 540 RLPMTWYPQS 549
+L +T +P++
Sbjct: 644 KLTVT-FPKT 652
>gi|427383551|ref|ZP_18880271.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
12058]
gi|425728735|gb|EKU91590.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
12058]
Length = 939
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 159/550 (28%), Positives = 241/550 (43%), Gaps = 111/550 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F ++ +RLGIP ++ +E + GV SY AT+FP +
Sbjct: 138 LNEVQRFFIEE----TRLGIP-VDFTNEGIRGVESY------------RATNFPTQLGLG 180
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L +G + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 181 HTWNRKLIHQVGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 235
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR----------------- 234
+ V+G+Q +VAA KH+ AY N KG
Sbjct: 236 ELGIEMVRGMQHNH-------QVAATGKHFVAYS--NNKGAREGMARVDPQMSPREVEMI 286
Query: 235 --YHFNAMVIYT---------------------YYLIK-------FK-YIVSDCDSVDVL 263
Y F ++ Y+L K F+ Y+VSD D+V+ L
Sbjct: 287 HVYPFKRVIKEAGMLGVMSSYNDYDGIPIQGSYYWLTKRLRGEMGFRGYVVSDSDAVEYL 346
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNESAIDHAISNN 318
Y K +EA +S+ AGL++ C +F + VK G ++E I+ + +
Sbjct: 347 YTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEDIINDRVRDI 405
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTA 378
+G FD G K+V N+ +AL A+R+ ++LLKN LPL
Sbjct: 406 LRVKFLIGLFDAPYQTDLAG--ADKEVEKAENEAVALQASRESLILLKNENNVLPLDINN 463
Query: 379 IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL----AAVVATIYQAGCSNVQCG--- 431
IK +AV GPNAN + +Y + T L+G+ +Y GC V
Sbjct: 464 IKTIAVCGPNANEEGYALTHYGPLAVEVITVLEGIRQKAEGKAEVLYAKGCDLVDANWPE 523
Query: 432 ------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
A+++ A + A AD V+V+G Q E+ R L LPG+Q L+
Sbjct: 524 SELIEYPMTNEEQAEINKAVENARKADVAVVVLGGGQRTCGENKSRSSLDLPGRQLKLLQ 583
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
V K PV+L++++G I++A D + +IL YPG GG A+ADV FG YNP G
Sbjct: 584 AVQATGK-PVVLVLINGRPLSINWA--DKFVPAILETWYPGSKGGTAVADVLFGDYNPGG 640
Query: 540 RLPMTWYPQS 549
+L +T +P+S
Sbjct: 641 KLTVT-FPKS 649
>gi|238620766|ref|YP_002915592.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.4]
gi|238381836|gb|ACR42924.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.16.4]
Length = 755
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 147/508 (28%), Positives = 231/508 (45%), Gaps = 103/508 (20%)
Query: 117 VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPR 175
+V AT+FPQ I A++++ L + + + +A+ L G SP +++ RDPR
Sbjct: 97 MVKTATAFPQAIGLASTWDPDLIREVSSTIRYQAK------LIGTNQCLSPVLDVCRDPR 150
Query: 176 WGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL-DNWKGTDR 234
WGR +ET GED L + YVKGLQ + ++ A KH+ A+ + +
Sbjct: 151 WGRCEETYGEDQYLVASIGLAYVKGLQGEN-------ELIATVKHFAAHGFPEGGRNIAP 203
Query: 235 YHFNAMVIYTYYLIKFK------------------------------------------Y 252
H + +L F+
Sbjct: 204 VHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQEWGFEGI 263
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD-----LNCGSFLGKHTEAAVKAGLVN 307
+VSD D++ L + +EAA ++ AG+D ++C G+ AVK GL++
Sbjct: 264 VVSDYDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDC---FGEPLLEAVKEGLIS 320
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
ES ID A+ +LG F+ H + P+ + +++LALD AR+ IVLLKN
Sbjct: 321 ESIIDRAVERVLRIKEKLGLFNDHYINE---NNVPEKLDNSKSRELALDVARKSIVLLKN 377
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-------CKYTTPLQGLAAVVA-- 418
LPL+ I +AVIGPNAN + ++G+Y T + T L+G+ V+
Sbjct: 378 D-NILPLNKN-IGTIAVIGPNANEPRNLLGDYTYTGHLNADVGIEVVTVLEGIMRKVSNN 435
Query: 419 --TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGAD-------------------QS 457
+Y GC +A + A D + VMG Q+
Sbjct: 436 TNVLYAKGCDIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGKDEFEKYQA 495
Query: 458 IEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
+ E +DR L LPG Q+ L+ E+ K K P+IL++++G +S N+ + +I+
Sbjct: 496 VTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFNE--VNAIIDAW 552
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTW 545
+PG+ GG AIADV FG YNPSGRLP+++
Sbjct: 553 FPGEEGGNAIADVIFGDYNPSGRLPISF 580
>gi|423313768|ref|ZP_17291703.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
CL09T03C04]
gi|392684303|gb|EIY77631.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
CL09T03C04]
Length = 788
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 153/527 (29%), Positives = 243/527 (46%), Gaps = 94/527 (17%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP ++ +E + G+ + AT FP A++N L IG+V +
Sbjct: 137 TRLGIP-VDFTNEGIRGLCH-----------DRATYFPAQCGQGATWNKKLIARIGEVEA 184
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD-- 205
EA V L +SP ++I +DPRWGR ET GEDP L + + LQ+ +
Sbjct: 185 KEA-----VALGYTNIYSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKYNLV 239
Query: 206 -------------GG-----------SPNRLKVAACCKHYTAY-------------DLDN 228
GG +P ++ A+ D D
Sbjct: 240 ATPKHFAVYSIPIGGRDGKTRTDPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDG 299
Query: 229 WKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
T YHF ++ + K Y+VSD ++V+ + N T E+ A+++ AGL++
Sbjct: 300 EPITGSYHFLTEILRQEWGFK-GYVVSDSEAVEFISNKHKVADTYEDGIAQAVNAGLNIR 358
Query: 289 C-----GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY---GQL 340
F+ AV G +++ +D ++ RLG FD PY G+
Sbjct: 359 THFTPPADFILP-LRKAVDNGKISQETLDKRVAEILRIKFRLGLFD-----NPYRGNGKQ 412
Query: 341 GPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE 400
+ V ++ +Q ++L+AARQ +VLLKN LPLS +I+++AVIGPNAN +I Y
Sbjct: 413 AEQIVHSKEHQAVSLEAARQSLVLLKNETNLLPLS-KSIRSIAVIGPNANEQTQLICRYG 471
Query: 401 GTPCKYTTPLQGLAAVV---ATIYQAGCSNVQC-----------GTAQ----VDDAKKAA 442
T QG+ ++ IY+ GC + TA+ +++A +AA
Sbjct: 472 PANAPIKTVYQGIKELLPHAEVIYKKGCDIIDPHFPESEILDFPKTAEEVQLMEEAIRAA 531
Query: 443 ASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS 502
A+ V+V+G ++ E R L LPG+Q+ L+ V K PVIL+++ G I+
Sbjct: 532 KQAEVVVMVLGGNELTVREDRSRTSLNLPGRQEELLKAVCATGK-PVILVMLDGRASSIN 590
Query: 503 FAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+A + +IL +PG+ G A+A+ FG YNP GRL +T +P+S
Sbjct: 591 YAA--AHVPAILHAWFPGEFCGQAVAEALFGDYNPGGRLAVT-FPKS 634
>gi|373461705|ref|ZP_09553443.1| hypothetical protein HMPREF9944_01707 [Prevotella maculosa OT 289]
gi|371951597|gb|EHO69442.1| hypothetical protein HMPREF9944_01707 [Prevotella maculosa OT 289]
Length = 662
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 141/416 (33%), Positives = 204/416 (49%), Gaps = 75/416 (18%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
L ++LTL EK++ L+D + ++ RL IP ++WW+EALHGV+ G AT FPQ
Sbjct: 23 LCRQLTLSEKVSLLLDDSPAIPRLNIPAFQWWNEALHGVARNGV----------ATVFPQ 72
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGLA--------GLTFWSPNINIFRDPRWGR 178
I AA+F+ L Q + VVS EARA A GL+FW+PNINIFRDPRWGR
Sbjct: 73 TIGMAAAFDDELLQQVFTVVSDEARAKAERAKALGHVRRYQGLSFWTPNINIFRDPRWGR 132
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY----------DLDN 228
GQET GEDP L + V+GLQ D + K AC KH+ + D++N
Sbjct: 133 GQETYGEDPYLTGRMGLAVVRGLQGPD--TACYYKTLACAKHFAVHSGPESLRHEMDIEN 190
Query: 229 WKGTDRYH-----FNAMVI------------------------YTYYLIKFKY-----IV 254
D + F ++V ++++ ++ +V
Sbjct: 191 LSPRDLWETYLPAFKSLVTKGQVAEVMCAYQRFDGQPCCGNSRLLQHILRDEWNFKGLVV 250
Query: 255 SDCDSV-DVLYNSQH-YTKTPEEAAAKSILAGLDLNCGSFLGKHT-EAAVKAGLVNESAI 311
SDC ++ D +H K EA+A++ AG D+ CGS H+ E AVKAG++ ES I
Sbjct: 251 SDCGAISDFWIPGRHGVAKDAVEASAQAQSAGTDVECGS--NYHSLEEAVKAGVLKESDI 308
Query: 312 DHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAG 370
D ++ + LG D PS P+ + V A++ LAL A + +VLL+N
Sbjct: 309 DRSVIRVLEARLALG--DISPSAVVPWKTIPYAVVDCPAHRQLALRMAHESMVLLQNRHH 366
Query: 371 SLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-YQAGC 425
+LPLS + V+G NA + + NY G P T L G+ + + + GC
Sbjct: 367 TLPLSTH--DKILVVGENAVDSMMLWANYNGIPSHTVTVLDGIRQLAEHVEFMPGC 420
>gi|261405721|ref|YP_003241962.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
gi|261282184|gb|ACX64155.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
Y412MC10]
Length = 765
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 151/545 (27%), Positives = 233/545 (42%), Gaps = 101/545 (18%)
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASF 134
E + + A SRLGIP +G ++ G T FP + +++
Sbjct: 89 EAVNHIQRYAVEQSRLGIP------------ILIGEECSHGHMAIGGTVFPVPLSIGSTW 136
Query: 135 NASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 194
N L++ + + V+ E R+ G +SP +++ RDPRWGR +E GEDP L S+YA
Sbjct: 137 NVDLYRDMCRAVALETRSQ-----GGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEYA 191
Query: 195 TGYVKGLQQTDGGSPNRLKVAACCKHYTAY-DLDNWKGTDRYHFNAMVIYTYYLIKFK-- 251
V+GLQ SP+ VAA KH+ Y + + H + ++ FK
Sbjct: 192 VASVEGLQGESLDSPS--SVAATLKHFVGYGSSEGGRNAGPVHMGTRELMEVDMLPFKKA 249
Query: 252 ---------------------------------------YIVSDCDSVDVLYNSQHYTKT 272
+++DC ++D+L + +
Sbjct: 250 VEAGAASIMPAYNEIDGVPCTVNTELLDGILRKEWGFDGMVITDCGAIDMLASGHDTAED 309
Query: 273 PEEAAAKSILAGLDLN-CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGH 331
+AA ++I AG+D+ G GKH + AV++ + S +D A+ +LG F+ +
Sbjct: 310 GMDAAVQAIRAGIDMEMSGEMFGKHLQKAVESNKLEVSVLDEAVRRVLTLKFKLGLFE-N 368
Query: 332 PSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANV 391
P P Q + ++ + LA A +GIVLLKN A +LPLS +AVIGPNA+
Sbjct: 369 PYVDP--QTAENVIGSEQHVGLARQLAAEGIVLLKNEAKALPLSKEG-GVIAVIGPNADQ 425
Query: 392 TKTMIGNYEGT--PCKYTTPLQGLAAVVA-----TIYQAGCSNVQCGTAQVDDAKKAAAS 444
+G+Y P TT L G+ A + +Y GC + A A
Sbjct: 426 GYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPGCRIKDDSREGFEFALTCAEQ 485
Query: 445 ADATVLVMGADQSIE-------------------------AESHDRLDLLLPGQQQLLIT 479
AD V+V+G + + E DR+ L L G Q L+
Sbjct: 486 ADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSDMDCGEGIDRMTLQLSGVQLELVQ 545
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
E+ K+ K +++ I G I+ D +IL YPGQ GG A+AD+ FG NPSG
Sbjct: 546 EIHKLGKRMIVVYI---NGRPIAEPWIDEHADAILEAWYPGQEGGHAVADILFGDVNPSG 602
Query: 540 RLPMT 544
+L M+
Sbjct: 603 KLTMS 607
>gi|384566534|ref|ZP_10013638.1| beta-glucosidase-like glycosyl hydrolase [Saccharomonospora glauca
K62]
gi|384522388|gb|EIE99583.1| beta-glucosidase-like glycosyl hydrolase [Saccharomonospora glauca
K62]
Length = 1010
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 158/495 (31%), Positives = 231/495 (46%), Gaps = 74/495 (14%)
Query: 29 SSSAQSSPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVS 88
+S+A +S V A S P+ F + SL + RV DL+ RLTL EKI L ++
Sbjct: 34 ASTAHASAVTAPAPASRPAR----FRDPSLPLAARVEDLLSRLTLDEKIALLHQYQPAIP 89
Query: 89 RLGIPKYEWWSEALHGVSYVGPGTHFSN----VVPGATSFPQVILTAASFNASLFQAIGK 144
RLGI +++ +EALHGV++ T F N V AT FPQ + A+++N +L + +G+
Sbjct: 90 RLGIGRFKTGTEALHGVAW---STDFDNGGAVVTATATVFPQTVGLASTWNPALVRRVGE 146
Query: 145 VVSTEARAMYNV--GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ 202
VV EARA ++ L GL W+P +N+ RDPRWGR +E EDP L ++ A Y G+Q
Sbjct: 147 VVGVEARAFHSRRPELWGLQLWAPVVNLLRDPRWGRNEEGYSEDPYLTARIAIAYGHGIQ 206
Query: 203 QTDGGSPNRLKVAACCKHYTAYDLDNWKGT---------DRYHFNAM------------V 241
G P+RL+ A KHY AY+ + + T R ++ A V
Sbjct: 207 ---GDDPDRLRAAPVLKHYLAYNNETHRDTTSSMVPPRVKREYYEAAFRAALSADAATGV 263
Query: 242 IYTYYLIKFKYIVSDCDSVDV--------------------LYNSQHYTKTPEEAAAKSI 281
+ +Y L+ + + D DV L SQ Y T EA A +
Sbjct: 264 MSSYNLVNGRPATAHPDYNDVVRTWTERPLCNVTDAGGPNNLTGSQGYYATQAEADAAVL 323
Query: 282 LAGL------DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ 335
GL D + G + TE A+ GL++ES +D A+ N + RLG FD P
Sbjct: 324 KCGLDSFTVDDTDAGPTVAAVTE-ALDRGLLSESDVDTAVRNILSLRFRLGEFD--PDGG 380
Query: 336 PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTM 395
P+ +G + + + ++ + +AA + +VLLKNT +LPL I+ AV+GP AN T
Sbjct: 381 PHAGIGEEVIDSPEHRAINREAATEAVVLLKNTEATLPLDAERIRKAAVVGPLANTLYTD 440
Query: 396 IGNYEGTPCKYTTPLQGLA---AVVATIYQAGCSNVQCGTAQVDDAKKAAASADATVLVM 452
Y G T L GL A ++ G + TA D + A + L
Sbjct: 441 W--YSGALPYRVTVLDGLRDRLAAAEVSHEEGIDRIALRTA---DGRYVTAPTEPGALRA 495
Query: 453 GADQSIEAESHDRLD 467
D A D +D
Sbjct: 496 TTDPVERAAHFDVVD 510
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 49/87 (56%), Gaps = 5/87 (5%)
Query: 461 ESHDRLDL-LLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYP 519
E DR DL L P QQ+LL E A +L++ + I++A++ + +I+W +
Sbjct: 637 EDDDRTDLGLAPRQQELL--EAVHGANPTTVLVLNNSYPTAITWAQD--HVPAIVWTTHA 692
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTWY 546
G G A+ADV G NPSGRLP TWY
Sbjct: 693 GAETGNAVADVLVGAANPSGRLPQTWY 719
>gi|308208211|gb|ADO20356.1| putative beta-D-xylosidase/alpha-L-arabinosidase [uncultured rumen
bacterium]
Length = 780
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 148/442 (33%), Positives = 200/442 (45%), Gaps = 84/442 (19%)
Query: 47 SLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVS 106
SL++ + + SL + R DLV RLTL+EK + + + V LGI Y WWSEALHGV+
Sbjct: 40 SLSAQPYKDRSLPPEERAKDLVSRLTLEEKASLSMHPSAPVEALGIKAYNWWSEALHGVA 99
Query: 107 YVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--------L 158
G AT FPQ I AASF+ L + VS EAR Y +
Sbjct: 100 RNG----------AATVFPQPIGMAASFDEPLLYEVFTAVSDEARVKYKIAKESGHIGQY 149
Query: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACC 218
G+TFW+PNINIFRDPRWGRG ET GEDP L + V+GLQ SP LK AC
Sbjct: 150 QGVTFWTPNINIFRDPRWGRGMETYGEDPYLTGQMGMAVVRGLQGP-SDSPV-LKAHACA 207
Query: 219 KHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------------- 251
KHY + W +R+ ++A V + YL FK
Sbjct: 208 KHYAVHSGPEW---NRHSYDAEVSERDLRETYLPAFKDLVTKANVQEVMTAYNRFRGEPC 264
Query: 252 -------------------YIVSDCDSVDVLYNSQHYTKTPE--EAAAKSILAGLDLNCG 290
I SDC +V+ Y + +P+ AAA ++ AG+D CG
Sbjct: 265 GASDYLINTILRGEWGYKGLITSDCWAVEDFYVQGRHGYSPDVASAAAAAVHAGVDTECG 324
Query: 291 SFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHP--SKQPYGQL-GPKDVCT 347
+H AV+ GL++E +D + F +LG D P L GP+ +
Sbjct: 325 QAY-RHIPEAVERGLLDEKDLDRNLIRLFTARYQLGEMDDISLWDDLPASILEGPEHLA- 382
Query: 348 QANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 407
L+ A++ +VLL+N G LPL+P +A++GPN + + GNY P +
Sbjct: 383 -----LSRKMAQESMVLLQNKGGILPLAPDV--RVALVGPNGDDREMQWGNYNPVPGRTV 435
Query: 408 TPLQGLAAVVATI-YQAGCSNV 428
T L I Y GC V
Sbjct: 436 TLYDALKERFPGIKYVRGCGIV 457
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 45/83 (54%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR D+ LP Q+ L+ + K VIL+ SG I +IL YPG+ G
Sbjct: 555 DRTDIQLPQVQRDLMKALHDAGKK-VILVNFSGCA--IGLVPETESCDAILQAWYPGEEG 611
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G AI DV FG NPSG+LP+T+Y
Sbjct: 612 GLAITDVLFGDVNPSGKLPVTFY 634
>gi|160887545|ref|ZP_02068548.1| hypothetical protein BACOVA_05565 [Bacteroides ovatus ATCC 8483]
gi|156107956|gb|EDO09701.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 736
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 158/527 (29%), Positives = 241/527 (45%), Gaps = 96/527 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP + EA HG ++ GAT FP I AA+++ L + +G+V++
Sbjct: 82 TRLGIPMF-LAEEAPHG-----------HMAIGATVFPTGIGMAATWSPELVKEVGQVIA 129
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E R+ G + P +++ RDPRW R +ET GEDP+L+ V GL G
Sbjct: 130 KEIRSQ-----GGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL--GGGN 182
Query: 208 SPNRLKVAACCKHYTAY-----------------DLDN------WKGTDRYHFNAMVIYT 244
+ A KH+ AY DL K D + M Y
Sbjct: 183 LSQKYATIATLKHFLAYAVPEGGQNGNYASVGIRDLHQNFLPPFRKAIDAGALSVMTSYN 242
Query: 245 ----------YYLI--------KFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+YL+ KF+ ++VSD S++ ++ S T E AA +S++AG+
Sbjct: 243 SIDGTPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVAPTKENAAIQSVMAGV 302
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
D++ G + AV++G ++++ ID A+ +G F+ HP P ++ K V
Sbjct: 303 DVDLGGDAYTNLCHAVQSGQMDKTVIDTAVCRVLRMKFEMGLFE-HPYVDP--KIAAKTV 359
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG--TP 403
+ + +LA A+ I LLKN LPLS T I +AVIGPNA+ M+G+Y
Sbjct: 360 RRKEHIELARKIAQSSITLLKNENSILPLSKT-INKVAVIGPNADNRYNMLGDYTAPQED 418
Query: 404 CKYTTPLQGLAAVVATI---YQAGCSNVQCGTAQVDDAKKAAASAD-------------- 446
T L G+ ++ Y GC+ +++ A KAA ++
Sbjct: 419 SNVKTVLDGILTKLSPFRVEYVRGCAIRDTTVNEIEQAIKAARRSEVVIVVVGGSSARDF 478
Query: 447 --------ATVLVMGADQSIE-AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
A V G+ +E E DR L L G+QQ L+ + K K P+I++ + G
Sbjct: 479 KTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTGK-PLIVVYIEGR 537
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
+ ++A ++L YPGQ GG AIADV FG YNPSGRLP++
Sbjct: 538 PLEKNWASE--YADALLTAYYPGQEGGNAIADVLFGDYNPSGRLPIS 582
>gi|167764323|ref|ZP_02436448.1| hypothetical protein BACSTE_02707 [Bacteroides stercoris ATCC
43183]
gi|167697728|gb|EDS14307.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
stercoris ATCC 43183]
Length = 862
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 149/438 (34%), Positives = 200/438 (45%), Gaps = 79/438 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
+ TSL D R L+K LTL+EK+ ++D++ V RLGI Y WW+EALHGV+ G
Sbjct: 22 YRTTSLTPDKRAELLLKELTLEEKVALMMDTSQPVERLGIKPYNWWNEALHGVARAGL-- 79
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--------GLAGLTFW 164
AT FPQ I AASF+ + VS EARA GLT W
Sbjct: 80 --------ATVFPQPIGMAASFSPQTVYEVFNAVSDEARAKNTYYASQGSYERYQGLTMW 131
Query: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY 224
+P +NI+RDPRWGRG ET GEDP L S+ VKGLQ T+ G ++L AC KH+ +
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGRYDKLH--ACAKHFAVH 189
Query: 225 DLDNWKGTDRYHFNA-----MVIYTYYLIKFKYIV------------------------- 254
W +R+ FN +Y YL F+ +V
Sbjct: 190 SGPEW---NRHSFNVENLSTRDLYETYLPPFEALVKEAGVKEVMCAYNSFEGEPCCGSNR 246
Query: 255 -----------------SDCDSVDVLYNS--QHYTKTPEEAAAKSILAGLDLNCGSFLGK 295
SDC ++ YN + A+A ++L G DL CGS
Sbjct: 247 LLMQILRNDWGFDGIVLSDCGAIADFYNEYGHKAYSDAKSASAAAVLNGTDLECGSSYKA 306
Query: 296 HTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
+AA + G ++E ID A+ LG D P + ++ V + A+ L L
Sbjct: 307 LVKAA-QEGKIDEKDIDKAVLRLLEARFALGEMDA-PEDVSWTKIPFSVVASAAHDSLVL 364
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG--- 412
D AR+ + LL+N LPL + +AV+GPNAN + GNY G P T L G
Sbjct: 365 DMARKSMTLLQNKDNILPLKRGGL-TVAVMGPNANDSVMQWGNYNGMPSHTVTILDGIRN 423
Query: 413 -LAAVVATIYQAGCSNVQ 429
L A IY+ GCS V+
Sbjct: 424 ALGADDKLIYEQGCSWVE 441
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 62/123 (50%), Gaps = 6/123 (4%)
Query: 427 NVQCGTAQVDDAKKAAASADATVLVMGADQSIEA---ESHDRLDLLLPGQQQLLITEVAK 483
N+ A V DA + + L+ G + + DR D+ LP Q+ LI + +
Sbjct: 588 NIDASVACVKDADVVVFAGGISPLLEGEEMGVNLPGFRGGDRTDIELPAVQRELIHALHR 647
Query: 484 VAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPM 543
K +I+++ G I+ +IL YPGQAGG A+ADV FG YNP GRLP+
Sbjct: 648 AGKK---IILVNCSGSPIALEPETKNCEAILQAWYPGQAGGTAVADVLFGDYNPGGRLPV 704
Query: 544 TWY 546
T+Y
Sbjct: 705 TFY 707
>gi|334144838|ref|YP_004538047.1| beta-glucosidase [Novosphingobium sp. PP1Y]
gi|333936721|emb|CCA90080.1| beta-glucosidase [Novosphingobium sp. PP1Y]
Length = 889
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 129/399 (32%), Positives = 187/399 (46%), Gaps = 81/399 (20%)
Query: 66 DLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFP 125
DLV ++TL EK+ L+++A ++ RL IP Y WW+E+LHG P T+FP
Sbjct: 36 DLVAKMTLDEKLGQLLNTAPAIPRLDIPAYNWWTESLHGALGSLP----------TTNFP 85
Query: 126 QVILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTFWSPNINIFRDPRW 176
+ I AA+F+ASL + + +STE R ++ + GL WSPNINIFRDPRW
Sbjct: 86 EPIGLAATFDASLVKDVAGAISTEVRGLHALARKTGRMGRIGTGLDTWSPNINIFRDPRW 145
Query: 177 GRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYH 236
GRGQET GEDP L ++ +V+G+Q D P+ V A KH+ + N + R+H
Sbjct: 146 GRGQETYGEDPYLTARMGVSFVEGMQGPD---PDLPDVIATPKHFAVH---NGPESTRHH 199
Query: 237 FNAMV----IYTYYLIKFK----------------------------------------- 251
N V + YL F+
Sbjct: 200 ANVFVSRHDLEDTYLPAFRAAIVEGRAGSVMCAYNRVDGQPACASQELLQEHLVDAWGFQ 259
Query: 252 -YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF-------LGKHTEAAVKA 303
Y+VSDCD+V + ++ Y A A ++ G+D C ++ L A++
Sbjct: 260 GYVVSDCDAVKDISDNHKYAPDGAAAVAAAMRMGVDSECHTWTLSDTDGLTDRYREALER 319
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIV 363
GL+ S +D + F+ +R G G + DV T A+ LAL AA + +V
Sbjct: 320 GLITVSDVDRTLIRLFSARLRNGDLPGVRKLSTFTSSA-ADVGTPAHGALALKAAEESLV 378
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
LLKN G LP +K +AVIGP + T+ + GNY T
Sbjct: 379 LLKND-GILPFQTAGMK-VAVIGPFGDATRVLRGNYSST 415
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 72/128 (56%), Gaps = 13/128 (10%)
Query: 429 QCGTAQVDDAKKAAASADATVLVMGADQSIEAESH----------DRLDLLLPGQQQLLI 478
+ T D ++AA +AD V V+G +EAE D+ L +P QQ L+
Sbjct: 602 RISTDPAGDMRRAAQAADVLVAVVGLTSDLEAEESPIEIPGFKGGDKTTLDIPADQQELL 661
Query: 479 TEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPS 538
E AK P+I++ M+G ++ +AK + +IL YPGQ+GG AIA+V G+ NP+
Sbjct: 662 -EQAKATGKPLIVVAMNGSPINLHWAKEN--ADAILEAWYPGQSGGLAIANVLTGKANPT 718
Query: 539 GRLPMTWY 546
G+LP+T+Y
Sbjct: 719 GKLPLTFY 726
>gi|385774250|ref|YP_005646817.1| glycoside hydrolase family protein [Sulfolobus islandicus HVE10/4]
gi|323478365|gb|ADX83603.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
HVE10/4]
Length = 754
Score = 181 bits (460), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 148/503 (29%), Positives = 221/503 (43%), Gaps = 101/503 (20%)
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRG 179
+T+FPQ I A+++N L I V+ ++ R L G+ SP +++ +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQGR------LVGVNQCLSPVLDVCKDPRWGRC 154
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL-DNWKGTDRYHFN 238
+ET GEDP L + Y+ GLQ + ++ A KH+ A+ + + + H
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------QLVATAKHFAAHGFPEGGRNIAQVHVG 207
Query: 239 AMVIYTYYLIKFK------------------------------------------YIVSD 256
+ +L F+ +VSD
Sbjct: 208 NRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQEWGFDGIVVSD 267
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKA---GLVNESAIDH 313
D + L EAA ++ +G+D+ + + ++E V A GLV ES ID
Sbjct: 268 YDGIRQLETIHRVASNKMEAAILALESGVDIEFPT-IDCYSEPLVNALTEGLVPESLIDR 326
Query: 314 AISNNFATLMRLGFFDGHPSKQPYGQLG--PKDVCTQANQDLALDAARQGIVLLKNTAGS 371
A+ RLG D P+ P+ + +++LAL AR+ IVLLKN
Sbjct: 327 AVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTARESIVLLKNENNI 381
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP-------CKYTTPLQGLAAVVA---TIY 421
LPLS + +AVIGPNAN + M+G+Y T + T LQG+ V +Y
Sbjct: 382 LPLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKKVGESKVLY 440
Query: 422 QAGCSNVQCGTAQVDDAKKAAASADATVLVMGAD-------------------QSIEAES 462
GC +A + A AD + VMG Q++ E
Sbjct: 441 AKGCDIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFKKYQAVTGEG 500
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
+DR L LPG Q+ L+ E+ K K P+IL++++G +S N K W +PG+
Sbjct: 501 NDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSPIINYVKAVIEAW--FPGEE 557
Query: 523 GGAAIADVCFGRYNPSGRLPMTW 545
GG AIADV FG YNP GRLP+T+
Sbjct: 558 GGNAIADVIFGDYNPGGRLPITF 580
>gi|261408260|ref|YP_003244501.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
gi|261284723|gb|ACX66694.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
Y412MC10]
Length = 763
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 156/530 (29%), Positives = 234/530 (44%), Gaps = 99/530 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
SRLGIP + E HG ++ GAT FP + +++N LF++I + V+
Sbjct: 103 SRLGIPIL-FGEECSHG-----------HMAIGATVFPVPLTIGSTWNTELFRSISRAVA 150
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E RA G +SP +++ RDPRWGR +ET GEDP L +++A V+GLQ
Sbjct: 151 AETRAQ-----GGSATYSPVLDVVRDPRWGRTEETFGEDPHLVTEFAVAAVQGLQGERLD 205
Query: 208 SPNRLKVAACCKHYTAYDL-DNWKGTDRYHFNAMVIYTYYLIKFK--------------- 251
S L A KH+ Y + + H ++ L+ F+
Sbjct: 206 SHTSL--LATLKHFAGYGASEGGRNGAPVHMGLRELHEVDLLPFRKAVEAGALSVMTAYN 263
Query: 252 --------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
++++DC ++ +L + + EAAA+S+ AG+
Sbjct: 264 EIDGVPCTSSGYLLQDVLREAWGFDGFVITDCGAIHMLACGHNTAGSGVEAAAQSLKAGV 323
Query: 286 DLN-CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD 344
D+ G+ H A++ GL+ E ++ A RLG FD P P +
Sbjct: 324 DMEMSGTMFRAHLHQALEQGLITEEDLNRAAGRVLELKFRLGLFD-RPYVDP--AWAEQV 380
Query: 345 VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT-- 402
+ + + LA AA +GIVLLKN LPL ++ +AVIGPNA+ +G+Y
Sbjct: 381 IGCKEHIALAYQAAAEGIVLLKNEGNLLPLDSSS-GTIAVIGPNAHAPYHQLGDYTSPQP 439
Query: 403 PCKYTTPLQGLAAVVA---TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMG------ 453
P + T L G+ + +Y GC A A AD V+V+G
Sbjct: 440 PGQIVTVLDGIRRRLGDSRVLYAPGCRIQGDSREGFPRALACAEQADVIVMVLGGSSARD 499
Query: 454 ------------------ADQSIE-AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIM 494
A+ +E E DR L L G Q L+ E+ K+ K PVI++ +
Sbjct: 500 FGEGTIDLRTGASVVTGHAESDMECGEGIDRSTLTLMGVQLELLQELHKLGK-PVIVVYI 558
Query: 495 SGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
+G I+ D I SI+ YPGQ GG+AIAD+ FG NPSGRLP++
Sbjct: 559 NG--RPITEPWIDEHIPSIVEAWYPGQEGGSAIADMLFGDINPSGRLPLS 606
>gi|302554503|ref|ZP_07306845.1| sugar hydrolase [Streptomyces viridochromogenes DSM 40736]
gi|302472121|gb|EFL35214.1| sugar hydrolase [Streptomyces viridochromogenes DSM 40736]
Length = 945
Score = 181 bits (459), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 140/409 (34%), Positives = 194/409 (47%), Gaps = 71/409 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L RV D++ RLTL EKI+FL A +V RLGI + EALHGV+++GP
Sbjct: 11 FRDARLPFAKRVDDVLGRLTLDEKISFLHQFAPAVERLGIAAFRTGQEALHGVAWMGP-- 68
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWSPNINIF 171
AT FPQ + A++N L + +G VS E RAM GL W+P +N+
Sbjct: 69 --------ATVFPQAVGLGATWNPELVRRVGDAVSKEVRAMRAEDDRVGLNVWAPTVNLL 120
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
R P WGR +E EDP L S AT Y GL+ G P + AA KH+ A++ + +
Sbjct: 121 RHPLWGRNEEGYSEDPKLTSAIATAYTHGLR---GDHPTYWRTAAVLKHWLAHNNETNRS 177
Query: 232 TDRYHFNAMVIYTYYLIKFKYIV------------------------------------- 254
T V++ Y L F+ V
Sbjct: 178 TTSSSVRPRVLHEYDLRAFRETVEAGAVAGVMPAYNLVNGRPNHVSPYLREQLRTWTDEE 237
Query: 255 ----SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD------LNCGSFLGKHTEAAVKAG 304
SD + L +S+HY T EEA A S++AG+D + G+ A++ G
Sbjct: 238 LLVCSDAGAPSNLVDSEHYFDTHEEATAASLVAGVDSFTDHGTDSSQMTGR-LRGALERG 296
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
L+ ES +D A+ + RLG FD P P+G G D T A++ LA +AA Q +VL
Sbjct: 297 LLKESDLDTAVRRQLSVRFRLGEFD--PEYDPHGATGEFD--TPAHRALAQEAAEQAVVL 352
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
LKN G LPL+P +AV+G A+ K + Y GT +TPL+GL
Sbjct: 353 LKND-GVLPLAPDT--RVAVVGLLADECK--LDWYSGTLIHRSTPLEGL 396
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+ DR L LP Q+ L+ A+ A +L ++S +A + + ++LW + G
Sbjct: 583 ETEDRTTLRLPEHQERLL-RAARAANPATVLALVSA----YPYAVDPADLPAVLWTAHGG 637
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
QA G A+A V G +P+GRLP TWY
Sbjct: 638 QAAGTALARVLAGDVSPAGRLPQTWY 663
>gi|222635353|gb|EEE65485.1| hypothetical protein OsJ_20901 [Oryza sativa Japonica Group]
Length = 481
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 98/189 (51%), Positives = 118/189 (62%), Gaps = 47/189 (24%)
Query: 153 MYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRL 212
MYN G AGLTFWSPN+NIFRDPRWGRGQETPGEDP +A++YA YV+GLQQ S RL
Sbjct: 1 MYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQQQPSS-GRL 59
Query: 213 KVAACCKHYTAYDLDNWKGTDRYHFNAMV----IYTYYLIKFK----------------- 251
K+AACCKH+TAYDLDNW GTDR+HFNA+V + + + F+
Sbjct: 60 KLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRAASVMCSYNQ 119
Query: 252 -------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD 286
YIVSDCDSVDV Y+ QHYT+T E+A A ++ AGLD
Sbjct: 120 VNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVAATLRAGLD 179
Query: 287 LNCGSFLGK 295
L+CG FL +
Sbjct: 180 LDCGPFLAQ 188
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 56/80 (70%), Positives = 65/80 (81%)
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG+Q LI+ VAK +KGPVIL++MSGG DI FA+NDPKI ILW GYPGQAGG AIADV
Sbjct: 220 PGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAGGQAIADV 279
Query: 531 CFGRYNPSGRLPMTWYPQSY 550
FG +NP G+LP+TWYPQ Y
Sbjct: 280 IFGHHNPGGKLPVTWYPQDY 299
>gi|319901412|ref|YP_004161140.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
gi|319416443|gb|ADV43554.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
P 36-108]
Length = 944
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 169/619 (27%), Positives = 263/619 (42%), Gaps = 141/619 (22%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEW----WS------- 99
+ + + ID R+ DL+K+++L+EK +V G V + +P EW W
Sbjct: 53 YEDPTAAIDARIEDLLKQMSLEEKTCQMVTLYGYKRVLKDDLPTPEWKQMLWKDGIGAID 112
Query: 100 EALHGVSYVG-PGTHFSNVVPG-------------------------------------- 120
E L+G G P + NV P
Sbjct: 113 EHLNGFRQWGLPPSDNENVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESYK 172
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQ 180
AT+FP + ++N L IG + EAR L ++P +++ RD RWGR +
Sbjct: 173 ATNFPTQLGLGHTWNRELIHKIGFITGREARM-----LGYTNVYAPILDVGRDQRWGRYE 227
Query: 181 ETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDN--WKGTDR---- 234
E GE P L ++ V+G+Q +VAA KH+ AY + +G R
Sbjct: 228 EVYGESPYLVAELGIEMVRGMQYNH-------QVAATGKHFAAYSNNKGAREGMSRVDPQ 280
Query: 235 -----------YHFNAMVIYT---------------------YYL-------IKFK-YIV 254
Y F ++ Y+L I F+ Y+V
Sbjct: 281 ISPREVENIHIYPFRRVIREAGLLGVMSSYNDYDGIPIQGSHYWLTTRLRGEIGFRGYVV 340
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC-----GSFLGKHTEAAVKAGLVNES 309
SD D+V+ LY K +EA +S+ AGL++ C SF+ E VK G ++E
Sbjct: 341 SDSDAVEYLYTKHGTAKDMKEAIRQSVEAGLNIRCTFRSPDSFVLPLREL-VKEGGLSEE 399
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
I+ + + G FD G ++V + N +AL A+R+ IVLLKN
Sbjct: 400 IINDRVRDILRVKFLTGLFDTPYQSDLAG--ADREVEKEENGSIALQASRESIVLLKNEN 457
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGC 425
LPL + +K +AV GPNA+ + +Y + T L+G+ V+ +Y GC
Sbjct: 458 NMLPLDLSTVKRIAVCGPNADEKNYALTHYGPLAVEVITVLKGIQDKVSGKAEVLYTKGC 517
Query: 426 SNVQCG---------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLL 470
V A+++ A + A +D V+V+G Q E+ R L L
Sbjct: 518 DLVDANWPESEIINHPLTADEQAEINKAAENARQSDVAVVVLGGGQRTCGENKSRSSLDL 577
Query: 471 PGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADV 530
PG+Q L+ + K PVIL++++G +++A D + +IL YPG GG A+ADV
Sbjct: 578 PGRQLQLLQAIQATGK-PVILVLINGRPLSVNWA--DKYVPAILEAWYPGAKGGIALADV 634
Query: 531 CFGRYNPSGRLPMTWYPQS 549
FG YNP G+L +T +P++
Sbjct: 635 LFGDYNPGGKLTVT-FPKT 652
>gi|206901468|ref|YP_002249934.1| glucan 1,4-beta-glucosidase [Dictyoglomus thermophilum H-6-12]
gi|206740571|gb|ACI19629.1| glucan 1,4-beta-glucosidase [Dictyoglomus thermophilum H-6-12]
Length = 927
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 136/405 (33%), Positives = 189/405 (46%), Gaps = 70/405 (17%)
Query: 58 LGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNV 117
+ I+ +V +L+ +LTL+EKI L + RL I ++ EA HGV+++G
Sbjct: 1 MNIEEKVKELISKLTLEEKIKLLPTRQAEIPRLNIREFYIGGEAAHGVAWLGK------- 53
Query: 118 VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--GLAGLTFWSPNINIFRDPR 175
AT FPQ I ++SF+ L + IG VS EARA Y + + GL W+P +++ RDPR
Sbjct: 54 ---ATVFPQPIGLSSSFDRELMKKIGNAVSQEARAYYYMRGKIGGLMLWAPTVDMLRDPR 110
Query: 176 WGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRY 235
WGR +E GEDP LAS+ A Y++G+Q G P LK A KH+ A N DR
Sbjct: 111 WGRTEEGYGEDPFLASEMAGAYIQGMQ---GDDPVYLKTAMTPKHFFA----NNNEKDRD 163
Query: 236 HFNAMV----IYTYYL------------------------------------IKFKYIVS 255
F+A + +Y YYL IK K+ +
Sbjct: 164 KFSADIDPRNMYEYYLEVFRRVIEKYRAQCIMTAYNAVNGIPCIINPIVREVIKEKFGLE 223
Query: 256 DCDSVDVLYNSQHYTKTPE-----EAAAKSILAGLD--LNCGSFLGKHTEAAVKAGLVNE 308
C D SQ T E A ++ AG+D + + + A++ GL+ E
Sbjct: 224 GCVVTDAADFSQTVTSHKTFGNHYETLAYALKAGIDAFTDNPDLVIESAWQALEKGLITE 283
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
ID AISN+ RLG FD SK+ Y + P +C + LA A + +VLLKN
Sbjct: 284 EDIDRAISNSLKVRFRLGEFDEEISKRFY--VPPNQICNNEHSQLAYMAELKSVVLLKNE 341
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
LPL I +AVIGP AN K Y GT + LQG+
Sbjct: 342 NKFLPLKKEKINKIAVIGPLAN--KNYNDWYSGTYPYKVSVLQGI 384
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/173 (31%), Positives = 90/173 (52%), Gaps = 18/173 (10%)
Query: 381 NLAVIGPNANVTKTMIGNY------EGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ 434
N+ IG + KT G Y EG K+ + L I++ V+ G
Sbjct: 483 NIDPIGDGTYLIKTWNGKYAYIDEKEGNILKFKDSFENLPEE-KFIFEK----VEKG--- 534
Query: 435 VDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
++ A + A ++D +L +G + + E DR+D++LP Q+ L+ E+ KV ++L++
Sbjct: 535 IEKACEIAQNSDIVILCVGNNPLVNGREDEDRIDIVLPEHQENLVREIFKVNPN-IVLLV 593
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+S + I + K+ I +ILW + GQ G AIAD+ G ++P+GRL MTWY
Sbjct: 594 ISSYPYAICWEKD--HIPAILWSSHGGQEMGNAIADILLGNFSPAGRLNMTWY 644
>gi|429198487|ref|ZP_19190311.1| glycosyl hydrolase family 3 C-terminal domain protein, partial
[Streptomyces ipomoeae 91-03]
gi|428665804|gb|EKX65003.1| glycosyl hydrolase family 3 C-terminal domain protein, partial
[Streptomyces ipomoeae 91-03]
Length = 688
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 136/409 (33%), Positives = 192/409 (46%), Gaps = 70/409 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L + R+ DL++RLT EKI FL A +V RLGI + EALHGV+++GP
Sbjct: 11 FRDPQLPLAKRIDDLLQRLTPDEKIAFLHQFAPAVERLGIAAFRTGQEALHGVAWMGP-- 68
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWSPNINIF 171
AT FPQ + A++N +L + +G+ VS E RAM GL WSP +N+
Sbjct: 69 --------ATVFPQAVGLGATWNENLVRRVGEAVSAETRAMRARDDRVGLNVWSPTVNLL 120
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
R P WGR +E EDP L S AT Y +GL+ G P + A KH+ A++ + +
Sbjct: 121 RHPLWGRNEEGYSEDPRLTSAIATAYTRGLR---GDHPMYWRTAPVLKHWLAHNNETGRD 177
Query: 232 TDRYHFNAMVIYTYYLIKFKYIV------------------------------------- 254
V++ Y L F+ V
Sbjct: 178 VTSSSVRPRVLHEYDLRAFRETVEAGAVAGVMPAYNLVNGRPNHVSPYLAEHLRTWTDEP 237
Query: 255 ----SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD------LNCGSFLGKHTEAAVKAG 304
SD + L +S+HY T EEA A ++ AG+D + + +G+ A+ G
Sbjct: 238 LLVCSDAGAPSNLVDSEHYFDTHEEATAAALRAGVDSFTDHGTDSSTIIGR-IRGALDQG 296
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
L+ ID A+ + RLG FD P PY KD T A++ LA +AA Q +VL
Sbjct: 297 LLTADDIDEAVRRQLSVRFRLGEFD--PGADPYADT--KDFDTPAHRALAQEAAEQAVVL 352
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
LKN G LPL+ A +AV+G A+ K + Y GT +TPL+GL
Sbjct: 353 LKNEGGLLPLAADA--RIAVVGLLADECK--LDWYSGTLIHRSTPLEGL 397
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+ DR L LP Q+ L+ A+ A +L+++S + + A + + LW + G
Sbjct: 585 ETEDRAGLALPAHQERLL-RAARAANPNTVLVLVSAYPYAVDHAD----LPASLWTAHGG 639
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
QA G A+A V G +P+GRLP TWY
Sbjct: 640 QAAGTALARVLAGDVSPAGRLPQTWY 665
>gi|423303655|ref|ZP_17281654.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
CL03T00C23]
gi|423307623|ref|ZP_17285613.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
CL03T12C37]
gi|392688019|gb|EIY81310.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
CL03T00C23]
gi|392689492|gb|EIY82769.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
CL03T12C37]
Length = 801
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 149/532 (28%), Positives = 246/532 (46%), Gaps = 104/532 (19%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP ++ +E + G+ + AT FP A++N L IG+ +
Sbjct: 150 TRLGIP-VDFTNEGIRGLCH-----------DRATYFPAQCGQGATWNKELIAQIGEAEA 197
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
EA L +SP ++I +DPRWGR ET GEDP A + + LQ+
Sbjct: 198 REASV-----LGYTNIYSPILDIAQDPRWGRCVETYGEDPYHAGQMGKQMILSLQKN--- 249
Query: 208 SPNRLKVAACCKHYTAY------------------------------------------- 224
K+ + KH+ Y
Sbjct: 250 -----KLVSTPKHFAVYSIPVGGRDGKTRTDPHVAPREMRTLYLDPFRVAFHEAGALGVM 304
Query: 225 ----DLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKS 280
D D T YHF ++ + K Y+VSD ++V+ + E+A A++
Sbjct: 305 SSYNDYDGEPITGSYHFLTEILRQEWGFK-GYVVSDSEAVEFISTKHQVANGYEDAVAQA 363
Query: 281 ILAGLDLNC-----GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ 335
+ AGL++ F+ +AVK G +++ ++ ++ LG FD +P +
Sbjct: 364 VNAGLNIRTHFTPPADFILP-LRSAVKKGKISQETLNQRVAEILRVKFWLGLFD-NPYRG 421
Query: 336 PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTM 395
+ G + V + +Q LAL+AARQ +VLLKN +LPLS +I+++AVIGPNA+ + +
Sbjct: 422 DEKRAG-QIVHSPEHQQLALEAARQSLVLLKNEHQTLPLS-KSIRSVAVIGPNADERQQL 479
Query: 396 IGNYEGTPCKYTTPLQGLAAVVA---TIYQAGCSNVQ--------------CGTAQV-DD 437
I Y TT +G+ ++ +Y+ GC + AQ+ ++
Sbjct: 480 ICRYGPANAHITTIYEGIKKMLPQADVVYKKGCDIIDPHFPESEVLEFPKAAQEAQMMEE 539
Query: 438 AKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
A +AA A+ TV+V+G ++ E R L LPG+Q+ L+ ++ ++ K PV+L+++ G
Sbjct: 540 AIEAAKGAEVTVMVLGGNELTVREDRSRTSLDLPGRQEELLKKICQLGK-PVVLVMIDGR 598
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
I+FA + +I+ +PG+ GG AIA+ FG YNP GRL +T +P+S
Sbjct: 599 ASSINFAAT--HVPAIIHAWFPGEFGGQAIAEALFGDYNPGGRLAVT-FPKS 647
>gi|282878201|ref|ZP_06286997.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
gi|281299619|gb|EFA91992.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
Length = 947
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 149/537 (27%), Positives = 237/537 (44%), Gaps = 111/537 (20%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP ++ +E + GV AT+FP + ++N L +G +
Sbjct: 158 TRLGIP-VDFTNEGIRGVESFK-----------ATNFPTQLGLGTTWNRKLIHQVGYITG 205
Query: 148 TEARAMYNVGLAGLT-FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
EAR L G T ++P +++ RD RWGR +E GE P L ++ +GLQ
Sbjct: 206 REAR------LLGYTNVYAPILDVGRDQRWGRYEEVYGESPFLVAELGIQMTRGLQ---- 255
Query: 207 GSPNRLKVAACCKHYTAY------------------------------------------ 224
+VA+ KH+ AY
Sbjct: 256 ---TNYQVASTGKHFAAYSNNKGAREGMARVDPQMSPREVQNIHLYPWGRVVREAGLLGA 312
Query: 225 -----DLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAK 279
D D +H+ V+ + K Y+VSD D+++ L++ +EA K
Sbjct: 313 MSSYNDYDGVPIQGSFHWLTEVLRQQFGFK-GYVVSDSDALEYLFSKHRTAANMKEAVYK 371
Query: 280 SILAGLDLNC-----GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK 334
+++AGL++ C SF+ E VK G + ID + + +G FD
Sbjct: 372 AVMAGLNVRCTFRSPDSFVLPLREL-VKEGRIPMKVIDERLRDILRVKFMVGIFD----- 425
Query: 335 QPYG---QLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANV 391
+PY Q K+V +++Q +AL A+R+ IVLLKN +LPL+ +IK +AV GPNAN
Sbjct: 426 RPYQMNLQAADKEVDGKSHQQVALQASRESIVLLKNQNNTLPLNKASIKKIAVCGPNAND 485
Query: 392 TKTMIGNYEGTPCKYTTPLQGL----AAVVATIYQAGCSNVQC---------------GT 432
+ +Y + TT +G+ + V Y GC V
Sbjct: 486 AAYALTHYGPLAVEVTTVFEGIRNKVGSDVEVTYTKGCDLVDAHWPESELVDYPMTADEQ 545
Query: 433 AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILI 492
++D A + +D V+V+G + E+ R L LPG+Q L+ V K VIL+
Sbjct: 546 NEIDKAVEQVRQSDVAVVVLGGNSRTCGENKSRSSLELPGRQLQLLKAVQATGK-TVILV 604
Query: 493 IMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+++G +++A D + +I+ YPG GG A+ADV FG YNP G+L +T +P++
Sbjct: 605 LINGRPLSVNWA--DKFVPAIVEAWYPGSQGGTAVADVLFGDYNPGGKLTVT-FPKT 658
>gi|333377833|ref|ZP_08469566.1| hypothetical protein HMPREF9456_01161 [Dysgonomonas mossii DSM
22836]
gi|332883853|gb|EGK04133.1| hypothetical protein HMPREF9456_01161 [Dysgonomonas mossii DSM
22836]
Length = 780
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 160/530 (30%), Positives = 233/530 (43%), Gaps = 101/530 (19%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP + E HG ++ G T FP I AA++N +L Q + V+S
Sbjct: 124 TRLGIPIF-LAEECPHG-----------HMAIGTTVFPTAIGQAATWNPNLIQQMSAVIS 171
Query: 148 TEARAM-YNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
EAR+ ++G + P +++ R+ RW R +ET GEDP+L SK +V G D
Sbjct: 172 KEARSQGSHIG------YGPVLDLAREARWSRVEETYGEDPVLISKMGEAFVTGFGSGDL 225
Query: 207 GSPNRLKVAACCKHYTAYDL------------------DNW-----KGTDRYHFNAMVIY 243
P L + KH+ AY + +N+ K + M Y
Sbjct: 226 SKPYSL--ISTLKHFVAYGIPDGGHNGNSNSVGMRDLKENYLPPFEKAVKAGALSVMTAY 283
Query: 244 T----------YYLIK--------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
YL+K FK + VSD S++ L S + T +EAA S+ +G
Sbjct: 284 NSVDGIPCTSNEYLLKDVLCKDWGFKGFTVSDLGSIEGLKGSHYVVSTIQEAAILSLTSG 343
Query: 285 LDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY--GQLGP 342
LD + G AVK G+V E+ ID A+ +G F+ PY
Sbjct: 344 LDCDLGGNAFFTLSDAVKKGMVGETQIDSAVYKILKLKFDMGLFE-----NPYVDENNAR 398
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG- 401
+ V TQ N LA AR+ IVLL+N LPL+ + IK +AVIGPNA+ +G+Y
Sbjct: 399 QVVRTQENIVLARQVARESIVLLENKNNVLPLNKSKIKKIAVIGPNADNVYNQLGDYTAP 458
Query: 402 -TPCKYTTPLQGLAAVVATI---YQAGCSNVQCGTAQVDDAKKAAASAD----------- 446
T L G+ + + Y GC+ +D A +AA +D
Sbjct: 459 QDDSNVKTVLDGIRSKLKQSQIEYVKGCAIRDTLNTDIDKAVQAALRSDVAVVVVGGSSA 518
Query: 447 --------ATVLVMGADQSI----EAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIM 494
T + + SI E DR+ L L G+Q L+ + K PV+++ +
Sbjct: 519 RDFKTKYIETGAAVADEHSISDMESGEGFDRVSLDLMGKQLELLKAIKATGK-PVVVVYI 577
Query: 495 SGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
G ++++A + W YPGQ GG AIADV FG YNP+GRLPM+
Sbjct: 578 QGRPLNMNWASENADALLSAW--YPGQEGGNAIADVLFGEYNPAGRLPMS 625
>gi|270296098|ref|ZP_06202298.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
gi|270273502|gb|EFA19364.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
Length = 798
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 149/532 (28%), Positives = 246/532 (46%), Gaps = 104/532 (19%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP ++ +E + G+ + AT FP A++N L IG+ +
Sbjct: 147 TRLGIP-VDFTNEGIRGLCH-----------DRATYFPAQCGQGATWNKELIAQIGEAEA 194
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
EA L +SP ++I +DPRWGR ET GEDP A + + LQ+
Sbjct: 195 REASV-----LGYTNIYSPILDIAQDPRWGRCVETYGEDPYHAGQMGKQMILSLQKN--- 246
Query: 208 SPNRLKVAACCKHYTAY------------------------------------------- 224
K+ + KH+ Y
Sbjct: 247 -----KLVSTPKHFAVYSIPVGGRDGKTRTDPHVAPREMRTLYLDPFRVAFHEAGALGVM 301
Query: 225 ----DLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKS 280
D D T YHF ++ + K Y+VSD ++V+ + E+A A++
Sbjct: 302 SSYNDYDGEPITGSYHFLTEILRQEWGFK-GYVVSDSEAVEFISTKHQVANGYEDAVAQA 360
Query: 281 ILAGLDLNC-----GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQ 335
+ AGL++ F+ +AVK G +++ ++ ++ LG FD +P +
Sbjct: 361 VNAGLNIRTHFTPPADFILP-LRSAVKKGKISQETLNQRVAEILRVKFWLGLFD-NPYRG 418
Query: 336 PYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTM 395
+ G + V + +Q LAL+AARQ +VLLKN +LPLS +I+++AVIGPNA+ + +
Sbjct: 419 DEKRAG-QIVHSPEHQQLALEAARQSLVLLKNEHQTLPLS-KSIRSVAVIGPNADERQQL 476
Query: 396 IGNYEGTPCKYTTPLQGLAAVVA---TIYQAGCSNVQ--------------CGTAQV-DD 437
I Y TT +G+ ++ +Y+ GC + AQ+ ++
Sbjct: 477 ICRYGPANAHITTIYEGIKKMLPQADVVYKKGCDIIDPHFPESEVLEFPKAAQEAQMMEE 536
Query: 438 AKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
A +AA A+ TV+V+G ++ E R L LPG+Q+ L+ ++ ++ K PV+L+++ G
Sbjct: 537 AIEAAKGAEVTVMVLGGNELTVREDRSRTSLDLPGRQKELLKKICQLGK-PVVLVMIDGR 595
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
I+FA + +I+ +PG+ GG AIA+ FG YNP GRL +T +P+S
Sbjct: 596 ASSINFAAT--HVPAIIHAWFPGEFGGQAIAEALFGDYNPGGRLAVT-FPKS 644
>gi|251795943|ref|YP_003010674.1| beta-glucosidase [Paenibacillus sp. JDR-2]
gi|247543569|gb|ACT00588.1| Beta-glucosidase [Paenibacillus sp. JDR-2]
Length = 940
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 130/420 (30%), Positives = 188/420 (44%), Gaps = 70/420 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F N L ++ RV DLV R TL+EKI + ++ LG+ Y+ +EA HG++++G
Sbjct: 11 FQNPDLPLEQRVNDLVNRFTLEEKINLMCQYQDEITHLGVKAYKHGTEAAHGMAWLGE-- 68
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNINI 170
AT+FPQ I A +++ L + IG V+ EAR Y N GLT W+P +++
Sbjct: 69 --------ATTFPQPIGLACTWDKDLMKKIGSVIGDEARGFYSQNPTHNGLTLWAPTVDM 120
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E GEDP L K + +G+Q G P +K A KH+ + + +
Sbjct: 121 ERDPRWGRTEEAYGEDPHLTGKLSAALTQGIQ---GDHPFYVKAVASLKHFIGNNNEINR 177
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + YYL F+
Sbjct: 178 GECSVSLDPRNKNEYYLKAFEIPFKEGGALSMMTAYNSVNGVPANINPDVNNIVKRDWGM 237
Query: 252 --YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKH------TEAAVKA 303
++VSD V N HY T +EA A +I AG+D S H A+
Sbjct: 238 NGFVVSDAGDVLGTVNDHHYVDTYKEAVALTIKAGVD----SITDDHPISKQAIRDALAE 293
Query: 304 GLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIV 363
G++ E+ +D A+ N F RLG FD + PY + + + +LALDA R+ IV
Sbjct: 294 GMLTENDLDIALRNTFRVRFRLGEFDP-ADRNPYAAIDESVIMKPEHAELALDAVRKSIV 352
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQA 423
LLKN G LPLS + AVIGP A + Y G +P +G+ + +QA
Sbjct: 353 LLKND-GILPLSAEKLSKAAVIGPLAGIVYRDW--YSGLMPYSISPFEGIRKKMNGAHQA 409
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 53/182 (29%), Positives = 80/182 (43%), Gaps = 4/182 (2%)
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIYQAGC 425
K TA S + K + + P + + P GL A A
Sbjct: 480 KITASSEHIWEWFTKEVFHVRPQNDGNVVTFTTWNNRPVTVNAETGGLEAQDGDAIIAEP 539
Query: 426 SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKV 484
++ T + +A AA + + V+V+G I E+ DR DL L Q+ LI EV V
Sbjct: 540 FAIEKETDGLQEAIAAARNNEVAVVVVGNHPLINGKETIDRPDLTLAAYQEKLIREVYAV 599
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
V+ + G + + I +I+++ + GQ G A+ADV FG YNP+GR+ MT
Sbjct: 600 NPNTVV---VVVGSYPFAMPWVQENIPAIVYLSHAGQELGHALADVLFGDYNPAGRVNMT 656
Query: 545 WY 546
WY
Sbjct: 657 WY 658
>gi|423683845|ref|ZP_17658684.1| glycoside hydrolase family 3 [Bacillus licheniformis WX-02]
gi|383440619|gb|EID48394.1| glycoside hydrolase family 3 [Bacillus licheniformis WX-02]
Length = 981
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 138/457 (30%), Positives = 208/457 (45%), Gaps = 82/457 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + SL + RV DL+ RLTL+EK++ + +V RLGIP ++ +EALHGV+++G
Sbjct: 41 FQDPSLPVQKRVSDLLSRLTLEEKVSLMHQYQPAVPRLGIPAFKTGTEALHGVAWLGE-- 98
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--GLTFWSPNINI 170
AT FPQ A +++ L + IG V E R +++ A G+ W+P +++
Sbjct: 99 --------ATVFPQAFGLAHTWDRYLIKQIGSAVGDEVRGFHHLDPAANGVNVWAPVVDL 150
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E EDP L + AT Y GL+ G P LK KH+ AY+ + +
Sbjct: 151 LRDPRWGRNEEGYSEDPFLTGEIATAYASGLR---GDHPFYLKTVPTLKHFLAYNNETDR 207
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + ++ YYL F+
Sbjct: 208 GFSSSSIDPRNMHEYYLKPFETAISKKAAYGLMPAYNSVNDKPAILSPLLDSTVKRRWAG 267
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE-------AAV 301
+IVSD + N Y + E+A A ++ AG+D G++ E A+
Sbjct: 268 DDFFIVSDAFDPSGIVNDHKYYDSHEKAHAHAVKAGIDNFTDQ--GENPELTRNALTGAL 325
Query: 302 KAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQDLALDAARQ 360
K GL++E +D A++N F+ R G FD P + PY +L + + +Q LA AA +
Sbjct: 326 KQGLISEKDLDQALANTFSIRFRTGEFD--PDELNPYSRLTDDVINSPKHQLLAKKAAEK 383
Query: 361 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT- 419
IVLLKN LP + +++AVIGP N Y GT TPL +A +
Sbjct: 384 AIVLLKNDQDLLPFNARKNESIAVIGPFGNALYEDW--YSGTMPYRKTPLDEIADKIGKD 441
Query: 420 -------IYQAGCSNVQCG---TAQVDDAKKAAASAD 446
+ Q+G + G TA D + A+AD
Sbjct: 442 RVSFAEGLEQSGFKSAPTGKFVTAGRDGKQPLTAAAD 478
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 70/120 (58%), Gaps = 7/120 (5%)
Query: 428 VQCGTAQVDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAK 486
++ GT D+A + A SAD ++ +G + I E+ DR D+ LP Q+ L+ V+K A
Sbjct: 579 IKSGT---DEAVQLAKSADKAIVFVGNNPYINGRETEDRKDITLPPAQENLLKAVSK-AN 634
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+L++ S F +++AK I +I++ + GQ G+A+AD+ +G NP GRL TW+
Sbjct: 635 TNTVLVVTSSYPFALNWAK--AHIPAIIYSAHGGQEAGSALADILYGDVNPGGRLTQTWH 692
>gi|217966749|ref|YP_002352255.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
gi|217335848|gb|ACK41641.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
DSM 6724]
Length = 927
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 145/462 (31%), Positives = 215/462 (46%), Gaps = 74/462 (16%)
Query: 58 LGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNV 117
+ I+ +V +LV RLTL+EKI L + RL I ++ EA HGV+++G
Sbjct: 1 MNIEGKVKELVSRLTLEEKIKLLPTRQAEIPRLNIHEFYIGGEAAHGVAWLGK------- 53
Query: 118 VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV--GLAGLTFWSPNINIFRDPR 175
AT FPQ I ++SF+ L + IG+VVS EARA Y + + GL W+P +++ RDPR
Sbjct: 54 ---ATVFPQPIGLSSSFDRDLMRKIGEVVSQEARAYYYMRGKIGGLMLWAPTVDMGRDPR 110
Query: 176 WGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRY 235
WGR +E GEDP LAS+ A Y++ +Q G P LK A KH+ A N DR
Sbjct: 111 WGRTEECYGEDPFLASEMAGAYIQSMQ---GEDPVYLKTAMTPKHFFA----NNNEKDRD 163
Query: 236 HFNAMV----IYTYYLIKFKYIVSD----------------------------------- 256
F+A + +Y YYL F+ ++
Sbjct: 164 KFSANIDPRNMYEYYLEVFRRVIEKYKAQCIMTAYNAVNGIPCIINPIVKEVVKEKFGLE 223
Query: 257 -CDSVDVLYNSQHYT--KTPE---EAAAKSILAGLD--LNCGSFLGKHTEAAVKAGLVNE 308
C D SQ T KT E E A ++ AG+D + + + + A++ GL+ E
Sbjct: 224 GCVVTDAADFSQTVTSHKTFENHYETLAYALKAGIDAFTDDPNLVIESAWQALEKGLITE 283
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
ID AISN+ RLG FD SK+ Y + P +C + + L+ A + +VLLKN
Sbjct: 284 EDIDKAISNSLKVRFRLGEFDEEISKKFY--VPPTKICDKEHSQLSYLAELKSVVLLKNE 341
Query: 369 AGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV---ATIYQAGC 425
LP+ I +AVIGP A+ K Y GT + LQG+ + +Y
Sbjct: 342 NKFLPIKKDKINKIAVIGPLAH--KNYNDWYSGTYPYKVSILQGIINRLYDKEILYHDSY 399
Query: 426 SNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
V +A+ + + S + + + +D+ E ES +D
Sbjct: 400 DTVAIKSAKSNRYLRVIESEISPIWAI-SDKITERESFKYID 440
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 89/173 (51%), Gaps = 18/173 (10%)
Query: 381 NLAVIGPNANVTKTMIGNY------EGTPCKYTTPLQGLAAVVATIYQAGCSNVQCGTAQ 434
N+ +G + KT G Y +G K+ + L I ++ G
Sbjct: 483 NIDPLGDGTYLIKTWNGKYAYIDERKGNVLKFKDTFENLPEEKFII-----EKIEKG--- 534
Query: 435 VDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
++ + + A ++D +L +G + + E DR+D++LP Q+ L+ EV +V V+L+I
Sbjct: 535 IEKSCEIAQNSDLVILCVGNNPMVNGREDEDRIDIVLPEHQENLVKEVFRVNPN-VVLLI 593
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+S + IS+ K+ I +ILW + GQ G AIAD+ G ++P+GRL MTWY
Sbjct: 594 ISSYPYAISWEKD--HIPAILWSSHGGQEMGNAIADILLGNFSPAGRLSMTWY 644
>gi|428131069|gb|AFY97406.1| Xyl3A [Fervidobacterium gondwanense]
Length = 775
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 178/633 (28%), Positives = 267/633 (42%), Gaps = 174/633 (27%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKIT------------------------FLVDSAGSVS 88
+ ++S I+LRV DL+ R+TL+EK++ L D G +S
Sbjct: 4 YKDSSKPIELRVEDLLSRMTLEEKVSQLGSVWSYQLLDENGNFDEGKAFELLKDGIGQIS 63
Query: 89 R----------------------------LGIPKYEWWSEALHGVSYVGPGTHFSNVVPG 120
R LGIP E L G Y+G
Sbjct: 64 RPGGATNFQPEEVAQFDNKVQKFLIENTRLGIPAL-MHEECLTG--YMGLN--------- 111
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEARAM-YNVGLAGLTFWSPNINIFRDPRWGRG 179
++FP I A+++ L + + KV+ E R M + GLA P +++ RDPRWGR
Sbjct: 112 GSNFPVPIAMASTWEPELVKEVTKVIRQEMRNMGIHQGLA------PVLDVARDPRWGRV 165
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLK--VAACCKHYTAYDLD----NWKGTD 233
+ET GE P L + YV+GLQ D LK V A KH+ Y NW T+
Sbjct: 166 EETFGESPYLVASMGCAYVEGLQGED------LKDGVIATTKHFVGYSASEGGRNWAPTN 219
Query: 234 ----------RYHFNAMV-------------------------IYTYYLIK---FK-YIV 254
+ F A V + T L K F +V
Sbjct: 220 IPPRELREIFMFPFEAAVKVAKVGSVMNSYSEIDGVPLAASRELLTDVLRKEWGFDGLVV 279
Query: 255 SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC-GSFLGKHTEAAVKAGLVNESAIDH 313
SD SV +++ + EAA ++ AG+D+ + H VK+G++ E +D
Sbjct: 280 SDYFSVKLIHEHHKLARDKAEAAKYALEAGIDVELPNTDCYAHVLDLVKSGVIPEKLLDQ 339
Query: 314 AISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLP 373
+ +LG FD +PY + P N +LAL+ AR+ IVLLKN G LP
Sbjct: 340 TVRRILKMKFKLGLFD-----KPYVE--PSKAKVVKNTELALEVARKSIVLLKND-GILP 391
Query: 374 LSPTAIKNLAVIGPNANVTKTMIGNY----------EGTPCKYTTPLQGLAAVVATI--- 420
L +A+IGPNA + M+G+Y E + P L++V ++
Sbjct: 392 LKKDM--KVALIGPNAADVRNMLGDYMYLAHIKIMLENVNLAFDAPKFNLSSVKKSVEES 449
Query: 421 -------------------YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIE-- 459
Y GC + ++A KA ++D ++V+G +
Sbjct: 450 MNKIKSIEMLLKEESVQFTYAKGCDVLGDSKEGFNEALKAVENSDVAIVVVGDRSGLTMD 509
Query: 460 ---AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWV 516
ES D +L LPG Q+ LI EV+KV K PV+L +++G + S + K+++I+
Sbjct: 510 CTTGESRDSANLKLPGVQEELIIEVSKVGK-PVVLALLNGRPY--SLTRVVDKVSAIVEA 566
Query: 517 GYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
PG+ G AI DV +G+ NPSG+LPMT +P+S
Sbjct: 567 WLPGEIGAKAIVDVLYGKVNPSGKLPMT-FPRS 598
>gi|329922637|ref|ZP_08278189.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
gi|328941979|gb|EGG38262.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
Length = 765
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 153/545 (28%), Positives = 231/545 (42%), Gaps = 101/545 (18%)
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASF 134
E + + A SRLGIP +G ++ G T FP + +++
Sbjct: 89 EAVNHIQRYAIEQSRLGIP------------ILIGEECSHGHMAIGGTVFPVPLSIGSTW 136
Query: 135 NASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 194
N L++ + + V+ E R+ G +SP +++ RDPRWGR +E GEDP L S+YA
Sbjct: 137 NLDLYRDMCRAVALETRSQ-----GGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEYA 191
Query: 195 TGYVKGLQQTDGGSPNRLKVAACCKHYTAY-DLDNWKGTDRYHFNAMVIYTYYLIKFK-- 251
V+GLQ SP+ VAA KH+ Y + + H + ++ FK
Sbjct: 192 VASVEGLQGESLDSPS--SVAATLKHFVGYGSSEGGRNAGPVHMGTRELMEVDMLPFKKA 249
Query: 252 ---------------------------------------YIVSDCDSVDVLYNSQHYTKT 272
+++DC ++D+L + +
Sbjct: 250 VEAGAASIMPAYNEIDGVPCTVNTELLDGILRKEWGFDGMVITDCGAIDMLASGHDTAED 309
Query: 273 PEEAAAKSILAGLDLN-CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGH 331
+AA ++I AG+DL G GKH + AV++ + S +D A+ +LG F+ +
Sbjct: 310 GMDAAVQAIRAGIDLEMSGEMFGKHLQKAVESNKLEVSVLDEAVRRVLTLKFKLGLFE-N 368
Query: 332 PSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANV 391
P P Q + + + LA A +GIVLLKN A +LPLS +AVIGPNA+
Sbjct: 369 PYVDP--QTAENVIGSGQHIGLARQLAAEGIVLLKNEAKALPLSKEG-GVIAVIGPNADQ 425
Query: 392 TKTMIGNYEGT--PCKYTTPLQGLAAVVA-----TIYQAGCSNVQCGTAQVDDAKKAAAS 444
+G+Y P TT L G+ A + +Y GC + A A
Sbjct: 426 GYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPGCRIKDDSREGFEFALSCAEQ 485
Query: 445 ADATVLVMGADQSIE-------------------------AESHDRLDLLLPGQQQLLIT 479
AD V+V+G + + E DR+ L L G Q L
Sbjct: 486 ADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSDMDCGEGIDRMTLQLSGVQLDLAQ 545
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
E+ K+ K +++ I G I+ D +IL YPGQ GG AIAD+ FG NPSG
Sbjct: 546 EIHKLGKRMIVVYI---NGRPIAEPWIDEHADAILEAWYPGQEGGHAIADILFGDVNPSG 602
Query: 540 RLPMT 544
+L M+
Sbjct: 603 KLTMS 607
>gi|270294308|ref|ZP_06200510.1| periplasmic beta-glucosidase [Bacteroides sp. D20]
gi|270275775|gb|EFA21635.1| periplasmic beta-glucosidase [Bacteroides sp. D20]
Length = 862
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 170/599 (28%), Positives = 258/599 (43%), Gaps = 132/599 (22%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKIT-------------------FLVDSAGSVS----- 88
+ ++S+ ++ RV DL+ R+TL+EKI L D+AG +S
Sbjct: 26 YKDSSMSVERRVTDLLGRMTLREKIAQLSHLHGYQLYNGQEVDYQKLRDAAGDISYGCIE 85
Query: 89 ------------------------RLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSF 124
RLGIP + +E+LHG +V G+T F
Sbjct: 86 GFNLTGENVRKAFHAIQKYMVEETRLGIPVFTV-TESLHG-----------SVHDGSTIF 133
Query: 125 PQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPG 184
PQ + ++FN L + K ++TE R+ + SP +++ RD RWGR +E+ G
Sbjct: 134 PQSVAVGSTFNLDLAYQMTKAIATELRSQ-----GVIQTLSPGLDVVRDLRWGRVEESFG 188
Query: 185 EDPLLASKYATGYVKGLQQTDGGSPNRLK----------------VAACCKHYTAYDLDN 228
EDP L + VKG DGG LK V + + +
Sbjct: 189 EDPWLVGQMGIAQVKGY--IDGGISPMLKPFGPGGAPLGGLNLASVESGERDIRNIHIKP 246
Query: 229 WKGTDRYHFNAMVIYTY------------YLIK--------FK-YIVSDCDSVDVLYNSQ 267
++ R V+ +Y YL+ FK Y+ SD +V +L + Q
Sbjct: 247 YEMAVRNTEVKAVMTSYNSWNGIPNSASSYLLTNILRNEWGFKGYVYSDWGAVAMLKDFQ 306
Query: 268 HYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGF 327
H K EAA +++ AG+DL S E ++ G +E +D A+ LG
Sbjct: 307 HTAKDDSEAAIQALTAGVDLEASSNCYWALEQLIEQGRFDEKYVDLAVGRILRVKFELGL 366
Query: 328 FDGHPSKQPYGQLGPKDVCTQANQ--DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVI 385
F+ PY V + + +L+ A + IVLLKN LPL+ IK+LAVI
Sbjct: 367 FE-----NPYQGADMPGVAMRTKEAVELSRRVADESIVLLKNENTLLPLNLNKIKSLAVI 421
Query: 386 GPNANVTKTMIGNYEGTPCKY--TTPLQGLAAVVATI----YQAGCSNVQCGTAQVDDAK 439
GPNAN + G+Y + TPL+GL V Y AGC + + D+A
Sbjct: 422 GPNAN--QVQFGDYTWSRSNKDGVTPLEGLKKRVGNKIKINYAAGCDLITDNKSGFDEAV 479
Query: 440 KAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQQQLLITEVAKVAKGPVI 490
A ++D V+ +G+ + A E D L L G Q+ L+ E+ + K PVI
Sbjct: 480 AAVKASDMAVVFVGSSSASLARDYSDATCGEGFDLSSLDLTGVQEELVEEIYAIGK-PVI 538
Query: 491 LIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+++++G F IS+ K + W Y G+ G AIAD+ G NPS +LP + +PQS
Sbjct: 539 VVLVTGKPFSISWIKEHIPAIVVQW--YGGEKAGDAIADMLLGNINPSAKLPFS-FPQS 594
>gi|160890986|ref|ZP_02071989.1| hypothetical protein BACUNI_03433 [Bacteroides uniformis ATCC 8492]
gi|156859207|gb|EDO52638.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
Length = 869
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 170/599 (28%), Positives = 258/599 (43%), Gaps = 132/599 (22%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKIT-------------------FLVDSAGSVS----- 88
+ ++S+ ++ RV DL+ R+TL+EKI L D+AG +S
Sbjct: 33 YKDSSMSVERRVTDLLGRMTLREKIAQLSHLHGYQLYNGQEVDYQKLRDAAGDISYGCIE 92
Query: 89 ------------------------RLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSF 124
RLGIP + +E+LHG +V G+T F
Sbjct: 93 GFNLTGENVRKAFHAIQKYMVEETRLGIPVFTV-TESLHG-----------SVHDGSTIF 140
Query: 125 PQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPG 184
PQ + ++FN L + K ++TE R+ + SP +++ RD RWGR +E+ G
Sbjct: 141 PQSVAVGSTFNLDLAYQMTKAIATELRSQ-----GVIQTLSPGLDVVRDLRWGRVEESFG 195
Query: 185 EDPLLASKYATGYVKGLQQTDGGSPNRLK----------------VAACCKHYTAYDLDN 228
EDP L + VKG DGG LK V + + +
Sbjct: 196 EDPWLVGQMGIAQVKGY--IDGGISPMLKPFGPGGAPLGGLNLASVESGERDIRNIHIKP 253
Query: 229 WKGTDRYHFNAMVIYTY------------YLIK--------FK-YIVSDCDSVDVLYNSQ 267
++ R V+ +Y YL+ FK Y+ SD +V +L + Q
Sbjct: 254 YEMAVRNTEVKAVMTSYNSWNGIPNSASSYLLTNILRNEWGFKGYVYSDWGAVAMLKDFQ 313
Query: 268 HYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGF 327
H K EAA +++ AG+DL S E ++ G +E +D A+ LG
Sbjct: 314 HTAKDDSEAAIQALTAGVDLEASSNCYWALEQLIEQGRFDEKYVDLAVGRILRVKFELGL 373
Query: 328 FDGHPSKQPYGQLGPKDVCTQANQ--DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVI 385
F+ PY V + + +L+ A + IVLLKN LPL+ IK+LAVI
Sbjct: 374 FE-----NPYQGADMPGVAMRTKEAVELSRRVADESIVLLKNENTLLPLNLNKIKSLAVI 428
Query: 386 GPNANVTKTMIGNYEGTPCKY--TTPLQGLAAVVATI----YQAGCSNVQCGTAQVDDAK 439
GPNAN + G+Y + TPL+GL V Y AGC + + D+A
Sbjct: 429 GPNAN--QVQFGDYTWSRSNKDGVTPLEGLKKRVGNKIKINYAAGCDLITDNKSGFDEAV 486
Query: 440 KAAASADATVLVMGADQSIEA---------ESHDRLDLLLPGQQQLLITEVAKVAKGPVI 490
A ++D V+ +G+ + A E D L L G Q+ L+ E+ + K PVI
Sbjct: 487 AAVKASDMAVVFVGSSSASLARDYSDATCGEGFDLSSLDLTGVQEELVEEIYAIGK-PVI 545
Query: 491 LIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+++++G F IS+ K + W Y G+ G AIAD+ G NPS +LP + +PQS
Sbjct: 546 VVLVTGKPFSISWIKEHIPAIVVQW--YGGEKAGDAIADMLLGNINPSAKLPFS-FPQS 601
>gi|62321294|dbj|BAD94522.1| beta-xylosidase - like protein [Arabidopsis thaliana]
Length = 287
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 80/105 (76%), Positives = 92/105 (87%)
Query: 446 DATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAK 505
D+ VLV+GADQSIE E HDR+DL LPG+QQ L+T VA A+GPV+L+IMSGGGFDI+FAK
Sbjct: 1 DSVVLVVGADQSIEREGHDRVDLYLPGKQQELVTRVAMAARGPVVLVIMSGGGFDITFAK 60
Query: 506 NDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
ND KITSI+WVGYPG+AGG AI DV FGR+NPSG LPMTWYPQSY
Sbjct: 61 NDKKITSIMWVGYPGEAGGLAIPDVIFGRHNPSGNLPMTWYPQSY 105
>gi|160890630|ref|ZP_02071633.1| hypothetical protein BACUNI_03075 [Bacteroides uniformis ATCC 8492]
gi|317479833|ref|ZP_07938953.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
gi|156859629|gb|EDO53060.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
gi|316904039|gb|EFV25873.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
Length = 860
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 176/617 (28%), Positives = 267/617 (43%), Gaps = 143/617 (23%)
Query: 40 CDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKI-------------------TFL 80
C V + P L + + + +L + R DL++R+TL+EKI T L
Sbjct: 17 CGVATLP-LQAQSYKDPTLSPEERTTDLLRRMTLEEKIAQIRHIHSWNIFDEQDLNETKL 75
Query: 81 VDSAGSV-----------------------------SRLGIPKYEWWSEALHGVSYVGPG 111
+ G + +RLGIP + +EALHG
Sbjct: 76 QEFVGDLCWGFVEGFPLTGESCHRHMRRIQEYMVKHTRLGIPVFTV-AEALHG------- 127
Query: 112 THFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIF 171
+V G+T +PQ I A++FN L +S E ++ G+ + +P I++
Sbjct: 128 ----SVHEGSTIYPQNIALASTFNPELAYRRAAEISKE---LHYQGIRQIL--APCIDVV 178
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
RD RWGR +E+ GEDP L +A KG D G ++ KHY + + G
Sbjct: 179 RDLRWGRVEESYGEDPFLNGIFAYEEAKGY--LDNG------ISPMLKHYGPHG-NPLGG 229
Query: 232 TD--RYHFNAMVIYTYYLIKFK-------------------------------------- 251
+ H ++ YL FK
Sbjct: 230 LNLASVHCGVGELHDVYLQPFKRVVTSLPIHAVMSTYNSWNRVPNSSSHYLLTEVLRNRW 289
Query: 252 ----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVN 307
YI SD ++D+L+ Q EAA ++I+AGLD+ S H A VK V+
Sbjct: 290 GFQGYIYSDWGAIDMLHTFQRTASNQAEAAVQAIVAGLDVEASSECFPHLAALVKEKKVD 349
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYG-QLGPKDVCTQANQDLALDAARQGIVLLK 366
E ID A+S R+G F+ PYG + + +Q N +A A + VLLK
Sbjct: 350 EGIIDKAVSRVLLAKFRMGLFE-----DPYGDRFAGHSLHSQENIQVARQIADESTVLLK 404
Query: 367 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI-----Y 421
N LPL+ + +K++AVIGPNA+ + + T TPL+G+ V + Y
Sbjct: 405 NDKDLLPLNLSQLKSIAVIGPNADQVQFGDYTWSRTNQDGITPLEGIRKQVEPVGIKIRY 464
Query: 422 QAGCSNVQCGTAQVDDAKKAAASADATVLVMG-ADQSIEAESH--------DRLDLLLPG 472
GC+ + T Q+ A +AA +DA +L G A S+ + H D DL L G
Sbjct: 465 AKGCNMMSMDTTQIAAAVEAARQSDAAILFCGSASASLARDYHETNCGEGFDLTDLSLTG 524
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCF 532
Q LI V K PV+L++++G F I++ K I +IL Y G+ G++IAD+ F
Sbjct: 525 AQGKLIQAVHATGK-PVVLVLVTGKPFAIAWEKE--HIPAILVQWYAGEQEGSSIADILF 581
Query: 533 GRYNPSGRLPMTWYPQS 549
G+ NPSG L ++ +P+S
Sbjct: 582 GKTNPSGHLTVS-FPKS 597
>gi|383776168|ref|YP_005460734.1| putative glycosyl hydrolase [Actinoplanes missouriensis 431]
gi|381369400|dbj|BAL86218.1| putative glycosyl hydrolase [Actinoplanes missouriensis 431]
Length = 977
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 130/424 (30%), Positives = 196/424 (46%), Gaps = 66/424 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + + +RV DL+ RLTL EKI L +V RLG+ + +EALHGV+++G
Sbjct: 48 FRDPQHKVAVRVADLLDRLTLDEKIALLNQYQPAVPRLGVAAFRTGTEALHGVAWLG--- 104
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL--AGLTFWSPNINI 170
AT+FPQ + A+S+N L + +G+ V TE RAM++ GL W+P +N+
Sbjct: 105 -------TATAFPQAVGLASSWNPDLMRRVGEAVGTEVRAMHHRDPENVGLNVWAPVVNL 157
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E EDP L +T Y +GL+ G P L+ A KH+ Y+ + +
Sbjct: 158 LRDPRWGRNEEGYAEDPWLTGVLSTAYSRGLR---GDHPLWLRTAPTLKHFLGYNNETDR 214
Query: 231 GTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVL--YN----------------------- 265
+ V++ Y L F+ ++D +V V+ YN
Sbjct: 215 CLTSSNLPPRVLHEYELPAFRPALADGAAVAVMASYNLVNGRPTHLSPLIDETLRSWAED 274
Query: 266 ----------------SQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE--AAVKAGLVN 307
QH+ A ++ AG+D AA GL+
Sbjct: 275 DVLVVGDAWAVHNLAGDQHWFDDHVAGFAAALRAGIDCVTEDVTATRERFTAAYDRGLIT 334
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
ES +D A+ + + +RLG FD + PY +GP+ + +++LA +AAR VLL N
Sbjct: 335 ESHVDAAVRHILSIRVRLGEFDA--ASDPYRHIGPEVIDCGEHRELAHEAARASAVLLSN 392
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV---ATIYQAG 424
G LPL P + +AVIGP A+ + Y GTP TT G+ + A ++ G
Sbjct: 393 D-GILPLDPQTVGRVAVIGPLAD--QVFEDWYSGTPPYRTTLRAGVTRRIGADAVLFAEG 449
Query: 425 CSNV 428
V
Sbjct: 450 VDRV 453
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 3/86 (3%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+ DR DL LPG Q L+ V + A +L + S + + +A + ++LW + G
Sbjct: 607 ETADRRDLDLPGTQDDLMRAV-RAANPRTVLALTSSYPYAVGWASQ--HLPALLWSAHGG 663
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
Q GAA+AD+ FG PSGRL TWY
Sbjct: 664 QEHGAALADILFGAAEPSGRLTQTWY 689
>gi|52081845|ref|YP_080636.1| glycoside hydrolase family 3 [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|404490730|ref|YP_006714836.1| glycoside hydrolase [Bacillus licheniformis DSM 13 = ATCC 14580]
gi|52005056|gb|AAU24998.1| putative Glycoside Hydrolase Family 3 [Bacillus licheniformis DSM
13 = ATCC 14580]
gi|52349736|gb|AAU42370.1| putative glycoside hydrolase [Bacillus licheniformis DSM 13 = ATCC
14580]
Length = 981
Score = 179 bits (454), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 138/457 (30%), Positives = 208/457 (45%), Gaps = 82/457 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + SL + RV DL+ RLTL+EK++ + +V RLGIP ++ +EALHGV+++G
Sbjct: 41 FQDPSLPVQKRVSDLLSRLTLEEKVSLMHQYQPAVPRLGIPAFKTGTEALHGVAWLGE-- 98
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--GLTFWSPNINI 170
AT FPQ A +++ L + IG V E R +++ A G+ W+P +++
Sbjct: 99 --------ATVFPQAFGLAHTWDRYLIKQIGSAVGDEVRGFHHLDPAANGVNVWAPVVDL 150
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E EDP L + AT Y GL+ G P LK KH+ AY+ + +
Sbjct: 151 LRDPRWGRNEEGYSEDPFLTGEIATAYASGLR---GDHPFYLKTVPTLKHFLAYNNETDR 207
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + ++ YYL F+
Sbjct: 208 GFSSSSIDPRNMHEYYLKPFETAISKKAAYGLMPAYNSVNDKPAILSPLLDSTVKRRWAG 267
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE-------AAV 301
+IVSD + N Y + E+A A ++ AG+D G++ E A+
Sbjct: 268 DDFFIVSDAFDPSGIVNDHKYYDSHEKAHAHAVKAGIDNFTDQ--GENPELTRNALTGAL 325
Query: 302 KAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQDLALDAARQ 360
K GL++E +D A++N F+ R G FD P + PY +L + + +Q LA AA +
Sbjct: 326 KQGLISEKDLDQALANTFSIRFRTGEFD--PDELNPYSRLTDDVINSPKHQLLAKKAAEK 383
Query: 361 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT- 419
IVLLKN LP + +++AVIGP N Y GT TPL +A +
Sbjct: 384 AIVLLKNDQDLLPFNARKNESIAVIGPFGNALYEDW--YSGTMPYRKTPLDEIADKIGKD 441
Query: 420 -------IYQAGCSNVQCG---TAQVDDAKKAAASAD 446
+ Q+G + G TA D + A+AD
Sbjct: 442 RVSFAEGLEQSGFKSALTGKFVTAGRDGKQPLTAAAD 478
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 70/120 (58%), Gaps = 7/120 (5%)
Query: 428 VQCGTAQVDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAK 486
++ GT D+A + A SAD ++ +G + I E+ DR D+ LP Q+ L+ V+K A
Sbjct: 579 IKSGT---DEAVQLAKSADKAIVFVGNNPYINGRETEDRKDITLPPAQENLLKAVSK-AN 634
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+L++ S F +++AK I +I++ + GQ G+A+AD+ +G NP GRL TW+
Sbjct: 635 TNTVLVVTSSYPFALNWAK--AHIPAIIYSAHGGQEAGSALADILYGDVNPGGRLTQTWH 692
>gi|319647763|ref|ZP_08001981.1| hypothetical protein HMPREF1012_03020 [Bacillus sp. BT1B_CT2]
gi|317390104|gb|EFV70913.1| hypothetical protein HMPREF1012_03020 [Bacillus sp. BT1B_CT2]
Length = 981
Score = 179 bits (454), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 138/457 (30%), Positives = 208/457 (45%), Gaps = 82/457 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + SL + RV DL+ RLTL+EK++ + +V RLGIP ++ +EALHGV+++G
Sbjct: 41 FQDPSLPVQKRVSDLLSRLTLEEKVSLMHQYQPAVPRLGIPAFKTGTEALHGVAWLGE-- 98
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA--GLTFWSPNINI 170
AT FPQ A +++ L + IG V E R +++ A G+ W+P +++
Sbjct: 99 --------ATVFPQAFGLAHTWDRYLIKQIGSAVGDEVRGFHHLDPAANGVNVWAPVVDL 150
Query: 171 FRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWK 230
RDPRWGR +E EDP L + AT Y GL+ G P LK KH+ AY+ + +
Sbjct: 151 LRDPRWGRNEEGYSEDPFLTGEIATAYASGLR---GDHPFYLKTVPTLKHFLAYNNETDR 207
Query: 231 GTDRYHFNAMVIYTYYLIKFK--------------------------------------- 251
G + ++ YYL F+
Sbjct: 208 GFSSSSIDPRNMHEYYLKPFETAISKKAAYGLMPAYNSVNDKPAILSPLLDSTVKRRWAG 267
Query: 252 ---YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE-------AAV 301
+IVSD + N Y + E+A A ++ AG+D G++ E A+
Sbjct: 268 DDFFIVSDAFDPSGIVNDHKYYDSHEKAHAHAVKAGIDNFTDQ--GENPELTRNALTGAL 325
Query: 302 KAGLVNESAIDHAISNNFATLMRLGFFDGHPSK-QPYGQLGPKDVCTQANQDLALDAARQ 360
K GL++E +D A++N F+ R G FD P + PY +L + + +Q LA AA +
Sbjct: 326 KQGLISEKDLDQALANTFSIRFRTGEFD--PDELNPYSRLTDDVINSPKHQLLAKKAAEK 383
Query: 361 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT- 419
IVLLKN LP + +++AVIGP N Y GT TPL +A +
Sbjct: 384 AIVLLKNDQDLLPFNARKNESIAVIGPFGNALYEDW--YSGTMPYRKTPLDEIADKIGKD 441
Query: 420 -------IYQAGCSNVQCG---TAQVDDAKKAAASAD 446
+ Q+G + G TA D + A+AD
Sbjct: 442 RVSFAEGLEQSGFKSALTGKFVTAGRDGKQPLTAAAD 478
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 70/120 (58%), Gaps = 7/120 (5%)
Query: 428 VQCGTAQVDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAK 486
++ GT D+A + A SAD ++ +G + I E+ DR D+ LP Q+ L+ V+K A
Sbjct: 579 IKSGT---DEAVQLAKSADKAIVFVGNNPYINGRETEDRKDITLPPAQENLLKAVSK-AN 634
Query: 487 GPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+L++ S F +++AK I +I++ + GQ G+A+AD+ +G NP GRL TW+
Sbjct: 635 TNTVLVVTSSYPFALNWAK--AHIPAIIYSAHGGQEAGSALADILYGDVNPGGRLTQTWH 692
>gi|423212854|ref|ZP_17199383.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694712|gb|EIY87939.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
CL03T12C04]
Length = 782
Score = 179 bits (454), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 157/527 (29%), Positives = 240/527 (45%), Gaps = 96/527 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP + EA HG ++ GAT FP I AA+++ L + +G+V++
Sbjct: 128 TRLGIPMF-LAEEAPHG-----------HMAIGATVFPTGIGMAATWSPELVKEVGQVIA 175
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E R+ G + P +++ RDPRW R +ET GEDP+L+ V GL G
Sbjct: 176 KEIRSQ-----GGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL--GGGN 228
Query: 208 SPNRLKVAACCKHYTAY-----------------DLDN------WKGTDRYHFNAMVIYT 244
+ A KH+ AY DL K D + M Y
Sbjct: 229 LSQKYATIATLKHFLAYAVPEGGQNGNYASVGIRDLHQNFLPPFRKAIDAGALSVMTSYN 288
Query: 245 ----------YYLI--------KFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+YL+ KF+ ++VSD S++ ++ S T E AA +S+ AG+
Sbjct: 289 SIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVAPTKENAAIQSVTAGV 348
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
D++ G + AV++G ++++ ID A+ +G F+ HP P ++ K V
Sbjct: 349 DVDLGGDAYTNLCHAVQSGQMDKTVIDTAVCRVLRMKFEMGLFE-HPYVDP--KIAAKTV 405
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG--TP 403
+ + +LA A+ I LLKN LPLS T I +AVIGPNA+ M+G+Y
Sbjct: 406 RRKEHIELARKIAQSSITLLKNENSILPLSKT-INKVAVIGPNADNRYNMLGDYTAPQED 464
Query: 404 CKYTTPLQGLAAVVATI---YQAGCSNVQCGTAQVDDAKKAAASAD-------------- 446
T L G+ ++ Y GC+ +++ A +AA ++
Sbjct: 465 SNVKTVLDGILTKLSPFRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVGGSSARDF 524
Query: 447 --------ATVLVMGADQSIE-AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
A V G+ +E E DR L L G+QQ L+ + K K P+I++ + G
Sbjct: 525 KTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTGK-PLIVVYIEGR 583
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
+ ++A ++L YPGQ GG AIADV FG YNPSGRLP++
Sbjct: 584 PLEKNWASE--YADALLTAYYPGQEGGNAIADVLFGDYNPSGRLPIS 628
>gi|334365132|ref|ZP_08514098.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
sp. HGB5]
gi|313158675|gb|EFR58064.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
sp. HGB5]
Length = 771
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 152/529 (28%), Positives = 239/529 (45%), Gaps = 102/529 (19%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
SRLGIP + EA HG ++ GAT+FP A+++N L + +GKV++
Sbjct: 119 SRLGIPLF-LAEEAPHG-----------HMAIGATTFPTAPGQASTWNPELIERMGKVIA 166
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E R G + P ++I RDPRW R +E+ GED L ++ YV+G T G
Sbjct: 167 AEIRLQ-----GGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRG---TGSG 218
Query: 208 SPNRLKVA-ACCKHYTAYDLD--------NWKGTDRYH------FNAMV------IYTYY 246
++ + A + KH+ AY N G F A V + T Y
Sbjct: 219 DLSQSRHALSTLKHFIAYGASEGGQNGGSNLLGERELRETYLPPFEAAVKAGARSVMTAY 278
Query: 247 -----------------LIKFK-----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
+++ + ++VSD S++ L+ + + EAA +++ AG
Sbjct: 279 NSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREAAVQALRAG 338
Query: 285 LD--LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY-GQLG 341
+D L G+F A +AG V E+ ID A+ A +G F+ PY +
Sbjct: 339 VDADLKGGAF--ASLREAAEAGDVAEAEIDRAVERVLALKFEMGLFE-----NPYIDEAA 391
Query: 342 PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG 401
+V A+ +LAL+AARQ + LL+N +G+LPL P ++ +AVIGPNA+ +G+Y
Sbjct: 392 AAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLGDYTA 451
Query: 402 TPCKYTTPLQGLAAVVA---TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSI 458
T GL ++ +Y GC+ +++ A AA DA V+V+G +
Sbjct: 452 QQTAANTVRDGLEKLLGRDRVVYSRGCTVRGGDRSEIAAAVSAARGTDAAVVVIGGSSAR 511
Query: 459 E-----------------------AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
+ E DR L L G+Q+ L+ + K P+I++ ++
Sbjct: 512 DFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALLGEQEELLRRI-KATGTPLIVVCIA 570
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
G D+ A + W YPG GG A+A+ GR NP+GRLP+T
Sbjct: 571 GRPLDLRRASEQADALLMAW--YPGARGGDAVAETILGRNNPAGRLPIT 617
>gi|255545660|ref|XP_002513890.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223546976|gb|EEF48473.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 336
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 84/158 (53%), Positives = 107/158 (67%), Gaps = 1/158 (0%)
Query: 394 TMIGNYEGTPCKYTTPLQGLAAVVA-TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVM 452
T++GNY G PCK TPLQGL + T Y GC+ V C A + +A + A D VLVM
Sbjct: 2 TLVGNYAGPPCKTVTPLQGLQNYIKNTKYHRGCNTVACSKATIREAVQIAREVDQVVLVM 61
Query: 453 GADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITS 512
DQ+ EAE D L+L LPG QQ LI VA+ A PV+L+++ GG D+SFA +PKI S
Sbjct: 62 ELDQTQEAERIDLLNLRLPGNQQKLIISVARAANKPVVLVLICGGLVDVSFAITEPKIGS 121
Query: 513 ILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
ILW GYPG+AGG A+A++ FG +NP G+LP+TWYPQ Y
Sbjct: 122 ILWAGYPGEAGGTALAEIIFGDHNPGGKLPVTWYPQQY 159
>gi|387790798|ref|YP_006255863.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
3403]
gi|379653631|gb|AFD06687.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
3403]
Length = 730
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 174/565 (30%), Positives = 246/565 (43%), Gaps = 101/565 (17%)
Query: 54 CNTSLGID-----LRVGDLVKRLTLQEKITFL-----VDSAGSVSRLGIPKYEWWSEALH 103
C S G D ++ L++++TL+EK+ + SAG + RLGIP+ S+ H
Sbjct: 26 CQQSKGADEERFEQKIEQLIEKMTLEEKVGMIHGNSSFTSAG-IERLGIPELVT-SDGPH 83
Query: 104 GVSYVGPGTHFS---NVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAG 160
GV V G ++ NV AT P AA++N L G V+ +EA + G
Sbjct: 84 GVR-VEHGRDWTVDTNVDDAATYLPTGNTLAATWNTDLGYQFGAVLGSEANYRGKDVILG 142
Query: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKH 220
P +NI R P GR E EDP L SK A GY+KG+Q D G V+AC KH
Sbjct: 143 -----PGVNIIRSPLCGRNFEYLSEDPYLISKMAVGYIKGVQ--DQG------VSACVKH 189
Query: 221 YTAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSD--CDSVDVLYNS-QHYTKTPEEAA 277
Y A + + + T + + YL FK V D ++V YN + T E
Sbjct: 190 YAANNEEVDRNTVDVQMSERALREIYLPAFKAAVVDGGVNTVMGSYNKFRGQYATHNEYL 249
Query: 278 AKSILAG-------------------------LDLNCGS-------------FLGKHTEA 299
K IL G DL G+ F+ A
Sbjct: 250 VKKILKGEWGFKGVLMSDWGAVHNTMEAMQNGTDLEMGTDLGMLPNPNYNKFFMADTVLA 309
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
VK+G ++E ID + + + DG P T+ +Q +AL A
Sbjct: 310 LVKSGKLSEQLIDEKVRRILWVMFKTNMIDGKRQ--------PGSFNTKEHQKVALKVAE 361
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT-TPLQGLAAVVA 418
+GIVLLKN G LPL +K++AVIG NAN +M G KY T LQGL ++
Sbjct: 362 EGIVLLKNENGILPLQKNDLKSIAVIGENANRPNSMGGGSSQVKAKYEITLLQGLKNLLG 421
Query: 419 TI----YQAG---CSNVQCGTAQVDDAKKAAASADATVLVMG----------ADQSIEAE 461
+ Y G Q + +A AA+ A+ +LV+G D + +AE
Sbjct: 422 STVNIQYAQGYKIARGQQADAKLISEAVSAASKAEIAILVVGWTHGYDYSVWNDNAYDAE 481
Query: 462 SHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG-FDISFAKNDPKITSILWVGYPG 520
D+ D+ +P Q LI V K P +++++GGG D++ D K W Y G
Sbjct: 482 GVDKPDMDMPFGQNELIKAVLKA--NPHTVVVLTGGGPIDVTQWIGDAKGVLEGW--YAG 537
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTW 545
GG A+A + FG NPSG+LPMT+
Sbjct: 538 MEGGNALAKILFGEVNPSGKLPMTF 562
>gi|302872951|ref|YP_003841584.1| Beta-glucosidase [Clostridium cellulovorans 743B]
gi|307686534|ref|ZP_07628980.1| Beta-glucosidase [Clostridium cellulovorans 743B]
gi|302575808|gb|ADL49820.1| Beta-glucosidase [Clostridium cellulovorans 743B]
Length = 965
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 130/411 (31%), Positives = 189/411 (45%), Gaps = 66/411 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-----SY 107
F N ++ I+ R+ DL+ RLTL+EKI L + RLGI Y E HG+
Sbjct: 15 FRNPNINIEERLDDLISRLTLEEKIHVLPGYQAPIERLGIQAYYVGGEGAHGLVRRQGKL 74
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVG--LAGLTFWS 165
GP T FPQ I ++++N +L IGKV+S EARA Y +AGL W+
Sbjct: 75 TGP----------TTVFPQPIGLSSTWNPNLMSEIGKVISDEARAYYKKSDKIAGLNIWA 124
Query: 166 PNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD 225
P +++ RDPRWGR +E GEDP L K + K + G P LK+ KH+ A +
Sbjct: 125 PTVDMERDPRWGRTEEAYGEDPYLTGKLSAALTKAER---GEDPFYLKMVPTLKHFYANN 181
Query: 226 LDNWKGTDRYHFNAMVIYTYYLIKFK---------------------------------- 251
+ + + + Y YYL FK
Sbjct: 182 YEKERISCSSSIDPRNKYEYYLKAFKPSIVEGGAQSIMTSYNAINGIPGMVNPEIQTILK 241
Query: 252 -------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGKHTEAAVK 302
+ VSD +V + + Y + E A SI AGLD + + A++
Sbjct: 242 DQWGLESFTVSDGGAVCQVVDYHKYFVSHAETIASSIKAGLDSFTDNPDMVVAAARDAME 301
Query: 303 AGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGI 362
GL+ E +D A+ N F T +RLG D PY + + +C++ N +AL AAR+ +
Sbjct: 302 RGLMTEDDLDKAVRNIFRTRIRLGQLDPE-GYDPYDYISDEVICSEENSKVALKAAREAM 360
Query: 363 VLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
VLLKN LPL+ +K +AVIGP A+ + ++ Y G TPL+G+
Sbjct: 361 VLLKNENNILPLNKEKLKKVAVIGPLAD--ELLMDWYSGIHPYKVTPLEGI 409
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 62/113 (54%), Gaps = 4/113 (3%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
+++A AA AD ++V+G + I E DR D++LP QQ LI V + K +++I+
Sbjct: 561 IEEATSAAKDADVAIVVVGNNPVINGKEEVDRKDIILPPSQQELIKAVYETNKNTIVVIV 620
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
G + + I +I++ GQ G AIAD FG Y P+GRL MTWY
Sbjct: 621 ---GTYPFAINWEQENIPAIMFTASGGQELGTAIADSIFGDYAPAGRLSMTWY 670
>gi|333379224|ref|ZP_08470948.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
22836]
gi|332885492|gb|EGK05741.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
22836]
Length = 745
Score = 179 bits (453), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 163/567 (28%), Positives = 258/567 (45%), Gaps = 93/567 (16%)
Query: 60 IDLRVGDLVKRLTLQEKI--TFLVDSAGSV--SRLGIPKY-EWWSEALHGVSYVGPGTHF 114
+D V DL++R+TL+EKI T L S V P Y E+ + + G + G +
Sbjct: 31 VDKFVDDLLRRMTLEEKIGQTVLYTSGYDVITGPTVDPNYKEYLKKGMVGGIFNAVGADY 90
Query: 115 S---------------------NVVPGA-TSFPQVILTAASFNASLFQAIGKVVSTEARA 152
+ +V+ G T FP + + S++ + ++ ++EA A
Sbjct: 91 TRSLQKIAVEETRLGIPLIFGYDVIHGQRTIFPIPLAESCSWDLEAMERSARIAASEATA 150
Query: 153 MYNVGLAGLTF-WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR 211
G+ + ++P ++I RDPRWGR E GED L S A VKG Q + + N
Sbjct: 151 ------EGINWIYAPMVDISRDPRWGRVAEGAGEDVYLGSLIAAARVKGFQGDNLSAVN- 203
Query: 212 LKVAACCKHYTAYD-----------------------------LDNWKGTDRYHFNAM-- 240
V AC KHY AY LD GT FN +
Sbjct: 204 -TVVACVKHYAAYGATMAGRDYNTVDMSLNELWNTYLPPFKAALDAGCGTIMTSFNDLNG 262
Query: 241 --VIYTYYLIK--------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC 289
YL+K F ++V+D S++ + Y + +A ++ AG+D++
Sbjct: 263 IPATGNKYLLKDILRDKWNFNGFVVTDYTSINEMI-PHGYANDEKHSAEIAMNAGVDMDM 321
Query: 290 -GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ 348
G H + ++ G V+E + A +LG F+ P + D+ T
Sbjct: 322 QGGVYMNHLKTLIEEGKVSEKDVTEAARAILKIKYKLGLFE-DPYRYCDANREKTDILTP 380
Query: 349 ANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTT 408
AN++ A D AR+ +VLLKN +LPL K +A+IGP ++G + + T
Sbjct: 381 ANKEAARDMARKSMVLLKNDKQTLPLKEN--KRVALIGPLVKDKYEILGCWSAMGNRDTI 438
Query: 409 PLQGLAAVVATI------YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIEAES 462
P+ +V I Y GC T +A + A+++D V+VMG ++ E+
Sbjct: 439 PVSVYDGLVEAIGKDKISYAKGCDIQSEDTKGFAEAVRVASASDVVVMVMGEFHNMSGEN 498
Query: 463 HDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQA 522
+ R +L LPG Q L+ + K K PV+L++M+G I++ K++ + +IL +PG
Sbjct: 499 NSRTNLSLPGVQVDLLKAIKKTGK-PVVLVLMNGRPLTINWEKDN--LDAILEAWFPGTM 555
Query: 523 GGAAIADVCFGRYNPSGRLPMTWYPQS 549
GGAAIADV G+YNPSG+L MT +PQ+
Sbjct: 556 GGAAIADVLTGKYNPSGKLTMT-FPQN 581
>gi|423287910|ref|ZP_17266761.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
CL02T12C04]
gi|392671925|gb|EIY65396.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
CL02T12C04]
Length = 782
Score = 179 bits (453), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 158/527 (29%), Positives = 239/527 (45%), Gaps = 96/527 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP + EA HG ++ GAT FP I AA+++ L + +G+V++
Sbjct: 128 TRLGIPMF-LAEEAPHG-----------HMAIGATVFPTGIGMAATWSLELVKEVGQVIA 175
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E R+ G + P +++ RDPRW R +ET GEDP+L+ V GL G
Sbjct: 176 KEIRSQ-----GGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL--GGGN 228
Query: 208 SPNRLKVAACCKHYTAY-----------------DLDN------WKGTDRYHFNAMVIYT 244
+ A KH+ AY DL K D + M Y
Sbjct: 229 LSQKYATIATLKHFLAYAVPEGGQNGNYASVGIRDLHQNFLPPFRKAIDSGALSVMTSYN 288
Query: 245 ----------YYLI--------KF-KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+YL+ KF ++VSD S++ ++ S T E AA +S+ AG+
Sbjct: 289 SIDGIPCTSNHYLLTQLLRNEWKFCGFVVSDLYSIEGIHESHFVALTKENAAIQSVTAGV 348
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
D++ G + AV++G ++++ ID A+ +G F+ HP P ++ K V
Sbjct: 349 DVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRVLRMKFEMGLFE-HPYVDP--KIAAKTV 405
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG--TP 403
+ + +LA A+ I LLKN LPLS T I +AVIGPNA+ M+G+Y
Sbjct: 406 RRKEHIELARKIAQSSITLLKNENSILPLSKT-INKVAVIGPNADNRYNMLGDYTAPQED 464
Query: 404 CKYTTPLQGLAAVVATI---YQAGCSNVQCGTAQVDDAKKAAASAD-------------- 446
T L G+ ++ Y GC+ +++ A KAA ++
Sbjct: 465 SNVKTVLDGILTKLSPFRVEYVRGCAIRDTTVNEIEQAIKAARRSEVVIVVVGGSSARDF 524
Query: 447 --------ATVLVMGADQSIE-AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
A V G+ +E E DR L L G+QQ L+ + K K P+I++ + G
Sbjct: 525 KTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTGK-PLIVVYIEGR 583
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
+ ++A ++L YPGQ GG AIADV FG YNPSGRLP++
Sbjct: 584 PLEKNWASE--YADALLTAYYPGQEGGNAIADVLFGDYNPSGRLPIS 628
>gi|340616356|ref|YP_004734809.1| xylosidase/arabinosidase [Zobellia galactanivorans]
gi|339731153|emb|CAZ94417.1| Xylosidase/arabinosidase, family GH3 [Zobellia galactanivorans]
Length = 738
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 152/574 (26%), Positives = 250/574 (43%), Gaps = 112/574 (19%)
Query: 61 DLRVGDLVKRLTLQEKITFLVDSAGSVS-RLGIPKYEWWSEALHGVSYVGPGTHFSNVVP 119
D ++ L+ +++L+EK+ L + + RLGIP +E LHG+ +
Sbjct: 32 DKKIEKLISKMSLEEKVHQLATQYPNANMRLGIPNLSA-NECLHGIK-----------MD 79
Query: 120 GATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
AT FPQ I A++++ L + +G V+ E+RA G+ ++P + + RD RWGR
Sbjct: 80 SATVFPQAIAMASTWDTELIERMGHTVAKESRAF---GIH--QCYTPMLAVVRDVRWGRT 134
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA------------YDLD 227
+E+ GEDP L K + Y++GLQ + + A KH+ A +D+
Sbjct: 135 EESYGEDPYLVGKIGSSYIRGLQGMGAERFDENHIMATAKHFVADGEPMAGDNGAAHDIS 194
Query: 228 NWKGTDRY----------------------------HFNAMVIYTYYLIKFKY---IVSD 256
+ + + H N V+ T ++ + +VSD
Sbjct: 195 EYTLQNVHLYPFRMAIEEAEVGAIMPAHHLLNGIPCHANKHVMQTVLRDEWGWDGLVVSD 254
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF--------LGKHTEAAVKAGLVNE 308
+ L +Y E AA K + AG+ F G + +AV +V
Sbjct: 255 NGDMRSLKRVFNYVPDYEHAAKKGLEAGIHQELALFQGWSDHRMFGDYLISAVNKKIVPV 314
Query: 309 SAIDHAISNNFATLMRLGFFD-----------------GHPSKQP-----------YGQL 340
+ +D A+ + LG FD G P K Y +
Sbjct: 315 ALVDDAVKHVLQAKFDLGLFDTDIKNDERFDVLKNPDNGEPDKVSQHDAEMFKKALYVGI 374
Query: 341 GPKD----VCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMI 396
KD V Q++ DLAL+ A++ IVLLKN LPL K ++V+GPN + +
Sbjct: 375 PKKDWKKTVFDQSHNDLALEVAQKSIVLLKNEGDLLPLKKEKYKKISVVGPNGKAMR--L 432
Query: 397 GNYE-GTPCKYTTPLQGLAAVVAT----IYQAGCSNVQCGTAQVDDAKKAAASADATVLV 451
G Y P Y ++G+ + + ++ GC + TA + A A S+D T++
Sbjct: 433 GGYSPDNPKYYINIVEGIQNYLGSDREVAFEEGC-DFTDSTANIPKAVALAESSDITIVA 491
Query: 452 MGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKIT 511
+G + E+ DR DL LPG QQ L+ + K P ++++++G I + + +
Sbjct: 492 IGGSEETCRENEDRDDLSLPGPQQKLVEAIHATGK-PYVVVLLNGRPLSIEWIAENSQAI 550
Query: 512 SILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
W Y GQ G AIA++ FG+ NPSG+LP+T+
Sbjct: 551 VEGW--YLGQETGKAIANILFGKVNPSGKLPITF 582
>gi|423346097|ref|ZP_17323785.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
CL03T12C32]
gi|409220895|gb|EKN13848.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
CL03T12C32]
Length = 955
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 156/549 (28%), Positives = 243/549 (44%), Gaps = 109/549 (19%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F ++ +RLGIP ++ +E + GV SY+ AT+FP +
Sbjct: 149 LNEVQRFFIEE----TRLGIPT-DFTNEGIRGVESYI------------ATNFPTQLGLG 191
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRGQETPGEDPLLA 190
++N +L +G + E R L G T ++P +++ RD RWGR +E GE P L
Sbjct: 192 HTWNRNLVHKVGYITGREGR------LLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLV 245
Query: 191 SKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDN---------------------- 228
++ KG+Q TD +VAA KHY AY +
Sbjct: 246 AELGIEMAKGMQ-TDH------QVAATSKHYIAYSNNKGGREGMARVDPQMSPREVEMIH 298
Query: 229 ---WKG--------------TDRYHFNAMVIYTYYLIKFK-------YIVSDCDSVDVLY 264
WK D F Y + + + Y+VSD D+V+ L+
Sbjct: 299 VYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDSDAVEYLF 358
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNESAIDHAISNNF 319
+ +E+ +S+LAGL++ C +F + + G + S ID + +
Sbjct: 359 SKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGAIPMSTIDDRVRDIL 417
Query: 320 ATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAI 379
+G FD HP + + K+V NQ +AL A+++ +VLLKN LPL I
Sbjct: 418 RVKFLVGLFD-HPYQIDLKETD-KEVNCAENQQVALQASKESLVLLKNQDAVLPLDVNKI 475
Query: 380 KNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGCSNVQCG---- 431
+AV GPNA+ + +Y + TT L+G+ V ++ GC V
Sbjct: 476 SKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTNVLFTKGCDLVDANWPES 535
Query: 432 -----------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITE 480
+++D A + A +D TV+V+G E+ R L LPG+Q L+
Sbjct: 536 ELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSDRTCGENKSRSSLDLPGRQLDLLQA 595
Query: 481 VAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGR 540
V K PV+LI+++G I++A D + +IL YPG GG AIAD FG YNP G+
Sbjct: 596 VVATGK-PVVLILINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIADALFGDYNPGGK 652
Query: 541 LPMTWYPQS 549
L +T +P++
Sbjct: 653 LTVT-FPKT 660
>gi|423295566|ref|ZP_17273693.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
CL03T12C18]
gi|392672275|gb|EIY65744.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
CL03T12C18]
Length = 782
Score = 178 bits (452), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 156/527 (29%), Positives = 240/527 (45%), Gaps = 96/527 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP + EA HG ++ GAT FP I AA+++ L + +G+V++
Sbjct: 128 TRLGIPMF-LAEEAPHG-----------HMAIGATVFPTGIGMAATWSPELVKEVGQVIA 175
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E R+ G + P +++ RDPRW R +ET GEDP+L+ V GL G
Sbjct: 176 KEIRSQ-----GGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGASMVDGL--GGGN 228
Query: 208 SPNRLKVAACCKHYTAY-----------------DLDN------WKGTDRYHFNAMVIYT 244
+ A KH+ AY DL K D + M Y
Sbjct: 229 LSQKYATIATLKHFLAYAVPEGGQNGNYASVGIRDLHQNFLPPFRKAIDAGALSVMTSYN 288
Query: 245 ----------YYLI--------KFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+YL+ KF+ ++VSD S++ ++ S T E AA +S++AG+
Sbjct: 289 SIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVAPTKENAAIQSVMAGV 348
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
D++ G + AV++G ++++ ID A+ +G F+ HP P ++ K V
Sbjct: 349 DVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRVLRMKFEMGLFE-HPYVDP--KIAAKTV 405
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG--TP 403
+ + +LA A+ I LLKN LPLS I +AVIGPNA+ M+G+Y
Sbjct: 406 RRKEHIELARKIAQSSITLLKNENSILPLS-KMINKVAVIGPNADNRYNMLGDYTAPQED 464
Query: 404 CKYTTPLQGLAAVVA---TIYQAGCSNVQCGTAQVDDAKKAAASAD-------------- 446
T L G+ ++ Y GC+ +++ A +AA ++
Sbjct: 465 SNVKTVLDGIITKLSPSRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVGGSSARDF 524
Query: 447 --------ATVLVMGADQSIE-AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
A V G+ +E E DR L L G+QQ L+ + K K P+I++ + G
Sbjct: 525 KTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTGK-PLIVVYIEGR 583
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
+ ++A ++L YPGQ GG AIADV FG YNPSGRLP++
Sbjct: 584 PLEKNWASE--YADALLTAYYPGQEGGNAIADVLFGDYNPSGRLPIS 628
>gi|383114908|ref|ZP_09935668.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
gi|382948422|gb|EIC71783.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
Length = 782
Score = 178 bits (452), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 157/527 (29%), Positives = 240/527 (45%), Gaps = 96/527 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP + EA HG ++ GAT FP I AA+++ L + +G+V++
Sbjct: 128 TRLGIPMF-LAEEAPHG-----------HMAIGATVFPTGIGMAATWSPELVKEVGQVIA 175
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E R+ G + P +++ RDPRW R +ET GEDP+L+ V GL G
Sbjct: 176 KEIRSQ-----GGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL--GGGN 228
Query: 208 SPNRLKVAACCKHYTAY-----------------DLDN------WKGTDRYHFNAMVIYT 244
+ A KH+ AY DL K D + M Y
Sbjct: 229 LSQKYATIATLKHFLAYAVPEGGQNGNYASVGIRDLHQNFLPPFRKAIDSGALSVMTSYN 288
Query: 245 ----------YYLI--------KFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+YL+ KF+ ++VSD S++ ++ S T E AA +S+ AG+
Sbjct: 289 SIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVALTKENAAIQSVTAGV 348
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
D++ G + AV++G ++++ ID A+ +G F+ HP P ++ K V
Sbjct: 349 DVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRVLRMKFEMGLFE-HPYVDP--KIAAKTV 405
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG--TP 403
+ + +LA A+ I LLKN LPLS T I +AVIGPNA+ M+G+Y
Sbjct: 406 RRKEHIELARKIAQSSITLLKNENSILPLSKT-INKVAVIGPNADNRYNMLGDYTAPQED 464
Query: 404 CKYTTPLQGLAAVVA---TIYQAGCSNVQCGTAQVDDAKKAAASAD-------------- 446
T L G+ ++ Y GC+ +++ A +AA ++
Sbjct: 465 SNVKTVLDGIITKLSPSRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVGGSSARDF 524
Query: 447 --------ATVLVMGADQSIE-AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
A V G+ +E E DR L L G+QQ L+ + K K P+I++ + G
Sbjct: 525 KTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTGK-PLIVVYIEGR 583
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
+ ++A ++L YPGQ GG AIADV FG YNPSGRLP++
Sbjct: 584 PLEKNWASE--YADALLTAYYPGQEGGNAIADVLFGDYNPSGRLPIS 628
>gi|323452325|gb|EGB08199.1| hypothetical protein AURANDRAFT_4612, partial [Aureococcus
anophagefferens]
Length = 262
Score = 178 bits (452), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 104/243 (42%), Positives = 135/243 (55%), Gaps = 12/243 (4%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY- 107
+SL FC L RV DLV R+ E LV+ A + RL + + WWSEALHGV
Sbjct: 1 SSLPFCRGDLPRAKRVADLVDRVEPDEVQKLLVNGAAGLPRLWLQPHNWWSEALHGVQAG 60
Query: 108 VGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLA-GLTFWSP 166
S+ P T FP I TAASFNA+LF+A+G V+ EARA+ N G+ G TFWSP
Sbjct: 61 CAQADDGSSRCP--TGFPAAISTAASFNATLFRAVGSVIGAEARALANEGVTNGFTFWSP 118
Query: 167 NINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL 226
N+NI RDPRWGRGQETPGEDPLL KY +V G + DG S + + +AC KH+ AY+L
Sbjct: 119 NLNILRDPRWGRGQETPGEDPLLNGKYGENFVLGFEHPDGSSGDAIAASACPKHFFAYNL 178
Query: 227 DNW---KGTDRYHFNAM-----VIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAA 278
+N K R+ F+ + YL F+ +S + ++ + TP A A
Sbjct: 179 ENCFKVKDNCRHTFDMANLSQGELEATYLPPFEQAISSGKASGLMCSYNAVNGTPSCANA 238
Query: 279 KSI 281
I
Sbjct: 239 WGI 241
>gi|404406439|ref|ZP_10998023.1| glycoside hydrolase 3 [Alistipes sp. JC136]
Length = 925
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 151/549 (27%), Positives = 240/549 (43%), Gaps = 116/549 (21%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAA 132
+ E F V+ +RLGIP + +E + GV AT+FP +
Sbjct: 148 INETQRFFVEE----TRLGIP-VDMTNEGIRGVEAYR-----------ATNFPSQLGMGH 191
Query: 133 SFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L + G++V EAR L G T ++P +++ RD RWGR +E GE P L +
Sbjct: 192 TWNRELLRKTGRIVGREAR------LLGYTNIYAPVLDVGRDQRWGRYEEVFGESPYLVA 245
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDN--WKGTDRY--HFNAMVIYTYYL 247
+ G+Q TD +VA+ KH+ AY + +G R + +L
Sbjct: 246 ELGVAMASGMQ-TD------YQVASTAKHFAAYSNNKGAREGMSRVDPQMPPREVENIHL 298
Query: 248 IKFK------------------------------------------YIVSDCDSVDVLYN 265
+ F+ Y+VSD SV+ L+N
Sbjct: 299 MPFREVIRRAGILGVMSSYNDYDGVPIQGSRYWLTERLRGEMGFRGYVVSDSGSVEYLHN 358
Query: 266 SQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAV-------KAGLVNESAIDHAISNN 318
H +A +SI AGL++ C + H E V + GL+ E +D + +
Sbjct: 359 KHHTAVNQLDAVRQSIEAGLNVRCNFW---HPETYVMPLRQLLREGLITEELLDSRVRDV 415
Query: 319 FATLMRLGFFDGHPSKQPYGQ---LGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS 375
+G FD +PY ++V + ++AL A+R+ IVLLKN +LPL
Sbjct: 416 LRVKFLVGLFD-----RPYQTDLAAADREVDGPEHNEVALQASRESIVLLKNENSTLPLD 470
Query: 376 PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG----LAAVVATIYQAGCSNVQCG 431
I+ +AV+GPNA+ +G+Y + T+ L G L A +Y+ GC V
Sbjct: 471 ARKIRRIAVLGPNADARGFALGHYGPLAVEVTSVLDGLKRNLGARCEIVYEKGCELVDAA 530
Query: 432 ---------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQL 476
A + A +AA+ +D V+V+G E+ R L LPG+Q+
Sbjct: 531 WPLSEIFREEMTPEEKAGIRRAAEAASESDVAVVVLGGGSRTCGENCSRSSLDLPGRQEE 590
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
L+ V K P +L++++G I++A D + +I+ YPG GG A+ +V FG YN
Sbjct: 591 LLRAVEATGK-PTVLVMINGRPNSINWA--DAHVDAIVEAWYPGAHGGQAVYEVLFGEYN 647
Query: 537 PSGRLPMTW 545
P G+L +T+
Sbjct: 648 PGGKLTVTF 656
>gi|90399218|emb|CAJ86028.1| B0414F07.1 [Oryza sativa Indica Group]
Length = 320
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 82/103 (79%), Positives = 90/103 (87%), Gaps = 1/103 (0%)
Query: 138 LFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGY 197
++ I VVSTEARAM+NVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA GY
Sbjct: 204 MYNLIVLVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGY 263
Query: 198 VKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAM 240
V GLQ GGS + LKVAACCKHYTAYD+DNWKG +RY F+A+
Sbjct: 264 VTGLQDAGGGS-DALKVAACCKHYTAYDVDNWKGVERYTFDAV 305
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 72/110 (65%), Positives = 86/110 (78%), Gaps = 1/110 (0%)
Query: 35 SPVFACDVVSNPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPK 94
+PVFACD SN +++ GFC+ + R DL+ RLTL EK+ FLV+ ++ RLGIP
Sbjct: 28 TPVFACDA-SNATVSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPA 86
Query: 95 YEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGK 144
YEWWSEALHGVSYVGPGT FS +VPGATSFPQ ILTAASFNASLF+AIG+
Sbjct: 87 YEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGE 136
>gi|410097219|ref|ZP_11292201.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
CL02T12C30]
gi|409224537|gb|EKN17469.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
CL02T12C30]
Length = 805
Score = 177 bits (450), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 169/612 (27%), Positives = 260/612 (42%), Gaps = 149/612 (24%)
Query: 60 IDLRVGDLVKRLTLQEKITFL--VDSAGSVSRLGIPKYEW----WSEALHGVS------- 106
ID RV DL+K++T++EK L + G+V + +P EW W + + +
Sbjct: 68 IDKRVEDLLKQMTVEEKTCQLGTIYGYGAVLKDTLPTDEWKTRIWKDGIGNIDEHLNGEW 127
Query: 107 --------YVGPGTHFSNV-----------VPG--------------ATSFPQVILTAAS 133
Y + V +P +T FP I +
Sbjct: 128 KRTSLDFPYSNHAEAMNKVQAFFVEETRLGIPADLTNEGIRGLKHEKSTFFPAQIGQGCT 187
Query: 134 FNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 193
++ L IG++ EA+A L +SP +++ RDPRWGR E+ GED LA +
Sbjct: 188 WDKELIYEIGRITGEEAKA-----LGYTNIYSPILDLSRDPRWGRTVESYGEDSYLAGEL 242
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL-----DNWKGTDRYHFNAMVIYTYYLI 248
G QQ G NR V + KH+ Y + D + TD H + ++ +L
Sbjct: 243 ------GRQQVLGIQSNR--VVSTPKHFAIYGIPGGGRDCYSRTDP-HASPQEVHELHLE 293
Query: 249 KFK------------------------------------------YIVSDCDSVDVLYNS 266
F+ Y+VSD ++D
Sbjct: 294 PFRIAFQEAGALGTMCSHNDYNGTPVSASHYLMTELLRNQWGFKGYVVSDSWAIDKNVKF 353
Query: 267 QHYTKTPEEAAAKSILAGLDLNCGSFLGKHT------EAAVKAGLVNESAIDHAISNNFA 320
H T EEA A + AGL N +F + A++ GLV ES +D +
Sbjct: 354 YHIVDTEEEAVASELNAGL--NVRTFFEQSEVFIEALRRALQKGLVEESTLDQRVREVLY 411
Query: 321 TLMRLGFFDGHPSKQPY---GQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPT 377
LG FD PY +L K V + N++++L AAR+ IVLLKN +LPLS T
Sbjct: 412 VKFWLGLFD-----DPYVKDTKLADKIVNSDKNREVSLRAARESIVLLKNENNTLPLSKT 466
Query: 378 AIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAV----VATIYQAGCSNVQCGT- 432
+KN+AVIGP A+ K++ Y T LQGL + V +Y GC NV+
Sbjct: 467 -LKNIAVIGPQADEVKSLTSRYGSHNPNVITGLQGLKNLLGENVNLMYAKGC-NVRDKNF 524
Query: 433 ---------------AQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLL 477
++D+A + A A+ ++ +G D ES R++L L G+Q+ L
Sbjct: 525 PQSDVMYFELSDKEKEEIDEAVEIAKKAEVAIIYVGDDFRTIGESRSRVNLDLSGRQKEL 584
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ V PV+L++ +G +++ D + +I+ YPG+ G A+A+V FG YNP
Sbjct: 585 VRAVQATGT-PVVLVLFNGRPVTLNW--EDANLPAIVEAWYPGEFSGQAVAEVLFGDYNP 641
Query: 538 SGRLPMTWYPQS 549
G+L T +P+S
Sbjct: 642 GGKLSTT-FPKS 652
>gi|300727409|ref|ZP_07060818.1| beta-xylosidase B [Prevotella bryantii B14]
gi|299775289|gb|EFI71888.1| beta-xylosidase B [Prevotella bryantii B14]
Length = 841
Score = 177 bits (450), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 148/435 (34%), Positives = 205/435 (47%), Gaps = 82/435 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + N SL + R DL RLTL+EK + + + ++ RL IP +EWWSEALHG + G
Sbjct: 8 LPYQNPSLSAEQRAEDLCSRLTLEEKCKLMQNGSPAIKRLNIPAFEWWSEALHGTARNG- 66
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASL----FQAIG---KVVSTEARAMYNVG-LAGLT 162
AT FP AAS+N L F AIG ++ +T AR N+ GL+
Sbjct: 67 ---------FATVFPNTTGMAASWNDQLLLQIFSAIGNESRIKNTLARKSGNIKRYQGLS 117
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR--LKVAACCKH 220
W+PNINIFRDPRWGRGQET GEDP L K V+GLQ G N K+ AC KH
Sbjct: 118 IWTPNINIFRDPRWGRGQETYGEDPYLTGKMGLAVVEGLQ----GPKNSKYYKLLACAKH 173
Query: 221 YTAYDLDNWKGTDRYHFN-----AMVIYTYYLIKFK------------------------ 251
+ + + R+ FN A ++ YL FK
Sbjct: 174 FAVHSGPEYL---RHSFNIENLPARDLWETYLPAFKTLIQEGNVAEVMCAYHSMDGLPCC 230
Query: 252 ------------------YIVSDCDSV-DVLYNSQHYT-KTPEEAAAKSILAGLDLNCGS 291
+VSDC ++ D +H + +A+A+++LAG D+ CG+
Sbjct: 231 GSNKYLQQILRQDLGFKGMVVSDCGAIGDFWIQGRHEVAQDAAQASAQAVLAGTDVECGA 290
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
K E AVK G ++E I+ ++ +LG FD + + QL + ++
Sbjct: 291 NYDKLPE-AVKRGEISEEKINVSVMRLLKARFKLGDFDSDNMVE-WTQLPESLIACSKHK 348
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LA A++ + LLKN G LPL A +AV+G NAN + + GNY G P K + L+
Sbjct: 349 QLAYQMAQESMTLLKNN-GILPLQKNA--RIAVMGANANDSIMLWGNYNGYPTKTISILE 405
Query: 412 GLAAVVATI-YQAGC 425
GL I Y GC
Sbjct: 406 GLQNKSKHISYIPGC 420
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 46/83 (55%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR + LP Q+ +I + K VI + SGG I+ +IL Y G+ G
Sbjct: 613 DRTTIELPQSQRDMIALLHNSGK-KVIFVNCSGGA--IALEPESRNADAILQAWYGGEMG 669
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G A+ADV FG YNP+G+LP+T+Y
Sbjct: 670 GQAVADVLFGDYNPNGKLPVTFY 692
>gi|443289222|ref|ZP_21028316.1| Glycoside hydrolase family 3 domain protein [Micromonospora lupini
str. Lupac 08]
gi|385887900|emb|CCH16390.1| Glycoside hydrolase family 3 domain protein [Micromonospora lupini
str. Lupac 08]
Length = 994
Score = 177 bits (450), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 138/415 (33%), Positives = 198/415 (47%), Gaps = 66/415 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L + +RV DL+ RLTL EKI++L +V RLGI ++ +EALHGV++
Sbjct: 35 FRDPHLPVQVRVDDLLGRLTLDEKISWLHQYQPAVPRLGIGLFKTGTEALHGVAWSTDID 94
Query: 113 HFSNVVPG-ATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSPNIN 169
+ VV TSFPQ + A++++ L Q +G V EAR + N + GL W+P +N
Sbjct: 95 NGGAVVKARGTSFPQPVGMASTWDTDLIQRVGSAVGDEARGYHAQNPRVWGLNLWAPVVN 154
Query: 170 IFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNW 229
+ RDPRWGR +E EDPLL +K +T Y G+ GG P+ LK A KHY AY+ +
Sbjct: 155 LLRDPRWGRNEEGYSEDPLLTAKISTAYGSGMT---GGDPDHLKSAPTLKHYLAYNNEVR 211
Query: 230 KGT-------------DRYHFNAM--------VIYTYYLIKFKYIVSDCDSVDV------ 262
+ DR F V+ +Y L+ + D D V
Sbjct: 212 RDVTSSNVPPRVLNEYDRAAFKPAIAADAATGVMASYNLVNGRPATVDPDLATVVRGWTD 271
Query: 263 --------------LYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA--------- 299
L SQ Y T EA A + AG++ SF+ T A
Sbjct: 272 QPLMNVTDAWAPNNLVASQGYYATQPEANAAIVKAGVN----SFITDDTNAQPTVTAIKQ 327
Query: 300 AVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAAR 359
A+ GL+ E ID IS RLG FD P +G + P + + +Q LA AA
Sbjct: 328 ALATGLLTEGDIDARISETLNVRFRLGEFD--PGGGRHGGITPDVINSAEHQRLARQAAG 385
Query: 360 QGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY-TTPLQGL 413
+ +VLLKN+ +LPL+P + +AV+GP A+ T+ ++ G Y T L G+
Sbjct: 386 EAMVLLKNSRQALPLNPARTRKVAVLGPLAD---TLYTDWYGGDLPYEVTALDGI 437
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 60/113 (53%), Gaps = 4/113 (3%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
VD A AA +AD VLV+G++ I E+HDR L Q+ L+ V K V+++
Sbjct: 595 VDRAVAAAKAADTAVLVVGSNPFINGREAHDRTSTALSAGQEALVKAVTKANPHTVLVLQ 654
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
S + ++ + +I+W + G G A++DV FG NPSG+L TWY
Sbjct: 655 TS---YPVTIGWEQEHVPAIVWTTHAGAETGHAVSDVLFGDRNPSGKLTQTWY 704
>gi|297202710|ref|ZP_06920107.1| sugar hydrolase [Streptomyces sviceus ATCC 29083]
gi|197713289|gb|EDY57323.1| sugar hydrolase [Streptomyces sviceus ATCC 29083]
Length = 908
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 138/412 (33%), Positives = 191/412 (46%), Gaps = 77/412 (18%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L R+ DL+ RLTL EKI FL +V RLGI + EALHGV+++GP
Sbjct: 30 FRDPHLSFAKRIDDLLSRLTLDEKIGFLHQFVPAVERLGIAAFRTGQEALHGVAWMGP-- 87
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWSPNINIF 171
AT FPQ + A++N L + +G+ VS E RAM GL W+P +N+
Sbjct: 88 --------ATVFPQAVGLGATWNTDLVRRVGEAVSKEVRAMRARDDRVGLNIWAPTVNLL 139
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
R P WGR +E EDP L S AT Y GL+ G P + A KH+ A++ + +
Sbjct: 140 RHPLWGRNEEGYSEDPKLTSAIATAYTHGLR---GDHPTYWRTAPVLKHWLAHNNETDRA 196
Query: 232 TDRYHFNAMVIYTYYLIKFKYIV------------------------------------- 254
T V++ Y L F+ V
Sbjct: 197 TSSSSVRPRVLHEYDLRAFRETVEAGAVAGVMPAYNLVNGRPNHVSPYLREHLRAWTQEE 256
Query: 255 ----SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEA---------AV 301
SD + L + +HY T EEA A S+ AG+D SF T++ A+
Sbjct: 257 LLVCSDAGAPSNLVDHEHYFDTHEEATAASLRAGVD----SFTDHGTDSSQIIARIQGAL 312
Query: 302 KAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQG 361
GL+ E+ ID A+ + RLG FD P P+ + +D T A++ LA +AA Q
Sbjct: 313 DQGLLTEAEIDTAVRRQLSVRFRLGEFD--PEHDPHADV--RDFDTPAHRALAQEAAEQA 368
Query: 362 IVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
IVLL+N G LPL+P LAV+G A+ K + Y GT +TPL+GL
Sbjct: 369 IVLLRND-GVLPLAPDT--RLAVVGLLADECK--LDWYSGTLIHRSTPLEGL 415
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 6/112 (5%)
Query: 436 DDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIM 494
D + + AD ++V G D I E+ DR L LP Q+ L+ A+ A +L ++
Sbjct: 586 DAVTRVTSEADVVLVVAGNDPHINGRETEDRASLRLPAHQERLL-RAARAANPRTVLALV 644
Query: 495 SGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
S +A + + + LW + GQA G A+A V G +P+GRLP TWY
Sbjct: 645 SA----YPYAVDTSDLAAALWTAHGGQAAGTALARVLAGDVSPAGRLPQTWY 692
>gi|289577460|ref|YP_003476087.1| glycoside hydrolase [Thermoanaerobacter italicus Ab9]
gi|289527173|gb|ADD01525.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
italicus Ab9]
Length = 787
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 157/548 (28%), Positives = 235/548 (42%), Gaps = 127/548 (23%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP +H S G + GAT FPQ I A+++N L + + V+
Sbjct: 93 TRLGIPAL------IHEESCSG------YMAKGATIFPQTIGVASTWNPKLVEKMASVIR 140
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
+ +A + +P +++ RDPRWGR +ET GEDP L Y++GLQ +
Sbjct: 141 EQMKA-----VGARQALAPLLDVTRDPRWGRTEETFGEDPYLVMHMGVSYIRGLQTEN-- 193
Query: 208 SPNRLKVAACCKHYTAYDLD----NWKGTDRYHFNAMVIYTYYLIKFK------------ 251
+ V A KH+ Y NW H +Y +L F+
Sbjct: 194 --LKEGVIATGKHFVGYGNSEGGMNWAPA---HIPMRELYEIFLYPFEAAVKEAKLGSIM 248
Query: 252 ------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSI 281
+VSD +++ LY +EAA ++
Sbjct: 249 PGYHELDGIPCHKSKQLLTDILRKNWGFDGIVVSDYFAINQLYEYHRLASNKKEAAKLAL 308
Query: 282 LAGLDLNCGSF--LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQ 339
AG+D+ S G + ++ G ++ ++ A+ LG F+ PY
Sbjct: 309 EAGVDVELPSTDCYGLPIKELIEQGDIDIDFVNDAVRRILKAKFLLGLFE-----NPY-- 361
Query: 340 LGPKDVC----TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTM 395
+ K V TQ + LA A++ IVLLKN + LPL ++++AVIGPNA+ + M
Sbjct: 362 VDEKRVVEIFDTQEQRQLAYKIAQESIVLLKNESNLLPLKKD-LQSIAVIGPNADNIRNM 420
Query: 396 IGNYEGTPCK-------------YTTPL-----------------QGLAAVVA----TIY 421
IG+Y PC + TPL QG+ V+ IY
Sbjct: 421 IGDY-AYPCHIESLLEMREKDNVFNTPLPEGLEAKDIYVPIVSVLQGIKEKVSPKTKVIY 479
Query: 422 QAGCSNVQCGTAQVDDAKKAAASADATVLVMG-----ADQSIEAESHDRLDLLLPGQQQL 476
GC + TA + A + A AD ++V+G D ES DR DL LPG Q+
Sbjct: 480 AKGCDVISDDTAGFNKAVEVAKQADVAIVVVGDRAGLTDGCTSGESRDRADLNLPGVQEE 539
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
LI V + PVI+++++G IS+ KI +I+ PG+ GG AIADV FG YN
Sbjct: 540 LIKAVYETGT-PVIVVLINGRPMSISWIAE--KIPAIIEAWLPGEEGGRAIADVIFGDYN 596
Query: 537 PSGRLPMT 544
P G+LP++
Sbjct: 597 PGGKLPIS 604
>gi|300786232|ref|YP_003766523.1| beta-glucosidase [Amycolatopsis mediterranei U32]
gi|384149550|ref|YP_005532366.1| beta-glucosidase [Amycolatopsis mediterranei S699]
gi|399538115|ref|YP_006550777.1| beta-glucosidase [Amycolatopsis mediterranei S699]
gi|299795746|gb|ADJ46121.1| beta-glucosidase [Amycolatopsis mediterranei U32]
gi|340527704|gb|AEK42909.1| beta-glucosidase [Amycolatopsis mediterranei S699]
gi|398318885|gb|AFO77832.1| beta-glucosidase [Amycolatopsis mediterranei S699]
Length = 975
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 131/414 (31%), Positives = 197/414 (47%), Gaps = 63/414 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + SL + R+ DL+ RLT EKI+ L ++ RLGI ++ +EALHGV++ T
Sbjct: 33 FRDPSLPLATRIDDLLSRLTADEKISLLHQYEPAIPRLGIGVFKTGTEALHGVAW---ST 89
Query: 113 HFSN----VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARA--MYNVGLAGLTFWSP 166
+ N V T FPQ I A+++N +L + +G V EAR + N L GL W+P
Sbjct: 90 DYDNKGAVVKADGTVFPQAIGLASTWNPALVKQVGAAVGQEARGFNVRNPTLWGLNLWAP 149
Query: 167 NINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL 226
+N+ RDPRWGR +E EDP L ++AT Y +G+Q G P L+ A KH+ AY+
Sbjct: 150 VVNLLRDPRWGRNEEGYSEDPYLTGEFATAYGRGMQ---GDDPRYLQAAPTLKHFLAYNN 206
Query: 227 DNWKGTDRYHFNAMVIYTYYLIKFKY---------------------------------- 252
+ + T +++ Y FK
Sbjct: 207 EVNRDTSNSSVPPKILHDYDEQAFKIPLQNGAANAVMPSYNLVNGRPNHVSPDLDGKLRK 266
Query: 253 -------IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD-LNCGSFLGKHTEAAVKA- 303
+VSD + L NS+ Y T E A +I AGLD G T AAVK
Sbjct: 267 WAPQDIAVVSDAGAPSNLVNSEKYYATKAEGDAAAIKAGLDSFTDNDTDGSITVAAVKEA 326
Query: 304 ---GLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQ 360
GL+ + +++A + + RLG FD P PY ++ P + + +Q LA A +
Sbjct: 327 LSKGLLTMADVENADRHLLSLRFRLGEFD-PPGGNPYAKITPAVIGSPEHQALARKTADE 385
Query: 361 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY-TTPLQGL 413
IV+L+N +LPL+ K +AV+GP +N T+ ++ +Y TP+QG+
Sbjct: 386 QIVMLRNNGNALPLNAARNKKIAVVGPLSN---TLYEDWYSGAMQYKVTPVQGI 436
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 59/118 (50%), Gaps = 9/118 (7%)
Query: 432 TAQVDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVI 490
T+ V A AA AD V+V+G+ I E++DR L Q+ LI V K V+
Sbjct: 589 TSGVGSAVAAAKDADTAVVVVGSMPFINGREANDRTRTELAPAQRALIEAVQKANPHTVV 648
Query: 491 LIIMS--GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
++ S G+D + ILW + GQ G A+ADV FG +PSGRL TWY
Sbjct: 649 VVENSYPTTGWDTL------SVPGILWTSHAGQETGHAVADVLFGDQDPSGRLTQTWY 700
>gi|336430645|ref|ZP_08610589.1| hypothetical protein HMPREF0994_06595 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336017554|gb|EGN47314.1| hypothetical protein HMPREF0994_06595 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 962
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 116/382 (30%), Positives = 184/382 (48%), Gaps = 50/382 (13%)
Query: 55 NTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHF 114
N L ++ R+ L+ +T +EKI L +SRLGI EA HG+
Sbjct: 11 NNRLPVEERLDYLIGEMTTEEKIACLTTGCPDISRLGIRASYMGGEAAHGIEARHDQAFN 70
Query: 115 SNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV-GLAGLTFWSPNINIFRD 173
TSF Q I +ASF+ L + G+ V EARA++ G GL W+P +++ RD
Sbjct: 71 KGEPEPTTSFTQPIGMSASFDRELIRECGRCVGEEARALFTRNGSGGLCRWAPTVDMERD 130
Query: 174 PRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTD 233
PRWGR +E GEDP L + A+ Y++G++ G P ++ A KH+ A +++ + +
Sbjct: 131 PRWGRTEEAYGEDPYLTGEMASSYIQGMK---GDDPFYIRCGATLKHFYANNVEKDRISI 187
Query: 234 RYHFNAMVIYTYYLIKFKYIVSD------------------------------------- 256
+ Y YYL F+ + +
Sbjct: 188 SSSLDRRNKYEYYLEPFRKAIVEGGAEAVMTSYNEINGIPAIVNEEVRTILKEAWGLPGH 247
Query: 257 --CDSVDV---LYNSQHYTKTPEEAAAKSILAGLDL--NCGSFLGKHTEAAVKAGLVNES 309
CD D+ +Y+ +++ KT EE A + AG+D + S + + A++ G++ E
Sbjct: 248 VVCDGGDMQQTVYDHKYF-KTHEETVAYGLKAGVDCFTDDKSVVMEAARKALEKGMITEE 306
Query: 310 AIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTA 369
ID +I N+F T +RLGFFDG + PY +G + V Q ++D+ + A + +VLLKN
Sbjct: 307 DIDRSIRNSFRTRIRLGFFDGD-GECPYTGMGEEYVNNQEHRDICVKMAEESVVLLKNEK 365
Query: 370 GSLPLSPTAIKNLAVIGPNANV 391
LP P ++LA+IGP A+V
Sbjct: 366 NILPFLPEKTESLAIIGPLADV 387
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 59/113 (52%), Gaps = 4/113 (3%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
+ +A AA +A VLV+G + I + E DR L LP QQ L V K A I+++
Sbjct: 554 IKEAVHAAENAQRAVLVLGCNPVINSKEEIDRSTLALPPFQQNLADAVRK-ANPETIVVL 612
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+S + I+ + + + I+W Q G +A + G+ +P+GRL MTWY
Sbjct: 613 LSNYPYSINRLQEE--MPGIIWSASGSQELGTGVASILSGKVSPAGRLNMTWY 663
>gi|291292288|gb|ADD92016.1| Xyl3C [Prevotella bryantii B14]
Length = 857
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 148/435 (34%), Positives = 205/435 (47%), Gaps = 82/435 (18%)
Query: 51 LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGP 110
L + N SL + R DL RLTL+EK + + + ++ RL IP +EWWSEALHG + G
Sbjct: 24 LPYQNPSLSAEQRAEDLCSRLTLEEKCKLMQNGSPAIKRLNIPAFEWWSEALHGTARNG- 82
Query: 111 GTHFSNVVPGATSFPQVILTAASFNASL----FQAIG---KVVSTEARAMYNVG-LAGLT 162
AT FP AAS+N L F AIG ++ +T AR N+ GL+
Sbjct: 83 ---------FATVFPNTTGMAASWNDQLLLQIFSAIGNESRIKNTLARKSGNIKRYQGLS 133
Query: 163 FWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR--LKVAACCKH 220
W+PNINIFRDPRWGRGQET GEDP L K V+GLQ G N K+ AC KH
Sbjct: 134 IWTPNINIFRDPRWGRGQETYGEDPYLTGKMGLAVVEGLQ----GPKNSKYYKLLACAKH 189
Query: 221 YTAYDLDNWKGTDRYHFN-----AMVIYTYYLIKFK------------------------ 251
+ + + R+ FN A ++ YL FK
Sbjct: 190 FAVHSGPEYL---RHSFNIENLPARDLWETYLPAFKTLIQEGNVAEVMCAYHSMDGLPCC 246
Query: 252 ------------------YIVSDCDSV-DVLYNSQHYT-KTPEEAAAKSILAGLDLNCGS 291
+VSDC ++ D +H + +A+A+++LAG D+ CG+
Sbjct: 247 GSNKYLQQILRQDLGFKGMVVSDCGAIGDFWIQGRHEVAQDAAQASAQAVLAGTDVECGA 306
Query: 292 FLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQ 351
K E AVK G ++E I+ ++ +LG FD + + QL + ++
Sbjct: 307 NYDKLPE-AVKRGEISEEKINVSVMRLLKARFKLGDFDSDNMVE-WTQLPESLIACSKHK 364
Query: 352 DLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQ 411
LA A++ + LLKN G LPL A +AV+G NAN + + GNY G P K + L+
Sbjct: 365 QLAYQMAQESMTLLKNN-GILPLQKNA--RIAVMGANANDSIMLWGNYNGYPTKTISILE 421
Query: 412 GLAAVVATI-YQAGC 425
GL I Y GC
Sbjct: 422 GLQNKSKHISYIPGC 436
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 46/83 (55%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
DR + LP Q+ +I + K VI + SGG I+ +IL Y G+ G
Sbjct: 629 DRTTIELPQSQRDMIALLHNSGK-KVIFVNCSGGA--IALEPESRNADAILQAWYGGEMG 685
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G A+ADV FG YNP+G+LP+T+Y
Sbjct: 686 GQAVADVLFGDYNPNGKLPVTFY 708
>gi|308080460|ref|NP_001183746.1| uncharacterized protein LOC100502339 [Zea mays]
gi|238014360|gb|ACR38215.1| unknown [Zea mays]
Length = 344
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 86/157 (54%), Positives = 108/157 (68%), Gaps = 1/157 (0%)
Query: 395 MIGNYEGTPCKYTTPLQGLAAVVATI-YQAGCSNVQCGTAQVDDAKKAAASADATVLVMG 453
+I NY G PC+ TTPL+GL + V + + AGC++ C A D A A S D L MG
Sbjct: 3 LIANYFGPPCESTTPLKGLQSYVNDVRFLAGCNSAACDVAATDQAVALAGSEDYVFLFMG 62
Query: 454 ADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSI 513
Q E+E DR LLLPG QQ LIT VA +K PVIL+++SGG DI+FA+++PKI +I
Sbjct: 63 LSQKQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAI 122
Query: 514 LWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQSY 550
LW GYPGQAGG AIA V FG +NPSGRLP+TWYP+ +
Sbjct: 123 LWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEF 159
>gi|374992674|ref|YP_004968169.1| glycoside hydrolase family protein [Streptomyces bingchenggensis
BCW-1]
gi|297163326|gb|ADI13038.1| glycoside hydrolase family 3 domain protein [Streptomyces
bingchenggensis BCW-1]
Length = 956
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 125/409 (30%), Positives = 196/409 (47%), Gaps = 78/409 (19%)
Query: 43 VSNPSLAS----LGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWW 98
+ +PSL++ L F + +L + RVGDL+ RLTL E++ L +V RLGI +
Sbjct: 1 MGSPSLSAAAVQLPFRDPALPLGERVGDLLDRLTLDERVAMLHQYVPAVPRLGIASFRTG 60
Query: 99 SEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN--- 155
+EALHGV+++G AT FPQ + A+++ L +A+ + VS E RA +
Sbjct: 61 TEALHGVAWLGV----------ATVFPQAVGLGAAWDEDLVRAVAEAVSVEVRAFHRHRP 110
Query: 156 ----VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNR 211
G GL W+P +N+ RDPRWGR +E EDPL ++ AT Y +GL G P+
Sbjct: 111 TSTGSGTNGLQTWAPVVNLLRDPRWGRNEEGYSEDPLHTARLATAYCQGLA---GDHPDF 167
Query: 212 LKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFK-------------------- 251
L+VA KH+ AY+ + + V++ Y L F+
Sbjct: 168 LRVAPVLKHFLAYNNETDRCVTSSGLRPRVLHEYDLAAFRPVVASGAATGAMAAYNLVNG 227
Query: 252 ------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD- 286
++VSD + L +HY + + A ++ AG+D
Sbjct: 228 RPCHVSPLIETELRTWARTTGHELFVVSDEQAPSNLVEMEHYFEDHAASHAAALKAGIDS 287
Query: 287 -----LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG 341
+ + +G+ E A++ GL+ E ++ A+ RLG FD P PY +G
Sbjct: 288 FTDHREDSATVIGRLRE-AMERGLIEEEDVNRAVRRQLQLRFRLGEFD--PDLDPYAGIG 344
Query: 342 PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNAN 390
P+ + + ++ LAL AA + +VLLKN G LPL+ +AVIGP+A+
Sbjct: 345 PEVIDSPEHRALALRAATESVVLLKND-GLLPLNANRAPRIAVIGPHAD 392
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 55/102 (53%), Gaps = 4/102 (3%)
Query: 446 DATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFA 504
D ++V+G I E+ DR LP Q+ L+ VA V + L++MS + + +A
Sbjct: 560 DVAIVVLGNHPMINGRETEDRTGTGLPQTQETLLRAVAAV-RPETALVVMSSYPYAVDWA 618
Query: 505 KNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
D + ++LW + GQ G A+ADV G P+GRLP TWY
Sbjct: 619 --DTHLPAVLWTSHGGQETGRALADVLLGEAEPAGRLPQTWY 658
>gi|332982620|ref|YP_004464061.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
gi|332700298|gb|AEE97239.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
50-1 BON]
Length = 753
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 155/522 (29%), Positives = 238/522 (45%), Gaps = 97/522 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP E L G + GAT FPQ I A++++A +A+ V+
Sbjct: 88 TRLGIPAI-VHEECLSGF-----------MADGATVFPQAIGLASTWDAEAIEAMAGVIR 135
Query: 148 TEARAM-YNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
+ +A + GL SP +++ RDPRWGR +ET GEDP L + A YV+GLQ
Sbjct: 136 QQMKAAGAHQGL------SPVLDVARDPRWGRVEETFGEDPYLVASMAVSYVRGLQ---- 185
Query: 207 GSPNRLKVAACCKHYTAYDL-DNWKGTDRYHFNAMVIYTYYLIKFK-------------- 251
G + A KH+ + + + H ++ +L F+
Sbjct: 186 GQDLTKGIFATLKHFAGHSFSEGGRNCAPVHVGERELWDIFLFPFEAAVREANAKSVMNA 245
Query: 252 ----------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
+VSD D++D L + +EAA +++ A
Sbjct: 246 YHDIDGVPCAASRELLTDILRGHFGFDGIVVSDYDAIDRLRKAHFTAGNKKEAAVQALEA 305
Query: 284 GLDLNCGSF--LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLG 341
G+D+ G+ AVK G+++E+ I+ ++ LG FDG Y +
Sbjct: 306 GIDIELPKMDCYGQPLMDAVKEGMISEATINESVERVLTAKFELGLFDG-----VYVDVD 360
Query: 342 --PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
P T ++++ D AR+ IVLLKN LPLS IK++AVIGPNA+ + M+G+Y
Sbjct: 361 SVPGLFETPEQREMSRDIARKSIVLLKND-NVLPLSKD-IKSIAVIGPNADNARNMLGDY 418
Query: 400 --------EGTPCKYTTPLQGLAAVV----ATIYQAGCSNVQCGTAQVDDAKKAAASADA 447
+ T T L+G+ V Y GC + T +A AA +ADA
Sbjct: 419 AFMAHRSYDKTSVHIVTVLEGIKNKVLDSCRITYAKGCDIIDPSTDGFVEAVNAARAADA 478
Query: 448 TVLVMGADQSI-----EAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS 502
++V+G + I E+ DR D+ LPG Q L+ + K PVI+++++G F
Sbjct: 479 AIVVVGDNSGIFGKGTSGENDDRTDITLPGVQMQLVKAIKDTGK-PVIVVLINGRAFAAK 537
Query: 503 FAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
++ W YPG+ GG A+ADV FG YNP+GRLP++
Sbjct: 538 ELADNASALMEAW--YPGEEGGNAVADVLFGDYNPAGRLPIS 577
>gi|423342899|ref|ZP_17320613.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
CL02T12C29]
gi|409217154|gb|EKN10133.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
CL02T12C29]
Length = 955
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 155/552 (28%), Positives = 242/552 (43%), Gaps = 115/552 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F ++ +RLGIP ++ +E + GV SY+ AT+FP +
Sbjct: 149 LNEVQRFFIEE----TRLGIPT-DFTNEGIRGVESYI------------ATNFPTQLGLG 191
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRGQETPGEDPLLA 190
++N L +G + E R L G T ++P +++ RD RWGR +E GE P L
Sbjct: 192 HTWNRDLVHKVGYITGREGR------LLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLV 245
Query: 191 SKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDN---------------------- 228
++ KG+Q TD +VAA KHY AY +
Sbjct: 246 AELGVEMAKGMQ-TD------YQVAATSKHYIAYSNNKGGREGMARVDPQMSPREVEMLH 298
Query: 229 ---WKG--------------TDRYHFNAMVIYTYYLIKFK-------YIVSDCDSVDVLY 264
WK D F Y + + + Y+VSD D+V+ L+
Sbjct: 299 VYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDSDAVEYLF 358
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNESAIDHAISNNF 319
+ +E+ +S+LAGL++ C +F + + G + S ID + +
Sbjct: 359 SKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGALPMSTIDDRVRDIL 417
Query: 320 ATLMRLGFFDGHPSKQPYG---QLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+G FD QPY + K+V + NQ +AL A+++ +VLLKN LPL
Sbjct: 418 RVKFLVGLFD-----QPYQIDLKQADKEVNSAENQQVALQASKESLVLLKNQDAVLPLDV 472
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV----ATIYQAGCSNVQCG- 431
I +AV GPNA+ + +Y + TT L+G+ V ++ GC V
Sbjct: 473 NKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEVLFTKGCDLVDANW 532
Query: 432 --------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLL 477
+++D A + A +D V+V+G E+ R L LPG+Q L
Sbjct: 533 PESELIRYPLTSEEQSEIDKAVENAKKSDVAVVVLGGSNRTCGENKSRSSLELPGRQLDL 592
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ V K PV+L++++G I++A D + +IL YPG GG AIAD FG YNP
Sbjct: 593 LQAVVATGK-PVVLVLINGRPISINWA--DKYVPAILEAWYPGSQGGTAIADALFGDYNP 649
Query: 538 SGRLPMTWYPQS 549
G+L +T +P++
Sbjct: 650 GGKLTVT-FPKT 660
>gi|423722678|ref|ZP_17696831.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
CL09T00C40]
gi|409241951|gb|EKN34716.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
CL09T00C40]
Length = 955
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 155/549 (28%), Positives = 243/549 (44%), Gaps = 109/549 (19%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F ++ +RLGIP ++ +E + GV SY+ AT+FP +
Sbjct: 149 LNEVQRFFIEE----TRLGIPT-DFTNEGIRGVESYI------------ATNFPTQLGLG 191
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRGQETPGEDPLLA 190
++N +L +G + E R L G T ++P +++ RD RWGR +E GE P L
Sbjct: 192 HTWNRNLVHKVGYITGREGR------LLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLV 245
Query: 191 SKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDN---------------------- 228
++ KG+Q TD +VAA KHY AY +
Sbjct: 246 AELGVEMAKGMQ-TD------YQVAATSKHYIAYSNNKGGREGMARVDPQMSPREVEMIH 298
Query: 229 ---WKG--------------TDRYHFNAMVIYTYYLIKFK-------YIVSDCDSVDVLY 264
WK D F Y + + + Y+VSD D+V+ L+
Sbjct: 299 VYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDSDAVEYLF 358
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNESAIDHAISNNF 319
+ +E+ +S+LAGL++ C +F + + G + S ID + +
Sbjct: 359 SKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGAIPMSTIDDRVRDIL 417
Query: 320 ATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAI 379
+G FD HP + + K+V NQ +AL A+++ +VLLKN LPL I
Sbjct: 418 RVKFLVGLFD-HPYQIDLKETD-KEVNCAENQLVALQASKESLVLLKNQDAVLPLDVNKI 475
Query: 380 KNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGCSNVQCG---- 431
+AV GPNA+ + +Y + TT L+G+ V ++ GC V
Sbjct: 476 SKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTDVLFTKGCDLVDANWPES 535
Query: 432 -----------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITE 480
+++D A + A +D TV+V+G E+ R L LPG+Q L+
Sbjct: 536 ELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENKSRSSLDLPGRQLDLLQA 595
Query: 481 VAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGR 540
V K PV+L++++G I++A D + +IL YPG GG AIAD FG YNP G+
Sbjct: 596 VVATGK-PVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIADALFGDYNPGGK 652
Query: 541 LPMTWYPQS 549
L +T +P++
Sbjct: 653 LTVT-FPKT 660
>gi|326329187|ref|ZP_08195515.1| beta-glucosidase [Nocardioidaceae bacterium Broad-1]
gi|325953074|gb|EGD45086.1| beta-glucosidase [Nocardioidaceae bacterium Broad-1]
Length = 979
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 133/421 (31%), Positives = 194/421 (46%), Gaps = 67/421 (15%)
Query: 48 LASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSY 107
L + F N L ++ RV DL+ RLTL EK++ L S +V RL IP ++ +EALHGV++
Sbjct: 18 LDDMPFRNPELPVEQRVDDLLGRLTLDEKLSLLHQSQVAVPRLDIPYFKAGTEALHGVAW 77
Query: 108 VGPGTHFSNVV--PGATSFPQVILTAASFNASLFQAIGKVVSTEARA--MYNVGLAGLTF 163
+ N V AT FPQ + A+++N L + +G V EARA N + GL
Sbjct: 78 SNDIDNGWNQVLADRATVFPQAVGLASTWNPGLMRRVGSAVGDEARAYNTMNPEVWGLQL 137
Query: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTA 223
W+P +N+ RDPRWGR +E EDP L S+ AT Y KGL G P LK A KH+ A
Sbjct: 138 WAPVVNLLRDPRWGRNEEGYSEDPTLTSEMATAYGKGLS---GDDPTYLKSAPVLKHFYA 194
Query: 224 YDLDNWKGTDRYHFNAMVIYTYYLIKFK-------------------------------- 251
Y+ + + + A + + Y FK
Sbjct: 195 YNNETNRSASSSNVRAQLRHEYEYAAFKPAIEADAATGVMASYNEVNGRPTHVDRALNED 254
Query: 252 ---------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTE---- 298
Y VSD + +QH+ A A + AGLD SF+ +++
Sbjct: 255 VRSWTDETLYNVSDAWGPHAVTQAQHFYDDETVAYAHVLKAGLD----SFVVDNSDNKPM 310
Query: 299 -----AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDL 353
A+ L+ E+ +D A++ RLG FD P PY P + + A++ L
Sbjct: 311 IATLKDALARDLITEADVDQAVTRVLTIRCRLGHFD--PDGGPYAGTSPDVLNSPAHRRL 368
Query: 354 ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY-TTPLQG 412
AA + +VLL+N LPL+ K +AV+GP A+ T+ ++ G Y TPL G
Sbjct: 369 NRQAAEEAVVLLRNDDDLLPLNAKRAKKVAVVGPLAD---TLYADWYGGTMPYEVTPLDG 425
Query: 413 L 413
+
Sbjct: 426 I 426
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 42/86 (48%), Gaps = 6/86 (6%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E HDR + L G QQ LI V + P +++ + ++ DP + LW + G
Sbjct: 609 EVHDRASMALGGSQQDLIDAVQEA--NPRTAVVLESS-YPVTM-DEDPG--AFLWTTHAG 662
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
G A+A FG NPSGRL TWY
Sbjct: 663 SETGNAVAATLFGDSNPSGRLTQTWY 688
>gi|299146513|ref|ZP_07039581.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298517004|gb|EFI40885.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 736
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 155/527 (29%), Positives = 239/527 (45%), Gaps = 96/527 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP + EA HG ++ G T FP I AA+++ L + +G+V++
Sbjct: 82 TRLGIPMF-LAEEAPHG-----------HMAIGTTVFPTGIGMAATWSPELVKEVGQVIA 129
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E R+ G + P +++ RDPRW R +ET GEDP+L+ V GL G
Sbjct: 130 KEIRSQ-----GGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL--GGGN 182
Query: 208 SPNRLKVAACCKHYTAY-----------------DLDN------WKGTDRYHFNAMVIYT 244
+ A KH+ AY DL K D + M Y
Sbjct: 183 LSQKYATIATLKHFLAYAVPEGGQNGNYASVGIRDLHQNFLPPFRKAIDAGALSVMTSYN 242
Query: 245 ----------YYLI--------KFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+YL+ KF+ ++VSD S++ ++ S T E AA +S++AG+
Sbjct: 243 SIDGIPCTSNHYLLTKLLRNEWKFRGFVVSDLYSIEGIHESHFVAPTKENAAIQSVMAGV 302
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
D++ G + AV++G ++++ ID A+ +G F+ HP P ++ K V
Sbjct: 303 DVDLGGDAYTNLCHAVQSGQMDKTVIDTAVCRVLRMKFEMGLFE-HPYVDP--KIAAKTV 359
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG--TP 403
+ + +LA A+ I LLKN LPLS I +AVIGPNA+ M+G+Y
Sbjct: 360 RRKEHIELARKIAQSSITLLKNENSILPLS-KMINKVAVIGPNADNRYNMLGDYTAPQED 418
Query: 404 CKYTTPLQGLAAVVA---TIYQAGCSNVQCGTAQVDDAKKAAASAD-------------- 446
T L G+ ++ Y GC+ +++ A +AA ++
Sbjct: 419 SNVKTVLDGIITKLSPSRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVGGSSARDF 478
Query: 447 --------ATVLVMGADQSIE-AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
A V G+ +E E DR L L G+QQ L+ + K K P+I++ + G
Sbjct: 479 KTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTGK-PLIVVYIEGR 537
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
+ ++A ++L YPGQ GG AIADV FG YNPSGRLP++
Sbjct: 538 PLEKNWASE--YADALLTAYYPGQEGGNAIADVLFGDYNPSGRLPIS 582
>gi|298374091|ref|ZP_06984049.1| thermostable beta-glucosidase B [Bacteroides sp. 3_1_19]
gi|298268459|gb|EFI10114.1| thermostable beta-glucosidase B [Bacteroides sp. 3_1_19]
Length = 732
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 167/584 (28%), Positives = 252/584 (43%), Gaps = 106/584 (18%)
Query: 37 VFACDVVSNPSLASLGFCNTS-LGIDLRVGDLVKRLTLQEKITFLVDSA----GSVSRLG 91
V AC + N + AS + + ++ R+ L+K++TL+EK+ L ++ V RLG
Sbjct: 7 VIACGLFLNAAAASSQVTDKEKVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLG 66
Query: 92 IPKYEW-WSEALHGV---------SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQA 141
IP EW S+ HGV +Y G ++ P T+F AA++N L
Sbjct: 67 IP--EWSLSDGPHGVRAEINRHDWAYAGWTNDSASYFPTGTAF------AAAWNPELAYR 118
Query: 142 IGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGL 201
G+V+ EAR L G P +NI R P GR E EDP + S A Y+KGL
Sbjct: 119 RGEVLGEEARWRKKDVLLG-----PGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGL 173
Query: 202 QQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVD 261
Q D VA KH+ + + + T + + YL FK V + ++
Sbjct: 174 QSRD--------VACSVKHFAVNNQETNRTTIDVECSERALREIYLPAFKAAVQEGGALT 225
Query: 262 VL--YNSQHYTKTPE--------------------------EAAAKSILAGLDLNCGSFL 293
V+ YN E + S+ AGLDL G+ +
Sbjct: 226 VMAAYNKFRGEFCAENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPSMEAGLDLEMGTLI 285
Query: 294 GKHTE--------AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
K+ + AVK+G V S +D + + +++ D P + GP +
Sbjct: 286 DKYEDWYYANPLIEAVKSGKVPMSLVDEKVGDVLRVMIKTNVLD------PKKRFGPGSM 339
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
T+ +Q DAA + IVLLKN LPL ++IK+LAVIG NA + G
Sbjct: 340 NTKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAV 399
Query: 406 Y-TTPLQGLAA---------------VVATIYQAGCSNVQCGT---------AQVDDAKK 440
Y TPL+ L A ++T + + GT A + +A +
Sbjct: 400 YEVTPLEALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVE 459
Query: 441 AAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFD 500
A ++D +LV G + + ES DRL++ +P Q LI EV K P ++IM G
Sbjct: 460 VARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVIMIAGS-P 516
Query: 501 ISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
++ A D +I+W + G GG A+ DV G+ NPSG++P T
Sbjct: 517 LNMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT 560
>gi|154493932|ref|ZP_02033252.1| hypothetical protein PARMER_03276 [Parabacteroides merdae ATCC
43184]
gi|154086192|gb|EDN85237.1| glycosyl hydrolase family 3 C-terminal domain protein
[Parabacteroides merdae ATCC 43184]
Length = 955
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 155/549 (28%), Positives = 243/549 (44%), Gaps = 109/549 (19%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F ++ +RLGIP ++ +E + GV SY+ AT+FP +
Sbjct: 149 LNEVQRFFIEE----TRLGIPT-DFTNEGIRGVESYI------------ATNFPTQLGLG 191
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRGQETPGEDPLLA 190
++N +L +G + E R L G T ++P +++ RD RWGR +E GE P L
Sbjct: 192 HTWNRNLVHKVGYITGREGR------LLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLV 245
Query: 191 SKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDN---------------------- 228
++ KG+Q TD +VAA KHY AY +
Sbjct: 246 AELGIEMAKGMQ-TDH------QVAATSKHYIAYSNNKGGREGMARVDPQMSPREVEMIH 298
Query: 229 ---WKG--------------TDRYHFNAMVIYTYYLIKFK-------YIVSDCDSVDVLY 264
WK D F Y + + + Y+VSD D+V+ L+
Sbjct: 299 VYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDSDAVEYLF 358
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNESAIDHAISNNF 319
+ +E+ +S+LAGL++ C +F + + G + S ID + +
Sbjct: 359 SKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGAIPMSTIDDRVRDIL 417
Query: 320 ATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAI 379
+G FD HP + + K+V NQ +AL A+++ +VLLKN LPL I
Sbjct: 418 RVKFLVGLFD-HPYQIDLKETD-KEVNCAENQLVALQASKESLVLLKNQDAVLPLDVNKI 475
Query: 380 KNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGCSNVQCG---- 431
+AV GPNA+ + +Y + TT L+G+ V ++ GC V
Sbjct: 476 SKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTDVLFTKGCDLVDANWPES 535
Query: 432 -----------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITE 480
+++D A + A +D TV+V+G E+ R L LPG+Q L+
Sbjct: 536 ELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENKSRSSLDLPGRQLDLLQA 595
Query: 481 VAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGR 540
V K PV+L++++G I++A D + +IL YPG GG AIAD FG YNP G+
Sbjct: 596 VVATGK-PVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIADALFGDYNPGGK 652
Query: 541 LPMTWYPQS 549
L +T +P++
Sbjct: 653 LTVT-FPKT 660
>gi|383649199|ref|ZP_09959605.1| sugar hydrolase [Streptomyces chartreusis NRRL 12338]
Length = 945
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 138/409 (33%), Positives = 193/409 (47%), Gaps = 71/409 (17%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L R+ DL+ RLTL E+I+FL A +V RLGI + EALHGV+++GP
Sbjct: 11 FRDPQLPFADRIDDLLSRLTLDERISFLHQFAPAVDRLGIAAFRTGQEALHGVAWMGP-- 68
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWSPNINIF 171
AT FPQ + A++N L + +G VS E RAM GL W+P +N+
Sbjct: 69 --------ATVFPQAVGLGATWNPELVRRVGDAVSKEIRAMRAKDERVGLNVWAPTVNLL 120
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
R P WGR +E EDP L S AT Y GL+ G P + A KH+ A++ + +
Sbjct: 121 RHPLWGRNEEGYSEDPKLTSAIATAYTHGLR---GDHPTYWRTAPVLKHWLAHNNETDRS 177
Query: 232 TDRYHFNAMVIYTYYLIKFKYIV------------------------------------- 254
T V++ Y L F+ V
Sbjct: 178 TTSSSVRPRVLHEYDLKAFRETVEAGAVAGVMPAYNLVNGRPNHVSPYLSTHLRAWADED 237
Query: 255 ----SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD------LNCGSFLGKHTEAAVKAG 304
SD + L +S+HY T EEA A S++AG+D + +G+ A++
Sbjct: 238 LLVCSDAGAPSNLVDSEHYFDTHEEATAASLVAGVDSFTDHGTDSSQMIGR-LRGALERR 296
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVL 364
L+ E+ ID A+ + RLG FD P P+ G D T A++ LA +AA Q IVL
Sbjct: 297 LLTEADIDTAVRRQLSVRFRLGEFD--PEDDPHDATGEFD--TPAHRALAQEAAEQAIVL 352
Query: 365 LKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
LKN G LPL+P A +AV+G A+ K + Y GT +TPL+GL
Sbjct: 353 LKND-GVLPLAPEA--RVAVVGLLADECK--LDWYSGTLIHRSTPLEGL 396
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 48/86 (55%), Gaps = 5/86 (5%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+ DR L LPG Q+ L+ A+ A +L ++S +A + + ++LW + G
Sbjct: 583 ETEDRTTLRLPGHQERLL-RAARAANPATVLALVSA----YPYAVDPADLPAVLWTAHGG 637
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
QA G A+A V G +P+GRLP TWY
Sbjct: 638 QAAGTALARVLAGDVSPAGRLPQTWY 663
>gi|323344052|ref|ZP_08084278.1| beta-glucosidase [Prevotella oralis ATCC 33269]
gi|323094781|gb|EFZ37356.1| beta-glucosidase [Prevotella oralis ATCC 33269]
Length = 779
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 153/527 (29%), Positives = 236/527 (44%), Gaps = 97/527 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP + EA HG ++ GAT FP + AA+++ + + G +++
Sbjct: 127 TRLGIPLF-LAEEAPHG-----------HMAIGATVFPTGLGMAATWSTDVIEQAGVIIA 174
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E R G + P +++ +PRW R +ET GEDP+L+ A VKGL D
Sbjct: 175 KEIRLQ-----GGHISYGPVLDLAHEPRWSRVEETMGEDPVLSGTIAVAQVKGLGAGDIT 229
Query: 208 SPNRLKVAACCKHYTAYD------------------LDNW-----KGTDRYHFNAMVIYT 244
P A KH+ AY LDN+ + D + M Y
Sbjct: 230 KP--FATIATLKHFIAYGIPESGQNGAPSIIGTRDLLDNFLPPFRRAIDAGALSVMTSYN 287
Query: 245 -----------YYLIK-------FK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+ L + FK ++VSD S+D +Y + H + +EA +++ AG+
Sbjct: 288 SMDGIPCTSNGHLLTEILRNQWGFKGFVVSDLYSIDGIYGTHHTVSSLQEAGIEALRAGV 347
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
D++ G+ AV+ G V+E+AID A+ + +G F+ HP P + V
Sbjct: 348 DVDLGANAFALLCDAVRQGRVSEAAIDEAVLRILRMKIEMGLFE-HPYVNP--KTAKTGV 404
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG--TP 403
T N +A A + I LLKN+ LPLS +AVIGPNA+ M+G+Y
Sbjct: 405 RTAENIQVAKRVAEESITLLKNSNKLLPLSKNI--KIAVIGPNADNRYNMLGDYTAPQQD 462
Query: 404 CKYTTPLQGLAAVVA---TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSIE- 459
T L G+ + ++ Y GCS ++ +A +AA AD V+ +G + +
Sbjct: 463 SNVKTILDGIRSKLSPSQITYVKGCSIRDTVFNEIGEAVRAAREADVIVVAVGGSSARDF 522
Query: 460 ----------------------AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
E DR L L G Q L+ + + K P+++I + G
Sbjct: 523 KTSYQETGAAITSSKVVSDMESGEGFDRASLSLMGIQSRLLQSLKETGK-PMVVIYIEGR 581
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
D ++A + ++L YPGQ GG AIA+V FG YNP+GRLP+T
Sbjct: 582 PLDKTWASE--QADALLTAYYPGQEGGNAIANVLFGDYNPAGRLPIT 626
>gi|365877135|ref|ZP_09416640.1| glycoside hydrolase family protein [Elizabethkingia anophelis Ag1]
gi|442587941|ref|ZP_21006755.1| glycoside hydrolase family protein [Elizabethkingia anophelis R26]
gi|365754995|gb|EHM96929.1| glycoside hydrolase family protein [Elizabethkingia anophelis Ag1]
gi|442562440|gb|ELR79661.1| glycoside hydrolase family protein [Elizabethkingia anophelis R26]
Length = 827
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 141/533 (26%), Positives = 233/533 (43%), Gaps = 103/533 (19%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP ++ +E +HG+++ AT FP I ++++ L IG +
Sbjct: 164 TRLGIP-VDFTNEGIHGLTH-----------DRATPFPAPINIGSTWDKDLVGKIGNTIG 211
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
EA Y +G + ++P +++ RDPRWGR ET GEDP + +Y VKG+QQ
Sbjct: 212 KEA---YYLGYTNV--YAPILDVSRDPRWGRVVETYGEDPFMIGEYGKRMVKGIQQNG-- 264
Query: 208 SPNRLKVAACCKHYTAYDL-----DNWKGTDRYHFNAMVIYTYYLIKFK----------- 251
VA+ KHY Y + D TD H ++T YL FK
Sbjct: 265 ------VASTLKHYAVYSVPKGGRDGLARTDP-HVAPKEMHTMYLYPFKEVIRKEHPLGV 317
Query: 252 -------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAKS 280
Y+VSD D+++ L+ H K EE K+
Sbjct: 318 MASYNDYDGVPVISSKYFLTDLLRKEYGFDGYVVSDSDALEFLHGKHHVAKDYEEGIQKA 377
Query: 281 ILAGLDLNCGSFLGKH----TEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQP 336
+ AGLD+ K A+K+G + E ++ + + T RLG FD P +
Sbjct: 378 LEAGLDVRTNFTQPKEYLTALMDALKSGKIKEEVLNERVRSVLKTKFRLGLFD-EPIRN- 435
Query: 337 YGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMI 396
+ + + V T+ ++ L++D R+ +VLLKN +LPL +KN+ + GP A+
Sbjct: 436 FIKEADRKVHTKEDEALSVDVNRRSVVLLKNEKQTLPLDTGKLKNILITGPLADAVNYTT 495
Query: 397 GNYEGTPCKYTTPLQGLAAV-----VATIYQAGCSNVQCG---------------TAQVD 436
Y + TT +G+ + T Y G + G +++
Sbjct: 496 SRYGPSNNPVTTIRKGIEDYASLHHINTSYTKGVDVIDEGWPETEIIPVEPTEKEKSEIS 555
Query: 437 DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
A +D + VMG + ES R L LPG+Q + ++ K K P++L++++G
Sbjct: 556 KTISMAEKSDVIIAVMGESEKEVGESRSRSSLNLPGKQTYFLQQLYKTRK-PIVLVLVNG 614
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
I++ + + +IL + G G +A+ FG NP G+LP++ +P+S
Sbjct: 615 RPLTINWE--NKYLPAILETWFLGPQSGNIVAETLFGENNPGGKLPIS-FPKS 664
>gi|393719789|ref|ZP_10339716.1| glycosyl hydrolase family protein [Sphingomonas echinoides ATCC
14820]
Length = 896
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 132/401 (32%), Positives = 192/401 (47%), Gaps = 89/401 (22%)
Query: 67 LVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQ 126
LV ++T +EK+ L++ A ++ RL IP Y WW+E+LHG +GP +P T+FP+
Sbjct: 49 LVAQMTTEEKLPQLLNVAPAIPRLKIPAYNWWTESLHGA--LGP-------IP-TTNFPE 98
Query: 127 VILTAASFNASLFQAIGKVVSTEARAMYNVGL---------AGLTFWSPNINIFRDPRWG 177
I AASF+A L + +STE RA++ G GL WSPNINIFRDPRWG
Sbjct: 99 PIGLAASFDAPLVHQVADAISTEVRALHTQGRQTGKLGRIGTGLDTWSPNINIFRDPRWG 158
Query: 178 RGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHF 237
RGQET GEDP LA++ +V G+Q G +P+ +V + KH+ A+ + + R+
Sbjct: 159 RGQETYGEDPYLAARMGVAFVTGMQ---GPNPDLPRVISTPKHFAAH---SGPESTRHAA 212
Query: 238 NAMV----IYTYYLIKFK------------------------------------------ 251
N V + YL F+
Sbjct: 213 NVYVSPHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIEGQPACASDLLLKDHLRGAWGFTG 272
Query: 252 YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSF-------LGKHTEAAVKAG 304
Y+VSDCD+V + ++ Y A A ++ AG+D C + L A+K
Sbjct: 273 YVVSDCDAVKDIADNHKYAPDQATAVAAAMKAGVDNECNTQTIGDIGGLPDRFGDALKRN 332
Query: 305 LVNESAIDHAISNNFATLMRLGFFD---GHPSKQPYGQLGPKDVCTQANQDLALDAARQG 361
L+++ ID + F+ +R G G P+ P + T + LALDAA +
Sbjct: 333 LISQGDIDRTLVRLFSARLRNGDLPGVAGAPAVVPVSA-----ILTPDHIALALDAAEKS 387
Query: 362 IVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
+VLLKN G LPL P A +AV+GP + T+ + GNY T
Sbjct: 388 LVLLKND-GVLPLRPGA--RIAVVGPLGDATRVLRGNYSST 425
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 55/83 (66%), Gaps = 3/83 (3%)
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
D+ L LP +QQ L+ E AK P+I++ M+G ++++AK++ W YPGQ+G
Sbjct: 654 DKTSLDLPAEQQALL-EHAKATGKPLIVVAMNGSPINLAWAKDNAAAIVEAW--YPGQSG 710
Query: 524 GAAIADVCFGRYNPSGRLPMTWY 546
G A+A+V G+ NP+GRLP+T+Y
Sbjct: 711 GLAVANVLTGKTNPAGRLPLTFY 733
>gi|336275603|ref|XP_003352555.1| hypothetical protein SMAC_01389 [Sordaria macrospora k-hell]
gi|380094444|emb|CCC07823.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 833
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 95/219 (43%), Positives = 123/219 (56%), Gaps = 12/219 (5%)
Query: 45 NPSLASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHG 104
N LA C+++ R LV++LT+ EK+ LVD + RLG+P Y WWSE LHG
Sbjct: 28 NGVLAKTKACDSTASAPDRAASLVEQLTIDEKLVNLVDQSKGAPRLGLPPYAWWSEGLHG 87
Query: 105 VSYVGPGTHFSNV---VPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGL 161
V+ PG F+ ATSF VI A+ + L +G +STEARA G GL
Sbjct: 88 VA-GSPGVVFNTSGYPFSYATSFANVITLGAALDDDLVYEVGTAISTEARAFAKFGFGGL 146
Query: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHY 221
+W+PNIN ++DPRWGRG ETPGEDPL Y V GL+ G+ KV A CKH+
Sbjct: 147 DYWTPNINPYKDPRWGRGAETPGEDPLRIKGYVKAMVAGLE----GNGTVRKVIATCKHF 202
Query: 222 TAYDLDNWKGTDRYHFNAMV----IYTYYLIKFKYIVSD 256
AYDL+ W+G RY F+A+V + YYL F+ D
Sbjct: 203 AAYDLERWRGLTRYDFDAVVSLQDLSEYYLPPFQQCARD 241
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 103/311 (33%), Positives = 153/311 (49%), Gaps = 44/311 (14%)
Query: 252 YIVSDCDSV-DVLYNSQHYTKTPEEAAAKSILAGLDLNCG-SFLGKHTE--AAVKAGLVN 307
YI SDC+++ D L ++ ++++TP EAAA + +AG D C S +T+ A L++
Sbjct: 314 YITSDCNAIQDFLPDNHNFSQTPAEAAAAAYIAGTDTVCEVSGWPPYTDVVGAYNQSLLS 373
Query: 308 ESAIDHAISNNFATLMRLGFFD------GHPSKQPYGQLGPKDVCTQANQDLALDAARQG 361
ES ID A+ + L+R G+ D P K P+
Sbjct: 374 ESVIDTALRRLYEGLIRAGYLDHGRPASSSPDKAPF------------------------ 409
Query: 362 IVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATIY 421
++ LPL T K +A+IG AN T+T+ G Y G P Y P+ + + + Y
Sbjct: 410 -----SSPDFLPLDLTG-KTVALIGHWANATRTIRGPYSGLPPFYHNPMYAVRQLKLSFY 463
Query: 422 QAGCSNVQCGTAQ--VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLIT 479
A V A A AA SAD + G D ++ +E DR + P Q LI
Sbjct: 464 YANGPVVNSTDADTWTAAAMLAAESADVVLYFGGTDTTVASEDLDRESIAWPKTQLTLIE 523
Query: 480 EVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSG 539
++A+V K P+++I + G D + N+ I+SILWVGYPGQ+GG A+ DV G+ +G
Sbjct: 524 KLAQVGK-PMVVIQL-GDQVDDTPLLNNKNISSILWVGYPGQSGGTAVFDVLTGKKASAG 581
Query: 540 RLPMTWYPQSY 550
RLP+T YP Y
Sbjct: 582 RLPVTQYPAGY 592
>gi|390945417|ref|YP_006409177.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
17242]
gi|390421986|gb|AFL76492.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
17242]
Length = 771
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 151/529 (28%), Positives = 238/529 (44%), Gaps = 102/529 (19%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
SRLGIP + EA HG ++ GAT+FP A+++N L + +GKV++
Sbjct: 119 SRLGIPLF-LAEEAPHG-----------HMAIGATTFPTAPGQASTWNPELIERMGKVIA 166
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E R G + P ++I RDPRW R +E+ GED L ++ YV+G T G
Sbjct: 167 AEIRLQ-----GGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRG---TGSG 218
Query: 208 SPNRLKVA-ACCKHYTAYDLD--------NWKGTDRYH------FNAMV------IYTYY 246
++ + A + KH+ AY N G F A V + T Y
Sbjct: 219 DLSQSRHALSTLKHFIAYGASEGGQNGGSNLLGERELRETYLPPFEAAVKAGARSVMTAY 278
Query: 247 -----------------LIKFK-----YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG 284
+++ + ++VSD S++ L+ + + EAA +++ AG
Sbjct: 279 NSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREAAVQALRAG 338
Query: 285 LD--LNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY-GQLG 341
+D L G+F A +AG V E+ ID A+ A +G F+ PY +
Sbjct: 339 VDADLKGGAF--ASLREAAEAGDVAEAEIDRAVERVLALKFEMGLFE-----NPYIDEAA 391
Query: 342 PKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG 401
+V A+ +LAL+AARQ + LL+N +G+LPL P ++ +AVIGPNA+ +G+Y
Sbjct: 392 AAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLGDYTA 451
Query: 402 TPCKYTTPLQGLAAVVA---TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGADQSI 458
T GL ++ +Y GC+ +++ A AA DA V+V+G +
Sbjct: 452 QQTAANTVRDGLEKLLGRDRVVYSRGCTVRGGDRSEIAAAVSAARGTDAAVVVIGGSSAR 511
Query: 459 E-----------------------AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMS 495
+ E DR L L G+Q+ L+ + K P+I++ ++
Sbjct: 512 DFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALLGEQEELLRRI-KATGTPLIVVCIA 570
Query: 496 GGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
G D+ A + W YPG GG A+A+ G NP+GRLP+T
Sbjct: 571 GRPLDLRRASEQADALLMAW--YPGARGGDAVAETILGHNNPAGRLPIT 617
>gi|393787054|ref|ZP_10375186.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
CL02T12C05]
gi|392658289|gb|EIY51919.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
CL02T12C05]
Length = 958
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 156/548 (28%), Positives = 239/548 (43%), Gaps = 107/548 (19%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F ++ +RLGIP ++ +E + GV SY AT+FP +
Sbjct: 153 LNEVQRFFIEE----TRLGIP-VDFTNEGIRGVESY------------KATNFPTQLGLG 195
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLAS 191
++N L +G + EAR L ++P +++ RD RWGR +E GE P L +
Sbjct: 196 HTWNRKLIHQVGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVA 250
Query: 192 KYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR----------------- 234
+ VKG+Q +VAA KH+ AY N KG
Sbjct: 251 ELGIEMVKGMQHN-------YQVAATGKHFIAYS--NNKGAREGMARVDPQMSPREVEMI 301
Query: 235 --YHFNAMV----------------------IYTYYLIKFK-------YIVSDCDSVDVL 263
Y F ++ Y + + + + Y+VSD D+V+ L
Sbjct: 302 HVYPFKRVIQEAGLLGVMSSYNDYDGLPVQSSYYWLMTRLRGQMGFRGYVVSDSDAVEYL 361
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNN-FATL 322
Y K +EA +S+ AGL++ C +F + LV E + I N+ +
Sbjct: 362 YTKHGTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVQEGGLSEEIINDRVRDI 420
Query: 323 MRLGFFDGHPSKQPYGQLGPKDVCTQANQD--LALDAARQGIVLLKNTAGSLPLSPTAIK 380
+R+ F G L D + ++ +AL A+R+ IVLLKN +LPL +I+
Sbjct: 421 LRVKFLVGLFDTPYQTDLKGADEEVEKEENEIVALQASRESIVLLKNDKNALPLDVASIR 480
Query: 381 NLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV----ATIYQAGCSNVQCG----- 431
+AV GPNA+ T + +Y TT L G+ V +Y GC V
Sbjct: 481 KIAVCGPNADETAYALTHYGPLAVDVTTVLSGIRQKVDGKAEVLYTKGCELVDANWPESE 540
Query: 432 ----------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEV 481
++D A A AD V+V+G Q E+ R L LPG+Q L+ V
Sbjct: 541 IIDYPLTNDEQNKIDKAVAQAKEADVAVVVLGGGQRTCGENKSRSSLDLPGRQLDLLKAV 600
Query: 482 AKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRL 541
K PV+L++++G +++A D + +I+ YPG GG A+ADV FG YNP G+L
Sbjct: 601 QATGK-PVVLVLINGRPLSVNWA--DKFVPAIIEAWYPGSKGGTAVADVLFGDYNPGGKL 657
Query: 542 PMTWYPQS 549
+T +P+S
Sbjct: 658 TVT-FPKS 664
>gi|103486503|ref|YP_616064.1| glycoside hydrolase [Sphingopyxis alaskensis RB2256]
gi|98976580|gb|ABF52731.1| glycoside hydrolase, family 3-like protein [Sphingopyxis alaskensis
RB2256]
Length = 772
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 152/484 (31%), Positives = 225/484 (46%), Gaps = 73/484 (15%)
Query: 122 TSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF-WSPNINIFRDPRWGRGQ 180
T FP + AAS++ + ++ + EA A G+ + +SP ++I RDPRWGR
Sbjct: 141 TIFPISLGEAASWDLQAIEKAARISAIEASA------EGIHWTFSPMVDIARDPRWGRIS 194
Query: 181 ETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYH---F 237
E GED L S A V+G Q D P+ + A KH+ AY G D YH
Sbjct: 195 EGAGEDVYLGSLIAKARVRGYQGGDLSRPD--TILATAKHFAAYGAAQ-AGRD-YHTVDI 250
Query: 238 NAMVIYTYYLIKFK-----------------------------------------YIVSD 256
+ + YL FK ++V+D
Sbjct: 251 SERTMRDVYLPPFKAAADAGAATFMTAFNEYDGVPASGSHYLLTDVLRKKWGFKGFVVTD 310
Query: 257 CDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNC-GSFLGKHTEAAVKAGLVNESAIDHAI 315
S++ + Y K ++A +++ AG+D++ G+ ++ +V G V+ + ID A+
Sbjct: 311 YTSINEMV-PHGYAKDLKQAGEQAMRAGVDMDMQGAVFMENLAKSVAEGKVDTARIDAAV 369
Query: 316 SNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS 375
RLG FD P + + A + A D AR+ IVLLKN LPL+
Sbjct: 370 KAILEMKYRLGLFD-DPYRYADAAREKATIYKPAFLEAARDVARKSIVLLKNKDNVLPLA 428
Query: 376 PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTP---LQGLAAVV---ATI-YQAGCSNV 428
+A K++AVIGP N + MIG++ + T P L+GL A TI Y G S
Sbjct: 429 ASA-KSIAVIGPLGNSKEDMIGSWSAAGDRRTRPVTLLEGLQAGAPKGTTIAYAKGASYH 487
Query: 429 QCGTAQVD---DAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVA 485
+ D +A A +D + MG ++ E+ R L LPG QQ L+ + K
Sbjct: 488 FDDVGKTDGFAEALALAEKSDVIIAAMGEHWNMTGEAASRTSLDLPGNQQALLEALEKTG 547
Query: 486 KGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
K PVIL++MSG I +A D + +IL YPG GG AIAD+ +GRYNPSG+LP+T
Sbjct: 548 K-PVILVLMSGRPNSIEWA--DANVDAILEAWYPGTMGGHAIADILYGRYNPSGKLPVT- 603
Query: 546 YPQS 549
+P++
Sbjct: 604 FPRT 607
>gi|224536087|ref|ZP_03676626.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522306|gb|EEF91411.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
DSM 14838]
Length = 791
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 150/525 (28%), Positives = 233/525 (44%), Gaps = 90/525 (17%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP ++ +E + G+ + AT FP +++N L IG+V +
Sbjct: 141 TRLGIP-VDFTNEGIRGLCH-----------DRATFFPSQSGQGSTWNKELIARIGEVEA 188
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQ---- 203
EA + L +SP ++I +DPRWGR E GEDP L + ++ LQ+
Sbjct: 189 KEA-----IALGYTNIYSPILDICQDPRWGRSVECYGEDPYLVGQLGKQMIQSLQKHRLV 243
Query: 204 ---------------TDGG-------SPNRLKVAACCKHYTAY-------------DLDN 228
DG SP ++ A+ D D
Sbjct: 244 STVKHFAVYSIPVGGRDGKTRTDPHVSPREMRTLYLEPFRRAFCEAGALGVMSSYNDYDG 303
Query: 229 WKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
T +HF ++ Y K Y+VSD ++V+ + H E A+++ AGL N
Sbjct: 304 EPITSSHHFLTEILRQEYGFK-GYVVSDSEAVEFITTKHHVVSNEVEGVAQAVNAGL--N 360
Query: 289 CGSFLGKHTE------AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
+ K + A+K G V+ I+ +++ LG FD +P + Q
Sbjct: 361 IRTHFTKPEDFVLPLRQAIKEGKVSPETINSRVADILRIKFWLGLFD-NPYRGDEKQ-EE 418
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
K V + +Q +AL+AARQ +VLLKN LPL T +K++AVIGPNAN +I Y
Sbjct: 419 KIVHCKEHQQVALEAARQSLVLLKNENQLLPLKKT-VKSVAVIGPNANEQTQLICRYGPA 477
Query: 403 PCKYTTPLQGLAAVV---ATIYQAGCSNVQCGTAQ---------------VDDAKKAAAS 444
T QG+ ++ +Y+ GC + + +D+A AA +
Sbjct: 478 NAPIKTVYQGIKELLPETEVVYRKGCEIIDSHFPESEILPFEKTTEEQQMLDEAVAAARN 537
Query: 445 ADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFA 504
A+ VLV+G + E R L LPG QQ L+ + K P +L+++ G I++A
Sbjct: 538 AEVVVLVLGGSELTVREDRSRTSLDLPGHQQELMQAIHATGK-PTVLVLLDGRAATINYA 596
Query: 505 KNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
I +IL +PG+ G A+A+ FG YNP GRL +T +P+S
Sbjct: 597 NQ--YIPAILHAWFPGEFAGTAVAEALFGDYNPGGRLAVT-FPKS 638
>gi|169602325|ref|XP_001794584.1| hypothetical protein SNOG_04159 [Phaeosphaeria nodorum SN15]
gi|160706143|gb|EAT87919.2| hypothetical protein SNOG_04159 [Phaeosphaeria nodorum SN15]
Length = 868
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 106/261 (40%), Positives = 146/261 (55%), Gaps = 6/261 (2%)
Query: 293 LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQD 352
+ H A GL NE+ +D A+ +A+L+R+G+FD S QPY QLG V T A+Q
Sbjct: 1 MQSHLPGAFSQGLTNETVLDQALIRQYASLVRVGWFDS-ASDQPYRQLGWNTVATNASQQ 59
Query: 353 LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQG 412
LA AA +GIVLLKN G LP+S + + + G AN T ++GNY G +PL
Sbjct: 60 LARRAATEGIVLLKND-GVLPISIDSSMKVGLFGEWANATTQLLGNYAGVSTYLHSPLYA 118
Query: 413 LAAVVATI-YQAGCSNVQCG--TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLL 469
L + ATI Y G Q T + + K A +D V V G D +E E DR L
Sbjct: 119 LQQINATINYAGGLPGGQGDPTTERWLNLKPAIDGSDVLVYVGGIDNGVEEEGMDRNSLQ 178
Query: 470 LPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIAD 529
G Q +I ++A K P I+++M GG D + KN+P +++ILW GYPGQ GG+AI D
Sbjct: 179 WTGAQLDVIGQLADTGK-PTIVVVMGGGQIDSTPIKNNPNVSAILWGGYPGQDGGSAIVD 237
Query: 530 VCFGRYNPSGRLPMTWYPQSY 550
+ G+ P+GRLP T YP ++
Sbjct: 238 ILTGKVAPAGRLPQTQYPSNF 258
>gi|262383006|ref|ZP_06076143.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
gi|262295884|gb|EEY83815.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
Length = 732
Score = 176 bits (445), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 165/584 (28%), Positives = 252/584 (43%), Gaps = 106/584 (18%)
Query: 37 VFACDVVSNPSLASLGFCNTS-LGIDLRVGDLVKRLTLQEKITFLVDSA----GSVSRLG 91
V AC + N + AS + + ++ R+ L+K++TL+EK+ L ++ V RLG
Sbjct: 7 VIACGLFLNAAAASSQVTDKEKVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLG 66
Query: 92 IPKYEW-WSEALHGV---------SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQA 141
IP EW S+ HGV +Y G ++ P T+F AA++N L
Sbjct: 67 IP--EWSLSDGPHGVRAEINRHDWAYAGWTNDSASYFPTGTAF------AAAWNPELAYR 118
Query: 142 IGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGL 201
G+V+ EAR L G P +NI R P GR E EDP + S A Y+KGL
Sbjct: 119 RGEVLGEEARWRKKDVLLG-----PGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGL 173
Query: 202 QQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVD 261
Q D VA KH+ + + + T + + YL FK V + ++
Sbjct: 174 QSRD--------VACSVKHFAVNNQETNRTTVDVECSERALREIYLPAFKAAVQEGGALT 225
Query: 262 VL--YNSQHYTKTPE--------------------------EAAAKSILAGLDLNCGSFL 293
V+ YN E + S+ AGLDL G+ +
Sbjct: 226 VMAAYNKFRGEFCAENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPSMEAGLDLEMGTLI 285
Query: 294 GKHTE--------AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
K+ + AVK+G + S +D + + +++ D P + GP +
Sbjct: 286 DKYEDWYYANPLIEAVKSGKIPMSLVDEKVGDVLRVMIKTNVLD------PKKRFGPGSM 339
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
T+ +Q DAA + IVLLKN LPL ++IK+LAVIG NA + G
Sbjct: 340 NTKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAV 399
Query: 406 Y-TTPLQGLAA---------------VVATIYQAGCSNVQCGT---------AQVDDAKK 440
Y TPL+ L A ++T + + GT A + +A +
Sbjct: 400 YEVTPLEALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVE 459
Query: 441 AAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFD 500
A ++D +LV G + + ES DRL++ +P Q LI EV K P +++M G
Sbjct: 460 VARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-P 516
Query: 501 ISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
++ A D +I+W + G GG A+ DV G+ NPSG++P T
Sbjct: 517 LNMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT 560
>gi|301307693|ref|ZP_07213650.1| thermostable beta-glucosidase B [Bacteroides sp. 20_3]
gi|423337298|ref|ZP_17315042.1| hypothetical protein HMPREF1059_00967 [Parabacteroides distasonis
CL09T03C24]
gi|300834367|gb|EFK64980.1| thermostable beta-glucosidase B [Bacteroides sp. 20_3]
gi|409237758|gb|EKN30554.1| hypothetical protein HMPREF1059_00967 [Parabacteroides distasonis
CL09T03C24]
Length = 732
Score = 176 bits (445), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 165/584 (28%), Positives = 252/584 (43%), Gaps = 106/584 (18%)
Query: 37 VFACDVVSNPSLASLGFCNTS-LGIDLRVGDLVKRLTLQEKITFLVDSA----GSVSRLG 91
V AC + N + AS + + ++ R+ L+K++TL+EK+ L ++ V RLG
Sbjct: 7 VIACGLFLNAAAASSQVTDKEKVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLG 66
Query: 92 IPKYEW-WSEALHGV---------SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQA 141
IP EW S+ HGV +Y G ++ P T+F AA++N L
Sbjct: 67 IP--EWSLSDGPHGVRAEINRHDWAYAGWTNDSASYFPTGTAF------AAAWNPELAYR 118
Query: 142 IGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGL 201
G+V+ EAR L G P +NI R P GR E EDP + S A Y+KGL
Sbjct: 119 RGEVLGEEARWRKKDVLLG-----PGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGL 173
Query: 202 QQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVD 261
Q D VA KH+ + + + T + + YL FK V + ++
Sbjct: 174 QSRD--------VACSVKHFAVNNQETNRTTVDVECSERALREIYLPAFKAAVQEGGALT 225
Query: 262 VL--YNSQHYTKTPE--------------------------EAAAKSILAGLDLNCGSFL 293
V+ YN E + S+ AGLDL G+ +
Sbjct: 226 VMAAYNKFRGEFCAENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPSMEAGLDLEMGTLI 285
Query: 294 GKHTE--------AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
K+ + AVK+G + S +D + + +++ D P + GP +
Sbjct: 286 DKYEDWYYANPLIEAVKSGKIPMSLVDEKVGDVLRVMIKTNVLD------PKKRFGPGSM 339
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
T+ +Q DAA + IVLLKN LPL ++IK+LAVIG NA + G
Sbjct: 340 NTKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAV 399
Query: 406 Y-TTPLQGLAA---------------VVATIYQAGCSNVQCGT---------AQVDDAKK 440
Y TPL+ L A ++T + + GT A + +A +
Sbjct: 400 YEVTPLEALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVE 459
Query: 441 AAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFD 500
A ++D +LV G + + ES DRL++ +P Q LI EV K P +++M G
Sbjct: 460 VARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-P 516
Query: 501 ISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
++ A D +I+W + G GG A+ DV G+ NPSG++P T
Sbjct: 517 LNMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT 560
>gi|430736195|gb|AGA60127.1| glycoside hydrolase [Aminobacter sp. Gsoil204]
Length = 772
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 167/581 (28%), Positives = 257/581 (44%), Gaps = 104/581 (17%)
Query: 50 SLGFCNTSLGIDLRVGDLVKRLTLQEKITFLV------DSAGSVSRLG------------ 91
S G+ +D + +L+ ++TL+EKI L DS G R G
Sbjct: 43 STGWMKPDPAMDKFIAELMAKMTLEEKIGQLSLLTSDWDSTGPTMRQGYQEDIRKGRIGS 102
Query: 92 -----IPKY--EWWSEALHGVSYVGPGTHFSNVVPG-ATSFPQVILTAASFNASLFQAIG 143
KY + A+ P +V+ G T FP + AAS++ +
Sbjct: 103 IFNAFTAKYTRDLQRVAVEETRLKIPLLFGYDVIHGHRTIFPISLGEAASWDLKAIEKAA 162
Query: 144 KVVSTEARAMYNVGLAGLTF-WSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ 202
++ +TEA A G+ + ++P +++ RDPRWGR E GED L S+ A V+G Q
Sbjct: 163 RISATEASA------EGIHWTFAPMVDVARDPRWGRISEGAGEDVYLGSRIAEARVRGFQ 216
Query: 203 QTDGGSPNRLKVAACCKHYTAY-------------------------------------- 224
D + + V A KH+ AY
Sbjct: 217 GNDLKAVD--TVLATAKHFAAYGAAQAGRDYGTVDISERTLRDVYLPPFKAAADAGAATF 274
Query: 225 -----DLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAK 279
D+D + +H V+ + K ++V+D S++ + + Y+K ++A +
Sbjct: 275 MTSFNDVDGIPASGNHHLLTDVLRDKWGFK-GFVVTDYTSINEMV-AHGYSKDLQQAGEQ 332
Query: 280 SILAGLDLNC-GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYG 338
+I AG+D++ G+ +H +V G V+ + ID A+ RLG F+ PY
Sbjct: 333 AINAGVDMDLQGAVFMEHLAKSVAEGKVDVARIDAAVKAILEMKYRLGLFE-----DPYR 387
Query: 339 QLGP-KDVCTQANQDL---ALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKT 394
++ T D A D AR+ +VLLKN +LPL+ +A K++AVIGP +
Sbjct: 388 YSDEAREKATVYRPDFLEAARDVARKSMVLLKNANNALPLAASA-KSIAVIGPLGDSKAD 446
Query: 395 MIGNYEGTPCKYTTP---LQGLAAVV----ATIYQAGCSNVQCGTAQVD---DAKKAAAS 444
MIG++ + T P L+G+ A + Y G S + D +A A
Sbjct: 447 MIGSWSAAGDRKTRPVTLLEGMQARAPKGQSVAYVRGASYAFEDAGKTDGFAEAIALAQK 506
Query: 445 ADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFA 504
+D V MG + E+ R L LPG QQ L+ E+ K K P+IL++MSG I +A
Sbjct: 507 SDVIVAAMGERWDMTGEAASRTSLDLPGNQQALLQELKKTGK-PIILVLMSGRPNSIEWA 565
Query: 505 KNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
D + +IL YPG GG AIADV +G YNPSG+LP T+
Sbjct: 566 --DANVDAILEAWYPGTMGGHAIADVLYGDYNPSGKLPATF 604
>gi|375143423|ref|YP_005005864.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361057469|gb|AEV96460.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 793
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 158/560 (28%), Positives = 243/560 (43%), Gaps = 107/560 (19%)
Query: 60 IDLRVGDLVKRL-TLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVV 118
IDL V D+ K + + E F ++ +RLGIP ++ +E + GV
Sbjct: 117 IDL-VKDMEKHVWAMNETQRFFIEQ----TRLGIPA-DFTNEGIRGVEAYE--------- 161
Query: 119 PGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGR 178
AT FP + ++N L G + EARA L ++P +++ RD RWGR
Sbjct: 162 --ATGFPTELNMGMTWNKELVHQEGIITGREARA-----LGYTNVYAPIMDVARDQRWGR 214
Query: 179 GQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDN--WKGTDR-- 234
+E+ GEDP L + KG+QQ DG KVA+ KH+ Y + +G R
Sbjct: 215 LEESYGEDPYLVASMGIALAKGIQQ-DG------KVASTAKHFAVYSANKGAREGQARTD 267
Query: 235 -------------YHFNAMVIY---------------------TYYLIK--------FKY 252
Y F ++ Y+LI+ Y
Sbjct: 268 PQVAPREVENLLLYPFKKVIKEAGIMGVMSSYNDYDGIPVSGSNYWLIQRLRVEMGFTGY 327
Query: 253 IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGK-----HTEAAVKAGLVN 307
+VSD D+++ L H +EA ++ +AG+++ +F + VK G +
Sbjct: 328 VVSDSDALEYLATKHHVAANLKEAVFQAFMAGMNVRT-TFKAPDSIIIYLRQLVKEGRIP 386
Query: 308 ESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKN 367
I+H +++ RLG FD HP + + K V + A+Q +AL A+R+ +VLLKN
Sbjct: 387 MDTINHRVADVLRVKFRLGLFD-HPYVESAAET-RKVVNSDASQQIALQASRESVVLLKN 444
Query: 368 TAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA---TIYQAG 424
LPL ++ +AV+GPNA +Y LQG+ A + +Y G
Sbjct: 445 NNNILPLV-KSLDKIAVVGPNATDDDYAHTHYGPLGSPSVNVLQGIQAKLGAGKVLYAKG 503
Query: 425 CSNVQCG---------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLL 469
V A +D A A ++V+G + ES R DL
Sbjct: 504 VDLVDKNWPESEILPEPMDAGEQAMLDSAVNITKQAQMAIVVLGGNTRTAGESKSRTDLD 563
Query: 470 LPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIAD 529
LPG Q L+ + K PV+++++ I++ D I I++ GYPG GG A+AD
Sbjct: 564 LPGHQLELVKAIKATGK-PVVVVLLGTQPMTINWI--DKYIDGIVYAGYPGVKGGIAVAD 620
Query: 530 VCFGRYNPSGRLPMTWYPQS 549
V FG YNP G+L +TW P+S
Sbjct: 621 VLFGDYNPGGKLTLTW-PKS 639
>gi|254514842|ref|ZP_05126903.1| periplasmic beta-glucosidase [gamma proteobacterium NOR5-3]
gi|219677085|gb|EED33450.1| periplasmic beta-glucosidase [gamma proteobacterium NOR5-3]
Length = 740
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 157/537 (29%), Positives = 238/537 (44%), Gaps = 97/537 (18%)
Query: 61 DLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPG 120
D+ + + ++RL ++E SRLGIP + +HG
Sbjct: 68 DVNIVNELQRLAVEE------------SRLGIPLL-IGRDVIHGFK-------------- 100
Query: 121 ATSFPQVILTAASFNASLFQAIGKVVSTEA-RAMYNVGLAGLTFWSPNINIFRDPRWGRG 179
T FP + AAS+N S+ +A +V + EA RA N TF +P I+I RDPRWGR
Sbjct: 101 -TIFPIPLGQAASWNPSVVEAGARVSAEEAVRAGIN-----WTF-APMIDITRDPRWGRI 153
Query: 180 QETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL-DNWKGTDRYHFN 238
E+ GEDP L SK V+G Q D +P+ +AAC KH+ Y + + + +
Sbjct: 154 AESLGEDPYLCSKLGAAMVRGFQSDDLSAPD--AIAACAKHFAGYGAAEGGRDYNTANIP 211
Query: 239 AMVIYTYYLIKFK-----------------------------------------YIVSDC 257
+ YL FK +VSD
Sbjct: 212 ENEMRNVYLRPFKAAAEAGVATFMSAFCDLNGVPATGNRWLMDEILRQEWSYQGMVVSDW 271
Query: 258 DSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGK-HTEAAVKAGLVNESAIDHAIS 316
+SV V + +T E+AA ++ +AG+D+ S + H E V + ID ++
Sbjct: 272 ESV-VEMSVHGFTHDDEQAAYEAAMAGIDMEMASSSYRDHLEGLVGENKITLEQIDRMVA 330
Query: 317 NNFATLMRLGFFDGHPSKQPYGQLGPK-DVCTQANQDLALDAARQGIVLLKNTAGSLPLS 375
LG F+ QPY ++ +AN A AA Q VLLKN +LPL
Sbjct: 331 RVLRLKFELGLFE-----QPYTDPAQHPELLNKANLKAAKQAATQSCVLLKNAHQTLPLV 385
Query: 376 PTAIKNLAVIGPNANVTKTMIGN--YEGTPCKYTTPLQGLAAV----VATIYQAGCSNVQ 429
P + ++A+IGP A+ +G ++G T Q L + V Y+ +
Sbjct: 386 PAKLDSIALIGPLADDGYEQMGTWVFDGDAAHSVTCRQALDELLGRTVEIHYEKALETTR 445
Query: 430 CGTAQ-VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGP 488
+ AK AA +DA ++V+G + + E+H R ++ LPG QQ LI VA K P
Sbjct: 446 AASPDNFAAAKNAAQQSDAAIIVVGEEAFMSGEAHSRANIDLPGHQQALIEAVASAGK-P 504
Query: 489 VILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTW 545
+I++IM+G I ++L+ +PG GG AIAD+ G +PSG+LP+T+
Sbjct: 505 IIVVIMAGRPLTIEPVLE--HADAVLYAWHPGTMGGPAIADLLLGLESPSGKLPVTF 559
>gi|373459262|ref|ZP_09551029.1| glycoside hydrolase family 3 domain protein [Caldithrix abyssi DSM
13497]
gi|371720926|gb|EHO42697.1| glycoside hydrolase family 3 domain protein [Caldithrix abyssi DSM
13497]
Length = 749
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 161/536 (30%), Positives = 230/536 (42%), Gaps = 96/536 (17%)
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASF 134
E IT + A +RLGIP L+G+ + H SN + AT FPQ I AA+F
Sbjct: 102 EMITIMQKMATQETRLGIP-------ILYGIDAI----HGSNYIKEATLFPQSIAMAATF 150
Query: 135 NASLFQAIGKVVSTEARAMYNVGLAGLTF-WSPNINIFRDPRWGRGQETPGEDPLLASKY 193
N L + G++ + E RA+ G+ + ++P + + R+P W R ET GED L S+
Sbjct: 151 NRQLSRREGEITALETRAV------GIPWNFNPVLGLGRNPLWPRFWETYGEDVYLTSEM 204
Query: 194 ATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRY--HFNAMVIYTYYLIKFK 251
Y+ GLQ DG +VAAC KHY Y G DR ++ +L FK
Sbjct: 205 GRAYILGLQGEDGDISRADRVAACMKHYAGYSFP-LSGHDRTPAWIPERLMREMFLTPFK 263
Query: 252 -----------------------------------------YIVSDCDSVDVLYNSQHYT 270
VSD + V L++ H
Sbjct: 264 SAVDAGVYTVMINSGEVNGVPAHSSAFLLTRVLREEWGFKGLAVSDWEDVKRLHDRDHVA 323
Query: 271 KTPEEAAAKSILAGLDLNCGSF---LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGF 327
+PEEA +++AGLD++ F ++ VK G V E+ ID A++N + G
Sbjct: 324 ASPEEAVKMAVMAGLDMSMVPFDFSFAEYLYQLVKKGEVPETRIDDAVANILRVKFQAGL 383
Query: 328 FDG-HPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIG 386
F+ P Q +G + ++L AAR+ I LLKN LPL + V G
Sbjct: 384 FENPFPDPQRLQLIGKPEFA-----QVSLSAAREAITLLKNENNILPLQKDV--KILVTG 436
Query: 387 PNANVTKTMIGNYEGT---------PCKYTTPLQGLAAVVATIYQAGCSNVQCG--TAQV 435
P AN + G + T P Y T LQ A+ Q VQ Q
Sbjct: 437 PTANSRAYLNGGWTYTWQGDDERYYPAHYKTILQ---AITEKAGQRNVVYVQGADIETQK 493
Query: 436 DDAKKAAASADATVLVMGADQSIEAESHDRL-DLLLPGQQQLLITEVAKVAKGPVILIIM 494
D + + D V+V ++ E+ + DL LP Q+ LI ++A+ K P++L+++
Sbjct: 494 DMGEAVQKARDVDVIVACLGEATYCETPGNINDLHLPAVQRELIHQLARTGK-PIVLVLV 552
Query: 495 SGGGFDISFAKND--PKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQ 548
G I ND PK IL PG G AIA+ FG NPSG+LP T YP+
Sbjct: 553 EGRPRVI----NDIVPKTKGILMAYLPGPYGSEAIAEALFGEVNPSGKLPFT-YPK 603
>gi|261880507|ref|ZP_06006934.1| xylosidase [Prevotella bergensis DSM 17361]
gi|270332847|gb|EFA43633.1| xylosidase [Prevotella bergensis DSM 17361]
Length = 948
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 156/553 (28%), Positives = 246/553 (44%), Gaps = 117/553 (21%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
+ E F V+ +RLGIP ++ +E + G+ SY AT+FP +
Sbjct: 149 INEVQRFFVEE----TRLGIP-VDFTNEGIRGIESY------------KATNFPTQLGLG 191
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRGQETPGEDPLLA 190
++N L + +G + EAR L G T ++P +++ RD RWGR +E GE P L
Sbjct: 192 TTWNRQLIRQVGYITGREAR------LLGYTNVYAPILDVGRDQRWGRYEEIYGESPFLV 245
Query: 191 SKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAY-------------------------- 224
++ +GLQ TD +VA+ KH+ AY
Sbjct: 246 AELGIQMTRGLQ-TD------FQVASTAKHFAAYSNNKGGREGMSRVDPQMPPREVENIH 298
Query: 225 ---------------------DLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVL 263
D D YH+ V+ + + YIVSD D+++ L
Sbjct: 299 LYPWERVVQEAGLLGAMSSYNDYDGIPIQGSYHWLTEVLRHRFGFR-GYIVSDSDALEYL 357
Query: 264 YNSQHYTKTPEEAAAKSILAGLDLNC-----GSFLGKHTEAAVKAGLVNESAIDHAISNN 318
++ H +EA ++++AGL++ C SF+ E ++ G + S ID + +
Sbjct: 358 FSKHHTAADMKEAVYQAVMAGLNVRCTFRSPDSFVLPLREL-IREGRIPMSVIDRLVGDI 416
Query: 319 FATLMRLGFFDGHPSKQPYG---QLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLS 375
G FD PY + ++V ++ NQ +AL A+RQ IVLLKN LPL
Sbjct: 417 LRVKFITGIFD-----NPYQMNLKAADQEVNSERNQAVALQASRQSIVLLKNQDRLLPLD 471
Query: 376 PTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVAT----IYQAGCSNVQCG 431
+ ++ + V GPNA+ + +Y TT L+G+ V Y GC V
Sbjct: 472 RSKLRRILVCGPNADDASYALTHYGPLAVDVTTVLEGIRDKVENNIEVSYAKGCDVVDPH 531
Query: 432 -----------TAQ----VDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQL 476
T+Q +D A A +D ++V+G + E+ R L LPG+Q
Sbjct: 532 WPESEIIGYPMTSQEQQDIDHAVALAKESDVAIVVLGGNSRTCGENKSRSSLDLPGRQLD 591
Query: 477 LITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYN 536
L+ V K PV+L++++G +++A D I +I+ YPG GG A+ADV FG YN
Sbjct: 592 LLKAVQATGK-PVVLVLINGRPLSVNWA--DRFIPAIVEAWYPGSQGGTAVADVLFGDYN 648
Query: 537 PSGRLPMTWYPQS 549
P G+L +T +P+S
Sbjct: 649 PGGKLTVT-FPKS 660
>gi|260598593|ref|YP_003211164.1| beta-D-glucoside glucohydrolase [Cronobacter turicensis z3032]
gi|260217770|emb|CBA32207.1| Periplasmic beta-glucosidase [Cronobacter turicensis z3032]
Length = 765
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 156/566 (27%), Positives = 253/566 (44%), Gaps = 118/566 (20%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
+VG + +T Q+ I + D +SRL IP + + + LHG T
Sbjct: 76 QVGAIFNTVTRQD-IRAMQDQVMQLSRLKIPLF-FAYDVLHGQR---------------T 118
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF-WSPNINIFRDPRWGRGQE 181
FP + A+SFN + +G++ + Y GL W+P +++ RDPRWGRG E
Sbjct: 119 IFPNSLGLASSFNLDAVKTVGRI------SAYEAADDGLNMTWAPMVDVTRDPRWGRGSE 172
Query: 182 TPGEDPLLASKYATGYVKGLQQTDGGSP-NRLKVAACCKHYTAYD-LDNWKGTDRYHFNA 239
GED L S V+ +Q G SP +R V KH+ AY ++ K + ++
Sbjct: 173 GFGEDTYLTSMMGKTMVESMQ---GKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVDMSS 229
Query: 240 MVIYTYYLIKFK-----------------------------------------YIVSDCD 258
++ Y+ +K +SD
Sbjct: 230 QRLFNDYMPPYKEALDAGSGGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGITISDHG 289
Query: 259 SVDVLYNSQHYTKT-PEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDHAIS 316
++ L +H T + PE+A +I +G+D++ + K+ +K+G V+ +D A
Sbjct: 290 AIKELI--KHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVSMEELDDATR 347
Query: 317 NNFATLMRLGFFDGHPSKQPYGQLGPKD-------VCTQANQDLALDAARQGIVLLKNTA 369
+ +G F+ PY LGPKD ++ ++D A + AR+ +VLLKN
Sbjct: 348 HVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARNVARESLVLLKNRL 402
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYE--GTPCKYTTPLQGLAAVV---ATIYQAG 424
+LPL + +AV+GP A+ + M+G++ G + T LQG+ V A I A
Sbjct: 403 ETLPLKKSG--TIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKAKILFAK 460
Query: 425 CSNV---------------------QCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
+NV + +D+A A +D V V+G Q + E+
Sbjct: 461 GANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQGMAHEAS 520
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
R DL LP Q+ LI+ + K P++L++M+G ++ K D + +IL + G G
Sbjct: 521 SRTDLTLPQSQRDLISALKATGK-PLVLVLMNGR--PLALVKEDQQADAILETWFAGTEG 577
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQS 549
G AIADV FG YNPSG+LP++ +P+S
Sbjct: 578 GNAIADVLFGDYNPSGKLPIS-FPRS 602
>gi|336415919|ref|ZP_08596257.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
3_8_47FAA]
gi|335939822|gb|EGN01694.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
3_8_47FAA]
Length = 782
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 155/527 (29%), Positives = 239/527 (45%), Gaps = 96/527 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP + EA HG ++ GAT FP I AA+++ L + +G+V++
Sbjct: 128 TRLGIPMF-LAEEAPHG-----------HMAIGATVFPTGIGMAATWSPELVKEVGQVIA 175
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E R+ G + P +++ RDPRW R +ET GEDP+L+ V GL G
Sbjct: 176 KEIRSQ-----GGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL--GGGN 228
Query: 208 SPNRLKVAACCKHYTAYDLDNWKGTDRY--------HFNAM-------------VIYTYY 246
+ A KH+ AY + Y H N + V+ +Y
Sbjct: 229 LSQKYATIATLKHFLAYAVPEGGQNGNYASVGIRDLHQNFLPPFRKAIDAGALSVMTSYN 288
Query: 247 LI--------------------KFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
I KF+ ++VSD S++ ++ S T E AA +S+ AG+
Sbjct: 289 SIDGIPCTSNHNLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVAPTKENAAIQSVTAGV 348
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
D++ G + AV++G ++++ ID A+ +G F+ HP P ++ K V
Sbjct: 349 DVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRVLRMKFEMGLFE-HPYVDP--KIAAKTV 405
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG--TP 403
+ + +LA A+ I LLKN LPLS T I +AVIGPNA+ M+G+Y
Sbjct: 406 RRKEHIELARKIAQSSITLLKNENSILPLSKT-INKVAVIGPNADNRYNMLGDYTAPQED 464
Query: 404 CKYTTPLQGLAAVVA---TIYQAGCSNVQCGTAQVDDAKKAAASAD-------------- 446
T L G+ ++ Y GC+ +++ A +AA ++
Sbjct: 465 SNVKTVLDGIITKLSPSRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVGGSSARDF 524
Query: 447 --------ATVLVMGADQSIE-AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
A V G+ +E E DR L L G+QQ L+ + K K P+I++ + G
Sbjct: 525 KTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTGK-PLIVVYIEGR 583
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
+ ++A ++L YPGQ GG AIADV FG YNPSGRLP++
Sbjct: 584 PLEKNWASE--YADALLTAYYPGQEGGNAIADVLFGDYNPSGRLPIS 628
>gi|224025503|ref|ZP_03643869.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
18228]
gi|224018739|gb|EEF76737.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
18228]
Length = 787
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 161/536 (30%), Positives = 238/536 (44%), Gaps = 113/536 (21%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP + E HG ++ GAT FP + A+++N SL + +G+V+
Sbjct: 124 TRLGIPVL-FAEECPHG-----------HMAIGATVFPTSMGQASTWNESLIRQMGEVIG 171
Query: 148 TEARAM-YNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDG 206
EAR N+G + P ++I R+PRW R +ET GEDP L T +V+G+Q D
Sbjct: 172 LEARLQGANIG------YGPVLDIAREPRWSRVEETFGEDPYLTGILGTAFVQGMQGKD- 224
Query: 207 GSPNRLKVAACCKHYTAY------------DLDNWKGTDRY--HFN--------AMVIYT 244
+ V + KH AY D+ D Y F A V+ +
Sbjct: 225 -FKDGRHVYSTLKHLAAYGVPRGGHNGGPADMGLRALLDEYLPGFQRAVEVGKAATVMTS 283
Query: 245 YYLI--------KF-------------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILA 283
Y I KF ++ SD S+D + + H E+AA +++ A
Sbjct: 284 YNSIDGVPCTSNKFLIDSLLRKRWGFDGFVYSDLASIDGIAGA-HVAANLEDAAIQAVEA 342
Query: 284 GLDLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPK 343
G D++ G+ + AV+ G V ESAI+ A+SN R+G F+ QPY + P+
Sbjct: 343 GTDMDLGANAYRRLVKAVQTGKVKESAINRAVSNVLRLKFRMGLFE-----QPY--VSPE 395
Query: 344 DVCTQANQD----LALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNY 399
+ N + LA AR+G VLLKN G LPL +K +AVIGPNA+V +G+Y
Sbjct: 396 EAARLVNCEDHRMLARKIAREGTVLLKNN-GILPLG--KVKRIAVIGPNADVMYNYLGDY 452
Query: 400 EG--TPCKYTTPLQGLAAVVATI---YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGA 454
K T L L + + Y GC+ + + +A +AA AD +L +G
Sbjct: 453 TAPQERSKVVTLLDALRNRMPDVRIDYVKGCAIRDTTQSNIKEAVEAARKADLVILAVGG 512
Query: 455 DQSIE--------------------------AESHDRLDLLLPGQQQLLITEVAKVAKGP 488
+ + E DR L L G Q+ LI +A K P
Sbjct: 513 SSARDFKTKYINTGAATVDSENSGILSDMECGEGFDRATLDLLGDQEKLIRAIAATEK-P 571
Query: 489 VILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
++ + ++G +++ A W YPG+ GG I DV G YNPSGRLPM+
Sbjct: 572 LVTVYIAGRPLNMNLASEVSDALLTAW--YPGEQGGNGIVDVLTGEYNPSGRLPMS 625
>gi|218258058|ref|ZP_03474485.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
DSM 18315]
gi|218225777|gb|EEC98427.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
DSM 18315]
Length = 955
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 154/552 (27%), Positives = 242/552 (43%), Gaps = 115/552 (20%)
Query: 73 LQEKITFLVDSAGSVSRLGIPKYEWWSEALHGV-SYVGPGTHFSNVVPGATSFPQVILTA 131
L E F ++ +RLGIP ++ +E + GV SY+ AT+FP +
Sbjct: 149 LNEVQRFFIEE----TRLGIPT-DFTNEGIRGVESYI------------ATNFPTQLGLG 191
Query: 132 ASFNASLFQAIGKVVSTEARAMYNVGLAGLT-FWSPNINIFRDPRWGRGQETPGEDPLLA 190
++N L +G + E R L G T ++P +++ RD RWGR +E GE P L
Sbjct: 192 HTWNRDLVHKVGYITGREGR------LLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLV 245
Query: 191 SKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDN---------------------- 228
++ KG+Q TD +VAA KHY AY +
Sbjct: 246 AELGVEMAKGMQ-TD------YQVAATSKHYIAYSNNKGGREGMARVDPQMSPREVEMLH 298
Query: 229 ---WKG--------------TDRYHFNAMVIYTYYLIKFK-------YIVSDCDSVDVLY 264
WK D F Y + + + Y+VSD D+V+ L+
Sbjct: 299 VYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDSDAVEYLF 358
Query: 265 NSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNESAIDHAISNNF 319
+ +E+ +S+LAGL++ C +F + + G + S ID + +
Sbjct: 359 SKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGALPMSTIDDRVRDIL 417
Query: 320 ATLMRLGFFDGHPSKQPYG---QLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSP 376
+G FD QPY + K+V + NQ +AL A+++ +VLLKN LPL
Sbjct: 418 RVKFLVGLFD-----QPYQIDLKQADKEVNSAENQQVALQASKESLVLLKNQDAVLPLDV 472
Query: 377 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVV----ATIYQAGCSNVQCG- 431
I +AV GPNA+ + +Y + TT L+G+ V ++ GC V
Sbjct: 473 NKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEVLFTKGCDLVDANW 532
Query: 432 --------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLL 477
++++ A + A +D V+V+G E+ R L LPG+Q L
Sbjct: 533 PESELIRYPLTSEEQSEINKAVENAKKSDVAVVVLGGSNRTCGENKSRSSLELPGRQLDL 592
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ V K PV+L++++G I++A D + +IL YPG GG AIAD FG YNP
Sbjct: 593 LQAVVATGK-PVVLVLINGRPISINWA--DKYVPAILEAWYPGSQGGTAIADALFGDYNP 649
Query: 538 SGRLPMTWYPQS 549
G+L +T +P++
Sbjct: 650 GGKLTVT-FPKT 660
>gi|153809292|ref|ZP_01961960.1| hypothetical protein BACCAC_03604 [Bacteroides caccae ATCC 43185]
gi|149128062|gb|EDM19283.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
caccae ATCC 43185]
Length = 946
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 165/617 (26%), Positives = 258/617 (41%), Gaps = 151/617 (24%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEW----WSEALHGV-------- 105
ID R+ DL+ ++TL+EK +V G V + +P EW W + + +
Sbjct: 60 IDARIEDLLSQMTLEEKTCQMVTLYGYKRVLKDDLPTSEWKNQLWKDGIGAIDEHLNGFQ 119
Query: 106 ---------SYVGPG-----------------------THFSNV-VPG-----ATSFPQV 127
YV P T F+N + G AT+FP
Sbjct: 120 QWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNFPTQ 179
Query: 128 ILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDP 187
+ ++N L +G + EAR L ++P +++ RD RWGR +E GE P
Sbjct: 180 LGLGHTWNRQLIHQVGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESP 234
Query: 188 LLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR------------- 234
L ++ V+G+Q +VAA KH+ AY N KG
Sbjct: 235 YLVAELGIEMVRGMQHNH-------QVAATGKHFIAYS--NNKGAREGMARVDPQMSPRE 285
Query: 235 ------YHFNAMVIY--------------------TYYLIKFK---------YIVSDCDS 259
Y F ++ +YY + + Y+VSD D+
Sbjct: 286 VEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVSDSDA 345
Query: 260 VDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNESAIDHA 314
V+ LY K +EA +S+ AGL++ C +F + VK G ++E I+
Sbjct: 346 VEYLYTKHGTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEEVINDR 404
Query: 315 ISNNFATLMRLGFFDGHPSKQPYG-QLGPKDVCTQANQDL--ALDAARQGIVLLKNTAGS 371
+ + +G FD PY L D + ++ AL A+R+ IVLLKN
Sbjct: 405 VRDILRVKFLVGLFD-----TPYQTDLKGADEEVEKKENEEVALQASRESIVLLKNEKNV 459
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGCSN 427
LPL P+ I+ +AV GPNA+ + +Y + T+ L+G+ + +Y GC
Sbjct: 460 LPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADVLYTKGCDL 519
Query: 428 VQCG---------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPG 472
V ++D A A AD ++V+G Q E+ R L LPG
Sbjct: 520 VDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENKSRSSLDLPG 579
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCF 532
+Q L+ V K PV+L++++G I++A D + +IL YPG GG A+AD+ F
Sbjct: 580 RQLDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGIAVADILF 636
Query: 533 GRYNPSGRLPMTWYPQS 549
G YNP G+L +T +P++
Sbjct: 637 GDYNPGGKLTVT-FPKT 652
>gi|423217451|ref|ZP_17203947.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
CL03T12C61]
gi|392628610|gb|EIY22636.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
CL03T12C61]
Length = 946
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 165/617 (26%), Positives = 258/617 (41%), Gaps = 151/617 (24%)
Query: 60 IDLRVGDLVKRLTLQEKITFLVDSAG--SVSRLGIPKYEW----WSEALHGV-------- 105
ID R+ DL+ ++TL+EK +V G V + +P EW W + + +
Sbjct: 60 IDARIEDLLSQMTLEEKTCQMVTLYGYKRVLKDDLPTSEWKNQLWKDGIGAIDEHLNGFQ 119
Query: 106 ---------SYVGPG-----------------------THFSNV-VPG-----ATSFPQV 127
YV P T F+N + G AT+FP
Sbjct: 120 QWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNFPTQ 179
Query: 128 ILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDP 187
+ ++N L +G + EAR L ++P +++ RD RWGR +E GE P
Sbjct: 180 LGLGHTWNRQLIHQVGLITGREARM-----LGYTNVYAPILDVGRDQRWGRYEEVYGESP 234
Query: 188 LLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDR------------- 234
L ++ V+G+Q +VAA KH+ AY N KG
Sbjct: 235 YLVAELGIEMVRGMQHNH-------QVAATGKHFIAYS--NNKGAREGMARVDPQMSPRE 285
Query: 235 ------YHFNAMVIY--------------------TYYLIKFK---------YIVSDCDS 259
Y F ++ +YY + + Y+VSD D+
Sbjct: 286 VEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVSDSDA 345
Query: 260 VDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT-----EAAVKAGLVNESAIDHA 314
V+ LY K +EA +S+ AGL++ C +F + VK G ++E I+
Sbjct: 346 VEYLYTKHGTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEEVINDR 404
Query: 315 ISNNFATLMRLGFFDGHPSKQPYG-QLGPKDVCTQANQDL--ALDAARQGIVLLKNTAGS 371
+ + +G FD PY L D + ++ AL A+R+ IVLLKN
Sbjct: 405 VRDILRVKFLVGLFD-----TPYQTDLKGADEEVEKKENEEVALQASRESIVLLKNEKNV 459
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGCSN 427
LPL P+ I+ +AV GPNA+ + +Y + T+ L+G+ + +Y GC
Sbjct: 460 LPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADVLYTKGCDL 519
Query: 428 VQCG---------------TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPG 472
V ++D A A AD ++V+G Q E+ R L LPG
Sbjct: 520 VDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENKSRSSLDLPG 579
Query: 473 QQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCF 532
+Q L+ V K PV+L++++G I++A D + +IL YPG GG A+AD+ F
Sbjct: 580 RQLDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGIAVADILF 636
Query: 533 GRYNPSGRLPMTWYPQS 549
G YNP G+L +T +P++
Sbjct: 637 GDYNPGGKLTVT-FPKT 652
>gi|156933295|ref|YP_001437211.1| hypothetical protein ESA_01107 [Cronobacter sakazakii ATCC BAA-894]
gi|156531549|gb|ABU76375.1| hypothetical protein ESA_01107 [Cronobacter sakazakii ATCC BAA-894]
Length = 757
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 169/632 (26%), Positives = 267/632 (42%), Gaps = 158/632 (25%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLR---VGDLVKRLTLQEKITFLV------------- 81
A + P+LAS F L + R V DL+K++T+ EKI L
Sbjct: 1 MAVSLALQPALASEPFGEHPLTPEARDAFVTDLLKKMTVDEKIGQLRLISVGPDNPKEAI 60
Query: 82 -------------------------DSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSN 116
D +SRL IP + + + LHG
Sbjct: 61 RDMIKEGQVGAIFNTVTRPDIRAMQDQVMQLSRLKIPLF-FAYDVLHGQR---------- 109
Query: 117 VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF-WSPNINIFRDPR 175
T FP + A+SFN + +G+V + Y GL W+P +++ RDPR
Sbjct: 110 -----TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPR 158
Query: 176 WGRGQETPGEDPLLASKYATGYVKGLQQTDGGSP-NRLKVAACCKHYTAYD-LDNWKGTD 233
WGRG E GED L S V+ +Q G SP +R V KH+ AY ++ K +
Sbjct: 159 WGRGSEGFGEDTYLTSMMGKTMVESMQ---GKSPADRYSVMTSVKHFAAYGAVEGGKEYN 215
Query: 234 RYHFNAMVIYTYYLIKFK-----------------------------------------Y 252
++ ++ Y+ +K
Sbjct: 216 TVDMSSQRLFNDYMPPYKEALDAGSGGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 275
Query: 253 IVSDCDSVDVLYNSQHYTKT-PEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESA 310
+SD ++ L +H T + PE+A +I +G+D++ + K+ +K+G V +
Sbjct: 276 TISDHGAIKELI--KHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 333
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD-------VCTQANQDLALDAARQGIV 363
+D A + +G F+ PY LGPKD ++ ++D A AR+ +V
Sbjct: 334 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARKVARESLV 388
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE--GTPCKYTTPLQGLAAVV---A 418
LLKN +LPL + +AV+GP A+ + M+G++ G + T LQG+ V A
Sbjct: 389 LLKNRLETLPLKKSG--TIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGHKA 446
Query: 419 TIYQAGCSNV---------------------QCGTAQVDDAKKAAASADATVLVMGADQS 457
I A +NV + A +D+A A +D V V+G Q
Sbjct: 447 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKAMIDEAVNVAKQSDVVVAVVGEAQG 506
Query: 458 IEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
+ E+ R DL +P Q+ LI+ + K P++L++M+G ++ K D + +IL
Sbjct: 507 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGR--PLALVKEDQQADAILETW 563
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+ G GG AIADV FG YNPSG+LP++ +P+S
Sbjct: 564 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS 594
>gi|387888690|ref|YP_006318988.1| periplasmic beta-glucosidase BglX [Escherichia blattae DSM 4481]
gi|414592757|ref|ZP_11442406.1| beta-glucosidase BglX [Escherichia blattae NBRC 105725]
gi|386923523|gb|AFJ46477.1| periplasmic beta-glucosidase BglX [Escherichia blattae DSM 4481]
gi|403196238|dbj|GAB80058.1| beta-glucosidase BglX [Escherichia blattae NBRC 105725]
Length = 766
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 158/564 (28%), Positives = 255/564 (45%), Gaps = 114/564 (20%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
+VG + +T Q+ I + D ++SRL IP + + + +HG T
Sbjct: 77 QVGAIFNTVTRQD-IRAMQDQVMALSRLKIPLFFAY-DVVHGQR---------------T 119
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF-WSPNINIFRDPRWGRGQE 181
FP + A++FN +G+V + Y GL W+P +++ RDPRWGR E
Sbjct: 120 IFPISLGLASTFNLDTASIVGRV------SAYEAADDGLNMTWAPMVDVSRDPRWGRVSE 173
Query: 182 TPGEDPLLASKYATGYVKGLQQTDGGSP-NRLKVAACCKHYTAYD-LDNWKGTDRYHFNA 239
GED L+S V +Q G SP R V KH+ AY ++ + + +
Sbjct: 174 GFGEDTFLSSAMGQALVTSMQ---GKSPAERYSVMTSVKHFAAYGAVEGGREYNTVDMSP 230
Query: 240 MVIYTYYLIKFKYIV------------------SDCDS---VDVLYNSQHYT-------- 270
++ Y+ +K + S DS D+L + +T
Sbjct: 231 QRLFNDYMPPYKAALDAGSGGVMVALNSVNGTPSTADSWLLKDLLRDQWGFTGITISDHG 290
Query: 271 -----------KTPEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDHAISNN 318
+ PE+A ++ AG++++ + K+ +K+G V++ A+D A +
Sbjct: 291 AIKELIKHGVARDPEDAVRVALNAGINMSMSDEYYSKYLPGLLKSGAVSQQALDDATRHV 350
Query: 319 FATLMRLGFFDGHPSKQPYGQLGPKD---VCTQANQDLALDAARQ----GIVLLKNTAGS 371
+G F+ PY LGP++ T A L +AARQ +VLLKN +
Sbjct: 351 LNVKYDMGLFN-----DPYSHLGPRESDPAETNAESRLHREAARQVARESLVLLKNRLNT 405
Query: 372 LPLSPTAIKNLAVIGPNANVTKTMIGNYE--GTPCKYTTPLQGLAAVV---ATIYQAGCS 426
LPL + +AVIGP A+ + M+G++ G + T LQG+ + AT+ A +
Sbjct: 406 LPLKKSG--TIAVIGPLADSKRDMMGSWSAAGVAEQSVTVLQGIKNALGEQATVRYARGA 463
Query: 427 NV---------------------QCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDR 465
NV + A +D+A K A+ +D V V+G Q + E+ R
Sbjct: 464 NVTDDQGIVAFLNQYEPAVTIDKRTPQAMLDEAVKTASQSDVIVAVVGEAQGMAHEASSR 523
Query: 466 LDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGA 525
D+ LP QQ LI + K K P++L++M+G ++ K D + ++L + G GG
Sbjct: 524 TDISLPASQQALIAALKKTGK-PLVLVLMNGR--PLTLVKEDQQADALLETWFAGTEGGN 580
Query: 526 AIADVCFGRYNPSGRLPMTWYPQS 549
AIADV FG YNPSG+LPM+ +P+S
Sbjct: 581 AIADVLFGDYNPSGKLPMS-FPRS 603
>gi|296270377|ref|YP_003653009.1| family 3 glycoside hydrolase domain-containing protein
[Thermobispora bispora DSM 43833]
gi|296093164|gb|ADG89116.1| glycoside hydrolase family 3 domain protein [Thermobispora bispora
DSM 43833]
Length = 988
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 136/425 (32%), Positives = 197/425 (46%), Gaps = 66/425 (15%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYV 108
A L F + +L + R+ DL+ RLTL EKI+ L + RLGI ++ +EALHGV++
Sbjct: 37 ADLPFRDPTLPLAQRIDDLLGRLTLDEKISLLHQYQPPIERLGIKSFKTGTEALHGVAWS 96
Query: 109 GPGTHFSNVVPG-ATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWS 165
T VV T FPQ + A++++ L + +G V EAR + N + GL W+
Sbjct: 97 TDVTDNGAVVTANGTVFPQAVGLASTWDPELIKRVGTAVGEEARGFHAQNPVVWGLNLWA 156
Query: 166 PNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD 225
P +N+ RDPRWGR +E EDPLL A Y G+Q G P+ L+ A KHY A +
Sbjct: 157 PVVNLLRDPRWGRNEEGYSEDPLLTGAIAIAYGSGIQ---GDDPDHLRAAPTLKHYLANN 213
Query: 226 LDNWKGTDRYHFNAMVIYTYYLIKFKYI-------------------------------- 253
+ + T + V + YY F+
Sbjct: 214 NEIRRDTTSSNLPPRVKHEYYEAPFRAAITAGAATGVMTAYNLVNGRPATVNPDLNDTVR 273
Query: 254 ---------VSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT------- 297
V+D + + L SQ Y T EA A ++ AG+D SF T
Sbjct: 274 TWTDRDLLNVTDAGAPNNLVGSQAYFATLAEADAAALKAGID----SFTTDETNSAPTIT 329
Query: 298 --EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
+ A+ GL+ E ID A+ + RLG FD P PY ++ P + + A++ LA
Sbjct: 330 AIKTALSQGLLTEQDIDTAVRHILGIRFRLGEFD--PDGGPYAKITPDVIDSPAHRRLAR 387
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
+ A + +VLLKN G+LPL P +AV+GP A+V T Y G P TPL G+
Sbjct: 388 ETAAKAMVLLKNERGTLPLDPGM--KVAVVGPLADVLYTDW--YSGRPTYQVTPLDGIRE 443
Query: 416 VVATI 420
A++
Sbjct: 444 RAASV 448
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 58/113 (51%), Gaps = 4/113 (3%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
+DDA A AD V+V+G+ I E HDR D+ L Q+ L+ V V+++
Sbjct: 595 IDDAVAKAKEADVAVVVVGSMPFINGREDHDRTDMNLAEGQEALVKAVFNANPRTVVVLE 654
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
S + + D + +ILW + G G A+ADV +G NP+GRL TWY
Sbjct: 655 NS---YPTTINWIDEHVPAILWTTHAGAETGNALADVLYGDVNPAGRLTQTWY 704
>gi|404405497|ref|ZP_10997081.1| glycoside hydrolase family protein [Alistipes sp. JC136]
Length = 804
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 146/538 (27%), Positives = 227/538 (42%), Gaps = 115/538 (21%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP E+ +E +HG+++ AT P I +++N +L G++
Sbjct: 144 TRLGIP-VEFTNEGIHGLNH-----------SRATPLPAPIAIGSTWNRALVHRAGEIAG 191
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
EAR L ++P +++ RDPRWGR E GEDP L ++ V+G+Q
Sbjct: 192 HEARV-----LGYKNVYAPILDVARDPRWGRVVECYGEDPFLIAELGVEMVRGIQSQG-- 244
Query: 208 SPNRLKVAACCKHYTAYDL----------------------------------------- 226
VA+ KHY AY +
Sbjct: 245 ------VASTLKHYAAYSVPKGGRDGNCRTDPHIAPRELHQMYLYPFRRVIRESGPMGVM 298
Query: 227 ---DNWKGT----DRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAK 279
++W G RY ++ + Y Y+VSD ++V+ ++ +T E+A +
Sbjct: 299 SSYNDWDGVPVTASRYFLTDLLRHEYGFDG--YVVSDSEAVEYVHTKHAVAETYEDAVRQ 356
Query: 280 SILAGLDLNC-----GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSK 334
+ AGL++ F+ V+ G ++ +D + RLG FD
Sbjct: 357 VLEAGLNVRTNFSPPARFI-LPVRKLVREGRLSMEVVDQRVREVLRVKFRLGLFD----- 410
Query: 335 QPYGQLGPKDVCTQA----NQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNAN 390
PY P++ +A ++D LD RQ +VLLKN +LPL + V GP A+
Sbjct: 411 NPYND--PREAVAEAGADKHRDFVLDIQRQSLVLLKNEDKTLPLDKKKTARVLVAGPLAD 468
Query: 391 VTKTMIGNYEGTPCKYTTPLQGLAAVVA----TIYQAGCSNVQCG--------------- 431
MI Y T L G+ + Y GC V G
Sbjct: 469 EDNFMISRYGPNDLPTVTVLDGIRNYLGDGAEVRYAKGCDVVDAGFPDSELTATPLTAAE 528
Query: 432 TAQVDDAKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVIL 491
A +++A K AA D V V+G D ESH R L LPG+QQ L+ E PV+L
Sbjct: 529 RAGINEAVKQAAGCDVIVAVLGEDDERVGESHSRTSLELPGRQQQLL-EALHATGVPVVL 587
Query: 492 IIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
++++G +++A + + +IL +P GG AIA+ FG YNP G+L +T +P+S
Sbjct: 588 VLINGQPLTVNWAAQN--VPAILEGWFPSVEGGTAIAETLFGDYNPGGKLTIT-FPRS 642
>gi|423333878|ref|ZP_17311659.1| hypothetical protein HMPREF1075_03310 [Parabacteroides distasonis
CL03T12C09]
gi|409226713|gb|EKN19619.1| hypothetical protein HMPREF1075_03310 [Parabacteroides distasonis
CL03T12C09]
Length = 732
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 166/587 (28%), Positives = 253/587 (43%), Gaps = 112/587 (19%)
Query: 37 VFACDVVSNPSLASLGFCNTS-LGIDLRVGDLVKRLTLQEKITFLVDSA----GSVSRLG 91
V AC + N + AS + + ++ R+ L+K++TL+EK+ L ++ V RLG
Sbjct: 7 VIACGLFLNAAAASSQVTDKEKVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLG 66
Query: 92 IPKYEW-WSEALHGV---------SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQA 141
IP EW S+ HGV +Y G ++ P T+F AA++N L
Sbjct: 67 IP--EWSLSDGPHGVRAEINRHDWAYAGWTNDSASYFPTGTAF------AAAWNPELAYR 118
Query: 142 IGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGL 201
G+V+ EAR L G P +NI R P GR E EDP + S A Y+KGL
Sbjct: 119 RGEVLGEEARWRKKDVLLG-----PGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGL 173
Query: 202 QQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVD 261
Q D VA KH+ + + + T + + YL FK V + ++
Sbjct: 174 QSRD--------VACSVKHFAVNNQETNRTTVDVECSERALREIYLPAFKAAVQEGGALT 225
Query: 262 VLYNSQHYTKTPEEAAAK-------------------------------SILAGLDLNCG 290
V+ Y K E A+ S+ AGLDL G
Sbjct: 226 VM---AAYNKFRGEFCAENNYLVCKILRNEWGFDGVYVTDWGAAHSTIPSMEAGLDLEMG 282
Query: 291 SFLGKHTE--------AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGP 342
+ + K+ + AVK+G + S +D + + +++ D P + GP
Sbjct: 283 TLIDKYEDWYYANPLIEAVKSGKIPMSLVDEKVGDVLRVMIKTNVLD------PKKRFGP 336
Query: 343 KDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 402
+ T+ +Q DAA + IVLLKN LPL ++IK+LAVIG NA + G
Sbjct: 337 GSMNTKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEI 396
Query: 403 PCKY-TTPLQGLAA---------------VVATIYQAGCSNVQCGT---------AQVDD 437
Y TPL+ L A ++T + + GT A + +
Sbjct: 397 KAVYEVTPLEALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKE 456
Query: 438 AKKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
A + A ++D +LV G + + ES DRL++ +P Q LI EV K P +++M G
Sbjct: 457 AVEVARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAG 514
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
++ A D +I+W + G GG A+ DV G+ NPSG++P T
Sbjct: 515 S-PLNMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT 560
>gi|254445290|ref|ZP_05058766.1| Glycosyl hydrolase family 3 C terminal domain protein
[Verrucomicrobiae bacterium DG1235]
gi|198259598|gb|EDY83906.1| Glycosyl hydrolase family 3 C terminal domain protein
[Verrucomicrobiae bacterium DG1235]
Length = 730
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 162/562 (28%), Positives = 239/562 (42%), Gaps = 88/562 (15%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L + R+ DL+ +TL+EK+ + G + RL + KY SE HGV+ GP
Sbjct: 29 FQDPDLPNEERIDDLITCMTLEEKVDLMGFVPG-IPRLDV-KYTRISEGYHGVAQGGPSN 86
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYN---VGLAGLTFWSPNIN 169
T FPQ AA+++ +L + +TE R +Y +GL +PN +
Sbjct: 87 WGKRNPTPTTQFPQAYGLAATWDPALISRVSANQATEVRYLYQSPKYQRSGLVVMAPNAD 146
Query: 170 IFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNW 229
+ RDPRWGR +E GEDP L A + GL G P LK + KH+ A ++
Sbjct: 147 LARDPRWGRTEEVYGEDPFLTGTLAAAFASGLA---GDHPRYLKATSLLKHFLANSNEDD 203
Query: 230 KGTDRYHFNAMVIYTYYLIKFKYIVSD--CDSVDVLYNS--------------------- 266
+ F+ + YY F+ + D S+ YN+
Sbjct: 204 RFFSSSDFDERLWREYYAKPFEMAIRDGGARSMMAAYNAINGTPAHVHPMLRDIVMGEWG 263
Query: 267 -----------------QH--YTKTPEEAAAKSILAGLDLNCGSFLGKHTEA---AVKAG 304
QH Y P A A I AG++L FL HT+A AV+
Sbjct: 264 LDGTICTDGGGLAHLVNQHKTYPDLP-TATAACIKAGINL----FLDNHTQAALDAVEQS 318
Query: 305 LVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCT----QANQDLALDAARQ 360
LV E+ ID I + LG D P PY +G + + R+
Sbjct: 319 LVTEAEIDDVIRGRIRLFLDLGLLDP-PELVPYSNIGHEPGLEPWELPETHAFVREVTRK 377
Query: 361 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI 420
IVLLKN LPL P+ I ++A++GP AN T++ Y GTP P G+ +
Sbjct: 378 SIVLLKNENNILPLDPSKINSVAIVGPLANT--TLLDWYSGTPPYAIPPRDGIEGYANS- 434
Query: 421 YQAGCSNVQCGTAQV----DDAKKAAASADATVLVMGADQSIEA------------ESHD 464
S + G+ V D A + AAS D ++V+G A E+ D
Sbjct: 435 -GPFPSPAKFGSNWVADMSDTALEVAASRDVAIVVVGNHPESNAGWGVVTSPSEGKEAVD 493
Query: 465 RLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISF-AKNDPKITSILWVGYPGQAG 523
R +++L Q+ I +V A I++++S + + + A+N P I I + Q
Sbjct: 494 RQEIILQPDQEEFIQKV-YAANPNTIVVLVSNFPYAMPWAAENAPAIVHIT---HASQEQ 549
Query: 524 GAAIADVCFGRYNPSGRLPMTW 545
G A+ADV FG YNP G+ TW
Sbjct: 550 GNALADVLFGDYNPGGKTVQTW 571
>gi|336430623|ref|ZP_08610567.1| hypothetical protein HMPREF0994_06573 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336017532|gb|EGN47292.1| hypothetical protein HMPREF0994_06573 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 973
Score = 175 bits (443), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 135/451 (29%), Positives = 212/451 (47%), Gaps = 58/451 (12%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + SL + R+ L+ +T+ EK++ + + RLGIP + EA HGV
Sbjct: 11 FWDPSLPFEERLDWLLSEMTMDEKLSCMATRVPDLERLGIPGFSVGGEAAHGVEARNDQN 70
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGL-AGLTFWSPNINIF 171
P TSFPQ I +A+++ L + G+V TEAR +Y+ GL+ W+P +++
Sbjct: 71 DLGAAEP-TTSFPQPIGMSATWDPDLIKKAGEVTGTEARVIYHRHPDRGLSRWAPTVDLE 129
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLD---N 228
RDPRWGR +E GEDP+L ++ Y++G+Q G P L+VAA KH+ A + +
Sbjct: 130 RDPRWGRTEEGYGEDPVLTGVMSSAYIQGMQ---GDDPKYLRVAATLKHFYANNTEAGRG 186
Query: 229 WKGTDRYHFNAMVIY-----------------TYY----------------LIKFKY--- 252
WK + N +Y T Y ++K +Y
Sbjct: 187 WKSSCVDPRNRQELYLEPFRRAIMNGRAEGVMTAYNKINGIPGMLNPEVREILKKEYGLK 246
Query: 253 --IVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGKHTEAAVKAGLVNE 308
+V D +++++ N HY E A ++ AG+D + + + A + GL+ E
Sbjct: 247 GHVVCDGGAMELVANLHHYYGLHAETLANAVKAGVDGMSDRPEAVAQAAREAWELGLITE 306
Query: 309 SAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNT 368
ID A+ N F T +RLG +D P PY ++ +D+ + N+ + L +R+ IVLLKN
Sbjct: 307 EEIDEALRNIFRTKLRLGIYDREPC-NPYDRVTEEDLNSPLNKQICLQLSREAIVLLKNE 365
Query: 369 AGSLPLSPTA-IKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAAVVATI--YQAGC 425
LPL P + +A++GP A+ Y GTP TT QG+ + T + G
Sbjct: 366 NKMLPLDPAVPAETMALVGPLAD--SWYQDWYGGTPPYVTTLRQGMKEIQGTQVPFADGL 423
Query: 426 SNV--QCGTAQVDDAK--KAAASADATVLVM 452
V +CG V A + S + V VM
Sbjct: 424 DRVIFRCGQKGVATAADGRLYLSEEPDVFVM 454
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 57/102 (55%), Gaps = 4/102 (3%)
Query: 446 DATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFA 504
+ +L +G + I A E DR + LP Q+ L+ E+ ++ VIL++ S + I+ A
Sbjct: 565 EKVILALGCNSMINAKEEVDRTTIALPPAQEHLLEEIYRINPN-VILVLFSNYPYSINMA 623
Query: 505 KNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ K+ ++LW Q G A+A+ FG+ P+GRL MTWY
Sbjct: 624 QE--KLPAVLWSATGSQDMGTAMAETLFGKNAPAGRLNMTWY 663
>gi|374374543|ref|ZP_09632202.1| Beta-glucosidase [Niabella soli DSM 19437]
gi|373233985|gb|EHP53779.1| Beta-glucosidase [Niabella soli DSM 19437]
Length = 799
Score = 175 bits (443), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 150/531 (28%), Positives = 237/531 (44%), Gaps = 100/531 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP ++ +E +HG++ H AT+FP I +++N L +G+++
Sbjct: 137 TRLGIP-VDFTNEGIHGLNQ----DH-------ATAFPAPIGIGSTWNKELVHQMGQIIG 184
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
EA+A L ++P +++ RD RWGR ET GEDP L + T G+Q+
Sbjct: 185 REAKA-----LGYTNVYAPILDVARDQRWGRVVETYGEDPFLVAGLGTALAGGIQENG-- 237
Query: 208 SPNRLKVAACCKHYTAYDL-----DNWKGTDRY----HFNAMVIY--------------- 243
VA+ KH+ Y + D TD + + +Y
Sbjct: 238 ------VASTLKHFAVYSVPKGGRDGNARTDPHVAPREMQQLFLYPFRKVIQNVHPLGVM 291
Query: 244 --------------TYYLIKF--------KYIVSDCDSVDVLYNSQHYTKTPEEAAAKSI 281
Y+L + Y+VSD +V+ +Y H K +EA +
Sbjct: 292 SSYNDWDGMPVTASNYFLTQLLRQQFGFDGYVVSDSRAVEFVYEKHHVAKDYKEAVKMVM 351
Query: 282 LAGL----DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY 337
AGL + N S +K G ++ ++ + + RLG FD K P
Sbjct: 352 EAGLNVRTEFNAPSNFILPLRQLIKEGGLSMETLNQRVGEVLSVKFRLGLFDAPYVKDP- 410
Query: 338 GQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIG 397
+ K V T+A++ +AL R+ +VLLKN LPLS +N+ V GP A+ + I
Sbjct: 411 -KAADKIVATEASEAVALQMNRESLVLLKNDKNILPLSLGQYRNILVTGPLADEKEHAIS 469
Query: 398 NYEGTPCKYTTPLQGL---AAVVATI-YQAGCSNVQC---------------GTAQVDDA 438
Y + K + L+G+ AA ATI Y GC A+++ A
Sbjct: 470 RYGPSNKKVISVLEGIRHFAAKKATINYIKGCEAADATWPESEIIDTPPTPQEIAEMNKA 529
Query: 439 KKAAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGG 498
+AA D + VMG + ES R L LPG+Q L+ E+ K K P++LI+++G
Sbjct: 530 VEAAKQNDIIIAVMGENDKQVGESLSRTGLNLPGRQLRLLEELKKTGK-PMVLILINGQP 588
Query: 499 FDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
I++ + + +IL +PG AGG A+A+ FG YNP G+L T +P++
Sbjct: 589 LTINW--ENRYLDAILETWFPGPAGGTAVAEAIFGAYNPGGKLTTT-FPKT 636
>gi|149826|gb|AAB36835.1| glucan-glucohydrolase [Thermobispora bispora]
Length = 986
Score = 175 bits (443), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 151/476 (31%), Positives = 214/476 (44%), Gaps = 74/476 (15%)
Query: 49 ASLGFCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYV 108
A L F + +L + R+ DL+ RLTL EKI+ L + RLGI ++ +EALHGV++
Sbjct: 37 ADLPFRDPTLPLAQRIDDLLGRLTLDEKISLLHQYQPPIERLGIKSFKTGTEALHGVAWS 96
Query: 109 GPGTHFSNVVPG-ATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWS 165
T VV T FPQ + A++++ L Q +G V EAR + N + GL W+
Sbjct: 97 TDVTDNGAVVTANGTVFPQAVGLASTWDPELNQRVG-TVGEEARGFHAQNPVVWGLNLWA 155
Query: 166 PNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYD 225
P +N+ RDPRWGR +E EDPLL A Y G+Q G P+ L+ A KHY A +
Sbjct: 156 PVVNLLRDPRWGRNEEGYSEDPLLTGAIAIAYGSGIQ---GDDPDHLRAAPTLKHYLANN 212
Query: 226 LDNWKGTDRYHFNAMVIYTYYLIKFKYI-------------------------------- 253
+ + T + V + YY F+
Sbjct: 213 NEIRRDTTSSNLPPRVKHEYYEAPFRAAITAGAATGVMTAYNLVNGRPATVNPDLNDTVR 272
Query: 254 ---------VSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLNCGSFLGKHT------- 297
V+D + + L SQ Y T EA A ++ AG+D SF T
Sbjct: 273 TWTDRDLLNVTDAGAPNNLVGSQAYFATLAEADAAALKAGID----SFTTDETNSAPTIT 328
Query: 298 --EAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLAL 355
+ A+ GL+ E ID A+ + RLG FD P PY ++ P + + A++ LA
Sbjct: 329 AIKTALSQGLLTEQDIDTAVRHILGIRFRLGEFD--PDGGPYAKITPDVIDSPAHRRLAR 386
Query: 356 DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAA 415
+ A + +VLLKN G+LPL P K +AV+GP A+V T Y G P TPL G+
Sbjct: 387 ETAAKAMVLLKNERGTLPLDPG--KKVAVVGPLADVLYTDW--YSGRPTYQVTPLDGIRE 442
Query: 416 VVATIYQA-GCSNV---QCGTAQVDDAKKAAASADATVLVMGADQSIEAESHDRLD 467
A++ + G V T QV + AD L + A E E D D
Sbjct: 443 RAASVTSSEGVDRVAFKDLATGQVHHRGERPEGAD---LRLSATTIGETEQFDVFD 495
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 58/113 (51%), Gaps = 4/113 (3%)
Query: 435 VDDAKKAAASADATVLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILII 493
+DDA A AD V+V+G+ I E HDR D+ L Q+ L+ V V+++
Sbjct: 594 IDDAVAKAKEADVAVVVVGSMPFINGREDHDRTDMNLAEGQEALVKAVFNANPRTVVVLE 653
Query: 494 MSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
S + + D + +ILW + G G A+ADV +G NP+GRL TWY
Sbjct: 654 NS---YPTTINWIDEHVPAILWTTHAGAETGNALADVLYGDVNPAGRLTQTWY 703
>gi|389840350|ref|YP_006342434.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii ES15]
gi|417792062|ref|ZP_12439471.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii E899]
gi|429121663|ref|ZP_19182278.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 680]
gi|333953864|gb|EGL71757.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii E899]
gi|387850826|gb|AFJ98923.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii ES15]
gi|426323856|emb|CCK13015.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 680]
Length = 765
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 169/632 (26%), Positives = 267/632 (42%), Gaps = 158/632 (25%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLR---VGDLVKRLTLQEKITFLV------------- 81
A + P+LAS F L + R V DL+K++T+ EKI L
Sbjct: 9 MAVSLALQPALASEPFGEHPLTPEARDAFVTDLLKKMTVDEKIGQLRLISVGPDNPKEAI 68
Query: 82 -------------------------DSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSN 116
D +SRL IP + + + LHG
Sbjct: 69 RDMIKEGQVGAIFNTVTRPDIRAMQDQVMQLSRLKIPLF-FAYDVLHGQR---------- 117
Query: 117 VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF-WSPNINIFRDPR 175
T FP + A+SFN + +G+V + Y GL W+P +++ RDPR
Sbjct: 118 -----TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPR 166
Query: 176 WGRGQETPGEDPLLASKYATGYVKGLQQTDGGSP-NRLKVAACCKHYTAYD-LDNWKGTD 233
WGRG E GED L S V+ +Q G SP +R V KH+ AY ++ K +
Sbjct: 167 WGRGSEGFGEDTYLTSMMGKTMVESMQ---GKSPADRYSVMTSVKHFAAYGAVEGGKEYN 223
Query: 234 RYHFNAMVIYTYYLIKFK-----------------------------------------Y 252
++ ++ Y+ +K
Sbjct: 224 TVDMSSQRLFNDYMPPYKEALDAGSGGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 283
Query: 253 IVSDCDSVDVLYNSQHYTKT-PEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESA 310
+SD ++ L +H T + PE+A +I +G+D++ + K+ +K+G V +
Sbjct: 284 TISDHGAIKELI--KHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 341
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD-------VCTQANQDLALDAARQGIV 363
+D A + +G F+ PY LGPKD ++ ++D A AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARKVARESLV 396
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE--GTPCKYTTPLQGLAAVV---A 418
LLKN +LPL + +AV+GP A+ + M+G++ G + T LQG+ V A
Sbjct: 397 LLKNRLETLPLKKSG--TIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 454
Query: 419 TIYQAGCSNV---------------------QCGTAQVDDAKKAAASADATVLVMGADQS 457
I A +NV + A +D+A A +D V V+G Q
Sbjct: 455 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKAMIDEAVNVAKQSDVVVAVVGEAQG 514
Query: 458 IEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
+ E+ R DL +P Q+ LI+ + K P++L++M+G ++ K D + +IL
Sbjct: 515 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGR--PLALVKEDQQADAILETW 571
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+ G GG AIADV FG YNPSG+LP++ +P+S
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS 602
>gi|455641256|gb|EMF20442.1| sugar hydrolase [Streptomyces gancidicus BKS 13-15]
Length = 948
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 136/408 (33%), Positives = 192/408 (47%), Gaps = 69/408 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L RV DL+ RLT E+I FL A +V RLG+ + EALHGV+++GP
Sbjct: 11 FRDPGLPFAERVDDLLSRLTPDERIGFLHQFAPAVERLGVAAFRTGQEALHGVAWMGP-- 68
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNV-GLAGLTFWSPNINIF 171
AT FPQ + A++N L + +G+ VS EARAM + G GL WSP +N+
Sbjct: 69 --------ATVFPQAVGLGATWNEDLVRRVGEAVSREARAMRSRDGRVGLNVWSPTVNLL 120
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
R P WGR +E EDP L S AT Y +GL+ G P + A KH+ A++ + +
Sbjct: 121 RHPLWGRNEEGYSEDPRLTSAIATAYTRGLR---GDHPVYWRTAPVLKHWLAHNNETRRD 177
Query: 232 TDRYHFNAMVIYTYYLIKFKYIV------------------------------------- 254
T V++ Y L F+ V
Sbjct: 178 TTSSSVRPRVLHEYDLRAFRDTVEAGAVAGVMPAYNLVNGRPNHVSPHLREQLRTWTGEE 237
Query: 255 ----SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDL-----NCGSFLGKHTEAAVKAGL 305
SD + L +S+HY T EEA A ++ AG+D S + + A+ GL
Sbjct: 238 LLVCSDAGAPSNLVDSEHYFDTHEEATAAALRAGVDSFTDHGTDSSTIVARIKGALAQGL 297
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
+ E +D A+ + RLG FD P + PY G + T ++ LA +AA Q VLL
Sbjct: 298 LTEEDVDTAVRRQLSVRFRLGEFD--PDRDPYA--GERAFDTPEHRALAQEAAEQAAVLL 353
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
KN G LPL+P + LAV+G A+ K + Y G+ +TPL+GL
Sbjct: 354 KND-GLLPLAPGS--RLAVVGLLADECK--LDWYSGSLIHRSTPLEGL 396
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 7/87 (8%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGP-VILIIMSGGGFDISFAKNDPKITSILWVGYP 519
E+ DR L LP QQ+ L+ A +A P +L ++S +A + + ++LW +
Sbjct: 583 ETEDRTTLRLPAQQERLLR--AALAANPRTVLALVSA----YPYAVDPADLPAVLWTAHG 636
Query: 520 GQAGGAAIADVCFGRYNPSGRLPMTWY 546
GQA G A+A V G +P+GRLP TWY
Sbjct: 637 GQAAGTALARVLAGDVSPAGRLPQTWY 663
>gi|150009689|ref|YP_001304432.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
8503]
gi|149938113|gb|ABR44810.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
Length = 732
Score = 174 bits (442), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 164/584 (28%), Positives = 251/584 (42%), Gaps = 106/584 (18%)
Query: 37 VFACDVVSNPSLASLGFCNTS-LGIDLRVGDLVKRLTLQEKITFLVDSA----GSVSRLG 91
V AC + N + AS + + ++ R+ L+K++TL+EK+ L ++ V RLG
Sbjct: 7 VIACGLFLNAAAASSQVTDKEKVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLG 66
Query: 92 IPKYEW-WSEALHGV---------SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQA 141
IP EW S+ HGV +Y G ++ P T+F AA++N L
Sbjct: 67 IP--EWSLSDGPHGVRAEINRHDWAYAGWTNDSASYFPTGTAF------AAAWNPELAYR 118
Query: 142 IGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGL 201
G+V+ EAR L G P +NI R P GR E EDP + S A Y+KGL
Sbjct: 119 RGEVLGEEARWRKKDVLLG-----PGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGL 173
Query: 202 QQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVD 261
Q D VA KH+ + + + T + + YL FK V + ++
Sbjct: 174 QSRD--------VACSVKHFAVNNQETNRTTVDVECSERALREIYLPAFKAAVQEGGALT 225
Query: 262 VL--YNSQHYTKTPE--------------------------EAAAKSILAGLDLNCGSFL 293
V+ YN E + S+ AGLDL G+ +
Sbjct: 226 VMAAYNKFRGEFCAENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPSMEAGLDLEMGTLI 285
Query: 294 GKHTE--------AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
K+ + AVK+G V S +D + + +++ D P + GP +
Sbjct: 286 DKYEDWYYANPLIEAVKSGKVPMSLVDEKVGDVLRVMIKTNVLD------PKKRFGPGSM 339
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNAN-------VTKTMIGN 398
T+ +Q DAA + IVLLKN LPL ++IK+LAVIG NA ++ +
Sbjct: 340 NTKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAV 399
Query: 399 YEGTP-------------CKYTTPLQGLAAVVATIYQAG-----CSNVQCGTAQVDDAKK 440
YE TP ++ + L+ V S Q A + +A +
Sbjct: 400 YEVTPLGALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVE 459
Query: 441 AAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFD 500
A ++D +LV G + + ES DRL++ +P Q LI EV K P +++M G
Sbjct: 460 VARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-P 516
Query: 501 ISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
++ A D +I+W + G GG A+ DV G+ NPSG++P T
Sbjct: 517 LNMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT 560
>gi|429101023|ref|ZP_19162997.1| Periplasmic beta-glucosidase [Cronobacter turicensis 564]
gi|426287672|emb|CCJ89110.1| Periplasmic beta-glucosidase [Cronobacter turicensis 564]
Length = 759
Score = 174 bits (442), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 156/566 (27%), Positives = 253/566 (44%), Gaps = 118/566 (20%)
Query: 63 RVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGAT 122
+VG + +T Q+ I + D +SRL IP + + + LHG T
Sbjct: 70 QVGAIFNTVTRQD-IRAMQDQIMQLSRLKIPLF-FAYDVLHGQR---------------T 112
Query: 123 SFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF-WSPNINIFRDPRWGRGQE 181
FP + A+SFN + +G+V + Y GL W+P +++ RDPRWGRG E
Sbjct: 113 IFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPRWGRGSE 166
Query: 182 TPGEDPLLASKYATGYVKGLQQTDGGSP-NRLKVAACCKHYTAYD-LDNWKGTDRYHFNA 239
GED L S V+ +Q G SP +R V KH+ AY ++ K + ++
Sbjct: 167 GFGEDTYLTSMMGKTMVESMQ---GKSPADRYSVMTSVKHFAAYGAVEGGKEYNTVDMSS 223
Query: 240 MVIYTYYLIKFK-----------------------------------------YIVSDCD 258
++ Y+ +K +SD
Sbjct: 224 QRLFNDYMPPYKEALDAGSGGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGITISDHG 283
Query: 259 SVDVLYNSQHYTKT-PEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESAIDHAIS 316
++ L +H T + PE+A +I +G+D++ + K+ +K+G V+ +D A
Sbjct: 284 AIKELI--KHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVSMEELDDATR 341
Query: 317 NNFATLMRLGFFDGHPSKQPYGQLGPKD-------VCTQANQDLALDAARQGIVLLKNTA 369
+ +G F+ PY LGPKD ++ ++D A + AR+ +VLLKN
Sbjct: 342 HVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARNVARESLVLLKNRL 396
Query: 370 GSLPLSPTAIKNLAVIGPNANVTKTMIGNYE--GTPCKYTTPLQGLAAVV---ATIYQAG 424
+LPL + +AV+GP A+ + M+G++ G + T LQG+ V A I A
Sbjct: 397 ETLPLKKSG--TIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKAKILFAK 454
Query: 425 CSNV---------------------QCGTAQVDDAKKAAASADATVLVMGADQSIEAESH 463
+NV + +D+A A +D V V+G Q + E+
Sbjct: 455 GANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQGMAHEAS 514
Query: 464 DRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAG 523
R DL +P Q+ LI+ + K P++L++M+ G ++ K D + +IL + G G
Sbjct: 515 SRTDLTIPQSQRDLISALKATGK-PLVLVLMN--GRPLALVKEDQQADAILETWFAGTEG 571
Query: 524 GAAIADVCFGRYNPSGRLPMTWYPQS 549
G AIADV FG YNPSG+LP++ +P+S
Sbjct: 572 GNAIADVLFGDYNPSGKLPIS-FPRS 596
>gi|383123909|ref|ZP_09944579.1| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
gi|382983834|gb|EES66944.2| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
Length = 815
Score = 174 bits (442), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 154/528 (29%), Positives = 242/528 (45%), Gaps = 98/528 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP + EA HG ++ G T FP I AA+++ L + +G V++
Sbjct: 159 TRLGIPLF-LAEEAPHG-----------HMAIGTTVFPTGIGMAATWSPVLIEEVGNVIA 206
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E R+ + P +++ RDPRW R +ET GEDP+L+ + V GL G
Sbjct: 207 KEIRSQ-----GAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVIGL--GSGD 259
Query: 208 SPNRLKVAACCKHYTAYDL------DNWK--GTDRYHFN------------AMVIYT--- 244
A KH+ AY + N+ GT H N A+ + T
Sbjct: 260 LSREYATIATLKHFLAYAVPEGGQNGNYASVGTRDLHENFLPPFQEAIDAGALSVMTSYN 319
Query: 245 ----------YYLI--------KFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
YYL+ +F+ ++VSD S++ ++ S T EEAA + + AG+
Sbjct: 320 SIDGIPCTANYYLLTQLLRNEWRFRGFVVSDLYSIEGVHESHFVAPTIEEAAMQVVSAGV 379
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
D++ G + AV++G ++E+ ID A+ +G F+ HP P + K V
Sbjct: 380 DIDLGGNAFMNLTHAVQSGKISEAVIDTAVCRVLRMKFEMGLFE-HPYVNP--KSATKVV 436
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 405
++ + LA A+ IVLLKN LPL+ IK +AV+GPNA+ M+G+Y P +
Sbjct: 437 RSEEHIRLAHKVAQSSIVLLKNKNSILPLN-KKIKKVAVVGPNADNRYNMLGDYTA-PQE 494
Query: 406 YTTPLQGLAAVVATI------YQAGCSNVQCGTAQVDDAKKAAASADATVLVMGA----- 454
L V++ + Y GC+ ++ + +AA+ ++ + V+G
Sbjct: 495 DENIKTVLDGVISKLSPSKVEYVRGCAIRDTTVNEIAEVVEAASRSEVIIAVVGGSSARD 554
Query: 455 --------------DQSIE----AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSG 496
++SI E DR L L G+QQ L+ + K P+I++ + G
Sbjct: 555 FKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLNALKATGK-PLIVVYIEG 613
Query: 497 GGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
D +A ++L YPGQ GG AIADV FG YNP+GRLP++
Sbjct: 614 RPLDKVWASE--YADALLTASYPGQEGGYAIADVLFGDYNPAGRLPVS 659
>gi|256838635|ref|ZP_05544145.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
gi|256739554|gb|EEU52878.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
Length = 732
Score = 174 bits (442), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 164/584 (28%), Positives = 251/584 (42%), Gaps = 106/584 (18%)
Query: 37 VFACDVVSNPSLASLGFCNTS-LGIDLRVGDLVKRLTLQEKITFLVDSA----GSVSRLG 91
V AC + N + AS + + ++ R+ L+K++TL+EK+ L ++ V RLG
Sbjct: 7 VIACGLFLNAAAASSQVTDKEKVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLG 66
Query: 92 IPKYEW-WSEALHGV---------SYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQA 141
IP EW S+ HGV +Y G ++ P T+F AA++N L
Sbjct: 67 IP--EWSLSDGPHGVRAEINRHDWAYAGWTNDSASYFPTGTAF------AAAWNPELAYR 118
Query: 142 IGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGL 201
G+V+ EAR L G P +NI R P GR E EDP + S A Y+KGL
Sbjct: 119 RGEVLGEEARWRKKDVLLG-----PGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGL 173
Query: 202 QQTDGGSPNRLKVAACCKHYTAYDLDNWKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVD 261
Q D VA KH+ + + + T + + YL FK V + ++
Sbjct: 174 QSRD--------VACSVKHFAVNNQETNRTTVDVECSERALREIYLPAFKAAVQEGGALT 225
Query: 262 VL--YNSQHYTKTPE--------------------------EAAAKSILAGLDLNCGSFL 293
V+ YN E + S+ AGLDL G+ +
Sbjct: 226 VMAAYNKFRGEFCAENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPSMEAGLDLEMGTLI 285
Query: 294 GKHTE--------AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
K+ + AVK+G V S +D + + +++ D P + GP +
Sbjct: 286 DKYEDWYYANPLIEAVKSGKVPMSLVDEKVGDVLRVMIKTNVLD------PKKRFGPGSM 339
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNAN-------VTKTMIGN 398
T+ +Q DAA + IVLLKN LPL ++IK+LAVIG NA ++ +
Sbjct: 340 NTKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAV 399
Query: 399 YEGTP-------------CKYTTPLQGLAAVVATIYQAG-----CSNVQCGTAQVDDAKK 440
YE TP ++ + L+ V S Q A + +A +
Sbjct: 400 YEVTPLGALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVE 459
Query: 441 AAASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFD 500
A ++D +LV G + + ES DRL++ +P Q LI EV K P +++M G
Sbjct: 460 VARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-P 516
Query: 501 ISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
++ A D +I+W + G GG A+ DV G+ NPSG++P T
Sbjct: 517 LNMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT 560
>gi|319643197|ref|ZP_07997825.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
gi|345520511|ref|ZP_08799899.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|254835034|gb|EET15343.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|317385101|gb|EFV66052.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
Length = 788
Score = 174 bits (442), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 150/527 (28%), Positives = 240/527 (45%), Gaps = 94/527 (17%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP ++ +E + G+ + AT FP A++N L IG+V +
Sbjct: 137 TRLGIP-VDFTNEGIRGLCH-----------DRATYFPAQCGQGATWNKKLIARIGEVEA 184
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTD-- 205
EA V L +SP ++I +DPRWGR ET GEDP L + + LQ+ +
Sbjct: 185 KEA-----VALGYTNIYSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKYNLV 239
Query: 206 -------------GG-----------SPNRLKVAACCKHYTAY-------------DLDN 228
GG +P ++ A+ D D
Sbjct: 240 ATPKHFAVYSIPIGGRDGKTRTDPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDG 299
Query: 229 WKGTDRYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGLDLN 288
T YHF ++ + K Y+VSD ++V+ + N T E+ A+++ AGL++
Sbjct: 300 EPITGSYHFLTEILRQEWGFK-GYVVSDSEAVEFISNKHKVADTYEDGIAQAVNAGLNIR 358
Query: 289 C-----GSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY---GQL 340
F+ AV G +++ +D ++ LG FD PY G+
Sbjct: 359 THFTPPADFILP-LRKAVDDGKISQETLDKRVAEILRIKFWLGLFD-----NPYRGNGKQ 412
Query: 341 GPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE 400
+ V ++ +Q ++L+AARQ +VLLKN LPLS +I+++AVIGPNA+ +I Y
Sbjct: 413 AEQIVHSKEHQAVSLEAARQSLVLLKNETHLLPLS-KSIRSIAVIGPNADEQTQLICRYG 471
Query: 401 GTPCKYTTPLQGLAAVV---ATIYQAGCSNVQC-----------GTAQ----VDDAKKAA 442
T QG+ ++ IY+ GC + TA+ + + +AA
Sbjct: 472 PANAPIKTVYQGIKELLPHAEVIYKKGCDIIDPHFPESEILDFPKTAEEVRLMQEVIRAA 531
Query: 443 ASADATVLVMGADQSIEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDIS 502
A+ V+V+G ++ E R L LPG+Q+ L+ V K PVIL+++ G I+
Sbjct: 532 KQAEVVVMVLGGNELTVREDRSRTSLNLPGRQEELLKAVCATGK-PVILVMLDGRASSIN 590
Query: 503 FAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+A + +IL +PG+ G A+A+ FG YNP GRL +T +P+S
Sbjct: 591 YAA--AHVPAILHAWFPGEFCGQAVAEALFGDYNPGGRLAVT-FPKS 634
>gi|374320547|ref|YP_005073676.1| glycoside hydrolase [Paenibacillus terrae HPL-003]
gi|357199556|gb|AET57453.1| glycoside hydrolase family protein [Paenibacillus terrae HPL-003]
Length = 767
Score = 174 bits (442), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 149/547 (27%), Positives = 227/547 (41%), Gaps = 105/547 (19%)
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASF 134
E + + A SRLGIP +G ++ T FP + +++
Sbjct: 90 EAVNLIQRYAVEHSRLGIP------------ILIGEECSHGHMAIDGTVFPVPLSIGSTW 137
Query: 135 NASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 194
N L++ + + V++E RA G +SP +++ RDPRWGR +E GEDP L ++A
Sbjct: 138 NVDLYRDMCRAVASETRAQ-----GGAVTYSPVLDVVRDPRWGRTEECFGEDPYLIGEFA 192
Query: 195 TGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDN-----------WKG---TDRYHFNAM 240
V+GLQ S + VAA KH+ Y W+ D Y F
Sbjct: 193 VAAVEGLQGESLLSEH--SVAATLKHFAGYGSSEGGRNAGPVHMGWREFLEVDLYPFQKA 250
Query: 241 VIYTYYLIKFKY----------------------------IVSDCDSVDVLYNSQHYTKT 272
V + Y I++DC ++++L N +
Sbjct: 251 VEAGAQSVMPAYNEIDGVPCTVNAELLDGILRQTWGFDGLIITDCGAIEMLANGHDVAED 310
Query: 273 PEEAAAKSILAGLDLN-CGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGH 331
+AA ++I AG+D+ G G H AV AG + S +D A+ RLG FD
Sbjct: 311 GSDAAVQAIRAGIDMEMSGEMFGSHLVEAVHAGKLETSVLDRAVRRVLTLKFRLGLFD-- 368
Query: 332 PSKQPY--GQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNA 389
+PY + + + + LA A +GIVLLKN G+LPL P K +A+IGPNA
Sbjct: 369 ---KPYVDAERAEQVIGQTEHIRLARQLATEGIVLLKNVDGTLPL-PKTSKRIAIIGPNA 424
Query: 390 NVTKTMIGNYEGTP--CKYTTPLQGLAAVVA-----TIYQAGCSNVQCGTAQVDDAKKAA 442
+ +G+Y + T L G+ + +Y GC ++A A
Sbjct: 425 DQVYNQLGDYTSPQPRSRVITVLDGIRGKLGKDQAGVLYAPGCRIKGESREGFENALACA 484
Query: 443 ASADATVLVMGADQSIE-------------------------AESHDRLDLLLPGQQQLL 477
A D V+V+G + + E DR+ L L G Q L
Sbjct: 485 AEVDTVVMVVGGSSARDFGEGTIDLKTGASKVSDHDWNDMESGEGIDRMTLGLAGVQLQL 544
Query: 478 ITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNP 537
+ EV ++ K ++++ M+G + + W YPGQ GG AIAD+ FG NP
Sbjct: 545 MQEVYRLGK-ELVVVYMNGRPIAEPWVEEHAHAIVEAW--YPGQEGGHAIADILFGDVNP 601
Query: 538 SGRLPMT 544
SGRL ++
Sbjct: 602 SGRLTLS 608
>gi|383643328|ref|ZP_09955734.1| glycoside hydrolase family 3 [Sphingomonas elodea ATCC 31461]
Length = 799
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 164/540 (30%), Positives = 240/540 (44%), Gaps = 123/540 (22%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP EALHG+ V PGATSFPQ I A+SF+ L + I + +
Sbjct: 144 TRLGIPML-MHEEALHGL-----------VAPGATSFPQSIALASSFDPKLVENIFSMAA 191
Query: 148 TEARAM-YNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQ---- 202
EARA N+ LA P +++ RDPRWGR +ET GEDP L ++ ++G Q
Sbjct: 192 KEARARGANLVLA------PVVDVARDPRWGRIEETYGEDPYLVTQMGLAAIRGFQGTTM 245
Query: 203 -------------QTDGGSPNR----------------------------LKVAACCKHY 221
T G P L V + Y
Sbjct: 246 PLKSDKVFITLKHMTGHGQPENGTNVGPASLGERTLREDFFPPFEAAVKTLPVMSVMASY 305
Query: 222 TAYD-----LDNWKGTD----RYHFNAMVIYTYYLIKFKYIVSDCDSVDVLYNSQHYTKT 272
D + W TD + F V+ Y+ I+ L H K
Sbjct: 306 NEIDGIPSHANKWLLTDVLRGEWGFQGAVVSDYFAIR------------ELITRHHLFKD 353
Query: 273 PEEAAAKSILAGLDLNCGSFLG-KHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGH 331
P++AA +++ AG+D+ H VK G V++ ID+A+ ++R+ F +G
Sbjct: 354 PKDAAQRALDAGVDVETPDGEAYTHLVQLVKQGRVSQGEIDNAVRR----VLRMKF-EGG 408
Query: 332 PSKQPYGQLGPKDVCTQANQDLAL--DAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNA 389
+ PY ++ T + +AL AAR+ IVLLKN G LPL IK +AVIG +A
Sbjct: 409 LFENPYPEVKLAAARTNTPEAIALSRQAARESIVLLKNAQGLLPLDARGIKRMAVIGTHA 468
Query: 390 NVTKTMIGNYEGTPCKYTTPLQGLAAV----VATIYQAGC----------SNVQCGTAQV 435
T IG Y P + L+G+ A A Y G V A V
Sbjct: 469 K--DTPIGGYSDLPNHVVSVLEGMQAEGKGKFAVDYAEGIRITNHREWSKDAVAQVPASV 526
Query: 436 DDAKKA-----AASADATVLVMGADQSIE----AESH--DRLDLLLPGQQQLLITEVAKV 484
+D +A A +AD VLV+G ++++ A++H D L LPG Q L E+ +
Sbjct: 527 NDQLRAQALETAKNADVVVLVLGGNEAVSREAWADNHLGDSETLDLPGPQDQLAKELIAL 586
Query: 485 AKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
K PV++I+++G + +++ K +++ Y G+ G AIADV FGRYNP G+LP++
Sbjct: 587 GK-PVVVILLNGRPYAVNYLAE--KAPALIEGWYLGEQTGNAIADVVFGRYNPGGKLPVS 643
>gi|386847378|ref|YP_006265391.1| beta-glucosidase [Actinoplanes sp. SE50/110]
gi|359834882|gb|AEV83323.1| beta-glucosidase [Actinoplanes sp. SE50/110]
Length = 966
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 140/431 (32%), Positives = 205/431 (47%), Gaps = 71/431 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + +L + RV DLV RLTL EK++ L ++ RLGI ++ +EALHGV++ T
Sbjct: 32 FRDPTLALPARVDDLVGRLTLDEKLSLLHQYQPAIPRLGIAVFKSGTEALHGVAW---ST 88
Query: 113 HFSN----VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY--NVGLAGLTFWSP 166
++N AT FPQ + A+S++ +L + +G V EAR ++ N + GL W+P
Sbjct: 89 DYTNGGAKTDATATVFPQAVGLASSWDPALVRQVGAAVGDEARGLHAQNPTVWGLNLWAP 148
Query: 167 NINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDL 226
+N+ RDPRWGR +E EDPLL+ AT Y +GLQ G P+ L A KHY AY
Sbjct: 149 VVNLLRDPRWGRNEEGYSEDPLLSGTVATAYGRGLQ---GDDPDHLLTAPTLKHYAAY-- 203
Query: 227 DNWKGTDRYHFN-----------------------AMVIYTYYLIKFK------------ 251
+N G DR N V+ +Y L+ +
Sbjct: 204 NNETGRDRTSSNVPQRVLNEYDRKTFEIPLRADAATGVMSSYNLVNGRPATVDPDLGGLV 263
Query: 252 --------YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAG-----LDLNCGSFLGKHTE 298
+ V+D + L SQ Y T EA A + AG +D G+
Sbjct: 264 RGWSDRRLFNVTDAGAPTNLTGSQGYFATQAEADAAIVKAGGDSFTVDDTDGAPTVTALR 323
Query: 299 AAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAA 358
A+ GL++ +D A+ + + RLG FD P PY + DV + A++ LA +AA
Sbjct: 324 QAIDQGLLSVRQVDTAVGDALSIRFRLGEFD--PDGGPYAGIPASDVDSPAHRQLAREAA 381
Query: 359 RQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY-TTPLQGLAAVV 417
+VLLKNT G LPL+ A ++AV GP + T+ ++ G Y TPL G++A
Sbjct: 382 ASSMVLLKNT-GVLPLA--AGGSVAVSGP---LGDTVYTDWYGGQLPYRVTPLDGISAKS 435
Query: 418 ATIYQAGCSNV 428
Y G +
Sbjct: 436 PATYAPGADRI 446
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 461 ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPG 520
E+HDR L L +Q+ LI E + A ++++ S I++A+ + I+W + G
Sbjct: 607 EAHDRTGLGLGARQEALI-EAVRAANPRTVVVVQSSYPEAITWAQ--AHVPGIVWTTHAG 663
Query: 521 QAGGAAIADVCFGRYNPSGRLPMTWY 546
G A+ADV +G NP GRL TWY
Sbjct: 664 AETGHALADVLYGDVNPGGRLTQTWY 689
>gi|297543748|ref|YP_003676050.1| glycoside hydrolase family 3 domain-containing protein
[Thermoanaerobacter mathranii subsp. mathranii str. A3]
gi|296841523|gb|ADH60039.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
mathranii subsp. mathranii str. A3]
Length = 787
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 156/550 (28%), Positives = 236/550 (42%), Gaps = 131/550 (23%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP +H S G + GAT FPQ I A+++N L + + V+
Sbjct: 93 TRLGIPAL------IHEESCSG------YMAKGATIFPQTIGVASTWNPKLVEKMASVIR 140
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
+ +A + +P +++ RDPRWGR +ET GEDP L Y++GLQ +
Sbjct: 141 EQMKA-----VGARQALAPLLDVTRDPRWGRTEETFGEDPYLVMHMGVSYIRGLQTEN-- 193
Query: 208 SPNRLK--VAACCKHYTAYDLD----NWKGTDRYHFNAMVIYTYYLIKFK---------- 251
LK V A KH+ Y NW H +Y +L F+
Sbjct: 194 ----LKEGVIATGKHFVGYGNSEGGMNWAPA---HIPMRELYEIFLYPFEAAVKEAKLGS 246
Query: 252 --------------------------------YIVSDCDSVDVLYNSQHYTKTPEEAAAK 279
+VSD +++ LY +EAA
Sbjct: 247 IMPGYHELDGIPCHKSKQLLTDILRKNWGFDGIVVSDYFAINQLYEYHRLASNKKEAAKL 306
Query: 280 SILAGLDLNCGSF--LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPY 337
++ AG+D+ S G + ++ G ++ ++ A+ LG F+ PY
Sbjct: 307 ALEAGVDVELPSTDCYGLPIKELIEQGDIDIDFVNDAVRRILKAKFLLGLFE-----NPY 361
Query: 338 GQLGPKDVC----TQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTK 393
+ K V TQ + LA A++ IVLLKN + LPL ++++AVIGPNA+ +
Sbjct: 362 --VDEKRVVEIFDTQEQRQLAYKIAQESIVLLKNESNLLPLKKD-LQSIAVIGPNADNIR 418
Query: 394 TMIGNYEGTPCK-------------YTTPL-----------------QGLAAVVA----T 419
MIG+Y PC + TPL QG+ V+
Sbjct: 419 NMIGDY-AYPCHIESLLEMREKDNVFNTPLPEGLEAKDIYVPIVSVLQGIKEKVSPKTKV 477
Query: 420 IYQAGCSNVQCGTAQVDDAKKAAASADATVLVMG-----ADQSIEAESHDRLDLLLPGQQ 474
IY GC + TA + A + A AD ++V+G D ES DR DL LPG Q
Sbjct: 478 IYAKGCDVISDDTAGFNKAVEIAKQADVAIVVVGDRAGLTDGCTSGESRDRADLNLPGVQ 537
Query: 475 QLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGR 534
+ L+ + + PV++++++G IS + KI +I+ PG+ GG AIADV FG
Sbjct: 538 EQLVKAIYETGT-PVVVVLINGRPMSIS--RLAEKIPAIIEAWLPGEEGGRAIADVIFGD 594
Query: 535 YNPSGRLPMT 544
YNP G+LP++
Sbjct: 595 YNPGGKLPIS 604
>gi|449307585|ref|YP_007439941.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii SP291]
gi|449097618|gb|AGE85652.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii SP291]
Length = 765
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 169/632 (26%), Positives = 267/632 (42%), Gaps = 158/632 (25%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLR---VGDLVKRLTLQEKITFLV------------- 81
A + P+LAS F L + R V DL+K++T+ EKI L
Sbjct: 9 MAVSLALQPALASEPFGEHPLTPEARDAFVTDLLKKMTVDEKIGQLRLISVGPDNPKEAI 68
Query: 82 -------------------------DSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSN 116
D +SRL IP + + + LHG
Sbjct: 69 RDMIKEGQVGAIFNTVTRPDIRAMQDQVMQLSRLKIPLF-FAYDVLHGQR---------- 117
Query: 117 VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF-WSPNINIFRDPR 175
T FP + A+SFN + +G+V + Y GL W+P +++ RDPR
Sbjct: 118 -----TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPR 166
Query: 176 WGRGQETPGEDPLLASKYATGYVKGLQQTDGGSP-NRLKVAACCKHYTAYD-LDNWKGTD 233
WGRG E GED L S V+ +Q G SP +R V KH+ AY ++ K +
Sbjct: 167 WGRGSEGFGEDTYLTSMMGKTMVESMQ---GKSPADRYSVMTSVKHFAAYGAVEGGKEYN 223
Query: 234 RYHFNAMVIYTYYLIKFK-----------------------------------------Y 252
++ ++ Y+ +K
Sbjct: 224 TVDMSSQRLFNDYMPPYKEALDAGSGGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 283
Query: 253 IVSDCDSVDVLYNSQHYTKT-PEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESA 310
+SD ++ L +H T + PE+A +I +G+D++ + K+ +K+G V +
Sbjct: 284 TISDHGAIKELI--KHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 341
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD-------VCTQANQDLALDAARQGIV 363
+D A + +G F+ PY LGPKD ++ ++D A AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARKVARESLV 396
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE--GTPCKYTTPLQGLAAVV---A 418
LLKN +LPL + +AV+GP A+ + M+G++ G + T LQG+ V A
Sbjct: 397 LLKNRLETLPLKKSG--TIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 454
Query: 419 TIYQAGCSNV---------------------QCGTAQVDDAKKAAASADATVLVMGADQS 457
I A +NV + A +D+A A +D V V+G Q
Sbjct: 455 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKAMIDEAVNVAKQSDVVVAVVGEAQG 514
Query: 458 IEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
+ E+ R DL +P Q+ LI+ + K P++L++M+G ++ K D + +IL
Sbjct: 515 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGR--PLALVKEDLQADAILETW 571
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+ G GG AIADV FG YNPSG+LP++ +P+S
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS 602
>gi|375254464|ref|YP_005013631.1| glycosyl hydrolase family 3, C-terminal domain-containing protein
[Tannerella forsythia ATCC 43037]
gi|363407375|gb|AEW21061.1| glycosyl hydrolase family 3, C-terminal domain protein [Tannerella
forsythia ATCC 43037]
Length = 775
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 153/543 (28%), Positives = 252/543 (46%), Gaps = 97/543 (17%)
Query: 75 EKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASF 134
E + L A +RLGIP + + E +HG ++ G T FP I A+++
Sbjct: 107 EALNALQKYAMENTRLGIPIF-FAEECMHG-----------HMAIGTTVFPTSIGQASTW 154
Query: 135 NASLFQAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 194
N +L + +G ++ E R+ + P +++ R+PRW R +ET GEDP+L+
Sbjct: 155 NRTLIEKMGAAIAHETRSQ-----GAHIAYGPVLDLAREPRWSRVEETFGEDPVLSGILG 209
Query: 195 TGYVKGLQQTD---------------------GGSPNR-LKVAA---CCKHYTAYDLDNW 229
+ +V+GLQ D GG R ++ A +H +++
Sbjct: 210 SAFVRGLQGKDFADGRHTYSTLKHLAAYGIPVGGHNGRQAQIGARELIAEHLLPFEMAVK 269
Query: 230 KGTDRY--HFNAM-----VIYTYYLIKF--------KYIVSDCDSVDVLYNSQHYTKTPE 274
G +NA+ TY L K ++VSD S++ + + +
Sbjct: 270 AGAQSVMTSYNAVDGVPCTSNTYILKKILRGEWDFNGFVVSDLGSIEGIATTHRVAPDIK 329
Query: 275 EAAAKSILAGLDLNCGSF-LGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPS 333
AAA ++ AG++++ G ++ E A L++ S ID A+S +G F+ P
Sbjct: 330 HAAAMALNAGVEMDLGGVAYTRNMEQAHTDSLISMSEIDDAVSRILRLKFEMGLFES-PY 388
Query: 334 KQPYGQLGPKDVCTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTK 393
QP + + ++ + LA A + IVLLKN A LPLS I ++AVIGPNA+
Sbjct: 389 VQP--SRTTEIIRSKEHNRLARKVAEESIVLLKNNANLLPLSKN-IGSIAVIGPNADNLY 445
Query: 394 TMIGNYEG-TPCKY-TTPLQGLAAVVATI----YQAGCSNVQCGTAQVDDAKKAAASADA 447
+G+Y P ++ T L+G+ V+ Y GC+ + +D+A +AA +++A
Sbjct: 446 NQLGDYTAPQPEEHIVTILEGIRNAVSPTTVIRYVKGCAVRDTTQSNIDEAVRAANASNA 505
Query: 448 TVLVMGADQS-------IE-------------------AESHDRLDLLLPGQQQLLITEV 481
VLV+G + IE E +DR L L G Q+ LI +
Sbjct: 506 VVLVVGGSSARDFHTKYIETGAATVSSRENELIPDMESGEGYDRKSLTLLGHQEKLIESI 565
Query: 482 AKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRL 541
A K P+I++ + G +++ A D K +++L YPG+ GG A+A+V FG NPSGRL
Sbjct: 566 AATGK-PLIMVYIQGRPLNMNLA--DKKASALLTAWYPGEEGGNAVANVIFGDVNPSGRL 622
Query: 542 PMT 544
P++
Sbjct: 623 PIS 625
>gi|443623759|ref|ZP_21108251.1| putative Sugar hydrolase [Streptomyces viridochromogenes Tue57]
gi|443342757|gb|ELS56907.1| putative Sugar hydrolase [Streptomyces viridochromogenes Tue57]
Length = 945
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 141/408 (34%), Positives = 196/408 (48%), Gaps = 68/408 (16%)
Query: 53 FCNTSLGIDLRVGDLVKRLTLQEKITFLVDSAGSVSRLGIPKYEWWSEALHGVSYVGPGT 112
F + L R+ DL+ RLTL EKI FL A +V RLGIP + EALHGV+++GP
Sbjct: 11 FRDQRLPFAKRIDDLLSRLTLDEKIAFLHQFAPAVERLGIPAFRTGQEALHGVAWMGP-- 68
Query: 113 HFSNVVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMY-NVGLAGLTFWSPNINIF 171
AT FPQ + A++N L + +G+ VS E RAM GL WSP +N+
Sbjct: 69 --------ATVFPQAVGLGATWNTDLVRRVGEAVSKETRAMRARDDRVGLNVWSPTVNLL 120
Query: 172 RDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGGSPNRLKVAACCKHYTAYDLDNWKG 231
R P WGR +E EDP L S AT Y +GL+ G P + A KH+ A++ + +
Sbjct: 121 RHPLWGRNEEGYSEDPKLTSAIATAYTRGLR---GDHPTYWRTAPVLKHWLAHNNETGRD 177
Query: 232 TDRYHFNAMVIYTYYLIKFKYIV------------------------------------- 254
V++ Y L F+ V
Sbjct: 178 VTSSSVRPRVLHEYDLRAFRETVEAGAVAGVMPAYNLVNGRPNHVSPYLREHLRTWTDED 237
Query: 255 ----SDCDSVDVLYNSQHYTKTPEEAAAKSILAGLD--LNCGSFLGKHT---EAAVKAGL 305
SD + L +S+HY T EEA A ++LAG+D + G+ K T A++ GL
Sbjct: 238 LLVCSDAGAPSNLVDSEHYFDTHEEATAAALLAGVDSFTDHGTDSSKITARVRGALERGL 297
Query: 306 VNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQANQDLALDAARQGIVLL 365
+ +S ID A+ + RLG FD P P+ +D T A++ LA +AA Q +VLL
Sbjct: 298 LTDSDIDTAVRRQLSVRFRLGEFD--PEYDPHALT--RDFDTPAHRALAQEAAEQAVVLL 353
Query: 366 KNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGL 413
KN+ G LPL+P LAV+G A+ K + Y GT +TPL+GL
Sbjct: 354 KNSGGLLPLAPGT--RLAVVGLLADECK--LDWYSGTLIHRSTPLEGL 397
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 53/99 (53%), Gaps = 6/99 (6%)
Query: 449 VLVMGADQSIEA-ESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKND 507
++V G D I E+ DR L LP Q+ L+ A+ A +L ++S +A +
Sbjct: 570 LVVAGNDPHINGRETEDRTTLALPAHQERLL-RAARTANPKTVLALVSA----YPYAVDH 624
Query: 508 PKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMTWY 546
+ ++LW + GQA G A+A + G +P+GRLP TWY
Sbjct: 625 AALPAMLWTAHGGQAAGTALARILAGDVSPAGRLPQTWY 663
>gi|429087308|ref|ZP_19150040.1| Periplasmic beta-glucosidase [Cronobacter universalis NCTC 9529]
gi|426507111|emb|CCK15152.1| Periplasmic beta-glucosidase [Cronobacter universalis NCTC 9529]
Length = 765
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 168/632 (26%), Positives = 267/632 (42%), Gaps = 158/632 (25%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLR---VGDLVKRLTLQEKITFLV------------- 81
A + P+LAS F L + R V DL+K++T+ EKI L
Sbjct: 9 MAVSLALQPALASEPFGEHPLTPEARDAFVTDLLKKMTVDEKIGQLRLISVGPDNPKEAI 68
Query: 82 -------------------------DSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSN 116
D +SRL IP + + + LHG
Sbjct: 69 RDMIKEGQVGAIFNTVTRPDIRAMQDQVMQLSRLKIPLF-FAYDVLHGQR---------- 117
Query: 117 VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF-WSPNINIFRDPR 175
T FP + A+SFN + +G+V + Y GL W+P +++ RDPR
Sbjct: 118 -----TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPR 166
Query: 176 WGRGQETPGEDPLLASKYATGYVKGLQQTDGGSP-NRLKVAACCKHYTAYD-LDNWKGTD 233
WGRG E GED L S V+ +Q G SP +R V KH+ AY ++ K +
Sbjct: 167 WGRGSEGFGEDTYLTSMMGKTMVESMQ---GKSPADRYSVMTSVKHFAAYGAVEGGKEYN 223
Query: 234 RYHFNAMVIYTYYLIKFK-----------------------------------------Y 252
++ ++ Y+ +K
Sbjct: 224 TVDMSSQRLFNDYMPPYKEALDAGSGGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 283
Query: 253 IVSDCDSVDVLYNSQHYTKT-PEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESA 310
+SD ++ L +H T + PE+A +I +G+D++ + K+ +K+G V+
Sbjct: 284 TISDHGAIKELI--KHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPDLIKSGKVSMEE 341
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKD-------VCTQANQDLALDAARQGIV 363
+D A + +G F+ PY LGPKD ++ ++D A + AR+ +V
Sbjct: 342 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAESRLHRDDARNVARESLV 396
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE--GTPCKYTTPLQGLAAVV---A 418
LLKN +LPL + +AV+GP A+ + M+G++ G + T LQG+ V A
Sbjct: 397 LLKNRLETLPLKKSG--TIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKNVAGDKA 454
Query: 419 TIYQAGCSNV---------------------QCGTAQVDDAKKAAASADATVLVMGADQS 457
I A +NV + +D+A A +D V V+G Q
Sbjct: 455 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDVVVAVVGEAQG 514
Query: 458 IEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
+ E+ R DL +P Q+ LI+ + K P++L++M+G ++ K D + +IL
Sbjct: 515 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMNGR--PLALVKEDQQADAILETW 571
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+ G GG AIADV FG YNPSG+LP++ +P+S
Sbjct: 572 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS 602
>gi|380694609|ref|ZP_09859468.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
[Bacteroides faecis MAJ27]
Length = 804
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 152/527 (28%), Positives = 244/527 (46%), Gaps = 96/527 (18%)
Query: 88 SRLGIPKYEWWSEALHGVSYVGPGTHFSNVVPGATSFPQVILTAASFNASLFQAIGKVVS 147
+RLGIP + EA HG ++ G T FP I +A+++ +L + +GK ++
Sbjct: 141 TRLGIPVF-LAEEAPHG-----------HMAIGTTVFPTGIGMSATWSPTLIEEVGKAIA 188
Query: 148 TEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYATGYVKGLQQTDGG 207
E R+ + P +++ RDPRW R +ET GEDP+L+ + V GL D
Sbjct: 189 KEIRSQ-----GAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVTGLGSGDLS 243
Query: 208 SPNRLKVAACCKHYTAYDL------DNWK--GTDRYHFN------------AMVIYT--- 244
+ A KH+ AY + N+ G H N A+ + T
Sbjct: 244 REH--ATIATLKHFLAYAVPEGGQNGNYASVGARDLHENFLPPFREAIEAGALSVMTSYN 301
Query: 245 ----------YYLI--------KFK-YIVSDCDSVDVLYNSQHYTKTPEEAAAKSILAGL 285
+YL+ KF+ ++VSD S++ ++ S T EEAA +++ AG+
Sbjct: 302 SIDGIPCTANHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVASTMEEAAVQALSAGV 361
Query: 286 DLNCGSFLGKHTEAAVKAGLVNESAIDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDV 345
D++ G + AV++G ++E+ I+ A+ +G F+ HP P + K V
Sbjct: 362 DIDLGGDAFMNLLQAVRSGKLDETQINAAVDRILRMKFEMGLFE-HPYVNP--KTTTKMV 418
Query: 346 CTQANQDLALDAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEG--TP 403
+ + LA A+ +VLL+N LPLS IK +AV+GPNA+ M+G+Y
Sbjct: 419 RNKEHVKLARKVAQSSVVLLENKNSILPLS-KKIKRVAVVGPNADNRYNMLGDYTAPQED 477
Query: 404 CKYTTPLQGLAAVVA---TIYQAGCSNVQCGTAQVDDAKKAAASADATVLVMGA------ 454
T L G+ + ++ Y GC+ ++ +A +AA ++ + V+G
Sbjct: 478 KDIRTVLDGVISKLSPSRVEYVRGCAIRDTTVNEIAEAVEAAHRSEVIIAVVGGSSARDF 537
Query: 455 -------------DQSIE----AESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGG 497
++SI E DR L L G+QQ L+ + K P+I++ + G
Sbjct: 538 KTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLNALKTTGK-PLIVVYIEGR 596
Query: 498 GFDISFAKNDPKITSILWVGYPGQAGGAAIADVCFGRYNPSGRLPMT 544
D +A ++L YPGQAGG AIADV FG YNP+GRLP++
Sbjct: 597 PLDKVWASECAD--ALLTASYPGQAGGDAIADVLFGDYNPAGRLPVS 641
>gi|429110963|ref|ZP_19172733.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 507]
gi|426312120|emb|CCJ98846.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 507]
Length = 759
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 169/632 (26%), Positives = 265/632 (41%), Gaps = 158/632 (25%)
Query: 38 FACDVVSNPSLASLGFCNTSLGIDLR---VGDLVKRLTLQEKITFLV------------- 81
A + P+LAS F L + R V DL+K++T+ EKI L
Sbjct: 3 MAVSLALQPALASEPFGEHPLTPEARDAFVTDLLKKMTVDEKIGQLRLISVGPDNPKEAI 62
Query: 82 -------------------------DSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSN 116
D +SRL IP + + + LHG
Sbjct: 63 RDMIKEGQVGAIFNTVTRPDIRAMQDQVMQLSRLKIPLF-FAYDVLHGQR---------- 111
Query: 117 VVPGATSFPQVILTAASFNASLFQAIGKVVSTEARAMYNVGLAGLTF-WSPNINIFRDPR 175
T FP + A+SFN + +G+V + Y GL W+P +++ RDPR
Sbjct: 112 -----TIFPNSLGLASSFNLDAVKTVGRV------SAYEAADDGLNMTWAPMVDVTRDPR 160
Query: 176 WGRGQETPGEDPLLASKYATGYVKGLQQTDGGSP-NRLKVAACCKHYTAYD-LDNWKGTD 233
WGRG E GED L S V+ +Q G SP +R V KH+ AY ++ K +
Sbjct: 161 WGRGSEGFGEDTYLTSMMGKTMVESMQ---GKSPADRYSVMTSVKHFAAYGAVEGGKEYN 217
Query: 234 RYHFNAMVIYTYYLIKFK-----------------------------------------Y 252
++ ++ Y+ +K
Sbjct: 218 TVDMSSQRLFNDYMPPYKEALDAGSGGVMIALNSLNGTPAASDGWLLKDLLRGDWGFKGI 277
Query: 253 IVSDCDSVDVLYNSQHYTKT-PEEAAAKSILAGLDLNCG-SFLGKHTEAAVKAGLVNESA 310
+SD ++ L +H T + PE+A +I +G+D++ + K+ +K+G V +
Sbjct: 278 TISDHGAIKELI--KHGTASDPEDAVRVAIKSGVDMSMADEYYSKYLPNLIKSGKVTMAE 335
Query: 311 IDHAISNNFATLMRLGFFDGHPSKQPYGQLGPKDVCTQ-------ANQDLALDAARQGIV 363
+D A + +G F+ PY LGPKD Q ++D A AR+ +V
Sbjct: 336 LDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPQDTNAESRLHRDDARKVARESLV 390
Query: 364 LLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYE--GTPCKYTTPLQGLAAVV---A 418
LLKN +LPL + +AV+GP A+ + M+G++ G + T LQG+ V A
Sbjct: 391 LLKNRLETLPLKKSG--TIAVVGPLADSKRDMMGSWSAAGVADQSVTLLQGMKKVAGDKA 448
Query: 419 TIYQAGCSNV---------------------QCGTAQVDDAKKAAASADATVLVMGADQS 457
I A +NV + +D+A A +D V V+G Q
Sbjct: 449 KILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVNVAKQSDMVVAVVGEAQG 508
Query: 458 IEAESHDRLDLLLPGQQQLLITEVAKVAKGPVILIIMSGGGFDISFAKNDPKITSILWVG 517
+ E+ R DL +P Q+ LI+ + K P++L++M+ G ++ K D + +IL
Sbjct: 509 MAHEASSRTDLTIPQSQRDLISALKATGK-PLVLVLMN--GRPLALVKEDQQADAILETW 565
Query: 518 YPGQAGGAAIADVCFGRYNPSGRLPMTWYPQS 549
+ G GG AIADV FG YNPSG+LP++ +P+S
Sbjct: 566 FAGTEGGNAIADVLFGDYNPSGKLPIS-FPRS 596
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,735,962,546
Number of Sequences: 23463169
Number of extensions: 369165067
Number of successful extensions: 859810
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5334
Number of HSP's successfully gapped in prelim test: 1494
Number of HSP's that attempted gapping in prelim test: 827878
Number of HSP's gapped (non-prelim): 16415
length of query: 550
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 402
effective length of database: 8,886,646,355
effective search space: 3572431834710
effective search space used: 3572431834710
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 80 (35.4 bits)